arxiv:2410.13859
YaxinLuo PRO
YaxinLuo
·
AI & ML interests
AudioVisual Speaker extraction, video understanding, self-supervised large speech models
Organizations
models
21
YaxinLuo/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
YaxinLuo/vqvae_gpt2_codebook_imagenet-stage1-resolution-256
Updated
YaxinLuo/vqvae_gpt2_codebook_imagenet-stage1
Updated
YaxinLuo/llava-v1.5-7b-pretrained-gpt2vision-stage2
7B
•
Updated
•
6
YaxinLuo/llava-v1.5-7b-gpt2vision-pretrain
Updated
•
10
YaxinLuo/llava-v1.5-7b-gpt2vision-scratch-stage2
7B
•
Updated
•
7
YaxinLuo/segmenter-tiny-mask-pcontext-gpt2
Updated
YaxinLuo/segmenter-tiny-mask-pcontext-baseline
Updated
YaxinLuo/segmenter-tiny-mask-gpt2-ade20k
Updated
YaxinLuo/segmenter-tiny-mask-baseline-ade20k
Updated