YaxinLuo's picture

4 5 3

YaxinLuo PRO

YaxinLuo

·

https://yaxin9luo.github.io./

Yaxin9Luo

AI & ML interests

AudioVisual Speaker extraction, video understanding, self-supervised large speech models

Organizations

Papers 1

arxiv:2410.13859

spaces 4

Open CaptchaWorld

Solve CAPTCHA puzzles to test accuracy

Catering Service Tool

Generate superhero party themes

AlfredAgent

First Agent Template

Suggest activities based on preferences and time

models 21

YaxinLuo/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Apr 21, 2025

YaxinLuo/vqvae_gpt2_codebook_imagenet-stage1-resolution-256

Updated Apr 7, 2025

YaxinLuo/vqvae_gpt2_codebook_imagenet-stage1

Updated Apr 2, 2025

YaxinLuo/llava-v1.5-7b-pretrained-gpt2vision-stage2

7B • Updated Apr 1, 2025 • 6

YaxinLuo/llava-v1.5-7b-gpt2vision-pretrain

Updated Apr 1, 2025 • 10

YaxinLuo/llava-v1.5-7b-gpt2vision-scratch-stage2

7B • Updated Apr 1, 2025 • 7

YaxinLuo/segmenter-tiny-mask-pcontext-gpt2

Updated Apr 1, 2025

YaxinLuo/segmenter-tiny-mask-pcontext-baseline

Updated Apr 1, 2025

YaxinLuo/segmenter-tiny-mask-gpt2-ade20k

Updated Apr 1, 2025

YaxinLuo/segmenter-tiny-mask-baseline-ade20k

Updated Apr 1, 2025

datasets 2

YaxinLuo/Open_CaptchaWorld

Viewer • Updated Jun 4, 2025 • 490 • 496

YaxinLuo/mmbench

Preview • Updated Apr 1, 2025 • 8