-
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 106 -
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 121 -
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
Paper • 2512.23447 • Published • 99 -
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
Paper • 2512.23576 • Published • 66
diege
fulandiege
·
AI & ML interests
None yet
Recent Activity
upvoted a collection 3 days ago
OneVL Models updated a collection 5 days ago
papers updated a collection 18 days ago
papersOrganizations
None yet