arxiv:2502.11663
Liu Yichen
lyclyc52
·
AI & ML interests
Computer Vision
Recent Activity
upvoted
a
paper
about 5 hours ago
SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning
upvoted
a
paper
3 months ago
From Pixels to Words -- Towards Native Vision-Language Primitives at
Scale
upvoted
a
paper
3 months ago
InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn
Dialogue