Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 20 days ago • 66
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence Paper • 2603.13398 • Published 28 days ago • 152
CodePercept: Code-Grounded Visual STEM Perception for MLLMs Paper • 2603.10757 • Published 29 days ago • 14
OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution Paper • 2601.20380 • Published Jan 28 • 9
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control Paper • 2601.05138 • Published Jan 8 • 18
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Paper • 2601.04720 • Published Jan 8 • 57
RelayLLM: Efficient Reasoning via Collaborative Decoding Paper • 2601.05167 • Published Jan 8 • 31