VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control Paper • 2601.05138 • Published about 22 hours ago • 11
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice Paper • 2601.05175 • Published about 21 hours ago • 13
ThinkRL-Edit: Thinking in Reinforcement Learning for Reasoning-Centric Image Editing Paper • 2601.03467 • Published 3 days ago • 4
Klear: Unified Multi-Task Audio-Video Joint Generation Paper • 2601.04151 • Published 2 days ago • 12
WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks Paper • 2601.02439 • Published 4 days ago • 13
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 5 days ago • 32
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 4 days ago • 23
OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment Paper • 2601.01576 • Published 5 days ago • 8
Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes Paper • 2601.02356 • Published 4 days ago • 13
VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published 4 days ago • 28