mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition • Updated 3 days ago • 5.99k • 532
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published Jan 8 • 225
Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation Paper • 2602.01756 • Published 14 days ago • 22
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation Paper • 2602.03796 • Published 12 days ago • 57
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published 16 days ago • 181
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation Paper • 2601.22153 • Published 17 days ago • 68
Running 108 The Eiffel Tower Llama 📝 108 Explore the Eiffel Tower Llama experiment with open-source models
Running on Zero MCP Featured 1.71k Z Image Turbo 🏃 1.71k Generate images from text prompts with adjustable size and seed
Running Featured 104 Supertonic TTS WebGPU ⚡ 104 Blazingly fast text-to-speech 100% locally in your browser
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 298