AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation Paper • 2604.08540 • Published Apr 9 • 5
Running on CPU Upgrade Agents Featured 116 Cohere Multilingual ASR 🎙 116 Transcribe audio clips to text in multiple languages
Running on Zero MCP 2.74k Wan2.2 14B Preview 🐌 2.74k generate a video from an image with a text prompt
Running on T4 Agents Featured 80 Trackers 🔥 80 Track objects in your video and get an annotated result