openai/whisper-large-v3 Automatic Speech Recognition ⢠2B ⢠Updated Aug 12, 2024 ⢠6.87M ⢠⢠5.31k
google/pix2struct-widget-captioning-large Visual Question Answering ⢠1B ⢠Updated Apr 10, 2024 ⢠51 ⢠20
Salesforce/blip-image-captioning-large Image-to-Text ⢠0.5B ⢠Updated Feb 3, 2025 ⢠1.38M ⢠1.44k
mistralai/Mistral-7B-Instruct-v0.1 Text Generation ⢠7B ⢠Updated Jul 24, 2025 ⢠321k ⢠1.82k
Runtime error Featured 272 Edit Video By Editing Text ā 272 Audio-based video editing using AI-generated transcription
Runtime error Featured 307 AudioLDM2 Text2Audio Text2Music Generation š 307 Generate audio and waveform video from text