Models - Video
updated
VideoPrism: A Foundational Visual Encoder for Video Understanding
Paper
•
2402.13217
•
Published
•
38
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with
Audio2Video Diffusion Model under Weak Conditions
Paper
•
2402.17485
•
Published
•
194
Text Generation
•
Updated
•
39.7k
•
380
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Paper
•
2403.01422
•
Published
•
30
World Model on Million-Length Video And Language With RingAttention
Paper
•
2402.08268
•
Published
•
40
Valley: Video Assistant with Large Language model Enhanced abilitY
Paper
•
2306.07207
•
Published
•
2
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and
Language Models
Paper
•
2306.05424
•
Published
•
7
Image-to-Video
•
Updated
•
160k
•
•
2.08k
Text-to-Video
•
Updated
•
3.05k
•
•
1.3k
Text-to-Video
•
Updated
•
69
•
191
FastVideo/FastMochi-diffusers
Text-to-Video
•
Updated
•
12
•
19
Text-to-Video
•
Updated
•
1.11k
•
•
2.1k