Models - Video
updated
VideoPrism: A Foundational Visual Encoder for Video Understanding
Paper
• 2402.13217
• Published • 38
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with
Audio2Video Diffusion Model under Weak Conditions
Paper
• 2402.17485
• Published • 194
Text Generation
• Updated • 88.1k
• 381
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Paper
• 2403.01422
• Published • 30
World Model on Million-Length Video And Language With RingAttention
Paper
• 2402.08268
• Published • 40
Valley: Video Assistant with Large Language model Enhanced abilitY
Paper
• 2306.07207
• Published • 3
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and
Language Models
Paper
• 2306.05424
• Published • 7
Image-to-Video
• Updated • 403k
• • 2.13k
Text-to-Video
• Updated • 11.5k
• • 1.31k
Text-to-Video
• Updated • 30
• 191
FastVideo/FastMochi-diffusers
Text-to-Video
• Updated • 5
• 19
Text-to-Video
• Updated • 831
• • 2.14k