Extract tables, figures, and captions from documents
Higgs Audio Demo
Audio-Driven Multi-Person Conversational Video Generation
Detect and segment objects in images
On-Device Track Anything Model