AI & ML interests

Omni-modal Large Language Models, Multi-modal Large Language Models (MLLMs), Emotional spoken dialogue

Recent Activity

KaiChen1998  updated a Space 9 days ago
Emova-ollm/RACRO-demo
KaiChen1998  updated a Space 9 days ago
Emova-ollm/EMOVA-demo
gooorillax  updated a model 9 months ago
Emova-ollm/emova_speech_tokenizer_hf
View all activity

Organization Card

👋 Welcome to EMOVA! We are a team focusing on fully open-sourced omni-modal foundational models with visual, textual, and speech capabilities. EMOVA (EMotionally Omni-present Voice Assistant) is a novel Omni-modal Large Language Model with end-to-end speech capabilities while maintaining state-of-the-art vision-language performance. We wish to promote the development of omni-modal human interactions with intelligent models!