Adaptive Preference Optimization with Uncertainty-aware Utility Anchor Paper • 2509.10515 • Published Sep 3, 2025 • 1
UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models Paper • 2510.22588 • Published Oct 26, 2025 • 1
IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer Paper • 2511.22167 • Published Nov 27, 2025
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published Dec 8, 2025 • 78
Make an Offer They Can't Refuse: Grounding Bayesian Persuasion in Real-World Dialogues without Pre-Commitment Paper • 2510.13387 • Published Oct 15, 2025
Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia Paper • 2512.03318 • Published Dec 3, 2025 • 4
OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs Paper • 2510.10689 • Published Oct 12, 2025 • 47
V-HUB: A Visual-Centric Humor Understanding Benchmark for Video LLMs Paper • 2509.25773 • Published Sep 30, 2025
Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception Paper • 2510.12720 • Published Oct 14, 2025 • 2
Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space Paper • 2505.13308 • Published May 19, 2025 • 27
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts Paper • 2503.22952 • Published Mar 29, 2025 • 17