Behavior Knowledge Merge in Reinforced Agentic Models Paper • 2601.13572 • Published 5 days ago • 21
Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning Paper • 2601.16163 • Published 2 days ago • 9
knowledgator/modern-gliner-bi-large-v1.0 Token Classification • Updated Feb 26, 2025 • 173 • 62
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 9 items • Updated 3 days ago • 197
LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR Paper • 2601.14251 • Published 4 days ago • 21
Language of Thought Shapes Output Diversity in Large Language Models Paper • 2601.11227 • Published 8 days ago • 7
BigVGAN Collection BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input. • 11 items • Updated 4 days ago • 16