LGAI-EXAONE/K-EXAONE-236B-A23B Text Generation • 237B • Updated about 8 hours ago • 250 • 267
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published Oct 7, 2025 • 141
Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought Paper • 2510.04230 • Published Oct 5, 2025 • 26
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published Apr 18, 2025 • 139
trillionlabs/Trillion-LLaVA-7B Visual Question Answering • 8B • Updated Apr 20, 2025 • 36 • 11
meta-llama/Llama-3.3-70B-Instruct Text Generation • 71B • Updated Dec 21, 2024 • 323k • • 2.61k
Qwen/Qwen2.5-Coder-32B-Instruct Text Generation • 33B • Updated Jan 12, 2025 • 199k • • 1.96k