MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 4 days ago • 29
Demystifying When Pruning Works via Representation Hierarchies Paper • 2603.24652 • Published 6 days ago • 14
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published 4 days ago • 60
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 6 days ago • 99
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook Paper • 2604.02029 • Published 10 days ago • 137
PRBench: End-to-end Paper Reproduction in Physics Research Paper • 2603.27646 • Published 14 days ago • 29
On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models Paper • 2603.27481 • Published 14 days ago • 35
VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward Paper • 2603.26599 • Published 15 days ago • 61
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 23 days ago • 330
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 14 days ago • 137
Emergent Social Intelligence Risks in Generative Multi-Agent Systems Paper • 2603.27771 • Published 13 days ago • 50
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published 17 days ago • 49
DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models Paper • 2603.23499 • Published 18 days ago • 51
Vega: Learning to Drive with Natural Language Instructions Paper • 2603.25741 • Published 16 days ago • 6
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 17 days ago • 126
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation Paper • 2603.22117 • Published 19 days ago • 29
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 20 days ago • 122
Alignment Makes Language Models Normative, Not Descriptive Paper • 2603.17218 • Published 25 days ago • 46
Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding Paper • 2603.13366 • Published Mar 9 • 94