LRAT: Learning to Retrieve from Agent Trajectories Collection Official resources for LRAT, including trajectory-trained dense retrievers and the LRAT training dataset for agentic search. • 3 items • Updated about 6 hours ago • 1
LRAT: Learning to Retrieve from Agent Trajectories Collection Official resources for LRAT, including trajectory-trained dense retrievers and the LRAT training dataset for agentic search. • 3 items • Updated about 6 hours ago • 1
When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs Paper • 2601.11000 • Published Jan 16 • 27
MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching Paper • 2601.10712 • Published Jan 15 • 24
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data Paper • 2511.12609 • Published Nov 16, 2025 • 106
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published Jun 2, 2025 • 190
NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search Paper • 2505.14680 • Published May 20, 2025 • 9
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper • 2505.04921 • Published May 8, 2025 • 187