SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 4 days ago • 19
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 4 days ago • 19
Bridging the Long-Term Gap: A Memory-Active Policy for Multi-Session Task-Oriented Dialogue Paper • 2505.20231 • Published May 26, 2025
ReSURE: Regularizing Supervision Unreliability for Multi-turn Dialogue Fine-tuning Paper • 2508.19996 • Published Aug 27, 2025
Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents Paper • 2512.20092 • Published 17 days ago • 8
Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents Paper • 2512.20092 • Published 17 days ago • 8