TokenPilot: Cache-Efficient Context Management for LLM Agents Paper • 2606.17016 • Published 9 days ago • 16
LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories Paper • 2606.13578 • Published 13 days ago • 54
When Should Models Change Their Minds? Contextual Belief Management in Large Language Models Paper • 2605.30219 • Published 27 days ago • 26
How LoRA Remembers? A Parametric Memory Law for LLM Finetuning Paper • 2605.30260 • Published 27 days ago • 44
Rethinking Memory as Continuously Evolving Connectivity Paper • 2605.28773 • Published 28 days ago • 34
MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems Paper • 2605.28732 • Published 28 days ago • 40
SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research Paper • 2605.22878 • Published May 20 • 59
OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models Paper • 2605.00877 • Published Apr 25 • 15
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language Paper • 2604.19667 • Published Apr 21 • 23
SkillX: Automatically Constructing Skill Knowledge Bases for Agents Paper • 2604.04804 • Published Apr 6 • 35
view article Article Create, Evaluate, and Connect AI Skills | SkillNet: A Large-Scale Agentic "Skill Graph" Knowledge Base xzwnlp • Feb 28 • 13
From Data to Behavior: Predicting Unintended Model Behaviors Before Training Paper • 2602.04735 • Published Feb 4 • 15
Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics Paper • 2602.02343 • Published Feb 2 • 13
Aligning Agentic World Models via Knowledgeable Experience Learning Paper • 2601.13247 • Published Jan 19 • 15
Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency Paper • 2601.05905 • Published Jan 9 • 21
InnoGym: Benchmarking the Innovation Potential of AI Agents Paper • 2512.01822 • Published Dec 1, 2025 • 36