From Model Scaling to System Scaling: Scaling the Harness in Agentic AI Paper • 2605.26112 • Published May 25 • 9
COMPASS: COntinual Multilingual PEFT with Adaptive Semantic Sampling Paper • 2604.20720 • Published Apr 22 • 2
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages Paper • 2406.10118 • Published Jun 14, 2024 • 32
Grid2Matrix: Revealing Digital Agnosia in Vision-Language Models Paper • 2604.09687 • Published Apr 14 • 8
Combee: Scaling Prompt Learning for Self-Improving Language Model Agents Paper • 2604.04247 • Published Apr 5 • 31
Can AI Agents Answer Your Data Questions? A Benchmark for Data Agents Paper • 2603.20576 • Published Mar 21 • 4
SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing Paper • 2603.08982 • Published Mar 9 • 16
V_1: Unifying Generation and Self-Verification for Parallel Reasoners Paper • 2603.04304 • Published Mar 4 • 14
SLA2: Sparse-Linear Attention with Learnable Routing and QAT Paper • 2602.12675 • Published Feb 13 • 59
AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions Paper • 2602.06008 • Published Feb 5 • 5
MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning Paper • 2512.16909 • Published Dec 18, 2025 • 3
Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware Paper • 2505.09601 • Published May 14, 2025 • 6
In-Context Imitation Learning via Next-Token Prediction Paper • 2408.15980 • Published Aug 28, 2024 • 10