Building AI Coding Agents for the Terminal: Scaffolding, Harness, Context Engineering, and Lessons Learned Paper • 2603.05344 • Published Mar 5 • 7
SpareCodeSearch: Searching for Code Context When You Have No Spare GPU Paper • 2510.12948 • Published Oct 14, 2025
SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios Paper • 2512.18470 • Published Dec 20, 2025 • 12
AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology Paper • 2406.11912 • Published Jun 16, 2024 • 27
CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition Paper • 2505.13380 • Published May 19, 2025 • 5
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking Paper • 2503.00955 • Published Mar 2, 2025 • 28
The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation Paper • 2305.06156 • Published May 9, 2023 • 2
XMainframe: A Large Language Model for Mainframe Modernization Paper • 2408.04660 • Published Aug 5, 2024
Improving the detection of technical debt in Java source code with an enriched dataset Paper • 2411.05457 • Published Nov 8, 2024 • 2
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models Paper • 2411.00918 • Published Nov 1, 2024 • 9
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models Paper • 2411.00918 • Published Nov 1, 2024 • 9
The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation Paper • 2305.06156 • Published May 9, 2023 • 2
CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities of CodeLLMs Paper • 2410.01999 • Published Oct 2, 2024 • 10