Add GSPO to verl-agent (#179) 873b913 unverified Markus Kristian Junttila MarkusNokia commited on Oct 15, 2025
Add search-r1 experiments (tool-calling) & the resutls of GiGPO on search-r1 experiments & similarity-based GiGPO (#159) 44be5f4 unverified Lang Feng commited on Sep 3, 2025
add 'resources_per_worker' config for easily managing cpus/gpus of each env worker (#148) bae6790 unverified Lang Feng commited on Aug 20, 2025
Add memory manager & Use config in env_manager (#90) cfe3636 unverified Lang Feng commited on Jun 29, 2025
remove appworld folder and adjust the appworld worker (#80) 37a9e4a unverified Lang Feng commited on Jun 19, 2025
Major Update: merge latest verl (#54) 823e1b4 unverified Lang Feng 湛露先生 AoShen Franz Srambical BaiqingL BearBiscuit HL Blue Space tongyx361 Mantas Bakšys Mert Unsal yangwang92 Patrik Bartak Junrong Lin BearBiscuit05 Leege233 sicer wuxibin shengguangming wangfuchun-fc DuCross Qunhong Zeng Dai, Weinan runluo PPPisOK hiyouga Haibin Lin pengsun000 Hunter Zhang Changlong Xiang Long zyzshishui wangyeyeye zhaochenyang20 jrlin HollowMan6 Tianyun Zhao icejieke lixiaoguang12 Earl St Sauver Hongpeng Guo Wang Zhang Wang Zhang Jinn yhyang201 QQSong Changlong Yu changlyu chenhaiquan yushengsu Yusheng Su none06630663 ShareLer Xiang Long Rihong Qiu Yuhua Jiang Swtheking openhands Wei Wu commited on Jun 3, 2025
Achieve unbiased estimate for GiGPO; change the history length of Alfworld to 2 eae9a00 langfeng01 commited on Apr 24, 2025
IMPORTANT: achieve two modes of GiGPO: 'mean_std_norm' and 'leave_one_out' 9d6a877 Feng Lang commited on Apr 23, 2025
IMPORTANT: DAPO (https://arxiv.org/abs/2503.14476) support! 85ac871 Feng Lang commited on Apr 21, 2025
address issue of info['won'] and update readme and setup file 5401fe7 Feng Lang commited on Apr 20, 2025
IMPORTANT: achieve the convergence of sokoban and ezpoints 01e15c5 langfeng01 commited on Apr 19, 2025
IMPORTANT: implementation of sokoban environmetns: support two modality: text and visual 749fe10 langfeng01 commited on Apr 14, 2025