verl-agent / examples /data_preprocess /preprocess_search_r1_dataset.py

Commit History

Add search-r1 experiments (tool-calling) & the resutls of GiGPO on search-r1 experiments & similarity-based GiGPO (#159)
44be5f4
unverified

Lang Feng commited on

Major Update: merge latest verl (#54)
823e1b4
unverified

Lang Feng 湛露先生 AoShen Franz Srambical BaiqingL BearBiscuit HL Blue Space tongyx361 Mantas Bakšys Mert Unsal yangwang92 Patrik Bartak Junrong Lin BearBiscuit05 Leege233 sicer wuxibin shengguangming wangfuchun-fc DuCross Qunhong Zeng Dai, Weinan runluo PPPisOK hiyouga Haibin Lin pengsun000 Hunter Zhang Changlong Xiang Long zyzshishui wangyeyeye zhaochenyang20 jrlin HollowMan6 Tianyun Zhao icejieke lixiaoguang12 Earl St Sauver Hongpeng Guo Wang Zhang Wang Zhang Jinn yhyang201 QQSong Changlong Yu changlyu chenhaiquan yushengsu Yusheng Su none06630663 ShareLer Xiang Long Rihong Qiu Yuhua Jiang Swtheking openhands Wei Wu commited on