Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
zhangzhifang
/
verl-agent
like
0
Safetensors
arxiv:
12 papers
Model card
Files
Files and versions
xet
Community
main
verl-agent
/
examples
/
search
14.9 kB
Ctrl+K
Ctrl+K
100 contributors
History:
1 commit
Lang Feng
Add search-r1 experiments (tool-calling) & the resutls of GiGPO on search-r1 experiments & similarity-based GiGPO (#159)
44be5f4
unverified
8 months ago
retriever
Add search-r1 experiments (tool-calling) & the resutls of GiGPO on search-r1 experiments & similarity-based GiGPO (#159)
8 months ago
searchr1_download.py
814 Bytes
Add search-r1 experiments (tool-calling) & the resutls of GiGPO on search-r1 experiments & similarity-based GiGPO (#159)
8 months ago