·
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
Organizations
dongguanting/ARPO-RL-DeepSearch-1K
Viewer
•
Updated
•
1.07k
•
84
•
6
dongguanting/ARPO-RL-Reasoning-10K
Viewer
•
Updated
•
10k
•
153
•
4
dongguanting/ARPO-SFT-54K
Viewer
•
Updated
•
54.6k
•
142
•
14
dongguanting/RAG-Error-Critic-100K
Viewer
•
Updated
•
100k
•
40
•
3
dongguanting/Tool-Star-SFT-54K
Viewer
•
Updated
•
54k
•
112
•
10
dongguanting/Multi-Tool-RL-10K
Viewer
•
Updated
•
10k
•
85
•
5
Viewer
•
Updated
•
32.8k
•
56
•
2
dongguanting/ShareGPT-12K
Viewer
•
Updated
•
12.9k
•
64
•
1
dongguanting/VIF-RAG-QA-110K
Viewer
•
Updated
•
111k
•
82
•
7
Viewer
•
Updated
•
574k
•
64
•
2
dongguanting/VIF-RAG-QA-20K
Viewer
•
Updated
•
20k
•
12
•
4