39 189 46

KABI

dongguanting

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper 5 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

upvoted a paper 6 days ago

Latent Collaboration in Multi-Agent Systems

upvoted a paper 13 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

View all activity

Organizations

liked 3 datasets about 1 month ago

liked a model 3 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24 • 22.2k • 508

liked a dataset 3 months ago

inclusionAI/ASearcher-train-data

Preview • Updated Aug 13 • 237 • 21

liked 2 datasets 4 months ago

We-Math/We-Math2.0-Pro

Viewer • Updated Aug 19 • 4.55k • 186 • 21

We-Math/We-Math2.0-Standard

Viewer • Updated Aug 19 • 5.84k • 163 • 22

liked 2 models 4 months ago

Kwai-Klear/Klear-Reasoner-8B

8B • Updated Sep 27 • 22 • 19

dongguanting/RAG-Critic-3B

Text Generation • 3B • Updated Jun 28 • 83 • 3

liked 3 datasets 5 months ago

dongguanting/ARPO-SFT-54K

Viewer • Updated Oct 17 • 54.6k • 186 • 14

dongguanting/ARPO-RL-DeepSearch-1K

Viewer • Updated Oct 17 • 1.07k • 107 • 5

dongguanting/ARPO-RL-Reasoning-10K

Viewer • Updated Oct 17 • 10k • 123 • 3

liked 8 models 5 months ago

dongguanting/Llama3.1-8B-ARPO

Text Generation • 8B • Updated Aug 12 • 16 • 1

dongguanting/Qwen3-14B-ARPO-DeepSearch

Text Generation • 15B • Updated Aug 12 • 18 • 5

dongguanting/Qwen2.5-7B-ARPO

Text Generation • 8B • Updated Aug 19 • 923 • 2

dongguanting/Qwen3-8B-ARPO-DeepSearch

8B • Updated Jul 29 • 9 • 2

dongguanting/Qwen2.5-3B-ARPO

Text Generation • 3B • Updated Aug 12 • 13 • 3

dongguanting/Tool-Star-Qwen-1.5B

Text Generation • 2B • Updated Jun 6 • 7 • 2

dongguanting/Tool-Star-Qwen-0.5B

Text Generation • 0.6B • Updated Jun 6 • 4 • 1

dongguanting/Tool-Star-Qwen-7B

Text Generation • 8B • Updated Jun 30 • 46 • 2

KABI

AI & ML interests

Recent Activity

Organizations

dongguanting's activity