Yihe Deng PRO
ydeng9
AI & ML interests
LLM post-training
Recent Activity
published a dataset 13 days ago
DuoGuard/duoguard-iter1-data published a dataset 13 days ago
DuoGuard/duoguard-seed-data updated a dataset 2 months ago
ydeng9/OpenVLThinker-grpo-hard