jasonjiang
mikinyaa
AI & ML interests
None yet
Recent Activity
upvoted a paper about 3 hours ago
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought upvoted a paper 1 day ago
UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation upvoted a paper 4 days ago
On the Direction of RLVR Updates for LLM Reasoning: Identification and ExploitationOrganizations
None yet