-
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper • 2509.07980 • Published • 101 -
Robot Learning from a Physical World Model
Paper • 2511.07416 • Published • 30 -
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
Paper • 2511.06805 • Published • 12 -
GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms
Paper • 2511.17592 • Published • 118
Harihara Valliappan
HarishValliappan
·
AI & ML interests
None yet
Recent Activity
updated
a collection
about 7 hours ago
RL
upvoted
a
paper
about 7 hours ago
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
updated
a collection
about 10 hours ago
RL
Organizations
None yet