arxiv:2509.20712
Fuzheng Zhang
Edrex
AI & ML interests
None yet
Recent Activity
updated
a model
28 days ago
Edrex/gemma-3-1B-it-reasoning
published
a model
28 days ago
Edrex/gemma-3-1B-it-reasoning
authored
a paper
2 months ago
CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy
Optimization in Reinforcement Learning
Organizations
None yet