VIDEOP2R: Video Understanding from Perception to Reasoning Paper β’ 2511.11113 β’ Published Nov 14, 2025 β’ 112
Quantile Advantage Estimation for Entropy-Safe Reasoning Paper β’ 2509.22611 β’ Published Sep 26, 2025 β’ 118
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper β’ 2504.13837 β’ Published Apr 18, 2025 β’ 139
Configuration error 121 Berkeley Function Calling Leaderboard π 121 Display Berkeley Function-Calling Leaderboard