Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Paper • 2507.10524 • Published Jul 14, 2025 • 73
Ranger21: a synergistic deep learning optimizer Paper • 2106.13731 • Published Jun 25, 2021 • 1