The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward Paper • 2509.07430 • Published Sep 9, 2025 • 3 • 2