Neural Additive Experts: Context-Gated Experts for Controllable Model Additivity Paper • 2602.10585 • Published Feb 11 • 2
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Paper • 2602.12036 • Published Feb 12 • 91
Routing the Lottery: Adaptive Subnetworks for Heterogeneous Data Paper • 2601.22141 • Published Jan 29 • 3
ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas Paper • 2601.21558 • Published Jan 29 • 59