MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper β’ 2601.07832 β’ Published Jan 12 β’ 52
A Survey of Reinforcement Learning for Large Reasoning Models Paper β’ 2509.08827 β’ Published Sep 10, 2025 β’ 190
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Paper β’ 2507.10524 β’ Published Jul 14, 2025 β’ 71
view article Article Bringing Fusion Down to Earth: ML for Stellarator Optimization Jul 2, 2025 β’ 78
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance May 21, 2025 β’ 39
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper β’ 2503.14476 β’ Published Mar 18, 2025 β’ 144
Hymba Collection A series of Hybrid Small Language Models. β’ 3 items β’ Updated 16 days ago β’ 32
Addition is All You Need for Energy-efficient Language Models Paper β’ 2410.00907 β’ Published Oct 1, 2024 β’ 151
Hyena Hierarchy: Towards Larger Convolutional Language Models Paper β’ 2302.10866 β’ Published Feb 21, 2023 β’ 7