Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers Paper • 2601.04890 • Published 1 day ago • 27
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 4 days ago • 23
PORT: Preference Optimization on Reasoning Traces Paper • 2406.16061 • Published Jun 23, 2024 • 1
Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets Paper • 2504.19981 • Published Apr 28, 2025
PORT: Preference Optimization on Reasoning Traces Paper • 2406.16061 • Published Jun 23, 2024 • 1
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 4 days ago • 23
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance May 21, 2025 • 38
Investigating Regularization of Self-Play Language Models Paper • 2404.04291 • Published Apr 4, 2024 • 1
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance Paper • 2507.22448 • Published Jul 30, 2025 • 68