In a Training Loop 🔄

8 11 21

Louis Ulmer

lulmer

lulmer

AI & ML interests

NLP (semantic search, topic generation) Computer vision (object detection) Diffusion Models

Recent Activity

liked a model about 1 month ago

zai-org/GLM-4.7-Flash

new activity about 1 month ago

stas/openwebtext-10k:Convert dataset to Parquet

upvoted a paper about 1 month ago

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

View all activity

Organizations

upvoted a paper about 1 month ago

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Paper • 2601.07832 • Published Jan 12 • 52

upvoted a paper 5 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

upvoted a paper 7 months ago

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

Paper • 2507.10524 • Published Jul 14, 2025 • 71

upvoted an article 8 months ago

Article

Bringing Fusion Down to Earth: ML for Stellarator Optimization

Jul 2, 2025

•

upvoted 2 articles 9 months ago

Article

🐯 Liger GRPO meets TRL

May 25, 2025

•

Article

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

May 21, 2025

•

upvoted a paper 11 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 144

upvoted a collection about 1 year ago

Hymba

Collection

A series of Hybrid Small Language Models. • 3 items • Updated 16 days ago • 32

upvoted a paper over 1 year ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 151

upvoted an article almost 2 years ago

Article

Introduction to State Space Models (SSM)

Jul 19, 2024

•

208

upvoted a paper about 2 years ago

Hyena Hierarchy: Towards Larger Convolutional Language Models

Paper • 2302.10866 • Published Feb 21, 2023 • 7

Louis Ulmer

AI & ML interests

Recent Activity

Organizations

lulmer's activity

Bringing Fusion Down to Earth: ML for Stellarator Optimization

🐯 Liger GRPO meets TRL

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Introduction to State Space Models (SSM)