ben burtenshaw
AI & ML interests
None yet
Recent Activity
updated a dataset about 5 hours ago
context-course/certificates updated a dataset about 6 hours ago
huggingface-course/supervised-finetuning_quiz_student_responses updated a dataset about 9 hours ago
agents-course/certificatesOrganizations
Posts 39
Post
8440
Smol course has a distinctive approach to teaching post-training, so I'm posting about how it’s different to other post-training courses, including the llm course that’s already available.
In short, the smol course is just more direct that any of the other course, and intended for semi-pro post trainers.
- It’s a minimal set of instructions on the core parts.
- It’s intended to bootstrap real projects you're working on.
- The material handsover to existing documentation for details
- Likewise, it handsover to the LLM course for basics.
- Assessment is based on a leaderboard, without reading all the material.
To start the smol course, follow here:
smol-course
In short, the smol course is just more direct that any of the other course, and intended for semi-pro post trainers.
- It’s a minimal set of instructions on the core parts.
- It’s intended to bootstrap real projects you're working on.
- The material handsover to existing documentation for details
- Likewise, it handsover to the LLM course for basics.
- Assessment is based on a leaderboard, without reading all the material.
To start the smol course, follow here:
Articles 37
Article
47
DeepSeek-V4: a million-token context that agents can actually use
VLMs playing atari
- RunningRL2
Agentic Environment - Atari Pong
🎮2Play and control Atari 2600 games through a web interface
- Running
Agentic Environment - Atari PacMan
🎮Control and monitor Atari games through a web interface
- RunningRL1
Agentic Environment - Atari Breakout
🎮1Play Atari games and view results via a web UI
- Runtime errorAgentsFeatured15
Qwen Atari
😻15Play Atari games using a vision-language model
RL Environments
VLMs playing atari
- RunningRL2
Agentic Environment - Atari Pong
🎮2Play and control Atari 2600 games through a web interface
- Running
Agentic Environment - Atari PacMan
🎮Control and monitor Atari games through a web interface
- RunningRL1
Agentic Environment - Atari Breakout
🎮1Play Atari games and view results via a web UI
- Runtime errorAgentsFeatured15
Qwen Atari
😻15Play Atari games using a vision-language model
spaces 106
Running
Terminus Pi Trl Static 3bab6f
🎯
Explore your data with an interactive Trackio dashboard
Running
Terminus Pi Trl Static 12edaf
🎯
Visualize your data with an interactive Trackio dashboard
Running
Terminus Pi Trl Static 745011
🎯
View and monitor your data in an interactive dashboard
Running
Terminus Pi Trl Static A131d4
🎯
View and manage your tracking data in an interactive dashboard
Running
Terminus Pi Trl Static Fc86e2
🎯
View your project metrics in an interactive dashboard
Running
Agents
Terminus Pi Trl Trackio
🎯
Display an interactive map of your GPS tracks
models 74
burtenshaw/terminus-pi-trl-qwen3-4b-rollout-22189657
Text Generation • 4B • Updated • 16
burtenshaw/terminus-pi-trl-qwen3-4b-rollout-22189531
Updated
burtenshaw/terminus-pi-trl-qwen3-4b-rollout-22189522
Updated
burtenshaw/terminus-pi-trl-qwen3-4b-rollout-22189282
Updated
burtenshaw/terminus-pi-trl-async-grpo-qwen3-4b
Text Generation • 4B • Updated • 32
burtenshaw/terminus-pi-trl-qwen35-4b-200-beta
Text Generation • 4B • Updated • 49
burtenshaw/terminus-pi-trl-qwen35-4b-200-alpha
Text Generation • 4B • Updated • 70
burtenshaw/terminus-pi-trl-async-grpo-qwen35-4b-range-04-20260529163009
Text Generation • 4B • Updated • 46
burtenshaw/terminus-pi-trl-async-grpo-qwen35-4b-range-03-20260529163004
Text Generation • 4B • Updated • 26
burtenshaw/terminus-pi-trl-async-grpo-qwen35-4b-range-02-20260529162322
Text Generation • 4B • Updated • 123
datasets 66
burtenshaw/terminus-pi-trl-tasks
Viewer • Updated • 8 • 44
burtenshaw/transformers-pr-slop-dataset
Viewer • Updated • 1.46M • 2.18k • 2
burtenshaw/1-million-rows
Updated • 31
burtenshaw/european-cities
Viewer • Updated • 40 • 30
burtenshaw/european-countries
Viewer • Updated • 47 • 22
burtenshaw/ptc-optimized-kernel-job-inputs
Updated • 13
burtenshaw/kernel-skill-source
Updated • 123
burtenshaw/qwen3-5-0-8b-rmsnorm-experiment
Viewer • Updated • 33 • 138
burtenshaw/hub-stats-papers-last-week-2026-03-28
Preview • Updated • 47
burtenshaw/test-rlm-sft
Viewer • Updated • 11 • 24