Daniil Tiapkin's picture

2 8

Daniil Tiapkin

dtiapkin

·

https://d-tiapkin.github.io/

AI & ML interests

Reinforcement learning enjoyer

Recent Activity

upvoted a paper 22 days ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

upvoted a paper 2 months ago

GAS: Improving Discretization of Diffusion ODEs via Generalized Adversarial Solver

upvoted an article 6 months ago

SmolLM3: smol, multilingual, long-context reasoner

View all activity

Organizations

None yet

commented a paper 7 months ago

Accelerating Nash Learning from Human Feedback via Mirror Prox

Paper • 2505.19731 • Published May 26, 2025 • 6 •

commented a paper 11 months ago

On Teacher Hacking in Language Model Distillation

Paper • 2502.02671 • Published Feb 4, 2025 • 18 •