Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
amang1802 's Collections
ThinkTransformer experiments
Smol-Math
Small model pretraining experiments
PPO experiments
Synthetic Data rewrite (model checkpoints)
Synthetic Data rewrite research (training and eval datasets)
WildeWeb Research

ThinkTransformer experiments

updated Feb 22, 2025

Experiments with new architecture that enables latent space reasoning

Upvote
-

  • amang1802/think_fineweb-edu_chkpts_exp2

    Updated Feb 20, 2025

  • amang1802/think_fineweb-edu_chkpts_exp11

    Updated Feb 22, 2025
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs