Running 89 Unlocking On-Policy Distillation for Any Model Family 📝 89 Visualize on-policy distillation for any model family
Running on Zero 31 Gpt2 Multiplication Predictor 📈 31 Multiply large numbers using different reasoning methods