Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ldwang 's Collections
MiscSpaces
MiscAgentic
MiscIndustry
MiscKernel
MiscR1
MiscModels
MiscDatasets
MiscTools

MiscSpaces

updated Nov 6, 2025
Upvote
1

  • Running
    588

    Scaling test-time compute

    πŸ“ˆ
    588

    Implement test-time compute scaling for math problems


  • Running
    Featured
    1.25k

    FineWeb: decanting the web for the finest text data at scale

    🍷
    1.25k

    Generate high-quality text data for LLMs using FineWeb


  • Running
    3.62k

    The Ultra-Scale Playbook

    🌌
    3.62k

    The ultimate guide to training LLM on large GPU Clusters


  • Running
    215

    FineVision: Open Data is All You Need

    πŸ“
    215

    A new open-source dataset for training VLMs


  • Sleeping
    19

    Megatron Memory Estimator

    πŸ‘
    19

    Estimate GPU memory usage for Megatron models


  • Running on Zero
    19

    Smol2Operator Demo

    🐒
    19

    Smol2Operator Demo: GUI Agent Model


  • Running on CPU Upgrade
    Featured
    2.78k

    The Smol Training Playbook

    πŸ“š
    2.78k

    The secrets to building world-class LLMs


  • Running
    74

    Unlocking On-Policy Distillation for Any Model Family

    πŸ“
    74

    Apply on-policy distillation to any model family

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs