avewright/chess-positions-lichess-sf
Viewer β’ Updated β’ 885M β’ 227 β’ 1
A 204M parameter chess transformer trained on Stockfish-labeled positions from Lichess games.
| Step | Positions | Accuracy | Top-3 | SF Rank | Value Acc |
|---|---|---|---|---|---|
| 0 | 0 | 16.3% | 41.8% | 66.6 | 78.5% |
ChessTransformer200M (~204M params)
βββ FusedBoardEncoder (embed_dim=256)
βββ Linear projection (256 β 1024)
βββ CLS token + positional embeddings (68 positions)
βββ TransformerEncoder (16 layers, 16 heads, FFN 4096, GELU, norm_first)
βββ LayerNorm
βββ SpatialPolicyHead (head_dim=512) β 1968 moves
βββ Value head (1024 β 512 β 3 WDL)
best_model.pt β best checkpoint (state_dict only)training_log.json β full eval history config.json β training configurationtrain.log β aggregated worker logsfrom huggingface_hub import hf_hub_download
import torch
path = hf_hub_download("avewright/chess-transformer-200m-v2", "best_model.pt")
state_dict = torch.load(path, map_location="cpu", weights_only=True)
# Load into ChessTransformer200M architecture