Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
khazarai
/
Math-RL
like
1
Follow
KhazarAI
10
Text Generation
Transformers
Safetensors
HoangHa/pensez-grpo
English
qwen2
math
trl
unsloth
grpo
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Math-RL
Commit History
Upload 5 files
1d78f60
verified
Rustamshry
commited on
7 days ago
Delete tokenizer_config.json
5295bc6
verified
Rustamshry
commited on
7 days ago
Update README.md
40df86c
verified
Rustamshry
commited on
13 days ago
Update README.md
81019de
verified
Rustamshry
commited on
13 days ago
Upload tokenizer
8f3cab6
verified
Rustamshry
commited on
13 days ago
Upload Qwen2ForCausalLM
6700da7
verified
Rustamshry
commited on
13 days ago
initial commit
b256aae
verified
Rustamshry
commited on
13 days ago