Robert Mueller
bordeauxred
ยท
AI & ML interests
RL, RLHF, RLAIF, meta learning
Recent Activity
updated a model 1 day ago
GoodStartLabs/kimi-k26-textarena-async-64tok-200iter published a model 1 day ago
GoodStartLabs/kimi-k26-textarena-async-64tok-200iter updated a model 2 days ago
GoodStartLabs/qwen3-8b-openspiel-mix8-selfplay-randmix-1000iter