byteshape/Devstral-Small-2-24B-Instruct-2512-GGUF Text Generation • 24B • Updated 8 days ago • 8.79k • 20
view post Post 5145 We collaborated with Hugging Face to enable you to train MoE models 12× faster with 35% less VRAM via our new Triton kernels (no accuracy loss). 🤗Train gpt-oss locally on 12.8GB VRAM with our free notebooks: https://unsloth.ai/docs/new/faster-moe See translation 1 reply · 🔥 29 29 🤗 5 5 + Reply
noctrex/Nemotron-3-Nano-30B-A3B-MXFP4_MOE-GGUF Text Generation • 32B • Updated Dec 21, 2025 • 4.11k • 17