view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 626
view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 389
view article Article Building for an Open Future - our new partnership with Google Cloud jeffboudier, pagezyhf • Nov 13, 2025 • 48
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference mfuntowicz, hlarcher • Jan 16, 2025 • 76
view article Article Deploy models on AWS Inferentia2 from Hugging Face jeffboudier, philschmid • May 22, 2024 • 14