view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 19 days ago • 480
Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery Paper • 2601.20088 • Published Jan 27 • 3