RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 24 days ago • 37.5k • 9
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3 Text Generation • 1.0B • Updated 17 days ago • 6.27k • 1
Running 1 Quantization Formats And Cuda Compute Capability Support 🧠 1 Quantization Formats & CUDA Compute Capability Support