-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
Text Generation
•
358B
•
Updated
•
8.57k
•
12
Disty0/Qwen-Image-Edit-2511-SDNQ-uint4-svd-r32
Image-to-Image
•
Updated
•
241
•
6
tencent/HY-MT1.5-1.8B-GPTQ-Int4
Translation
•
2B
•
Updated
•
5
tencent/HY-MT1.5-7B-GPTQ-Int4
Translation
•
8B
•
Updated
•
5
QuantTrio/GLM-4.7-GPTQ-Int4-Int8Mix
Text Generation
•
390B
•
Updated
•
80
•
4
mlx-community/MiniMax-M2.1-4bit
Text Generation
•
229B
•
Updated
•
680
•
4
QuantTrio/MiniMax-M2.1-AWQ
Text Generation
•
229B
•
Updated
•
586
•
4
TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ
7B
•
Updated
•
195
•
61
MaziyarPanahi/TheTop-5x7B-Instruct-S5-v0.1-GGUF
Text Generation
•
7B
•
Updated
•
45
•
3
unsloth/Phi-3-mini-4k-instruct-bnb-4bit
Text Generation
•
4B
•
Updated
•
37.5k
•
42
MaziyarPanahi/Mistral-Nemo-Instruct-2407-GGUF
Text Generation
•
12B
•
Updated
•
165k
•
50
ICEPVP8977/Uncensored_Qwen2.5_Coder_7B_4_bit_quantized_Seaftensors
8B
•
Updated
•
35
•
3
Text Generation
•
0.6B
•
Updated
•
70.8k
•
22
lmstudio-community/Devstral-Small-2507-MLX-4bit
Text Generation
•
24B
•
Updated
•
27.6k
•
5
mlx-community/gpt-oss-20b-MXFP4-Q8
Text Generation
•
Updated
•
676k
•
22
nota-ai/Qwen3-30B-A3B-NotaMoEQuant-Int4
Text Generation
•
0.6B
•
Updated
•
129
•
4
Disty0/Qwen-Image-Layered-SDNQ-uint4-svd-r32
Updated
•
48
•
3
nota-ai/GLM-4.5-Air-NotaMoeQuant-Int4
Text Generation
•
1B
•
Updated
•
63
•
2
nightmedia/Qwen3-4B-Agent-F32-dwq4-mlx
Text Generation
•
0.8B
•
Updated
•
183
•
2
fifrio/gemma-3-4b-it-gptq-4bit-calibration-Swahili-128samples
4B
•
Updated
•
77
•
2
Text-to-Speech
•
0.5B
•
Updated
•
25
•
2
TevunahAi/Nemotron-3-Nano-30B-A3B-GPTQ
Text Generation
•
6B
•
Updated
•
856
•
2
Intel/GLM-4.7-int4-mixed-AutoRound
Text Generation
•
2B
•
Updated
•
22
•
2
TheBloke/WizardLM-33B-V1-0-Uncensored-SuperHOT-8K-GPTQ
Text Generation
•
33B
•
Updated
•
40
•
93
MaziyarPanahi/TheTop-5x7B-Instruct-T-v0.1-GGUF
Text Generation
•
7B
•
Updated
•
51
•
1
CohereLabs/c4ai-command-r-v01-4bit
Text Generation
•
35B
•
Updated
•
31
•
176
Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4
Text Generation
•
14B
•
Updated
•
1.34k
•
49
casperhansen/llama-3-8b-instruct-awq
Text Generation
•
8B
•
Updated
•
8.46k
•
26
casperhansen/llama-3-70b-instruct-awq
Text Generation
•
71B
•
Updated
•
1.1k
•
70
solidrust/Llama-3-8B-Lexi-Uncensored-AWQ
Text Generation
•
8B
•
Updated
•
81.4k
•
4