-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
Text Generation
•
22B
•
Updated
•
6.67M
•
•
4.15k
Text Generation
•
120B
•
Updated
•
3.67M
•
•
4.31k
openai/gpt-oss-safeguard-20b
Text Generation
•
22B
•
Updated
•
36k
•
•
174
openai/gpt-oss-safeguard-120b
Text Generation
•
120B
•
Updated
•
17.1k
•
79
lukealonso/MiniMax-M2.1-NVFP4
115B
•
Updated
•
624
•
3
nvidia/Llama-3.3-70B-Instruct-NVFP4
41B
•
Updated
•
37.2k
•
28
nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4
56B
•
Updated
•
24.1k
•
15
tiiuae/Falcon-H1-7B-Instruct-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
35
•
2
NVFP4/Qwen3-235B-A22B-Instruct-2507-FP4
Text Generation
•
118B
•
Updated
•
9.58k
•
3
Firworks/Kimi-Linear-48B-A3B-Instruct-nvfp4
28B
•
Updated
•
275
•
9
Text Generation
•
22B
•
Updated
•
12
•
2
speakleash/Bielik-11B-v3.0-Instruct-MLX-8bit
Text Generation
•
11B
•
Updated
•
39
•
2
MaziyarPanahi/TheTop-5x7B-Instruct-S5-v0.1-GGUF
Text Generation
•
7B
•
Updated
•
32
•
3
MaziyarPanahi/gemma-7b-GGUF
Text Generation
•
9B
•
Updated
•
1.35k
•
12
MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF
Text Generation
•
7B
•
Updated
•
169k
•
129
AayushMathur/manim-codellama-7b
Updated
•
3
•
2
MaziyarPanahi/Mistral-Nemo-Instruct-2407-GGUF
Text Generation
•
12B
•
Updated
•
165k
•
50
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
152k
•
31
MaziyarPanahi/reader-lm-0.5b-GGUF
Text Generation
•
0.5B
•
Updated
•
1.42k
•
4
MaziyarPanahi/reader-lm-1.5b-GGUF
Text Generation
•
2B
•
Updated
•
964
•
2
MaziyarPanahi/Llama-3.2-1B-Instruct-GGUF
Text Generation
•
1B
•
Updated
•
158k
•
17
mlx-community/Qwen3-0.6B-8bit
Text Generation
•
Updated
•
5.38k
•
6
MaziyarPanahi/Qwen3-14B-GGUF
Text Generation
•
15B
•
Updated
•
220k
•
4
tranhuonglan/qwen3-06B-base-smoothquant08-gptqmodifier-w4a8-linear
0.8B
•
Updated
•
11
•
1
RedHatAI/gemma-3-1b-it-quantized.w8a8
Text Generation
•
1B
•
Updated
•
60.5k
•
1
Text Generation
•
19B
•
Updated
•
426
•
5
osxest/Qwen2.5-Coder-32B-Instruct-Uncensored-mlx-8Bit
Text Generation
•
9B
•
Updated
•
72
•
1
mlx-community/gemma-3n-E2B-8bit
Image-Text-to-Text
•
Updated
•
30
•
2
mlx-community/gemma-3n-E4B-it-8bit
Image-Text-to-Text
•
Updated
•
48
•
2
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
•
133B
•
Updated
•
3.4k
•
9