Inference Providers
Active filters: sglang
NinjaBoffin/MiniMax-M2.7-NVFP4
Text Generation
• 116B • Updated • 1.46k
• 7
mattbucci/Qwen3.6-35B-A3B-AWQ-CT
35B • Updated • 113
• 2
mattbucci/Qwen3.6-27B-AWQ
Updated • 481
• 2
AxionML/Qwen3.5-27B-NVFP4
Image-Text-to-Text
• 17B • Updated • 9.12k
• 10
AxionML/Qwen3.5-35B-A3B-NVFP4
Image-Text-to-Text
• Updated • 8k
• 6
Image-Text-to-Text
• 2B • Updated • 1.46k
• 2
thoughtworks/Gemma-4-31B-Eagle3
Text Generation
• 0.6B • Updated • 1.15k
• 3
thoughtworks/MiniMax-M2.5-Eagle3
Text Generation
• 0.2B • Updated • 3.72k
• 4
mattbucci/Qwen3.5-27B-AWQ
Text Generation
• Updated • 312
• 1
scottgl/Qwen3.5-122B-A10B-NVFP4-GB10
Text Generation
• 27B • Updated • 2.96k
• 3
dervig/m51Lab-MiniMax-M2.7-REAP-139B-A10B-NVFP4-GB10
Text Generation
• 79B • Updated • 1.28k
• 3
BBuf/ltx2-modelopt-fp8-sglang-transformer
19B • Updated • 16
• 1
0xSero/GLM-5.1-478B-A42B-REAP-NVFP4
Text Generation
• 280B • Updated • 1.69k
• 8
Text Generation
• Updated • 4
mattbucci/Qwen3.6-27B-AWQ-CT
27B • Updated • 1.79k
• 1
mattbucci/Qwen3.6-35B-A3B-AWQ
Updated • 468
• 1
SurfaceData/llava-v1.6-mistral-7b-sglang
Image-Text-to-Text
• 8B • Updated • 25
• 9
SurfaceData/llava-v1.6-vicuna-7b-sglang
Image-Text-to-Text
• 7B • Updated • 35
• 1
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 94
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 72
alvarobartt/grok-2-tokenizer
Text Generation
• Updated • 77
• 5
173B • Updated • 2.79k
• 35
mradermacher/MiniMax-M2-THRIFT-GGUF
JasmineBBB/Kimi-Linear-48B-A3B-Instruct-bnb-4bit
Text Generation
• 49B • Updated • 16
• 1
mradermacher/MiniMax-M2-THRIFT-i1-GGUF
173B • Updated • 135
• 10
bartowski/VibeStudio_MiniMax-M2-THRIFT-GGUF
Text Generation
• 173B • Updated • 405
• 8
osmapi/MiniMax-M2-THRIFT-55
106B • Updated • 261
• 5
JinnP/SGLang-EAGLE3-Qwen3-Coder-30B-A3B-Instruct
Text Generation
• 0.2B • Updated • 92
• 1
mradermacher/MiniMax-M2-THRIFT-55-GGUF
106B • Updated • 169
• 2