Inference Providers
Active filters: int4
codgician/Qwen3.5-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-GPTQ-int4
Image-Text-to-Text
• 36B • Updated • 1.28k
• 4
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated • 765k
• 20
0xSero/Kimi-K2.5-PRISM-REAP-72
Text Generation
• 91B • Updated • 394
• 8
codgician/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GPTQ-int4
Image-Text-to-Text
• 28B • Updated • 8.12k
• 4
happypatrick/Qwen3.5-122B-A10B-heretic-int4-AutoRound
Image-Text-to-Text
• 19B • Updated • 26.6k
• 6
YCWTG/Qwen3.5-35B-A3B-Instruct-int4-mixed-AutoRound
Text Generation
• 7B • Updated • 323
• 2
INSAIT-Institute/BgGPT-Gemma-3-12B-IT-GPTQ-W4A16
Image-Text-to-Text
• 4B • Updated • 2
INSAIT-Institute/BgGPT-Gemma-3-27B-IT-GPTQ-W4A16
Image-Text-to-Text
• 7B • Updated • 2
Ex0bit/Kimi-K2.5-PRISM-REAP-530B-A32B
Text Generation
• 91B • Updated • 1.41k
• 19
apolo13x/Qwen3.5-35B-A3B-quantized.w4a16
Image-Text-to-Text
• 6B • Updated • 10.1k
• 2
oxzoid/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GPTQ-int4
Image-Text-to-Text
• 28B • Updated • 8.59k
• 3
chankhavu/Nemotron-Cascade-2-30B-A3B-AWQ-INT4
5B • Updated • 542
• 1
Shoolife/Qwen2.5-1.5B-Instruct-LiteRT-LM-INT4
Text Generation
• Updated • 1
happypatrick/Qwen3.5-397B-A17B-heretic-int4-AutoRound
Text Generation
• Updated • 1
Advantech-EIOT/intel_llama-2-chat-7b
Text Generation
• Updated • 3
RedHatAI/zephyr-7b-beta-marlin
Text Generation
• 1B • Updated • 20
RedHatAI/TinyLlama-1.1B-Chat-v1.0-marlin
Text Generation
• 0.3B • Updated • 4.44k
• 2
RedHatAI/OpenHermes-2.5-Mistral-7B-marlin
Text Generation
• 1B • Updated • 11
• 2
RedHatAI/Nous-Hermes-2-Yi-34B-marlin
Text Generation
• 5B • Updated • 7
• 5
ecastera/ecastera-eva-westlake-7b-spanish-int4-gguf
7B • Updated • 16
• 2
softmax/Llama-2-70b-chat-hf-marlin
Text Generation
• 10B • Updated • 9
softmax/falcon-180B-chat-marlin
Text Generation
• 26B • Updated • 26
study-hjt/Meta-Llama-3-8B-Instruct-GPTQ-Int4
Text Generation
• 8B • Updated • 6
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4
Text Generation
• 71B • Updated • 21
• 6
study-hjt/Meta-Llama-3-70B-Instruct-AWQ
Text Generation
• 71B • Updated • 1
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
• 111B • Updated • 10
• 2
study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int4
Text Generation
• 7B • Updated • 14
study-hjt/Qwen1.5-110B-Chat-AWQ
Text Generation
• 111B • Updated • 6
modelscope/Yi-1.5-34B-Chat-AWQ
Text Generation
• 34B • Updated • 83
• 2
modelscope/Yi-1.5-6B-Chat-GPTQ
Text Generation
• 6B • Updated • 2