-
-
-
-
-
-
Inference Providers
Active filters: gptq
ChenMnZ/Llama-2-13b-EfficientQAT-w2g128-BitBLAS
Text Generation
• 51B • Updated
ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-BitBLAS
Text Generation
• 51B • Updated
ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-GPTQ
Text Generation
• 13B • Updated
• 3
ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-BitBLAS
Text Generation
• 51B • Updated
• 2
Xu-Ouyang/pythia-2.8b-deduped-int4-step129000-GPTQ-wikitext2
Text Generation
• 3B • Updated
• 4
ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-GPTQ
Text Generation
• 13B • Updated
ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-BitBLAS
Text Generation
• 274B • Updated
ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-GPTQ
Text Generation
• 69B • Updated
• 3
ChenMnZ/Llama-2-70b-EfficientQAT-w2g64-GPTQ
Text Generation
• 69B • Updated
• 2
ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-BitBLAS
Text Generation
• 275B • Updated
ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-GPTQ
Text Generation
• 69B • Updated
• 4
Xu-Ouyang/pythia-2.8b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
• 3B • Updated
Xu-Ouyang/pythia-12b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
• 11B • Updated
• 3
ChenMnZ/Llama-2-7b-EfficientQAT-w2g128-GPTQ
Text Generation
• 7B • Updated
• 1
ChenMnZ/Llama-2-7b-EfficientQAT-w2g64-GPTQ
Text Generation
• 7B • Updated
• 3
• 1
Xu-Ouyang/pythia-2.8b-deduped-int3-step29000-GPTQ-wikitext2
Text Generation
• 3B • Updated
• 1
ModelCloud/gemma-2-27b-it-gptq-4bit
Text Generation
• 28B • Updated
• 43
• 12
ChenMnZ/Llama-2-7b-EfficientQAT-w4g128-GPTQ
Text Generation
• 7B • Updated
• 3
ChenMnZ/Llama-3-70b-EfficientQAT-w2g128-GPTQ
Text Generation
• 71B • Updated
ChenMnZ/Llama-3-70b-EfficientQAT-w2g64-GPTQ
Text Generation
• 71B • Updated
• 5
ChenMnZ/Llama-3-70b-EfficientQAT-w4g128-GPTQ
Text Generation
• 71B • Updated
• 4
Xu-Ouyang/pythia-2.8b-deduped-int3-step43000-GPTQ-wikitext2
Text Generation
• 3B • Updated
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-GPTQ
Text Generation
• 71B • Updated
• 1
Llamarider222/Mixtral_8x7B_GPTQ
Text Generation
• 47B • Updated
• 2
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g64-GPTQ
Text Generation
• 71B • Updated
• 2
ChenMnZ/Llama-2-7b-EfficientQAT-w2g128-BitBLAS
Text Generation
• 26B • Updated
• 1
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-GPTQ
Text Generation
• 71B • Updated
• 2
ChenMnZ/Llama-2-7b-EfficientQAT-w2g64-BitBLAS
Text Generation
• 26B • Updated
• 1
Xu-Ouyang/pythia-2.8b-deduped-int3-step57000-GPTQ-wikitext2
Text Generation
• 3B • Updated
• 5
ChenMnZ/Llama-2-7b-EfficientQAT-w4g128-BitBLAS
Text Generation
• 26B • Updated