How to use from the
Use from the
Transformers.js library
// npm i @huggingface/transformers
import { pipeline } from '@huggingface/transformers';

// Allocate pipeline
const pipe = await pipeline('text-generation', 'BricksDisplay/TinyLlama-1.1B-Chat-v1.0-q4');

Convert from TinyLlama/TinyLlama-1.1B-Chat-v1.0 and 4 bits quantized.

Require onnxruntime>=0.17.0

Downloads last month
5
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for BricksDisplay/TinyLlama-1.1B-Chat-v1.0-q4

Quantized
(148)
this model

Collection including BricksDisplay/TinyLlama-1.1B-Chat-v1.0-q4