vibevoice-asr-it-yodas-only-ggml

GGUF quantizations for vibevoice.cpp of LocalAI-io/vibevoice-asr-it-yodas-only — VibeVoice-ASR fine-tuned on YODAS-only Italian (asr_only ~87k + ast 200k cap, conversational YouTube audio).

Files

File Quantization Size Notes
vibevoice-asr-it-yodas-only-q8_0.gguf Q8_0 ~13 GB Near-lossless, recommended for accuracy
vibevoice-asr-it-yodas-only-q5_0.gguf Q5_0 ~11 GB Sweet spot quality/size
vibevoice-asr-it-yodas-only-q4_0.gguf Q4_0 ~10 GB Smallest, fastest

Usage

hf download LocalAI-io/vibevoice-asr-it-yodas-only-ggml vibevoice-asr-it-yodas-only-q5_0.gguf --local-dir .
./build/bin/vibevoice-cli asr --model vibevoice-asr-it-yodas-only-q5_0.gguf --audio audio.wav

See vibevoice.cpp for build + tokenizer setup.

Downloads last month
41
GGUF
Model size
9B params
Architecture
vibevoice
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for LocalAI-io/vibevoice-asr-it-yodas-only-ggml

Quantized
(9)
this model

Dataset used to train LocalAI-io/vibevoice-asr-it-yodas-only-ggml