vibevoice-asr-it-yodas-only

Fine-tuned microsoft/VibeVoice-ASR on YODAS-Granary Italian (asr_only ~87k + ast 200k cap, conversational YouTube audio).

The LoRA adapter has been merged into the base model — drop-in replacement.

Usage

from vibevoice.modular.modeling_vibevoice_asr import VibeVoiceASRForConditionalGeneration
from vibevoice.processor.vibevoice_asr_processor import VibeVoiceASRProcessor

processor = VibeVoiceASRProcessor.from_pretrained("LocalAI-io/vibevoice-asr-it-yodas-only")
model = VibeVoiceASRForConditionalGeneration.from_pretrained("LocalAI-io/vibevoice-asr-it-yodas-only", dtype="bfloat16")

For just the LoRA adapter, see LocalAI-io/vibevoice-asr-it-yodas-only-lora.

Downloads last month
21
Safetensors
Model size
9B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for LocalAI-io/vibevoice-asr-it-yodas-only

Finetuned
(17)
this model

Dataset used to train LocalAI-io/vibevoice-asr-it-yodas-only