Parakeet-TDT 0.6B HD Phonation
Official checkpoint for the paper "Huntington Disease Automatic Speech Recognition with Biomarker Supervision."
Model description
This model is a phonation-aware variant of Parakeet-TDT 0.6B HD, tuned for automatic speech recognition on speech affected by Huntington disease (HD). It extends the HD-adapted base model with auxiliary supervision from phonatory biomarker labels.
What this model does
The model transcribes English read / controlled clinical speech from speakers with Huntington disease and healthy controls. It is intended as a research model for studying robust ASR under hyperkinetic motor-speech disruption and for analyzing the effect of phonatory supervision on transcription behavior.
Training
The model was initialized from charleslwang/parakeet-tdt-0.6b-HD and further adapted using parameter-efficient encoder-side adapters with an auxiliary objective based on phonatory biomarker labels, while keeping the pretrained backbone frozen.
Evaluation
On the reported HD test set, this model achieved:
- WER: 6.07
- Substitutions: 1.92
- Deletions: 2.77
- Insertions: 1.38
Intended use
This model is intended for:
- research on pathological / atypical speech recognition,
- benchmarking ASR on Huntington disease speech,
- studying how phonatory auxiliary supervision reshapes error behavior.
It is not intended for clinical diagnosis, treatment decisions, or standalone medical use.
Limitations
- Trained and evaluated on a relatively small, high-fidelity clinical corpus.
- Primarily reflects controlled / read speech rather than spontaneous conversational speech.
- Did not outperform the plain HD-adapted model on overall WER.
- May not generalize to severe out-of-distribution impairment, other languages, or other recording conditions.
Citation
If you use this model, please cite:
@article{wang2026huntington,
title={Huntington Disease Automatic Speech Recognition with Biomarker Supervision},
author={Wang, Charles L. and Chen, Cady and Gong, Ziwei and Hirschberg, Julia},
journal={arXiv preprint arXiv:2603.11168},
year={2026},
url={https://arxiv.org/abs/2603.11168}
}
- Downloads last month
- 133
Model tree for charleslwang/parakeet-tdt-0.6b-HD-phonation
Base model
nvidia/parakeet-tdt-0.6b-v2Collection including charleslwang/parakeet-tdt-0.6b-HD-phonation
Paper for charleslwang/parakeet-tdt-0.6b-HD-phonation
Evaluation results
- WER (%) on Huntington Disease clinical speech test setself-reported6.070