Commit
·
ff022d0
1
Parent(s):
fa4264f
Update README.md
Browse files
README.md
CHANGED
|
@@ -11,7 +11,9 @@ license: apache-2.0
|
|
| 11 |
|
| 12 |
[Facebook's Hubert](https://ai.facebook.com/blog/hubert-self-supervised-representation-learning-for-speech-recognition-generation-and-compression)
|
| 13 |
|
| 14 |
-
The large model pretrained on 16kHz sampled speech audio. When using the model make sure that your speech input is also sampled at 16Khz.
|
|
|
|
|
|
|
| 15 |
|
| 16 |
The model was pretrained on [Libri-Light](https://github.com/facebookresearch/libri-light).
|
| 17 |
|
|
|
|
| 11 |
|
| 12 |
[Facebook's Hubert](https://ai.facebook.com/blog/hubert-self-supervised-representation-learning-for-speech-recognition-generation-and-compression)
|
| 13 |
|
| 14 |
+
The large model pretrained on 16kHz sampled speech audio. When using the model make sure that your speech input is also sampled at 16Khz.
|
| 15 |
+
|
| 16 |
+
**Note**: This model does not have a tokenizer as it was pretrained on audio alone. In order to use this model **speech recognition**, a tokenizer should be created and the model should be fine-tuned on labeled text data. Check out [this blog](https://huggingface.co/blog/fine-tune-wav2vec2-english) for more in-detail explanation of how to fine-tune the model.
|
| 17 |
|
| 18 |
The model was pretrained on [Libri-Light](https://github.com/facebookresearch/libri-light).
|
| 19 |
|