Commit ·
76fcc9f
1
Parent(s): e585064
Update README.md
Browse files
README.md
CHANGED
|
@@ -31,7 +31,7 @@ This repository provides a pretrained E-TDNN model (x-vector) using SpeechBrain.
|
|
| 31 |
|
| 32 |
This system is composed of an E-TDNN model (x-vector). It is a combination of convolutional and residual blocks. The embeddings are extracted using temporal statistical pooling. The system is trained with Additive Margin Softmax Loss.
|
| 33 |
|
| 34 |
-
We use FBank (16kHz, 25ms frame length, 10ms hop length, 80 filter-bank channels) as the input features. It was trained using initial learning rate of 0.001 and batch size of 512 with linear scheduler for
|
| 35 |
|
| 36 |
# Performance
|
| 37 |
|
|
@@ -39,7 +39,7 @@ We use FBank (16kHz, 25ms frame length, 10ms hop length, 80 filter-bank channels
|
|
| 39 |
|
| 40 |
| Splits | Backend | S-norm | EER(%) | minDCF(0.01) |
|
| 41 |
|:-------------:|:--------------:|:--------------:|:--------------:|:--------------:|
|
| 42 |
-
| VoxCeleb1-O | cosine | no |
|
| 43 |
| VoxCeleb1-E | cosine | no | TBD | TBD |
|
| 44 |
| VoxCeleb1-H | cosine | no | TBD | TBD |
|
| 45 |
|
|
|
|
| 31 |
|
| 32 |
This system is composed of an E-TDNN model (x-vector). It is a combination of convolutional and residual blocks. The embeddings are extracted using temporal statistical pooling. The system is trained with Additive Margin Softmax Loss.
|
| 33 |
|
| 34 |
+
We use FBank (16kHz, 25ms frame length, 10ms hop length, 80 filter-bank channels) as the input features. It was trained using initial learning rate of 0.001 and batch size of 512 with linear scheduler for 40 epochs on 4 A100 GPUs. We employ additive noises and reverberation from [MUSAN](http://www.openslr.org/17/) and [RIR](http://www.openslr.org/28/) datasets to enrich the supervised information. The pre-training progress takes approximately seven days for the E-TDNN model.
|
| 35 |
|
| 36 |
# Performance
|
| 37 |
|
|
|
|
| 39 |
|
| 40 |
| Splits | Backend | S-norm | EER(%) | minDCF(0.01) |
|
| 41 |
|:-------------:|:--------------:|:--------------:|:--------------:|:--------------:|
|
| 42 |
+
| VoxCeleb1-O | cosine | no | 1.91 | 0.20 |
|
| 43 |
| VoxCeleb1-E | cosine | no | TBD | TBD |
|
| 44 |
| VoxCeleb1-H | cosine | no | TBD | TBD |
|
| 45 |
|