Skip to content

Commit 3cfb98c

Browse files
Update README.md
1 parent 773c664 commit 3cfb98c

File tree

1 file changed

+6
-1
lines changed

1 file changed

+6
-1
lines changed

README.md

+6-1
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,12 @@ Note that whilst this code is MIT licensed, the model weights are covered by the
1414

1515
## Additional training
1616

17-
In addition to the training described in the paper, the released weights have also undergone 500k steps of finetuning with force-aligned data from LibriLight and the English portion Multilingual LibriSpeech. This was performed by using a CTC head to regress the force-aligned tags from pre-bottleneck latents. We found that this additional training significantly boosted the applicability of the codec tokens to downstream tasks like TTS.
17+
In addition to the training described in the paper, the released weights have also undergone 500k steps of finetuning with force-aligned data from LibriLight and the English portion Multilingual LibriSpeech. This was performed by using a CTC head to regress the force-aligned tags from pre-bottleneck latents. We found that this additional training significantly boosted the applicability of the codec tokens to downstream tasks like TTS, at a small cost to reconstruction metrics.
18+
19+
| Model | SI-SDR | Mel Dis | STFT Dis | PESQ | STOI |
20+
|---------------------------|-------:|--------:|---------:|-----:|-----:|
21+
| base | 4.73 | 0.86 | 1.26 | 3.09 | 0.92 |
22+
| CTC finefune | 3.58 | 0.90 | 1.30 | 3.01 | 0.90 |
1823

1924
## Install
2025

0 commit comments

Comments
 (0)