Skip to content

Commit eb9a237

Browse files
authored
[release 0.12] Remove examples code using prototype features from release (#2497)
1 parent a188711 commit eb9a237

File tree

11 files changed

+2
-1069
lines changed

11 files changed

+2
-1069
lines changed

examples/asr/emformer_rnnt/README.md

+1-3
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ This directory contains sample implementations of training and evaluation pipeli
1515
### Pipeline Demo
1616

1717
[`pipeline_demo.py`](./pipeline_demo.py) demonstrates how to use the `EMFORMER_RNNT_BASE_LIBRISPEECH`
18-
or `EMFORMER_RNNT_BASE_TEDLIUM3` bundle that wraps a pre-trained Emformer RNN-T produced by the corresponding recipe below to perform streaming and full-context ASR on several audio samples.
18+
bundle that wraps a pre-trained Emformer RNN-T produced by the LibriSpeech recipe below to perform streaming and full-context ASR on several audio samples.
1919

2020
## Model Types
2121

@@ -67,8 +67,6 @@ The table below contains WER results for dev and test subsets of TED-LIUM releas
6767
| dev | 0.108 |
6868
| test | 0.098 |
6969

70-
[`tedlium3/eval_pipeline.py`](./tedlium3/eval_pipeline.py) evaluates the pre-trained `EMFORMER_RNNT_BASE_TEDLIUM3` bundle on the dev and test sets of TED-LIUM release 3. Running the script should produce WER results that are identical to those in the above table.
71-
7270
### MuST-C release v2.0
7371

7472
The MuST-C model is configured with a vocabulary size of 500. Consequently, the MuST-C model's last linear layer in the joiner has an output dimension of 501 (500 + 1 to account for the blank symbol). In contrast to those of the datasets for the above two models, MuST-C's transcripts are cased and punctuated; we preserve the casing and punctuation when training the SentencePiece model.

examples/asr/emformer_rnnt/pipeline_demo.py

+1-14
Original file line numberDiff line numberDiff line change
@@ -13,13 +13,8 @@
1313

1414
import torch
1515
import torchaudio
16-
from common import MODEL_TYPE_LIBRISPEECH, MODEL_TYPE_MUSTC, MODEL_TYPE_TEDLIUM3
17-
from mustc.dataset import MUSTC
16+
from common import MODEL_TYPE_LIBRISPEECH
1817
from torchaudio.pipelines import EMFORMER_RNNT_BASE_LIBRISPEECH, RNNTBundle
19-
from torchaudio.prototype.pipelines import (
20-
EMFORMER_RNNT_BASE_MUSTC,
21-
EMFORMER_RNNT_BASE_TEDLIUM3,
22-
)
2318

2419
logger = logging.getLogger(__name__)
2520

@@ -35,14 +30,6 @@ class Config:
3530
partial(torchaudio.datasets.LIBRISPEECH, url="test-clean"),
3631
EMFORMER_RNNT_BASE_LIBRISPEECH,
3732
),
38-
MODEL_TYPE_MUSTC: Config(
39-
partial(MUSTC, subset="tst-COMMON"),
40-
EMFORMER_RNNT_BASE_MUSTC,
41-
),
42-
MODEL_TYPE_TEDLIUM3: Config(
43-
partial(torchaudio.datasets.TEDLIUM, release="release3", subset="test"),
44-
EMFORMER_RNNT_BASE_TEDLIUM3,
45-
),
4633
}
4734

4835

examples/asr/emformer_rnnt/tedlium3/eval_pipeline.py

-90
This file was deleted.

examples/asr/librispeech_conformer_rnnt/README.md

-49
This file was deleted.

examples/asr/librispeech_conformer_rnnt/data_module.py

-194
This file was deleted.

0 commit comments

Comments
 (0)