🌱 ASR Wav2Vec2 model performance evaluation | Access our final thesis here

The aim of this study is to investigate the performance of the wav2vec model trained on five mutually exclusive subsets of the GigaSpeech dataset (a multi-domain English speech recognition corpus), each of which contains samples belonging to a specific category. Our goal is to train the model separately on each subset and evaluate its performance using a confusion matrix to derive conclusions about the generalization of the dataset to specific categories. The metric we plan to use is (w.e.r) word error rate. The wav2vec model has not previously been exposed to this dataset, and we seek to determine how it performs on these subsets. We describe the methodology used to create the subsets and preprocessed the data for training the model. We then evaluate the performance of the model on each subset separately and discuss the correlation between these categories. Our expectation is that the results will show that training the wav2vec model on these subsets separately can lead to improved performance and better generalization to the specific categories. Additionally, the confusion matrix analysis can provide insights into correlations between the different categories and if a combination of such data can improve the generalization of the model.

Final working version colab notebook link
To run eval, after the training is done (after trainer.train() cell), move the vocab.json file into /wav2vec2-base-speechcolab-demo/checkpoint-1000 folder. And then run eval.

References:

wav2vec2 documentation
wav2vec2 description HuggingFace
wav2vec2 fine-tuning example with timit
wav2vec2 fine-tuning example with timit colab notebook

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Copy_of_dl_project.ipynb		Copy_of_dl_project.ipynb
Interpretability.ipynb		Interpretability.ipynb
README.md		README.md
aditya's copy.ipynb		aditya's copy.ipynb
audiobooks_test.csv		audiobooks_test.csv
audiobooks_train.csv		audiobooks_train.csv
dl_project.ipynb		dl_project.ipynb
eval_final.ipynb		eval_final.ipynb
working_eval_final.ipynb		working_eval_final.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🌱 ASR Wav2Vec2 model performance evaluation | Access our final thesis here

About

Releases

Packages

Contributors 2

Languages

srilaasya/asr_eval

Folders and files

Latest commit

History

Repository files navigation

🌱 ASR Wav2Vec2 model performance evaluation | Access our final thesis here

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages