how to inference with batch? #17

dinoSpeech · 2024-01-13T04:38:06Z

Hello, thank you for sharing really nice code.

However, I cannot find batch-wise inference codes for transcribing .

(I referred quick start example code in ReadMe)

Is there any batch-wise codes for inference?

best regards

YuanGongND · 2024-01-13T05:43:01Z

hi there,

Thanks for the question.

No, the current version does not support batch inference, the reason is that this repo is based on the official OpenAI whisper repo, which does not support batch inference.

However, in evaluating Whisper-AT on large datasets (e.g., AudioSet), we do use batch inference, the implementation is pre-extract and store Whisper encoder features on disk (which is done one by one), and feed a batch to TLTR module for training or inference (which supports batch input).

There are implementations of Whisper that support batch inference from the third party, as long as their encoder feature is same with the official Whisper, you can use them to extract features from Whisper in batch.

-Yuan

YuanGongND added the enhancement New feature or request label Jan 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to inference with batch? #17

how to inference with batch? #17

dinoSpeech commented Jan 13, 2024

YuanGongND commented Jan 13, 2024 •

edited

Loading

how to inference with batch? #17

how to inference with batch? #17

Comments

dinoSpeech commented Jan 13, 2024

YuanGongND commented Jan 13, 2024 • edited Loading

YuanGongND commented Jan 13, 2024 •

edited

Loading