-
Notifications
You must be signed in to change notification settings - Fork 332
[Docs]: Add whisper word level timestamps #3231
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
[Docs]: Add whisper word level timestamps #3231
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull request overview
This PR adds documentation and examples for word-level timestamp prediction in Whisper speech recognition, enhancing the existing timestamp capabilities.
Changes:
- Added word-level timestamps feature to the speech recognition use case overview
- Documented word-level timestamp prediction with Python and C++ code examples
- Updated Python and C++ sample code to demonstrate word-level timestamp usage
Reviewed changes
Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.
| File | Description |
|---|---|
| site/src/pages/_sections/UseCasesSection/components/speech-recognition.tsx | Added word-level timestamps as a new feature in the UI component |
| site/docs/use-cases/speech-recognition/_sections/_usage_options/index.mdx | Added comprehensive documentation section explaining word-level timestamp prediction with code examples |
| samples/python/whisper_speech_recognition/whisper_speech_recognition.py | Updated Python sample to enable and demonstrate word-level timestamp output |
| samples/cpp/whisper_speech_recognition/whisper_speech_recognition.cpp | Updated C++ sample to enable and demonstrate word-level timestamp output |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull request overview
Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
site/docs/use-cases/speech-recognition/_sections/_usage_options/index.mdx
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull request overview
Copilot reviewed 6 out of 6 changed files in this pull request and generated 1 comment.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
|
|
||
| # x86_64 macOS does not (and will not) support newer versions of torch > 2.2 which are used in the newer transformers versions. It's not possible to lower transformer version in requirements.txt because that triggers vulnerability alert: https://github.com/openvinotoolkit/openvino_tokenizers/security/dependabot/11 | ||
| - run: python -m pip install "transformers<4.52" | ||
| # - run: python -m pip install "transformers<4.52" |
Copilot
AI
Jan 28, 2026
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The comment explaining the transformers version constraint (lines 605-607) should be removed or updated since the pip install command is now commented out. The vulnerability and compatibility context no longer applies to active code.
Description
CVS-179419
CVS-179417
Checklist: