Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add OpenVoice #1633

Merged
merged 29 commits into from
Feb 5, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
29 commits
Select commit Hold shift + click to select a range
c66e067
initial
pavel-esir Jan 22, 2024
04c74bc
moved some parts to openvoice_utils
pavel-esir Jan 22, 2024
3cba802
move wrapper to separate classes
pavel-esir Jan 22, 2024
d2d2d94
successfully moved all infer into openvino_utils.py
pavel-esir Jan 22, 2024
867e25e
successully run gradio
pavel-esir Jan 22, 2024
8dd337d
ready for review
pavel-esir Jan 22, 2024
05231ab
add Readme, add steps explanations
pavel-esir Jan 26, 2024
caf2101
Update README.md
pavel-esir Jan 26, 2024
594d99b
fetch notebook_utils.py, delete quantization
pavel-esir Jan 26, 2024
dc69731
fix pip conflicts
pavel-esir Jan 29, 2024
d1344dc
spellcheck fix
pavel-esir Jan 29, 2024
2617de7
remove webm
pavel-esir Jan 29, 2024
5efe08b
update readme, install ffmpeg
pavel-esir Jan 29, 2024
0977b0e
Merge remote-tracking branch 'upstream/main' into add_openvoice
pavel-esir Jan 29, 2024
a9c8ec5
typo fix
pavel-esir Jan 29, 2024
2443277
remove webm file
pavel-esir Jan 29, 2024
a5614d2
typo fix
pavel-esir Jan 29, 2024
1f52fe7
apply comments
pavel-esir Jan 31, 2024
5d6b33d
Merge remote-tracking branch 'upstream/main' into add_openvoice
pavel-esir Jan 31, 2024
1dc4719
fix enable_chinese_lang = False
pavel-esir Jan 31, 2024
64d56e6
typos fix
pavel-esir Jan 31, 2024
031fe17
ignore openvoice on treon mac
pavel-esir Jan 31, 2024
db54a8f
add -q to pip install
pavel-esir Jan 31, 2024
d86bc6c
remove redundant close for gradio
pavel-esir Jan 31, 2024
d2aa108
Merge branch 'main' into add_openvoice
pavel-esir Feb 1, 2024
9efcc85
make notebook independable from download utils
pavel-esir Feb 2, 2024
c86e60b
cleanup and add metadata
pavel-esir Feb 2, 2024
8d934ad
make linter happy
pavel-esir Feb 2, 2024
932d259
make manually selected voice more natural: fix source source speaker …
pavel-esir Feb 5, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .ci/ignore_treon_mac.txt
Original file line number Diff line number Diff line change
Expand Up @@ -47,4 +47,5 @@
281-kosmos2-multimodal-large-language-model
279-mobilevlm-language-assistant
283-photo-maker
284-openvoice
404-style-transfer-webcam
9 changes: 8 additions & 1 deletion .ci/spellcheck/.pyspelling.wordlist.txt
Original file line number Diff line number Diff line change
Expand Up @@ -780,4 +780,11 @@ ZavyChromaXL
Zongyuan
ZeroScope
zeroscope
xformers
xformers
OpenVoice
BaseSpeakerTTS
ToneColorConverter
nn
lang
OpenVoiceBaseClass
processings
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -55,6 +55,7 @@ Check out the latest notebooks that show how to optimize and deploy popular mode
| [DepthAnything](notebooks/280-depth-anything)<br>[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/openvinotoolkit/openvino_notebooks/HEAD?filepath=notebooks%2F280-depth-anythingh%2F280-depth-anything.ipynb)<br>[![Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/openvinotoolkit/openvino_notebooks/blob/main/notebooks/280-depth-anything/280-depth-anything.ipynb) | Monocular Depth estimation with DepthAnything and OpenVINO | <img src=https://github.com/openvinotoolkit/openvino_notebooks/assets/29454499/a9a16658-512f-470c-a33c-0e1f9d0ae72c width=300> |
| [Kosmos-2: Grounding Multimodal Large Language Models](notebooks/281-kosmos2-multimodal-large-language-model)<br> | Kosmos-2: Grounding Multimodal Large Language Model and OpenVINO™ | <img src=https://huggingface.co/microsoft/kosmos-2-patch14-224/resolve/main/annotated_snowman.jpg width=225> |
| [PhotoMaker](notebooks/283-photo-maker)<br> | Text-to-image generation using PhotoMaker and OpenVINO | <img src=https://github.com/openvinotoolkit/openvino_notebooks/assets/91237924/88bccc4a-5789-42ca-8a68-f402c3e7c5a4 width=300> |
| [OpenVoice](notebooks/284-openvoice)<br>[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/openvinotoolkit/openvino_notebooks/HEAD?filepath=notebooks%2F284-openvoice%2F284-openvoice.ipynb)[![Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/openvinotoolkit/openvino_notebooks/blob/main/notebooks/284-openvoice/284-openvoice.ipynb) | OpenVoice a versatile instant voice tone transferring and generating speech in various languages. |<img src=https://github.com/openvinotoolkit/openvino_notebooks/assets/5703039/ca7eab80-148d-45b0-84e8-a5a279846b51 width=300> |

## Table of Contents

Expand Down
Loading
Loading