Project is continued on: https://github.com/codename0og/codename-rvc-fork-4
ㅤㅤ👇 You can join my discord server below ( RVC / AI Audio friendly ) 👇ㅤㅤ
Goal of this project is for me to have a much more flexible base than mainline rvc.
ㅤ
- Normalization
- Peak-compression
- Silence-truncation
- 'simple' method chosen for preprocessing ( 3sec segments. ) Expect issues with PESQ and data alignment If the following requirements are not met. ㅤ
-
Hold-Out type validation mechanism during training. ( L1 MEL, mrSTFT, PESQ, SI-SDR )
In between epochs.
-
BF16-AMP, TF32, FP32 Training modes available.
BF16 & TF32 require Ampere or newer GPUs.
BF16 and TF32 can be used simultaneously for extra speed gains.
NOTE: BF16 is used by default. If unsupported hardware detected, switched back to FP32. Inference is only in FP32.
-
Support for 'Spin' embedder.
Needs proper pretrains.
-
Ability to choose an optimizer.
( Currently supporting: AdamW, RAdam, Ranger21 )
-
Double-update strategy for Discriminator.
-
Support for custom input-samples used during training for live-preview of model's reconstruction performance.
-
Mel spectrogram %-based similarity metric.
-
Support for Multi-scale and classic L1 mel loss.
-
Support for the following vocoders: HiFi-GAN, MRF-HiFi-gan and Refine-GAN.
( And their respective pretrains. )
-
Checkpointing and various speed / memory optimizations compared to og RVC.
-
New logging mechanism for losses: Average loss per epoch logged as the standard loss,
and rolling average loss over 50 steps to evaluate general trends and the model's performance over time. -
From-ui quick tweaks; lr for g/d, scheduler, linear warmup etc.
⚠️ 1: HiFi-gan is the stock rvc/applio vocoder, hence it's what you use for og pretrains and hifigan-based customs.
⚠️ 2: MRF-HiFi-GAN and Refine-GAN require new pretrained models. They can't be used with original rvc's G/D pretrains.
✨ to-do list ✨
- Ability to choose lr_decay from the ui.
💡 Ideas / concepts 💡
Propably improving the mel-similarity.. or generally expanding the idea.Irrelevant. Functionality taken over by hold-out validation mechanism.- and more.. perhaps ..
Run the installation script:
- Double-click
run-install.bat
.
Start Applio using:
- Double-click
run-fork.bat
.
This launches the Gradio interface in your default browser.
To monitor training or visualize data:
- Run the " run_tensorboard_in_model_folder.bat " file from logs folder and paste in there path to your model's folder
( containing 'eval' folder or tfevents file/s. )
If it doesn't work for you due to blocked port, open up CMD with admin rights and use this command:netsh advfirewall firewall add rule name="Open Port 25565" dir=in action=allow protocol=TCP localport=25565
The creators of the original Applio repository, Applio's contributors, and the maintainer of this fork (Codename;0), built upon Applio, are not responsible for any legal issues, damages, or consequences arising from the use of this repository or the content generated from it. By using this fork, you acknowledge that:
- The use of this fork is at your own risk.
- This repository is intended solely for educational, and experimental purposes.
- Any misuse, including but not limited to illegal activities or violation of third-party rights,
is not the responsibility of the original creators, contributors, or this fork’s maintainer. - You willingly agree to comply with this repository's Terms of Use