Voicetral

Overview

This project provides an interface between the Ollama model and Applio's text-to-speech (TTS) and voice conversion services. It converts user speech input into text, generates responses using Ollama, and then synthesizes and plays back the response using Applio.

Features

Speech-to-text conversion using speech_recognition.
Text generation using the Ollama model.
Text-to-speech conversion and voice conversion using Applio.
Audio playback using sounddevice.
Audio resampling and processing with pydub.

Requirements

Software Dependencies

Python 3.9
FFmpeg (for audio processing)
Ollama: A model service for text generation. Visit Ollama's website for installation and usage instructions.
Applio: A service for text-to-speech and voice conversion. Visit Applio's website for installation and usage instructions.

Python Packages

The required Python packages are listed in requirements.txt. To install them, use the following command:

pip install -r requirements.txt

Configuration

FFmpeg: Ensure that FFmpeg is installed and accessible in your system's PATH. You can download FFmpeg from here and follow the installation instructions for your operating system.
Ollama: Install and run the Ollama service according to the instructions on their website. Make sure it's accessible at the specified URL.
Applio: Install and run the Applio service according to the instructions on their website. Ensure it is running locally on the specified port (default: http://127.0.0.1:6969/).
Configuration File: Update the config.ini file with the appropriate paths and settings for your environment.
- START_PROMPT: Your initial prompt for the Ollama model.
- OLLAMA_MODEL: The name of the Ollama model to use.
- APPLIO_TTS_VOICE: The voice configuration for Applio's TTS.
- APPLIO_PTH_PATH: Path to Applio's model file.
- APPLIO_INDEX_PATH: Path to Applio's index file.
- APPLIO_TTS_OUTPUT_PATH: Path where the TTS output will be saved.
- APPLIO_RVC_OUTPUT_PATH: Path where the RVC output will be saved.

Installation

Clone the repository:

git clone https://github.com/Skulux/Voicetral
cd Voicetral

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows, use `venv\Scripts\activate`

Install the required packages:
```
pip install -r requirements.txt
```
Ensure FFmpeg is installed and properly configured in your PATH.
Install and start the Ollama and Applio services as per their respective instructions.

Usage

Configure your config.ini file with the necessary settings as described in the Configuration section.
Run the main script:
```
python main.py
```
Follow the on-screen prompts. Speak into your microphone to interact with the bot.
Say "exit" to stop the program. It is important if you want to save your conversation history.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Feel free to submit issues or pull requests if you have suggestions or improvements. For significant changes, please open an issue first to discuss what you would like to change.

Contact

For questions or feedback, please contact [email protected] or open an issue on the project's GitHub repository.

External Services

Ollama: Installation and usage instructions
Applio: Installation and usage instructions

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
LICENSE		LICENSE
config.ini		config.ini
main.py		main.py
readme.md		readme.md
requirements.txt		requirements.txt
short_term_memory.py		short_term_memory.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voicetral

Overview

Features

Requirements

Software Dependencies

Python Packages

Configuration

Installation

Usage

License

Contributing

Contact

External Services

About

Releases

Packages

Languages

License

Skulux/Voicetral

Folders and files

Latest commit

History

Repository files navigation

Voicetral

Overview

Features

Requirements

Software Dependencies

Python Packages

Configuration

Installation

Usage

License

Contributing

Contact

External Services

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages