QwQ Edge 💬

title	emoji	colorFrom	colorTo	sdk	sdk_version	app_file	pinned	license	about	short_description
QWQ EDGE	💬	red	gray	gradio	5.15.0	app.py	true	creativeml-openrail-m	demo space to try how multi model selection function works	Multimodality

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

QwQ Edge 💬

QwQ Edge is an advanced multimodal chatbot that integrates AI-driven text-to-speech (TTS) generation, image generation using Stable Diffusion XL (SDXL), and a conversational interface powered by large language models. The app supports real-time conversations with multimedia output, including images and speech, and offers a flexible environment for creative and interactive communication.

Key Features:

Multimodal Conversational AI: Supports text, image, and speech as input/output.
Text-to-Speech (TTS): Convert text responses into speech with multiple voice options using Edge TTS.
Image Generation: Generate high-quality images based on prompts using the Stable Diffusion XL pipeline.
Multimodal Inputs: Handle text, image files, and spoken queries for generating appropriate outputs.
Real-time Interaction: Provides instant responses with streaming output and live updates for image generation and TTS processing.
Customizable Parameters: Fine-tune token generation parameters such as temperature, top-p, top-k, and repetition penalties for more controlled responses.

Technologies Used:

Gradio: For creating the user-friendly interface.
Transformers: For NLP and text processing (Hugging Face models).
Stable Diffusion XL: For high-quality image generation.
Edge TTS: For converting text into speech.
Python: For the backend logic and model integration.
PyTorch: For deep learning model inference and acceleration.

Supported Commands:

Text-to-Speech (TTS):
- @tts1: Use the "en-US-JennyNeural" voice.
- @tts2: Use the "en-US-GuyNeural" voice.
Image Generation:
- @image <description>: Generate an image from the given description using Stable Diffusion XL.

Environment Setup

Clone the repository:

git clone https://github.com/PRITHIVSAKTHIUR/QwQ-Edge.git
cd QwQ-Edge

Install the required dependencies:
```
pip install -r requirements.txt
```
Set up the environment variables for model paths and parameters:
- MODEL_VAL_PATH: Path to the SDXL model.
- MAX_INPUT_TOKEN_LENGTH: Maximum token length for model inputs.
- Other environment variables for batch size, resolution, and CPU/GPU usage.
Launch the app:
```
python app.py
```

Example Usage:

TTS Example:
- Type: @tts1 What is quantum computing?
- The app will convert this text into speech using the JennyNeural voice.
Image Generation Example:
- Type: @image A futuristic city skyline at sunset with neon lights
- The app will generate an image based on the prompt using the SDXL model.
Chatbot Conversation Example:
- Type: What is the capital of France?
- The app will respond with a chatbot-generated text response.

Technical Architecture

graph TD
    A[User Interface] --> B[Chat Logic]
    B --> C{Command Type}
    C -->|Text| D[FastThink-0.5B]
    C -->|Image| E[Qwen2-VL-OCR-2B]
    C -->|@image| F[Stable Diffusion XL]
    C -->|@tts| G[Edge TTS]
    D --> H[Response]
    E --> H
    F --> H
    G --> H

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
examples		examples
previous_version		previous_version
.gitattributes		.gitattributes
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QwQ Edge 💬

Key Features:

Technologies Used:

Supported Commands:

Environment Setup

Example Usage:

Technical Architecture

About

Releases

Packages

Languages

PRITHIVSAKTHIUR/QwQ-Edge

Folders and files

Latest commit

History

Repository files navigation

QwQ Edge 💬

Key Features:

Technologies Used:

Supported Commands:

Environment Setup

Example Usage:

Technical Architecture

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages