DiffusionDetect is a project that combines the power of Stable Diffusion models for image generation with object detection using YOLOv5. This project allows users to generate images based on text prompts using the Stable Diffusion model and then apply object detection to the generated images using YOLOv5.
- Generate images based on text prompts using Stable Diffusion models.
- Apply object detection to the generated images using YOLOv5.
- View both the generated and filtered images side by side in a user-friendly interface.
To run the project, follow these steps:
Install the required libraries by running the following command in your terminal:
pip install torch diffusers gradiogit clone github.com/rahulsharmavishwakarma/diffusiondetect.git3. Run the provided Python script to launch the Gradio interface for image generation and object detection. Navigate to the project directory and run the following command:
python app.py- Enter a text prompt in the input textbox on the Gradio interface.
- Click the "Generate" button to create an image based on the prompt.
- View the generated image and the filtered image with object detection results side by side.
- Explore different prompts and observe the variations in generated images and object detection outcomes.
- torch
- diffusers
- gradio
- PIL
- ultralytics/yolov5
- Stable Diffusion models by StabilityAI
- YOLOv5 model by Ultralytics
