GitHub - coeyliang20/stable_diffision_from_scratch

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
0.quickstart.py		0.quickstart.py
1.inference.py		1.inference.py
2.train.py		2.train.py
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
my_plot.png		my_plot.png
reset.sh		reset.sh
set.sh		set.sh

Repository files navigation

Stable Diffusion Intro:
- Text-to-image latent diffusion model
- Created by CompVis, Stability AI, and LAION
- Trained on 512x512 images from LAION-5B database subset
- Uses frozen CLIP ViT-L/14 text encoder
  - part of the CLIP
- 860M UNet and 123M text encoder
- Lightweight, runs on consumer GPUs

- Try Stable Diffusion from Huggingface - install some packages ```bash pip install -r requirements.txt

# if you are in China
pip install -r requirements.txt -i http://pypi.douban.com/simple/ --trusted-host pypi.douban.com
```

Pipeline
- end2end
- pretrained model on huggingface:
  - CompVis/stable-diffusion-v1-4。 512x512
  - runwayml/stable-diffusion-v1-5. 512x512
  - stabilityai/stable-diffusion-2-1-base. 512x512
  - stabilityai/stable-diffusion-2-1. 768x768
- for a faster inference and lower memory usage, use fp16, also pass torch_dtype = torch.float16

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages