Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation

This repository contains the official implementation of our paper "Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation". Our work introduces a novel approach to estimate pixel-wise uncertainty in diffusion models, enabling more reliable and controlled image generation.

The method provides:

Pixel-wise uncertainty maps for generated images
Improved sampling guidance based on uncertainty estimates
Evaluation metrics for uncertainty quality assessment
Implementation for various datasets including ImageNet and CIFAR-10

For implementation details and usage instructions, please refer to the individual scripts described below.

Installing environment

To install the environment, use hatch:

# Install hatch if not already installed
pip install hatch

# Create and activate default environment
hatch shell

# Alternatively, for CPU-only installation
hatch -e cpu shell

Download models:

# Download U-ViT models
hatch run download-uvit-imagenet64-M
hatch run download-uvit-256
hatch run download-uvit-512

# Download ADM models
hatch run download-adm-imagenet128
hatch run download-adm-imagenet64

# Download other models
hatch run download-uvit-autoencoder
hatch run download-ilora-sd-depth
hatch run download-imagenet64-classifier
hatch run download-imagenet128-classifier

Dataset download for FID calculation

ImageNet Dataset Download

To download the ImageNet dataset for FID calculation, use the provided script:

# Run the download script
bash scripts/download_imagenet.sh

Note: You need to obtain valid ImageNet download links and add them to the script at scripts/download_imagenet.sh before running. The links are available from the official ImageNet website after registration.

Resizing dataset

To get different versions of Imagenet, namely Imagenet64, Imagenet128, Imagenet256 and Imagenet512 please use the script in this script

as follow:

# Resize to 64x64
python image_resizer_imagenet.py -i PATH/TO/IMAGENET -o OUTPUT/FOLDER -r -s 64

# Resize to 128x128
python image_resizer_imagenet.py -i PATH/TO/IMAGENET -o OUTPUT/FOLDER -r -s 128

# Resize to 256x256
python image_resizer_imagenet.py -i PATH/TO/IMAGENET -o OUTPUT/FOLDER -r -s 256

# Resize to 512x512 
python image_resizer_imagenet.py -i PATH/TO/IMAGENET -o OUTPUT/FOLDER -r -s 512

CIFAR-10 Download

You can download CIFAR-10 from the following link and put under data/cifar10

Experiments

Generate uncertainty maps

Once download the models you can generate the uncertainty maps for imagenet with the following command:

python scripts/generate_dataset_score_uncertainty_imagenet.py --num-samples 10_000 --batch-size 128 -M 5 --dropout 0.5 --multi-gpu --scheduler uncertainty_zigzag_centered --image-size 128 --generation-steps 50 --start-step-uc 40 --num-steps-uc 10  --index-seed 3

FID Calculation

To compute FID first you need to compute true dataset distribution:

python compute_dataset_fid.py

Then you can compute FID score with the following command:

  python compute_fid_imagenet.py --config imagenet256_1000_samples

Uncertainty guidance

Once download the models you can use the uncertainty to guide the generative process with the following command:

python scripts/generate_with_uncertainty_threshold_stable_diffusion.py --prompt "a beautiful mountain landscape" --num-steps 20 --seed 123 --percentile 0.9 --strength 1.0 --num-steps-threshold 2

usage: generate_with_uncertainty_threshold_stable_diffusion.py [-h] [--num-steps NUM_STEPS] [--prompt PROMPT]
                 [--prompt-negative PROMPT_NEGATIVE] [--seed SEED]
                 [--start-step-threshold START_STEP_THRESHOLD]
                 [--num-steps-threshold NUM_STEPS_THRESHOLD]
                 [--percentile PERCENTILE] [--skip-original] [--use-posterior]
                 [--strength STRENGTH] [--config CONFIG]

options:
  -h, --help            show this help message and exit
  --num-steps NUM_STEPS
                        number of steps to generate (default: 20)
  --prompt PROMPT       prompt for the model (default: a photo of a cat)
  --prompt-negative PROMPT_NEGATIVE
                        negative prompt for the model
  --seed SEED           seed for the model (default: 491)
  --start-step-threshold START_STEP_THRESHOLD
                        step to start estimating the threshold (default: 0)
  --num-steps-threshold NUM_STEPS_THRESHOLD
                        number of steps to estimate the threshold (default: 20)
  --percentile PERCENTILE, --perc PERCENTILE, -p PERCENTILE
                        percentile for the threshold (default: 0.95)
  --skip-original       skip original
  --use-posterior       use posterior
  --strength STRENGTH   strength of the uncertainty guidance (default: 0.99)
  --config CONFIG       Path to the configuration file

Citation

@InProceedings{De_Vita_2025_WACV,
    author    = {De Vita, Michele and Belagiannis, Vasileios},
    title     = {Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation},
    booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)},
    month     = {February},
    year      = {2025},
    pages     = {3844-3854}
}

License

This project is licensed under a Creative Commons Attribution 4.0 International License.

See LICENSE for details.

Scripts

This directory contains the following Python scripts:

clean_empty_runs.py: Removes empty run directories to clean up storage.
compute_ause.py: Calculates the Area Under the Sparsification Error (AUSE) for uncertainty estimation.
compute_dataset_fid_imagenet64_npz.py: Computes the Fréchet Inception Distance (FID) for ImageNet64 datasets in NPZ format.
compute_dataset_fid_old.py: An earlier version of the FID computation script.
compute_dataset_fid.py: Computes the FID for generated datasets.
compute_fid_imagenet128.py: Calculates the FID for the ImageNet128 dataset.
compute_fid_imagenet.py: Computes the FID for the ImageNet dataset.
compute_nll.py: Calculates the Negative Log-Likelihood (NLL) for model evaluation.
compute_pr_generated_samples.py: Computes Precision and Recall metrics for generated samples.
compute_pr_true_dataset.py: Computes Precision and Recall for the true dataset.
compute_statistics_fid_score.py: Gathers statistics related to FID scores.
compute_threshold_pixel_wise.py: Determines pixel-wise thresholds for uncertainty estimation.
eval_fid_lsun_churches256.py: Evaluates FID on the LSUN Churches256 dataset.
generate_compute_fid_score_guided_diffusion_imagenet128.py: Generates images using guided diffusion and computes FID scores on ImageNet128.
generate_dataset_score_uncertainty_cifar10.py: Generates datasets and computes uncertainty scores on CIFAR-10.
generate_dataset_score_uncertainty_imagenet_classifier_guidance.py: Generates datasets with classifier guidance and computes uncertainty scores on ImageNet.
generate_dataset_score_uncertainty_imagenet.py: Generates datasets and computes uncertainty scores on ImageNet.
generate_diffusion_starting_data.py: Generates initial data for diffusion models.
generate_images_with_uncertainty_threshold.py: Generates images using specified uncertainty thresholds.
generate_with_uncertainty_threshold_flux.py: Generates images with uncertainty thresholds using flux methods.
generate_with_uncertainty_threshold_stable_diffusion_3.py: Uses stable diffusion models to generate images with uncertainty guidance.
generate_with_uncertainty_threshold_stable_diffusion.py: Generates images using stable diffusion and uncertainty guidance.
measure_times_cifar10.py: Measures processing times on the CIFAR-10 dataset.
measure_times_imagenet.py: Measures processing times on the ImageNet dataset.
plot_curve_M.py: Plots evaluation curves for analysis.
summary_experiments.py: Summarizes results from various experiments.
uncertainty_benchmark_imagenet.py: Benchmarks uncertainty estimation methods on ImageNet.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
config		config
diffusion_uncertainty		diffusion_uncertainty
img		img
scripts		scripts
tests		tests
cover_image.png		cover_image.png
license.md		license.md
pyproject.toml		pyproject.toml
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Table of Contents

Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation

Installing environment

Download models:

Dataset download for FID calculation

ImageNet Dataset Download

Resizing dataset

CIFAR-10 Download

Experiments

Generate uncertainty maps

FID Calculation

Uncertainty guidance

Citation

License

Scripts

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Michedev/diffusion-uncertainty

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation

Installing environment

Download models:

Dataset download for FID calculation

ImageNet Dataset Download

Resizing dataset

CIFAR-10 Download

Experiments

Generate uncertainty maps

FID Calculation

Uncertainty guidance

Citation

License

Scripts

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages