library-of-code
diff --git a/‎generative_models/InfoGAN_TensorFlow/README.md
Lines changed: 153 additions & 0 deletions b/‎generative_models/InfoGAN_TensorFlow/README.md
Lines changed: 153 additions & 0 deletions
diff --git a/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARbackground.png
448 KB b/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARbackground.png
448 KB
diff --git a/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARfakeaccuracy.png
40.6 KB b/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARfakeaccuracy.png
40.6 KB
diff --git a/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARfinal.png
468 KB b/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARfinal.png
468 KB
diff --git a/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARforeground.png
447 KB b/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARforeground.png
447 KB
diff --git a/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARloss.png
31.4 KB b/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARloss.png
31.4 KB
diff --git a/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARrealaccuracy.png
43.4 KB b/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARrealaccuracy.png
43.4 KB
diff --git a/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistfakeaccuracy.png
43 KB b/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistfakeaccuracy.png
43 KB
diff --git a/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistfinal.png
147 KB b/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistfinal.png
147 KB
diff --git a/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistloss.png
32 KB b/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistloss.png
32 KB
diff --git a/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistrealaccuracy.png
39.6 KB b/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistrealaccuracy.png
39.6 KB
diff --git a/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistthick.png
149 KB b/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistthick.png
149 KB
diff --git a/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnisttilt.png
148 KB b/‎generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnisttilt.png
148 KB
diff --git a/‎generative_models/InfoGAN_TensorFlow/main.py
Lines changed: 224 additions & 0 deletions b/‎generative_models/InfoGAN_TensorFlow/main.py
Lines changed: 224 additions & 0 deletions
@@ -0,0 +1,153 @@
+# TensorFlow Implementation of InfoGAN 
+## Usage
+```bash
+$ python3 main.py --dataset CIFAR10 --noise_dim 64
+```
+> **_NOTE:_** on Colab Notebook use following command:
+```python
+!git clone link-to-repo
+%run main.py main.py --dataset CIFAR10 --noise_dim 64
+```
+
+## Help Log
+```
+usage: main.py [-h] [--dataset DATASET] [--epochs EPOCHS]
+               [--noise_dim NOISE_DIM] [--continuous_weight CONTINUOUS_WEIGHT]
+               [--batch_size BATCH_SIZE] [--outdir OUTDIR]
+
+optional arguments:
+  -h, --help            show this help message and exit
+  --dataset DATASET     Name of dataset: MNIST (default) or CIFAR10
+  --epochs EPOCHS       No of epochs: default 50 for MNIST, 150 for CIFAR10
+  --noise_dim NOISE_DIM
+                        No of latent Noise variables, default 62 for MNIST, 64
+                        for CIFAR10
+  --continuous_weight CONTINUOUS_WEIGHT
+                        Weight given to continuous Latent codes in loss
+                        calculation, default 0.5 for MNIST, 1 for CIFAR10
+  --batch_size BATCH_SIZE
+                        Batch size, default 256
+  --outdir OUTDIR       Directory in which to store data, don't put '/' at the
+                        end!
+```
+
+## Contributed by:
+* [Atharv Singh Patlan](https://github.com/AthaSSiN)
+
+## References
+
+* **Title**: InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
+* **Authors**: Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel
+* **Link**: https://arxiv.org/pdf/1606.03657.pdf
+* **Tags**: Neural Network, Generative Networks, GANs
+* **Year**: 2016
+
+# Summary 
+
+## Introduction
+
+Generative adversarial nets were recently introduced as a novel way to train a generative model.
+They consist of two ‘adversarial’ models: a generative model G that captures the data distribution, and a discriminative model D that estimates the probability that a sample came from the training data rather than G.
+
+However, the above specified GAN, termed as VanillaGAN, is not good in classifying the inputs provided to it, and hence generate an image as per our specifications. In order to do this, we need to tune the noise provided in the input provided to the GAN, and hence define a way so that the GAN learns to classify an image as belong to a given class, and also determine if it is real or fake. 
+
+Enter InfoGAN!
+
+## InfoGAN
+
+The idea is to provide a latent code, which has meaningful and consistent effects on the output. For instance, consider the MNIST dataset, where we have 10 digits. It would be helpful if we could use the property of the dataset having 10 classes and be able to assign a given digit with a particular value. This can be done by assigning part of the input to a 10-state discrete variable. The hope is that if you keep the code the same and randomly change the noise, you get variations of the same digit.
+
+The way InfoGAN approaches this problem is by splitting the Generator input into two parts: the traditional noise vector and a new “latent code” vector. The codes are then made meaningful by maximizing the __Mutual Information__ between the code and the generator output.
+
+![Eqn1](https://miro.medium.com/max/552/1*rSZXfx4_xcC-5z4LirNDRQ.png)
+
+Here *V(D,G)* is the standard Vanilla Gan loss, and *I(c;G(z,c))* is the mutual information loss, with Lambda being sort of a regularization constant (the mutual information loss can be seen as a regularizing term
+
+However, int the calculation of *I(c;G(z,c))*, we need to sample from the posterior distribution of the latent codes, which is usually intractable, and hence we replace it with a lower bound, calculated by approximating the posterior using an auxiliary distribution *Q(c|x)* and the reparameterization trick.
+
+![Eqn2](https://miro.medium.com/max/552/1*NTYmbgNBT9RzhdLl71-koA.png)  
+
+Where  
+![Eqn3](https://miro.medium.com/max/552/1*92L-ml_k7iQcPIWcvT7TIw.png)  
+
+Hence the final form of the loss function becomes:  
+![Eqn4](https://miro.medium.com/max/552/1*W2G0DFBQUa52Piy1snYVjQ.png)
+
+Thus, the problem basically reduces to the following process:
+1. Sample a value for the latent code c from a prior of your choice
+2. Sample a value for the noise z from a prior of your choice
+3. Generate x = *G(c,z)*
+4. Calculate *Q(c|x=G(c,z))*
+
+## Implementation
+
+In the implementation, we input a user defined number of noise variables, 10 categorical latent codes (hoping that in the output, each corresponds to a class of the datasets), and 2 uniform continuous latent codes (with values from -1 to 1), hoping that the correspond to some other features in the dataset
+
+![Model](https://miro.medium.com/max/1104/1*dXLgTV8lNiTInvxomgZSAg.png)
+
+We use the following default configuration: 
+- Binary CE to calculate the loss in real and fake samples detection
+- Categorical CE to calculate the loss in categorical classification
+- Ordinary Least Squares to calculate the loss in continuous variable detection (The continuous variables are uniform in the input but in the architecture predicts them in the form of a Gaussian Distribution. So i tried outputting the mean and log variance of the predictions and hence calculating the losses using the reparameterization trick, but upon applying some basic mathematics, I realized that it all boils down to calculating the OLS of the predicted values)
+- Lambda = 1, however, the weight given to the loss of the continuous codes can be varied (we used 0.5 for MNIST and 1 for CIFAR10)
+
+# Results
+
+## On MNIST Dataset
+
+Results after training for 50 epochs:
+![mnistFinal](https://github.com/AthaSSiN/model-zoo/blob/master/generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistfinal.png)
+
+> **_NOTE:_** In this graph orange plot corresponds to dicriminator loss, blue to generator loss, Green to loss of continuous variables and Gray to loss in categorical variables.
+
+
+Loss:  
+![mnistloss](https://github.com/AthaSSiN/model-zoo/blob/master/generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistloss.png)
+
+Plot of Real and Fake detection accuracies:  
+![mnistreal](https://github.com/AthaSSiN/model-zoo/blob/master/generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistrealaccuracy.png)
+![mnistfake](https://github.com/AthaSSiN/model-zoo/blob/master/generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistfakeaccuracy.png)
+
+Here is the final image generated by the generator for a randomly generated noise and label, with one continuous code being varied along the rows.
+
+In this one, the tilt in the images seems to change as we move left to right:  
+![mnisttilt](https://github.com/AthaSSiN/model-zoo/blob/master/generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnisttilt.png)
+
+While in this, the thickness of the digits seems to change:  
+![mnistthick](https://github.com/AthaSSiN/model-zoo/blob/master/generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/mnistthick.png)
+
+Note: In some cases, the digits have also changed while varying the continuous codes. I think that this is because there are many possible characters that the uniform codes can comply to, and its actually quite possible that they do not apply only to thickness / tilt etc, but can apply to curviness, or number of lines in a digit etc, which can make digits which look similar to each other, be generated by the same categorical code.
+
+## On CIFAR10 Dataset
+
+> **_NOTE_**: The paper does not have an implementation for the CIFAR10 dataset and hence the results aren't very good.
+
+Results after training for 137 epochs
+
+![cifargif](https://github.com/AthaSSiN/model-zoo/blob/master/generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARfinal.png)
+
+> **_NOTE:_** In this graph blue plot corresponds to generator loss and orange to discriminator loss
+
+Here is the loss graph  
+![cifarloss](https://github.com/AthaSSiN/model-zoo/blob/master/generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARloss.png)
+
+Plot of Real and Fake detection accuracies:  
+![CIFARreal](https://github.com/AthaSSiN/model-zoo/blob/master/generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARrealaccuracy.png)
+![CIFARfake](https://github.com/AthaSSiN/model-zoo/blob/master/generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARfakeaccuracy.png)
+
+Here is the final image generated by the generator for a randomly generated noise and label.
+
+In this one, the background color varies as we move left to right:  
+![cifarbg](https://github.com/AthaSSiN/model-zoo/blob/master/generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARbackground.png)
+
+While in this, the foreground color/size varies:  
+![cifarfg](https://github.com/AthaSSiN/model-zoo/blob/master/generative_models/InfoGAN_TensorFlow/assets/ReadmeImages/CIFARforeground.png)
+
+It seems the continuous latent codes are working fine, but the categorical codes weren't able to represent the different classes too well, hence there is room for a lot of experiments!
+
+# Sources
+
+- [InfoGAN — Generative Adversarial Networks Part III](https://towardsdatascience.com/infogan-generative-adversarial-networks-part-iii-380c0c6712cd)  
+Template on which the code was built:  
+- [DCGAN on TensorFlow tutorials](https://www.tensorflow.org/tutorials/generative/dcgan)
+
@@ -0,0 +1,224 @@
+import tensorflow as tf
+import glob
+import matplotlib.pyplot as plt
+import os
+import time
+import datetime
+import argparse
+from tensorflow.keras import layers
+
+print(tf.__version__)
+from utils import run_from_ipython, generate_latent_points, generate_and_save_images, save_gif, generate_varying_outputs
+
+parser = argparse.ArgumentParser()
+ipython = run_from_ipython()
+
+if ipython:
+    from IPython import display
+
+parser.add_argument('--dataset', type = str, default = "MNIST", help = "Name of dataset: MNIST (default) or CIFAR10")
+parser.add_argument('--epochs', type = int, default = 0, help = "No of epochs: default 50 for MNIST, 150 for CIFAR10")
+parser.add_argument('--noise_dim', type = int, default = 0, help = "No of latent Noise variables, default 62 for MNIST, 64 for CIFAR10")
+parser.add_argument('--continuous_weight', type = float, default = 0.0, help = "Weight given to continuous Latent codes in loss calculation, default 0.5 for MNIST, 1 for CIFAR10")
+parser.add_argument('--batch_size', type = int, default = 256, help = "Batch size, default 256")
+parser.add_argument('--outdir', type = str, default = '.', help = "Directory in which to store data, don't put '/' at the end!")
+
+args = parser.parse_args()
+
+if args.dataset == "MNIST":
+    from model_MNIST import make_generator_model, make_discriminator_model
+    (train_images, _), (_, _) = tf.keras.datasets.mnist.load_data()
+    train_images = train_images.reshape(train_images.shape[0], 28, 28, 1).astype('float32')
+    if args.epochs == 0 :
+        args.epochs = 50
+    if args.noise_dim == 0 :
+        args.noise_dim = 62
+    if args.continuous_weight == 0.0:
+        args.continuous_weight = 0.5
+    
+else :
+    from model_CIFAR10 import make_generator_model, make_discriminator_model
+    (train_images, _), (_, _) = tf.keras.datasets.cifar10.load_data()
+    train_images = train_images.reshape(train_images.shape[0], 32, 32, 3).astype('float32')
+    if args.epochs == 0 :
+        args.epochs = 150
+    if args.noise_dim == 0 :
+        args.noise_dim = 64
+    if args.continuous_weight == 0.0:
+        args.continuous_weight = 1
+            
+if not os.path.exists(f"{args.outdir}/assets/{args.dataset}"):
+    os.makedirs(f"{args.outdir}/assets/{args.dataset}")
+
+#normalizing the images
+train_images = (train_images - 127.5) / 127.5
+
+##### DEFINE GLOBAL VARIABLES AND OBJECTS ######
+BUFFER_SIZE = 600000
+BATCH_SIZE = args.batch_size
+epochs = args.epochs
+noise_dim = args.noise_dim
+continuous_dim = 2
+categorical_dim = 10
+num_examples_to_generate = 100
+continuous_weight = args.continuous_weight
+seed, _, _ = generate_latent_points(num_examples_to_generate, noise_dim, categorical_dim, continuous_dim) # A constant sample of latent points so as to create images
+
+ # Define Generator
+generator = make_generator_model(noise_dim)
+print("\nGenerator : ")
+print(generator.summary())
+discriminator = make_discriminator_model()
+print("\nDiscriminator : ")
+print(discriminator.summary())
+
+print("Dataset : ", args.dataset)
+###########################################
+
+# Converting data to tf Dataset
+train_dataset = tf.data.Dataset.from_tensor_slices(train_images).shuffle(BUFFER_SIZE).batch(BATCH_SIZE)
+
+# defining losses
+binary_cross_entropy = tf.keras.losses.BinaryCrossentropy(from_logits=True)
+categorical_cross_entropy = tf.keras.losses.CategoricalCrossentropy(from_logits=True)
+
+#defining optimizers
+generator_optimizer = tf.keras.optimizers.Adam(1e-3, beta_1=0.5 )
+discriminator_optimizer = tf.keras.optimizers.Adam(2e-4, beta_1=0.5)
+
+#defining storage points for checkpoints
+checkpoint_dir = f'{args.outdir}/training_checkpoints'
+checkpoint_prefix = os.path.join(checkpoint_dir, "ckpt")
+checkpoint = tf.train.Checkpoint(generator_optimizer=generator_optimizer, discriminator_optimizer =discriminator_optimizer,generator=generator,discriminator=discriminator)
+
+#defining loss metrics for Plotting purposes with tensorboard
+discriminator_loss_metric = tf.keras.metrics.Mean('discriminator_loss', dtype=tf.float32)
+discriminator_real_accuracy_metric = tf.keras.metrics.BinaryCrossentropy('discriminator_real_accuracy', from_logits=True)
+discriminator_fake_accuracy_metric = tf.keras.metrics.BinaryCrossentropy('discriminator_fake_accuracy', from_logits=True)
+generator_loss_metric = tf.keras.metrics.Mean('generator_loss', dtype=tf.float32)
+categorical_loss_metric = tf.keras.metrics.Mean('categorical_loss', dtype=tf.float32)
+continuous_loss_metric = tf.keras.metrics.Mean('continuous_loss', dtype=tf.float32)
+
+# Save points for metrics
+current_time = datetime.datetime.now().strftime("%Y%m%d-%H%M%S")
+base = f"{args.outdir}/logs/gradientTape/{current_time}"
+disc_log_dir = base + '/discriminator'
+gen_log_dir = base + '/generator'
+cont_log_dir = base + '/cont'
+cat_log_dir = base + '/cat'
+
+# Create summary writers
+disc_summary_writer = tf.summary.create_file_writer(disc_log_dir)
+gen_summary_writer = tf.summary.create_file_writer(gen_log_dir)
+cat_summary_writer = tf.summary.create_file_writer(cont_log_dir)
+cont_summary_writer = tf.summary.create_file_writer(cat_log_dir)
+
+##################################
+# A train step to train the model on a minibatch
+
+def train_step(images):
+    noise, categorical_input, continuous_input = generate_latent_points(BATCH_SIZE, noise_dim, categorical_dim, continuous_dim)
+
+    with tf.GradientTape() as gen_tape, tf.GradientTape() as disc_tape:
+      generated_images = generator(noise, training=True)
+
+      real_output = discriminator(images, training=True)
+      fake_output = discriminator(generated_images, training=True)
+
+      disc_loss, real_loss, fake_loss, categorical_loss, continuous_loss = discriminator_loss(real_output, fake_output, categorical_input, continuous_input)
+      gen_loss = generator_loss(fake_output, categorical_loss, continuous_loss)
+
+    discriminator_loss_metric(disc_loss)
+    generator_loss_metric(gen_loss)
+    discriminator_real_accuracy_metric(tf.ones_like(real_output[:,0]), real_output[:,0])
+    discriminator_fake_accuracy_metric(tf.zeros_like(fake_output[:,0]), fake_output[:,0])
+    categorical_loss_metric(categorical_loss)
+    continuous_loss_metric(continuous_loss)
+
+    print(f"Losses - Disc : [{disc_loss}], Gen : [{gen_loss}], \n categorical loss : {categorical_loss}, continuous loss : {continuous_loss}")
+    
+    gradients_of_generator = gen_tape.gradient(gen_loss, generator.trainable_variables)
+    gradients_of_discriminator = disc_tape.gradient(disc_loss, discriminator.trainable_variables)
+
+    generator_optimizer.apply_gradients(zip(gradients_of_generator, generator.trainable_variables))
+    discriminator_optimizer.apply_gradients(zip(gradients_of_discriminator, discriminator.trainable_variables))
+    
+####################################
+
+def discriminator_loss(real_output, fake_output, categorical_input, continuous_input):
+    real_loss = binary_cross_entropy(tf.ones_like(real_output[:,0]), real_output[:,0])
+    fake_loss = binary_cross_entropy(tf.zeros_like(fake_output[:,0]), fake_output[:,0])
+    
+    categorical_output = fake_output[:,1:1 + categorical_dim]
+    continuous_output = fake_output[:, 1+categorical_dim : ]
+
+    categorical_loss = categorical_cross_entropy(categorical_input, categorical_output)
+    continuous_loss = tf.reduce_mean((2*(continuous_output - continuous_input))**2)
+    
+    total_loss = real_loss + fake_loss + continuous_weight*continuous_loss + categorical_loss
+    return total_loss, real_loss, fake_loss, categorical_loss, continuous_loss
+
+#####################################
+
+def generator_loss(fake_output, categorical_loss, continuous_loss):
+    gen_loss = binary_cross_entropy(tf.ones_like(fake_output[:,0]), fake_output[:,0])
+    return gen_loss +  continuous_weight*continuous_loss + categorical_loss
+    
+#####################################
+
+def main():
+    # begin the training loop
+    
+    for epoch in range(epochs):
+        start = time.time()
+        print(f"EPOCH : {epoch+1}")
+        for image_batch in train_dataset:
+            train_step(image_batch)
+        # Produce images for the GIF
+        if ipython:
+            display.clear_output(wait=True)
+        generate_and_save_images(generator, epoch + 1, seed, outdir = args.outdir, dataset = args.dataset)
+    
+        # Save the model every 15 epochs
+        if (epoch + 1) % 15 == 0:
+          checkpoint.save(file_prefix = checkpoint_prefix)
+        
+        # writing to summary writers
+        with disc_summary_writer.as_default():
+          tf.summary.scalar('Loss', discriminator_loss_metric.result(), step = epoch)
+          tf.summary.scalar('Real Accuracy', discriminator_real_accuracy_metric.result(), step = epoch)
+          tf.summary.scalar('Fake Accuracy', discriminator_fake_accuracy_metric.result(), step = epoch)
+        
+        with cat_summary_writer.as_default():
+          tf.summary.scalar('Loss', categorical_loss_metric.result(), step = epoch)
+        
+        with cont_summary_writer.as_default():
+          tf.summary.scalar('Loss', continuous_loss_metric.result(), step = epoch)
+        
+        with gen_summary_writer.as_default():
+          tf.summary.scalar('Loss', generator_loss_metric.result(), step = epoch)
+    
+        print('Time for epoch {} is {} sec'.format(epoch + 1, time.time()-start))
+        print(f'Epoch results: Discriminator Loss: {discriminator_loss_metric.result()}, Real Accuracy: {discriminator_real_accuracy_metric.result()}, Fake Accuracy: {discriminator_fake_accuracy_metric.result()}')
+        print(f'               Generator Loss: {generator_loss_metric.result()}')
+    
+        discriminator_loss_metric.reset_states()
+        discriminator_real_accuracy_metric.reset_states()
+        discriminator_fake_accuracy_metric.reset_states()
+        generator_loss_metric.reset_states()
+        categorical_loss_metric.reset_states()
+        continuous_loss_metric.reset_states()
+    
+    # Generate after the final epoch
+    if ipython:
+        display.clear_output(wait=True)
+    generate_and_save_images(generator, epochs, seed, outdir = args.outdir, dataset = args.dataset)
+                             
+    save_gif(args.outdir, args.dataset)
+    
+    # For producing outputs with constant noise and varying continuous and categorical latent codes
+    
+    generate_varying_outputs(generator, num_examples_to_generate, noise_dim, args.dataset, args.outdir)
+    
+if __name__ == '__main__':
+    main()