Description
2017-11-10
1)Image-to-Image Translation with Conditional Adversarial Nets
阅读总结:
训练input: 一个图片和随机高斯噪声(dropout) ,训练output:一个逼真的和输入图片相关的图片
测试input: 一个图片和随机高斯噪声(dropout) 。 测试output:一个逼真的和输入图片相关的图片
G: U-NET是an encoder-decoder with skip connections**
(encoder:
C64-C128-C256-C512-C512-C512-C512-C512
U-Net decoder:
CD512-CD1024-CD1024-C1024-C1024-C512-C256-C128
After the last layer in the decoder, a convolution is ap- plied to map to the number of output channels (3 in general, except in colorization, where it is 2), followed by a Tanh function. As an exception to the above notation, Batch- Norm is not applied to the first C64 layer in the encoder. All ReLUs in the encoder are leaky, with slope 0.2, while ReLUs in the decoder are not leaky.)
D:论文采用70x70的patchGAN
(C64-C128-C256-C512,After the last layer, a convolution is applied to map to a 1 dimensional output, followed by a Sigmoid function. As an exception to the above notation, BatchNorm is not applied to the first C64 layer. All ReLUs are leaky, with slope 0.2.)
优势:可以用于处理任意大的图片
loss函数:G⇤ = arg min max LcGAN (G, D) + lamda*LL1(G).
(Adding both terms together (with lamda = 100) reduces these artifacts.)
训练细节:
1)Weights were initialized from a Gaussian distribution with mean 0 and standard deviation 0.02.
2) apply batch normalization,use batch size 1 for certain experiments and 4 for others,noting little difference