Skip to content

Commit b6a0836

Browse files
authored
Update pixelrnn.md
1 parent 4ed7d06 commit b6a0836

File tree

1 file changed

+10
-7
lines changed

1 file changed

+10
-7
lines changed

pixelrnn.md

+10-7
Original file line numberDiff line numberDiff line change
@@ -29,12 +29,8 @@ principles apply here; more precisely, we generate image pixels starting
2929
from the top left corner, and we model each pixel’s dependency on
3030
previous pixels using an RNN (LSTM).
3131

32-
![(From CS231N Slides) Sequential Image Generation using
33-
PixelRNN](imports/pixelrnn.png "fig:") [fig:my~l~abel]
34-
3532
<div class="fig figcenter fighighlight">
36-
<img src="/assets/nn2/prepro1.jpeg">
37-
<div class="figcaption">Common data preprocessing pipeline. <b>Left</b>: Original toy, 2-dimensional input data. <b>Middle</b>: The data is zero-centered by subtracting the mean in each dimension. The data cloud is now centered around the origin. <b>Right</b>: Each dimension is additionally scaled by its standard deviation. The red lines indicate the extent of the data - they are of unequal length in the middle, but of equal length on the right.</div>
33+
<img src="/assets/pixelrnn.png">
3834
</div>
3935

4036
Specifically, the PixelRNN framework is made up of twelve
@@ -89,8 +85,9 @@ As we have seen, there are two distinct components to the
8985
illustrates how each of these two LSTMs operates, when applied to an RGB
9086
image.
9187

92-
![Visualization of the mappings for Row LSTM and Diagonal
93-
BiLSTM](Screen Shot 2021-06-15 at 9.41.08 AM.png "fig:") [fig:my~l~abel]
88+
<div class="fig figcenter fighighlight">
89+
<img src="/assets/Screen Shot 2021-06-15 at 9.41.08 AM.png">
90+
</div>
9491

9592
**Row LSTM** is a unidirectional layer that processes the image row by
9693
row from top to bottom computing features for a whole row at once using
@@ -140,3 +137,9 @@ On the ImageNet data set, achieved an NLL score of 3.86 and 3.63 on the
140137
the 32x32 and 64x64 image sizes respectively. On CiFAR-10, it achievied
141138
a NLL score of 3.00, which was state-of-the-art at the time of
142139
publication.
140+
141+
References
142+
==========
143+
144+
1) CS231n Lecture 11 'Generative Modeling'
145+
2) Pixel Recurrent Neural Networks (Oord et. al.) 2016

0 commit comments

Comments
 (0)