You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<divclass="figcaption">Common data preprocessing pipeline. <b>Left</b>: Original toy, 2-dimensional input data. <b>Middle</b>: The data is zero-centered by subtracting the mean in each dimension. The data cloud is now centered around the origin. <b>Right</b>: Each dimension is additionally scaled by its standard deviation. The red lines indicate the extent of the data - they are of unequal length in the middle, but of equal length on the right.</div>
33
+
<imgsrc="/assets/pixelrnn.png">
38
34
</div>
39
35
40
36
Specifically, the PixelRNN framework is made up of twelve
@@ -89,8 +85,9 @@ As we have seen, there are two distinct components to the
89
85
illustrates how each of these two LSTMs operates, when applied to an RGB
90
86
image.
91
87
92
-
 [fig:my~l~abel]
88
+
<divclass="fig figcenter fighighlight">
89
+
<imgsrc="/assets/Screen Shot 2021-06-15 at 9.41.08 AM.png">
90
+
</div>
94
91
95
92
**Row LSTM** is a unidirectional layer that processes the image row by
96
93
row from top to bottom computing features for a whole row at once using
@@ -140,3 +137,9 @@ On the ImageNet data set, achieved an NLL score of 3.86 and 3.63 on the
140
137
the 32x32 and 64x64 image sizes respectively. On CiFAR-10, it achievied
141
138
a NLL score of 3.00, which was state-of-the-art at the time of
0 commit comments