Hi! I wonder after conv5, should we do maxpool2d first or relu first?
From the instruction in jupyter notebook, it seems we do maxpool2d before relu. I thought it should be the other way round.
The architecture should be:
(* skipped ... *)
conv2d, 5x5, 64->128, padding=2
maxpool2d
Relu()
flatten the feature map
...