Christian Wolf · Oct 5, 2020 · 10:12 AM UTC

Christian Wolf · Oct 5, 2020 · 10:12 AM UTC

Christian Wolf

5 Oct 2020

I trained a CNN w/o pooling on MNIST and produced a colored visualization of the feature maps you get when you translate a digit on an input canvas. This illustrates the equivariance property of convolutions. More info (why convolutions?) in a blog post: medium.com/@chriswolfvision/…

Oct 5, 2020 · 10:12 AM UTC

115

617

GIF

Tim Rocktäschel · Oct 5, 2020 · 6:41 PM UTC

Tim Rocktäschel

@_rockt

5 Oct 2020

Replying to @chriswolfvision

That's awesome! What did you use to create the animation?

Christian Wolf · Oct 5, 2020 · 6:45 PM UTC

Christian Wolf @chriswolfvision

5 Oct 2020

Thanks :) All done by hand, ~500 lines of PyTorch, but only ~250 lines are the visualization code, the rest is standard boiler plate code for model training and validation. I considered using libraries like pyrender, but found this quicker (4h of coding Sunday afternoon).

Jan van Gemert · Oct 5, 2020 · 5:50 PM UTC

Jan van Gemert @jan_gemert

5 Oct 2020

Replying to @chriswolfvision

Even fully convolutional layers without pooling aren't translation equivariant, they can and will encode position information: "On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location" arxiv.org/abs/2003.07064

On Translation Invariance in CNNs: Convolutional Layers can...

In this paper we challenge the common assumption that convolutional layers in modern CNNs are translation invariant. We show that CNNs can and will exploit the absolute spatial location by...

arxiv.org