Papers Read on AI

Papers Read on AI header image 1
May 3, 2022  

Patches Are All You Need?

May 3, 2022

We propose the ConvMixer, an extremely simple model that is similar in spirit to the ViT and the even-more-basic MLP-Mixer in that it operates directly on patches as input, separates the mixing of spatial and channel dimensions, and maintains equal size and resolution throughout the network. ConvMixer outperforms the ViT, MLP-Mixer, and some of their variants for similar parameter counts and data set sizes, in addition to outperforming classical vision models such as the ResNet.

2022: Asher Trockman, J. Z. Kolter

Ranked #80 on Image Classification on CIFAR-10

https://arxiv.org/pdf/2201.09792v1.pdf