Abstract: This paper shows that masked autoencoders (MAE) are scalable self-supervised learners for computer vision. Our MAE approach is simple: we mask random patches of the input image and ...
A paper crane that flaps its wings without a single motor inside it sounds like a magic trick. But engineers at Princeton ...