papers_we_read

Siamese Masked Autoencoders

Authors: Agrim Gupta, Jiajun Wu, Jia Deng, Li Fei-Fei
NeurIPS 2023

## Summary

This paper introduces a neural network architecture called SiamMAE Network, which is a simple extension of Masked Autoencoders (MAE) for learning visual correspondence from videos. Here, the asymmetric masking approach is explored, which encourages the network to model object motion, or in other words, to understand what went where

Contributions

Methodology

Key Components

Working

Results

Here, The comparison against prior work is done and shows better performance.


General Qualitative Results are shown here concerning Object propagation, Pose propagation, and Semantic Propagation.

Our Two Cents

Resources

Project Page: https://siam-mae-video.github.io/