Papers We Read


This repo houses summaries for various excitng works in the field of Deep Learning. You can contribute summaries of your own. Check out our contributing guide to start contributing. Happy Reading & Summarizing!




  • Human-level play in the game of Diplomacy by combining language models with strategic reasoning [Paper][Review]

    • Meta Fundamental AI Research Diplomacy Team (FAIR), Antin Bakhtun, Noam Brown, Emily Dinan, Science Journal 2022
  • Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding [Paper][Review]

    • Chitwan Saharia, William Chan, Saurabh Saxena, Lala Li, Jay Whang, Emily Denton, Seyed Kamyar Seyed Ghasemipour, Burcu Karagol Ayan, S. Sara Mahdavi, Rapha Gontijo Lopes, Tim Salimans, Jonathan Ho, David J Fleet, Mohammad Norouzi, NIPS 2022
  • Learning Video Representations from Large Language Models [Paper][Review]

    • Yue Zhao, Ishan Misra, Philipp Krähenbüh, Rohit Girdhar, Facebook AI Research- Meta AI, University of Texas, Austin


  • GANcraft: Unsupervised 3D Neural Rendering of Minecraft Worlds [Paper][Review]

    • Zekun Hao, Arun Mallya, Serge Belongie, Ming-Yu Liu, ICCV 2021
  • GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields [Paper][Review]

    • Michael Niemeyer, Andreas Geiger, CVPR 2021
  • Creative Sketch Genetation [Paper][Review]

    • Songwei Ge, Devi Parikh, Vedanuj Goswami & C. Lawrence Zitnick, ICLR 2021
  • Binary TTC: A Temporal Geofence for Autonomous Navigation[Paper][Review]

    • Abhishek Badki, Orazio Gallo, Jan Kautz, Pradeep Sen, CVPR 2021


  • Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild [Paper][Review]

    • Shangzhe Wu, Christian Rupprecht, Andrea Vedaldi, CVPR 2020
  • You Only Train Once: Loss-conditional training of deep networks [Paper][Review]

    • Alexey Dosovitskiy, Josip Djolonga, ICLR 2020
  • GrokNet: Unified Computer Vision Model Trunk and Embeddings For Commerce [Paper][Review]

    • Sean Bell, Yiqun Liu, Sami Alsheikh, Yina Tang, Ed Pizzi, M. Henning, Karun Singh, Omkar Parkhi, Fedor Borisyuk, KDD 2020
  • Semantically multi-modal image synthesis [Paper][Review]

    • Zhen Zhu, Zhiliang Xu, Ansheng You, Xiang Bai, CVPR 2020
  • Learning to Simulate Dynamic Environments with GameGAN [Paper][Review]

    • Seung Wook Kim, Yuhao Zhou, Jonah Philion, Antonio Torralba, Sanja Fidler, CVPR 2020
  • Adversarial Policies : Attacking deep reinforcement learning [Paper][Review]

    • Adam Gleave, Michael Dennis, Cody Wild, Neel Kant, Sergey Levine, Stuart Russell, ICLR 2020
  • Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning [Paper][Review]

    • Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre H. Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Koray Kavukcuoglu, Rémi Munos, Michal Valko, CVPR 2020


  • ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks [Paper][Review]

    • Jiasen Lu, Dhruv Batra, Devi Parikh, Stefan Lee, NIPS 2019
  • Stand-Alone Self-Attention in Vision Models [Paper][Review]

    • Prajit Ramachandran, Niki Parmar, Ashish Vaswani, Irwan Bello, Anselm Levskaya, Jonathon Shlens, NIPS 2019
  • Zero-Shot Entity Linking by Reading Entity Descriptions [Paper][Review]

    • Lajanugen Logeswaran , Ming-Wei Chang‡ Kenton Lee , Kristina Toutanova , Jacob Devlin, Honglak Lee ACL-2019
  • Do you know that Florence is packed with visitors? Evaluating state-of-the-art models of speaker commitment [Paper][Review]

    • Nanjiang Jiang and Marie-Catherine de Marneffe , ACL-2019
  • Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations [Paper][Review]

    • Vincent Sitzmann, Michael Zollhofer, Gordon Wetzstein, NIPS-2019
  • Emotion-Cause Pair Extraction: A New Task to Emotion Analysis in Texts [Paper][Review]

    • Rui Xia, Zixiang Ding, ACL-2019
  • Putting an End to End-to-End: Gradient-Isolated Learning of Representations [Paper][Review]

    • Sindy Lowe, Peter O’ Connor, Bastiaan S. Veeling, NIPS-2019
  • Bridging the Gap between Training and Inference for Neural Machine Translation [Paper][Review]

    • Wen Zhang, Yang Feng, Fandong Meng, Di You, Qun Liu, ACL-2019
  • Designing and Interpreting Probes with Control Tasks [Paper][Review]

    • John Hewitt, Percy Liang, EMNLP-2019
  • Specializing Word Embeddings (for Parsing) by Information Bottleneck [Paper][Review]

    • Xiang Lisa Li, Jason Eisner, EMNLP-2019
  • vGraph: A Generative Model for Joint Community Detection and Node Representational Learning [Paper][Review]

    • Fan-Yun Sun, Meng Qu, Jordan Hoffmann, Chin-Wei Huang, Jian Tang, NIPS-2019
  • Uniform convergence may be unable to explain generalization in deep learning [Paper][Review]

    • Vaishnavh Nagarajan, J. Zico Kolter, NIPS-2019
  • SinGAN: Learning a Generative Model from a Single Natural Image [Paper][Review]

    • Tamar Rott Shaham, Tali Dekel, Tomer Michaeli, ICCV-2019
  • Graph U-Nets [Paper][Review]

    • Hongyang Gao, Shuiwang Ji, ICML-2019
  • Feature Denoising for Improving Adversarial Robustness [Paper][Review]

    • Cihang Xie, Yuxin Wu, Laurens van der Maaten, Alan Yuille, kaiming He, CVPR-2019
  • This Looks Like That: Deep Learning for Interpretable Image Recognition [Paper][Review]

    • Chaofan Chen, Oscar Li, Chaofan Tao, Alina Jade Barnett, Jonathan Su, Cynthia Rudin, NIPS-2019


  • CyCADA: Cycle-Consistent Adversarial Domain Adaptation [Paper][Review]

    • Judy Hoffman, Eric Tzeng, Taesung Park, Jun-Yan Zhu, Phillip Isola, Kate Saenko, Alexei A. Efros, Trevor Darrell, ICML-2018


  • Unpaired Image-to-Image Translation using Cycle Consistent Adversarial Networks [Paper][Review]

    • Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros, ICCV-2017
  • Densely Connected Convolutional Networks [Paper][Review]

    • Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger, CVPR-2017
  • On Calibration of Modern Neural Networks [Paper][Review]

    • Chuan Guo, Geoff Pleiss, Yu Sun, Kilian Q. Weinberger, ICML-2017


  • Siamese Recurrent Architectures for Learning Sentence Similarity [Paper][Review]

    • Jonas Mueller, Aditya Thyagarajan, AAAI-2016


We appreciate all contributions to the set of summaries. Please refer to for the contributing guideline.


papers_we_read is an open source repository that welcomes any contribution and feedback. We wish the collected sets of summaries can help the DL community to start with the practice of reading and understanding research papers which is a potent skill in the research community. Most of our contributors include students enrolled in undergraduate programmes. We are grateful for all the contributions that help improve this collection of summaries.


This repo is open-sourced under the MIT License.