Awesome

Self-Supervised Learning

This repo contains a curated list of self-supervised learning papers with a focus on representation learning and clustering. Feel free to add pull requests or open issues to suggest papers.

Representation Learning
Image Clustering
Geometry

Representation Learning

Analysis

W. Van Gansbeke, S. Vandenhende, S. Georgoulis, L. Van Gool, Revisiting Contrastive Methods for Unsupervised Learning of Visual Representation, NeurIPS, 2021. [Code]
R. Geirhos, K. Narayanappa, B. Mitzkus, M. Bethge, F. A. Wichmann, W. Brendel, On the surprising similarities between supervised and self-supervised models, ICLR, 2021.
X. Liu, F. Zhang, Z. Hou, L. Mian, Z. Wang, J. Zhang, J. Tang, Self-supervised Learning: Generative or Contrastive, Arxiv, 2020.
L. Jing, Y. Tian, Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey, T-PAMI, 2020.
S. Purushwalkam, A. Gupta, Demystifying Contrastive Self-Supervised Learning: Invariances, Augmentations and Dataset Biases, NIPS, 2020.
Alejandro Newell, Jia Deng, How Useful is Self-Supervised Pretraining for Visual Tasks?, CVPR, 2020. [Code]
Y. M. Asano, C. Rupprecht, A. Vedaldi, A critical analysis of self-supervision, or what we can learn from a single image, ICLR, 2020. [Code]

Image-Level Representation Learning

Contrastive Learning

X. Chen, S. Xie, K. He, An Empirical Study of Training Self-Supervised Vision Transformers, ICCV, 2021.
P. Goyal, M. Caron, B. Lefaudeux, M. Xu, P. Wang, V. Pai, M. Singh, V. Liptchinsky, I. Misra, A. Joulin, P. Bojanowski, Self-supervised Pretraining of Visual Features in the Wild, Arxiv, 2021.
X. Chen, K. He, Exploring Simple Siamese Representation Learning, CVPR, 2021.
J. Grill, F. Strub, F. Altché, C. Tallec, P. H. Richemond, E. Buchatskaya, C. Doersch, B. A. Pires, Z. Guo, M. G. Azar, B. Piot, K. Kavukcuoglu, R. Munos, M. Valko, Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning, NeurIPS, 2020. [Code]
T. Chen, S. Kornblith, M. Norouzi, G. Hinton, A Simple Framework for Contrastive Learning of Visual Representations, ICML, 2020. [Code]
M. Tschannen, J. Djolonga, P. K. Rubenstein, S. Gelly, M. Lucic, On Mutual Information Maximization for Representation Learning, ICLR, 2020. [Code]
Y. Tian, D. Krishnan, P. Isola, Contrastive Multiview Coding, ECCV, 2020. [Code]
K. He, H. Fan, Y. Wu, S. Xie, R. Girshick, Momentum Contrast for Unsupervised Visual Representation Learning, CVPR, 2020. [Code]
I. Misra, L. Maaten, Self-Supervised Learning of Pretext-Invariant Representations, CVPR, 2020.
O. Henaff, A. Razavi, C. Doersch, S. Eslami, A. Oord, Data-Efficient Image Recognition with Contrastive Predictive Coding, ICML, 2020.
P. Bachman, R. D. Hjelm, W. Buchwalter, Learning Representations by Maximizing Mutual Information Across Views, NIPS, 2019. [Code]
R. D. Hjelm, A. Fedorov, S. Lavoie-Marchildon, K. Grewal, P. Bachman, A. Trischler, Y. Bengio, Learning deep representations by mutual information estimation and maximization, ICLR, 2019. [Code]
J. Huang, Q. Dong, S. Gong, X. Zhu, Unsupervised Deep Learning by Neighbourhood Discovery, ICML, 2019. [Code]
A. Oord, Y. Li, O. Vinyals, Representation Learning with Contrastive Predictive Coding, Arxiv, 2018.
Z. Wu, Y. Xiong and X. Y. Stella, D. Lin, Unsupervised Feature Learning via Non-parameteric Instance Discrimination, CVPR, 2018. [Code]
Wang, Xiaolong and He, Kaiming and Gupta, Abhinav, Transitive Invariance for Self-supervised Visual Representation Learning, ICCV, 2017.
Li, Dong and Hung, Wei-Chih and Huang, Jia-Bin and Wang, Shengjin and Ahuja, Narendra and Yang, Ming-Hsuan, Unsupervised Visual Representation Learning by Graph-based Consistent Constraints, ECCV, 2016. [Code]

Clustering

M. Caron, I. Misra, J. Mairal, P. Goyal, P. Bojanowski, A. Joulin, Unsupervised Learning of Visual Features by Contrasting Cluster Assignments, NIPS, 2020. [Code]
Y. M. Asano, C. Rupprecht, A. Vedaldi, Self-labelling via simultaneous clustering and representation learning, ICLR, 2020. [Code]
X. Yan, I. Misra, A. Gupta, D. Ghadiyaram, D. Mahajan, ClusterFit: Improving Generalization of Visual Representations, CVPR, 2020.
M. Caron, P. Bojanowski, J. Mairal, A. Joulin, Unsupervised Pre-Training of Image Features on Non-Curated Data, ICCV, 2019. [Code]
M. Caron, P. Bojanowski, A. Joulin, M. Douze, Deep Clustering for Unsupervised Learning of Visual Features, ECCV, 2018. [Code]
J. Yang, D. Parikh, D. Batra, Joint Unsupervised Learning of Deep Representations and Image Clusters, CVPR, 2016. [Code]
J. Xie, R. Girshick, A. Farhadi, Unsupervised Deep Embedding for Clustering Analysis, ICML, 2016. [Code]

Masked Image Modeling

X. Dong, J. Bao, T. Zhang, D. Chen, W. Zhang, L. Yuan, D. Chen, F. Wen, N. Yu, PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers, Arxiv, 2022.
R. Wang, D. Chen, Z. Wu, Y. Chen, X. Dai, M. Liu, Y. Jiang, L. Zhou, L. Yuan, BEVT: BERT Pretraining of Video Transformers, CVPR, 2022.
H. Bao, L. Dong, F. Wei, BEIT: BERT Pre-Training of Image Transformers, Arxiv, 2021.
K. He, X. Chen, S. Xie, Y. Li, P. Dollár, R. Girshick, Masked Autoencoders Are Scalable Vision Learners, Arxiv, 2021.
Z. Xie Z. Zhang, Y. Cao, Y. Lin, J. Bao, Z. Yao, Q. Dai, H. Hu SimMIM: A Simple Framework for Masked Image Modeling, Arxiv, 2021.
M. Chen, A. Radford, R. Child, J. Wu, H. Jun, P. Dhariwal, D. Luan, I Sutskever, Generative Pretraining from Pixels, ICML, 2020.

Proxy Tasks

X. Zhan, X. Pan, Z. Liu, D. Lin, C. C. Loy, Self-Supervised Learning via Conditional Motion Propagation, CVPR, 2019. [Code]
Z. Feng, C. Xu, D. Tao, Self-Supervised Representation Learning by Rotation Feature Decoupling, CVPR, 2019. [Code]
A. Kolesnikov, X. Zhai, L. Beye, Revisiting Self-Supervised Visual Representation Learning, CVPR, 2019. [Code]
T. Chen, X. Zhai, M. Ritter, M. Lucic, N. Houlsby, Self-Supervised GANs via Auxiliary Rotation Loss, CVPR, 2019. [Code]
L. Zhang, G. Qi, L. Wang, J. Luo, AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transformations rather than Data, CVPR, 2019. [Code]
X. Zhai, A. Oliver, A. Kolesnikov, L. Beyer, S4L: Self-Supervised Semi-Supervised Learning, ICCV, 2019. [Code]
S. Gidaris, P. Singh, N. Komodakis, Unsupervised Representation Learning by Predicting Image Rotations, ICLR, 2018 [Code]
S. Jenni, P. Favaro, Self-Supervised Feature Learning by Learning to Spot Artifacts, CVPR, 2018. [Code]
M. Noroozi, A. Vinjimoor, P. Favaro, H. Pirsiavash, Boosting Self-Supervised Learning via Knowledge Transfer, CVPR, 2018.
A. Mahendran, J. Thewlis, A. Vedaldi, Cross Pixel Optical-Flow Similarity for Self-Supervised Learning, ACCV, 2018.
M. Noroozi, H. Pirsiavash, P. Favaro, Representation Learning by Learning to Count, ICCV, 2017. [Code]
R. Zhang, P. Isola, A. Efros, Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction, CVPR, 2017. [Code]
D. Pathak, R. Girshick, P. Dollar, T. Darrell, B. Hariharan, Learning Features by Watching Objects Move, CVPR, 2017. [Code]
G. Larsson, M. Maire, G. Shakhnarovich, Colorization as a Proxy Task for Visual Understanding, CVPR, 2017. [Code]
R. S. Cruz, B. Fernando, A. Cherian, S. Gould, DeepPermNet: Visual Permutation Learning, CVPR, 2017. [Code]
H. Lee, J. Huang, M. K. Singh, M. Yang, Unsupervised Representation Learning by Sorting Sequences, ICCV, 2017. [Code]
P. Bojanowski, A. Joulin, Unsupervised Learning by Predicting Noise, ICML, 2017. [Code]
M. Noroozi, P. Favaro, Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles, ECCV, 2016. [Code]
G. Larsson, M. Maire, G. Shakhnarovich, Learning Representations for Automatic Colorization, ECCV, 2016. [Code]
R. Zhang, P. Isola, A. Efros, Colorful Image Colorization, ECCV, 2016. [Code]
D. Pathak, P. Krahenbuhl, J. Donahue, T. Darrell, A. Efros, Context Encoders: Feature Learning by Inpainting, CVPR, 2016. [Code]
P. Agrawal, J. Carreira, J. Malik, Learning to See by Moving, ICCV, 2015. [Code]
C. Doersch, A. Gupta, A. Efros, Unsupervised Visual Representation Learning by Context Prediction, ICCV, 2015. [Code]

Dense Representation Learning

W. Van Gansbeke, S. Vandenhende, S. Georgoulis, L. Van Gool, Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals, Arxiv, 2021. [Code]
X. Wang, R. Zhang, C. Shen, T. Kong, L. Li, Dense Contrastive Learning for Self-Supervised Visual Pre-Training, Arxiv, 2020.
Z. Xie, Y. Lin, Z. Zhang, Y. Cao, S. Lin, H. Hu, Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, Arxiv, 2020. [Code]
A. Jabri, A, Owens, A. Efros, Space-Time Correspondence as a Contrastive Random Walk, NIPS, 2020. [Code]
P. O. Pinheiro, A. Almahairi, R. Y. Benmalek, F. Golemo, A. Courville, Unsupervised Learning of Dense Visual Representations, NIPS, 2020.
X. Ji, J. F. Henriques, A. Vedaldi, Invariant Information Clustering for Unsupervised Image Classification and Segmentation, ICCV, 2019. [Code]
X. Wang, A. Jabri, A. Efros. Learning Correspondence from the Cycle-Consistency of Time, CVPR, 2019. [Code]
T. Zhou, P. Krahenbuhl, M. Aubry, Q. Huang, A. Efros.Learning dense correspondence via 3d-guided cycle consistency, CVPR, 2016.

Image Clustering

W. Van Gansbeke, S. Vandenhende, S. Georgoulis, M. Proesmans, L. Van Gool, SCAN: Learning to Classify Images without Labels, ECCV, 2020. [Code]
X. Ji, J. F. Henriques, A. Vedaldi, Invariant Information Clustering for Unsupervised Image Classification and Segmentation, ICCV, 2019. [Code]
J. Chang, L. Wang, G. Meng, S. Xiang, C. Pan, Deep Adaptive Image Clustering, ICCV, 2017. [Code]
J. Yang, D. Parikh, D. Batra, Joint unsupervised learning of deep representations and image clusters, CVPR, 2016. [Code]
J. Xie, R. Girshick, A. Farhadi, Unsupervised Deep Embedding for Clustering Analysis, ICML, 2016. [Code]

Geometry

C. Godard, O. M. Aodha, M. Firman, G. J. Brostow, Digging into Self-Supervised Monocular Depth Prediction, ICCV, 2019. [Code]
J. Wang, J. Jiao, L. Bao, S. He, Y. Liu, W. Liu, SelFlow: Self-Supervised Learning of Optical Flow, CVPR, 2019. [Code]
C. Godard, O. <. Aodha, G. J. Brostow, Unsupervised Monocular Depth Estimation with Left-Right Consistency, CVPR, 2017. [Code]
T. Zhou, M. Brown, N. Snavely, D. G. Lowe, Unsupervised Learning of Depth and Ego-Motion from Video, CVPR, 2017. [Code]

Awesome

Self-Supervised Learning

Table of Contents

Representation Learning

Analysis

Image-Level Representation Learning

Contrastive Learning

Clustering

Masked Image Modeling

Proxy Tasks

Dense Representation Learning

Image Clustering

Geometry