Home

Awesome

PASS: Pictures without humAns for Self-Supervised Pretraining

TL;DR: An ImageNet replacement dataset for self-supervised pretraining without humans

img.png

Content

PASS is a large-scale image dataset that does not include any humans, human parts, or other personally identifiable information that can be used for high-quality pretraining while significantly reducing privacy concerns.

pass.gif

Download the dataset

The quickest way:

git clone https://github.com/yukimasano/PASS
cd PASS
source download.sh # maybe change the directory where you want to download it

Generally: all information is on our webpage.

For downloading the dataset, please visit our dataset on zenodo. There you can download it in tar files and find the meta-data.

You can also download the images from their AWS urls, from here.

Pretrained models

PretrainingMethodEpochsIN-1k Acc.Places205 Acc.
(IN-1k)MoCo-v2 20060.650.1visit MoCo-v2 repo
PASSMoCo-v218059.152.8R50 weights
PASSMoCo-v220059.552.8R50 weights
PASSMoCo-v280061.254.0R50 weights
PASSMoCo-v2 (R18)80045.344.4R18 weights
PASSMoCo-v2-CLD20060.253.1R50 weights
PASSSwAV20060.855.5R50 weights
PASSDINO10061.354.6ViT S16 weights
PASSDINO30065.055.7ViT S16 weights

In the table above we give the download links to the full checkpoints (including momentum encoder etc.) to the models we've trained. For comparison, we include MoCo-v2 trained on ILSVRC-12 ("IN-1k") and report linear probing performance on IN-1k and Places205.

Pretrained models from PyTorch Hub

import torch
vits16_100ep = torch.hub.load('yukimasano/PASS:main', 'dino_100ep_vits16')
vits16 = torch.hub.load('yukimasano/PASS:main', 'dino_vits16')
r50_swav_200ep = torch.hub.load('yukimasano/PASS:main', 'swav_resnet50')
r50_moco_800ep = torch.hub.load('yukimasano/PASS:main', 'moco_resnet50')
r50_moco_cld_200ep = torch.hub.load('yukimasano/PASS:main', 'moco_cld_resnet50')

PASSify your dataset

In the folder PASSify of this repo, you can find automated scripts that try to remove humans from image datasets.

Contribute your models

Please let us know if you have a model pretrained on this dataset and I will add this to the list above.

Citation

@Article{asano21pass,
author = "Yuki M. Asano and Christian Rupprecht and Andrew Zisserman and Andrea Vedaldi",
title = "PASS: An ImageNet replacement for self-supervised pretraining without humans",
journal = "NeurIPS Track on Datasets and Benchmarks",
year = "2021"
}