Home

Awesome

Imagenette

🎶 Imagenette, gentille imagenette,

Imagenette, je te plumerai. 🎶

(Imagenette theme song thanks to Samuel Finlayson)


NB:


The Datasets

Imagenette

Imagenette is a subset of 10 easily classified classes from Imagenet (tench, English springer, cassette player, chain saw, church, French horn, garbage truck, gas pump, golf ball, parachute).

'Imagenette' is pronounced just like 'Imagenet', except with a corny inauthentic French accent. If you've seen Peter Sellars in The Pink Panther, then think something like that. It's important to ham up the accent as much as possible, otherwise people might not be sure whether you're refering to "Imagenette" or "Imagenet". (Note to native French speakers: to avoid confusion, be sure to use a corny inauthentic American accent when saying "Imagenet". Think something like the philosophy restaurant skit from Monty Python's The Meaning of Life.)

The '320 px' and '160 px' versions have their shortest side resized to that size, with their aspect ratio maintained.

The dataset also comes with a CSV file with 1%, 5%, 25%, and 50% of the labels randomly changed to an incorrect label. More information about the noisy labels are provided in the "noisy_labels" folder. Leaderboards for 5% noise and 50% noise are maintained below.

Too easy for you? In that case, you might want to try Imagewoof.

Imagewoof

Imagewoof is a subset of 10 classes from Imagenet that aren't so easy to classify, since they're all dog breeds. The breeds are: Australian terrier, Border terrier, Samoyed, Beagle, Shih-Tzu, English foxhound, Rhodesian ridgeback, Dingo, Golden retriever, Old English sheepdog. (No we will not enter in to any discussion in to whether a dingo is in fact a dog. Any suggestions to the contrary are un-Australian. Thank you for your cooperation.)

The dataset also comes with a CSV file with 1%, 5%, 25%, and 50% of the labels randomly changed to an incorrect label. More information about the noisy labels are provided in the "noisy_labels" folder.

Imagewoof too easy for you too?!? Then get your hands on Image网.

Image网

Image网 is pronounced "Imagewang"; 网 means "net" in Chinese! Image网 contains Imagenette and Imagewoof combined, but with some twists that make it into a tricky semi-supervised unbalanced classification problem:

Why Imagenette?

I (Jeremy Howard, that is) mainly made Imagenette because I wanted a small vision dataset I could use to quickly see if my algorithm ideas might have a chance of working. They normally don't, but testing them on Imagenet takes a really long time for me to find that out, especially because I'm interested in algorithms that perform particularly well at the end of training.

But I think this can be a useful dataset for others as well.

Usage

If you are already using the fastai library, you can download and access these quickly with commands like:

path = untar_data(URLs.IMAGENETTE_160)

where path now stores the destination to ImageNette-160.

For researchers

For students

Tips

Leaderboard

Generally you'll see +/- 1% differences from run to run since it's quite a small validation set. So please only send in contributions that are higher than the reported accuracy >80% of the time. Here's the rules:

Imagenette Leaderboard

Size (px)EpochsURLAccuracy# Runs
1285fastai2 train_imagenette.py 2020-10 + MaxBlurPool + tuned hyperparams87.43%5, mean
12820fastai2 train_imagenette.py 2020-01 + MaxBlurPool91.57%5, mean
12880fastai2 train_imagenette.py 2020-0193.55%1
128200fastai2 train_imagenette.py 2020-0194.24%1
1925fastai2 train_imagenette.py 2020-01 + MaxBlurPool86.76%5, mean
19220fastai2 train_imagenette.py 2020-01 + MaxBlurPool92.50%5, mean
19280fastai2 train_imagenette.py 2020-0194.50%1
192200fastai2 train_imagenette.py 2020-0195.03%1
2565fastai2 train_imagenette.py 2020-01 + MaxBlurPool86.85%5, mean
25620fastai2 train_imagenette.py 2020-01 + MaxBlurPool93.53%5, mean
25680fastai2 train_imagenette.py 2020-0194.90%1
256200fastai2 train_imagenette.py 2020-0195.11%1

Imagenette w/Label Noise = 5%

Size (px)EpochsURLAccuracy# Runs
1285baseline83.44%1
12820baseline89.53%1
12880baseline89.30%1
128200baseline90.04%1
1925baseline84.13%1
19220baseline90.65%1
19280baseline91.01%1
192200baseline91.08%1
2565SESEMI88.87% ± 0.675,mean±std
25620baseline91.39%1
25680SESEMI92.95% ± 0.123,mean±std
256200SESEMI93.96% ± 0.233,mean±std

Imagenette w/Label Noise = 50%

Size (px)EpochsURLAccuracy# Runs
1285baseline66.60%1
12820baseline79.36%1
12880baseline50.80%1
128200baseline52.18%1
1925baseline67.54%1
19220baseline79.34%1
19280baseline52.51%1
192200baseline53.71%1
2565SESEMI76.72% ± 0.835,mean±std
25620baseline79.21%1
25680SESEMI57.76% ± 0.393,mean±std
256200SESEMI61.48% ± 0.333,mean±std

Imagewoof Leaderboard

Size (px)EpochsURLAccuracy# Runs
1285depthwise(x6)76.61%5, mean
12820depthwise(x4)86.27%5, mean
12880depthwise(x4)87.83%1
128200fastai2 train_imagenette.py 2020-0187.20%1
1925depthwise(x4)81.15%5, mean
19220depthwise(x4)88.37%5, mean
19280depthwise(x2)90.30%1
192200fastai2 train_imagenette.py 2020-0189.54%1
2565Resnet Trick + Mish + Sa + MaxBlurPool78,84%5, mean
25620Resnet Trick + Mish + Sa + MaxBlurPool88,58%5, mean
25680fastai2 train_imagenette.py 2020-0190.48%1
256200fastai2 train_imagenette.py 2020-0190.38%1

Image网 Leaderboard

Size (px)EpochsURLAccuracy# Runs
1285SwAV72.94%5,mean
12820SwAV72.18%3,mean
12880SwAV69.53%1
128200SwAV66.04%1
1925SwAV77.07%5,mean
19220SwAV77.81%3,mean
19280SwAV74.9%1
192200SwAV71.77%1
2565SwAV79.56%5,mean
25620SwAV79.2%3,mean
25680SESEMI78.41% ± 0.395,mean±std
256200SESEMI79.27% ± 0.203,mean±std