Awesome
DeepBDC For Few-shot Larning
<div>       <img src="http://peihuali.org/DeepBDC/illustration.gif" width="80%"/> </div>Introduction
In this repo, we provide the implementation of the following paper:<br> "Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification" [Project] [Paper].
In this paper, we propose deep Brownian Distance Covariance (DeepBDC) for few-shot classification. DeepBDC can effectively learn image representations by measuring, for the query and support images, the discrepancy between the joint distribution of their embedded features and product of the marginals. The core of DeepBDC is formulated as a modular and efficient layer, which can be flexibly inserted into deep networks, suitable not only for meta-learning framework based on episodic training, but also for the simple transfer learning (STL) framework of pretraining plus linear classifier.<br>
If you find this repo helpful for your research, please consider citing our paper:<br>
@inproceedings{DeepBDC-CVPR2022,
title={Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification},
author={Jiangtao Xie and Fei Long and Jiaming Lv and Qilong Wang and Peihua Li},
booktitle={CVPR},
year={2022}
}
Few-shot classification Results
Experimental results on miniImageNet, CUB and tieredImageNet. We report average results with 2,000 randomly sampled episodes for both 1-shot and 5-shot evaluation. More details on the experiments can be seen in the paper.
miniImageNet
We followed DeepEMD for data preprocessing.
<table> <tr> <th rowspan="2" style="text-align:center;">Method</th> <th colspan="2" style="text-align:center;">ResNet-12</th> <th colspan="2" style="text-align:center;">Pre-trained models</th> <th colspan="2" style="text-align:center;">Meta-trained models</th> </tr> <tr> <th colspan="1" style="text-align:center;">5-way-1-shot</th> <th colspan="1" style="text-align:center;">5-way-5-shot</th> <th colspan="1" style="text-align:center;">GoogleDrive</th> <th colspan="1" style="text-align:center;">BaiduCloud</th> <th colspan="1" style="text-align:center;">GoogleDrive</th> <th colspan="1" style="text-align:center;">BaiduCloud</th> </tr> <tr> <td style="text-align:center">ProtoNet</td> <td style="text-align:center;">62.11±0.44</td> <td style="text-align:center;">80.77±0.30</td> <td style="text-align:center;"><a href="https://drive.google.com/drive/folders/1ngzuWjB4btPzGqIX_dr24iUwa0fDWk6p?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1eQJnkxkH0HgB1cBiU4kjuA?pwd=an94">Download</a></td> <td style="text-align:center;"><a href="https://drive.google.com/drive/folders/1GaBoQh4i9kF13jEXRwORmpXTpsDlcVOE?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/16EV3jsOsEnTdl3DYtLCaMw?pwd=sw8j">Download</a></td> </tr> <tr> <td style="text-align:center">Good-Embed</td> <td style="text-align:center;">64.98±0.44</td> <td style="text-align:center;">82.10±0.30</td> <td style="text-align:center;"><a href="https://drive.google.com/drive/folders/1C2uIs1t_QJBcol2TKjlwTPAr78IfRZRn?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1z_YCzvhHMLzGPVkxWGqWoA?pwd=8cyz">Download</a></td> <td colspan="2" style="text-align:center;">N/A</td> </tr> <tr> <td style="text-align:center">Meta DeepBDC</td> <td style="text-align:center;">67.34±0.43</td> <td style="text-align:center;">84.46±0.28</td> <td style="text-align:center;"><a href="https://drive.google.com/drive/folders/10Ej_xZeO_M-aMQkKpYcawAQ0BKV3b8SU?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1encoBx8lJrRTkptBc4O3XQ?pwd=3ee0">Download</a></td> <td style="text-align:center;"><a href="https://drive.google.com/drive/folders/14_2dqvGSPeQ9sqLjXWpi58YVwfWqufMq?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1vuh08yt02CX2TXnV332frA?pwd=abzh">Download</a></td> </tr> <tr> <td style="text-align:center">STL DeepBDC</td> <td style="text-align:center;">67.83±0.43</td> <td style="text-align:center;">85.45±0.29</td> <td style="text-align:center;"><a href="https://drive.google.com/drive/folders/1mxacPRdvNayZDrhyprrgOWwyRXRdhdu1?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/10Ft7xvbQQCII3OsL0jFkyw?pwd=ls0a">Download</a></td> <td colspan="2" style="text-align:center;">N/A</td> </tr> </table>Note that for Good-Embed and STL DeepBDC, a sequential self-distillation technique is used to obtain the pre-trained models; See the paper of Good-Embed for details.
CUB
We followed CloserLookFewShot for data preprocessing.
<table> <tr> <th rowspan="2" style="text-align:center;">Method</th> <th colspan="2" style="text-align:center;">ResNet-18</th> <th colspan="2" style="text-align:center;">Pre-trained models</th> <th colspan="2" style="text-align:center;">Meta-trained models</th> </tr> <tr> <th colspan="1" style="text-align:center;">5-way-1-shot</th> <th colspan="1" style="text-align:center;">5-way-5-shot</th> <th colspan="1" style="text-align:center;">GoogleDrive</th> <th colspan="1" style="text-align:center;">BaiduCloud</th> <th colspan="1" style="text-align:center;">GoogleDrive</th> <th colspan="1" style="text-align:center;">BaiduCloud</th> </tr> <tr> <td style="text-align:center">ProtoNet</td> <td style="text-align:center;">80.90±0.43</td> <td style="text-align:center;">89.81±0.23</td> <td style="text-align:center;"><a href="https://drive.google.com/file/d/1rpNH9iAI10KEGacLn55gY3PYoBFDg94W/view?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1CMfyUtpkkTTDF4kT5lJ7Hw?pwd=1din">Download</a></td> <td style="text-align:center;"><a href="https://drive.google.com/drive/folders/1xPYoVtv0sPa1QY2eq9bmJNGOhqRSMHGD?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/19Y8IaoWNHGgYcjrLd9Mvsw?pwd=cl0t">Download</a></td> </tr> <tr> <td style="text-align:center">Good-Embed</td> <td style="text-align:center;">77.92±0.46</td> <td style="text-align:center;">89.94±0.26</td> <td style="text-align:center;"><a href="https://drive.google.com/file/d/15Cd-bodJUQHH7rB3x4JohA723lN9CuDq/view?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1y85JKAaQaEE8sPTQ6RNqEw?pwd=00qf">Download</a></td> <td colspan="2" style="text-align:center;">N/A</td> </tr> <tr> <td style="text-align:center">Meta DeepBDC</td> <td style="text-align:center;">83.55±0.40</td> <td style="text-align:center;">93.82±0.17</td> <td style="text-align:center;"><a href="https://drive.google.com/file/d/15rXrL2DLw0d5nO2CfGOLNbqYaKLcasAG/view?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1FKRt65qaM2JlTBfsG9l64w?pwd=e3cw">Download</a></td> <td style="text-align:center;"><a href="https://drive.google.com/drive/folders/1jnK0O4BNfrZnZl9CG3nFeJfJZHJdqVeH?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1yoqXwsqU4T2DK-MgJ_z1JQ?pwd=xro5">Download</a></td> </tr> <tr> <td style="text-align:center">STL DeepBDC</td> <td style="text-align:center;">84.01±0.42</td> <td style="text-align:center;">94.02±0.24</td> <td style="text-align:center;"><a href="https://drive.google.com/file/d/1ZN6DXGPREHNIQJSGeLdfQUWJb90Grjck/view?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/15dXPW3pcs2BaaC0fBtgrTA?pwd=18nw">Download</a></td> <td colspan="2" style="text-align:center;">N/A</td> </tr> </table>Note that for Good-Embed and STL DeepBDC, a sequential self-distillation technique is used to obtain the pre-trained models; See the paper of Good-Embed for details.
tieredImageNet
We followed DeepEMD for data preprocessing.
<table> <tr> <th rowspan="2" style="text-align:center;">Method</th> <th colspan="2" style="text-align:center;">ResNet-12</th> <th colspan="2" style="text-align:center;">Pre-trained models</th> <th colspan="2" style="text-align:center;">Meta-trained models</th> </tr> <tr> <th colspan="1" style="text-align:center;">5-way-1-shot</th> <th colspan="1" style="text-align:center;">5-way-5-shot</th> <th colspan="1" style="text-align:center;">GoogleDrive</th> <th colspan="1" style="text-align:center;">BaiduCloud</th> <th colspan="1" style="text-align:center;">GoogleDrive</th> <th colspan="1" style="text-align:center;">BaiduCloud</th> </tr> <tr> <td style="text-align:center">ProtoNet</td> <td style="text-align:center;">68.31±0.51</td> <td style="text-align:center;">83.85±0.36</td> <td style="text-align:center;"><a href="https://drive.google.com/drive/folders/15AVdTLq0QyaJF7ccuvQUGK6TXtXuQybW?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1O8Tt-bHJiomz7n4gm2G7gA?pwd=j90s">Download</a></td> <td style="text-align:center;"><a href="https://drive.google.com/drive/folders/1Q8r0ub2uO0su34daLmBBCkqntfyv0blR?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1BIASKmHXgjwGIqOXGnDELA?pwd=atuw">Download</a></td> </tr> <tr> <td style="text-align:center">Good-Embed</td> <td style="text-align:center;">71.52±0.69</td> <td style="text-align:center;">86.03±0.58</td> <td style="text-align:center;">N/A</td> <td style="text-align:center;">N/A</td> <td colspan="2" style="text-align:center;">N/A</td> </tr> <tr> <td style="text-align:center">Meta DeepBDC</td> <td style="text-align:center;">72.34±0.49</td> <td style="text-align:center;">87.31±0.32</td> <td style="text-align:center;"><a href="https://drive.google.com/drive/folders/1V7jvTOktv_0LnfDSCawdCFOBSfx5Ttlv?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1zfIkOc2513cv56bksJddIA?pwd=kyil">Download</a></td> <td style="text-align:center;"><a href="https://drive.google.com/drive/folders/1WbQEeM57PORiG5Y2XyhJA6VWm3LDZ0yb?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1LJ9tER3tf5jzj5fHHTeCgw?pwd=offl">Download</a></td> </tr> <tr> <td style="text-align:center">STL DeepBDC</td> <td style="text-align:center;">73.82±0.47</td> <td style="text-align:center;">89.00±0.30</td> <td style="text-align:center;"><a href="https://drive.google.com/file/d/1OWPJaoXC7UpJRsfNb3pNXjbVYAGt7_X4/view?usp=sharing">Download</a></td> <td style="text-align:center;"><a href="https://pan.baidu.com/s/1D-HwS9IM6b1HuE2aLk1n4A?pwd=0o7n">Download</a></td> <td colspan="2" style="text-align:center;">N/A</td> </tr> </table>Note that for Good-Embed and STL DeepBDC, a sequential self-distillation technique is used to obtain the pre-trained models; See the paper of Good-Embed for details.
References
[BDC] G. J. Szekely and M. L. Rizzo. Brownian distance covariance. Annals of Applied Statistics, 3:1236–1265, 2009.<br> [ProtoNet] Jake Snell, Kevin Swersky, and Richard Zemel. Prototypical networks for few-shot learning. In NIPS, 2017.<br> [Good-Embed] Y. Tian, Y. Wang, D. Krishnan, J. B. Tenenbaum, and P. Isola. Rethinking few-shot image classification: a good embedding is all you need? In ECCV, 2020.<br>
Implementation details
Datasets
- miniImageNet: Download Link: [BaiduCloud] [GoogleDrive].
- CUB: Download Link: [BaiduCloud] [GoogleDrive].
- tieredImageNet: Download Link: [BaiduCloud]
Implementation environment
Note that the test accuracy may slightly vary with different Pytorch/CUDA versions, GPUs, etc. <br>
- Linux
- Python 3.8.3
- torch 1.7.1
- GPU (RTX3090) + CUDA11.0 CuDNN
- sklearn1.0.1, pillow8.0.0, numpy1.19.2
Installation
- Clone this repo:
git clone https://github.com/Fei-Long121/DeepBDC.git
cd DeepBDC
For Meta DeepBDC on general object recognition
cd scripts/mini_magenet/run_meta_deepbdc
- modify the dataset path in
run_pretrain.sh
,run_metatrain.sh
andrun_test.sh
bash run.sh
For STL DeepBDC on general object recognition
cd scripts/mini_imagenet/run_stl_deepbdc
- modify the dataset path in
run_pretrain.sh
,run_distillation.sh
andrun_test.sh
bash run.sh
Acknowledgments
Our code builds upon the the following code publicly available:
Contact
If you have any questions or suggestions, please contact us:
Fei Long(longfei121@mail.dlut.edu.cn)
<br>
Jiaming Lv(ljm_vlg@mail.dlut.edu.cn)