Awesome
<p align="center"> <img src="./resources/logo.png" width="800px"> </p> <p align="center"> <a href="#-introduction">๐Introduction</a> โข <a href="#-methods-reproduced">๐Methods Reproduced</a> โข <a href="#-reproduced-results">๐Reproduced Results</a> <br /> <a href="#%EF%B8%8F-how-to-use">โ๏ธHow to Use</a> โข <a href="#-acknowledgments">๐จโ๐ซAcknowledgments</a> โข <a href="#-contact">๐คContact</a> </p><p align="center"> <a href=""><img src="https://img.shields.io/badge/PILOT-v1.0-darkcyan"></a> <a href='https://arxiv.org/abs/2309.07117'><img src='https://img.shields.io/badge/Arxiv-2309.07117-b31b1b.svg?logo=arXiv'></a> <a href=""><img src="https://img.shields.io/github/stars/sun-hailong/LAMDA-PILOT?color=4fb5ee"></a> <a href=""><img src="https://hits.seeyoufarm.com/api/count/incr/badge.svg?url=https%3A%2F%2Fgithub.com%2Fsun-hailong%2FLAMDA-PILOT&count_bg=%23FFA500&title_bg=%23555555&icon=&icon_color=%23E7E7E7&title=visitors&edge_flat=false"></a> <a href=""><img src="https://black.readthedocs.io/en/stable/_static/license.svg"></a> <a href=""><img src="https://img.shields.io/github/last-commit/sun-hailong/LAMDA-PILOT?color=blue"></a> </p>
๐ Introduction
Welcome to PILOT, a pre-trained model-based continual learning toolbox <a href="https://arxiv.org/abs/2309.07117">[Paper]</a>. On the one hand, PILOT implements some state-of-the-art class-incremental learning algorithms based on pre-trained models, such as L2P, DualPrompt, and CODA-Prompt. On the other hand, PILOT also fits typical class-incremental learning algorithms (e.g., FOSTER, and MEMO) within the context of pre-trained models to evaluate their effectiveness.
If you use any content of this repo for your work, please cite the following bib entries:
@article{sun2023pilot,
title={PILOT: A Pre-Trained Model-Based Continual Learning Toolbox},
author={Sun, Hai-Long and Zhou, Da-Wei and Ye, Han-Jia and Zhan, De-Chuan},
journal={arXiv preprint arXiv:2309.07117},
year={2023}
}
@inproceedings{zhou2024continual,
title={Continual learning with pre-trained models: A survey},
author={Zhou, Da-Wei and Sun, Hai-Long and Ning, Jingyi and Ye, Han-Jia and Zhan, De-Chuan},
booktitle={IJCAI},
pages={8363-8371},
year={2024}
}
@article{zhou2024class,
author = {Zhou, Da-Wei and Wang, Qi-Wei and Qi, Zhi-Hong and Ye, Han-Jia and Zhan, De-Chuan and Liu, Ziwei},
title = {Class-Incremental Learning: A Survey},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
volume={46},
number={12},
pages={9851--9873},
year = {2024}
}
๐ฐ What's New
- [2024-10]๐ Check out our latest work on pre-trained model-based domain-incremental learning!
- [2024-08]๐ Check out our latest work on pre-trained model-based class-incremental learning (IJCV 2024)!
- [2024-07]๐ Check out our rigorous and unified survey about class-incremental learning, which introduces some memory-agnostic measures with holistic evaluations from multiple aspects (TPAMI 2024)!
- [2024-07]๐ Check out our work about all-layer margin in class-incremental learning (ICML 2024)!
- [2024-04]๐ Check out our latest survey on pre-trained model-based continual learning (IJCAI 2024)!
- [2024-03]๐ Add EASE. State-of-the-art method of 2024!
- [2024-03]๐ Check out our latest work on pre-trained model-based class-incremental learning (CVPR 2024)!
- [2023-12]๐ Add RanPAC.
- [2023-09]๐ Initial version of PILOT is released.
- [2023-05]๐ Check out our recent work about class-incremental learning with vision-language models!
- [2023-01]๐ As team members are committed to other projects and in light of the intense demands of code reviews, we will prioritize reviewing algorithms that have explicitly cited and implemented methods from our toolbox paper in their publications. Please read the PR policy before submitting your code.
๐ Methods Reproduced
FineTune
: Baseline method which simply updates parameters on new tasks.iCaRL
: iCaRL: Incremental Classifier and Representation Learning. CVPR 2017 [paper]Coil
: Co-Transport for Class-Incremental Learning. ACMMM 2021 [paper]DER
: DER: Dynamically Expandable Representation for Class Incremental Learning. CVPR 2021 [paper]FOSTER
: Feature Boosting and Compression for Class-incremental Learning. ECCV 2022 [paper]MEMO
: A Model or 603 Exemplars: Towards Memory-Efficient Class-Incremental Learning. ICLR 2023 Spotlight [paper]L2P
: Learning to Prompt for Continual Learning. CVPR 2022 [paper]DualPrompt
: DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning. ECCV 2022 [paper]CODA-Prompt
: CODA-Prompt: COntinual Decomposed Attention-based Prompting for Rehearsal-Free Continual Learning. CVPR 2023 [paper]RanPAC
: RanPAC: Random Projections and Pre-trained Models for Continual Learning. NeurIPS 2023 [paper]LAE
: A Unified Continual Learning Framework with General Parameter-Efficient Tuning. ICCV 2023 [paper]SLCA
: SLCA: Slow Learner with Classifier Alignment for Continual Learning on a Pre-trained Model. ICCV 2023 [paper]FeCAM
: FeCAM:Exploiting the Heterogeneity of Class Distributions in Exemplar-Free Continual Learning. NeurIPS 2023 [paper]DGR
: Gradient Reweighting: Towards Imbalanced Class-Incremental Learning. CVPR 2024 [paper]Ease
: Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental Learning. CVPR 2024 [paper]SimpleCIL
: Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need. IJCV 2024 [paper]Aper
: Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need. IJCV 2024 [paper]
๐ Reproduced Results
CIFAR-100
<div align="center"> <img src="./resources/cifarb0inc10.jpg" width="600px"> </div>ImageNet-R
<div align="center"> <img src="./resources/imagenetRb0inc20.jpg" width="600px"> </div>For exemplar parameters, Coil, DER, iCaRL, MEMO, and FOSTER set the
fixed_memory
option to false and retain thememory_size
of 2000 for CIFAR100, while settingfixed_memory
option to true and retaining thememory_per_class
of 20 for ImageNet-R. On the contrary, other models are exemplar-free.
โ๏ธ how to use
๐น๏ธ Clone
Clone this GitHub repository:
git clone https://github.com/sun-hailong/LAMDA-PILOT
cd LAMDA-PILOT
๐๏ธ Dependencies
๐ Run experiment
-
Edit the
[MODEL NAME].json
file for global settings and hyperparameters. -
Run:
python main.py --config=./exps/[MODEL NAME].json
-
hyper-parameters
When using PILOT, you can edit the global parameters and algorithm-specific hyper-parameter in the corresponding json file.
These parameters include:
- model_name: The model's name should be selected from the 11 methods listed above, i.e.,
finetune
,icarl
,coil
,der
,foster
,memo
,simplecil
,l2p
,dualprompt
,coda-prompt
andadam
. - init_cls: The number of classes in the initial incremental stage. As the configuration of CIL includes different settings with varying class numbers at the outset, our framework accommodates diverse options for defining the initial stage.
- increment: The number of classes in each incremental stage $i$, $i$ > 1. By default, the number of classes is equal across all incremental stages.
- backbone_type: The backbone network of the incremental model. It can be selected from a variety of pre-trained models available in the Timm library, such as ViT-B/16-IN1K and ViT-B/16-IN21K. Both are pre-trained on ImageNet21K, while the former is additionally fine-tuned on ImageNet1K.
- seed: The random seed is utilized for shuffling the class order. It is set to 1993 by default, following the benchmark setting iCaRL.
- fixed_memory: a Boolean parameter. When set to true, the model will maintain a fixed amount of memory per class. Alternatively, when set to false, the model will preserve dynamic memory allocation per class.
- memory_size: The total number of exemplars in the incremental learning process. If
fixed_memory
is set to false, assuming there are $K$ classes at the current stage, the model will preserve $\left[\frac{{memory-size}}{K}\right]$ exemplars for each class. L2P, DualPrompt, SimpleCIL, ADAM, and CODA-Prompt do not require exemplars. Therefore, parameters related to the exemplar are not utilized. - memory_per_class: If
fixed memory
is set to true, the model will preserve a fixed number ofmemory_per_class
exemplars for each class.
- model_name: The model's name should be selected from the 11 methods listed above, i.e.,
๐ Datasets
We have implemented the pre-processing datasets as follows:
- CIFAR100: will be automatically downloaded by the code.
- CUB200: Google Drive: link or Onedrive: link
- ImageNet-R: Google Drive: link or Onedrive: link
- ImageNet-A: Google Drive: link or Onedrive: link
- OmniBenchmark: Google Drive: link or Onedrive: link
- VTAB: Google Drive: link or Onedrive: link
- ObjectNet: Onedrive: link You can also refer to the filelist if the file is too large to download.
These subsets are sampled from the original datasets. Please note that I do not have the right to distribute these datasets. If the distribution violates the license, I shall provide the filenames instead.
When training not on CIFAR100
, you should specify the folder of your dataset in utils/data.py
.
def download_data(self):
assert 0,"You should specify the folder of your dataset"
train_dir = '[DATA-PATH]/train/'
test_dir = '[DATA-PATH]/val/'
๐จโ๐ซ Acknowledgments
We thank the following repos providing helpful components/functions in our work.
๐ค Contact
If there are any questions, please feel free to propose new features by opening an issue or contact with the author: Hai-Long Sun(sunhl@lamda.nju.edu.cn) and Da-Wei Zhou(zhoudw@lamda.nju.edu.cn). Enjoy the code.