Home

Awesome

Not All Poisons are Created Equal: Robust Training against Data Poisoning (ICML 2022)

[Paper]

Overview

EPIC (Effective Poison Identification) is an efficient defense mechanism that significantly reduces the attack success rate of various data poisoning attacks by iteratively find and drop the isolated points in low-density gradient regions.

Updates

[Aug 6th, 2022] We released the code of EPIC.

[July 20th, 2022] We presented our paper at ICML 2022.

Install requirements

pip install -r requirements.txt

Prepare the Data

Precomputed poisoned datasets that are publicly available can be downloaded with the links below.

Usage

See examples.sh for example usages.

Acknowledgements

Some code in this repo comes from the following repositories:

We thank these authors for making their code open-source.

Citation

Please cite our paper if you find the results or our code useful. :beers:

@inproceedings{yang2022not,
  title={Not All Poisons are Created Equal: Robust Training against Data Poisoning},
  author={Yang, Yu and Liu, Tian Yu and Mirzasoleiman, Baharan},
  booktitle={International Conference on Machine Learning},
  pages={25154--25165},
  year={2022},
  organization={PMLR}
}