Awesome
Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them
This project includes the experiments described in the paper:
"Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them", Hila Gonen and Yoav Goldberg, NAACL 2019.
Full reimplementation of the experiments is available in "remaining_bias_2016.ipynb" for Bolukbasi's embeddings, and in "remaining_bias_2018.ipynb" for Zhao's embeddings.
Prerequisites
- Python 2.7
Download embeddings
As a first step, download the nondebiased and debiased embeddings into data/embeddings/ from this folder (8 files):
- orig_w2v: Bolukbasi's embeddings, nodebiased
- hard_debiased_w2v: Bolukbasi's embeddings, debiased
- orig_glove: Zhao's embeddings, nodebiased
- gn_glove: Zhao's embeddings, debiased
These files are the original embeddings but with a preprocessing step (for fast loading, see source/save_embeds.py):
- Embeddings of Bolukbasi et al. (hard_debiased) are taken from nondebiased and debiased.
- Embeddings of Zhao et al. (gn_glove) are taken from nondebiased and debiased.
Cite
If you find this project useful, please cite the paper:
@inproceedings{GONEN19,
title={Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them},
author={Gonen, Hila and Goldberg, Yoav},
booktitle={Proceedings of NAACL-HLT},
year={2019}
}
Contact
If you have any questions or suggestions, please contact Hila Gonen.
License
This project is licensed under Apache License - see the LICENSE file for details.