


Accompanying code and data to the paper "Measuring Gender Bias in Word Embeddings across Domains and Discovering New Gender Bias Word Categories" by Kaytlin Chaloner and Alfredo Maldonado. The paper will be presented at the 1st ACL Workshop on Gender Bias for Natural Language Processing 2019 in Florence.


If you use this code, data or our results, we would appreciate if you could cite this paper as follows:

Kaytlin Chaloner and Alfredo Maldonado (2019). Measuring Gender Bias in Word Embeddings across Domains and Discovering New Gender Bias Word Categories. In Proceedings of the 1st ACL Workshop on Gender Bias for Natural Language Processing. Florence.


address = {Florence},
author = {Chaloner, Kaytlin and Maldonado, Alfredo},
booktitle = {Proceedings of the 1st ACL Workshop on Gender Bias for Natural Language Processing},
title = {{Measuring Gender Bias in Word Embeddings across Domains and Discovering New Gender Bias Word Categories}},
year = {2019}


All code needed (barring standard installable dependencies) to run experiments is under the code directory.

Each script is documented with instructions on how to run it. You can also type $ <script_name.py> --help in a Linux terminal to obtain instructions on how to run the script.


This data is located inside the data directory:

The rest of the data is located at our <a target='_blank' href='https://drive.google.com/drive/folders/13HSQXJgCSYCgpf1tV3sC37bivCjySP2Q?usp=sharing'>public Google Drive Folder</a>: