Awesome
Bias in NLP
This is a collection of natural language processing papers that deal with bias (mostly gender bias). The list is by no means complete and is just a way to keep up with the large amount of papers in that area. If you miss a paper, please add it.
Papers
Towards Detection of Subjective Bias using Contextualized Word Embeddings
WebConf2020 - Paper, Code
Note: Wikineutrality Corpus.
Joint Multiclass Debiasing of Word Embeddings
ISMIS2020 - Paper, Code
Note: Hard and Soft WEAT
Towards Debiasing Sentence Representations
ACL2020 - Paper, Code
Note: Sentence-level debiasing. Difference between pretraining and finetuning.
Neutralizing Gender Bias in Word Embedding with Latent Disentanglement and Counterfactual Generation
arxiv2020 - Paper
Note: Counterfactual generation.
Unsupervised Discovery of Implicit Gender Bias
arxiv2020 - Paper, Code
Note: Unsupervised bias detection from comments.
StereoSet: Measuring stereotypical bias in pretrained language models
arxiv2020 - Paper, Code
Note: Benchmark and Dataset for measuring bias in 4 domains (gender, profession, race, religion).
Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation
ACL2020 - Paper, Code
Note: Double Hard Debias: mitigigate dataset and then do debiasing
Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer
ACL2020 - Paper
Note: Bias in multilingual embeddings depends on the alignment direction.
Scalable Cross Lingual Pivots to Model Pronoun Gender for Translation
arxiv2020 - Paper
Note: Gender labels for pronouns in MT English-Spanish.
Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases
arxiv2020 - Paper
Note: CEAT
OSCaR: Orthogonal Subspace Correction and Rectification of Biases in Word Embeddings
arxiv2020 - Paper
Note: Preserve semantic meaning of embeddings.
Investigating Gender Bias in BERT
arxiv2020 - Paper
Note: Identify one gender direction per BERT layer.
Type B Reflexivization as an Unambiguous Testbed for Multilingual Multi-Task Gender Bias
arxiv2020 - Paper, Code
Note: Multilingual multitask dataset across 4 languages.
Towards Debiasing NLU Models from Unknown Biases
arxiv2020 - Paper, Code
Note: Unsupervised bias detection.
Robustness and Reliability of Gender Bias Assessment in Word Embeddings: The Role of Base Pairs
arxiv2020 - Paper, Code
Note: Choice of base pairs is relevant.
LOGAN: Local Group Bias Detection by Clustering
arxiv2020 - Paper
Note: Identify biases through clustering.
Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation
arxiv2020 - Paper
Note: Verify whether non-linear debiasing helps. It seems not.
Unmasking Contextual Stereotypes: Measuring and Mitigating BERT’s Gender Bias
GeBNLP2020 - Paper, Code
Note: Verify gender debiasing techniques in German.
Language (Technology) is Power: A Critical Survey of “Bias” in NLP
arxiv2020 - Paper
Note: Metastudy: survey of 146 gender bias papers
Pick a Fight or Bite your Tongue: Investigation of Gender Differences in Idiomatic Language Usage
arxiv2020 - Paper
Note: Idiomatic expressions depending on the speaker.
Evaluating Bias In Dutch Word Embeddings
GeBNLP2020 - Paper, Code
Note: Examining bias in Dutch (using WEAT)
Analyzing Gender Bias within Narrative Tropes
arxiv2020 - Paper, Code
Note: Analyze bias using tropes
Neural Machine Translation Doesn’t Translate Gender Coreference Right Unless You Make It
GeBNLP2020 - Paper, Code
Note: Incorporate explicit word-level gender tags.
The Gap on GAP: Tackling the Problem of Differing Data Distributions in Bias-Measuring Datasets
NeurIPS 2020 - Paper, Code
Note: Distances in GAP play a role.
AraWEAT: Multidimensional Analysis of Biases in Arabic Word Embeddings
arxiv2020 - Paper, Code
Note: Arabic WEAT.
Characterising Bias in Compressed Models
arxiv2020 - Paper
Note: Bias in compressed model is large. Provide method to identify biased examples.
Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them
NAACL2019 - Paper, Code
Note: Debiasing by setting dimensions to zero ist not effective
Equalizing Gender Bias in Neural Machine Translation with Word Embeddings Techniques
GeBNLP 2019 - Paper
Note: Spanisch-Englisch translation with occupations.
Evaluating the Underlying Gender Bias in Contextualized Word Embeddings
GeBNLP 2019 - Paper
Note: Cointextualized embeddings are less biased than static ones.
Mitigating Gender Bias in Natural Language Processing: Literature Review
ACL2019 - Paper
Note: Survey
What's in a Name? Reducing Bias in Bios without Access to Protected Attributes
NAACL2019 - Paper
Note: Work on biographies.
Assessing Social and Intersectional Biases in Contextualized Word Representations
NeurIPS2019 - Paper
Note: Strong bias in contextualized embeddings. Bias not always visible on sentence level.
It’s All in the Name: Mitigating Gender Bias with Name-Based Counterfactual Data Substitution
EMNLP2019 - Paper
Note: Counterfactual Data Substitution (CDS)
Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in Sentiment Analysis
GeBNLP 2019 - Paper, Code
Note: Dataset of 800 sentences analysed with sentiment analysis.
Automatic Gender Identification and Reinflection in Arabic
GeBNLP 2019 - Paper
Note: Arabic English Translation with focus on getting the pronouns right.
Gendered Ambiguous Pronouns Shared Task: Boosting Model Confidence by Evidence Pooling
GeBNLP 2019 - Paper, Code
Note: Shared task winner GAP
Gendered Ambiguous Pronouns (GAP) Shared Task at the Gender Bias in NLP Workshop 2019
GeBNLP 2019 - Paper, Code
Note: GAP shared task description
Conceptor Debiasing of Word Representations Evaluated on WEAT
GeBNLP 2019 - Paper
Note: Proposes Conceptor Debiasing.
On Measuring Gender Bias in Translation of Gender-neutral Pronouns
GeBNLP 2019 - Paper, Code
Note: Gender bias in pronoun translation Korean-English
Measuring Gender Bias in Word Embeddings across Domains and Discovering New Gender Bias Word Categories
GeBNLP 2019 - Paper, Code
Note: Clustering method for discovering new biases.
The Role of Protected Class Word Lists in Bias Identification of Contextualized Word Representations
GeBNLP 2019 - Paper
Note: Uses conceptor debiasing
The Woman Worked as a Babysitter: On Biases in Language Generation
EMNLP2019 - Paper, Code
Note: Regard and Sentiment. Annotations released.
Exploring Human Gender Stereotypes with Word Association Test
EMNLP2019 - Paper, Code
Note: Word association graphs
Gender-preserving Debiasing for Pre-trained Word Embeddings
ACL2019 - Paper, Code
Note: Differentiate between bias and gender information.
Quantifying Social Biases in Contextual Word Representations
GeBNLP 2019 - Paper
Note: Template based method to quantify bias.
Bias in Bios: A Case Study of Semantic Representation Bias in a High-Stakes Setting
ACM Fat 2019 - Paper
Note: Analyze effects of bias.
Gender Bias in Neural Natural Language Processing
Logic, Language, and Security. Springer. 2018 - Paper
Note: Counterfactual Data Augmentation (CDA). Clear definition of Bias. Evaluates on coreference resolution and language modelling.
Gender Bias in Coreference Resolution
NAACL2018 - Paper, Code
Note: Windogender schemes.
Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings
NeurIPS2016 - Paper, Code
Note: Among the first to address gender bias
Rejecting the Gender Binary: A Vector-Space Operation.
2015 - Paper
Note: Blog post: first to propose to remove gender dimension
TODOS
- add https://arxiv.org/pdf/2011.12086.pdf
- add https://arxiv.org/pdf/2011.12096.pdf
- add papers from GeBNLP2020 once they are available.