Home

Awesome

Bias and Fairness in Large Language Models: A Survey

Isabel O. Gallegos, Ryan A. Rossi, Joe Barrow, Md Mehrab Tanjim, Sungchul Kim, Franck Dernoncourt, Tong Yu, Ruiyi Zhang, and Nesreen K. Ahmed

Pre-print: https://arxiv.org/abs/2309.00770

If you use or discuss our survey in your work, please use the following citation:

@article{gallegos2023bias,
    title={Bias and Fairness in Large Language Models: A Survey},
    author={Gallegos, Isabel O and Rossi, Ryan A and Barrow, Joe and Tanjim, Md Mehrab and Kim, Sungchul and Dernoncourt, Franck and Yu, Tong and Zhang, Ruiyi and Ahmed, Nesreen K},
    journal={arXiv preprint arXiv:2309.00770},
    year={2023}
}

To enable easy use of bias evaluation datasets, we compile publicly-available ones and provide access here. We provide links to the original data sources below. We do not modify any of the datasets, but do remove unrelated material from the original repositories. Please refer to the original works for more detailed documentation.

DatasetLink
BBQhttps://github.com/nyu-mll/BBQ
BEC-Prohttps://github.com/marionbartl/gender-bias-BERT
Bias NLIhttps://github.com/sunipa/On-Measuring-and-Mitigating-Biased-Inferences-of-Word-Embeddings
BOLDhttps://github.com/amazon-science/bold
BUGhttps://github.com/SLAB-NLP/BUG
CrowS-Pairshttps://github.com/nyu-mll/crows-pairs/
Equity Evaluation Corpushttp://saifmohammad.com/WebPages/Biases-SA.html
GAPhttps://github.com/google-research-datasets/gap-coreference
Grep-BiasIRhttps://github.com/KlaraKrieg/GrepBiasIR
HolisticBiashttps://github.com/facebookresearch/ResponsibleNLP
HONESThttps://github.com/MilaNLProc/honest
PANDAhttps://github.com/facebookresearch/ResponsibleNLP
RealToxicityPromptshttps://toxicdegeneration.allenai.org
RedditBiashttps://github.com/umanlp/RedditBias
StereoSethttps://github.com/McGill-NLP/bias-bench, https://github.com/moinnadeem/stereoset
TrustGPThttps://github.com/HowieHwong/TrustGPT
UnQoverhttps://github.com/allenai/unqover
WinoBiashttps://github.com/uclanlp/corefBias
WinoBias+https://github.com/vnmssnhv/NeuTralRewriter
WinoGenderhttps://github.com/rudinger/winogender-schemas
WinoQueerhttps://github.com/katyfelkner/winoqueer