Home

Awesome

KODOLI

KODOLI is a novel KOrean Dataset for Offensive Language Identification.

Warning: it contains highly offensive expressions.

Download

You can download benchmark KODOLI in this repository. Please, follow the data's license.

Dataset Description

Source

source

Statistics

Statistics

Guideline Details

Guideline(ENG.)

[Guideline(KOR.)] Comming Soon

Updates

Citation

@inproceedings{park2023feel,
  title={“Why do I feel offended?”-Korean Dataset for Offensive Language Identification},
  author={Park, San-Hee and Kim, Kang-Min and Lee, O-joun and Kang, Youjin and Lee, Jaewon and Lee, Su-min and Lee, Sangkeun},
  booktitle={Findings of the Association for Computational Linguistics: EACL 2023},
  pages={1112--1123},
  year={2023}
}

Contributors

License

<a rel="license" href="http://creativecommons.org/licenses/by-sa/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by-sa/4.0/88x31.png" /></a><br />

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.