Home

Awesome

KaWAT: Kata.ai Word Analogy Task

KaWAT contains word analogy task for Indonesian. The raw data is stored under syntax and semantic directory for syntactic and semantic analogy questions respectively. To convert the raw data into Google's Analogy Task format, you must build the dataset by invoking make (make sure to have Python 3.6 in your PATH). The build results will be stored under build directory. Invoke make help to see other available commands.

License

This dataset is licensed under the Creative Commons Attribution-ShareAlike 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by-sa/4.0/ or send a letter to Creative Commons, PO Box 1866, Mountain View, CA 94042, USA.

The code is licensed under the Apache License, Version 2.0.

Citation

If you use KaWAT in your work, please cite:

@article{kurniawan2019,
  title={KaWAT: A Word Analogy Task Dataset for Indonesian},
  url={http://arxiv.org/abs/1906.09912},
  journal={arXiv:1906.09912 [cs]},
  author={Kurniawan, Kemal},
  year={2019},
  month={Jun}
}