Home

Awesome

FrequencyDictionaries

This repo contains frequency dictionaries that are text files with one word per line.

It is composed of two folders:

freq_dicts_dirty

The files in this folder were obtained from LuminosoInsight/wordfreq project. The corresponding dictionaries were transformed to .txt files with one word per line (the more frequents come first) by keeping only the words longer than 2 characters.

The transformation was done using jakm/msgpack-cli tool to convert the .msgpack files to .json files, and then using sed and grep they are transformed to .txt files with one word per line.

freq_dicts_clean

The files in this folder were obtained from the files in the freq_dicts_dirty folder by removing all words that are not in the corresponding dictionary of titoBouzout/Dictionaries.