Home

Awesome

ipa-dict-dsl - IPA Dictionaries in DSL format

This repository makes available a collection of IPA dictionaries in DSL format derived from the IPA Dictionary project. This makes it easy to look up the IPA pronunciation of words in a variety of languages using your favourite dictionary program.

ipa

Format

ABBYY Lingvo DSL is a flexible dictionary format that can be read by dictionary applications such as Goldendict and converted to other formats using tools such as pyglossary. There are also a number of tools for creating DSL format dictionaries available in the dsl-tools project.

DSL files must be saved as UTF-16 to be usable by dictionary programs. The raw source files in this repository are saved in UTF-8 format, which is both significantly smaller in terms of file size, and also readable (and diffable) by git. However, there are fully encoded and compressed .dsl.dz dictionaries ready for use available in the Releases section.

You can also use the rezip_dsl.rb and unzip_dsl.rb scripts provided by the dsl-tools repo to encode/compress and decode/uncompress the dictionaries yourself either individually or as a group.

Download

You can download pre-compiled dictionary files here for immediate use. These can be added directly to any dictionary program that supports DSL files.

Data

The data directory contains the dictionaries listed according to ISO language code.

The basic filename pattern is [ISO]_ipa.dsl, with [ISO] being the source language ISO code. A list of all available languages is below.

Available languages

FilenameLanguage
ar_ipa.dslArabic (Modern Standard)
de_ipa.dslGerman
en_US_ipa.dslEnglish (General American)
eo_ipa.dslEsperanto
es_ES_ipa.dslSpanish (Spain)
es_MX_ipa.dslSpanish (Mexico)
fa_ipa.dslPersian
fi_ipa.dslFinnish
fr_ipa.dslFrench
ja_ipa.dslJapanese
jam_ipa.dslJamaican Creole
ma_ipa.dslMalay (Malaysian/Indonesian)
nb_ipa.dslNorwegian (Bokmål)
sv_ipa.dslSwedish
sw_ipa.dslSwahili
yue_ipa.dslCantonese
zh_hans_ipa.dslMandarin (Simplified)
zh_hant_ipa.dslMandarin (Traditional)

Statistics

Dictionary size

Language# of entries
ar857161
es_ES595899
es_MX595888
de278915
fr245150
ja221421
en_US125927
fi92837
yue57149
sw48308
zh_hant48241
zh_hans44781
ma28215
eo23517
sv21106
nb10139
fa8090
jam1701

License

MIT.