Awesome
emmorph2ud
The script converts the output tag of emMorph morphological analyzer to the corresponding output tag of magyarlanc 3.0.
What's in this repo?
- the main script of the converter:
__main__.py
(run it withpython3 -m emmorph2ud
) - auxiliary files in folder
converterdata
- license
- this readme
The tagsets :hungary:
A detailed description of the tagsets is available here.
emMorph
emMorph is the current morphological analyzer for Hungarian and it is integrated into the e-magyar language processing toolchain. The list of emMorph tags is from here.
UD
UD follows Universal Dependencies, a framework for cross-linguistically consistent grammatical annotation. UD is the output of the third version of magyarlanc, a toolkit for linguistic processing of Hungarian texts and it is also an annotation scheme for a version of the largest fully manually annotated corpus of Hungarian, UD Hungarian Szeged.
How to use the converter?
- standard input: token, lemma, emmorph tag separated by tab
- standard output: ud tag
Dependencies
Python3
License
GNU General Public License v3.0