Home

Awesome

AMR parse(r) evaluation experiment

This is code/data for our paper Better Smatch = better parser? AMR evaluation is not so simple anymore

Data

Data for little prince:

Data for AMR3:

Is found in qualityamr3. It follows the same format as little prince data. Note: Release of reference graphs requires LDC license, therefore this repo does not contain qualityamr3/abparser/tgt.test.amr. For constructing the reference AMR3, please see also Notes below.

Run experiments

Please look in this directory:

evaluation-suite

Notes

Citation

If you find the work interesting, consider citing

@inproceedings{opitz-frank-2022-better,
    title = "Better {S}match = Better Parser? {AMR} evaluation is not so simple anymore",
    author = "Opitz, Juri and Frank, Anette",
    booktitle = "Proceedings of the 3rd Workshop on Evaluation and Comparison of NLP Systems",
    month = nov,
    year = "2022",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.eval4nlp-1.4",
    doi = "10.18653/v1/2022.eval4nlp-1.4",
    pages = "32--43",
}