Awesome

Improving Automated Evaluation of Open Domain Dialog via Diverse Reference Augmentation

Code and Data for our Findings of ACL 2021 paper titled 'Improving Automated Evaluation of Open Domain Dialog via Diverse Reference Augmentation. Varun Gangal *, Harsh Jhamtani *, Eduard Hovy, Taylor Berg-Kirkpatrick'

Data

Relevant original and augmented reference files in are present in 'ref_files/' in the required format
Human ratings file: 'human_rating_correlation/mturk_rating_processed_output.csv'. Please consider citing Gupta et al if you use the human ratings file.

Code

Code and script to compute metric correlations with human ratings can be found in 'human_rating_correlation/' directory

Requirements

Python 3.7.5
bert_score (0.3.7)
nlgeval(Accessed: December 2020)
scipy 1.1.0

Citation

@inproceedings{acl2021dialogeval, 
title={Improving Automated Evaluation of Open Domain Dialog via Diverse Reference Augmentation}, 
author={Gangal, Varun and Jhamtani, Harsh and Hovy, Ed and Berg-Kirkpatrick, Taylor}, 
booktitle={Findings of ACL}, 
year={2021} 
}