Awesome

A Simple and Effective Model for Answering Multi-span Questions

This repository contains the official implementation of the following paper:
Elad Segal, Avia Efrat, Mor Shoham, Amir Globerson, Jonathan Berant. "A Simple and Effective Model for Answering Multi-span Questions". In EMNLP, 2020.

Citation

@inproceedings{Segal2020TASE,
  title={A Simple and Effective Model for Answering Multi-span Questions},
  author={Segal, Elad and Efrat, Avia and Shoham, Mor and Globerson, Amir and Berant, Jonathan},
  booktitle={EMNLP},
  year={2020},
}

DROP Explorer

Use DROP Explorer to better understand DROP, Quoref and the models' predictions.

Usage

The commands listed in this section need to be run from the root directory of the repository.

First, install prerequisites with
pip install -r requirements.txt

Commands

Train a model:

allennlp train configs/[config file] -s [training_directory] --include-package src

Output predictions by a model:

allennlp predict model.tar.gz drop_data/drop_dataset_dev.json --predictor machine-comprehension \
--cuda-device 0 --output-file predictions.jsonl --use-dataset-reader --include-package src \
-o "{'validation_dataset_reader.pickle.action': 'None'}"

Evaluate a model (unofficial evaluation code, faster):

allennlp evaluate model.tar.gz drop_data/drop_dataset_dev.json --cuda-device 0 --output-file eval.json \
--include-package src -o "{'validation_dataset_reader.pickle.action': 'None'}"

Evaluate a model (official evaluation code, slower):

python tools/generate_submission_predictions.py --archive_file model.tar.gz \
--input_file drop_data/drop_dataset_dev.json --cuda-device 0 --output_file predictions.json \
--include-package src

python -m allennlp.tools.drop_eval --gold_path drop_data/drop_dataset_dev.json \
--prediction_path predictions.json --output_path metrics.json

Trained Models

RoBERTa TASE_IO + SSE - Trained on DROP (config)
RoBERTa TASE_IO - Trained on DROP (config)