Home

Awesome

CommaQA: Communicating with Agents for QA

CommaQA Dataset is a QA benchmark for learning to communicate with agents. It consists of three datasets capturing three forms of multi-hop reasoning -- explicit(CommaQA-E), implicit(CommaQA-I), and numeric(CommaQA-N).

Paper Link: Semantic Scholar

Citation:

@article{Khot2021LearningTS,
  title={Learning to Solve Complex Tasks by Talking to Agents},
  author={Tushar Khot and Kyle Richardson and Daniel Khashabi and Ashish Sabharwal},
  journal={ArXiv},
  year={2021},
  volume={abs/2110.08542}
}

Table of Contents

Dataset

Download

Download the datasets:

Formats

Each dataset contains three formats:

Additional Datasets

can be used to train a model to generate the question (project_values_flat_unique) [text] Who all acted in the movie #1? (string following QS:) given the previous questions and answers.

Models

We also provide the T5-Large models trained to produce the next sub-question based on the oracle decompositions. These models can be used to perform inference as described [here]((commaqa/inference/README.md) to reproduce the TMN results.

Code

Refer to the individual READMEs in each package for instructions on: