Home

Awesome

InsuranceQA Corpus

This dataset is provided as is and for research purpose only. If you publish anything using this data, please cite our paper: Applying Deep Learning to Answer Selection: A Study and An Open Task Minwei Feng, Bing Xiang, Michael R. Glass, Lidan Wang, Bowen Zhou ASRU 2015

Introduction

Format

Corpus Statistics

QuestionAnswerQuestion Running Words
Train12,88921,325107,889
Valid2,000335416,931
Test2,000330816,815
There are totally 27,413 answers (answer set size is 27,413) with the 3,065,492 running words of answers.