Home

Awesome

Source Code for the ACL 2020 paper "Paraphrase Augmented Task-Oriented Dialog Generation"

Construct Paraphrase for CamRest676

python analysis.py

Construct Paraphrase for MultiWOZ

python analysis_multi.py

Training with default parameters

python model.py -mode train -model [tsdf-camrest|tsdf-multiwoz]

(optional: configuring hyperparameters with cmdline)

python model.py -mode train -model [tsdf-camrest|tsdf-multiwoz] -cfg lr=0.003 batch_size=32

Testing

python model.py -mode test -model [tsdf-camrest|tsdf-multiwoz]

Reinforcement fine-tuning

python model.py -mode rl -model [tsdf-camrest|tsdf-multiwoz] -cfg lr=0.0001

Before running

  1. Install required python packages. We used pytorch 0.3.0 and python 3.6 under Linux operating system.
pip install -r requirements.txt
  1. Download CamRest676 dataset and MultiWOZ dataset and place them in PROJECT_ROOT/data/CamRest676 and PROJECT_ROOT/data/MultiWOZ, respectively.

  2. Make directories under PROJECT_ROOT.

mkdir vocab
mkdir log
mkdir results
mkdir models
mkdir sheets
  1. Download pretrained Glove word vectors and place them in PROJECT_ROOT/data/glove.