Awesome

WMT17 TRANSFORMER SCRIPTS

This is a fork of https://github.com/EdinburghNLP/wmt17-scripts that demonstrates training of Nematus models with a Transformer architecture.

training/scripts/train.sh shows a configuration corresponding to training a Transformer-base model.

Scripts for preprocessing, validation, and evaluation, are also provided, and mirror the WMT17 setup of the University of Edinburgh (with minor tweaks, e.g. reducing the BPE vocabulary size).

REQUIREMENTS

The models use the following software:

moses decoder (scripts only; no compilation required) https://github.com/moses-smt/mosesdecoder
nematus: https://github.com/EdinburghNLP/nematus
subword-nmt https://github.com/rsennrich/subword-nmt

Please set the appropriate paths in the 'training/vars' file.

USAGE INSTRUCTIONS

For training, follow the instructions in training/README.md

LICENSE

All scripts in this directory are distributed under MIT license.