Home

Awesome

CUDA Optimized Simple Recurrent Unit (SRU)

Todo

Requirements

Language Modeling

Penn Treebank

cd examples/ptb
python3 train.py -b 32 -nf 128 -l 35 -opt msgd -lr 1 -g 0 -e 300 -lrd 30 -nl 2 -m 128x2.hdf5 -dos 0.5 -dor 0.2 -tanh
python3 train.py -b 32 -nf 320 -l 35 -opt msgd -lr 1 -g 0 -e 300 -lrd 30 -nl 2 -m 320x2.hdf5 -dos 0.5 -dor 0.2 -tanh
Model#layersdPerplexity
LSTM264089
SRU264092
SRU232092
LSTM232093
SRU2128110
LSTM2128117

Benchmark

result