Home

Awesome

SE_ASTER

Introduction

This is the implementation of the paper "SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition" This code is based on the aster.pytorch, we sincerely thank ayumiymk for his awesome repo and help.

How to use

Env

PyTorch == 1.1.0
torchvision == 0.3.0
fasttext == 0.9.1

Details can be found in requirements.txt

Train

Prepare your data
Run
sh train.sh

Test

sh test.sh

Experiments

Evaluation on benchmarks

CheckpointIIIT5KIC13-1015IC13-857IC15-1811IC15-2077SVTSVTPCUTE
OneDrive BaiduYun(key: x54e)93.493.594.579.875.888.482.084.0

Evalution with lexicons

MethodsIIIT5K-50IIIT5K-1KSVT-50IC13IC15
ED99.0697.8796.3697.4487.76
ED + SS<b>99.27<b>97.93<b>96.45<b>97.64<b>88.07

About the word embedding

IIIT5KIC13IC15-1811IC15-2077SVTSVTPCUTE
94.693.885.079.690.984.285.4

Exploration on global information

IIIT5KIC13IC15-1811IC15-2077SVTSVTPCUTE
93.891.378.7-90.181.681.9

Citation

@inproceedings{qiao2020seed,
  title={{SEED}: Semantics enhanced encoder-decoder framework for scene text recognition},
  author={Qiao, Zhi and Zhou, Yu and Yang, Dongbao and Zhou, Yucan and Wang, Weiping},
  booktitle={CVPR},
  year={2020},
}
@article{shi2018aster,
  title={{ASTER}: An attentional scene text recognizer with flexible rectification},
  author={Shi, Baoguang and Yang, Mingkun and Wang, Xinggang and Lyu, Pengyuan and Yao, Cong and Bai, Xiang},
  journal={TPAMI},
  volume={41},
  number={9},
  pages={2035--2048},
  year={2018},
  publisher={IEEE}
}