Home

Awesome

CRASpell

Source code for the paper "CRASpell: A Contextual Typo Robust Approach to Improve Chinese Spelling Correction" in ACL2022 findings https://aclanthology.org/2022.findings-acl.237.pdf .

1. Requirements

-python 3.7

-tensorflow 1.14

2. Instructions

Step1: Download the pretrained cBERT from https://drive.google.com/file/d/1cqSTpn7r9pnDcvMoM3BbX1X67JsPdZ8_/view?usp=sharing (our previous work), 
and save it in ./datas/init_bert/cbert

Step2: Run the training script: sh start_train.sh
      The best model will be saved when it is finished.

Step3: Run the evaluation script to obtain the results on whole set and multi-typo set, respectively:
      sh start_eval.sh sighan15_test.sh
      sh start_eval.sh sighan15_multierror.txt
      sh start_eval.sh sighan14_test.sh
      sh start_eval.sh sighan14_multierror.txt