Awesome

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

</div> <p align="center"> <img src="images/motivation.jpg" width="400"> </p>

This is the official implementation of our paper DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks.

News

🔥 A comprehensive Recommendation for Document Image Processing is available.

Inference

Put MBD model weights mbd.pkl to ./data/MBD/checkpoint/
Put DocRes model weights docres.pkl to ./checkpoints/
Run the following script and the results will be saved in ./restorted/. We have provided some distorted examples in ./input/.

python inference.py --im_path ./input/for_dewarping.png --task dewarping --save_dtsprompt 1

--im_path: the path of input document image
--task: task that need to be executed, it must be one of dewarping, deshadowing, appearance, deblurring, binarization, or end2end
--save_dtsprompt: whether to save the DTSPrompt

Evaluation

Dataset preparation, see dataset instruction
Put MBD model weights mbd.pkl to data/MBD/checkpoint/
Put DocRes model weights docres.pkl to ./checkpoints/
Run the following script

python eval.py --dataset realdae

--dataset: dataset that need to be evaluated, it can be set as dir300, kligler, jung, osr, docunet_docaligner, realdae, tdd, and dibco18.

Training

Dataset preparation, see dataset instruction
Specify the datasets_setting within train.py based on your dataset path and experimental setting.
Run the following script

bash start_train.sh

Citation:

@inproceedings{zhangdocres2024, 
Author = {Jiaxin Zhang, Dezhi Peng, Chongyu Liu , Peirong Zhang and Lianwen Jin}, 
Booktitle = {In Proceedings of the IEEE/CV Conference on Computer Vision and Pattern Recognition}, 
Title = {DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks}, 
Year = {2024}}

Awesome

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

News

Inference

Evaluation

Training

Citation:

⭐ Star Rising