Home

Awesome

SignGraph: A Sign Sequence is Worth Graphs of Nodes

An implementation of the paper: SignGraph: A Sign Sequence is Worth Graphs of Nodes. (CVPR 2024) [paper]

Prerequisites

Data Preparation

  1. PHOENIX2014 dataset: Download the RWTH-PHOENIX-Weather 2014 Dataset [download link].

  2. PHOENIX2014-T datasetDownload the RWTH-PHOENIX-Weather 2014 Dataset [download link]

  3. CSL dataset: Request the CSL Dataset from this website [download link]

Download datasets and extract them, no further data preprocessing needed.

Weights

We make some imporvments of our code, and provide newest checkpoionts and better performance.

DatasetBackboneDev WERDel / InsTest WERDel / InsPretrained model
Phoenix14TSignGraph17.004.99/2.3219.445.14/3.38[Google Drive]
Phoenix14SignGraph17.136.00/2.1718.175.65/2.23[Google Drive]
CSL-DailySignGraph26.389.92/2.6225.849.39/2.58[Google Drive]

​To evaluate the pretrained model, choose the dataset from phoenix2014/phoenix2014-T/CSL/CSL-Daily in line 3 in ./config/baseline.yaml first, and run the command below:
python main.py --device your_device --load-weights path_to_weight.pt --phase test

Training

The priorities of configuration files are: command line > config file > default values of argparse. To train the SLR model, run the command below:

python main.py --device your_device

Note that you can choose the target dataset from phoenix2014/phoenix2014-T/CSL/CSL-Daily in line 3 in ./config/baseline.yaml.

Thanks

This repo is based on VAC (ICCV 2021), VIT (NIPS 2022) and RTG-Net (ACM MM2023)

Citation

If you find this repo useful in your research works, please consider citing:

@inproceedings{gan2024signgraph,
  title={SignGraph: A Sign Sequence is Worth Graphs of Nodes},
  author={Gan, Shiwei and Yin, Yafeng and Jiang, Zhiwei and Wen, Hongkai and Xie, Lei and Lu, Sanglu},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={13470--13479},
  year={2024}
}

@inproceedings{gan2023towards,
  title={Towards Real-Time Sign Language Recognition and Translation on Edge Devices},
  author={Gan, Shiwei and Yin, Yafeng and Jiang, Zhiwei and Xie, Lei and Lu, Sanglu},
  booktitle={Proceedings of the 31st ACM International Conference on Multimedia},
  pages={4502--4512},
  year={2023}
}

@inproceedings{gan2023contrastive,
  title={Contrastive learning for sign language recognition and translation},
  author={Gan, Shiwei and Yin, Yafeng and Jiang, Zhiwei and Xia, Kang and Xie, Lei and Lu, Sanglu},
  booktitle={Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, IJCAI-23},
  pages={763--772},
  year={2023}
}

@article{han2022vision,
  title={Vision gnn: An image is worth graph of nodes},
  author={Han, Kai and Wang, Yunhe and Guo, Jianyuan and Tang, Yehui and Wu, Enhua},
  journal={Advances in neural information processing systems},
  volume={35},
  pages={8291--8303},
  year={2022}
}