Home

Awesome

BiDRN: Binarized Dual Residual Network for 3D Whole-body Human Mesh Recovery

Zhiteng Li, Yulun Zhang, Jing Lin, Haotong Qin, Jinjin Gu, Xin Yuan, Linghe Kong, and Xiaokang Yang, "Binarized 3D Whole-body Human Mesh Recovery", arXiv, 2023

[arXiv] [supplementary material] [visual results] [pretrained models]

Abstract: 3D whole-body human mesh recovery aims to reconstruct the 3D human body, face, and hands from a single image. Although powerful deep learning models have achieved accurate estimation in this task, they require enormous memory and computational resources. Consequently, these methods can hardly be deployed on resource-limited edge devices. In this work, we propose a Binarized Dual Residual Network (BiDRN), a novel quantization method to estimate the 3D human body, face, and hands parameters efficiently. Specifically, we design a basic unit Binarized Dual Residual Block (BiDRB) composed of Local Convolution Residual (LCR) and Block Residual (BR), which can preserve full-precision information as much as possible. For LCR, we generalize it to four kinds of convolutional modules so that full-precision information can be propagated even between mismatched dimensions. We also binarize the face and hands box-prediction network as Binaried BoxNet, which can further reduce the model redundancy. Comprehensive quantitative and qualitative experiments demonstrate the effectiveness of BiDRN, which has a significant improvement over state-of-the-art binarization algorithms. Moreover, our proposed BiDRN achieves comparable performance with full-precision method Hand4Whole while using just 22.1% parameters and 14.8% operations. We will release all the code and pretrained models.


ImageHand4Whole<br>(full-precision)BNNBiDRN (ours)
<img src="figs/ori_coco_4134_2.png" height=260><img src="figs/fp_coco_4134_2.png" height=260><img src="figs/bnn_coco_4134_2.png" height=260><img src="figs/ours_coco_4134_2.png" height=260>
Params(M) / OPs (G)77.84 / 16.8521.61 / 2.6317.22 / 2.50

⚒️ TODO

🔗 Contents

🔎 Results

We outperform existing state-of-the-art BNN-based methods and even achieve comparable performance with full-precision methods. More results can be found in the paper.

Quantitative Comparison

<p align="center"> <img width="900" src="figs/quantitative.png"> </p>

Visual Comparison

<p align="center"> <img width="900" src="figs/EHF.png"> </p>

Citation

If you find the code helpful in your research or work, please cite the following paper(s).

@article{li2023binarized,
      title={Binarized 3D Whole-body Human Mesh Recovery}, 
      author={Zhiteng Li and Yulun Zhang and Jing Lin and Haotong Qin and Jinjin Gu and Xin Yuan and Linghe Kong and Xiaokang Yang},
      year={2023},
      eprint={2311.14323},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

💡 Acknowledgements

This work is released under the Apache 2.0 license. The codes are based on Hand4Whole. Please also follow their licenses. Thanks for their awesome works.