Awesome
Awesome Human Pose Estimation
<p align="center"> <img src="1.png" width=700> </p> <p align="center"> <img src="2.png" width=700> </p> <p align="center"> <img src="vis.png" width=700> </p> <p align="center"> <img src="sfv.png" width=700> </p>A collection of resources on human pose related problem: mainly focus on human pose estimation, and will include mesh representation, flow calculation, (inverse) kinematics, affordance, robotics, or sequence learning.
Why awesome human pose estimation?
This is a collection of papers and resources I curated when learning the ropes in Human Pose estimation. And This is a fork from https://github.com/cbsudux/awesome-human-pose-estimation (thanks for cbsudux) and customized for personal study and sharing. I will be continuously updating this list with the latest papers and resources. If you want some theory on Human Pose Estimation, check out Pose Related_Human_Knowledge
Related Pages:
https://github.com/xinghaochen/awesome-hand-pose-estimation https://github.com/1adrianb/face-alignment
Contributing
If you think I have missed out on something (or) have any suggestions (papers, implementations and other resources), feel free to pull a request
Feedback and contributions are welcome!
Table of Contents
Basics
Papers
2D Pose estimation
- Learning Human Pose Estimation Features with Convolutional Networks - Jain, A., Tompson, J., Andriluka, M., Taylor, G.W., & Bregler, C. (ICLR 2013)
- DeepPose: Human Pose Estimation via Deep Neural Networks - Toshev, A., & Szegedy, C. (CVPR 2014)
- Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation - [CODE] - Tompson, J., Jain, A., LeCun, Y., & Bregler, C. (NIPS 2014)
- MoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation - Jain, A., Tompson, J., LeCun, Y., & Bregler, C. (ACCV 2014)
- Efficient Object Localization Using Convolutional Networks - Tompson, J., Goroshin, R., Jain, A., LeCun, Y., & Bregler, C (CVPR 2015)
- Flowing ConvNets for Human Pose Estimation in Videos - [CODE] - Pfister, T., Charles, J., & Zisserman, A. (ICCV 2015)
- Convolutional Pose Machines - [CODE] - Wei, S., Ramakrishna, V., Kanade, T., & Sheikh, Y. (CVPR 2016)
- Human Pose Estimation with Iterative Error Feedback- [CODE] Carreira, J., Agrawal, P., Fragkiadaki, K., & Malik, J. (CVPR 2016)
- DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation - [CODE] - Pishchulin, L., Insafutdinov, E., Tang, S., Andres, B., Andriluka, M., Gehler, P.V., & Schiele, B. (CVPR 2016)
- DeeperCut: A Deeper, Stronger, and Faster Multi-Person Pose Estimation Model - [CODE1][CODE2] - Insafutdinov, E., Pishchulin, L., Andres, B., Andriluka, M., & Schiele, B. (ECCV 2016)
- Stacked Hourglass Networks for Human Pose Estimation - [CODE] - Newell, A., Yang, K., & Deng, J. (ECCV 2016)
- Multi-context Attention for Human Pose Estimation - [CODE] - Chu, X., Yang, W., Ouyang, W., Ma, C., Yuille, A.L., & Wang, X. (CVPR 2017)
- Towards Accurate Multi-person Pose Estimation in the Wild - [CODE] - Papandreou, G., Zhu, T., Kanazawa, N., Toshev, A., Tompson, J., Bregler, C., & Murphy, K.P. (CVPR 2017)
- Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields - [CODE] - Cao, Z., Simon, T., Wei, S., & Sheikh, Y. (CVPR 2017)
- Learning Feature Pyramids for Human Pose Estimation - [CODE] - Yang, W., Li, S., Ouyang, W., Li, H., & Wang, X. (ICCV 2017)
- Human Pose Estimation Using Global and Local Normalization - Sun, K., Lan, C., Xing, J., Zeng, W., Liu, D., & Wang, J. (ICCV 2017)
- Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation - Chen, Y., Shen, C., Wei, X., Liu, L., & Yang, J. (ICCV 2017)
- RMPE: Regional Multi-person Pose Estimation - [CODE1][CODE2] - Fang, H., Xie, S., & Lu, C. (ICCV 2017)
- Self Adversarial Training for Human Pose Estimation - [CODE1][CODE2] - Chou, C., Chien, J., & Chen, H. (ArXiv 2017)
- Recurrent Human Pose Estimation - [CODE] - Belagiannis, V., & Zisserman, A. (FG 2017)
- Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation - [CODE] Ning, G., Zhang, Z., & He, Z. (IEEE Transactions on Multimedia 2018)
- Human Pose Estimation with Parsing Induced Learner- Xuecheng Nie, Jiashi Feng, Yiming Zuo, Shuicheng Yan (CVPR 2018)
- LSTM Pose Machines - [CODE] - Yue Luo, Jimmy Ren, Zhouxia Wang, Wenxiu Sun, Jinshan Pan, Jianbo Liu, Jiahao Pang, Liang Lin (CVPR 2018)
- Cascaded Pyramid Network for Multi-Person Pose Estimation - [CODE] - Yilun Chen, Zhicheng Wang, Yuxiang Peng, Zhiqiang Zhang, Gang Yu, Jian Sun (CVPR 2018)
- Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation - [CODE] - Peng, Xi and Tang, Zhiqiang and Yang, Fei and Feris, Rogerio S and Metaxas, Dimitris (CVPR 2018)
- Human Pose Estimation with Parsing Induced Learner - [CODE] - Xuecheng Nie, Jiashi Feng, Yiming Zuo, Shuicheng Yan (CVPR 2018)
- Through-Wall Human Pose Estimation Using Radio Signals - Mingmin Zhao,Tianhong Li, Mohammad Abu Alsheikh, Yonglong Tian, Hang Zhao, Antonio Torralba, Dina Katabi (CVPR 2018)
- Simple Baselines for Human Pose Estimation and Tracking - [CODE] - Bin, Xiao, Haiping Wu, Yichen Wei (ECCV 2018)
- Multi-Scale Structure-Aware Network for Human Pose Estimation - Lipeng Ke, Ming-Ching Chang, Honggang Qi, Siwei Lyu (ECCV 2018)
- Deeply Learned Compositional Models for Human Pose Estimation - [CODE] - Wei Tang, Pei Yu, Ying Wu (ECCV 2018)
- MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network - [CODE] - Muhammed Kocabas, Salih Karagoz, Emre Akbas (ECCV 2018)
- Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose - [CODE] - Osokin, D. (Arxiv 2018)
- Rethinking on Multi-Stage Networks for Human Pose Estimation - Wenbo Li, Zhicheng Wang, Binyi Yin, Qixiang Peng, Yuming Du, Tianzi Xiao, Gang Yu,Hongtao Lu, Yichen Wei, and Jian Sun (Arxiv 2018)
- CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark - [CODE] - Jiefeng Li, Can Wang, Hao Zhu, Yihuan Mao, Hao-Shu Fang, Cewu Lu (CVPR 2019)
- Deep High-Resolution Representation Learning for Human Pose Estimation - [CODE] - [CODE2] - Ke Sun, Bin Xiao, Dong Liu, Jingdong Wang (CVPR 2019)
- Human Pose Estimation with Spatial Contextual Information - Hong Zhang, Hao Ouyang, Shu Liu, Xiaojuan Qi, Xiaoyong Shen, Ruigang Yang, Jiaya Jia (Arxiv 2019)
- PoseFix: Model-agnostic General Human Pose Refinement Network - [CODE] - Moon, Gyeongsik and Chang, Juyong and Lee, Kyoung Mu (CVPR 2019)
- Graphonomy: Universal Human Parsing via Graph Transfer Learning - [CODE] - Ke Gong, Yiming Gao, Xiaodan Liang, Xiaohui Shen, Meng Wang, Liang Lin (CVPR 2019)
- PifPaf: Composite Fields for Human Pose Estimation - [CODE] - Sven Kreiss, Lorenzo Bertoni, Alexandre Alahi (CVPR 2019)
- Person-in-WiFi: Fine-grained Person Perception using WiFi - Fei Wang, Sanping Zhou, Stanislav Panev, Jinsong Han, Dong Huang (arxiv 2019)
- Can WiFi Estimate Person Pose? - Fei Wang, Stanislav Panev, Ziyi Dai, Jinsong Han, Dong Huang (arxiv 2019)
- Learning to Learn Relation for Important People Detection in Still Images - [CODE] - Wei-Hong Li, Fa-Ting Hong, Wei-Shi Zheng (CVPR 2019)
- Efficient Online Multi-Person 2D Pose Tracking with Recurrent Spatio-Temporal Affinity Fields - [CODE] - Yaadhav Raaj, Haroon Idrees, Gines Hidalgo, Yaser Sheikh(CVPR 2019)
- Adaptive NMS: Refining Pedestrian Detection in a Crowd - Songtao Liu, Di Huang, Yunhong Wang (CVPR 2019)
- Multi-Person Pose Estimation with Enhanced Channel-wise and Spatial Information - Kai Su, Dongdong Yu, Zhenqi Xu, Xin Geng, Changhu Wang (CVPR 2019)
- Fast Human Pose Estimation - [CODE] - Feng Zhang, Xiatian Zhu, Mao Ye (CVPR 2019)
- Slim DensePose: Thrifty Learning from Sparse Annotations and Motion Cues - Natalia Neverova, James Thewlis, Rıza Alp Güler, Iasonas Kokkinos, Andrea Vedaldi (CVPR 2019)
- Objects as Points - [CODE] - Xingyi Zhou, Dequan Wang, Philipp Krähenbühl (arxiv 2019)
- Learning Individual Styles of Conversational Gesture - [CODE] - Shiry Ginosar, Amir Bar, Gefen Kohavi, Caroline Chan, Andrew Owens, Jitendra Malik (CVPR 2019)
- Does Learning Specific Features for Related Parts Help Human Pose Estimation? - Wei Tang and Ying Wu (CVPR 2019)
- Visual Person Understanding through Multi-Task and Multi-Dataset Learning - Kilian Pfeiffer, et al (Arxiv 2019)
- Movement science needs different pose tracking algorithms - Nidhi Seethapathi, Shaofei Wang, Rachit Saluja, Gunnar Blohm, Konrad P. Kording (Arxiv 2019)
- Learning to Train with Synthetic Humans - David T. Hoffmann, Dimitrios Tzionas, Micheal J. Black, Siyu Tang (GCPR 2019)
- Falls Prediction Based on Body Keypoints and Seq2Seq Architecture - Minjie Hua, Yibing Nan, Shiguo Lian (Arxiv 2019)
- Cross-Domain Adaptation for Animal Pose Estimation - Jinkun Cao, Hongyang Tang, Hao-Shu Fang, Xiaoyong Shen, Cewu Lu, Yu-Wing Tai (ICCV 2019)
- Pose Neural Fabrics Search - [CODE] - Sen Yang, Wankou Yang, Zhen Cui (Arxiv 2019)
- Anchor Loss: Modulating Loss Scale based on Prediction Difficulty - Serim Ryou, Seong-Gyun Jeong, Pietro Perona (ICCV 2019)
- Single-Network Whole-Body Pose Estimation - [CODE] - Gines Hidalgo, Yaadhav Raaj, Haroon Idrees, Donglai Xiang, Hanbyul Joo, Tomas Simon, Yaser Sheikh (ICCV 2019)
- The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation - Junjie Huang, Zheng Zhu, Feng Guo, Guan Huang (Arxiv 2019)
- DirectPose: Direct End-to-End Multi-Person Pose Estimation - Zhi Tian, Hao Chen, Chunhua Shen (Arxiv 2019)
- The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation - Junjie Huang, Zheng Zhu, Feng Guo, Guan Huang (ICCV 2019)
- TRB: A Novel Triplet Representation for Understanding 2D Human Body - [Data] - Haodong Duan, KwanYee Lin, Sheng Jin, Wentao Liu, Chen Qian, Wanli Ouyang (ICCV 2019)
- Simple and Lightweight Human Pose Estimation - Zhe Zhang, Jie Tang, Gangshan Wu (Arxiv 2019)
- Mixture Dense Regression for Object Detection and Human Pose Estimation - Ali Varamesh, Tinne Tuytelaars (Arxiv 2019)
- 15 Keypoints Is All You Need - Michael Snower, Asim Kadav, Farley Lai, Hans Peter Graf (Arxiv 2019)
- Learning Temporal Pose Estimation from Sparsely Labeled Videos - [CODE] - Gedas Bertasius, Christoph Feichtenhofer, Du Tran, Jianbo Shi, Lorenzo Torresani (NIPS 2019)
- Correlated Uncertainty for Learning DenseCorrespondences from Noisy Labels - Natalia Neverova, David Novotny, Andrea Vedaldi (NIPS 2019)
- Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation -[CODE] - Jia Li, Wen Su, Zengfu Wang (AAAI2020)
- An End-to-End Framework for Unsupervised Pose Estimation of Occluded Pedestrians - Sudip Das, Perla Sai Raj Kishore, Ujjwal Bhattacharya (Arxiv 2020)
- Transferring Dense Pose to Proximal Animal Classes - [CODE] - Artsiom Sanakoyeu, Vasil Khalidov, Maureen S. McCarthy, Andrea Vedaldi, Natalia Neverova (CVPR 2020)
- Peeking into occluded joints: A novel framework for crowd pose estimation - ingteng Qiu, Xuanye Zhang, Yanran Li, Guanbin Li, Xiaojun Wu, Zixiang Xiong, Xiaoguang Han, Shuguang Cui (Arxiv 2020)
- Motion-supervised Co-Part Segmentation - Aliaksandr Siarohin*, Subhankar Roy*, Stéphane Lathuilière, Sergey Tulyakov, Elisa Ricci, Nicu Sebe (Arxiv 2020)
- Detailed 2D-3D Joint Representation for Human-Object Interaction - [CODE] - Yong-Lu Li, Xinpeng Liu, Han Lu, Shiyi Wang, Junqi Liu, Jiefeng Li, Cewu Lu (CVPR 2020)
- Distribution Aware Coordinate Representation for Human Pose Estimation - [CODE] - Feng Zhang, Xiatian Zhu, Hanbin Dai, Mao Ye, Ce Zhu (CVPR 2020)
- Yoga-82: A New Dataset for Fine-grained Classification of Human Poses - [Data] - Manisha Verma, Sudhakar Kumawat, Yuta Nakashima, Shanmuganathan Raman (CVPRW 2020)
- Self-supervised Keypoint Correspondences for Multi-Person Pose Estimation and Tracking in Videos - Rafi Umer, Andreas Doering, Bastian Leibe, Juergen Gall (Arxiv 2020)
- Making DensePose fast and light (Arxiv 2020)
- Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation - Sheng Jin, Wentao Liu, Enze Xie, Wenhai Wang, Chen Qian, Wanli Ouyang, Ping Luo (ECCV 2020)
- Whole-Body Human Pose Estimation in the Wild - [Data] - Sheng Jin, Lumin Xu, Jin Xu, Can Wang, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo (ECCV 2020)
3D Pose estimation
- Reconstruction of Articulated Objects from Point Correspondences in a Single Uncalibrated Image - CJ Taylor. (CVIU 2000)
- Covariance-Scaled Sampling for Monocular 3D Body Tracking - Cristian Sminchisescu and Bill Triggs. (CVPR 2001)
- Improving the Scope of Deformable Model Shape and Motion Estimation - C. Sminchisescu and D. Metaxas and S. Dickinson. (CVPR 2001)
- Recovering 3D Human Posefrom Monocular Images - Ankur Agarwal and Bill Triggs. (PAMI 2006)
- 3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network - Li, S., & Chan, A.B. (ACCV 2014)
- 3D Pictorial Structures for Multiple Human Pose Estimation - Vasileios Belagiannis , Sikandar Amin, Mykhaylo Andriluka,Bernt Schiele, Nassir Navab, and Slobodan Ilic (CVPR 2014)
- 3D Human pose estimation: A review of the literature and analysis of covariates (CVIU 2016)
- Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video - [CODE] - X. Zhou, M. Zhu, G. Pavlakos, S. Leonardos, K.G. Derpanis, K. Daniilidis. (CVPR 2016)
- Structured Prediction of 3D Human Pose with Deep Neural Networks - Tekin, B., Katircioglu, I., Salzmann, M., Lepetit, V., & Fua, P. (BMVC 2016)
- VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera - [CODE] - Mehta, Dushyant et al. (SIGGRAPH 2017)
- Recurrent 3D Pose Sequence Machines - Lin, M., Lin, L., Liang, X., Wang, K., & Cheng, H. (CVPR 2017)
- Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image - Tomè, D., Russell, C., & Agapito, L. (CVPR 2017)
- 3D Human Pose Estimation from a Single Image via Distance Matrix Regression - Francesc Moreno-Noguer. (CVPR 2017)
- 3D Human Pose Estimation = 2D Pose Estimation + Matching - [CODE] - Ching-Hang Chen, Deva Ramanan. (CVPR 2017)
- Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose - [CODE] - Pavlakos, G., Zhou, X., Derpanis, K.G., & Daniilidis, K. (CVPR 2017)
- LCR-Net: Localization-Classification-Regression for Human Pose - [CODE] - Grégory Rogez, Philippe Weinzaepfel, Cordelia Schmid. (CVPR 2017)
- Deep Learning on Lie Groups for Skeleton-based Action Recognition - [CODE] - Zhiwu Huang, Chengde Wan, Thomas Probst, Luc Van Gool. (CVPR 2017)
- Seeing invisible poses: Estimating3d body pose from egocentric video. - Hao Jiang, Kristen Grauman. (CVPR 2017)
- Harvesting Multiple Views for Marker-less 3D Human Pose Annotations - [CODE] - G. Pavlakos, X. Zhou, K. Derpanis, K. Daniilidis. (CVPR 2017)
- Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach - [CODE] - Zhou, X., Huang, Q., Sun, X., Xue, X., & Wei, Y. (ICCV 2017)
- Adversarial Inverse Graphics Networks: Learning 2D-to-3D Lifting and Image-to-Image Translation from Unpaired Supervision - Hsiao-Yu Fish Tung. etal. (ICCV 2017)
- A Simple Yet Effective Baseline for 3d Human Pose Estimation - [CODE] - Martinez, J., Hossain, R., Romero, J., & Little, J.J. (ICCV 2017)
- Sparse Representation for 3D Shape Estimation: A Convex Relaxation Approach - [CODE] - X. Zhou, M. Zhu, S. Leonardos, K. Daniilidis. (PAMI 2017)
- Compositional Human Pose Regression - Sun, X., Shang, J., Liang, S., & Wei, Y. (ICCV 2017)
- Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision - Mehta, D., Rhodin, H., Casas, D., Fua, P., Sotnychenko, O., Xu, W., & Theobalt, C. (3DV 2017)
- 3D Human Pose Estimation in the Wild by Adversarial Learning - Yang, W., Ouyang, W., Wang, X., Ren, J.S., Li, H., & Wang, X. (CVPR 2018)
- Ordinal Depth Supervision for 3D Human Pose Estimation - [CODE] - G. Pavlakos, X. Zhou, K. Daniilidis. (CVPR 2018)
- V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation From a Single Depth Map - [CODE] - Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee. (CVPR 2018)
- DRPose3D: Depth Ranking in 3D Human Pose Estimation - Wang, M., Chen, X., Liu, W., Qian, C., Lin, L., & Ma, L. (IJCAI 2018)
- Human Motion Capture Using a Drone - X. Zhou, S. Liu, G. Pavlakos, V.J. Kumar, K. Daniilidis. (ICRA 2018)
- End-to-end Recovery of Human Shape and Pose - [CODE] - Kanazawa, A., Black, M.J., Jacobs, D.W., & Malik, J. (CVPR 2018)
- Learning to Estimate 3D Human Pose and Shape from a Single Color Image - Pavlakos, G., Zhu, L., Zhou, X., & Daniilidis, K. (CVPR 2018)
- Monocular 3D Pose and Shape Estimation of Multiple People in Natural Scenes - Andrei Zanfir, Elisabeta Marinoiu, Cristian Sminchisescu. (CVPR 2018)
- Dense Human Pose Estimation In The Wild - [CODE] - Guler, R.A., Neverova, N., & Kokkinos, I. (CVPR 2018)
- Learning Monocular 3D Human Pose Estimation from Multi-View Images - Helge Rhodin, Jörg Spörri, Isinsu Katircioglu, Victor Constantin, Frédéric Meyer, Erich Müller, Mathieu Salzmann, Pascal Fua. (CVPR 2018)
- 3D Human Sensing, Action and Emotion Recognition inRobot Assisted Therapy of Children with Autism - Elisabeta Marinoiu, Mihai Zanfir, Vlad Olaru, Cristian Sminchisescu. (CVPR 2018)
- Neural Body Fitting: Unifying Deep Learning and Model-Based Human Pose and Shape Estimation - [CODE] - Omran, Mohamed and Lassner, Christoph and Pons-Moll, Gerard and Gehler, Peter V. and Schiele, Bernt (3DV 2018)
- Learning 3D Human Pose from Structure and Motion - Dabral, R., Mundhada, A., Kusupati, U., Afaque, S., Sharma, A., & Jain, A. (ECCV 2018)
- Unsupervised Learning of View-invariant Action Representations - Junnan Li.etal. (NIPS 2018)
- Deep Network for the Integrated 3D Sensing ofMultiple People in Natural Images - Andrei Zanfir.etal. (NIPS 2018)
- Integral Human Pose Regression - [CODE] - Sun, X., Xiao, B., Liang, S., & Wei, Y. (ECCV 2018)
- Dense Pose Transfer - Neverova, N., Guler, R.A., & Kokkinos, I. (ECCV 2018)
- Deformable Pose Traversal Convolutionfor 3D Action and Gesture Recognition - Junwu Weng.et.al. (ECCV 2018)
- Deep Autoencoder for Combined Human Pose Estimation and Body Model Upscaling - Matthew Trumble, Andrew Gilbert, Adrian Hilton, John Collomosse (ECCV 2018)
- Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation - [CODE] - Rhodin, H., Salzmann, M., & Fua, P. (ECCV 2018)
- Monocap: Monocular human motion capture using a CNN coupled with a geometric prior - [CODE] - X. Zhou, M. Zhu, G. Pavlakos, S. Leonardos, K.G. Derpanis, K. Daniilidis. (TPAMI 2018)
- Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB - [CODE1][CODE2] - Mehta, Dushyant and Sotnychenko, Oleksandr and Mueller, Franziska and Xu, Weipeng and Sridhar, Srinath and Pons-Moll, Gerard and Theobalt, Christian (3DV 2018)
- HUMBI 1.0: HUman Multiview Behavioral Imaging Dataset (Arxiv 2018)
- Explicit Pose Deformation Learning for Tracking Human Poses - Xiao Sun, Chuankang Li, Stephen Lin (Arxiv 2018)
- 3D Human Pose Machines with Self-supervised Learning - [CODE] - Keze Wang, Liang Lin, Chenhan Jiang, Chen Qian, and Pengxu Wei. (TPAMI 2019)
- 3D Human Pose Estimation with 2D Marginal Heatmaps - [CODE] - Aiden Nibali, Zhen He, Stuart Morgan, Luke Prendergast. (WACV 2019)
- Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views - [CODE] - Junting Dong, Wen Jiang, Qixing Huang, Hujun Bao, Xiaowei Zhou. (CVPR 2019)
- Learning the Depths of Moving People by Watching Frozen People - Zhengqi Li, Tali Dekel, Forrester Cole, Richard Tucker, Noah Snavely, Ce Liu, William T. Freeman. (CVPR 2019)
- Monocular Total Capture: Posing Face, Body and Hands in the Wild - [CODE] - Donglai Xiang, Hanbyul Joo, Yaser Sheikh (CVPR 2019)
- RepNet: Weakly Supervised Training of an Adversarial Reprojection Network for 3D Human Pose Estimation - Bastian Wandt, Bodo Rosenhahn (CVPR 2019)
- In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations - Ikhsanul Habibie, Weipeng Xu, Dushyant Mehta, Gerard Pons-Moll, Christian Theobalt (CVPR 2019)
- Semantic Graph Convolutional Networks for 3D Human Pose Regression - [CODE] - Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris N. Metaxas (CVPR 2019)
- ON THE CONTINUITY OF ROTATION REPRESENTATIONS IN NEURAL NETWORKS - Yi Zhou*, Connelly Barnes*, Jingwan Lu, Jimei Yang, Hao Li (CVPR 2019)
- Self-Supervised Learning of 3D Human Pose using Multi-view Geometry - [CODE] - Muhammed Kocabas, Salih Karagoz, Emre Akbas, (CVPR 2019)
- Generating Multiple Hypotheses for 3D Human Pose Estimation with Mixture Density Network - [CODE] - Chen Li, Gim Hee Lee (CVPR 2019)
- Neural Scene Decomposition for Multi-Person Motion Capture - Helge Rhodin, Victor Constantin, Isinsu Katircioglu, Mathieu Salzmann, Pascal Fua (CVPR 2019)
- Weakly-Supervised Discovery of Geometry-Aware Representationfor 3D Human Pose Estimation - Xipeng Chen, Kwan-Yee Lin, Wentao Liu, Chen Qian, Liang Lin (CVPR 2019)
- IGE-Net: Inverse Graphics Energy Networksfor Human Pose Estimation and Single-View Reconstruction - Dominic Jack etal (CVPR 2019)
- Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking - [CODE] - Saurabh Sharma, Pavan Teja Varigonda, Prashast Bindal, Abhishek Sharma, Arjun Jain (Arxiv 2019)
- Estimating 3D Motion and Forces of Person-Object Interactions from Monocular Video - Zongmian Li, Jiri Sedlar, Justin Carpentier, Ivan Laptev, Nicolas Mansard, Josef Sivic (Arxiv 2019)
- Context-aware Human Motion Prediction - Enric Corona, Albert Pumarola, Guillem Alenyà, Francesc Moreno (Arxiv 2019)
- Unsupervised 3D Pose Estimation with Geometric Self-Supervision - Ching-Hang Chen, Ambrish Tyagi, Amit Agrawal, Dylan Drover, Rohith MV, Stefan Stojanov, James M. Rehg (Arxiv 2019)
- Generalizing Monocular 3D Human Pose Estimation in the Wild - [CODE] - Luyang Wang, Yan Chen, Zhenhua Guo, Keyuan Qian, Mude Lin, Hongsheng Li, Jimmy S. Ren (Arxiv 2019)
- Absolute Human Pose Estimation with Depth Prediction Network - Márton Véges, András Lőrincz (IJCNN 2019)
- You2Me: Inferring Body Pose in Egocentric Video via First and Second Person Interactions - [CODE] - Evonne Ng, Donglai Xiang, Hanbyul Joo, Kristen Grauman (Arxiv 2019)
- Learnable Triangulation of Human Pose - [CODE] - Karim Iskakov, Egor Burkov, Victor Lempitsky, Yury Malkov (ICCV 2019)
- Not All Parts Are Created Equal: 3D Pose Estimation by Modelling Bi-directional Dependencies of Body Parts - Jue Wang, Shaoli Huang, Xinchao Wang, Dacheng Tao (Arxiv 2019)
- MONET: Multiview Semi-supervised Keypoint via Epipolar Divergence - Yasamin Jafarian, Yuan Yao, Hyun Soo Park (Arxiv 2018)
- Feature Boosting Network For 3D Pose Estimation - Jun Liu, Henghui Ding, Amir Shahroudy, Ling-Yu Duan, Xudong Jiang, Gang Wang, Alex C. Kot (TPAMI 2019)
- Ego-Pose Estimation and Forecasting as Real-Time PD Control - [CODE] - Ye Yuan, Kris Kitani (ICCV 2019)
- Understanding Human Context in 3D Scenes by Learning Spatial Affordances with Virtual Skeleton Models - Lasitha Piyathilaka, Sarath Kodagoda (Arxiv 2019)
- XNect: Real-time Multi-person 3D Human Pose Estimation with a Single RGB Camera - [CODE] - Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Mohamed Elgharib, Pascal Fua, Hans-Peter Seidel, Helge Rhodin, Gerard Pons-Moll, Christian Theobalt (Arxiv 2019)
- Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image - [CODE] - [CODE] - Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee (ICCV 2019)
- xR-EgoPose: Egocentric 3D Human Pose from an HMD Camera - [CODE] - Denis Tome, Patrick Peluse, Lourdes Agapito, Hernan Badino (ICCV 2019)
- Semantic Estimation of 3D Body Shape and Pose using Minimal Cameras - Andrew Gilbert, Matthew Trumble, Adrian Hilton, John Collomosse (Arxiv 2019)
- Resolving 3D Human Pose Ambiguities with 3D Scene Constraints - [Data] - [CODE] - Mohamed Hassan, Vasileios Choutas, Dimitrios Tzionas, Michael J. Black (ICCV 2019)
- Distill Knowledge from NRSfM for Weakly Supervised 3D Pose Learning - Chaoyang Wang, Chen Kong, Simon Lucey (ICCV 2019)
- Single-Stage Multi-Person Pose Machines - Xuecheng Nie, Jianfeng Zhang, Shuicheng Yan, Jiashi Feng (ICCV 2019)
- A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image - [CODE] - Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan (ICCV 2019)
- Cross View Fusion for 3D Human Pose Estimation - [CODE] - Haibo Qiu, Chunyu Wang, Jingdong Wang, Naiyan Wang, Wenjun Zeng (ICCV 2019)
- Optimizing Network Structure for 3D Human Pose Estimation - Hai Ci, Chunyu Wang, Xiaoxuan Ma, Yizhou Wang, (ICCV 2019)
- Motion Capture from Pan-Tilt Cameras with Unknown Orientation - Roman Bachmann, Jörg Spörri, Pascal Fua, Helge Rhodin (3DV 2019)
- C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion - David Novotny, Nikhila Ravi, Benjamin Graham, Natalia Neverova, Andrea Vedaldi (ICCV 2019)
- MonoLoco: Monocular 3D Pedestrian Localization and Uncertainty Estimation - [CODE] - Lorenzo Bertoni, Sven Kreiss, Alexandre Alahi (ICCV 2019)
- Multi-Person 3D Human Pose Estimation from Monocular Images - Rishabh Dabral, Nitesh B Gundavarapu, Rahul Mitra, Abhishek Sharma, Ganesh Ramakrishnan, Arjun Jain (3DV 2019)
- Human Synthesis and Scene Compositing - Mihai Zanfir, Elisabeta Oneata, Alin-Ionut Popa, Andrei Zanfir, Cristian Sminchisescu (Arxiv 2019)
- MocapNET: Ensemble of SNN Encoders for 3D Human Pose Estimation in RGB Images - [CODE] - Qammaz, Ammar and Argyros, Antonis A (BMVC 2019)
- Adversarial Attack on Skeleton-based HumanAction Recognition -Jian Liu, Naveed Akhtar, and Ajmal Mian, (Arxiv 2019)
- Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop - [CODE] - Nikos Kolotouros, Georgios Pavlakos, Michael J. Black, Kostas Daniilidis (ICCV 2019)
- DenseRaC: Joint 3D Pose and Shape Estimation by Dense Render-and-Compare - Yuanlu Xu, Song-Chun Zhu, Tony Tung (ICCV 2019)
- Chirality Nets for Human Pose Regression - Raymond A. Yeh, Yuan-Ting Hu, Alexander G. Schwing (NIPS 2019)
- Learning Graph Convolutional Network for Skeleton-based Human Action Recognition by Neural Searching - Wei Peng, Xiaopeng Hong, Haoyu Chen, Guoying Zhao (Arxiv 2020)
- A Neural Network for Detailed Human Depth Estimation from a Single Image - Sicong Tang, Feitong Tan, Kelvin Cheng, Zhaoyang Li, Siyu Zhu, Ping Tan (ICCV 2019)
- HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation - Kun Zhou, Xiaoguang Han, Nianjuan Jiang, Kui Jia, Jiangbo Lu (ICCV 2019)
- Convex Optimisation for Inverse Kinematics - Tarun Yenamandra, Florian Bernard, Jiayi Wang, Franziska Mueller, Christian Theobalt (Arxiv 2019)
- AbsPoseLifter: Absolute 3D Human Pose Lifting Network from a Single Noisy 2D Human Pose - Ju Yong Chang, Gyeongsik Moon, Kyoung Mu Lee (Arxiv 2019)
- SMART: Skeletal Motion Action Recognition aTtack - He Wang, Feixiang He, Zexi Peng, Yongliang Yang, Tianjia Shao, Kun Zhou, David Hogg (Arxiv 2019)
- Markerless Outdoor Human Motion Capture Using Multiple Autonomous Micro Aerial Vehicles - [CODE] - Nitin Saini, Eric Price, Rahul Tallamraju, Raffi Enficiaud, Roman Ludwig, Igor Martinovic, Aamir Ahmad, Michael J. Black (ICCV 2019)
- Exploiting Spatial-temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks - Yujun Cai, etal (ICCV 2019)
- Occlusion-Aware Networks for 3D Human Pose Estimation in Video - Yu Cheng, Bo Yang, Bo Wang, Wending Yan, and Robby T. Tan (ICCV 2019)
- On Boosting Single-Frame 3D Human Pose Estimation via Monocular Videos - Zhi Li,Xuan Wang, Fei Wang, and Peilin Jiang(ICCV 2019)
- Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking - [CODE] - Saurabh Sharma, Pavan Teja Varigonda, Prashast Bindal, Abhishek Sharma, Arjun Jain (ICCV 2019)
- Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates - Diogo C Luvizon, Hedi Tabia, David Picard (Arxiv 2019)
- DeepFuse: An IMU-Aware Network for Real-Time 3D Human Pose Estimation from Multi-View Image - Fuyang Huang, Ailing Zeng, Minhao Liu, Qiuxia Lai, Qiang Xu (WACV 2020)
- Generating 3D People in Scenes without People - Yan Zhang, Mohamed Hassan, Heiko Neumann, Michael J. Black, Siyu Tang (Arxiv 2019)
- Video Motion Capture from the Part Confidence Maps of Multi-Camera Images by Spatiotemporal Filtering Using the Human Skeletal Model - Takuya Ohashi, Yosuke Ikegami, Kazuki Yamamoto, Wataru Takano, Yoshihiko Nakamura (IROS 2018)
- Domes to Drones: Self-Supervised Active Triangulation for 3D Human Pose Reconstruction
- Aleksis Pirinen1, Erik Gärtner and Cristian Sminchisescu (NIPS 2019)
- From Kinematics To Dynamics: Estimating Center of Pressure and Base of Support from Video Frames of Human Motion
- Jesse Scott, Christopher Funk, Bharadwaj Ravichandran, John H. Challis, Robert T. Collins, Yanxi Liu (arxiv 2020)
- ActiveMoCap: Optimized Drone Flight for Active Human Motion Capture - Sena Kiciroglu, Helge Rhodin, Sudipta Sinha, Mathieu Salzmann, Pascal Fua (Arxiv 2019)
- Synergetic Reconstruction from 2D Pose and 3D Motion for Wide-Space Multi-Person Video Motion Capture in the Wild - [CODE] - Takuya Ohashi, Yosuke Ikegami,Yoshihiko Nakamura (Arxiv 2020)
- Deep Reinforcement Learning for Active Human Pose Estimation - Erik Gärtner, Aleksis Pirinen, Cristian Sminchisescu (AAAI 2020)
- Deep NRSfM++: Towards 3D Reconstruction in the Wild - Chaoyang Wang, Chen-Hsuan Lin, Simon Lucey (Arxiv 2020)
- Anatomy-aware 3D Human Pose Estimation in Videos - Tianlang Chen, Chen Fang, Xiaohui Shen, Yiheng Zhu, Zhili Chen, Jiebo Luo (Arxiv 2020)
- PoseNet3D: Unsupervised 3D Human Shape and Pose Estimation - Shashank Tripathi, Siddhant Ranade, Ambrish Tyagi, Amit Agrawal (Arxiv 2020)
- EllipBody: A Light-weight and Part-based Representation for Human Pose and Shape Recovery - Min Wang, Feng Qiu, Wentao Liu, Chen Qian, Xiaowei Zhou, Lizhuang Ma (CVPR 2020)
- Weakly-Supervised 3D Human Pose Learning via Multi-view Images in the Wild - Umar Iqbal, Pavlo Molchanov, Jan Kautz (CVPR 2020)
- Fusing Wearable IMUs with Multi-View Images for Human Pose Estimation: A Geometric Approach - Zhe Zhang, Chunyu Wang, Wenhu Qin, Wenjun Zeng (CVPR 2020)
- MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation - Rongchang Xie, Chunyu Wang, Yizhou Wang (CVPR 2020)
- Optical Non-Line-of-Sight Physics-based 3D Human Pose Estimation - Mariko Isogawa, Ye Yuan, Matthew O'Toole, Kris Kitani (CVPR 2020)
- Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation - Matteo Fabbri, Fabio Lanzi, Simone Calderara, Stefano Alletto, Rita Cucchiara (CVPR 2020)
- Bodies at Rest: 3D Human Pose and Shape Estimation from a Pressure Image using Synthetic Data - [CODE] - Henry M. Clever, Zackory Erickson, Ariel Kapusta, Greg Turk, C. Karen Liu, Charles C. Kemp (CVPR 2020)
- Lightweight Multi-View 3D Pose Estimation through Camera-Disentangled Representation - Edoardo Remelli, Shangchen Han, Sina Honari, Pascal Fua, Robert Wang (CVPR 2020)
- Multi-Person Absolute 3D Human Pose Estimation with Weak Depth Supervision - [Code] - Marton Veges, Andras Lorincz (Arxiv 2020)
- Exemplar Fine-Tuning for 3D Human Pose Fitting Towards In-the-Wild 3D Human Pose Estimation - Hanbyul Joo, Natalia Neverova, Andrea Vedaldi (Arxiv 2020)
- Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image Synthesis - Jogendra Nath Kundu, Siddharth Seth, Varun Jampani, Mugalodi Rakesh, R. Venkatesh Babu, Anirban Chakraborty (CVPR 2020)
- Multimodal and multiview distillation for real-time player detection on a football field - Anthony Cioppa, Adrien Deliège, Noor Ul Huda, Rikke Gade, Marc Van Droogenbroeck, Thomas B. Moeslund (CVPRW 2020)
- End-to-End Estimation of Multi-Person 3D Poses from Multiple Cameras - [code] - Hanyue Tu, Chunyu Wang, Wenjun Zeng (Arxiv 2020)
- Motion Guided 3D Pose Estimation from Videos - Jingbo Wang, Sijie Yan, Yuanjun Xiong, Dahua Lin (Arxiv 2020)
- View Invariant Human Body Detection and Pose Estimation from Multiple Depth Sensors - Walid Bekhtaoui, Ruhan Sa, Brian Teixeira, Vivek Singh, Klaus Kirchberg, Yao-jen Chang, Ankur Kapoor (Arxiv 2020)
- MEBOW: Monocular Estimation of Body Orientation In the Wild - Chenyan Wu, et,al (CVPR 2020)
- Epipolar Transformers - [code] - Yihui He, Rui Yan, Katerina Fragkiadaki, Shoou-I Yu (CVPR 2020)
- Cross-View Tracking for Multi-Human 3D Pose Estimation at over 100 FPS - Long Chen, Haizhou Ai, Rui Chen, Zijie Zhuang, Shuang Liu (CVPR 2020)
- Multiview-Consistent Semi-Supervised Learning for 3D Human Pose Estimation - Rahul Mitra, Nitesh B. Gundavarapu, Abhishek Sharma, Arjun Jain (CVPR 2020)
- Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction - [code] - Ruixu Liu, Ju Shen, He Wang, Chen Chen, Sen-ching Cheung, Vijayan Asari (CVPR 2020)
- Cascaded Deep Monocular 3D Human Pose Estimation With Evolutionary Training Data - Shichao Li, Lei Ke, Kevin Pratama, Yu-Wing Tai, Chi-Keung Tang, Kwang-Ting Cheng (CVPR 2020)
- Deep Kinematics Analysis for Monocular 3D Human Pose Estimation - Jingwei Xu et al. (CVPR 2020)
- Three-dimensional Reconstruction of Human Interactions - [code] - Mihai Fieraru et al. (CVPR 2020)
- Coherent Reconstruction of Multiple Humans from a Single Image - [code] - Wen Jiang, Nikos Kolotouros, Georgios Pavlakos , Xiaowei Zhou, Kostas Daniilidis (CVPR 2020)
- Optical Non-Line-of-Sight Physics-based 3D Human Pose Estimation - [code] - Mariko Isogawa, Ye Yuan, Matthew O'Toole, Kris Kitani (CVPR 2020)
- Kinematic-Structure-Preserved Representation for Unsupervised 3D Human Pose Estimation - Jogendra Nath Kundu, Siddharth Seth, Rahul M V, Mugalodi Rakesh, R. Venkatesh Babu, Anirban Chakraborty (AAAI 2020)
- Inference Stage Optimization for Cross-scenario 3D Human Pose Estimation - Jianfeng Zhang, Xuecheng Nie, Jiashi Feng (Arxiv 2020)
- Geometric Pose Affordance: 3D Human Pose with Scene Constraints - [Page] - Zhe Wang, Liyan Chen, Shaurya Rathore, Daeyun Shin, Charless Fowlkes (Arxiv 2019)
- Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation - [Page] - Zhe Wang, Daeyun Shin, Charless C. Fowlkes (Arxiv 2020)
Geometry
- Neural 3D Mesh Renderer - JKato, Hiroharu and Ushiku, Yoshitaka and Harada, Tatsuya (CVPR 2018)
- Learning Two-View Correspondences and Geometry Using Order-Aware Network - Jiahui Zhang, Dawei Sun, Zixin Luo, Anbang Yao, Lei Zhou, Tianwei Shen, Yurong Chen, Long Quan, Hongen Liao (ICCV 2019)
- UprightNet: Geometry-Aware Camera Orientation Estimation from Single Images - Wenqi Xian, Zhengqi Li, Matthew Fisher, Jonathan Eisenmann, Eli Shechtman, Noah Snavely (Arxiv 2019)
- Gravity as a Reference for Estimating a Person's Height from Video - Didier Bieler, Semih Günel, Pascal Fua, Helge Rhodin (ICCV 2019)
- End-to-End Multi-View Fusion for 3D Object Detection in LiDAR Point Clouds - DYin Zhou, Pei Sun, Yu Zhang, Dragomir Anguelov, Jiyang Gao, Tom Ouyang, James Guo, Jiquan Ngiam, Vijay Vasudevan (CoRL 2019)
- EGenerating Human Action Videos by Coupling 3D Game Engines and Probabilistic Graphical Models - César Roberto de Souza, Adrien Gaidon, Yohann Cabon, Naila Murray, Antonio Manuel López (IJCV 2019)
- Unsupervised High-Resolution Depth Learning From Videos With Dual Networks - Junsheng Zhou, Yuwang Wang, Kaihuai Qin, Wenjun Zeng (ICCV 2019)
- Moving Indoor: Unsupervised Video Depth Learning in Challenging Environments - Junsheng Zhou, Yuwang Wang, Kaihuai Qin, Wenjun Zeng (ICCV 2019)
- 6-PACK: Category-level 6D Pose Tracker with Anchor-Based Keypoints -[CODE] - Chen Wang, Roberto Martín-Martín, Danfei Xu, Jun Lv, Cewu Lu, Li Fei-Fei, Silvio Savarese, Yuke Zhu (Arxiv 2019)
- GraphX-Convolution for Point Cloud Deformation in 2D-to-3D Conversion -[CODE] - Anh-Duc Nguyen, Seonghwa Choi, Woojae Kim, Sanghoon Lee (ICCV 2019)
- GIFT: Learning Transformation-Invariant Dense Visual Descriptors via Group CNNs -[CODE] - Yuan Liu, Zehong Shen, Zhixuan Lin, Sida Peng, Hujun Bao, Xiaowei Zhou (NIPS 2019)
- Conservative Wasserstein Training for Pose Estimation - Xiaofeng Liu, Yang Zou, Tong Che, Peng Ding, Ping Jia, Jane You, Kumar B.V.K (ICCV 2019)
- Tell Me What They're Holding: Weakly-supervised Object Detection with Transferable Knowledge from Human-object Interaction - Daesik Kim, Gyujeong Lee, Jisoo Jeong, Nojun Kwak (AAAI 2020)
- Single-Stage 6D Object Pose Estimation - Yinlin Hu, Pascal Fua, Wei Wang, Mathieu Salzmann (Arxiv 2019)
- GP2C: Geometric Projection Parameter Consensus for Joint 3D Pose and Focal Length Estimation in the Wild - Alexander Grabner, Peter M. Roth, Vincent Lepetit (ICCV 2019)
- SANet: Scene Agnostic Network for Camera Localization - Luwei Yang etal (ICCV 2019)
- ViewSynth: Learning Local Features from Depth using View Synthesis - Jisan Mahmud, Peri Akiva, Rajat Vikram Singh, Spondon Kundu, Kuan-Chuan Peng, Jan-Michael Frahm (Arxiv 2019)
- KeyPose: Multi-view 3D Labeling and Keypoint Estimation for Transparent Objects - Xingyu Liu, Rico Jonschkowski, Anelia Angelova, Kurt Konolige (Arxiv 2019)
- 3D Objectness Estimation via Bottom-up Regret Grouping - Zelin Ye, Yan Hao, Liang Xu, Rui Zhu, Cewu Lu (Arxiv 2019)
- Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation -[CODE] - Hongwei Yi, Zizhuang Wei, Mingyu Ding, Runze Zhang, Yisong Chen, Guoping Wang, Yu-Wing Tai (Arxiv 2019)
- Neural Network Generalization: The impact of camera parameters - HZhenyi Liu, Trisha Lian, Joyce Farrell, Brian Wandell(Arxiv 2019)
- Geometric Capsule Autoencoders for 3D Point Clouds - Nitish Srivastava, Hanlin Goh, Ruslan Salakhutdinov(Arxiv 2019)
- What You See is What You Get: Exploiting Visibility for 3D Object Detection - Peiyun Hu, Jason Ziglar, David Held, Deva Ramanan(Arxiv 2019)
- Car Pose in Context: Accurate Pose Estimation with Ground Plane Constraints - Pengfei Li, Weichao Qiu, Michael Peven, Gregory D. Hager, Alan L. Yuille (Arxiv 2019)
- Quaternion Knowledge Graph Embeddings -[CODE] - Shuai Zhang, Yi Tay, Lina Yao, Qi Liu (NIPS 2019)
- Quaternion Product Units for Deep Learning on 3D Rotation Groups -Xuan Zhang, Shaofei Qin, Yi Xu, Hongteng Xu (arxiv 2019)
- Inferring Distributions Over Depth from a Single Image -Gengshan Yang, Peiyun Hu, Deva Ramanan (IROS 2019)
- A Bayesian 3D Multi-view Multi-object Tracking Filter -Jonah Ong, Ba Tuong Vo, Ba Ngu Vo, Du Yong Kim, Sven Nordholm (TPAMI 2020)
- Learning to Move with Affordance Maps -William Qi, Ravi Teja Mullapudi, Saurabh Gupta, Deva Ramanan (ICLR 2020)
- Depth Estimation by Learning Triangulation and Densification of Sparse Points for Multi-view Stereo -Ayan Sinha, Zak Murez, James Bartolozzi, Vijay Badrinarayanan, Andrew Rabinovich (arxiv 2019)
- Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision -[CODE] - Niemeyer, Michael and Mescheder, Lars and Oechsle, Michael and Geiger, Andreas (CVPR 2020)
- SeqXY2SeqZ: Structure Learning for 3D Shapes by Sequentially Predicting 1D Occupancy Segments From 2D Coordinates -Zhizhong Han, Guanhui Qiao, Yu-Shen Liu, Matthias Zwicker (arxiv 2020)
- Real-Time Camera Pose Estimation for Sports Fields -Leonardo Citraro, Pablo Márquez-Neila, Stefano Savarè, Vivek Jayaram, Charles Dubout, Félix Renaut, Andrés Hasfura, Horesh Ben Shitrit, Pascal Fua (arxiv 2020)
- DO OPTIMIZATION METHODS IN DEEP LEARNING APPLICATIONS MATTER -[CODE] -Buse Melis Ozyildirim, Mariam Kiran (arxiv 2020)
- Occlusion-Aware Depth Estimation with Adaptive Normal Constraints -Xiaoxiao Long, Lingjie Liu, Christian Theobalt, Wenping Wang (arxiv 2020)
- DualConvMesh-Net: Joint Geodesic and Euclidean Convolutions on 3D Meshes -Jonas Schult, Francis Engelmann, Theodora Kontogianni, Bastian Leibe (CVPR 2020)
- Robust Single Rotation Averaging -Seong Hun Lee, Javier Civera (Arxiv 2020)
- Self-Supervised Scene De-occlusion -[CODE] -Xiaohang Zhan, Xingang Pan, Bo Dai, Ziwei Liu, Dahua Lin, Chen Change Loy (CVPR 2020)
- RANSAC-Flow: generic two-stage image alignment -[CODE] -Xi Shen, François Darmon, Alexei A. Efros, Mathieu Aubry (Arxiv 2020)
- Deep Homography Estimation for Dynamic Scenes -[CODE] -Hoang Le, Feng Liu, Shu Zhang, Aseem Agarwala (CVPR 2020)
- Self-Supervised Viewpoint Learning From Image Collections -[CODE] -Siva Karthik Mustikovela, Varun Jampani, Shalini De Mello, Sifei Liu, Umar Iqbal, Carsten Rother, Jan Kautz (CVPR 2020)
- Where Does It End? -- Reasoning About Hidden Surfaces by Object Intersection Constraints -Michael Strecke, Joerg Stueckler (CVPR 2020)
- Leveraging 2D Data to Learn Textured 3D Mesh Generation -Paul Henderson, Vagia Tsiminaki, Christoph H. Lampert (CVPR 2020)
- Image Co-skeletonization via Co-segmentation -Koteswar Rao Jerripothula, Jianfei Cai, Jiangbo Lu, Junsong Yuan (Arxiv 2020)
- On the uncertainty of self-supervised monocular depth estimation -[CODE] -Matteo Poggi, Filippo Aleotti, Fabio Tosi, Stefano Mattoccia (CVPR 2020)
- Focus on defocus: bridging the synthetic to real domain gap for depth estimation -Maxim Maximov, Kevin Galim, Laura Leal-Taixé (CVPR 2020)
- Uncertainty-Aware CNNs for Depth Completion: Uncertainty from Beginning to End -Abdelrahman Eldesokey, Michael Felsberg, Karl Holmquist, Mikael Persson (CVPR 2020)
- Accurate Estimation of Body Height From a Single Depth Image via a Four-Stage Developing Network -[CODE] -Fukun Yin, Shizhe Zhou (CVPR 2020)
- Quaternion Capsule Networks (Arxiv 2020)
Group of people
- SOCIAL LSTM: HUMAN TRAJECTORY PREDICTION IN CROWDED SPACES. - Alexandre Alahi, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Fei-Fei, Silvio Savarese. (CVPR 2016)
- Multi-Agent Tensor Fusion for Contextual Trajectory Prediction - Tianyang Zhao, Yifei Xu, Mathew Monfort, Wongun Choi, Chris Baker, Yibiao Zhao, Yizhou Wang, Ying Nian Wu (CVPR 2019)
- Towards Social Artificial Intelligence: Nonverbal Social Signal Prediction in A Triadic Interaction - Hanbyul Joo, Tomas Simon, Mina Cikara, Yaser Sheikh (CVPR 2019)
- Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks - Vineet Kosaraju, Amir Sadeghian, Roberto Martín-Martín, Ian Reid, S. Hamid Rezatofighi, Silvio Savarese (Arxiv 2019)
- To React or not to React: End-to-End Visual Pose Forecasting for Personalized Avatar during Dyadic Conversations - Chaitanya Ahuja, Shugao Ma, Louis-Philippe Morency, Yaser Sheikh (Arxiv 2019)
Person generation
- Activity Forecasting. - [CODE] - Kris M. Kitani, Brian Ziebart, James D. Bagnell and Martial Hebert. (ECCV 2012)
- Action-Reaction: Forecasting the Dynamics of Human Interaction. - De-An Huang and Kris M. Kitani. (ECCV 2014)
- A deep learning framework for character motion synthesis and editing - [CODE] (TOG 2016)
- Binge Watching: Scaling Affordance Learning from Sitcoms. - [CODE] - Xiaolong Wang*, Rohit Girdhar*, and Abhinav Gupta. (CVPR 2017)
- Pose Guided Person Image Generation - [CODE] - Ma, L., Jia, X., Sun, Q., Schiele, B., Tuytelaars, T., & Gool, L.V. (NIPS 2017)
- A Generative Model of People in Clothing - Lassner, C., Pons-Moll, G., & Gehler, P.V. (ICCV 2017)
- First-Person Activity Forecasting with Online Inverse Reinforcement Learning - Nicholas Rhinehart and Kris M. Kitani. (ICCV 2017)
- Synthesizing Images of Humans in Unseen Poses - [CODE] - Guha Balakrishnan, Amy Zhao, Adrian V. Dalca, Fredo Durand, John Guttag. (CVPR 2018)
- A Variational U-Net for Conditional Appearance and Shape Generation - [CODE] - Patrick Esser, Ekaterina Sutter, Björn Ommer. (CVPR 2018)
- Deformable GANs for Pose-based Human Image Generation - [CODE] - Siarohin, A., Sangineto, E., Lathuilière, S., & Sebe, N. (CVPR 2018)
- Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks - [CODE] - Agrim Gupta, Justin Johnson, Fei-Fei Li, Silvio Savarese, Alexandre Alahi. (CVPR 2018)
- QuaterNet: A Quaternion-based Recurrent Model for Human Motion - [CODE] - Dario Pavllo, David Grangier, and Michael Auli. (BMVC 2018)
- Dense Pose Transfer - Neverova, N., Guler, R.A., & Kokkinos, I. (ECCV 2018)
- MT-VAE: Learning Motion Transformations to Generate Multimodal Human Dynamics - Xinchen Yan, Akash Rastogi, Ruben Villegas, Kalyan Sunkavalli, Eli Shechtman, Sunil Hadap, Ersin Yumer, Honglak Lee (ECCV 2018)
- Few-Shot Human Motion Prediction via Meta-Learning - Liang-Yan Gui, Yu-Xiong Wang, Deva Ramanan, and Jos ́e M. F. Moura (ECCV 2018)
- Unsupervised Learning of Object Landmarks through Conditional Image Generation - Tomas Jakab, Ankush Gupta, Hakan Bilen, Andrea Vedaldi (NIPS 2018)
- FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification - [CODE] - Yixiao Ge.etal. (NIPS 2018)
- Soft-Gated Warping-GAN for Pose-Guided Person Image Synthesis - Haoye Dong.etal. (NIPS 2018)
- AUTO-CONDITIONED LSTM NETWORK FOR EXTENDED COMPLEX HUMAN MOTION SYNTHESIS - Yi Zhou*, Zimo Li*, Shuangjio Xiao, Chong He, Zeng Huang, Hao Li . (ICLR 2018)
- Everybody Dance Now - Caroline Chan, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros (Arxiv 2018)
- SiCloPe: Silhouette-Based Clothed People (Arxiv 2019)
- Unpaired Pose Guided Human Image Generation - Xu Chen, Jie Song, Otmar Hilliges (Arxiv 2019)
- Peeking into the Future: Predicting Future Person Activities and Locations in Videos - Junwei Liang, Lu Jiang, Juan Carlos Niebles, Alexander Hauptmann, Li Fei-Fei (Arxiv 2019)
- Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments - [CODE] - Xueting Li, SIfei Liu, Kihwan Kim, Xiaolong Wang, Ming-Hsuan Yang, Jan Kautz (CVPR 2019)
- Dense Intrinsic Appearance Flow for Human Pose Transfer - [CODE] - Yining Li, Chen Huang, Chen Change Loy (CVPR 2019)
- Vid2Game: Controllable Characters Extracted from Real-World Videos - Oran Gafni, Lior Wolf, Yaniv Taigman (Arxiv 2019)
- Textured Neural Avatars - [CODE] - Aliaksandra Shysheya, et al (Arxiv 2019)
- Explicit Disentanglement of Appearance and Perspective in Generative Models -Nicki S. Detlefsen,Søren Hauberg (Arxiv 2019)
- Learning Variations in Human Motion via Mix-and-Match Perturbation -Mohammad Sadegh Aliakbarian, Fatemeh Sadat Saleh, Mathieu Salzmann, Lars Petersson, Stephen Gould, Amirhossein Habibian (Arxiv 2019)
- First Order Motion Model for Image Animation - [CODE] -Stéphane Lathuilière, Sergey Tulyakov, Elisa Ricci and Nicu Sebe (NIPS 2019)
- Adversarial Synthesis of Human Pose from Text -Yifei Zhang, Rania Briq, Julian Tanke, Juergen Gall (Arxiv 2020)
3D Human Mesh
- Video Based Reconstruction of 3D People Models - Thiemo Alldieck, Marcus Magnor, Weipeng Xu, Christian Theobalt, Gerard Pons-Moll, (CVPR 2018)
- End-to-end Recovery of Human Shape and Pose - [CODE] - Kanazawa, A., Black, M.J., Jacobs, D.W., & Malik, J. (CVPR 2018)
- Relighting Humans: Occlusion-Aware Inverse Rendering for Full-Body Human Images - [CODE] - Yoshihiro Kanamori, Yuki Endo. (Siggraph 2018)
- BodyNet: Volumetric Inference of 3D Human Body Shapes - [CODE] - Varol, G., Ceylan, D., Russell, B., Yang, J., Yumer, E., Laptev, I., & Schmid, C. (ECCV 2018)
- Learning to Reconstruct People in Clothing from a Single RGB Camera - Thiemo Alldieck, Marcus Magnor, Bharat Lal Bhatnagar, Christian Theobalt, Gerard Pons-Moll (Arxiv 2019)
- DeepHuman: 3D Human Reconstruction from a Single Image - Zerong Zheng, Tao Yu, Yixuan Wei, Qionghai Dai, Yebin Liu (Arxiv 2019)
- Learning 3D Human Dynamics from Video - [CODE] - Angjoo Kanazawa, Jason Y. Zhang, Panna Felsen, Jitendra Malik (CVPR 2019)
- Detailed Human Shape Estimation from a Single Image by Hierarchical Mesh Deformation - [CODE] - Hao Zhu, Xinxin Zuo, Sen Wang, Xun Cao, Ruigang Yang (CVPR 2019)
- LBS Autoencoder: Self-supervised Fitting of Articulated Meshes to Point Clouds - Chun-Liang Li, Tomas Simon, Jason Saragih, Barnabás Póczos, Yaser Sheikh (CVPR 2019)
- Convolutional Mesh Regression for Single-Image Human Shape Reconstruction - [CODE] - Nikos Kolotouros, Georgios Pavlakos, Kostas Daniilidis (CVPR 2019)
- Expressive Body Capture: 3D Hands, Face, and Body from a Single Image - [CODE] - [CODE] - [CODE] - Georgios Pavlakos, Vasileios Choutas, Nima Ghorbani, Timo Bolkart, Ahmed A. A. Osman, Dimitrios Tzionas, Michael J. Black (CVPR 2019)
- Volumetric Capture of Humans with a Single RGBD Camera viaSemi-Parametric Learning - Rohit Pandey et al. (CVPR 2019)
- DenseBody: Directly Regressing Dense 3D Human Pose and Shape From a Single Color Image - [CODE] - Pengfei Yao, Zheng Fang, Fan Wu, Yao Feng, Jiwei Li (Arxiv 2019)
- Towards 3D Human Shape Recovery Under Clothing - Xin Chen, Anqi Pang, Yu Zhu, Yuwei Li, Xi Luo, Ge Zhang, Peihao Wang, Yingliang Zhang, Shiying Li, Jingyi Yu (Arxiv 2019)
- 3DPeople: Modeling the Geometry of Dressed Humans - [Dataset] - Albert Pumarola, Jordi Sanchez, Gary P. T. Choi, Alberto Sanfeliu, Francesc Moreno-Noguer (ICCV 2019)
- Long-Term Video Generation of Multiple FuturesUsing Human Poses - Naoya Fushishita, Antonio Tejero-de-Pablos, Yusuke Mukuta, Tatsuya Harada (Arxiv 2019)
- PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization - Shunsuke Saito, Zeng Huang, Ryota Natsume, Shigeo Morishima, Angjoo Kanazawa, Hao Li (Arxiv 2019)
- Learning 3D Human Body Embedding - Boyi Jiang, Juyong Zhang, Jianfei Cai, Jianmin Zheng (Arxiv 2019)
- Shape Evasion: Preventing Body Shape Inference of Multi-Stage Approaches - Hosnieh Sattar, Katharina Krombholz, Gerard Pons-Moll, Mario Fritz (Arxiv 2019)
- Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular Video - Jian Liu, Naveed Akhtar, Ajmal Mian (Arxiv 2019)
- Moulding Humans: Non-parametric 3D Human Shape Estimation from Single Images - [Studio] - Valentin Gabeur, Jean-Sebastien Franco, Xavier Martin, Cordelia Schmid, Gregory Rogez (ICCV 2019)
- Dressing 3D Humans using a Conditional Mesh-VAE-GAN - Qianli Ma, Siyu Tang, Sergi Pujades, Gerard Pons-Moll, Anurag Ranjan, Michael J. Black (Arxiv 2019)
- AMASS: Archive of Motion Capture as Surface Shapes - [CODE] - [CODE] - Mahmood, Naureen and Ghorbani, Nima and F. Troje, Nikolaus and Pons-Moll, Gerard and Black, Michael J. (ICCV 2019)
- Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation - Sun Yu, Ye Yun, Liu Wu, Gao Wenpeng, Fu YiLi, Mei Tao (ICCV 2019)
- Multi-Garment Net: Learning to Dress 3D People from Images - Bharat Lal Bhatnagar, Garvita Tiwari, Christian Theobalt, Gerard Pons-Moll (ICCV 2019)
- Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild - [CODE] - Yu Rong, Ziwei Liu, Cheng Li, Kaidi Cao, Chen Change Loy (ICCV 2019)
- Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images - Junbang Liang, Ming C. Lin (ICCV 2019)
- Estimation of Body Mass Index from Photographs using Deep Convolutional Neural Networks - Adam Pantanowitz, Emmanuel Cohen, Philippe Gradidge, Nigel Crowther, Vered Aharonson, Benjamin Rosman, David M Rubin (arxiv 2019)
- Video Interpolation and Prediction with Unsupervised Landmarks - Kevin J. Shih, Aysegul Dundar, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro (arxiv 2019)
- Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop - [CODE] - Nikos Kolotouros, Georgios Pavlakos, Michael J. Black, Kostas Daniilidis (ICCV 2019)
- DenseRaC: Joint 3D Pose and Shape Estimation by Dense Render-and-Compare - Yuanlu Xu, Song-Chun Zhu, Tony Tung (ICCV 2019)
- Efficient Learning on Point Clouds with Basis Point Sets - [CODE] - Prokudin, Sergey and Lassner, Christoph and Romero, Javier (ICCV 2019)
- TexturePose: Supervising Human Mesh Estimation with Texture Consistency - [CODE] - YGeorgios Pavlakos, Nikos Kolotouros, Kostas Daniilidis (ICCV 2019)
- Towards Robust RGB-D Human Mesh Recovery - Ren Li, Changjiang Cai, Georgios Georgakis, Srikrishna Karanam, Terrence Chen, Ziyan Wu (Arxiv 2019)
- CLOTH3D: Clothed 3D Humans - Hugo Bertiche, Meysam Madadi, Sergio Escalera(Arxiv 2019)
- Learning 3D Human Shape and Pose from Dense Body Parts - Hongwen Zhang Jie Cao Guo Lu Wanli Ouyang Zhenan Sun (Arxiv 2019)
- Learning from Synthetic Animals - Jiteng Mu, Weichao Qiu, Gregory Hager, Alan Yuille (Arxiv 2019)
- Dressing for Diverse Body Shapes - Wei-Lin Hsiao, Kristen Grauman (Arxiv 2019)
- Neural Human Video Rendering: Joint Learning of Dynamic Textures and Rendering-to-Video Translation - Lingjie Liu, Weipeng Xu, Marc Habermann, Michael Zollhoefer, Florian Bernard, Hyeongwoo Kim, Wenping Wang, Christian Theobalt(Arxiv 2020)
- Chained Representation Cycling: Learning to Estimate 3D Human Pose and Shape by Cycling Between Representations - Nadine Rueegg, Christoph Lassner, Michael J. Black, Konrad Schindler (AAAI 2020)
- The Whole Is Greater Than the Sum of Its Nonrigid Parts - Oshri Halimi, Ido Imanuel, Or Litany, Giovanni Trappolini, Emanuele Rodolà, Leonidas Guibas, Ron Kimmel (Arxiv 2020)
- Particle Filter Based Monocular Human Tracking with a 3D Cardbox Model and a Novel Deterministic Resampling Strategy - Ziyuan Liu, Dongheui Lee, Wolfgang Sepp (Arxiv 2020)
- PeelNet: Textured 3D reconstruction of human body using single view RGB image - Sai Sagar Jinka, Rohan Chacko, Avinash Sharma, P. J. Narayanan (Arxiv 2020)
- VIBE: Video Inference for Human Body Pose and Shape Estimation - [CODE] - Muhammed Kocabas, Nikos Athanasiou, Michael J. Black (CVPR 2020)
- Learning Nonparametric Human Mesh Reconstruction from a Single Image without Ground Truth Meshes - Kevin Lin, Lijuan Wang, Ying Jin, Zicheng Liu, Ming-Ting Sun (Arxiv 2020)
- Hierarchical Kinematic Human Mesh Recovery - Georgios Georgakis, Ren Li, Srikrishna Karanam, Terrence Chen, Jana Kosecka, Ziyan Wu (Arxiv 2020)
- Learning to Transfer Texture from Clothing Images to 3D Humans - Aymen Mir, Thiemo Alldieck, Gerard Pons-Moll (CVPR 2020)
- The Virtual Tailor: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style - [CODE] - Chaitanya Patel, Zhouyingcheng Liao, Gerard Pons-Moll (CVPR 2020)
- PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization - [CODE] - Shunsuke Saito, Tomas Simon, Jason Saragih, Hanbyul Joo (CVPR 2020)
- SoftSMPL: Data-driven Modeling of Nonlinear Soft-tissue Dynamics for Parametric Humans - [Page] - Igor Santesteban, Elena Garces, Miguel A. Otaduy, Dan Casas (Eurographics 2020)
- Learning 3D Human Shape and Pose from Dense Body Parts - [CODE] - Zhang, Hongwen and Cao, Jie and Lu, Guo and Ouyang, Wanli and Sun, Zhenan (Arxiv 2020)
- Robust 3D Self-portraits in Seconds -Zhe Li, Tao Yu, Chuanyu Pan, Zerong Zheng, Yebin Liu (CVPR 2020)
- ARCH: Animatable Reconstruction of Clothed Humans -Zeng Huang, Yuanlu Xu, Christoph Lassner, Hao Li, Tony Tung (CVPR 2020)
- MulayCap: Multi-layer Human Performance Capture Using A Monocular Video Camera -Zhaoqi Su, Weilin Wan, Tao Yu, Lingjie Liu, Lu Fang, Wenping Wang, Yebin Liu (Arxiv 2020)
- TetraTSDF: 3D human reconstruction from a single image with a tetrahedral outer shell -Hayato Onizuka, Zehra Hayirci, Diego Thomas, Akihiro Sugimoto, Hideaki Uchiyama, Rin-ichiro Taniguchi (Arxiv 2020)
- Self-Supervised Human Depth Estimation from Monocular Videos -Feitong Tan, Hao Zhu, Zhaopeng Cui, Siyu Zhu, Marc Pollefeys, Ping Tan (CVPR 2020)
- Learning to Dress 3D People in Generative Clothing -Qianli Ma, Jinlong Yang, Anurag Ranjan, Sergi Pujades, Gerard Pons-Moll, Siyu Tang, Michael J. Black (Arxiv 2020)
- IMPLICIT FUNCTIONS IN FEATURE SPACE FOR 3D SHAPE RECONSTRUCTION AND COMPLETION - [CODE] -Julian Chibane, Thiemo Alldieck, Gerard Pons-Moll (CVPR 2020)
- TailorNet: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style - [CODE] -Chaitanya Patel, Zhouyingcheng Liao, Gerard Pons-Moll (CVPR 2020)
- 3D Human Mesh Regression With Dense Correspondence -Wang Zeng, Wanli Ouyang, Ping Luo, Wentao Liu, Xiaogang Wang (CVPR 2020)
- Sequential 3D Human Pose and Shape Estimation From Point Clouds -Kangkan Wang, Jin Xie, Guofeng Zhang, Lei Liu, Jian Yang (CVPR 2020)
- GHUM & GHUML: Generative 3D Human Shape and Articulated Pose Models -Hongyi Xu, Eduard Gabriel Bazavan, Andrei Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu;(CVPR 2020)
- Object-Occluded Human Shape and Pose Estimation From a Single Color Image -Tianshu Zhang, Buzhen Huang, Yangang Wang (CVPR 2020)
Pose And Physics-Robotics
- Learning Locomotion Skills Using DeepRL: Does the Choice of Action Space Matter? - [CODE] - Xue Bin Peng, Michiel van de Panne (Eurographics 2017)
- DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills - [CODE] - Xue Bin Peng, Pieter Abbeel, Sergey Levine, Michiel van de Panne (SIGGRAPH 2018)
- SFV: Reinforcement Learning of Physical Skills from Videos - [CODE] - Xue Bin Peng, Angjoo Kanazawa, Jitendra Malik, Pieter Abbeel, Sergey Levine (SIGGRAPH Asia 2018)
- Learning to Sit: Synthesizing Human-Chair Interactions via Hierarchical Control - [Video] - Yu-Wei Chao, Jimei Yang, Weifeng Chen, Jia Deng (Arxiv 2018)
- AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human Videos - [CODE] - Laura Smith, Nikita Dhawan, Marvin Zhang, Pieter Abbeel, Sergey Levine (Arxiv 2019)
- pymanoid - [CODE]
Pose and Language-Speech-Reasoning-Semantics
- Your body language may shape who you are (TED 2012)
- Generating Animated Videos of Human Activities from Natural Language Descriptions - Angela S. Lin, Lemeng Wu, Rodolfo Corona, Kevin Tai, Qixing Huang, Raymond J. Mooney (NIPS 2018)
- Neural Sign Language Translation - [CODE] - Necati Cihan Camgoz and Simon Hadfield and Oscar Koller and Hermann Ney and Richard Bowden (CVPR 2018)
- Learning Individual Styles of Conversational Gesture - [CODE] - Shiry Ginosar, Amir Bar, Gefen Kohavi, Caroline Chan, Andrew Owens, Jitendra Malik (CVPR 2019)
- HAKE: Human Activity Knowledge Engine - [CODE] - Yong-Lu Li, Liang Xu, Xijie Huang, Xinpeng Liu, Ze Ma, Mingyang Chen, Shiyi Wang, Hao-Shu Fang, Cewu Lu (Arxiv 2019)
- Shape Evasion: Preventing Body Shape Inference of Multi-Stage Approaches - Hosnieh Sattar, Katharina Krombholz, Gerard Pons-Moll, Mario Fritz (Arxiv 2019)
- Language2Pose: Natural Language Grounded Pose Forecasting - Chaitanya Ahuja, Louis-Philippe Morency (Arxiv 2019)
- Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison - Dongxu Li, Cristian Rodriguez Opazo, Xin Yu, Hongdong Li (WACV 2020)
- Motion Reasoning for Goal-Based Imitation Learning - De-An Huang, Yu-Wei Chao, Chris Paxton, Xinke Deng, Li Fei-Fei, Juan Carlos Niebles, Animesh Garg, Dieter Fox (Arxiv 2019)
- Dancing to Music - [CODE] - Hsin-Ying Lee, Xiaodong Yang, Ming-Yu Liu, Ting-Chun Wang, Yu-Ding Lu, Ming-Hsuan Yang, Jan Kautz (NIPS 2019)
- Skeleton based Zero Shot Action Recognition in Joint Pose-Language Semantic Space - Bhavan Jasani, Afshaan Mazagonwalla (Arxiv 2019)
- Dressing for Diverse Body Shapes - Wei-Lin Hsiao, Kristen Grauman (Arxiv 2019)
- Music-oriented Dance Video Synthesis with Pose Perceptual Loss - Xuanchi Ren, Haoran Li, Zijian Huang, Qifeng Chen (Arxiv 2019)
- Music2Dance: Music-driven Dance Generation using WaveNet - Wenlin Zhuang, Congyi Wang, Siyu Xia, Jinxiang Chai, Yangang Wang (Arxiv 2020)
Pose-and-Action
- RSA: Randomized Simulation as Augmentation for Robust Human Action Recognition - Yi Zhang, Xinyue Wei, Weichao Qiu, Zihao Xiao, Gregory D. Hager, Alan Yuille. (Arxiv 2019)
- Simultaneous Implementation Features Extraction and Recognition Using C3DNetwork for WiFi-based Human Activity Recognition - Yafeng Liu et al. (Arxiv 2019)
- Action Recognition via Pose-Based Graph Convolutional Networks with Intermediate Dense Supervision - Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu (Arxiv 2019)
- Synthetic Humans for Action Recognition from Unseen Viewpoints - Gül Varol, Ivan Laptev, Cordelia Schmid, Andrew Zisserman (Arxiv 2019)
- Action Genome: Actions as Composition of Spatio-temporal Scene Graphs - Jingwei Ji, Ranjay Krishna, Li Fei-Fei, Juan Carlos Niebles (Arxiv 2019)
- Mimetics: Towards Understanding Human Actions Out of Context - Philippe Weinzaepfel, Grégory Rogez (Arxiv 2019)
- SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong - Steven Schwarcz, Peng Xu, David D'Ambrosio, Juhana Kangaspunta, Anelia Angelova, Huong Phan, Navdeep Jaitly (Arxiv 2019)
- Human Motion Anticipation with Symbolic Label - Julian Tanke, Andreas Weber, Juergen Gall (Arxiv 2019)
- Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition - Ziyu Liu, Hongwen Zhang, Zhenghao Chen, Zhiyong Wang, Wanli Ouyang (CVPR 2020)
- SSHFD: Single Shot Human Fall Detection with Occluded Joints Resilience - Umar Asif, Stefan Von Cavallar, Jianbin Tang, Stefan Harre (Arxiv 2020)
- Asynchronous Interaction Aggregation for Action Detection - Jiajun Tang, Jin Xia, Xinzhi Mu, Bo Pang, Cewu Lu (Arxiv 2020)
- 3DV: 3D Dynamic Voxel for Action Recognition in Depth Video - Yancheng Wang, Yang Xiao, Fu Xiong, Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan (CVPR 2020)
Video pose
- Nonrigid Structure from Motion in Trajectory Space - Ijaz Akhter, Yaser Sheikh, Sohaib Khan and Takeo Kanade. (NIPS 2008)
- Human Attributes from 3D Pose Tracking - Leonid Sigal, David J. Fleet, Nikolaus F. Troje, and Micha Livne. (ECCV 2010)
- Pose from Flow and Flow from Pose - Katerina Fragkiadaki, Han Hu and Jianbo Shi . (CVPR 2013)
- Recurrent Network Models for Human Dynamics - Katerina Fragkiadaki, Sergey Levine, Panna Felsen, Jitendra Malik . (ICCV 2015)
- Personalizing Human Video Pose Estimation - James Charles, Tomas Pfister, Derek Magee, David Hogg, Andrew Zisserman . (CVPR 2016)
- On human motion prediction using recurrent neural networks - Julieta Martinez, Michael J. Black, and Javier Romero. (CVPR 2017)
- Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos - Jie Song, Limin Wang, Luc Van Gool, Otmar Hilliges. (CVPR 2017)
- Deep Multitask Architecture for Integrated 2D and 3D Human Sensing - [CODE] - Alin-Ionut Popa and Mihai Zanfir and Cristian Sminchisescu. (CVPR 2017)
- Rpan: An end-to-end recurrent pose-attention network for action recognition in videos - Wenbin Du, Yali Wang, Yu Qiao. (ICCV 2017)
- Self-supervised Learning of Motion Capture - [CODE] - Hsiao-Yu Fish Tung, Hsiao-Wei Tung, Ersin Yumer, Katerina Fragkiadaki. (NIPS 2017)
- Detect-and-Track: Efficient Pose Estimation in Videos, - [CODE] - Rohit Girdhar, Georgia Gkioxari, Lorenzo Torresani, Manohar Paluri and Du Tran. (CVPR 2018)
- Neural Kinematic Networks for Unsupervised Motion Retargeting, - [CODE] - Ruben Villegas, Jimei Yang, Duygu Ceylan, Honglak Lee. (CVPR 2018)
- 2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning - [CODE] - Diogo C. Luvizon, David Picard, Hedi Tabia. (CVPR 2018)
- Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition, - [CODE] - Sijie Yan, Yuanjun Xiong, and Dahua Lin. (AAAI 2018)
- QuaterNet: A Quaternion-based Recurrent Model for Human Motion - [CODE] - Dario Pavllo, David Grangier, and Michael Auli. (BMVC 2018)
- Simple Baselines for Human Pose Estimation and Tracking - [CODE] - Bin Xiao, Haiping Wu, Yichen Wei. (ECCV 2018)
- Exploiting temporal information for 3D pose estimation - Mir Rayat Imtiaz Hossain, James J. Little (ECCV 2018)
- Learning 3D Human Pose from Structure and Motion - Dabral, R., Mundhada, A., Kusupati, U., Afaque, S., Sharma, A., & Jain, A. (ECCV 2018)
- Propagating LSTM: 3D Pose Estimation based on Joint Interdependency - Kyoungoh Lee, Inwoong Lee, and Sanghoon Lee. (ECCV 2018)
- Recovering Accurate 3D Human Pose in The Wild Using IMUs and a Moving Camera - Timo von Marcard, Roberto Henschel, Michael J. Black, Bodo Rosenhahn,and Gerard Pons-Moll (ECCV 2018)
- Learning to Detect and Track Visible and Occluded Body Joints in a Virtual World - [CODE] - [CODE] - Matteo Fabbri, Fabio Lanzi, Simone Calderara, Andrea Palazzi, Roberto Vezzani, and Rita Cucchiara (ECCV 2018)
- SFV: Reinforcement Learning of Physical Skills from Videos - Xue Bin Peng, Angjoo Kanazawa, Jitendra Malik, Pieter Abbeel, Sergey Levine. (ACM SIGGRAPH Asia 2018)
- 3D human pose estimation in video with temporal convolutions and semi-supervised training - Dario Pavllo, Christoph Feichtenhofer, David Grangier, Michael Auli. (Arxiv 2018)
- Human Motion Prediction via Learning Local Structure Representations and Temporal Dependencies - [CODE] - Xiao Guo, Jongmoo Choi. (AAAI 2019)
- BiHMP-GAN: Bidirectional 3D Human Motion Prediction GAN - Jogendra Nath Kundu, Maharshi Gor, R. Venkatesh Babu. (AAAI 2019)
- Bio-LSTM: A Biomechanically Inspired Recurrent Neural Network for 3D Pedestrian Pose and Gait Prediction - Xiao Guo, Jongmoo Choi. (Arxiv 2019)
- Multi-person Articulated Tracking with Spatial and Temporal Embeddings - Sheng Jin, Wentao Liu, Wanli Ouyang, Chen Qian (CVPR 2019)
- Learning Character-Agnostic Motion for Motion Retargeting in 2D - [CODE] - Kfir Aberman, Rundi Wu, Dani Lischinski, Baoquan Chen, Daniel Cohen-Or. (SIGGRAPH 2019)
- Exploiting temporal context for 3D human pose estimation in the wild - Anurag Arnab, Carl Doersch, Andrew Zisserman. (CVPR 2019)
- Learning Temporal Pose Estimation from Sparsely-Labeled Videos - Gedas Bertasius, Christoph Feichtenhofer, Du Tran, Jianbo Shi, Lorenzo Torresani. (NIPS 2019)
- Temporal Transformer Networks: Joint Learning of Invariant and Discriminative Time Warping - [CODE] - Suhas Lohit, Qiao Wang, Pavan Turaga. (CVPR 2019)
- Unsupervised Learning of Object Structure and Dynamics from Videos - Matthias Minderer, Chen Sun, Ruben Villegas, Forrester Cole, Kevin Murphy, Honglak Lee (Arxiv 2019)
- VRED: A Position-Velocity Recurrent Encoder-Decoder for Human Motion Prediction - Hongsong Wang, Jiashi Feng (Arxiv 2019)
- Delving into 3D Action Anticipation from Streaming Videos - Hongsong Wang, Jiashi Feng (Arxiv 2019)
- Sim2real transfer learning for 3D pose estimation: motion to the rescue - Carl Doersch, Andrew Zisserman (Arxiv 2019)
- A-MAL: Automatic Motion Assessment Learning from Properly Performed Motions in 3D Skeleton Videos - Tal Hakim, Ilan Shimshoni (Arxiv 2019)
- Learning Trajectory Dependencies for Human Motion Prediction - [CODE] - Wei Mao, Miaomiao Liu, Mathieu Salzmann, Hongdong Li (ICCV 2019)
- Dynamic Kernel Distillation for Efficient Pose Estimation in Videos - Xuecheng Nie, Yuncheng Li, Linjie Luo, Ning Zhang, Jiashi Feng (ICCV 2019)
- Imitation Learning for Human Pose Prediction - Borui Wang, Ehsan Adeli, Hsu-kuang Chiu, De-An Huang, Juan Carlos Niebles (ICCV 2019)
- Symbiotic Graph Neural Networks for 3D Skeleton-based Human Action Recognition and Motion Prediction - Maosen Li, Siheng Chen, Xu Chen, Ya Zhang, Yanfeng Wang, Qi Tian (Arxiv 2019)
- MultiPath: Multiple Probabilistic Anchor Trajectory Hypotheses for Behavior Prediction - Yuning Chai, Benjamin Sapp, Mayank Bansal, Dragomir Anguelov (CoRL 2019)
- Structured Prediction Helps 3D Human Motion Modelling - [CODE] - Emre Aksan, Manuel Kaufmann, Otmar Hilliges (ICCV 2019)
- Human Motion Prediction via Spatio-Temporal Inpainting - Alejandro Hernandez Ruiz, Juergen Gall, Francesc Moreno-Noguer (ICCV 2019)
- Imitation Learning for Human Pose Prediction - Borui Wang, Ehsan Adeli, Hsu-kuang Chiu, De-An Huang, Juan Carlos Niebles (ICCV 2019)
- Unsupervised learning of object structure and dynamics from videos - [CODE] - Matthias Minderer*, Chen Sun, Ruben Villegas, Forrester Cole, Kevin Murphy, Honglak Lee (NIPS 2019)
- Social-STGCNN: A Social Spatio-Temporal Graph Convolutional Neural Network for Human Trajectory Prediction - Abduallah Mohamed, Kun Qian, Mohamed Elhoseiny, Christian Claudel (CVPR 2020)
- TransMoMo: Invariance-Driven Unsupervised Video Motion Retargeting - Yang, Zhuoqian and Zhu, Wentao and Wu, Wayne and Qian, Chen and Zhou, Qiang and Zhou, Bolei and Loy, Chen Change (CVPR 2020)
- TITAN: Future Forecast using Action Priors - Srikanth Malla, Behzad Dariush, Chiho Choi (CVPR 2020)
- Long-term Human Motion Prediction with Scene Context - Zhe Cao, Hang Gao, Karttikeya Mangalam, Qi-Zhi Cai, Minh Vo, Jitendra Malik (Arxiv 2020)
- Human Motion Transfer from Poses in the Wild - Jian Ren, Menglei Chai, Sergey Tulyakov, Chen Fang, Xiaohui Shen, Jianchao Yang (Arxiv 2020)
- 3D human pose estimation with adaptive receptive fields and dilated temporal convolutions - Michael Shin, Eduardo Castillo, Irene Font Peradejordi, Shobhna Jayaraman (Arxiv 2020)
- TPNet: Trajectory Proposal Network for Motion Prediction - Liangji Fang, Qinhong Jiang, Jianping Shi, Bolei Zhou (Arxiv 2020)
- Generative Tweening: Long-term Inbetweening of 3D Human Motions - Yi Zhou, Jingwan Lu, Connelly Barnes, Jimei Yang, Sitao Xiang, Hao li(Arxiv 2020)
- Skeleton-Aware Networks for Deep Motion Retargeting - [CODE] - Kfir Aberman, Peizhuo Li, Dani Lischinski, Olga Sorkine-Hornung, Daniel Cohen-Or, Baoquan Chen (SIGGRAPH 2020)
- Unpaired Motion Style Transfer from Video to Animation - [CODE] - Kfir Aberman, Yijia Weng, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen (SIGGRAPH 2020)
Real-time pose estimation
- Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields - [CODE] - Cao, Z., Simon, T., Wei, S., & Sheikh, Y. (CVPR 2017)
- VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera - [CODE] - Mehta, Dushyant et al. (SIGGRAPH 2017)
- RMPE: Regional Multi-person Pose Estimation - [CODE1][CODE2] - Fang, H., Xie, S., & Lu, C. (ICCV 2017)
- Dense Human Pose Estimation In The Wild - [CODE] - Guler, R.A., Neverova, N., & Kokkinos, I. (CVPR 2018)
- MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network - [CODE] - Guler, R.A., Neverova, N., & Kokkinos, I. (ECCV 2018)
- Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose - [CODE] - Osokin, D. (Arxiv 2018)
- Extension to 3D pose estimation (based on Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB - Mehta, D., et al.) - [CODE]
- Lightweight 3D Human Pose Estimation Network Training Using Teacher-Student Learning- Dong-Hyun Hwang, Suntae Kim, Nicolas Monet, Hideki Koike, Soonmin Bae (Arxiv 2020)
Hand-Face-landmark
- Hand PointNet: 3D Hand Pose Estimation Using Point Sets - Liuhao Ge, Yujun Cai, Junwu Weng, Junsong Yuan (CVPR 2018)
- Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks - [CODE] - Xuepeng Shi, Shiguang Shan, Meina Kan, Shuzhe Wu, Xilin Chen (CVPR 2018)
- Hand Pose Estimation via Latent 2.5D Heatmap Networks Regression - Umar Iqbal , Pavlo Molchanov, Thomas Breuel, Juergen Gall, Jan Kautz1 (ECCV 2018)
- H+O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions - Bugra Tekin, Federica Bogo, Marc Pollefeys (CVPR 2019)
- 3D Hand Shape and Pose from Images in the Wild - Adnane Boukhayma, Rodrigo de Bem, Philip H.S. Torr (Arxiv 2019)
- 3D Dense Face Alignment via Graph Convolution Networks - Huawei Wei, Shuang Liang, Yichen Wei (Arxiv 2019)
- Disentangling Pose from Appearance in Monochrome Hand Images - Yikang Li, Chris Twigg, Yuting Ye, Lingling Tao, Xiaogang Wang (Arxiv 2019)
- Single Image 3D Hand Reconstruction with Mesh Convolutions - Dominik Kulon, Haoyang Wang, Riza Alp Güler, Michael Bronstein, Stefanos Zafeiriou (Arxiv 2019)
- Gesture-to-Gesture Translation in the Wild via Category-Independent Conditional Maps - Yahui Liu, Marco De Nadai, Gloria Zen, Nicu Sebe, Bruno Lepri (Arxiv 2019)
- FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images - [CODE] - Christian Zimmermann, Duygu Ceylan, Jimei Yang, Bryan Russell, Max Argus, Thomas Brox (ICCV 2019)
- Early Estimation of User's Intention of Tele-Operation Using Object Affordance and Hand Motion in a Dual First-Person Vision - Motoki Kojima, Jun Miura (Arxiv 2019)
- aligning latent spaces for 3d hand pose estimation - Linlin Yang, Shile Li, Dongheui Lee, Angela Yao (ICCV 2019)
- A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image -Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan (ICCV 2019)
- Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison -Dongxu Li and Cristian Rodriguez Opazo and Xin Yu and Hongdong Li (arxiv 2019)
- Deformation-aware Unpaired Image Translation for Pose Estimation on Laboratory Animals -Siyuan Li, Semih Günel, Mirela Ostrek, Pavan Ramdya, Pascal Fua, Helge Rhodin (arxiv 2020)
- Monocular Real-time Hand Shape and Motion Capture using Multi-modal Data - [CODE] -Yuxiao Zhou, Marc Habermann, Weipeng Xu, Ikhsanul Habibie, Christian Theobalt, Feng Xu (CVPR 2020)
- Balanced Alignment for Face Recognition: A Joint Learning Approach -Huawei Wei, Peng Lu, Yichen Wei (arxiv 2020)
- Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction (arxiv 2020)
- HandVoxNet: Deep Voxel-Based Network for 3D Hand Shape and Pose Estimation from a Single Depth Map -Jameel Malik, et,al (CVPR 2020)
- Weakly-Supervised Mesh-Convolutional Hand Reconstruction in the Wild - [CODE] -Dominik Kulon, Riza Alp Güler, Iasonas Kokkinos, Michael Bronstein, Stefanos Zafeiriou (CVPR 2020)
- Two-hand Global 3D Pose Estimation Using Monocular RGB -Fanqing Lin, Connor Wilhelm, Tony Martinez (arxiv 2020)
- Leveraging Photometric Consistency over Time for Sparsely Supervised Hand-Object Reconstruction -[CODE] -Hasson, Yana and Tekin, Bugra and Bogo, Federica and Laptev, Ivan and Pollefeys, Marc and Schmid, Cordelia (CVPR 2020)
- GanHand: Predicting Human Grasp Affordances in Multi-Object Scenes -Enric Corona, Albert Pumarola, Guillem Alenya, Francesc Moreno-Noguer, Gregory Rogez (CVPR 2020)
- Weakly-Supervised Domain Adaptation via GAN and Mesh Model for Estimating 3D Hand Poses Interacting Objects -Seungryul Baek, Kwang In Kim, Tae-Kyun Kim (CVPR 2020)
- JGR-P2O: Joint Graph Reasoning based Pixel-to-Offset Prediction Network for 3D Hand Pose Estimation from a Single Depth Image (ECCV 2020)
Datasets
2D
3D
Meshes
Benchmarks
2D
3D
Workshops
Blog posts
- Real-time Human Pose Estimation in the Browser with TensorFlow.js
- Deep learning for human pose estimation
- Deep Learning based Human Pose Estimation using OpenCV ( C++ / Python )
- A 2019 guide to Human Pose Estimation with Deep Learning
- Direct 3d Human Pose and Shape Estimation
Popular implementations
PyTorch
- MMPose
- openpifpaf
- pytorch-pose-hg-3d
- pytorch_Realtime_Multi-Person_Pose_Estimation
- AlphaPose
- pytorch-pose
- human-pose-estimation.pytorch
- coco-analyze,with Miss/Jitter/Swap/Inversion
TensorFlow
Torch
Others
Todo
- Add basics
- Add a SOTA ranking
- Add pose good group
- Human Mesh
- Pose & Language
- Popular implementations
License
<a rel="license" href="http://creativecommons.org/licenses/by/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by/4.0/88x31.png" /></a><br />This work is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</a>.