Awesome

Awesome Human Pose Estimation

A collection of resources on human pose related problem: mainly focus on human pose estimation, and will include mesh representation, flow calculation, (inverse) kinematics, affordance, robotics, or sequence learning.

Why awesome human pose estimation?

This is a collection of papers and resources I curated when learning the ropes in Human Pose estimation. And This is a fork from https://github.com/cbsudux/awesome-human-pose-estimation (thanks for cbsudux) and customized for personal study and sharing. I will be continuously updating this list with the latest papers and resources. If you want some theory on Human Pose Estimation, check out Pose Related_Human_Knowledge

https://github.com/xinghaochen/awesome-hand-pose-estimation https://github.com/1adrianb/face-alignment

Contributing

If you think I have missed out on something (or) have any suggestions (papers, implementations and other resources), feel free to pull a request

Feedback and contributions are welcome!

Basics
Papers
Datasets
Benchmarks
Workshops
Blog posts
Popular implementations
- PyTorch
- TensorFlow
- Torch
- Others

Basics

pose_related_human_knowledge

Papers

2D Pose estimation

Learning Human Pose Estimation Features with Convolutional Networks - Jain, A., Tompson, J., Andriluka, M., Taylor, G.W., & Bregler, C. (ICLR 2013)
DeepPose: Human Pose Estimation via Deep Neural Networks - Toshev, A., & Szegedy, C. (CVPR 2014)
Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation - [CODE] - Tompson, J., Jain, A., LeCun, Y., & Bregler, C. (NIPS 2014)
MoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation - Jain, A., Tompson, J., LeCun, Y., & Bregler, C. (ACCV 2014)
Efficient Object Localization Using Convolutional Networks - Tompson, J., Goroshin, R., Jain, A., LeCun, Y., & Bregler, C (CVPR 2015)
Flowing ConvNets for Human Pose Estimation in Videos - [CODE] - Pfister, T., Charles, J., & Zisserman, A. (ICCV 2015)
Convolutional Pose Machines - [CODE] - Wei, S., Ramakrishna, V., Kanade, T., & Sheikh, Y. (CVPR 2016)
Human Pose Estimation with Iterative Error Feedback- [CODE] Carreira, J., Agrawal, P., Fragkiadaki, K., & Malik, J. (CVPR 2016)
DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation - [CODE] - Pishchulin, L., Insafutdinov, E., Tang, S., Andres, B., Andriluka, M., Gehler, P.V., & Schiele, B. (CVPR 2016)
DeeperCut: A Deeper, Stronger, and Faster Multi-Person Pose Estimation Model - [CODE1][CODE2] - Insafutdinov, E., Pishchulin, L., Andres, B., Andriluka, M., & Schiele, B. (ECCV 2016)
Stacked Hourglass Networks for Human Pose Estimation - [CODE] - Newell, A., Yang, K., & Deng, J. (ECCV 2016)
Multi-context Attention for Human Pose Estimation - [CODE] - Chu, X., Yang, W., Ouyang, W., Ma, C., Yuille, A.L., & Wang, X. (CVPR 2017)
Towards Accurate Multi-person Pose Estimation in the Wild - [CODE] - Papandreou, G., Zhu, T., Kanazawa, N., Toshev, A., Tompson, J., Bregler, C., & Murphy, K.P. (CVPR 2017)
Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields - [CODE] - Cao, Z., Simon, T., Wei, S., & Sheikh, Y. (CVPR 2017)
Learning Feature Pyramids for Human Pose Estimation - [CODE] - Yang, W., Li, S., Ouyang, W., Li, H., & Wang, X. (ICCV 2017)
Human Pose Estimation Using Global and Local Normalization - Sun, K., Lan, C., Xing, J., Zeng, W., Liu, D., & Wang, J. (ICCV 2017)
Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation - Chen, Y., Shen, C., Wei, X., Liu, L., & Yang, J. (ICCV 2017)
RMPE: Regional Multi-person Pose Estimation - [CODE1][CODE2] - Fang, H., Xie, S., & Lu, C. (ICCV 2017)
Self Adversarial Training for Human Pose Estimation - [CODE1][CODE2] - Chou, C., Chien, J., & Chen, H. (ArXiv 2017)
Recurrent Human Pose Estimation - [CODE] - Belagiannis, V., & Zisserman, A. (FG 2017)
Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation - [CODE] Ning, G., Zhang, Z., & He, Z. (IEEE Transactions on Multimedia 2018)
Human Pose Estimation with Parsing Induced Learner- Xuecheng Nie, Jiashi Feng, Yiming Zuo, Shuicheng Yan (CVPR 2018)
LSTM Pose Machines - [CODE] - Yue Luo, Jimmy Ren, Zhouxia Wang, Wenxiu Sun, Jinshan Pan, Jianbo Liu, Jiahao Pang, Liang Lin (CVPR 2018)
Cascaded Pyramid Network for Multi-Person Pose Estimation - [CODE] - Yilun Chen, Zhicheng Wang, Yuxiang Peng, Zhiqiang Zhang, Gang Yu, Jian Sun (CVPR 2018)
Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation - [CODE] - Peng, Xi and Tang, Zhiqiang and Yang, Fei and Feris, Rogerio S and Metaxas, Dimitris (CVPR 2018)
Human Pose Estimation with Parsing Induced Learner - [CODE] - Xuecheng Nie, Jiashi Feng, Yiming Zuo, Shuicheng Yan (CVPR 2018)
Through-Wall Human Pose Estimation Using Radio Signals - Mingmin Zhao,Tianhong Li, Mohammad Abu Alsheikh, Yonglong Tian, Hang Zhao, Antonio Torralba, Dina Katabi (CVPR 2018)
Simple Baselines for Human Pose Estimation and Tracking - [CODE] - Bin, Xiao, Haiping Wu, Yichen Wei (ECCV 2018)
Multi-Scale Structure-Aware Network for Human Pose Estimation - Lipeng Ke, Ming-Ching Chang, Honggang Qi, Siwei Lyu (ECCV 2018)
Deeply Learned Compositional Models for Human Pose Estimation - [CODE] - Wei Tang, Pei Yu, Ying Wu (ECCV 2018)
MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network - [CODE] - Muhammed Kocabas, Salih Karagoz, Emre Akbas (ECCV 2018)
Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose - [CODE] - Osokin, D. (Arxiv 2018)
Rethinking on Multi-Stage Networks for Human Pose Estimation - Wenbo Li, Zhicheng Wang, Binyi Yin, Qixiang Peng, Yuming Du, Tianzi Xiao, Gang Yu,Hongtao Lu, Yichen Wei, and Jian Sun (Arxiv 2018)
CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark - [CODE] - Jiefeng Li, Can Wang, Hao Zhu, Yihuan Mao, Hao-Shu Fang, Cewu Lu (CVPR 2019)
Deep High-Resolution Representation Learning for Human Pose Estimation - [CODE] - [CODE2] - Ke Sun, Bin Xiao, Dong Liu, Jingdong Wang (CVPR 2019)
Human Pose Estimation with Spatial Contextual Information - Hong Zhang, Hao Ouyang, Shu Liu, Xiaojuan Qi, Xiaoyong Shen, Ruigang Yang, Jiaya Jia (Arxiv 2019)
PoseFix: Model-agnostic General Human Pose Refinement Network - [CODE] - Moon, Gyeongsik and Chang, Juyong and Lee, Kyoung Mu (CVPR 2019)
Graphonomy: Universal Human Parsing via Graph Transfer Learning - [CODE] - Ke Gong, Yiming Gao, Xiaodan Liang, Xiaohui Shen, Meng Wang, Liang Lin (CVPR 2019)
PifPaf: Composite Fields for Human Pose Estimation - [CODE] - Sven Kreiss, Lorenzo Bertoni, Alexandre Alahi (CVPR 2019)
Person-in-WiFi: Fine-grained Person Perception using WiFi - Fei Wang, Sanping Zhou, Stanislav Panev, Jinsong Han, Dong Huang (arxiv 2019)
Can WiFi Estimate Person Pose? - Fei Wang, Stanislav Panev, Ziyi Dai, Jinsong Han, Dong Huang (arxiv 2019)
Learning to Learn Relation for Important People Detection in Still Images - [CODE] - Wei-Hong Li, Fa-Ting Hong, Wei-Shi Zheng (CVPR 2019)
Efficient Online Multi-Person 2D Pose Tracking with Recurrent Spatio-Temporal Affinity Fields - [CODE] - Yaadhav Raaj, Haroon Idrees, Gines Hidalgo, Yaser Sheikh(CVPR 2019)
Adaptive NMS: Refining Pedestrian Detection in a Crowd - Songtao Liu, Di Huang, Yunhong Wang (CVPR 2019)
Multi-Person Pose Estimation with Enhanced Channel-wise and Spatial Information - Kai Su, Dongdong Yu, Zhenqi Xu, Xin Geng, Changhu Wang (CVPR 2019)
Fast Human Pose Estimation - [CODE] - Feng Zhang, Xiatian Zhu, Mao Ye (CVPR 2019)
Slim DensePose: Thrifty Learning from Sparse Annotations and Motion Cues - Natalia Neverova, James Thewlis, Rıza Alp Güler, Iasonas Kokkinos, Andrea Vedaldi (CVPR 2019)
Objects as Points - [CODE] - Xingyi Zhou, Dequan Wang, Philipp Krähenbühl (arxiv 2019)
Learning Individual Styles of Conversational Gesture - [CODE] - Shiry Ginosar, Amir Bar, Gefen Kohavi, Caroline Chan, Andrew Owens, Jitendra Malik (CVPR 2019)
Does Learning Specific Features for Related Parts Help Human Pose Estimation? - Wei Tang and Ying Wu (CVPR 2019)
Visual Person Understanding through Multi-Task and Multi-Dataset Learning - Kilian Pfeiffer, et al (Arxiv 2019)
Movement science needs different pose tracking algorithms - Nidhi Seethapathi, Shaofei Wang, Rachit Saluja, Gunnar Blohm, Konrad P. Kording (Arxiv 2019)
Learning to Train with Synthetic Humans - David T. Hoffmann, Dimitrios Tzionas, Micheal J. Black, Siyu Tang (GCPR 2019)
Falls Prediction Based on Body Keypoints and Seq2Seq Architecture - Minjie Hua, Yibing Nan, Shiguo Lian (Arxiv 2019)
Cross-Domain Adaptation for Animal Pose Estimation - Jinkun Cao, Hongyang Tang, Hao-Shu Fang, Xiaoyong Shen, Cewu Lu, Yu-Wing Tai (ICCV 2019)
Pose Neural Fabrics Search - [CODE] - Sen Yang, Wankou Yang, Zhen Cui (Arxiv 2019)
Anchor Loss: Modulating Loss Scale based on Prediction Difficulty - Serim Ryou, Seong-Gyun Jeong, Pietro Perona (ICCV 2019)
Single-Network Whole-Body Pose Estimation - [CODE] - Gines Hidalgo, Yaadhav Raaj, Haroon Idrees, Donglai Xiang, Hanbyul Joo, Tomas Simon, Yaser Sheikh (ICCV 2019)
The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation - Junjie Huang, Zheng Zhu, Feng Guo, Guan Huang (Arxiv 2019)
DirectPose: Direct End-to-End Multi-Person Pose Estimation - Zhi Tian, Hao Chen, Chunhua Shen (Arxiv 2019)
The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation - Junjie Huang, Zheng Zhu, Feng Guo, Guan Huang (ICCV 2019)
TRB: A Novel Triplet Representation for Understanding 2D Human Body - [Data] - Haodong Duan, KwanYee Lin, Sheng Jin, Wentao Liu, Chen Qian, Wanli Ouyang (ICCV 2019)
Simple and Lightweight Human Pose Estimation - Zhe Zhang, Jie Tang, Gangshan Wu (Arxiv 2019)
Mixture Dense Regression for Object Detection and Human Pose Estimation - Ali Varamesh, Tinne Tuytelaars (Arxiv 2019)
15 Keypoints Is All You Need - Michael Snower, Asim Kadav, Farley Lai, Hans Peter Graf (Arxiv 2019)
Learning Temporal Pose Estimation from Sparsely Labeled Videos - [CODE] - Gedas Bertasius, Christoph Feichtenhofer, Du Tran, Jianbo Shi, Lorenzo Torresani (NIPS 2019)
Correlated Uncertainty for Learning DenseCorrespondences from Noisy Labels - Natalia Neverova, David Novotny, Andrea Vedaldi (NIPS 2019)
Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation -[CODE] - Jia Li, Wen Su, Zengfu Wang (AAAI2020)
An End-to-End Framework for Unsupervised Pose Estimation of Occluded Pedestrians - Sudip Das, Perla Sai Raj Kishore, Ujjwal Bhattacharya (Arxiv 2020)
Transferring Dense Pose to Proximal Animal Classes - [CODE] - Artsiom Sanakoyeu, Vasil Khalidov, Maureen S. McCarthy, Andrea Vedaldi, Natalia Neverova (CVPR 2020)
Peeking into occluded joints: A novel framework for crowd pose estimation - ingteng Qiu, Xuanye Zhang, Yanran Li, Guanbin Li, Xiaojun Wu, Zixiang Xiong, Xiaoguang Han, Shuguang Cui (Arxiv 2020)
Motion-supervised Co-Part Segmentation - Aliaksandr Siarohin*, Subhankar Roy*, Stéphane Lathuilière, Sergey Tulyakov, Elisa Ricci, Nicu Sebe (Arxiv 2020)
Detailed 2D-3D Joint Representation for Human-Object Interaction - [CODE] - Yong-Lu Li, Xinpeng Liu, Han Lu, Shiyi Wang, Junqi Liu, Jiefeng Li, Cewu Lu (CVPR 2020)
Distribution Aware Coordinate Representation for Human Pose Estimation - [CODE] - Feng Zhang, Xiatian Zhu, Hanbin Dai, Mao Ye, Ce Zhu (CVPR 2020)
Yoga-82: A New Dataset for Fine-grained Classification of Human Poses - [Data] - Manisha Verma, Sudhakar Kumawat, Yuta Nakashima, Shanmuganathan Raman (CVPRW 2020)
Self-supervised Keypoint Correspondences for Multi-Person Pose Estimation and Tracking in Videos - Rafi Umer, Andreas Doering, Bastian Leibe, Juergen Gall (Arxiv 2020)
Making DensePose fast and light (Arxiv 2020)
Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation - Sheng Jin, Wentao Liu, Enze Xie, Wenhai Wang, Chen Qian, Wanli Ouyang, Ping Luo (ECCV 2020)
Whole-Body Human Pose Estimation in the Wild - [Data] - Sheng Jin, Lumin Xu, Jin Xu, Can Wang, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo (ECCV 2020)

3D Pose estimation

Reconstruction of Articulated Objects from Point Correspondences in a Single Uncalibrated Image - CJ Taylor. (CVIU 2000)
Covariance-Scaled Sampling for Monocular 3D Body Tracking - Cristian Sminchisescu and Bill Triggs. (CVPR 2001)
Improving the Scope of Deformable Model Shape and Motion Estimation - C. Sminchisescu and D. Metaxas and S. Dickinson. (CVPR 2001)
Recovering 3D Human Posefrom Monocular Images - Ankur Agarwal and Bill Triggs. (PAMI 2006)
3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network - Li, S., & Chan, A.B. (ACCV 2014)
3D Pictorial Structures for Multiple Human Pose Estimation - Vasileios Belagiannis , Sikandar Amin, Mykhaylo Andriluka,Bernt Schiele, Nassir Navab, and Slobodan Ilic (CVPR 2014)
3D Human pose estimation: A review of the literature and analysis of covariates (CVIU 2016)
Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video - [CODE] - X. Zhou, M. Zhu, G. Pavlakos, S. Leonardos, K.G. Derpanis, K. Daniilidis. (CVPR 2016)
Structured Prediction of 3D Human Pose with Deep Neural Networks - Tekin, B., Katircioglu, I., Salzmann, M., Lepetit, V., & Fua, P. (BMVC 2016)
VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera - [CODE] - Mehta, Dushyant et al. (SIGGRAPH 2017)
Recurrent 3D Pose Sequence Machines - Lin, M., Lin, L., Liang, X., Wang, K., & Cheng, H. (CVPR 2017)
Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image - Tomè, D., Russell, C., & Agapito, L. (CVPR 2017)
3D Human Pose Estimation from a Single Image via Distance Matrix Regression - Francesc Moreno-Noguer. (CVPR 2017)
3D Human Pose Estimation = 2D Pose Estimation + Matching - [CODE] - Ching-Hang Chen, Deva Ramanan. (CVPR 2017)
Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose - [CODE] - Pavlakos, G., Zhou, X., Derpanis, K.G., & Daniilidis, K. (CVPR 2017)
LCR-Net: Localization-Classification-Regression for Human Pose - [CODE] - Grégory Rogez, Philippe Weinzaepfel, Cordelia Schmid. (CVPR 2017)
Deep Learning on Lie Groups for Skeleton-based Action Recognition - [CODE] - Zhiwu Huang, Chengde Wan, Thomas Probst, Luc Van Gool. (CVPR 2017)
Seeing invisible poses: Estimating3d body pose from egocentric video. - Hao Jiang, Kristen Grauman. (CVPR 2017)
Harvesting Multiple Views for Marker-less 3D Human Pose Annotations - [CODE] - G. Pavlakos, X. Zhou, K. Derpanis, K. Daniilidis. (CVPR 2017)
Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach - [CODE] - Zhou, X., Huang, Q., Sun, X., Xue, X., & Wei, Y. (ICCV 2017)
Adversarial Inverse Graphics Networks: Learning 2D-to-3D Lifting and Image-to-Image Translation from Unpaired Supervision - Hsiao-Yu Fish Tung. etal. (ICCV 2017)
A Simple Yet Effective Baseline for 3d Human Pose Estimation - [CODE] - Martinez, J., Hossain, R., Romero, J., & Little, J.J. (ICCV 2017)
Sparse Representation for 3D Shape Estimation: A Convex Relaxation Approach - [CODE] - X. Zhou, M. Zhu, S. Leonardos, K. Daniilidis. (PAMI 2017)
Compositional Human Pose Regression - Sun, X., Shang, J., Liang, S., & Wei, Y. (ICCV 2017)
Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision - Mehta, D., Rhodin, H., Casas, D., Fua, P., Sotnychenko, O., Xu, W., & Theobalt, C. (3DV 2017)
3D Human Pose Estimation in the Wild by Adversarial Learning - Yang, W., Ouyang, W., Wang, X., Ren, J.S., Li, H., & Wang, X. (CVPR 2018)
Ordinal Depth Supervision for 3D Human Pose Estimation - [CODE] - G. Pavlakos, X. Zhou, K. Daniilidis. (CVPR 2018)
V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation From a Single Depth Map - [CODE] - Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee. (CVPR 2018)
DRPose3D: Depth Ranking in 3D Human Pose Estimation - Wang, M., Chen, X., Liu, W., Qian, C., Lin, L., & Ma, L. (IJCAI 2018)
Human Motion Capture Using a Drone - X. Zhou, S. Liu, G. Pavlakos, V.J. Kumar, K. Daniilidis. (ICRA 2018)
End-to-end Recovery of Human Shape and Pose - [CODE] - Kanazawa, A., Black, M.J., Jacobs, D.W., & Malik, J. (CVPR 2018)
Learning to Estimate 3D Human Pose and Shape from a Single Color Image - Pavlakos, G., Zhu, L., Zhou, X., & Daniilidis, K. (CVPR 2018)
Monocular 3D Pose and Shape Estimation of Multiple People in Natural Scenes - Andrei Zanfir, Elisabeta Marinoiu, Cristian Sminchisescu. (CVPR 2018)
Dense Human Pose Estimation In The Wild - [CODE] - Guler, R.A., Neverova, N., & Kokkinos, I. (CVPR 2018)
Learning Monocular 3D Human Pose Estimation from Multi-View Images - Helge Rhodin, Jörg Spörri, Isinsu Katircioglu, Victor Constantin, Frédéric Meyer, Erich Müller, Mathieu Salzmann, Pascal Fua. (CVPR 2018)
3D Human Sensing, Action and Emotion Recognition inRobot Assisted Therapy of Children with Autism - Elisabeta Marinoiu, Mihai Zanfir, Vlad Olaru, Cristian Sminchisescu. (CVPR 2018)
Neural Body Fitting: Unifying Deep Learning and Model-Based Human Pose and Shape Estimation - [CODE] - Omran, Mohamed and Lassner, Christoph and Pons-Moll, Gerard and Gehler, Peter V. and Schiele, Bernt (3DV 2018)
Learning 3D Human Pose from Structure and Motion - Dabral, R., Mundhada, A., Kusupati, U., Afaque, S., Sharma, A., & Jain, A. (ECCV 2018)
Unsupervised Learning of View-invariant Action Representations - Junnan Li.etal. (NIPS 2018)
Deep Network for the Integrated 3D Sensing ofMultiple People in Natural Images - Andrei Zanfir.etal. (NIPS 2018)
Integral Human Pose Regression - [CODE] - Sun, X., Xiao, B., Liang, S., & Wei, Y. (ECCV 2018)
Dense Pose Transfer - Neverova, N., Guler, R.A., & Kokkinos, I. (ECCV 2018)
Deformable Pose Traversal Convolutionfor 3D Action and Gesture Recognition - Junwu Weng.et.al. (ECCV 2018)
Deep Autoencoder for Combined Human Pose Estimation and Body Model Upscaling - Matthew Trumble, Andrew Gilbert, Adrian Hilton, John Collomosse (ECCV 2018)
Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation - [CODE] - Rhodin, H., Salzmann, M., & Fua, P. (ECCV 2018)
Monocap: Monocular human motion capture using a CNN coupled with a geometric prior - [CODE] - X. Zhou, M. Zhu, G. Pavlakos, S. Leonardos, K.G. Derpanis, K. Daniilidis. (TPAMI 2018)
Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB - [CODE1][CODE2] - Mehta, Dushyant and Sotnychenko, Oleksandr and Mueller, Franziska and Xu, Weipeng and Sridhar, Srinath and Pons-Moll, Gerard and Theobalt, Christian (3DV 2018)
HUMBI 1.0: HUman Multiview Behavioral Imaging Dataset (Arxiv 2018)
Explicit Pose Deformation Learning for Tracking Human Poses - Xiao Sun, Chuankang Li, Stephen Lin (Arxiv 2018)
3D Human Pose Machines with Self-supervised Learning - [CODE] - Keze Wang, Liang Lin, Chenhan Jiang, Chen Qian, and Pengxu Wei. (TPAMI 2019)
3D Human Pose Estimation with 2D Marginal Heatmaps - [CODE] - Aiden Nibali, Zhen He, Stuart Morgan, Luke Prendergast. (WACV 2019)
Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views - [CODE] - Junting Dong, Wen Jiang, Qixing Huang, Hujun Bao, Xiaowei Zhou. (CVPR 2019)
Learning the Depths of Moving People by Watching Frozen People - Zhengqi Li, Tali Dekel, Forrester Cole, Richard Tucker, Noah Snavely, Ce Liu, William T. Freeman. (CVPR 2019)
Monocular Total Capture: Posing Face, Body and Hands in the Wild - [CODE] - Donglai Xiang, Hanbyul Joo, Yaser Sheikh (CVPR 2019)
RepNet: Weakly Supervised Training of an Adversarial Reprojection Network for 3D Human Pose Estimation - Bastian Wandt, Bodo Rosenhahn (CVPR 2019)
In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations - Ikhsanul Habibie, Weipeng Xu, Dushyant Mehta, Gerard Pons-Moll, Christian Theobalt (CVPR 2019)
Semantic Graph Convolutional Networks for 3D Human Pose Regression - [CODE] - Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris N. Metaxas (CVPR 2019)
ON THE CONTINUITY OF ROTATION REPRESENTATIONS IN NEURAL NETWORKS - Yi Zhou*, Connelly Barnes*, Jingwan Lu, Jimei Yang, Hao Li (CVPR 2019)
Self-Supervised Learning of 3D Human Pose using Multi-view Geometry - [CODE] - Muhammed Kocabas, Salih Karagoz, Emre Akbas, (CVPR 2019)
Generating Multiple Hypotheses for 3D Human Pose Estimation with Mixture Density Network - [CODE] - Chen Li, Gim Hee Lee (CVPR 2019)
Neural Scene Decomposition for Multi-Person Motion Capture - Helge Rhodin, Victor Constantin, Isinsu Katircioglu, Mathieu Salzmann, Pascal Fua (CVPR 2019)
Weakly-Supervised Discovery of Geometry-Aware Representationfor 3D Human Pose Estimation - Xipeng Chen, Kwan-Yee Lin, Wentao Liu, Chen Qian, Liang Lin (CVPR 2019)
IGE-Net: Inverse Graphics Energy Networksfor Human Pose Estimation and Single-View Reconstruction - Dominic Jack etal (CVPR 2019)
Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking - [CODE] - Saurabh Sharma, Pavan Teja Varigonda, Prashast Bindal, Abhishek Sharma, Arjun Jain (Arxiv 2019)
Estimating 3D Motion and Forces of Person-Object Interactions from Monocular Video - Zongmian Li, Jiri Sedlar, Justin Carpentier, Ivan Laptev, Nicolas Mansard, Josef Sivic (Arxiv 2019)
Context-aware Human Motion Prediction - Enric Corona, Albert Pumarola, Guillem Alenyà, Francesc Moreno (Arxiv 2019)
Unsupervised 3D Pose Estimation with Geometric Self-Supervision - Ching-Hang Chen, Ambrish Tyagi, Amit Agrawal, Dylan Drover, Rohith MV, Stefan Stojanov, James M. Rehg (Arxiv 2019)
Generalizing Monocular 3D Human Pose Estimation in the Wild - [CODE] - Luyang Wang, Yan Chen, Zhenhua Guo, Keyuan Qian, Mude Lin, Hongsheng Li, Jimmy S. Ren (Arxiv 2019)
Absolute Human Pose Estimation with Depth Prediction Network - Márton Véges, András Lőrincz (IJCNN 2019)
You2Me: Inferring Body Pose in Egocentric Video via First and Second Person Interactions - [CODE] - Evonne Ng, Donglai Xiang, Hanbyul Joo, Kristen Grauman (Arxiv 2019)
Learnable Triangulation of Human Pose - [CODE] - Karim Iskakov, Egor Burkov, Victor Lempitsky, Yury Malkov (ICCV 2019)
Not All Parts Are Created Equal: 3D Pose Estimation by Modelling Bi-directional Dependencies of Body Parts - Jue Wang, Shaoli Huang, Xinchao Wang, Dacheng Tao (Arxiv 2019)
MONET: Multiview Semi-supervised Keypoint via Epipolar Divergence - Yasamin Jafarian, Yuan Yao, Hyun Soo Park (Arxiv 2018)
Feature Boosting Network For 3D Pose Estimation - Jun Liu, Henghui Ding, Amir Shahroudy, Ling-Yu Duan, Xudong Jiang, Gang Wang, Alex C. Kot (TPAMI 2019)
Ego-Pose Estimation and Forecasting as Real-Time PD Control - [CODE] - Ye Yuan, Kris Kitani (ICCV 2019)
Understanding Human Context in 3D Scenes by Learning Spatial Affordances with Virtual Skeleton Models - Lasitha Piyathilaka, Sarath Kodagoda (Arxiv 2019)
XNect: Real-time Multi-person 3D Human Pose Estimation with a Single RGB Camera - [CODE] - Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Mohamed Elgharib, Pascal Fua, Hans-Peter Seidel, Helge Rhodin, Gerard Pons-Moll, Christian Theobalt (Arxiv 2019)
Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image - [CODE] - [CODE] - Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee (ICCV 2019)
xR-EgoPose: Egocentric 3D Human Pose from an HMD Camera - [CODE] - Denis Tome, Patrick Peluse, Lourdes Agapito, Hernan Badino (ICCV 2019)
Semantic Estimation of 3D Body Shape and Pose using Minimal Cameras - Andrew Gilbert, Matthew Trumble, Adrian Hilton, John Collomosse (Arxiv 2019)
Resolving 3D Human Pose Ambiguities with 3D Scene Constraints - [Data] - [CODE] - Mohamed Hassan, Vasileios Choutas, Dimitrios Tzionas, Michael J. Black (ICCV 2019)
Distill Knowledge from NRSfM for Weakly Supervised 3D Pose Learning - Chaoyang Wang, Chen Kong, Simon Lucey (ICCV 2019)
Single-Stage Multi-Person Pose Machines - Xuecheng Nie, Jianfeng Zhang, Shuicheng Yan, Jiashi Feng (ICCV 2019)
A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image - [CODE] - Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan (ICCV 2019)
Cross View Fusion for 3D Human Pose Estimation - [CODE] - Haibo Qiu, Chunyu Wang, Jingdong Wang, Naiyan Wang, Wenjun Zeng (ICCV 2019)
Optimizing Network Structure for 3D Human Pose Estimation - Hai Ci, Chunyu Wang, Xiaoxuan Ma, Yizhou Wang, (ICCV 2019)
Motion Capture from Pan-Tilt Cameras with Unknown Orientation - Roman Bachmann, Jörg Spörri, Pascal Fua, Helge Rhodin (3DV 2019)
C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion - David Novotny, Nikhila Ravi, Benjamin Graham, Natalia Neverova, Andrea Vedaldi (ICCV 2019)
MonoLoco: Monocular 3D Pedestrian Localization and Uncertainty Estimation - [CODE] - Lorenzo Bertoni, Sven Kreiss, Alexandre Alahi (ICCV 2019)
Multi-Person 3D Human Pose Estimation from Monocular Images - Rishabh Dabral, Nitesh B Gundavarapu, Rahul Mitra, Abhishek Sharma, Ganesh Ramakrishnan, Arjun Jain (3DV 2019)
Human Synthesis and Scene Compositing - Mihai Zanfir, Elisabeta Oneata, Alin-Ionut Popa, Andrei Zanfir, Cristian Sminchisescu (Arxiv 2019)
MocapNET: Ensemble of SNN Encoders for 3D Human Pose Estimation in RGB Images - [CODE] - Qammaz, Ammar and Argyros, Antonis A (BMVC 2019)
Adversarial Attack on Skeleton-based HumanAction Recognition -Jian Liu, Naveed Akhtar, and Ajmal Mian, (Arxiv 2019)
Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop - [CODE] - Nikos Kolotouros, Georgios Pavlakos, Michael J. Black, Kostas Daniilidis (ICCV 2019)
DenseRaC: Joint 3D Pose and Shape Estimation by Dense Render-and-Compare - Yuanlu Xu, Song-Chun Zhu, Tony Tung (ICCV 2019)
Chirality Nets for Human Pose Regression - Raymond A. Yeh, Yuan-Ting Hu, Alexander G. Schwing (NIPS 2019)
Learning Graph Convolutional Network for Skeleton-based Human Action Recognition by Neural Searching - Wei Peng, Xiaopeng Hong, Haoyu Chen, Guoying Zhao (Arxiv 2020)
A Neural Network for Detailed Human Depth Estimation from a Single Image - Sicong Tang, Feitong Tan, Kelvin Cheng, Zhaoyang Li, Siyu Zhu, Ping Tan (ICCV 2019)
HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation - Kun Zhou, Xiaoguang Han, Nianjuan Jiang, Kui Jia, Jiangbo Lu (ICCV 2019)
Convex Optimisation for Inverse Kinematics - Tarun Yenamandra, Florian Bernard, Jiayi Wang, Franziska Mueller, Christian Theobalt (Arxiv 2019)
AbsPoseLifter: Absolute 3D Human Pose Lifting Network from a Single Noisy 2D Human Pose - Ju Yong Chang, Gyeongsik Moon, Kyoung Mu Lee (Arxiv 2019)
SMART: Skeletal Motion Action Recognition aTtack - He Wang, Feixiang He, Zexi Peng, Yongliang Yang, Tianjia Shao, Kun Zhou, David Hogg (Arxiv 2019)
Markerless Outdoor Human Motion Capture Using Multiple Autonomous Micro Aerial Vehicles - [CODE] - Nitin Saini, Eric Price, Rahul Tallamraju, Raffi Enficiaud, Roman Ludwig, Igor Martinovic, Aamir Ahmad, Michael J. Black (ICCV 2019)
Exploiting Spatial-temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks - Yujun Cai, etal (ICCV 2019)
Occlusion-Aware Networks for 3D Human Pose Estimation in Video - Yu Cheng, Bo Yang, Bo Wang, Wending Yan, and Robby T. Tan (ICCV 2019)
On Boosting Single-Frame 3D Human Pose Estimation via Monocular Videos - Zhi Li,Xuan Wang, Fei Wang, and Peilin Jiang(ICCV 2019)
Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking - [CODE] - Saurabh Sharma, Pavan Teja Varigonda, Prashast Bindal, Abhishek Sharma, Arjun Jain (ICCV 2019)
Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates - Diogo C Luvizon, Hedi Tabia, David Picard (Arxiv 2019)
DeepFuse: An IMU-Aware Network for Real-Time 3D Human Pose Estimation from Multi-View Image - Fuyang Huang, Ailing Zeng, Minhao Liu, Qiuxia Lai, Qiang Xu (WACV 2020)
Generating 3D People in Scenes without People - Yan Zhang, Mohamed Hassan, Heiko Neumann, Michael J. Black, Siyu Tang (Arxiv 2019)
Video Motion Capture from the Part Confidence Maps of Multi-Camera Images by Spatiotemporal Filtering Using the Human Skeletal Model - Takuya Ohashi, Yosuke Ikegami, Kazuki Yamamoto, Wataru Takano, Yoshihiko Nakamura (IROS 2018)
Domes to Drones: Self-Supervised Active Triangulation for 3D Human Pose Reconstruction
Aleksis Pirinen1, Erik Gärtner and Cristian Sminchisescu (NIPS 2019)
From Kinematics To Dynamics: Estimating Center of Pressure and Base of Support from Video Frames of Human Motion
Jesse Scott, Christopher Funk, Bharadwaj Ravichandran, John H. Challis, Robert T. Collins, Yanxi Liu (arxiv 2020)
ActiveMoCap: Optimized Drone Flight for Active Human Motion Capture - Sena Kiciroglu, Helge Rhodin, Sudipta Sinha, Mathieu Salzmann, Pascal Fua (Arxiv 2019)
Synergetic Reconstruction from 2D Pose and 3D Motion for Wide-Space Multi-Person Video Motion Capture in the Wild - [CODE] - Takuya Ohashi, Yosuke Ikegami,Yoshihiko Nakamura (Arxiv 2020)
Deep Reinforcement Learning for Active Human Pose Estimation - Erik Gärtner, Aleksis Pirinen, Cristian Sminchisescu (AAAI 2020)
Deep NRSfM++: Towards 3D Reconstruction in the Wild - Chaoyang Wang, Chen-Hsuan Lin, Simon Lucey (Arxiv 2020)
Anatomy-aware 3D Human Pose Estimation in Videos - Tianlang Chen, Chen Fang, Xiaohui Shen, Yiheng Zhu, Zhili Chen, Jiebo Luo (Arxiv 2020)
PoseNet3D: Unsupervised 3D Human Shape and Pose Estimation - Shashank Tripathi, Siddhant Ranade, Ambrish Tyagi, Amit Agrawal (Arxiv 2020)
EllipBody: A Light-weight and Part-based Representation for Human Pose and Shape Recovery - Min Wang, Feng Qiu, Wentao Liu, Chen Qian, Xiaowei Zhou, Lizhuang Ma (CVPR 2020)
Weakly-Supervised 3D Human Pose Learning via Multi-view Images in the Wild - Umar Iqbal, Pavlo Molchanov, Jan Kautz (CVPR 2020)
Fusing Wearable IMUs with Multi-View Images for Human Pose Estimation: A Geometric Approach - Zhe Zhang, Chunyu Wang, Wenhu Qin, Wenjun Zeng (CVPR 2020)
MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation - Rongchang Xie, Chunyu Wang, Yizhou Wang (CVPR 2020)
Optical Non-Line-of-Sight Physics-based 3D Human Pose Estimation - Mariko Isogawa, Ye Yuan, Matthew O'Toole, Kris Kitani (CVPR 2020)
Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation - Matteo Fabbri, Fabio Lanzi, Simone Calderara, Stefano Alletto, Rita Cucchiara (CVPR 2020)
Bodies at Rest: 3D Human Pose and Shape Estimation from a Pressure Image using Synthetic Data - [CODE] - Henry M. Clever, Zackory Erickson, Ariel Kapusta, Greg Turk, C. Karen Liu, Charles C. Kemp (CVPR 2020)
Lightweight Multi-View 3D Pose Estimation through Camera-Disentangled Representation - Edoardo Remelli, Shangchen Han, Sina Honari, Pascal Fua, Robert Wang (CVPR 2020)
Multi-Person Absolute 3D Human Pose Estimation with Weak Depth Supervision - [Code] - Marton Veges, Andras Lorincz (Arxiv 2020)
Exemplar Fine-Tuning for 3D Human Pose Fitting Towards In-the-Wild 3D Human Pose Estimation - Hanbyul Joo, Natalia Neverova, Andrea Vedaldi (Arxiv 2020)
Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image Synthesis - Jogendra Nath Kundu, Siddharth Seth, Varun Jampani, Mugalodi Rakesh, R. Venkatesh Babu, Anirban Chakraborty (CVPR 2020)
Multimodal and multiview distillation for real-time player detection on a football field - Anthony Cioppa, Adrien Deliège, Noor Ul Huda, Rikke Gade, Marc Van Droogenbroeck, Thomas B. Moeslund (CVPRW 2020)
End-to-End Estimation of Multi-Person 3D Poses from Multiple Cameras - [code] - Hanyue Tu, Chunyu Wang, Wenjun Zeng (Arxiv 2020)
Motion Guided 3D Pose Estimation from Videos - Jingbo Wang, Sijie Yan, Yuanjun Xiong, Dahua Lin (Arxiv 2020)
View Invariant Human Body Detection and Pose Estimation from Multiple Depth Sensors - Walid Bekhtaoui, Ruhan Sa, Brian Teixeira, Vivek Singh, Klaus Kirchberg, Yao-jen Chang, Ankur Kapoor (Arxiv 2020)
MEBOW: Monocular Estimation of Body Orientation In the Wild - Chenyan Wu, et,al (CVPR 2020)
Epipolar Transformers - [code] - Yihui He, Rui Yan, Katerina Fragkiadaki, Shoou-I Yu (CVPR 2020)
Cross-View Tracking for Multi-Human 3D Pose Estimation at over 100 FPS - Long Chen, Haizhou Ai, Rui Chen, Zijie Zhuang, Shuang Liu (CVPR 2020)
Multiview-Consistent Semi-Supervised Learning for 3D Human Pose Estimation - Rahul Mitra, Nitesh B. Gundavarapu, Abhishek Sharma, Arjun Jain (CVPR 2020)
Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction - [code] - Ruixu Liu, Ju Shen, He Wang, Chen Chen, Sen-ching Cheung, Vijayan Asari (CVPR 2020)
Cascaded Deep Monocular 3D Human Pose Estimation With Evolutionary Training Data - Shichao Li, Lei Ke, Kevin Pratama, Yu-Wing Tai, Chi-Keung Tang, Kwang-Ting Cheng (CVPR 2020)
Deep Kinematics Analysis for Monocular 3D Human Pose Estimation - Jingwei Xu et al. (CVPR 2020)
Three-dimensional Reconstruction of Human Interactions - [code] - Mihai Fieraru et al. (CVPR 2020)
Coherent Reconstruction of Multiple Humans from a Single Image - [code] - Wen Jiang, Nikos Kolotouros, Georgios Pavlakos , Xiaowei Zhou, Kostas Daniilidis (CVPR 2020)
Optical Non-Line-of-Sight Physics-based 3D Human Pose Estimation - [code] - Mariko Isogawa, Ye Yuan, Matthew O'Toole, Kris Kitani (CVPR 2020)
Kinematic-Structure-Preserved Representation for Unsupervised 3D Human Pose Estimation - Jogendra Nath Kundu, Siddharth Seth, Rahul M V, Mugalodi Rakesh, R. Venkatesh Babu, Anirban Chakraborty (AAAI 2020)
Inference Stage Optimization for Cross-scenario 3D Human Pose Estimation - Jianfeng Zhang, Xuecheng Nie, Jiashi Feng (Arxiv 2020)
Geometric Pose Affordance: 3D Human Pose with Scene Constraints - [Page] - Zhe Wang, Liyan Chen, Shaurya Rathore, Daeyun Shin, Charless Fowlkes (Arxiv 2019)
Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation - [Page] - Zhe Wang, Daeyun Shin, Charless C. Fowlkes (Arxiv 2020)

Geometry

Neural 3D Mesh Renderer - JKato, Hiroharu and Ushiku, Yoshitaka and Harada, Tatsuya (CVPR 2018)
Learning Two-View Correspondences and Geometry Using Order-Aware Network - Jiahui Zhang, Dawei Sun, Zixin Luo, Anbang Yao, Lei Zhou, Tianwei Shen, Yurong Chen, Long Quan, Hongen Liao (ICCV 2019)
UprightNet: Geometry-Aware Camera Orientation Estimation from Single Images - Wenqi Xian, Zhengqi Li, Matthew Fisher, Jonathan Eisenmann, Eli Shechtman, Noah Snavely (Arxiv 2019)
Gravity as a Reference for Estimating a Person's Height from Video - Didier Bieler, Semih Günel, Pascal Fua, Helge Rhodin (ICCV 2019)
End-to-End Multi-View Fusion for 3D Object Detection in LiDAR Point Clouds - DYin Zhou, Pei Sun, Yu Zhang, Dragomir Anguelov, Jiyang Gao, Tom Ouyang, James Guo, Jiquan Ngiam, Vijay Vasudevan (CoRL 2019)
EGenerating Human Action Videos by Coupling 3D Game Engines and Probabilistic Graphical Models - César Roberto de Souza, Adrien Gaidon, Yohann Cabon, Naila Murray, Antonio Manuel López (IJCV 2019)
Unsupervised High-Resolution Depth Learning From Videos With Dual Networks - Junsheng Zhou, Yuwang Wang, Kaihuai Qin, Wenjun Zeng (ICCV 2019)
Moving Indoor: Unsupervised Video Depth Learning in Challenging Environments - Junsheng Zhou, Yuwang Wang, Kaihuai Qin, Wenjun Zeng (ICCV 2019)
6-PACK: Category-level 6D Pose Tracker with Anchor-Based Keypoints -[CODE] - Chen Wang, Roberto Martín-Martín, Danfei Xu, Jun Lv, Cewu Lu, Li Fei-Fei, Silvio Savarese, Yuke Zhu (Arxiv 2019)
GraphX-Convolution for Point Cloud Deformation in 2D-to-3D Conversion -[CODE] - Anh-Duc Nguyen, Seonghwa Choi, Woojae Kim, Sanghoon Lee (ICCV 2019)
GIFT: Learning Transformation-Invariant Dense Visual Descriptors via Group CNNs -[CODE] - Yuan Liu, Zehong Shen, Zhixuan Lin, Sida Peng, Hujun Bao, Xiaowei Zhou (NIPS 2019)
Conservative Wasserstein Training for Pose Estimation - Xiaofeng Liu, Yang Zou, Tong Che, Peng Ding, Ping Jia, Jane You, Kumar B.V.K (ICCV 2019)
Tell Me What They're Holding: Weakly-supervised Object Detection with Transferable Knowledge from Human-object Interaction - Daesik Kim, Gyujeong Lee, Jisoo Jeong, Nojun Kwak (AAAI 2020)
Single-Stage 6D Object Pose Estimation - Yinlin Hu, Pascal Fua, Wei Wang, Mathieu Salzmann (Arxiv 2019)
GP2C: Geometric Projection Parameter Consensus for Joint 3D Pose and Focal Length Estimation in the Wild - Alexander Grabner, Peter M. Roth, Vincent Lepetit (ICCV 2019)
SANet: Scene Agnostic Network for Camera Localization - Luwei Yang etal (ICCV 2019)
ViewSynth: Learning Local Features from Depth using View Synthesis - Jisan Mahmud, Peri Akiva, Rajat Vikram Singh, Spondon Kundu, Kuan-Chuan Peng, Jan-Michael Frahm (Arxiv 2019)
KeyPose: Multi-view 3D Labeling and Keypoint Estimation for Transparent Objects - Xingyu Liu, Rico Jonschkowski, Anelia Angelova, Kurt Konolige (Arxiv 2019)
3D Objectness Estimation via Bottom-up Regret Grouping - Zelin Ye, Yan Hao, Liang Xu, Rui Zhu, Cewu Lu (Arxiv 2019)
Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation -[CODE] - Hongwei Yi, Zizhuang Wei, Mingyu Ding, Runze Zhang, Yisong Chen, Guoping Wang, Yu-Wing Tai (Arxiv 2019)
Neural Network Generalization: The impact of camera parameters - HZhenyi Liu, Trisha Lian, Joyce Farrell, Brian Wandell(Arxiv 2019)
Geometric Capsule Autoencoders for 3D Point Clouds - Nitish Srivastava, Hanlin Goh, Ruslan Salakhutdinov(Arxiv 2019)
What You See is What You Get: Exploiting Visibility for 3D Object Detection - Peiyun Hu, Jason Ziglar, David Held, Deva Ramanan(Arxiv 2019)
Car Pose in Context: Accurate Pose Estimation with Ground Plane Constraints - Pengfei Li, Weichao Qiu, Michael Peven, Gregory D. Hager, Alan L. Yuille (Arxiv 2019)
Quaternion Knowledge Graph Embeddings -[CODE] - Shuai Zhang, Yi Tay, Lina Yao, Qi Liu (NIPS 2019)
Quaternion Product Units for Deep Learning on 3D Rotation Groups -Xuan Zhang, Shaofei Qin, Yi Xu, Hongteng Xu (arxiv 2019)
Inferring Distributions Over Depth from a Single Image -Gengshan Yang, Peiyun Hu, Deva Ramanan (IROS 2019)
A Bayesian 3D Multi-view Multi-object Tracking Filter -Jonah Ong, Ba Tuong Vo, Ba Ngu Vo, Du Yong Kim, Sven Nordholm (TPAMI 2020)
Learning to Move with Affordance Maps -William Qi, Ravi Teja Mullapudi, Saurabh Gupta, Deva Ramanan (ICLR 2020)
Depth Estimation by Learning Triangulation and Densification of Sparse Points for Multi-view Stereo -Ayan Sinha, Zak Murez, James Bartolozzi, Vijay Badrinarayanan, Andrew Rabinovich (arxiv 2019)
Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision -[CODE] - Niemeyer, Michael and Mescheder, Lars and Oechsle, Michael and Geiger, Andreas (CVPR 2020)
SeqXY2SeqZ: Structure Learning for 3D Shapes by Sequentially Predicting 1D Occupancy Segments From 2D Coordinates -Zhizhong Han, Guanhui Qiao, Yu-Shen Liu, Matthias Zwicker (arxiv 2020)
Real-Time Camera Pose Estimation for Sports Fields -Leonardo Citraro, Pablo Márquez-Neila, Stefano Savarè, Vivek Jayaram, Charles Dubout, Félix Renaut, Andrés Hasfura, Horesh Ben Shitrit, Pascal Fua (arxiv 2020)
DO OPTIMIZATION METHODS IN DEEP LEARNING APPLICATIONS MATTER -[CODE] -Buse Melis Ozyildirim, Mariam Kiran (arxiv 2020)
Occlusion-Aware Depth Estimation with Adaptive Normal Constraints -Xiaoxiao Long, Lingjie Liu, Christian Theobalt, Wenping Wang (arxiv 2020)
DualConvMesh-Net: Joint Geodesic and Euclidean Convolutions on 3D Meshes -Jonas Schult, Francis Engelmann, Theodora Kontogianni, Bastian Leibe (CVPR 2020)
Robust Single Rotation Averaging -Seong Hun Lee, Javier Civera (Arxiv 2020)
Self-Supervised Scene De-occlusion -[CODE] -Xiaohang Zhan, Xingang Pan, Bo Dai, Ziwei Liu, Dahua Lin, Chen Change Loy (CVPR 2020)
RANSAC-Flow: generic two-stage image alignment -[CODE] -Xi Shen, François Darmon, Alexei A. Efros, Mathieu Aubry (Arxiv 2020)
Deep Homography Estimation for Dynamic Scenes -[CODE] -Hoang Le, Feng Liu, Shu Zhang, Aseem Agarwala (CVPR 2020)
Self-Supervised Viewpoint Learning From Image Collections -[CODE] -Siva Karthik Mustikovela, Varun Jampani, Shalini De Mello, Sifei Liu, Umar Iqbal, Carsten Rother, Jan Kautz (CVPR 2020)
Where Does It End? -- Reasoning About Hidden Surfaces by Object Intersection Constraints -Michael Strecke, Joerg Stueckler (CVPR 2020)
Leveraging 2D Data to Learn Textured 3D Mesh Generation -Paul Henderson, Vagia Tsiminaki, Christoph H. Lampert (CVPR 2020)
Image Co-skeletonization via Co-segmentation -Koteswar Rao Jerripothula, Jianfei Cai, Jiangbo Lu, Junsong Yuan (Arxiv 2020)
On the uncertainty of self-supervised monocular depth estimation -[CODE] -Matteo Poggi, Filippo Aleotti, Fabio Tosi, Stefano Mattoccia (CVPR 2020)
Focus on defocus: bridging the synthetic to real domain gap for depth estimation -Maxim Maximov, Kevin Galim, Laura Leal-Taixé (CVPR 2020)
Uncertainty-Aware CNNs for Depth Completion: Uncertainty from Beginning to End -Abdelrahman Eldesokey, Michael Felsberg, Karl Holmquist, Mikael Persson (CVPR 2020)
Accurate Estimation of Body Height From a Single Depth Image via a Four-Stage Developing Network -[CODE] -Fukun Yin, Shizhe Zhou (CVPR 2020)
Quaternion Capsule Networks (Arxiv 2020)

Group of people

SOCIAL LSTM: HUMAN TRAJECTORY PREDICTION IN CROWDED SPACES. - Alexandre Alahi, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Fei-Fei, Silvio Savarese. (CVPR 2016)
Multi-Agent Tensor Fusion for Contextual Trajectory Prediction - Tianyang Zhao, Yifei Xu, Mathew Monfort, Wongun Choi, Chris Baker, Yibiao Zhao, Yizhou Wang, Ying Nian Wu (CVPR 2019)
Towards Social Artificial Intelligence: Nonverbal Social Signal Prediction in A Triadic Interaction - Hanbyul Joo, Tomas Simon, Mina Cikara, Yaser Sheikh (CVPR 2019)
Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks - Vineet Kosaraju, Amir Sadeghian, Roberto Martín-Martín, Ian Reid, S. Hamid Rezatofighi, Silvio Savarese (Arxiv 2019)
To React or not to React: End-to-End Visual Pose Forecasting for Personalized Avatar during Dyadic Conversations - Chaitanya Ahuja, Shugao Ma, Louis-Philippe Morency, Yaser Sheikh (Arxiv 2019)

Person generation

Activity Forecasting. - [CODE] - Kris M. Kitani, Brian Ziebart, James D. Bagnell and Martial Hebert. (ECCV 2012)
Action-Reaction: Forecasting the Dynamics of Human Interaction. - De-An Huang and Kris M. Kitani. (ECCV 2014)
A deep learning framework for character motion synthesis and editing - [CODE] (TOG 2016)
Binge Watching: Scaling Affordance Learning from Sitcoms. - [CODE] - Xiaolong Wang*, Rohit Girdhar*, and Abhinav Gupta. (CVPR 2017)
Pose Guided Person Image Generation - [CODE] - Ma, L., Jia, X., Sun, Q., Schiele, B., Tuytelaars, T., & Gool, L.V. (NIPS 2017)
A Generative Model of People in Clothing - Lassner, C., Pons-Moll, G., & Gehler, P.V. (ICCV 2017)
First-Person Activity Forecasting with Online Inverse Reinforcement Learning - Nicholas Rhinehart and Kris M. Kitani. (ICCV 2017)
Synthesizing Images of Humans in Unseen Poses - [CODE] - Guha Balakrishnan, Amy Zhao, Adrian V. Dalca, Fredo Durand, John Guttag. (CVPR 2018)
A Variational U-Net for Conditional Appearance and Shape Generation - [CODE] - Patrick Esser, Ekaterina Sutter, Björn Ommer. (CVPR 2018)
Deformable GANs for Pose-based Human Image Generation - [CODE] - Siarohin, A., Sangineto, E., Lathuilière, S., & Sebe, N. (CVPR 2018)
Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks - [CODE] - Agrim Gupta, Justin Johnson, Fei-Fei Li, Silvio Savarese, Alexandre Alahi. (CVPR 2018)
QuaterNet: A Quaternion-based Recurrent Model for Human Motion - [CODE] - Dario Pavllo, David Grangier, and Michael Auli. (BMVC 2018)
Dense Pose Transfer - Neverova, N., Guler, R.A., & Kokkinos, I. (ECCV 2018)
MT-VAE: Learning Motion Transformations to Generate Multimodal Human Dynamics - Xinchen Yan, Akash Rastogi, Ruben Villegas, Kalyan Sunkavalli, Eli Shechtman, Sunil Hadap, Ersin Yumer, Honglak Lee (ECCV 2018)
Few-Shot Human Motion Prediction via Meta-Learning - Liang-Yan Gui, Yu-Xiong Wang, Deva Ramanan, and Jos ́e M. F. Moura (ECCV 2018)
Unsupervised Learning of Object Landmarks through Conditional Image Generation - Tomas Jakab, Ankush Gupta, Hakan Bilen, Andrea Vedaldi (NIPS 2018)
FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification - [CODE] - Yixiao Ge.etal. (NIPS 2018)
Soft-Gated Warping-GAN for Pose-Guided Person Image Synthesis - Haoye Dong.etal. (NIPS 2018)
AUTO-CONDITIONED LSTM NETWORK FOR EXTENDED COMPLEX HUMAN MOTION SYNTHESIS - Yi Zhou*, Zimo Li*, Shuangjio Xiao, Chong He, Zeng Huang, Hao Li . (ICLR 2018)
Everybody Dance Now - Caroline Chan, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros (Arxiv 2018)
SiCloPe: Silhouette-Based Clothed People (Arxiv 2019)
Unpaired Pose Guided Human Image Generation - Xu Chen, Jie Song, Otmar Hilliges (Arxiv 2019)
Peeking into the Future: Predicting Future Person Activities and Locations in Videos - Junwei Liang, Lu Jiang, Juan Carlos Niebles, Alexander Hauptmann, Li Fei-Fei (Arxiv 2019)
Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments - [CODE] - Xueting Li, SIfei Liu, Kihwan Kim, Xiaolong Wang, Ming-Hsuan Yang, Jan Kautz (CVPR 2019)
Dense Intrinsic Appearance Flow for Human Pose Transfer - [CODE] - Yining Li, Chen Huang, Chen Change Loy (CVPR 2019)
Vid2Game: Controllable Characters Extracted from Real-World Videos - Oran Gafni, Lior Wolf, Yaniv Taigman (Arxiv 2019)
Textured Neural Avatars - [CODE] - Aliaksandra Shysheya, et al (Arxiv 2019)
Explicit Disentanglement of Appearance and Perspective in Generative Models -Nicki S. Detlefsen,Søren Hauberg (Arxiv 2019)
Learning Variations in Human Motion via Mix-and-Match Perturbation -Mohammad Sadegh Aliakbarian, Fatemeh Sadat Saleh, Mathieu Salzmann, Lars Petersson, Stephen Gould, Amirhossein Habibian (Arxiv 2019)
First Order Motion Model for Image Animation - [CODE] -Stéphane Lathuilière, Sergey Tulyakov, Elisa Ricci and Nicu Sebe (NIPS 2019)
Adversarial Synthesis of Human Pose from Text -Yifei Zhang, Rania Briq, Julian Tanke, Juergen Gall (Arxiv 2020)

3D Human Mesh

Video Based Reconstruction of 3D People Models - Thiemo Alldieck， Marcus Magnor， Weipeng Xu， Christian Theobalt， Gerard Pons-Moll， (CVPR 2018)
End-to-end Recovery of Human Shape and Pose - [CODE] - Kanazawa, A., Black, M.J., Jacobs, D.W., & Malik, J. (CVPR 2018)
Relighting Humans: Occlusion-Aware Inverse Rendering for Full-Body Human Images - [CODE] - Yoshihiro Kanamori, Yuki Endo. (Siggraph 2018)
BodyNet: Volumetric Inference of 3D Human Body Shapes - [CODE] - Varol, G., Ceylan, D., Russell, B., Yang, J., Yumer, E., Laptev, I., & Schmid, C. (ECCV 2018)
Learning to Reconstruct People in Clothing from a Single RGB Camera - Thiemo Alldieck, Marcus Magnor, Bharat Lal Bhatnagar, Christian Theobalt, Gerard Pons-Moll (Arxiv 2019)
DeepHuman: 3D Human Reconstruction from a Single Image - Zerong Zheng, Tao Yu, Yixuan Wei, Qionghai Dai, Yebin Liu (Arxiv 2019)
Learning 3D Human Dynamics from Video - [CODE] - Angjoo Kanazawa, Jason Y. Zhang, Panna Felsen, Jitendra Malik (CVPR 2019)
Detailed Human Shape Estimation from a Single Image by Hierarchical Mesh Deformation - [CODE] - Hao Zhu, Xinxin Zuo, Sen Wang, Xun Cao, Ruigang Yang (CVPR 2019)
LBS Autoencoder: Self-supervised Fitting of Articulated Meshes to Point Clouds - Chun-Liang Li, Tomas Simon, Jason Saragih, Barnabás Póczos, Yaser Sheikh (CVPR 2019)
Convolutional Mesh Regression for Single-Image Human Shape Reconstruction - [CODE] - Nikos Kolotouros, Georgios Pavlakos, Kostas Daniilidis (CVPR 2019)
Expressive Body Capture: 3D Hands, Face, and Body from a Single Image - [CODE] - [CODE] - [CODE] - Georgios Pavlakos, Vasileios Choutas, Nima Ghorbani, Timo Bolkart, Ahmed A. A. Osman, Dimitrios Tzionas, Michael J. Black (CVPR 2019)
Volumetric Capture of Humans with a Single RGBD Camera viaSemi-Parametric Learning - Rohit Pandey et al. (CVPR 2019)
DenseBody: Directly Regressing Dense 3D Human Pose and Shape From a Single Color Image - [CODE] - Pengfei Yao, Zheng Fang, Fan Wu, Yao Feng, Jiwei Li (Arxiv 2019)
Towards 3D Human Shape Recovery Under Clothing - Xin Chen, Anqi Pang, Yu Zhu, Yuwei Li, Xi Luo, Ge Zhang, Peihao Wang, Yingliang Zhang, Shiying Li, Jingyi Yu (Arxiv 2019)
3DPeople: Modeling the Geometry of Dressed Humans - [Dataset] - Albert Pumarola, Jordi Sanchez, Gary P. T. Choi, Alberto Sanfeliu, Francesc Moreno-Noguer (ICCV 2019)
Long-Term Video Generation of Multiple FuturesUsing Human Poses - Naoya Fushishita, Antonio Tejero-de-Pablos, Yusuke Mukuta, Tatsuya Harada (Arxiv 2019)
PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization - Shunsuke Saito, Zeng Huang, Ryota Natsume, Shigeo Morishima, Angjoo Kanazawa, Hao Li (Arxiv 2019)
Learning 3D Human Body Embedding - Boyi Jiang, Juyong Zhang, Jianfei Cai, Jianmin Zheng (Arxiv 2019)
Shape Evasion: Preventing Body Shape Inference of Multi-Stage Approaches - Hosnieh Sattar, Katharina Krombholz, Gerard Pons-Moll, Mario Fritz (Arxiv 2019)
Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular Video - Jian Liu, Naveed Akhtar, Ajmal Mian (Arxiv 2019)
Moulding Humans: Non-parametric 3D Human Shape Estimation from Single Images - [Studio] - Valentin Gabeur, Jean-Sebastien Franco, Xavier Martin, Cordelia Schmid, Gregory Rogez (ICCV 2019)
Dressing 3D Humans using a Conditional Mesh-VAE-GAN - Qianli Ma, Siyu Tang, Sergi Pujades, Gerard Pons-Moll, Anurag Ranjan, Michael J. Black (Arxiv 2019)
AMASS: Archive of Motion Capture as Surface Shapes - [CODE] - [CODE] - Mahmood, Naureen and Ghorbani, Nima and F. Troje, Nikolaus and Pons-Moll, Gerard and Black, Michael J. (ICCV 2019)
Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation - Sun Yu, Ye Yun, Liu Wu, Gao Wenpeng, Fu YiLi, Mei Tao (ICCV 2019)
Multi-Garment Net: Learning to Dress 3D People from Images - Bharat Lal Bhatnagar, Garvita Tiwari, Christian Theobalt, Gerard Pons-Moll (ICCV 2019)
Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild - [CODE] - Yu Rong, Ziwei Liu, Cheng Li, Kaidi Cao, Chen Change Loy (ICCV 2019)
Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images - Junbang Liang, Ming C. Lin (ICCV 2019)
Estimation of Body Mass Index from Photographs using Deep Convolutional Neural Networks - Adam Pantanowitz, Emmanuel Cohen, Philippe Gradidge, Nigel Crowther, Vered Aharonson, Benjamin Rosman, David M Rubin (arxiv 2019)
Video Interpolation and Prediction with Unsupervised Landmarks - Kevin J. Shih, Aysegul Dundar, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro (arxiv 2019)
Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop - [CODE] - Nikos Kolotouros, Georgios Pavlakos, Michael J. Black, Kostas Daniilidis (ICCV 2019)
DenseRaC: Joint 3D Pose and Shape Estimation by Dense Render-and-Compare - Yuanlu Xu, Song-Chun Zhu, Tony Tung (ICCV 2019)
Efficient Learning on Point Clouds with Basis Point Sets - [CODE] - Prokudin, Sergey and Lassner, Christoph and Romero, Javier (ICCV 2019)
TexturePose: Supervising Human Mesh Estimation with Texture Consistency - [CODE] - YGeorgios Pavlakos, Nikos Kolotouros, Kostas Daniilidis (ICCV 2019)
Towards Robust RGB-D Human Mesh Recovery - Ren Li, Changjiang Cai, Georgios Georgakis, Srikrishna Karanam, Terrence Chen, Ziyan Wu (Arxiv 2019)
CLOTH3D: Clothed 3D Humans - Hugo Bertiche, Meysam Madadi, Sergio Escalera(Arxiv 2019)
Learning 3D Human Shape and Pose from Dense Body Parts - Hongwen Zhang Jie Cao Guo Lu Wanli Ouyang Zhenan Sun (Arxiv 2019)
Learning from Synthetic Animals - Jiteng Mu, Weichao Qiu, Gregory Hager, Alan Yuille (Arxiv 2019)
Dressing for Diverse Body Shapes - Wei-Lin Hsiao, Kristen Grauman (Arxiv 2019)
Neural Human Video Rendering: Joint Learning of Dynamic Textures and Rendering-to-Video Translation - Lingjie Liu, Weipeng Xu, Marc Habermann, Michael Zollhoefer, Florian Bernard, Hyeongwoo Kim, Wenping Wang, Christian Theobalt(Arxiv 2020)
Chained Representation Cycling: Learning to Estimate 3D Human Pose and Shape by Cycling Between Representations - Nadine Rueegg, Christoph Lassner, Michael J. Black, Konrad Schindler (AAAI 2020)
The Whole Is Greater Than the Sum of Its Nonrigid Parts - Oshri Halimi, Ido Imanuel, Or Litany, Giovanni Trappolini, Emanuele Rodolà, Leonidas Guibas, Ron Kimmel (Arxiv 2020)
Particle Filter Based Monocular Human Tracking with a 3D Cardbox Model and a Novel Deterministic Resampling Strategy - Ziyuan Liu, Dongheui Lee, Wolfgang Sepp (Arxiv 2020)
PeelNet: Textured 3D reconstruction of human body using single view RGB image - Sai Sagar Jinka, Rohan Chacko, Avinash Sharma, P. J. Narayanan (Arxiv 2020)
VIBE: Video Inference for Human Body Pose and Shape Estimation - [CODE] - Muhammed Kocabas, Nikos Athanasiou, Michael J. Black (CVPR 2020)
Learning Nonparametric Human Mesh Reconstruction from a Single Image without Ground Truth Meshes - Kevin Lin, Lijuan Wang, Ying Jin, Zicheng Liu, Ming-Ting Sun (Arxiv 2020)
Hierarchical Kinematic Human Mesh Recovery - Georgios Georgakis, Ren Li, Srikrishna Karanam, Terrence Chen, Jana Kosecka, Ziyan Wu (Arxiv 2020)
Learning to Transfer Texture from Clothing Images to 3D Humans - Aymen Mir, Thiemo Alldieck, Gerard Pons-Moll (CVPR 2020)
The Virtual Tailor: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style - [CODE] - Chaitanya Patel, Zhouyingcheng Liao, Gerard Pons-Moll (CVPR 2020)
PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization - [CODE] - Shunsuke Saito, Tomas Simon, Jason Saragih, Hanbyul Joo (CVPR 2020)
SoftSMPL: Data-driven Modeling of Nonlinear Soft-tissue Dynamics for Parametric Humans - [Page] - Igor Santesteban, Elena Garces, Miguel A. Otaduy, Dan Casas (Eurographics 2020)
Learning 3D Human Shape and Pose from Dense Body Parts - [CODE] - Zhang, Hongwen and Cao, Jie and Lu, Guo and Ouyang, Wanli and Sun, Zhenan (Arxiv 2020)
Robust 3D Self-portraits in Seconds -Zhe Li, Tao Yu, Chuanyu Pan, Zerong Zheng, Yebin Liu (CVPR 2020)
ARCH: Animatable Reconstruction of Clothed Humans -Zeng Huang, Yuanlu Xu, Christoph Lassner, Hao Li, Tony Tung (CVPR 2020)
MulayCap: Multi-layer Human Performance Capture Using A Monocular Video Camera -Zhaoqi Su, Weilin Wan, Tao Yu, Lingjie Liu, Lu Fang, Wenping Wang, Yebin Liu (Arxiv 2020)
TetraTSDF: 3D human reconstruction from a single image with a tetrahedral outer shell -Hayato Onizuka, Zehra Hayirci, Diego Thomas, Akihiro Sugimoto, Hideaki Uchiyama, Rin-ichiro Taniguchi (Arxiv 2020)
Self-Supervised Human Depth Estimation from Monocular Videos -Feitong Tan, Hao Zhu, Zhaopeng Cui, Siyu Zhu, Marc Pollefeys, Ping Tan (CVPR 2020)
Learning to Dress 3D People in Generative Clothing -Qianli Ma, Jinlong Yang, Anurag Ranjan, Sergi Pujades, Gerard Pons-Moll, Siyu Tang, Michael J. Black (Arxiv 2020)
IMPLICIT FUNCTIONS IN FEATURE SPACE FOR 3D SHAPE RECONSTRUCTION AND COMPLETION - [CODE] -Julian Chibane, Thiemo Alldieck, Gerard Pons-Moll (CVPR 2020)
TailorNet: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style - [CODE] -Chaitanya Patel, Zhouyingcheng Liao, Gerard Pons-Moll (CVPR 2020)
3D Human Mesh Regression With Dense Correspondence -Wang Zeng, Wanli Ouyang, Ping Luo, Wentao Liu, Xiaogang Wang (CVPR 2020)
Sequential 3D Human Pose and Shape Estimation From Point Clouds -Kangkan Wang, Jin Xie, Guofeng Zhang, Lei Liu, Jian Yang (CVPR 2020)
GHUM & GHUML: Generative 3D Human Shape and Articulated Pose Models -Hongyi Xu, Eduard Gabriel Bazavan, Andrei Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu;(CVPR 2020)
Object-Occluded Human Shape and Pose Estimation From a Single Color Image -Tianshu Zhang, Buzhen Huang, Yangang Wang (CVPR 2020)

Pose And Physics-Robotics

Learning Locomotion Skills Using DeepRL: Does the Choice of Action Space Matter? - [CODE] - Xue Bin Peng, Michiel van de Panne (Eurographics 2017)
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills - [CODE] - Xue Bin Peng, Pieter Abbeel, Sergey Levine, Michiel van de Panne (SIGGRAPH 2018)
SFV: Reinforcement Learning of Physical Skills from Videos - [CODE] - Xue Bin Peng, Angjoo Kanazawa, Jitendra Malik, Pieter Abbeel, Sergey Levine (SIGGRAPH Asia 2018)
Learning to Sit: Synthesizing Human-Chair Interactions via Hierarchical Control - [Video] - Yu-Wei Chao, Jimei Yang, Weifeng Chen, Jia Deng (Arxiv 2018)
AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human Videos - [CODE] - Laura Smith, Nikita Dhawan, Marvin Zhang, Pieter Abbeel, Sergey Levine (Arxiv 2019)
pymanoid - [CODE]

Pose and Language-Speech-Reasoning-Semantics

Your body language may shape who you are (TED 2012)
Generating Animated Videos of Human Activities from Natural Language Descriptions - Angela S. Lin, Lemeng Wu, Rodolfo Corona, Kevin Tai, Qixing Huang, Raymond J. Mooney (NIPS 2018)
Neural Sign Language Translation - [CODE] - Necati Cihan Camgoz and Simon Hadfield and Oscar Koller and Hermann Ney and Richard Bowden (CVPR 2018)
Learning Individual Styles of Conversational Gesture - [CODE] - Shiry Ginosar, Amir Bar, Gefen Kohavi, Caroline Chan, Andrew Owens, Jitendra Malik (CVPR 2019)
HAKE: Human Activity Knowledge Engine - [CODE] - Yong-Lu Li, Liang Xu, Xijie Huang, Xinpeng Liu, Ze Ma, Mingyang Chen, Shiyi Wang, Hao-Shu Fang, Cewu Lu (Arxiv 2019)
Shape Evasion: Preventing Body Shape Inference of Multi-Stage Approaches - Hosnieh Sattar, Katharina Krombholz, Gerard Pons-Moll, Mario Fritz (Arxiv 2019)
Language2Pose: Natural Language Grounded Pose Forecasting - Chaitanya Ahuja, Louis-Philippe Morency (Arxiv 2019)
Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison - Dongxu Li, Cristian Rodriguez Opazo, Xin Yu, Hongdong Li (WACV 2020)
Motion Reasoning for Goal-Based Imitation Learning - De-An Huang, Yu-Wei Chao, Chris Paxton, Xinke Deng, Li Fei-Fei, Juan Carlos Niebles, Animesh Garg, Dieter Fox (Arxiv 2019)
Dancing to Music - [CODE] - Hsin-Ying Lee, Xiaodong Yang, Ming-Yu Liu, Ting-Chun Wang, Yu-Ding Lu, Ming-Hsuan Yang, Jan Kautz (NIPS 2019)
Skeleton based Zero Shot Action Recognition in Joint Pose-Language Semantic Space - Bhavan Jasani, Afshaan Mazagonwalla (Arxiv 2019)
Dressing for Diverse Body Shapes - Wei-Lin Hsiao, Kristen Grauman (Arxiv 2019)
Music-oriented Dance Video Synthesis with Pose Perceptual Loss - Xuanchi Ren, Haoran Li, Zijian Huang, Qifeng Chen (Arxiv 2019)
Music2Dance: Music-driven Dance Generation using WaveNet - Wenlin Zhuang, Congyi Wang, Siyu Xia, Jinxiang Chai, Yangang Wang (Arxiv 2020)

Pose-and-Action

RSA: Randomized Simulation as Augmentation for Robust Human Action Recognition - Yi Zhang, Xinyue Wei, Weichao Qiu, Zihao Xiao, Gregory D. Hager, Alan Yuille. (Arxiv 2019)
Simultaneous Implementation Features Extraction and Recognition Using C3DNetwork for WiFi-based Human Activity Recognition - Yafeng Liu et al. (Arxiv 2019)
Action Recognition via Pose-Based Graph Convolutional Networks with Intermediate Dense Supervision - Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu (Arxiv 2019)
Synthetic Humans for Action Recognition from Unseen Viewpoints - Gül Varol, Ivan Laptev, Cordelia Schmid, Andrew Zisserman (Arxiv 2019)
Action Genome: Actions as Composition of Spatio-temporal Scene Graphs - Jingwei Ji, Ranjay Krishna, Li Fei-Fei, Juan Carlos Niebles (Arxiv 2019)
Mimetics: Towards Understanding Human Actions Out of Context - Philippe Weinzaepfel, Grégory Rogez (Arxiv 2019)
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong - Steven Schwarcz, Peng Xu, David D'Ambrosio, Juhana Kangaspunta, Anelia Angelova, Huong Phan, Navdeep Jaitly (Arxiv 2019)
Human Motion Anticipation with Symbolic Label - Julian Tanke, Andreas Weber, Juergen Gall (Arxiv 2019)
Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition - Ziyu Liu, Hongwen Zhang, Zhenghao Chen, Zhiyong Wang, Wanli Ouyang (CVPR 2020)
SSHFD: Single Shot Human Fall Detection with Occluded Joints Resilience - Umar Asif, Stefan Von Cavallar, Jianbin Tang, Stefan Harre (Arxiv 2020)
Asynchronous Interaction Aggregation for Action Detection - Jiajun Tang, Jin Xia, Xinzhi Mu, Bo Pang, Cewu Lu (Arxiv 2020)
3DV: 3D Dynamic Voxel for Action Recognition in Depth Video - Yancheng Wang, Yang Xiao, Fu Xiong, Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan (CVPR 2020)

Video pose

Nonrigid Structure from Motion in Trajectory Space - Ijaz Akhter, Yaser Sheikh, Sohaib Khan and Takeo Kanade. (NIPS 2008)
Human Attributes from 3D Pose Tracking - Leonid Sigal, David J. Fleet, Nikolaus F. Troje, and Micha Livne. (ECCV 2010)
Pose from Flow and Flow from Pose - Katerina Fragkiadaki, Han Hu and Jianbo Shi . (CVPR 2013)
Recurrent Network Models for Human Dynamics - Katerina Fragkiadaki, Sergey Levine, Panna Felsen, Jitendra Malik . (ICCV 2015)
Personalizing Human Video Pose Estimation - James Charles, Tomas Pfister, Derek Magee, David Hogg, Andrew Zisserman . (CVPR 2016)
On human motion prediction using recurrent neural networks - Julieta Martinez, Michael J. Black, and Javier Romero. (CVPR 2017)
Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos - Jie Song, Limin Wang, Luc Van Gool, Otmar Hilliges. (CVPR 2017)
Deep Multitask Architecture for Integrated 2D and 3D Human Sensing - [CODE] - Alin-Ionut Popa and Mihai Zanfir and Cristian Sminchisescu. (CVPR 2017)
Rpan: An end-to-end recurrent pose-attention network for action recognition in videos - Wenbin Du, Yali Wang, Yu Qiao. (ICCV 2017)
Self-supervised Learning of Motion Capture - [CODE] - Hsiao-Yu Fish Tung, Hsiao-Wei Tung, Ersin Yumer, Katerina Fragkiadaki. (NIPS 2017)
Detect-and-Track: Efficient Pose Estimation in Videos, - [CODE] - Rohit Girdhar, Georgia Gkioxari, Lorenzo Torresani, Manohar Paluri and Du Tran. (CVPR 2018)
Neural Kinematic Networks for Unsupervised Motion Retargeting, - [CODE] - Ruben Villegas, Jimei Yang, Duygu Ceylan, Honglak Lee. (CVPR 2018)
2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning - [CODE] - Diogo C. Luvizon, David Picard, Hedi Tabia. (CVPR 2018)
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition, - [CODE] - Sijie Yan, Yuanjun Xiong, and Dahua Lin. (AAAI 2018)
QuaterNet: A Quaternion-based Recurrent Model for Human Motion - [CODE] - Dario Pavllo, David Grangier, and Michael Auli. (BMVC 2018)
Simple Baselines for Human Pose Estimation and Tracking - [CODE] - Bin Xiao, Haiping Wu, Yichen Wei. (ECCV 2018)
Exploiting temporal information for 3D pose estimation - Mir Rayat Imtiaz Hossain, James J. Little (ECCV 2018)
Learning 3D Human Pose from Structure and Motion - Dabral, R., Mundhada, A., Kusupati, U., Afaque, S., Sharma, A., & Jain, A. (ECCV 2018)
Propagating LSTM: 3D Pose Estimation based on Joint Interdependency - Kyoungoh Lee, Inwoong Lee, and Sanghoon Lee. (ECCV 2018)
Recovering Accurate 3D Human Pose in The Wild Using IMUs and a Moving Camera - Timo von Marcard, Roberto Henschel, Michael J. Black, Bodo Rosenhahn,and Gerard Pons-Moll (ECCV 2018)
Learning to Detect and Track Visible and Occluded Body Joints in a Virtual World - [CODE] - [CODE] - Matteo Fabbri, Fabio Lanzi, Simone Calderara, Andrea Palazzi, Roberto Vezzani, and Rita Cucchiara (ECCV 2018)
SFV: Reinforcement Learning of Physical Skills from Videos - Xue Bin Peng, Angjoo Kanazawa, Jitendra Malik, Pieter Abbeel, Sergey Levine. (ACM SIGGRAPH Asia 2018)
3D human pose estimation in video with temporal convolutions and semi-supervised training - Dario Pavllo, Christoph Feichtenhofer, David Grangier, Michael Auli. (Arxiv 2018)
Human Motion Prediction via Learning Local Structure Representations and Temporal Dependencies - [CODE] - Xiao Guo, Jongmoo Choi. (AAAI 2019)
BiHMP-GAN: Bidirectional 3D Human Motion Prediction GAN - Jogendra Nath Kundu, Maharshi Gor, R. Venkatesh Babu. (AAAI 2019)
Bio-LSTM: A Biomechanically Inspired Recurrent Neural Network for 3D Pedestrian Pose and Gait Prediction - Xiao Guo, Jongmoo Choi. (Arxiv 2019)
Multi-person Articulated Tracking with Spatial and Temporal Embeddings - Sheng Jin, Wentao Liu, Wanli Ouyang, Chen Qian (CVPR 2019)
Learning Character-Agnostic Motion for Motion Retargeting in 2D - [CODE] - Kfir Aberman, Rundi Wu, Dani Lischinski, Baoquan Chen, Daniel Cohen-Or. (SIGGRAPH 2019)
Exploiting temporal context for 3D human pose estimation in the wild - Anurag Arnab, Carl Doersch, Andrew Zisserman. (CVPR 2019)
Learning Temporal Pose Estimation from Sparsely-Labeled Videos - Gedas Bertasius, Christoph Feichtenhofer, Du Tran, Jianbo Shi, Lorenzo Torresani. (NIPS 2019)
Temporal Transformer Networks: Joint Learning of Invariant and Discriminative Time Warping - [CODE] - Suhas Lohit, Qiao Wang, Pavan Turaga. (CVPR 2019)
Unsupervised Learning of Object Structure and Dynamics from Videos - Matthias Minderer, Chen Sun, Ruben Villegas, Forrester Cole, Kevin Murphy, Honglak Lee (Arxiv 2019)
VRED: A Position-Velocity Recurrent Encoder-Decoder for Human Motion Prediction - Hongsong Wang, Jiashi Feng (Arxiv 2019)
Delving into 3D Action Anticipation from Streaming Videos - Hongsong Wang, Jiashi Feng (Arxiv 2019)
Sim2real transfer learning for 3D pose estimation: motion to the rescue - Carl Doersch, Andrew Zisserman (Arxiv 2019)
A-MAL: Automatic Motion Assessment Learning from Properly Performed Motions in 3D Skeleton Videos - Tal Hakim, Ilan Shimshoni (Arxiv 2019)
Learning Trajectory Dependencies for Human Motion Prediction - [CODE] - Wei Mao, Miaomiao Liu, Mathieu Salzmann, Hongdong Li (ICCV 2019)
Dynamic Kernel Distillation for Efficient Pose Estimation in Videos - Xuecheng Nie, Yuncheng Li, Linjie Luo, Ning Zhang, Jiashi Feng (ICCV 2019)
Imitation Learning for Human Pose Prediction - Borui Wang, Ehsan Adeli, Hsu-kuang Chiu, De-An Huang, Juan Carlos Niebles (ICCV 2019)
Symbiotic Graph Neural Networks for 3D Skeleton-based Human Action Recognition and Motion Prediction - Maosen Li, Siheng Chen, Xu Chen, Ya Zhang, Yanfeng Wang, Qi Tian (Arxiv 2019)
MultiPath: Multiple Probabilistic Anchor Trajectory Hypotheses for Behavior Prediction - Yuning Chai, Benjamin Sapp, Mayank Bansal, Dragomir Anguelov (CoRL 2019)
Structured Prediction Helps 3D Human Motion Modelling - [CODE] - Emre Aksan, Manuel Kaufmann, Otmar Hilliges (ICCV 2019)
Human Motion Prediction via Spatio-Temporal Inpainting - Alejandro Hernandez Ruiz, Juergen Gall, Francesc Moreno-Noguer (ICCV 2019)
Imitation Learning for Human Pose Prediction - Borui Wang, Ehsan Adeli, Hsu-kuang Chiu, De-An Huang, Juan Carlos Niebles (ICCV 2019)
Unsupervised learning of object structure and dynamics from videos - [CODE] - Matthias Minderer*, Chen Sun, Ruben Villegas, Forrester Cole, Kevin Murphy, Honglak Lee (NIPS 2019)
Social-STGCNN: A Social Spatio-Temporal Graph Convolutional Neural Network for Human Trajectory Prediction - Abduallah Mohamed, Kun Qian, Mohamed Elhoseiny, Christian Claudel (CVPR 2020)
TransMoMo: Invariance-Driven Unsupervised Video Motion Retargeting - Yang, Zhuoqian and Zhu, Wentao and Wu, Wayne and Qian, Chen and Zhou, Qiang and Zhou, Bolei and Loy, Chen Change (CVPR 2020)
TITAN: Future Forecast using Action Priors - Srikanth Malla, Behzad Dariush, Chiho Choi (CVPR 2020)
Long-term Human Motion Prediction with Scene Context - Zhe Cao, Hang Gao, Karttikeya Mangalam, Qi-Zhi Cai, Minh Vo, Jitendra Malik (Arxiv 2020)
Human Motion Transfer from Poses in the Wild - Jian Ren, Menglei Chai, Sergey Tulyakov, Chen Fang, Xiaohui Shen, Jianchao Yang (Arxiv 2020)
3D human pose estimation with adaptive receptive fields and dilated temporal convolutions - Michael Shin, Eduardo Castillo, Irene Font Peradejordi, Shobhna Jayaraman (Arxiv 2020)
TPNet: Trajectory Proposal Network for Motion Prediction - Liangji Fang, Qinhong Jiang, Jianping Shi, Bolei Zhou (Arxiv 2020)
Generative Tweening: Long-term Inbetweening of 3D Human Motions - Yi Zhou, Jingwan Lu, Connelly Barnes, Jimei Yang, Sitao Xiang, Hao li(Arxiv 2020)
Skeleton-Aware Networks for Deep Motion Retargeting - [CODE] - Kfir Aberman, Peizhuo Li, Dani Lischinski, Olga Sorkine-Hornung, Daniel Cohen-Or, Baoquan Chen (SIGGRAPH 2020)
Unpaired Motion Style Transfer from Video to Animation - [CODE] - Kfir Aberman, Yijia Weng, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen (SIGGRAPH 2020)

Real-time pose estimation

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields - [CODE] - Cao, Z., Simon, T., Wei, S., & Sheikh, Y. (CVPR 2017)
VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera - [CODE] - Mehta, Dushyant et al. (SIGGRAPH 2017)
RMPE: Regional Multi-person Pose Estimation - [CODE1][CODE2] - Fang, H., Xie, S., & Lu, C. (ICCV 2017)
Dense Human Pose Estimation In The Wild - [CODE] - Guler, R.A., Neverova, N., & Kokkinos, I. (CVPR 2018)
MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network - [CODE] - Guler, R.A., Neverova, N., & Kokkinos, I. (ECCV 2018)
Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose - [CODE] - Osokin, D. (Arxiv 2018)
- Extension to 3D pose estimation (based on Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB - Mehta, D., et al.) - [CODE]
Lightweight 3D Human Pose Estimation Network Training Using Teacher-Student Learning- Dong-Hyun Hwang, Suntae Kim, Nicolas Monet, Hideki Koike, Soonmin Bae (Arxiv 2020)

Hand-Face-landmark

Hand PointNet: 3D Hand Pose Estimation Using Point Sets - Liuhao Ge, Yujun Cai, Junwu Weng, Junsong Yuan (CVPR 2018)
Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks - [CODE] - Xuepeng Shi, Shiguang Shan, Meina Kan, Shuzhe Wu, Xilin Chen (CVPR 2018)
Hand Pose Estimation via Latent 2.5D Heatmap Networks Regression - Umar Iqbal , Pavlo Molchanov, Thomas Breuel, Juergen Gall, Jan Kautz1 (ECCV 2018)
H+O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions - Bugra Tekin, Federica Bogo, Marc Pollefeys (CVPR 2019)
3D Hand Shape and Pose from Images in the Wild - Adnane Boukhayma, Rodrigo de Bem, Philip H.S. Torr (Arxiv 2019)
3D Dense Face Alignment via Graph Convolution Networks - Huawei Wei, Shuang Liang, Yichen Wei (Arxiv 2019)
Disentangling Pose from Appearance in Monochrome Hand Images - Yikang Li, Chris Twigg, Yuting Ye, Lingling Tao, Xiaogang Wang (Arxiv 2019)
Single Image 3D Hand Reconstruction with Mesh Convolutions - Dominik Kulon, Haoyang Wang, Riza Alp Güler, Michael Bronstein, Stefanos Zafeiriou (Arxiv 2019)
Gesture-to-Gesture Translation in the Wild via Category-Independent Conditional Maps - Yahui Liu, Marco De Nadai, Gloria Zen, Nicu Sebe, Bruno Lepri (Arxiv 2019)
FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images - [CODE] - Christian Zimmermann, Duygu Ceylan, Jimei Yang, Bryan Russell, Max Argus, Thomas Brox (ICCV 2019)
Early Estimation of User's Intention of Tele-Operation Using Object Affordance and Hand Motion in a Dual First-Person Vision - Motoki Kojima, Jun Miura (Arxiv 2019)
aligning latent spaces for 3d hand pose estimation - Linlin Yang, Shile Li, Dongheui Lee, Angela Yao (ICCV 2019)
A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image -Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan (ICCV 2019)
Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison -Dongxu Li and Cristian Rodriguez Opazo and Xin Yu and Hongdong Li (arxiv 2019)
Deformation-aware Unpaired Image Translation for Pose Estimation on Laboratory Animals -Siyuan Li, Semih Günel, Mirela Ostrek, Pavan Ramdya, Pascal Fua, Helge Rhodin (arxiv 2020)
Monocular Real-time Hand Shape and Motion Capture using Multi-modal Data - [CODE] -Yuxiao Zhou, Marc Habermann, Weipeng Xu, Ikhsanul Habibie, Christian Theobalt, Feng Xu (CVPR 2020)
Balanced Alignment for Face Recognition: A Joint Learning Approach -Huawei Wei, Peng Lu, Yichen Wei (arxiv 2020)
Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction (arxiv 2020)
HandVoxNet: Deep Voxel-Based Network for 3D Hand Shape and Pose Estimation from a Single Depth Map -Jameel Malik, et,al (CVPR 2020)
Weakly-Supervised Mesh-Convolutional Hand Reconstruction in the Wild - [CODE] -Dominik Kulon, Riza Alp Güler, Iasonas Kokkinos, Michael Bronstein, Stefanos Zafeiriou (CVPR 2020)
Two-hand Global 3D Pose Estimation Using Monocular RGB -Fanqing Lin, Connor Wilhelm, Tony Martinez (arxiv 2020)
Leveraging Photometric Consistency over Time for Sparsely Supervised Hand-Object Reconstruction -[CODE] -Hasson, Yana and Tekin, Bugra and Bogo, Federica and Laptev, Ivan and Pollefeys, Marc and Schmid, Cordelia (CVPR 2020)
GanHand: Predicting Human Grasp Affordances in Multi-Object Scenes -Enric Corona, Albert Pumarola, Guillem Alenya, Francesc Moreno-Noguer, Gregory Rogez (CVPR 2020)
Weakly-Supervised Domain Adaptation via GAN and Mesh Model for Estimating 3D Hand Poses Interacting Objects -Seungryul Baek, Kwang In Kim, Tae-Kyun Kim (CVPR 2020)
JGR-P2O: Joint Graph Reasoning based Pixel-to-Offset Prediction Network for 3D Hand Pose Estimation from a Single Depth Image (ECCV 2020)

Datasets

2D

3D

Meshes

Surreal

Benchmarks

2D

MPII
COCO

3D

Workshops

Blog posts

Popular implementations

PyTorch

TensorFlow

Torch

Others

Todo

License

<a rel="license" href="http://creativecommons.org/licenses/by/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by/4.0/88x31.png" /></a> This work is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</a>.