Awesome

Awesome 3D Body Papers

An awesome & curated list of papers about 3D human body.

:point_right: Note: see paper list sorted by year or publication.

Table of Contents

Body Model
Body Pose
Naked Body Mesh
Clothed Body Mesh
Human Depth Estimation
Human Motion
Human-Object Interaction
Animation
Cloth/Try-On
Neural Rendering
Dataset

Body Model

SCAPE: Shape Completion and Animation of People. SIGGRAPH, 2005. [Page]

SMPL: A Skinned Multi-Person Linear Model. SIGGRAPH Asia, 2015. [Page] [Code]

Expressive Body Capture: 3D Hands, Face, and Body from a Single Image. CVPR, 2019. [Page] [Code]

SoftSMPL: Data-driven Modeling of Nonlinear Soft-tissue Dynamics for Parametric Humans. Eurographics, 2020. [Page]

Modeling and Estimation of Nonlinear Skin Mechanics for Animated Avatars. Eurographics, 2020. [Page]

STAR: Sparse Trained Articulated Human Body Regressor. ECCV, 2020. [Page] [Code]

SUPR: A Sparse Unified Part-Based Human Representation. ECCV, 2022. [Page] [Code]

BLSM: A Bone-Level Skinned Model of the Human Mesh. ECCV, 2020. [Page]

Joint Optimization for Multi-Person Shape Models from Markerless 3D-Scans. ECCV, 2020. [Code]

GHUM & GHUML: Generative 3D Human Shape and Articulated Pose Models. CVPR (Oral), 2020. [Code]

PanoMan: Sparse Localized Components–based Model for Full Human Motions. ToG, 2021.

BASH: Biomechanical Animated Skinned Human for Visualization of Kinematics and Muscle Activity. GRAPP, 2021. [Code]

SMPLicit: Topology-aware Generative Model for Clothed People. CVPR, 2021. [Page] [Code]

NPMs: Neural Parametric Models for 3D Deformable Shapes. ArXiv, 2021. [Page]

LatentHuman: Shape-and-Pose Disentangled Latent Representation for Human Bodies. 3DV, 2021. [Page] [Code]

LEAP: Learning Articulated Occupancy of People. CVPR, 2021. [Page] [Code]

SCALE: Modeling Clothed Humans with a Surface Codec of Articulated Local Elements. CVPR, 2021. [Page]

Body Pose

MotioNet: 3D Human Motion Reconstruction from Monocular Video with Skeleton Consistency. ToG, 2020. [Page] [Code]

VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera. SIGGRAPH Asia, 2017. [Page] [Code]

XNect: Real-time Multi-person 3D Human Pose Estimation with a Single RGB Camera. SIGGRAPH, 2020. [Page] [Code]

PhysCap: Physically Plausible Monocular 3D Motion Capture in Real Time. SIGGRAPH Asia, 2020. [Page] [Code]

Neural Monocular 3D Human Motion Capture with Physical Awareness. SIGGRAPH, 2021. [Page] [Code]

PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation. CVPR (Oral), 2021. [Page] [Code]

Cascaded Deep Monocular 3D Human Pose Estimation with Evolutionary Training Data. CVPR, 2020. [Code]

PoseLifter: Absolute 3D Human Pose Lifting Network from a Single Noisy 2D Human Pose. ArXiv, 2020. [Code]

SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach. ECCV, 2020. [Code]

Probabilistic Monocular 3D Human Pose Estimation with Normalizing Flows. ICCV, 2021. [Code]

Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation. ICCV, 2021. [Code]

Learnable Triangulation of Human Pose. ICCV (Oral), 2019. [Code]

FLEX: Parameter-free Multi-view 3D Human Motion Reconstruction. ArXiv, 2021. [Page]

Weakly-supervised Cross-view 3D Human Pose Estimation. ArXiv, 2021.

High Fidelity 3D Reconstructions with Limited Physical Views. 3DV, 2021. [Page] [Code]

Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation. CVPR, 2020. [Code]

PandaNet: Anchor-Based Single-Shot Multi-Person 3D Pose Estimation. ArXiv, 2021.

SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation. ECCV, 2020. [Page] [Code]

PI-Net: Pose Interacting Network for Multi-Person Monocular 3D Pose Estimation. WACV, 2021.

Monocular 3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks. CVPR, 2021. [Code]

FCPose: Fully Convolutional Multi-Person Pose Estimation with Dynamic Instance-Aware Convolutions. CVPR, 2021. [Code]

End-to-End Estimation of Multi-Person 3D Poses from Multiple Cameras. ECCV (Oral), 2020.

Multi-person 3D Pose Estimation in Crowded Scenes Based on Multi-View Geometry. ArXiv, 2020. [Code]

Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo. CVPR, 2021. [Code]

Direct Multi-view Multi-person 3D Human Pose Estimation. NeurIPS, 2021. [Code]

Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views. CVPR, 2019. [Page] [Code]

Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views. TPAMI, 2021. [Page] [Code]

Temporal Smoothing for 3D Human Pose Estimation and Localization for Occluded People. ArXiv, 2020. [Code]

Attention Mechanism Exploits Temporal Contexts: Real-time 3D Human Pose Reconstruction. CVPR (Oral), 2020. [Code]

3D Human Pose Estimation with Spatial and Temporal Transformers. ArXiv, 2021. [Code]

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation. ArXiv, 2021. [Code]

Skeletor: Skeletal Transformers for Robust Body-Pose Estimation. ArXiv, 2021.

A Graph Attention Spatio-temporal Convolutional Networks for 3D Human Pose Estimation in Video. ArXiv, 2020. [Page] [Code]

TriPose: A Weakly-Supervised 3D Human Pose Estimation via Triangulation from Video. ArXiv, 2021.

Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos. TIP, 2021.

Camera Distortion-aware 3D Human Pose Estimation in Video with Optimization-based Meta-Learning. ICCV, 2021. [Code]

MeTRAbs: Metric-Scale Truncation-Robust Heatmaps for Absolute 3D Human Pose Estimation. T-BIOM, 2020. [Page] [Code]

PCLs: Geometry-aware Neural Reconstruction of 3D Pose with Perspective Crop Layers. CVPR, 2021.

Real-time Lower-body Pose Prediction from Sparse Upper-body Tracking Signals. ArXiv, 2021.

Context Modeling in 3D Human Pose Estimation: A Unified Perspective. CVPR, 2021.

CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild. CVPR, 2021.

Invariant Teacher and Equivariant Student for Unsupervised 3D Human Pose Estimation. AAAI, 2021. [Code]

Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement. ECCV, 2020. [Code]

Neural MoCon: Neural Motion Control for Physically Plausible Human Motion Capture. CVPR, 2022. [Page]

MocapNET: Ensemble of SNN Encoders for 3D Human Pose Estimation in RGB Images. BMVC, 2019. [Code]

DOPE: Distillation Of Part Experts for whole-body 3D pose estimation in the wild. ECCV, 2020. [Code]

Residual Pose: A Decoupled Approach for Depth-based 3D Human Pose Estimation. IROS, 2020. [Code]

PoP-Net: Pose over Parts Network for Multi-Person 3D Pose Estimation from a Depth Image. ArXiv, 2020. [Code]

3D Human Reconstruction in the Wild with Collaborative Aerial Cameras. ArXiv, 2021. [Code]

Naked Body Mesh

Keep it SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image. ECCV, 2016. [Page] [Code]

Learning to Estimate 3D Human Pose and Shape from a Single Color Image. CVPR, 2018. [Page]

Neural Body Fitting: Unifying Deep Learning and Model Based Human Pose and Shape Estimation. 3DV (Oral), 2018. [Code]

Appearance Consensus Driven Self-Supervised Human Mesh Recovery. ECCV (Oral), 2020. [Page] [Code]

Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild. ICCV, 2019. [Page] [Code]

Learning 3D Human Shape and Pose from Dense Body Parts. ArXiv, 2019. [Page] [Code]

Heuristic Weakly Supervised 3D Human Pose Estimation in Novel Contexts without Any 3D Pose Ground Truth. ArXiv, 2021.

Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation. ArXiv, 2021.

Full-Body Awareness from Partial Observations. ECCV, 2020. [Page] [Code]

Object-Occluded Human Shape and Pose Estimation from a Single Color Image. CVPR, 2020. [Page] [Code]

PARE: Part Attention Regressor for 3D Human Body Estimation. ArXiv, 2021. [Page]

Occluded Human Mesh Recovery. CVPR, 2022. [Page]

Implicit 3D Human Mesh Recovery using Consistency with Pose and Shape from Unseen-view. CVPR, 2023.

Generative Approach for Probabilistic Human Mesh Recovery using Diffusion Models. ICCV, 2023. [Code]

3D Multi-bodies: Fitting Sets of Plausible 3D Human Models to Ambiguous Image Data. NeurIPS, 2020.

Parametric Shape Estimation of Human Body under Wide Clothing. ACM MM, 2020. [Code]

Everybody Is Unique: Towards Unbiased Human Mesh Recovery. ArXiv, 2021.

3D Human Pose, Shape and Texture from Low-Resolution Images and Videos. ArXiv, 2021.

On Self-Contact and Human Pose. CVPR, 2021. [Page]

Probabilistic 3D Human Shape and Pose Estimation from Multiple Unconstrained Images in the Wild. CVPR, 2021.

Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation from Images in the Wild. ICCV, 2021. [Code]

Human Body Model Fitting by Learned Gradient Descent. ECCV, 2020. [Page]

End-to-end Recovery of Human Shape and Pose. CVPR, 2018. [Page] [Code]

Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop. ICCV, 2019. [Page] [Code]

Learning to Regress Bodies from Images using Differentiable Semantic Rendering. ICCV, 2021. [Page]

3D Human Mesh Regression with Dense Correspondence. CVPR, 2020. [Code]

Hierarchical Kinematic Human Mesh Recovery. ECCV, 2020. [Page]

I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image. ECCV, 2020. [Code]

MeshLifter: Weakly Supervised Approach for 3D Human Mesh Reconstruction from a Single 2D Pose Based on Loop Structure. Sensors, 2020. [Code]

Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose. ECCV, 2020. [Code]

PoseNet3D: Learning Temporally Consistent 3D Human Pose via Knowledge Distillation. 3DV, 2020.

Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation. ICCV, 2019. [Code]

Learning 3D Human Shape and Pose from Dense Body Parts. TPAMI, 2020. [Page] [Code]

Exemplar Fine-Tuning for 3D Human Pose Fitting Towards In-the-Wild 3D Human Pose Estimation. ArXiv, 2020. [Code]

HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation. CVPR, 2021. [Page] [Code]

Chasing the Tail in Monocular 3D Human Reconstruction with Prototype Memory. ArXiv, 2020.

Beyond Weak Perspective for Monocular 3D Human Pose Estimation. ArXiv, 2020.

PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop. ICCV (Oral), 2021. [Page] [Code]

KAMA: 3D Keypoint Aware Body Mesh Articulation. ArXiv, 2021.

SimPoE: Simulated Character Control for 3D Human Pose Estimation. CVPR (Oral), 2021. [Page]

SportsCap: Monocular 3D Human Motion Capture and Fine-grained Understanding in Challenging Sports Videos. IJCV, 2021. [Page] [Code]

Reconstructing 3D Human Pose by Watching Humans in the Mirror. CVPR (Oral), 2021. [Page] [Code]

CenterHMR: a Bottom-up Single-shot Method for Multi-person 3D Mesh Recovery from a Single Image. ArXiv, 2020. [Code]

Full-body motion capture for multiple closely interacting persons. CVM, 2020.

Coherent Reconstruction of Multiple Humans from a Single Image. CVPR, 2020. [Page] [Code]

Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image. ICCV, 2019. [Code]

Monocular, One-stage, Regression of Multiple 3D People. ArXiv, 2020. [Code]

Putting People in their Place: Monocular Regression of 3D People in Depth. CVPR, 2022. [Page] [Code]

TRACE: 5D Temporal Regression of Avatars with Dynamic Cameras in 3D Environments. CVPR, 2023. [Page] [Code]

GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras. CVPR (Oral), 2022. [Page] [Code]

Scene-Aware 3D Multi-Human Motion Capture. Eurographics, 2023. [Page] [Code]

Body Meshes as Points. CVPR, 2021. [Page] [Code]

Shape-aware Multi-Person Pose Estimation from Multi-View Images. ICCV, 2021. [Page] [Code]

Learning 3D Human Dynamics from Video. CVPR, 2019. [Page] [Code]

VIBE: Video Inference for Human Body Pose and Shape Estimation. CVPR, 2020. [Code]

3D Human Motion Estimation via Motion Compression and Refinement. ACCV (Oral), 2020. [Page] [Code]

Beyond Static Features for Temporally Consistent 3D Human Pose and Shape from a Video. CVPR, 2021. [Page] [Code]

End-to-End Human Pose and Mesh Reconstruction with Transformers. CVPR, 2021. [Code]

Video Inference for Human Mesh Recovery with Vision Transformer. IEEE Face and Gesture, 2023.

FastMETRO: Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers. ECCV, 2022. [Page] [Code]

A Lightweight Graph Transformer Network for Human Mesh Reconstruction from 2D Human Pose. ArXiv, 2021.

THUNDR: Transformer-based 3D HUmaN Reconstruction with Markers. ArXiv, 2021.

Human Mesh Recovery from Multiple Shots. ArXiv, 2020. [Page]

PC-HMR: Pose Calibration for 3D Human Mesh Recovery from 2D Images/Videos. AAAI, 2021.

Self-Attentive 3D Human Pose and Shape Estimation from Videos. ArXiv, 2021.

Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video. CVPR, 2022. [Page] [Code]

Physics-based Human Motion Estimation and Synthesis from Videos. ICCV, 2021.

HuMoR: 3D Human Motion Model for Robust Pose Estimation. ICCV, 2021. [Page]

Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction. CVPR, 2021. [Page] [Code]

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation. TPAMI, 2022. [Page] [Code]

Out-of-Domain Human Mesh Reconstruction via Bilevel Online Adaptation. CVPR, 2021. [Page] [Code]

Learning Local Recurrent Models for Human Mesh Recovery. ArXiv, 2021.

Probabilistic Modeling for Human Mesh Recovery. ICCV, 2021. [Page] [Code]

Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation. ICCV, 2021. [Code]

Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies. CVPR (Oral), 2018. [Page]

Monocular Total Capture: Posing Face, Body and Hands in the Wild. CVPR (Oral), 2019. [Page] [Code]

Expressive Body Capture: 3D Hands, Face, and Body from a Single Image. CVPR, 2019. [Page] [Code]

FrankMocap: A Fast Monocular 3D Hand and Body Motion Capture by Regression and Integration. ArXiv, 2020. [Page] [Code]

Monocular Expressive Body Regression through Body-Driven Attention. ECCV, 2020. [Page] [Code]

NeuralAnnot: Neural Annotator for in-the-wild Expressive 3D Human Pose and Mesh Training Sets. ArXiv, 2020. [Page]

Pose2Pose: 3D Positional Pose-Guided 3D Rotational Pose Prediction for Expressive 3D Human Pose and Mesh Estimation. ArXiv, 2020. [Page]

Monocular Real-time Full Body Capture with Inter-part Correlations. CVPR, 2021. [Page]

Collaborative Regression of Expressive Bodies using Moderation. ArXiv, 2021. [Page]

One-Stage 3D Whole-Body Mesh Recovery. CVPR, 2023. [Page] [Code]

Binarized 3D Whole-body Human Mesh Recovery. ArXiv, 2023. [Code]

Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras. ICCV, 2021. [Page]

Real-time RGBD-based Extended Body Pose Estimation. WACV, 2021. [Code]

SOMA: Solving Optical Marker-Based MoCap Automatically. ICCV, 2021. [Page]

TransPose: Real-time 3D Human Translation and Pose Estimation with Six Inertial Sensors. SIGGRAPH, 2021. [Page] [Code]

Physical Inertial Poser (PIP): Physics-aware Real-time Human Motion Tracking from Sparse Inertial Sensors. CVPR, 2022. [Page] [Code]

LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds. CVPR, 2022.

Clothed Body Mesh

LiveCap: Real-time Human Performance Capture from Monocular Video. SIGGRAPH, 2019. [Page]

DeepCap: Monocular Human Performance Capture Using Weak Supervision. CVPR (Oral), 2020. [Page]

MonoClothCap: Towards Temporally Coherent Clothing Capture from Monocular RGB Video. 3DV, 2020.

Human Performance Capture from Monocular Video in the Wild. 3DV, 2021. [Page] [Code]

MulayCap: Multi-layer Human Performance Capture Using A Monocular Video Camera. TVCG, 2020. [Page]

ChallenCap: Monocular 3D Capture of Challenging Human Performances using Multi-Modal References. CVPR, 2021.

TightCap: 3D Human Shape Capture with Clothing Tightness Field. ToG, 2021. [Page] [Code]

Deep Physics-aware Inference of Cloth Deformation for Monocular Human Performance Capture. ArXiv, 2020.

Video Based Reconstruction of 3D People Models. CVPR, 2018. [Page]

SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video. CVPR (Oral), 2022. [Page] [Code]

High-Fidelity Human Avatars from a Single RGB Camera. CVPR, 2022. [Page] [Code]

PatchShading: High-Quality Human Reconstruction by PatchWarping and Shading Refinement. ArXiv, 2022.

TotalSelfScan: Learning Full-body Avatars from Self-Portrait Videos of Faces, Hands, and Bodies. NeurIPS, 2022.

AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture. ECCV, 2022. [Page] [Code]

Capturing and Animation of Body and Clothing from Monocular Video. SIGGRAPH Asia, 2022. [Page] [Code]

DoubleFusion: Real-time Capture of Human Performance with Inner Body Shape from a Depth Sensor. CVPR (Oral), 2018. [Page] [Code]

SimulCap : Single-View Human Performance Capture with Cloth Simulation. CVPR, 2019. [Page]

RobustFusion: Human Volumetric Capture with Data-driven Visual Cues using a RGBD Camera. ECCV, 2020.

OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction. CVPR, 2022. [Page] [Code]

NormalGAN: Learning Detailed 3D Human from a Single RGB-D Image. ECCV, 2020. [Page]

Robust 3D Self-portraits in Seconds. CVPR (Oral), 2020. [Page]

TexMesh: Reconstructing Detailed Human Texture and Geometry from RGB-D Video. ECCV, 2020. [Page]

PINA: Learning a Personalized Implicit Neural Avatar from a Single RGB-D Video Sequence. CVPR, 2022. [Page] [Code]

Neural Deformation Graphs for Globally-consistent Non-rigid Reconstruction. CVPR (Oral), 2021. [Page]

Function4D: Real-time Human Volumetric Capture from Very Sparse Consumer RGBD Sensors. CVPR (Oral), 2021. [Page]

POSEFusion:Pose-guided Selective Fusion for Single-view Human Volumetric Capture. CVPR (Oral), 2021. [Page]

DSFN: Dynamic Surface Function Networks for Clothed Human Bodies. ArXiv, 2021. [Page] [Code]

Fast Generation of Realistic Virtual Humans. VRST, 2017. [Page]

Realistic Virtual Humans from Smartphone Videos. VRST, 2020. [Page]

DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras. ArXiv, 2021. [Page]

HDHumans: A Hybrid Approach for High-fidelity Digital Humans. ArXiv, 2022.

Learning to Reconstruct People in Clothing from a Single RGB Camera. CVPR, 2019. [Page] [Code]

SiCloPe: Silhouette-Based Clothed People. CVPR, 2019.

Tex2Shape: Detailed Full Human Body Geometry from a Single Image. ICCV, 2019. [Page] [Code]

Multi-Garment Net: Learning to Dress 3D People from Images. ICCV, 2019. [Page]

Image-Guided Human Reconstruction via Multi-Scale Graph Transformation Networks. TIP, 2021. [Page] [Code]

3DPeople: Modeling the Geometry of Dressed Humans. ICCV, 2019. [Page] [Code]

SIZER: A Dataset and Model for Parsing 3D Clothing and Learning Size Sensitive 3D Clothing. ECCV (Oral), 2020. [Page] [Code]

PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization. ICCV, 2019. [Page] [Code]

PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization. CVPR (Oral), 2020. [Page] [Code]

Geo-PIFu: Geometry and Pixel Aligned Implicit Functions for Single-view Human Reconstruction. NeurIPS, 2020. [Code]

ReFu: Refine and Fuse the Unobserved View for Detail-Preserving Single-Image 3D Human Reconstruction. ACM MM, 2022.

StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision. CVPR, 2021. [Page] [Code]

Total Scale: Face-to-Body Detail Reconstruction from Sparse RGBD Sensors. ArXiv, 2021.

Geometry-aware Two-scale PIFu Representation for Human Reconstruction. NeurIPS, 2022.

ARCH: Animatable Reconstruction of Clothed Humans. CVPR, 2020.

ARCH++: Animation-Ready Clothed Human Reconstruction Revisited. ICCV, 2021.

S3: Neural Shape, Skeleton, and Skinning Fields for 3D Human Modeling. CVPR, 2021.

Detailed Human Avatars from Monocular Video. 3DV, 2018. [Code]

Monocular Real-Time Volumetric Performance Capture. ECCV, 2020. [Page] [Code]

Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion. CVPR, 2020. [Page] [Code]

Combining Implicit Function Learning and Parametric Models for 3D Human Reconstruction. ECCV (Oral), 2020. [Page] [Code]

PaMIR: Parametric Model-Conditioned Implicit Representation for Image-based Human Reconstruction. TPAMI, 2020. [Page]

RIN: Textured Human Model Recovery and Imitation with a Single Image. ArXiv, 2020.

3D Human Avatar Digitization from a Single Image. VRCAI, 2019.

Detailed Avatar Recovery from Single Image. TPAMI, 2021.

High-Fidelity Clothed Avatar Reconstruction from a Single Image. CVPR, 2023. [Page] [Code]

SMPLicit: Topology-aware Generative Model for Clothed People. CVPR, 2021. [Page] [Code]

SCANimate: Weakly Supervised Learning of Skinned Clothed Avatar Networks. CVPR (Oral), 2021. [Page] [Code]

ICON: Implicit Clothed humans Obtained from Normals. CVPR, 2022. [Page] [Code]

ECON: Explicit Clothed humans Optimized via Normal integration. CVPR, 2023. [Page] [Code]

Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing. ICCV, 2021. [Page]

Reconstructing NBA Players. ECCV, 2020. [Page] [Code]

Capturing Detailed Deformations of Moving Human Bodies. ArXiv, 2021.

Towards Real-World Category-level Articulation Pose Estimation. CVPR, 2021. [Page]

gDNA: Towards Generative Detailed Neural Avatars. ArXiv, 2022. [Page]

Human Depth Estimation

Learning the Depths of Moving People by Watching Frozen People. CVPR, 2019. [Page] [Code]

A Neural Network for Detailed Human Depth Estimation from a Single Image. ICCV, 2019. [Code]

Self-Supervised Human Depth Estimation from Monocular Videos. CVPR, 2020. [Code]

DressNet: High Fidelity Depth Estimation of Dressed Humans from a Single View Image. ArXiv, 2021.

Learning High Fidelity Depths of Dressed Humans by Watching Social Media Dance Videos. CVPR (Oral), 2021. [Page] [Code]

Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging. CVPR, 2021. [Page] [Code]

Human Motion

3D Semantic Trajectory Reconstruction from 3D Pixel Continuum. CVPR, 2018. [Page]

Task-Generic Hierarchical Human Motion Prior using VAEs. ArXiv, 2021.

Convolutional Autoencoders for Human Motion Infilling. 3DV, 2020.

Robust Motion In-betweening. SIGGRAPH, 2020. [Page]

Single-Shot Motion Completion with Transformer. ArXiv, 2021. [Code]

Learning Compositional Representation for 4D Captures with Neural ODE. CVPR (Oral), 2021. [Page] [Code]

Graph Constrained Data Representation Learning for Human Motion Segmentation. ICCV, 2021.

Predicting 3D Human Dynamics from Video. ICCV, 2019. [Page] [Code]

Long-term Human Motion Prediction with Scene Context. ECCV (Oral), 2020. [Page] [Code]

Adversarial Refinement Network for Human Motion Prediction. ACCV, 2020.

Towards Accurate 3D Human Motion Prediction from Incomplete Observations. CVPR, 2021.

Aggregated Multi-GANs for Controlled 3D Human Motion Prediction. AAAI, 2021. [Code]

Flow-based Autoregressive Structured Prediction of Human Motion. ArXiv, 2021.

TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild. ArXiv, 2021. [Page]

Multi-level Motion Attention for Human Motion Prediction. ArXiv, 2021. [Code]

We are More than Our Joints: Predicting how 3D Bodies Move. CVPR, 2021. [Page]

Improving Human Motion Prediction Through Continual Learning. ArXiv, 2021.

MSR-GCN: Multi-Scale Residual Graph Convolution Networks for Human Motion Prediction. ICCV, 2021. [Code]

Stochastic Scene-Aware Motion Prediction. ICCV, 2021. [Page] [Code]

GIMO: Gaze-Informed Human Motion Prediction in Context. ArXiv, 2022.

Multiscale Spatio-Temporal Graph Neural Networks for 3D Skeleton-Based Motion Prediction. TIP, 2021.

Skeleton-Graph: Long-Term 3D Motion Prediction From 2D Observations Using Deep Spatio-Temporal Graph CNNs. ICCV (Workshop), 2021. [Code]

Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers. ICCV, 2021. [Code]

BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction. ArXiv, 2022. [Page] [Code]

Multi-Person 3D Motion Prediction with Multi-Range Transformers. NeurIPS, 2021. [Page]

Tracking People with 3D Representations. NeurIPS, 2021. [Page] [Code]

Tracking People by Predicting 3D Appearance, Location and Pose. CVPR, 2022. [Page] [Code]

Synthesizing Long-Term 3D Human Motion and Interaction in 3D. CVPR, 2021. [Page] [Code]

GlocalNet: Class-aware Long-term Human Motion Synthesis. MACV, 2021.

A Causal Convolutional Neural Network for Motion Modeling and Synthesis. ArXiv, 2021.

TrajeVAE - Controllable Human Motion Generation from Trajectories. ArXiv, 2021. [Page]

Action-Conditioned 3D Human Motion Synthesis with Transformer VAE. ArXiv, 2021. [Page]

Scene-aware Generative Network for Human Motion Synthesis. CVPR, 2021.

Learning a Family of Motor Skills from a Single Motion Clip. SIGGRAPH, 2021. [Page] [Code]

MUGL: Large Scale Multi Person Conditional Action Generation with Locomotion. WACV, 2022. [Page] [Code]

DualMotion: Global-to-Local Casual Motion Design for Character Animations. ArXiv, 2022.

Character Controllers using Motion VAEs. ToG, 2020. [Page] [Code]

Learn to Dance with AIST++: Music Conditioned 3D Dance Generation. ArXiv, 2021. [Page]

Learning Speech-driven 3D Conversational Gestures from Video. ArXiv, 2021.

DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer. ArXiv, 2021. [Page] [Code]

DanceAnyWay: Synthesizing Mixed-Genre 3D Dance Movements Through Beat Disentanglement. ArXiv, 2023.

Rhythm is a Dancer: Music-Driven Motion Synthesis with Global Structure. ArXiv, 2021.

Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory. CVPR, 2022. [Code]

Human-Object Interaction

Perceiving 3D Human-Object Spatial Arrangements from a Single Image in the Wild. ECCV, 2020. [Page] [Code]

Resolving 3D Human Pose Ambiguities with 3D Scene Constraints. ICCV, 2019. [Page] [Code]

GRAB: A Dataset of Whole-Body Human Grasping of Objects. ECCV, 2020. [Page] [Code]

Gravity-Aware Monocular 3D Human-Object Reconstruction. ICCV, 2021. [Page] [Code]

CHORE: Contact, Human and Object REconstruction from a single RGB image. ECCV, 2022. [Page] [Code]

InterCap: Joint Markerless 3D Tracking of Humans and Objects in Interaction. GCPR, 2022. [Page] [Code]

BEHAVE: Dataset and Method for Tracking Human Object Interactions. CVPR, 2022. [Page] [Code]

FLEX: Full-Body Grasping Without Full-Body Grasps. ArXiv, 2022. [Page] [Code]

Populating 3D Scenes by Learning Human-Scene Interaction. CVPR, 2021. [Page] [Code]

Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors. CVPR, 2021. [Page]

Holistic 3D Human and Scene Mesh Estimation from Single View Images. CVPR, 2021.

Soft Walks: Real-Time, Two-Ways Interaction between a Character and Loose Grounds. Eurographics, 2021.

RobustFusion: Robust Volumetric Performance Reconstruction under Human-object Interactions from Monocular RGBD Stream. TPAMI, 2021.

Animation

Predicting Animation Skeletons for 3D Articulated Models via Volumetric Nets. 3DV (Oral), 2019. [Page] [Code]

RigNet: Neural Rigging for Articulated Characters. SIGGRAPH, 2020. [Page] [Code]

HeterSkinNet: A Heterogeneous Network for Skin Weights Prediction. I3D, 2021.

Skeleton-Aware Networks for Deep Motion Retargeting. SIGGRAPH, 2020. [Page] [Code]

Contact-Aware Retargeting of Skinned Motion. ICCV, 2021.

Motion Retargetting based on Dilated Convolutions and Skeleton-specific Loss Functions. Eurographics, 2020. [Page] [Code]

Flow Guided Transformable Bottleneck Networks for Motion Retargeting. CVPR, 2021.

Functionality-Driven Musculature Retargeting. CGF, 2020. [Page] [Code]

A Deep Emulator for Secondary Motion of 3D Characters. CVPR (Oral), 2021. [Page]

DeePSD: Automatic Deep Skinning And Pose Space Deformation For 3D Garment Animation. ArXiv, 2020.

UniCon: Universal Neural Controller For Physics-based Character Motion. ArXiv, 2020. [Page]

Learning Skeletal Articulations With Neural Blend Shapes. SIGGRAPH, 2021. [Page] [Code]

Temporal Parameter-free Deep Skinning of Animated Meshes. CGI, 2021. [Page]

Cloth/Try-On

DeepWrinkles: Accurate and Realistic Clothing Modeling. ECCV (Oral), 2018.

Wallpaper Pattern Alignment along Garment Seams. SIGGRAPH, 2019. [Page]

Reﬂection Symmetry in Textured Sewing Patterns. VMV, 2019. [Page]

Deep Fashion3D: A Dataset and Benchmark for 3D Garment Reconstruction from Single-view Images. ECCV (Oral), 2020. [Page]

REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos. CVPR, 2023. [Page] [Code]

Garment4D: Garment Reconstruction from Point Cloud Sequences. NeurIPS, 2021. [Page] [Code]

TailorNet: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style. CVPR (Oral), 2020. [Page] [Code]

Learning-Based Animation of Clothing for Virtual Try-On. Eurographics, 2019. [Page] [Code]

Detail-aware Deep Clothing Animations Infused with Multi-source Attributes. ArXiv, 2021.

Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On. CVPR, 2021. [Page]

Physically Based Neural Simulator for Garment Animation. ArXiv, 2020.

P-Cloth: Interactive Complex Cloth Simulation on Multi-GPU Systems using Dynamic Matrix Assembly and Pipelined Implicit Integrators. SIGGRAPH Asia, 2020. [Page] [Code]

Neural Cloth Simulation. SIGGRAPH Asia, 2022. [Page] [Code]

N-Cloth: Predicting 3D Cloth Deformation with Mesh-Based Networks. Eurographics, 2022. [Page]

Deep Deformation Detail Synthesis for Thin Shell Models. ArXiv, 2021.

DeepCloth: Neural Garment Representation for Shape and Style Editing. ArXiv, 2020. [Page]

3D Custom Fit Garment Design with Body Movement. ArXiv, 2021.

Dynamic Neural Garments. SIGGRAPH Asia, 2021. [Page] [Code]

Motion Guided Deep Dynamic 3D Garments. SIGGRAPH Asia, 2022. [Page] [Code]

DiffCloth: Differentiable Cloth Simulation with Dry Frictional Contact. ArXiv, 2021.

Example-based Real-time Clothing Synthesis for Virtual Agents. ArXiv, 2021.

BCNet: Learning Body and Cloth Shape from a Single Image. ECCV, 2020. [Code]

3D Clothed Human Reconstruction in the Wild. ECCV, 2022. [Code]

Robust 3D Garment Digitization from Monocular 2D Images for 3D Virtual Try-On Systems. ArXiv, 2021.

DIG: Draping Implicit Garment over the Human Body. ACCV, 2022. [Page] [Code]

Registering Explicit to Implicit: Towards High-Fidelity Garment Mesh Reconstruction from Single Images. CVPR, 2022. [Page] [Code]

PERGAMO: Personalized 3D Garments from Monocular Video. SCA, 2022. [Page] [Code]

Fully Convolutional Graph Neural Networks for Parametric Virtual Try-On. SCA, 2020. [Page]

ULNeF: Untangled Layered Neural Fields for Mix-and-Match Virtual Try-On. NeurIPS, 2022. [Page]

SNUG: Self-Supervised Neural Dynamic Garments. CVPR (Oral), 2020. [Page] [Code]

Neural 3D Clothes Retargeting from a Single Image. ArXiv, 2021.

Neural Rendering

Neural3D: Light-weight Neural Portrait Scanning via Context-aware Correspondence Learning. ACM MM, 2020.

Multi-view Neural Human Rendering. CVPR, 2020. [Page] [Code]

NeuralHumanFVV: Real-Time Neural Volumetric Human Performance Rendering using RGB Cameras. CVPR, 2021.

LookinGood^π: Real-time Person-independent Neural Re-rendering for High-quality Human Performance Capture. ArXiv, 2021.

Few-shot Neural Human Performance Rendering from Sparse RGBD Videos. ArXiv, 2021.

ANR: Articulated Neural Rendering for Virtual Avatars. ArXiv, 2020. [Page]

SMPLpix: Neural Avatars from 3D Human Models. WACV, 2020. [Page] [Code]

Vid2Actor: Free-viewpoint Animatable Person Synthesis from Video in the Wild. ArXiv, 2020. [Page]

InstantAvatar: Learning Avatars from Monocular Video in 60 Seconds. ArXiv, 2022. [Page] [Code]

RANA: Relightable Articulated Neural Avatars. ArXiv, 2022. [Page]

Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans. CVPR, 2021. [Page] [Code]

Efficient Neural Radiance Fields with Learned Depth-Guided Sampling. ArXiv, 2021. [Page]

Neural Actor: Neural Free-view Synthesis of Human Actors with Pose Control. ArXiv, 2021.

StylePeople: A Generative Model of Fullbody Human Avatars. CVPR, 2021. [Page] [Code]

A-NeRF: Surface-free Human 3D Pose Refinement via Neural Rendering. ArXiv, 2021. [Page]

D-NeRF: Neural Radiance Fields for Dynamic Scenes. CVPR, 2021. [Page]

HumanNeRF: Generalizable Neural Human Radiance Field from Sparse Inputs. CVPR, 2022. [Page] [Code]

Neural Articulated Radiance Field. ArXiv, 2021. [Code]

Animatable Neural Radiance Fields for Human Body Modeling. ArXiv, 2021. [Page] [Code]

Editable Free-viewpoint Video Using a Layered Neural Representation. SIGGRAPH, 2021. [Page]

UV Volumes for Real-time Rendering of Editable Free-view Human Performance. ArXiv, 2022. [Page] [Code]

Neural Free-Viewpoint Performance Rendering under Complex Human-object Interactions. ArXiv, 2021.

MoCo-Flow: Neural Motion Consensus Flow for Dynamic Humans in Stationary Monocular Cameras. ArXiv, 2021.

Rotationally-Temporally Consistent Novel-View Synthesis of Human Performance Video. ECCV, 2020. [Code]

Human View Synthesis using a Single Sparse RGB-D Input. ArXiv, 2021. [Page]

Neural Human Performer: Learning Generalizable Radiance Fields for Human Performance Rendering. ArXiv, 2021. [Page]

HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video. ArXiv, 2022. [Page]

Dual-Space NeRF: Learning Animatable Avatars and Scene Lighting in Separate Spaces. 3DV, 2022.

NeuMan: Neural Human Radiance Field from a Single Video. ECCV, 2022. [Code]

Structured Local Radiance Fields for Human Avatar Modeling. CVPR, 2022. [Page]

Animatable Neural Implicit Surfaces for Creating Avatars from Videos. ICCV, 2021. [Page] [Code]

DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Reconstruction and Rendering. CVPR, 2022. [Page]

Human Performance Modeling and Rendering via Neural Animated Mesh. SIGGRAPH Asia, 2022. [Page] [Code]

Dataset

3DPW: Recovering Accurate 3D Human Pose in The Wild Using IMUs and a Moving Camera. ECCV, 2018. [Page]

AMASS: Archive of Motion Capture as Surface Shapes. ICCV, 2019. [Page] [Code]

3DBodyTex: Textured 3D Body Dataset. 3DV, 2018. [Page]

Motion Capture from Internet Videos. ECCV (Oral), 2020. [Page] [Code]

3DPeople: Modeling the Geometry of Dressed Humans. ICCV, 2019. [Page] [Code]

Full-Body Awareness from Partial Observations. ECCV, 2020. [Page] [Code]

Object-Occluded Human Shape and Pose Estimation from a Single Color Image. CVPR, 2020. [Page] [Code]

HUMBI: A Large Multiview Dataset of Human Body Expressions. CVPR, 2020. [Page] [Code]

SMPLy Benchmarking 3D Human Pose Estimation in the Wild. 3DV (Oral), 2020. [Page]

Reconstructing 3D Human Pose by Watching Humans in the Mirror. CVPR (Oral), 2021. [Page] [Code]

HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling. ECCV (Oral), 2022. [Page]

AGORA: Avatars in Geography Optimized for Regression Analysis. CVPR, 2021. [Page]

BABEL: Bodies, Action and Behavior with English Labels. CVPR, 2021. [Page]

BEHAVE: Dataset and Method for Tracking Human Object Interactions. CVPR, 2022. [Page] [Code]

Back to Top