Awesome
SGP: Self-supervised Geometric Perception
[CVPR 2021 Oral] Self-supervised Geometric Perception https://arxiv.org/abs/2103.03114
Introduction
In short, SGP is, to the best of our knowledge, the first general framework for feature learning in geometric perception without any supervision from ground-truth geometric labels.
SGP runs in an EM fashion. It iteratively performs robust estimation of the geometric models to generate pseudo-labels, and feature learning under the supervision of the noisy pseudo-labels.
<img src="assets/overview.png" alt="overview" width="600"/>We applied SGP to camera pose estimation and point cloud registration, demonstrating performance that is on par or even superior to supervised oracles in large-scale real datasets.
Camera pose estimation
Deep image features like CAPS can be trained with relative pose labels generated by 5pt-RANSAC, bootstraped with the handcrafted SIFT feature. They can be later used in robust relative camera pose estimation.
<div float="left"> <img src="assets/caps-megadepth.png" width="350" /> <img src="assets/caps-scannet.png" width="350" /> </div>Point cloud registration
Deep 3D features like FCGF can be trained with relative pose labels generated by 3pt-RANSAC, bootstraped by the handcrafted FPFH feature. They can be later used in robust point cloud registration.
<div float="left"> <img src="assets/fpfh-3dmatch.png" width="350" /> <img src="assets/fcgf-3dmatch.png" width="350" /> </div>Code
Please see code/
for detailed intructions about how to use the code base.
Citation
@inproceedings{yang2021sgp,
title={Self-supervised Geometric Perception},
author={Yang, Heng and Dong, Wei and Carlone, Luca and Koltun, Vladlen},
booktitle={CVPR},
year={2021}
}