Awesome
AirDet: Few-Shot Detection without Fine-tuning for Autonomous Exploration
Bowen Li, Chen Wang, Pranay Reddy, Seungchan Kim, and Sebastian Scherer*
European Conference on Computer Vision (ECCV2022)
Project web
Abstract
Few-shot object detection has attracted increasing attention and rapidly progressed in recent years. However, the requirement of an exhaustive offline fine-tuning stage in existing methods is time-consuming and significantly hinders their usage in online applications such as autonomous exploration of low-power robots. We find that their major limitation is that the little but valuable information from a few support images is not fully exploited. To solve this problem, we propose a brand new architecture, AirDet, and surprisingly find that, by learning class-agnostic relation with the support images in all modules, including cross-scale object proposal network, shots aggregation module, and localization network, AirDet without fine-tuning achieves comparable or even better results than the exhaustively fine-tuned methods, reaching up to 30-40% improvements. We also present solid results of onboard tests on real-world exploration data from the DARPA Subterranean Challenge, which strongly validate the feasibility of AirDet in robotics. To the best of our knowledge, AirDet is the first feasible few-shot detection method for autonomous exploration of low-power robots. The source code and pre-trained models are released.
Overview
We provide official implementation here to reproduce the results w/o fine-tuning of ResNet101 backbone on:
- COCO-2017 validation
- VOC-2012 validation dataset
Installation
Please create a python environment including:
Python 3.6.9
numpy 1.19.2
detectron2 0.2 (higher version is also OK)
CUDA compiler CUDA 10.2
PyTorch 1.5.1
Pillow 8.3.1
torchvision 0.6.0
fvcore 0.1.5
cv2 4.5.4
We also provide the official docker image (v4) for faster reproduction.
ROS wrapper is also provided here.
Dataset Preparation
1. Download official datasets
Expected dataset Structure:
coco/
annotations/
instances_{train,val}2017.json
person_keypoints_{train,val}2017.json
{train,val}2017/
VOC20{12}/
annotations/
json files
JPEGImages/
2. Generate supports
Download and unzip support (COCO json files) MEGA/BaiduNet(pwd:1134) in
datasets/
coco/
new_annotations/
Download and unzip support (VOC json files) MEGA/BaiduNet(pwd:1134) in
datasets/
voc/
new_annotations/
Run the script
cd datasets
bash generate_support_data.sh
You may modify 4_gen_support_pool_10_shot.py line 190, 213, and 269 with different shots (default is 1 shot).
Reproduce
Base training
Download base R-101 model in /output
start training
bash train.sh
We also provide official trained model MEGA/BaiduNet(pwd:1134)
Put the model in /output/R101/
Inference w/o fine-tuning
bash test.sh
You'll get the results in /log
Citation
If our work motivates/helps your work, please cite us as:
@inproceedings{Li2022ECCV,
author = {Li, Bowen and Wang, Chen and Reddy, Pranay and Kim, Seungchan and Scherer, Sebastian},
title = {AirDet: Few-Shot Detection without Fine-tuning for Autonomous Exploration},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2022}
}
Acknowledgement
Our code is built on top of FewX, we express our sincere gratitude for the authors.