Awesome

RepPoints V2: Verification Meets Regression for Object Detection

By Yihong Chen, Zheng Zhang, Yue Cao, Liwei Wang, Stephen Lin, Han Hu.

We provide supported codes and configuration files to reproduce "RepPoints V2: Verification Meets Regression for Object Detection" on COCO object detection and instance segmentation. Besides, this repo also includes improved results for RepPoints V1, Dense RepPoints (V1,V2). Our code is adapted from mmdetection.

Our paper has been accepted to the NeurIPS 2020!

Introduction

Verification and regression are two general methodologies for prediction in neural networks. Each has its own strengths: verification can be easier to infer accurately, and regression is more efficient and applicable to continuous target variables. Hence, it is often beneficial to carefully combine them to take advantage of their benefits. We introduce verification tasks into the localization prediction of RepPoints, producing RepPoints v2.

RepPoints v2 aims for object detection and it achieves 52.1 bbox mAP on COCO test-dev by a single model. Dense RepPoints v2 aims for instance segmentation and it achieves 45.9 bbox mAP and 39.0 mask mAP on COCO test-dev by using a ResNet-50 model.

Main Results

RepPoints V2

ResNe(X)ts:

Model	Multi-scale training	AP (minival)	AP (test-dev)	Link
RepPoints_V2_R_50_FPN_1x	No	40.9	---	Google / Baidu / Log
RepPoints_V2_R_50_FPN_GIoU_1x	No	41.1	41.3	Google / Baidu / Log
RepPoints_V2_R_50_FPN_GIoU_2x	Yes	43.9	44.4	Google / Baidu / Log
RepPoints_V2_R_101_FPN_GIoU_2x	Yes	45.8	46	Google / Baidu / Log
RepPoints_V2_R_101_FPN_dcnv2_GIoU_2x	Yes	47.7	48.1	Google / Baidu / Log
RepPoints_V2_X_101_FPN_GIoU_2x	Yes	47.3	47.8	Google / Baidu / Log
RepPoints_V2_X_101_FPN_dcnv2_GIoU_2x	Yes	49.3	49.4	Google / Baidu / Log

MobileNets:

Model	Multi-scale training	AP (minival)	AP (test-dev)	Link
RepPoints_V2_MNV2_c128_FPN_2x	Yes	36.8	---	Google / Baidu / Log
RepPoints_V2_MNV2_FPN_2x	Yes	39.4	---	Google / Baidu / Log

RepPoints V1

ResNe(X)ts:

Model	Multi-scale training	AP (minival)	AP (test-dev)	Link
RepPoints_V1_R_50_FPN_1x	No	38.8	---	Google / Baidu / Log
RepPoints_V1_R_50_FPN_GIoU_1x	No	39.9	---	Google / Baidu / Log
RepPoints_V1_R_50_FPN_GIoU_2x	Yes	42.7	---	Google / Baidu / Log
RepPoints_V1_R_101_FPN_GIoU_2x	Yes	44.4	---	Google / Baidu / Log
RepPoints_V1_R_101_FPN_dcnv2_GIoU_2x	Yes	46.6	---	Google / Baidu / Log
RepPoints_V1_X_101_FPN_GIoU_2x	Yes	46.3	---	Google / Baidu / Log
RepPoints_V1_X_101_FPN_dcnv2_GIoU_2x	Yes	48.3	---	Google / Baidu / Log

MobileNets:

Model	Multi-scale training	AP (minival)	AP (test-dev)	Link
RepPoints_V1_MNV2_c128_FPN_2x	Yes	35.7	---	Google / Baidu / Log
RepPoints_V1_MNV2_FPN_2x	Yes	37.8	---	Google / Baidu / Log

Dense Reppoints V2

Model	MS training	bbox AP (minival/test-dev)	mask AP (minival/test-dev)	Link
Dense_RepPoints_V2_R_50_FPN_1x	No	40.5/---	34.8/---	Google / Baidu / Log
Dense_RepPoints_V2_R_50_FPN_GIoU_1x	No	41.5/41.6	35.1/35.4	Google / Baidu / Log
Dense_RepPoints_V2_R_50_FPN_GIoU_3x	Yes	45.2/45.9	38.3/39.0	Google / Baidu / Log

Dense Reppoints V1

Model	MS training	bbox AP (minival/test-dev)	mask AP (minival/test-dev)	Link
Dense_RepPoints_V1_R_50_FPN_1x	No	39.9/---	33.7/---	Google / Baidu / Log
Dense_RepPoints_V1_R_50_FPN_GIoU_1x	No	40.9/41.1	34.2/34.5	Google / Baidu / Log
Dense_RepPoints_V1_R_50_FPN_GIoU_3x	Yes	44.5/45.0	37.4/38.0	Google / Baidu / Log

[1] GIoU means using GIoU loss instead of smooth-l1 loss for the regression branch, which we find improves the final performance.
[2] X-101 denotes ResNeXt-101-64x4d.
[3] 1x, 2x, 3x mean the model is trained for 12, 24 and 36 epochs, respectively.
[4] For multi-scale training, the shorter side of images is randomly chosen from [480, 960].
[5] dcnv2 denotes deformable convolutional networks v2.
[6] c128 denotes the model has 128 (instead of 256) channels in towers.
[7] We use syncbn, GIoU loss for the regression branch to train mobilenet v2 models by default.

Citation

@inproceedings{chen2020reppointsv2,
  title={RepPoints V2: Verification Meets Regression for Object Detection},
  author={Chen, Yihong and Zhang, Zheng and Cao, Yue and Wang, Liwei and Lin, Stephen and Hu, Han},
  booktitle={NeurIPS},
  year={2020}
}

@inproceedings{yang2019dense,
  title={Dense reppoints: Representing visual objects with dense point sets},
  author={Yang, Ze and Xu, Yinghao and Xue, Han and Zhang, Zheng and Urtasun, Raquel and Wang, Liwei and Lin, Stephen and Hu, Han},
  booktitle={ECCV},
  year={2020}
}

@inproceedings{yang2019reppoints,
  title={RepPoints: Point Set Representation for Object Detection},
  author={Yang, Ze and Liu, Shaohui and Hu, Han and Wang, Liwei and Lin, Stephen},
  booktitle={The IEEE International Conference on Computer Vision (ICCV)},
  month={Oct},
  year={2019}
}

Installation

Please refer to INSTALL.md for installation and dataset preparation.

Get Started

Please see GETTING_STARTED.md for the basic usage of MMDetection.

Inference

./tools/dist_test.sh configs/reppoints_v2/reppoints_v2_r50_fpn_giou_1x_coco.py work_dirs/reppoints_v2_r50_fpn_giou_1x_coco/epoch_12.pth 8 --eval bbox

Please note that:

If you are using other model, please change the config file and pretrained weights accordingly.

Train

./tools/dist_train.sh configs/reppoints_v2/reppoints_v2_r50_fpn_giou_1x_coco.py 8

Contributing to the project

Any pull requests or issues are welcome.