Awesome

Deformable ConvNets is initially described in an arxiv tech report.

R-FCN is initially described in a NIPS 2016 paper.

Soft-NMS is initially described in an arxiv tech report.

Our goal was to test Soft-NMS with a state-of-the-art detector, so Deformable-R-FCN was trained on 800x1200 size images with 15 anchors. Multi-Scale testing was also added with 6 scales. Union of all boxes at each scale was computed before performing NMS. Please note that the repository does not include the scripts for multi-scale testing as I just cache the boxes for each different scale and do NMS separately. The scales used in multi-scale testing were as follows, [(480, 800), (576,900), (688, 1100), (800,1200), (1200, 1600), (1400, 2000)].

The trained model can be downloaded from here.

	<sub>training data</sub>	<sub>testing data</sub>	<sub>mAP</sub>	<sub>mAP@0.5</sub>	<sub>mAP@0.75</sub>	<sub>mAP@S</sub>	<sub>mAP@M</sub>	<sub>mAP@L</sub>	<sub>Recall</sub>
<sub>Baseline D-R-FCN</sub>	<sub>coco trainval</sub>	<sub>coco test-dev</sub>	35.7	56.8	38.3	15.2	38.8	51.5
<sub>D-R-FCN, ResNet-v1-101, NMS</sub>	<sub>coco trainval</sub>	<sub>coco test-dev</sub>	37.4	59.6	40.2	17.8	40.6	51.4	48.3
<sub>D-R-FCN, ResNet-v1-101, SNMS</sub>	<sub>coco trainval</sub>	<sub>coco test-dev</sub>	38.4	60.1	41.6	18.5	41.6	52.5	53.8
<sub>D-R-FCN, ResNet-v1-101, MST, NMS</sub>	<sub>coco trainval</sub>	<sub>coco test-dev</sub>	39.8	62.4	43.3	22.6	42.3	52.2	52.9
<sub>D-R-FCN, ResNet-v1-101, MST, SNMS</sub>	<sub>coco trainval</sub>	<sub>coco test-dev</sub>	40.9	62.8	45.0	23.3	43.6	53.3	60.4