Home

Awesome

DOTA-C

This repository introduces a few image corruptions for two novel benchmarks to evaluate models' robustness in aerial object detection. All these corruptions are applied to DOTA-v1.0 test set and not used as data augmentation strategies in models' training phase.

Corruptions form ImageNet-C

The first benchmark encompasses 19 prevalent corruptions. For more detailed information, you may refer to the paper on the original corruption package authored by Hendrycks and Dietterich: Benchmarking neural network robustness to common corruptions and perturbations.

Clouds

The second benchmark focuses on cloud-corrupted images—a phenomenon uncommon in natural pictures yet frequent in aerial photography. Process 1 represents "Cloud Self-Subtraction" and process 2 represents "Cloud Addition-to-Scene". The detailed principle of this data processing method can be referred to Cloudy Image Arithmetic: A Cloudy Scene Synthesis Paradigm With an Application to Deep-Learning-Based Thin Cloud Removal.

Models' Results

Note that, unless explicitly stated, the backbone of all models is ResNet-50.

MethodReference$\mathrm{AP}^{\text {clean}}_{50}$mPCrPC (%)$\mathrm{AP}^{\text {clouds}}_{50}$rPC<sub>clouds</sub> (%)
Rotated Faster R-CNNRen et al.73.438.752.758.579.7
RoI TransformerDing et al.76.139.752.160.078.9
Oriented R-CNNXie et al.75.740.453.460.680.1
ReDetHan et al.76.745.659.566.286.3
Rotated RetinaNetLin et al.68.437.154.255.180.6
Rotated FCOSTian et al.71.338.654.257.580.7
R<sup>3</sup>DetYang et al.69.837.653.856.781.2
S<sup>2</sup>A-NetHan et al.73.939.553.459.380.2
RoI Transformer (backbone=ConvNeXt-T)Liu et al.75.047.563.364.586.0
RoI Transformer (backbone=Swin-T)Liu et al.77.543.155.662.881.1
RoI Transformer (backbone=Swin-S)Liu et al.77.144.357.463.382.1
RoI Transformer (backbone=Swin-B)Liu et al.77.744.857.664.883.5
RoI Transformer (backbone=Swin-L)Liu et al.77.647.561.266.785.9
RoI Transformer (augmentation=RandomRotate)76.440.953.661.380.2
RoI Transformer (augmentation=Mosaic)Bochkovskiy et al.74.438.852.259.680.1

Citing

If you make use of the data in DOTA-C, please cite our following paper:

@inproceedings{xia2018dota,
  title={DOTA: A large-scale dataset for object detection in aerial images},
  author={Xia, Gui-Song and Bai, Xiang and Ding, Jian and Zhu, Zhen and Belongie, Serge and Luo, Jiebo and Datcu, Mihai and Pelillo, Marcello and Zhang, Liangpei},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={3974--3983},
  year={2018}
}

@misc{he2023robustness,
      title={On the Robustness of Object Detection Models in Aerial Images}, 
      author={Haodong He and Jian Ding and Gui-Song Xia},
      year={2023},
      eprint={2308.15378},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}