Home

Awesome

TGRS-HRRSD-Dataset: High Resolution Remote Sensing Detection (HRRSD)

I. NOTE: JPEG files are available on BaiduCloud, GoogleDrive, and an ipv6 site bt.byr.cn

Moreover, this dataset is divided as several subsets, image numbers in each subset are 5401 for ‘train’, 5417 for ‘val’, and 10943 for ‘test’. And ‘train-val’ subset is a merge of ‘train’ and ‘val’.

II. Mean and Std

In most current object detection systems, means and std values of datasets are required.

You may refer to:

mean=[123.675, 116.28, 103.53], std=[58.395, 57.12, 57.375]

Moreover, you can compute the values by yourself with file OPT2017/avr_std_detection_sets.py:

$ cd your_HRRSD_path/OPT2017
$ python avr_std_detection_sets.py 500

III. Folders

Labels

Images

Dataset Division

IV. Statistics and Benchmark

Statistics

LabelNameN_TrainN_ValN_TrainvalN_TestN_AllMean Resized Scale /pixelResized Scale Std /pixel
1ship950948189819883886167.44110.37
2bridge11231121224423264570246.10110.53
3ground track field859856171720173734276.50100.65
4storage tank10991092219122154406125.6068.41
5basketball court923920184320333876108.1957.46
6tennis court10431040208322124295102.7138.80
7airplane12261222244824514899113.2167.98
8baseball diamond10071004201120224033231.61117.85
9harbor967964193119533884163.9694.16
10vehicle1188118623742382475641.969.99
11crossroad903901180422194023220.5459.24
12T junction10661065213122894420198.7154.88
13parking lot12411237247824804958122.8554.45

In this table, N_* refers to numbers of objects. 'Train', 'Val', 'Test' are three subsets of the dataset. 'Mean Resized Scale' shows average scale of each category. 'Resized Scale Std' is the standard deviation of category scale.

Benchmark

CategoryYOLO-v2 /%Fast R-CNN /%Fast_R-CNN_r50 + GACL-Net /%Faster R-CNN /%Faster_R-CNN_r50 + GACL-Net /%
Airplane84.683.385.190.890.8
Baseball Diamond62.283.682.686.987.2
Basketball Court41.336.742.147.949.7
Bridge79.075.176.785.585.6
Crossroad43.467.168.788.688.2
Ground Track Field94.490.089.690.690.7
Harbor74.476.078.489.489.7
Parking Lot45.837.539.563.365.3
Ship78.575.074.388.588.5
Storage Tank72.479.880.488.789.2
T Junction46.839.238.875.175.0
Tennis Court67.675.077.080.780.8
Vehicle65.146.150.784.086.9
Mean AP65.866.568.081.582.1

GACL-Net \cite{lu2020gated} is a method proposed to improve object localization performance. Title of this paper is "Gated and Axis-Concentrated Localization Network for Remote Sensing Object Detection". The suffix "r50" denotes using 50 layer Resnet.

V. FAQ

If any question is met, please contanct me with the e-mail: 1153463027@qq.com.

Qestion 1: AP for the "T junction" class is always NAN or 0, why?

Anwser Q1: In some object detection frameworks, there may be a piece of code like "cls_names = lower( cls_names )". This will set class names to lower case, but class names in xml files contain "T junction" where "T" is uppercase. This actually will cause several problems. The solution is using debug sofwares to find the code of changing word cases and correct it. For the dataset, I won't change the "T junction" labels in xmls currently for lacking time.

VI. Citation

If you find HRRSD dataset useful in your research, please consider citing:

@article{zhang2019hierarchical,
  title={Hierarchical and Robust Convolutional Neural Network for Very High-Resolution Remote Sensing Object Detection},
  author={Zhang, Yuanlin and Yuan, Yuan and Feng, Yachuang and Lu, Xiaoqiang},
  journal={IEEE Transactions on Geoscience and Remote Sensing},
  volume={57},
  number={8},
  pages={5535--5548},
  year={2019},
  publisher={IEEE}
}

For more comparative experimental results, please refer to:

@article{lu2020gated,
  title={Gated and Axis-Concentrated Localization Network for Remote Sensing Object Detection},
  author={Lu, Xiaoqiang and Zhang, Yuanlin and Yuan, Yuan and Feng, Yachuang},
  journal={IEEE Transactions on Geoscience and Remote Sensing},
  volume={58},
  number={1},
  pages={179--192},
  year={2020},
  publisher={IEEE}
}