Home

Awesome

Object Detection in Aerial Images Awesome

🔥 A curated list of awesome resources for generic object detection in aerial images.

:heavy_exclamation_mark: Updated at 2024-06.


<!--TOC-->

Contents:


Overview

No.YearPub.TitleLinks
102023GRSMRemote Sensing Object Detection Meets Deep Learning: A Meta-review of Challenges and Advances <br> <sub><sup>Xiangrong Zhang, Tianyang Zhang, Guanchun Wang, Peng Zhu, Xu Tang, Xiuping Jia, Licheng Jiao</sup></sub>Paper/Code
092023PAMITowards Large-Scale Small Object Detection: Survey and Benchmarks <br> <sub><sup>Gong Cheng, Xiang Yuan, Xiwen Yao, Kebing Yan, Qinghua Zeng, Xingxing Xie, Junwei Han</sup></sub>Paper/Data
082023arXivOriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey <br> <sub><sup>Kun Wang, Zi Wang, Zhang Li, Ang Su, Xichao Teng, Minhao Liu, Qifeng Yu</sup></sub>Paper/Code
072021PAMI<span style="white-space:nowrap;">Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges DOTA   </span> <br><sub><sup>Jian Ding, Nan Xue, Gui-Song Xia, Xiang Bai, et al.</sup></sub>Paper/Proj
062020IJCVDeep Learning for Generic Object Detection: A Survey GOD <br><sub><sup>Li Liu, Wanli Ouyang, et al.</sup></sub>Paper/Code
052020JPRSObject detection in optical remote sensing images: A survey and a new benchmark HBB <br><sub><sup>Ke Li, Gong Cheng, et al.</sup></sub>Paper/Data
042019TNNLSObject Detection With Deep Learning: A Review GOD <br><sub><sup>Zhongqiu Zhao, et al.</sup></sub>Paper/Code
032018JPRSA review of accuracy assessment for object-based image analysis: From per-pixel to per-polygon approaches <br><sub><sup>Su Ye, Rahul Rakshit, et al.</sup></sub>Paper/Code
022018CVPRDOTA: A Large-Scale Dataset for Object Detection in Aerial Images DOTA <br><sub><sup>Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, et al.</sup></sub>Paper/Code
012016JPRSA survey on object detection in optical remote sensing images <br><sub><sup>Gong Cheng, Junwei Han</sup></sub>Paper/Code

Oriented Object Detection

<!-- Dataset: DOTA, HRSC2016, ICDAR2015, ICDAR2017 MLT, MSRA-TD500, UCAS-AOD, FDDB, OHD-SJTU, SSDD++, Total-Text. -->

Preprint

YearPub.TitleLinks
2024arXivGOOD: Towards Domain Generalized Orientated Object Detection <br><sub><sup>Qi Bi, Beichen Zhou, Jingjun Yi, Wei Ji, Haolan Zhan, Gui-Song Xia</sup></sub>Paper/Code
2023arXivP2RBox: A Single Point is All You Need for Oriented Object Detection <br><sub><sup>Guangming Cao, Xuehui Yu, Wenwen Yu, Xumeng Han, Xue Yang, Guorong Li, Jianbin Jiao, Zhenjun Han</sup></sub>Paper/Code
2022arXivDAFNe: A One-Stage Anchor-Free Deep Model for Oriented Object Detection <br><sub><sup>Steven Lang, et al.</sup></sub>Paper/Code
<!-- **No.** | **Pub.** | **Title** | **Authors** | **Links** :-: | :-: | :- | :- | :- 2022 | WACV | TricubeNet: 2D Kernel-Based Object Representation for Weakly-Occluded Oriented Object Detection | Beomyoung Kim, Janghyeon Lee, et al. | [Paper](https://arxiv.org/abs/2104.11435)/Code 2022 | TGRS | MRDet: A Multi-Head Network for Accurate Oriented Object Detection in Aerial Images | Ran Qin, Yunhong Wang, et al. | [Paper](https://arxiv.org/pdf/2012.13135.pdf)/Code 2021 | TGRS | CFC-Net: A Critical Feature Capturing Network for Arbitrary-Oriented Object Detection in Remote Sensing Images | Qi Ming, et al. | [Paper](https://arxiv.org/pdf/2101.06849.pdf)/[Code](https://github.com/ming71/CFC-Net) 2021 | TGRS | Align Deep Features for Oriented Object Detection | Jiaming Han, Jian Ding, et al. | [Paper](https://arxiv.org/abs/2008.09397)/[Code](https://github.com/csuhan/s2anet) 2021 | GRSL | Optimization for Oriented Object Detection via Representation Invariance Loss | Qi Ming, Zhiqiang Zhou, et al. | [Paper](https://arxiv.org/abs/2103.11636)/[Code](https://github.com/ming71/RIDet) 2021 | TGRS | Anchor Retouching via Model Interaction for Robust Object Detection in Aerial Images <br><sub><sup>*Dong Liang, Qixiang Geng, et al.*</sup></sub> | [Paper](https://arxiv.org/abs/2112.06701)/[Code](https://github.com/QxGeng/DEA-Net) 2021 | arXiv06 | Oriented Object Detection with Transformer <br><sub><sup>*Teli Ma, et al.*</sup></sub> | [Paper](https://arxiv.org/pdf/2106.03146.pdf)/Code -->

2024

No.Pub.TitleLinks
15JSTARSMTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining <br><sub><sup>Di Wang, Jing Zhang, Minqiang Xu, Lin Liu, Dongsheng Wang, Erzhong Gao, Chengxi Han, Haonan Guo, Bo Du, Dacheng Tao, Liangpei Zhang</sup></sub>Paper/Code
14TPAMILearning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery <br> <sup><sub>Yansheng Li; Junwei Luo; Yongjun Zhang; Yihua Tan; Jin-Gang Yu; Song Bai</sub></sup>Paper/Code
13CVPRWeakly Misalignment-free Adaptive Feature Alignment for UAVs-based Multimodal Object Detection <br> <sup><sub>Chen Chen, Jiahao Qi, Xingyue Liu, Kangcheng Bin, Ruigang Fu, Xikun Hu, Ping Zhong</sub></sup>Paper/Code
12CVPRWildlifeMapper: Aerial Image Analysis for Multi-Species Detection and Identification <br> <sup><sub>Satish Kumar, Bowen Zhang, Chandrakanth Gudavalli, Connor Levenson, Lacey Hughey, et al.</sub></sup>Paper/Code
11CVPRRotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation <br> <sup><sub>Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji</sub></sup>Paper/Code
10CVPRRelational Matching for Weakly Semi-Supervised Oriented Object Detection <br> <sup><sub>Wenhao Wu, Hau-San Wong, Si Wu, Tianyou Zhang</sub></sup>Paper/Code
09CVPRPoint2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision <br> <sup><sub>Yi Yu, Xue Yang, Qingyun Li, Feipeng Da, Jifeng Dai, Yu Qiao, Junchi Yan</sub></sup>Paper/Code
08CVPRPointOBB: Learning Oriented Object Detection via Single Point Supervision <br> <sup><sub>Junwei Luo, Xue Yang, Yi Yu, Qingyun Li, Junchi Yan, Yansheng Li</sub></sup>Paper/Code
07CVPRRethinking Boundary Discontinuity Problem for Oriented Object Detection <br> <sup><sub>Hang Xu, Xinyuan Liu, Haonan Xu, Yike Ma, Zunjie Zhu, Chenggang Yan, Feng Dai</sub></sup>Paper/Code
06CVPRTheoretically Achieving Continuous Representation of Oriented Bounding Boxes <br> <sup><sub>Zikai Xiao, Guo-Ye Yang, Xue Yang, Tai-Jiang Mu, Junchi Yan, Shi-min Hu</sub></sup>Paper/Code
05CVPRPoly Kernel Inception Network for Remote Sensing Detection <br> <sup><sub>Xinhao Cai, Qiuxia Lai, Yuwei Wang, Wenguan Wang, Zeren Sun, Yazhou Yao</sub></sup>Paper/Code
04AAAIFRED: Towards a Full Rotation-Equivariance in Aerial Image Object Detection <br> <sup><sub>Chanho Lee, Jinsu Son, Hyounguk Shon, Yunho Jeon, Junmo Kim</sub></sup>Paper/Code
03AAAISpatial Transform Decoupling for Oriented Object Detection <br><sub><sup>Hongtian Yu, Yunjie Tian, Qixiang Ye, Yunfan Liu</sup></sub>Paper/Code
02IJCVOriented R-CNN and Beyond <br><sub><sup>Xingxing Xie, Gong Cheng, Jiabao Wang, Ke Li, Xiwen Yao & Junwei Han</sup></sub>Paper/Code
01TGRSARS-DETR: Aspect Ratio-Sensitive Detection Transformer for Aerial Oriented Object Detection <br><sub><sup>Ying Zeng, Yushi Chen, Xue Yang, Qingyun Li, Junchi Yan</sup></sub>Paper/Code

2023

No.Pub.TitleLinks
12arXivAdaptive Dense Pseudo Label Selection for Semi-supervised Oriented Object Detection <br><sub><sup>Tong Zhao, Qiang Fang, Shuohao Shi, Xin Xu</sup></sub>Paper/Code
11ICCVAdaptive Rotated Convolution for Rotated Object Detection <br><sub><sup>Yifan Pu, Yiru Wang, Zhuofan Xia, Yizeng Han, Yulin Wang, Weihao Gan, Zidong Wang, Shiji Song, Gao Huang</sup></sub>Paper/Code
10ICCVLarge Selective Kernel Network for Remote Sensing Object Detection <br><sub><sup>Yuxuan Li, Qibin Hou, Zhaohui Zheng, Ming-Ming Cheng, Jian Yang, and Xiang Li</sup></sub>Paper/Code
09TIPSampling Equivariant Self-attention Networks for Object Detection in Aerial Images <br><sub><sup>Guo-Ye Yang; Xiang-Li Li; Zi-Kai Xiao; Tai-Jiang Mu; Ralph R. Martin; Shi-Min Hu</sup></sub>Paper/Code
08PRRoMP-Transformer: Rotational bounding box with Multi-level feature Pyramid Transformer for object detection <br><sub><sup>Joonhyeok Moon, Munsu Jeon, Siheon Jeong, Ki-Yong Oh</sup></sub>Paper/Code
07PAMIDetecting rotated objects as Gaussian distributions and its 3-D generalization <br><sub><sup>Yang, Xue and Zhang, Gefan and Yang, Xiaojiang and Zhou, Yue and Wang, Wentao and Tang, Jin and He, Tao and Yan, Junchi</sup></sub>Paper/Code
06CVPRDynamic Coarse-to-Fine Learning for Oriented Tiny Object Detection <br><sub><sup>Chang Xu, Jian Ding, Jinwang Wang, Chang Xu, Huai Yu, Lei Yu, Gui-Song Xia</sup></sub>Paper/Code
05CVPRSOOD: Towards Semi-Supervised Oriented Object Detection <br><sub><sup>Wei Hua, Dingkang Liang, jingyu li, Xiaolong Liu, Zhikang Zou, Xiaoqing Ye, Xiang Bai</sup></sub>Paper/Code
04CVPRKnowledge Combination to Learn Rotated Detection Without Rotated Annotation <br><sub><sup>Tianyu Zhu, Bryce Ferenczi, Pulak Purkait, Tom Drummond, Hamid Rezatofighi, Anton van den Hengel</sup></sub>Paper/Code
03CVPRPhase-Shifting Coder: Predicting Accurate Orientation in Oriented Object Detection <br><sub><sup>Yi Yu, Feipeng Da</sup></sub>Paper/Code
02ICLRH2RBox: Horizontal Box Annotation is All You Need for Oriented Object Detection <br><sub><sup>Yang, Xue and Zhang, Gefan and Li, Wentong and Wang, Xuehui and Zhou, Yue and Yan, Junchi</sup></sub>Paper/Code
01TCSVTAO2-DETR: Arbitrary-Oriented Object Detection Transformer <br><sub><sup>Linhui Dai, Hong Liu, Hao Tang, Zhiwei Wu, Pinhao Song</sup></sub>Paper/Code

2022

No.Pub.TitleLinks
15TGRSAnchor-free Oriented Proposal Generator for Object Detection <br><sub><sup>Gong Cheng, Jiabao Wang, et al.</sup></sub>Paper/Code
14IJCVOn the Arbitrary-Oriented Object Detection: Classification Based Approaches Revisited <br><sub><sup>Xue Yang, Junchi Yan</sup></sub>Paper/Code
13ECCVEAutoDet: Efficient Architecture Search for Object Detection <br><sub><sup>Xiaoxing Wang, Jiale Lin, Junchi Yan, Juanping Zhao, Xiaokang Yang</sup></sub>Paper/Code
12PAMISCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing <br><sub><sup>Xue Yang, Junchi Yan, et al.</sup></sub>Paper/Proj
11TIPGGHL: A General Gaussian Heatmap Label Assignment for Arbitrary-Oriented Object Detection <br><sub><sup>Zhanchao Huang, Wei Li, et al.</sup></sub>Paper/Code
10TIPACE: Anchor-Free Corner Evolution for Real-Time Arbitrarily-Oriented Object Detection <br><sub><sup>Pengwen Dai; Siyuan Yao; Zekun Li; Sanyi Zhang; Xiaochun Cao</sup></sub>Paper/Code
09TCSVTRSDet++: Point-based Modulated Loss for More Accurate Rotated Object Detection <br><sub><sup>Wen Qian, Xue Yang, et al.</sup></sub>Paper/Code
08CVPRInteractive Multi-Class Tiny-Object Detection <br><sub><sup>Chunggi Lee, Seonwook Park, Heon Song, Jeongun Ryu, Sanghoon Kim</sup></sub>Paper/Code
07CVPRWeakly Supervised Rotation-Invariant Aerial Object Detection Network <br><sub><sup>Xiaoxu Feng, Gong Cheng, et al.</sup></sub>Paper/Code
06CVPROSKDet: Towards Orientation-sensitive Keypoint Localization for Rotated Object Detection <br><sub><sup>Dongchen Lu, et al.</sup></sub>Paper/Code
05CVPROriented RepPoints for Aerial Object Detection <br><sub><sup>Wentong Li, Jianke Zhu</sup></sub>Paper/Code
04CVPRCanonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes <br><sub><sup>Yang You, Cewu Lu, et al.</sup></sub>Paper/Code
03AAAIShape-Adaptive Selection and Measurement for Oriented Object Detection <br><sub><sup>Liping Hou, Ke Lu, et al.</sup></sub>Paper/Code<br>MMRotate
02AAAIPolygon-to-Polygon Distance Loss for Rotated Object Detection <br><sub><sup>Yang Yang, Jifeng Chen, et al.</sup></sub>Paper/Code/Slides
01TMMAdaZoom: Towards Scale-Aware Large Scene Object Detection HBB <br><sub><sup>Jingtao Xu, Yali Li, et al.</sup></sub>Paper/Code<br>arXiv

2021

No.Pub.TitleLinks
16PAMIGliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection <br><sub><sup>Yongchao Xu, et al.</sup></sub>Paper/arXiv<br>Code
15TIP<span style="white-space:nowrap;">GSDet: Object Detection in Aerial Images Based on Scale Reasoning </span> <br><sub><sup>Wei Li, Wei Wei, Lei Zhang</sup></sub>Paper/Code
14NeurIPSLearning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence <br><sub><sup>Xue Yang, Junchi Yan, et al.</sup></sub>Paper/Code
13ICCVTowards Rotation Invariance in Object Detection <br><sub><sup>Agastya Kalra, Guy Stoppi, et al.</sup></sub>Paper/Code
12ICCVOriented R-CNN for Object Detection <br><sub><sup>X. Xie, Gong Cheng, et al.</sup></sub>Paper/Code
11MMPolar Ray: A Single-stage Angle-free Detector for Oriented Object Detection in Aerial Images <br><sub><sup>Shuai Liu, Huchuan Lu, et al.</sup></sub>Paper/Code
10ICMLRethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss <br><sub><sup>Xue Yang, Junchi Yan, et al.</sup></sub>arXiv/Code<br>SUPP
09CVPRReDet: A Rotation-equivariant Detector for Aerial Object Detection <br><sub><sup>Jiaming Han, Jian Ding, Nan Xue, Gui-Song Xia</sup></sub>Paper/Code
08CVPRBeyond Bounding-Box: Convex-Hull Feature Adaptation for Oriented and Densely Packed Object Detection <br><sub><sup>Z. Guo, Qixiang Ye, et al.</sup></sub>Paper/Code/<br>Journal
07CVPRDense Label Encoding for Boundary Discontinuity Free Rotation Detection <br><sub><sup>Xue Yang, Junchi Yan, et al.</sup></sub>Paper/Code
06CVPRGAIA: A Transfer Learning System of Object Detection that Fits Your Needs HBB <br><sub><sup>X. Bu, Zhaoxiang Zhang, et al.</sup></sub>Paper/Code
05AAAIDynamic Anchor Learning for Arbitrary-Oriented Object Detection <br><sub><sup>Qi Ming, Zhiqiang Zhou, et al.</sup></sub>Paper/Code
04AAAILearning Modulated Loss for Rotated Object Detection RSDet <br><sub><sup>Wen Qian, Xue Yang, Junchi Yang, et al.</sup></sub>Paper/arXiv<br>Code
03AAAI<span style="white-space:nowrap;">R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object   </span> <br><sub><sup>Xue Yang, Junchi Yang, et al.</sup></sub>Paper/arXiv<br>Py/TF
02WACVOriented object detection in aerial images with box boundary-aware vectors <br><sub><sup>Yi, Jingru and Wu, Pengxiang and Liu, Bo and Huang, Qiaoying and Qu, Hui and Metaxas, Dimitris</sup></sub>Paper/Code
01PRGradient-Aligned Convolution Neural Network rotation invariance <br><sub><sup>You Hao, Ping Hu, et al.</sup></sub>Paper/Code

2020

No.Pub.TitleLinks
10TIPA Global-Local Self-Adaptive Network for Drone-View Object Detection <br><sub><sup>Sutao Deng, et al.</sup></sub>Paper/Code
09TNNLSCRPN-SFNet: A High-Performance Object Detector on Large-Scale Remote Sensing Images <br><sub><sup>Qifeng Lin, et al.</sup></sub>Paper/Code
08JPRSOrientation guided anchoring for geospatial object detection from remote sensing imagery <br><sub><sup>Yongtao Yu, et al.</sup></sub>Paper/Code
07JPRS<span style="white-space:nowrap;">Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images  </span> <br><sub><sup>Kun Fu, et al.</sup></sub>Paper/Code
06JPRSOriented objects as pairs of middle lines <br><sub><sup>Haoran Wei, et al.</sup></sub>Paper/Code
05ECCVArbitrary-Oriented Object Detection with Circular Smooth Label Conf <br> On the Arbitrary-Oriented Object Detection: Classification based Approaches Revisited Journal <br><sub><sup>Xue Yang, Junchi Yan, Tao He</sup></sub>Paper/Code<br>Code2/Journal<br>Proj/Data
04ECCVPIoU Loss: Towards Accurate Oriented Object Detection in Complex Environments <br><sub><sup>Zhiming Chen, Weiyao Lin, et al.</sup></sub>Paper/Code
03ICMECascade Detector With Feature Fusion For Arbitrary-Oriented Objects In Remote Sensing Images <br><sub><sup>Liping Hou, Ke Lu, et al.</sup></sub>Paper/Code
02CVPRForeground-Aware Relation Network for Geospatial Object Segmentation in High Spatial Resolution Remote Sensing Imagery <br><sub><sup>Zhuo Zheng, et al.</sup></sub>Paper/Code
01CVPR<span style="white-space:nowrap;">Dynamic Refinement Network for Oriented and Densely Packed Object Detection </span> <br><sub><sup>X. Pan, Changsheng Xu, et al.</sup></sub>Paper/Code

2019

No.Pub.TitleLinks
09TIPLearning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection <br><sub><sup>Gong Cheng, Junwei Han, et al.</sup></sub>Paper/CVPR16<br>Code
08TGRSCAD-Net: A Context-Aware Detection Network for Objects in Remote Sensing Imagery <br><sub><sup>Gongjie Zhang, Shijian Lu, Wei Zhang</sup></sub>Paper/arXiv<br>Code
07ICCVDelving Into Robust Object Detection From Unmanned Aerial Vehicles: A Deep Nuisance Disentanglement Approach HBB <br><sub><sup>Zhenyu Wu, et al.</sup></sub>Paper/Code
06ICCVSCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects <br><sub><sup>Xue Yang, Junchi Yan, et al.</sup></sub>Paper/Code
05ICCVClustered Object Detection in Aerial Images HBB <br><sub><sup>Fan Yang, Haibin Ling, et al.</sup></sub>Paper/Code
04ICMECropping Region Proposal Network Based Framework for Efficient Object Detection on Large Scale Remote Sensing Images <br><sub><sup>Qifeng Lin, et al.</sup></sub>Paper/Code
03CVPR<span style="white-space:nowrap;">Learning RoI Transformer for Oriented Object Detection in Aerial Images </span> <br><sub><sup>Jian Ding, Nan Xue, et al.</sup></sub>Paper/Code
02CVPRPrecise Detection in Densely Packed Scenes GOD <br><sub><sup>Eran Goldman, Roei Herzig, et al.</sup></sub>Paper/Code
01CVPRTowards Universal Object Detection by Domain Attention <br><sub><sup>Xudong Wang, et al.</sup></sub>Paper/Code

2018

No.Pub.TitleLinks
03TIPRandom Access Memories: A New Paradigm for Target Detection in High Resolution Aerial Remote Sensing Images <br><sub><sup>Zhengxia Zou, Zhenwei Shi</sup></sub>Paper/Code
02CVPRDOTA: A Large-Scale Dataset for Object Detection in Aerial Images <br><sub><sup>Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, et al.</sup></sub>Paper/Code
01CVPR<span style="white-space:nowrap;">ClusterNet: Detecting Small Objects in Large Scenes by Exploiting Spatio-Temporal Information HBB </span> <br><sub><sup>R. Londe, Dong Zhang, Mubarak Shah</sup></sub>Paper/arXiv<br>Code

Before 2018

No.YearPub.TitleLinks
042016CVPRRIFD-CNN: Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection <br><sub><sup>Gong Cheng, Peicheng Zhou, Junwei Han</sup></sub>Paper/Code
032016TGRS<span style="white-space:nowrap;">Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images </span> <br><sub><sup>Gong Cheng, Peicheng Zhou, Junwei Han</sup></sub>Paper/Code1<br>Code2
022016ECCVA Large Contextual Dataset for Classification, Detection and Counting of Cars with Deep Learning [HBB] <br><sub><sup>T. Nathan Mundhenk, et al.</sup></sub>Paper/Code
012015ICCVOriented Object Proposals <br><sub><sup>Shengfeng He, Rynson W. H. Lau</sup></sub>Paper/Code

Instance Segmentation

YearPub.TitleLinks
2024CVPRRotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation <br> <sup><sub>Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji</sub></sup>Paper/Code
2021TCYBSemantic Attention and Scale Complementary Network for Instance Segmentation in Remote Sensing Images <br> <sup><sub>Tianyang Zhang, Licheng Jiao, et al.</sub></sup>Paper/Code
2020CVPRForeground-Aware Relation Network for Geospatial Object Segmentation in High Spatial Resolution Remote Sensing Imagery <br> <sup><sub>Zhuo Zheng, et al.</sub></sup>Paper/Code

Small Object Detection

YearPub.TitleLinks
2024TCSVTSave the Tiny, Save the All: Hierarchical Activation Network for Tiny Object Detection <br> <sub><sup>Guangqian Guo; Pengfei Chen; Xuehui Yu; Zhenjun Han; Qixiang Ye; Shan Gao</sup></sub>Paper/Code
2023arXivTransformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art <br> <sub><sup>Aref Miri Rekavandi, Shima Rashidi, Farid Boussaid, Stephen Hoefs, Emre Akbas, Mohammed Bennamoun</sup></sub>Paper/Code
2023ICCVSmall Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning <br> <sub><sup>Xiang Yuan, Gong Cheng, Kebing Yan, Qinghua Zeng, Junwei Han</sup></sub>Paper/Code
2023PAMITowards Large-Scale Small Object Detection: Survey and Benchmarks <br> <sub><sup>Gong Cheng, Xiang Yuan, Xiwen Yao, Kebing Yan, Qinghua Zeng, Junwei Han</sup></sub>Paper/Data
2022JPRSDetecting tiny objects in aerial images: A normalized Wasserstein distance and a new benchmark <br> <sub><sup>Chang Xu, Jinwang Wang, Wen Yang, Huai Yua, Lei Yua, Gui-SongXi</sup></sub>Paper/Code
2022ECCVRFLA: Gaussian Receptive Field based Label Assignment for Tiny Object Detection <br> <sub><sup>Chang Xu, Jinwang Wang, Wen Yang, Huai Yu, Lei Yu, Gui-Song Xia</sup></sub>Paper/Code
2022CVPRQueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection <br> <sub><sup>Chenhongyi Yang, Zehao Huang, Naiyan Wang</sup></sub>Paper/Code
--------
2021arXiv07A Survey on Deep Domain Adaptation and Tiny Object Detection Challenges, Techniques and Datasets <br> <sub><sup>Muhammed Muzammul, Xi Li</sup></sub>Paper/Code
2021TMMExtended Feature Pyramid Network for Small Object Detection <br> <sub><sup>Chunfang Deng, et al.</sup></sub>Paper/Code
2021TMMCrossNet: Detecting Objects as Crosses <br> <sub><sup>Jiaxu Leng, Xinbo Gao, et al.</sup></sub>Paper/Code
2021CVPRDot distance for tiny object detection in aerial images <br> <sub><sup>Chang Xu, Jinwang Wang, Wen Yang, Lei Yu</sup></sub>Paper/Code
2021CVPRDogfight: Detecting Drones from Drones Videos <br> <sub><sup>M. Ashraf, W. Sultani, Mubarak Shah</sup></sub>Paper/Code
2021WACVEffective Fusion Factor in FPN for Tiny Object Detection <br> <sub><sup>Yuqi Gong, Zhenjun Han, et al.</sup></sub>Paper/Code
---------
2020IJCVMulti-task Generative Adversarial Network for Detecting Small Objects in the Wild <br> <sub><sup>Yongqiang Zhang, Yancheng Bai, et al.</sup></sub>Paper/CVPR18<br>ECCV18/Code
2020TCYBContext-Aware Block Net for Small Object Detection <br> <sub><sup>Lisha Cui, et al.</sup></sub>Paper/Code
2020MMCODAN: Counting-driven Attention Network for Vehicle Detection in Congested Scenes <br> <sub><sup>Wei Li, et al.</sup></sub>Paper/Code
---------
2019arXiv02Augmentation for small object detection <br> <sub><sup>Mate Kisantal, et al.</sup></sub>Paper/Code
2019TCSVTDetecting Small Objects Using a Channel-Aware Deconvolutional Network <br> <sub><sup>Kaiwen Duan, Dawei Du, et al.</sup></sub>Paper/Code
2019TGRSR2 -CNN: Fast Tiny Object Detection in Large-Scale Remote Sensing Images <br> <sub><sup>J. Pang, Jianping Shi, et al.</sup></sub>Paper/Code
2019ICCVBetter to Follow, Follow to Be Better: Towards Precise Supervision of Feature Super-Resolution for Small Object Detection <br> <sub><sup>Junhyug Noh, et al.</sup></sub>Paper/Code
2019ICCVMiss Detection vs. False Alarm: Adversarial Learning for Small Object Segmentation in Infrared Images <br> <sub><sup>Huan Wang, Luping Zhou, Lei Wang</sup></sub>Paper/Code
2019MMSmall and Dense Commodity Object Detection with Multi-Receptive Field Attention <br> <sub><sup>Zhong Ji, Yanwei Pang, et al.</sup></sub>Paper/Code
---------
2018CVPRFinding Tiny Faces in the Wild With Generative Adversarial Network <br> <sub><sup>Yancheng Bai, Bernard Ghanem, et al.</sup></sub>Paper/Code
2017ICCVFocal Loss for Dense Object Detection <br> <sub><sup>Tsung-Yi, Kaiming He, et al.</sup></sub>Paper/Code
2017CVPRPerceptual Generative Adversarial Networks for Small Object Detection <br> <sub><sup>Jianan Li, Tingfa Xu, et al.</sup></sub>Paper/Code

UAV Object Detection

YearPub.TitleAuthorsLinks
2021CVPRDetection, Tracking, and Counting Meets Drones in Crowds: A BenchmarkLongyin Wen, Dawei Du, et al.Paper/Code
------------
2020arXivVision Meets Drones: Past, Present and FuturePengfei Zhu, Dawei Du, et al.Paper/Code
2020IJCVThe Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and BaselineHongyang Yu, Dawei Du, et al.Paper/Proj<br>ECCV18
2020MMMOR-UAV: A Benchmark Dataset and Baselines for Moving Object Recognition in UAV VideosM. Mandal, Lav Kumar, S. VipparthiPaper/Code
2020MM<span style="white-space:nowrap;">Guided Attention Network for Object Detection and Counting on Drones </span><span style="white-space:nowrap;">Y. Cai, Dawei Du, et al. </span>Paper/Code

Dataset

YearNamePaperPub.
2022SODA<font size=2>Towards Large-Scale Small Object Detection: Survey and Benchmarks [OBB]</font>Paper
2021FAIR1M<font size=2>FAIR1M: A Benchmark Dataset for Fine-grained Object Recognition in High-Resolution Remote Sensing Imagery [OBB]</font>arXiv<br>Intro-ch
2021SaRNet<font size=2>SaRNet: A Dataset for Deep Learning Assisted Search and Rescue with Satellite Imagery [HBB]</font>arXiv
2021AI-TOD<font size=2>Tiny Object Detection in Aerial Images [HBB]</font>ICPR
2019DIOR<font size=2>Object detection in optical remote sensing images: A survey and a new benchmark [HBB]</font>ISPRSJ
2019iSAID<font size=2>iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images</font>CVPRW
2019HRRSD<font size=2>Hierarchical and Robust Convolutional Neural Network for Very High-Resolution Remote Sensing Object Detection</font>TGRS
2019VRAI<font size=2>Vehicle Re-identification in Aerial Imagery: Dataset and Approach</font>ICCV
2019ITCVD<font size=2>Vehicle Detection in Aerial Images</font>PERS
2019Aerial<br>Elephant<font size=2>The Aerial Elephant Dataset: A New Public Benchmark for Aerial Object Detection</font>CVPRW
2018DOTA<font size=2>DOTA: A Large-Scale Dataset for Object Detection in Aerial Images</font>CVPR/Kit
2018xView<font size=2>xView: Objects in Context in Overhead Imagery [HBB]</font>arXiv/Kit
2018VisDrone<font size=2>Vision Meets Drones: Past, Present and Future [HBB]</font>arXiv/Data
2018LPODC<font size=2>DAC-SDC Low Power Object Detection Challenge for UAV Applications [HBB]</font>PAMI
2018UAVDT<font size=2>The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking</font>ECCV
2016NWPU<br>VHR-10<font size=2>Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sensing images</font>TGRS
2016RSOD<font size=2>Accurate Object Localization in Remote Sensing Images Based on Convolutional Neural Networks</font>TGRS
2016HRSC<br>2016<font size=2>A High Resolution Optical Satellite Image Dataset for Ship Recognition and Some New Baselines</font>ICPRAM<br>Kaggle
2015VEDAI<font size=2>Vehicle Detection in Aerial Imagery: A small target detection benchmark</font>JVCIR
2014UCAS-AOD<font size=2>Orientation robust object detection in aerial images using deep convolutional neural network</font>ICIP

Appendix

GOD

YearPub.TitleAuthorsLinks
2021arXiv09Progressive Hard-case Mining across Pyramid Levels in Object DetectionBinghong Wu, et al.Paper/Code
2021arXiv04Slender Object Detection: Diagnoses and ImprovementsZhaoyi Wan, Yimin Chen, et al.Paper/Code
2021arXiv01Focal and Efficient IOU Loss for Accurate Bounding Box RegressionYi-Fan Zhang, Liang Wang, et al.Paper/Code
2021NeurIPSYou Only Look at One Sequence: Rethinking Transformer in Vision through Object DetectionYuxin Fang, Xinggang Wang, et al.Paper/Code
2021PAMIScale Normalized Image Pyramids with AutoFocus for Object DetectionBharat Singh, Mahyar Najibi, et al.Paper/Code
2021IJCVCompositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition Under OcclusionAdam Kortylewski, et al.Paper/Code
2021IJCVScale-Aware Domain Adaptive Faster R-CNNY. Chen, D. Dai, Luc Van Gool, et al.Paper/Code
2021IJCVGuided Attention in CNNs for Occluded Pedestrian Detection and Re-identificationS. Zhang, Di Chen, Jian Yang, Bernt SchielePaper/Code
2021ICCVSOTR: Segmenting Objects With TransformersRuohao Guo, Dantong Niu, Liao Qu, Zhenbo LiPaper/Code
2021ICCVDynamic DETR: End-to-End Object Detection With Dynamic AttentionXiyang Dai, Lu Yuan; Lei Zhang, et al.Paper/Code
2021ICCVCrossDet: Crossline Representation for Object DetectionHeqian Qiu, Hongliang Li, et al.Paper/Code
2021CVPRUP-DETR: Unsupervised Pre-training for Object Detection with TransformersZ. Dai, J. Chen, et al.Paper/Code
2021CVPRAdaptive Image Transformer for One-Shot Object DetectionDing-Jie Chen, He-Yen Hsieh, Tyng-Luh LiuPaper/Code
2021CVPRScale-aware Automatic Augmentation for Object DetectionYukang Chen, Jiaya Jia, et al.Paper/Code
2021CVPRSparse R-CNN: End-to-End Object Detection with Learnable ProposalsPeize Sun, Ping Luo, et al.Paper/Code<br>arXiv
2021ICLRDeformable DETR: Deformable Transformers for End-to-End Object DetectionXizhou Zhu, Xiaogang Wang, Jifeng Dai, et al.Paper/Code
2021ICLROn the Universality of Rotation Equivariant Point Cloud NetworksNadav Dym, Haggai MaronPaper/Code
2021AAAIRethinking Object Detection in Retail StoresYuanqiang Cai, Dawei Du, etcPaper/Code
------------
2020NeurIPSRepPoints V2: Verification Meets Regression for Object DetectionYihong Chen, Zheng Zhang, et al.Paper/Code
2020TIPSelf-Supervised Feature Augmentation for Large Image Object DetectionXingjia Pan, et al.Paper/Code
2020ECCVEnd-to-End Object Detection with Transformers DETRCarion N, et al.Paper/Code
2020ECCVLearning Data Augmentation Strategies for Object DetectionBarret Zoph, Quoc V. Le, et al.Paper/Code
2020ECCVLearning to Separate: Detecting Heavily-Occluded Objects in Urban ScenesChenhongyi Yang, et al.Paper/Code
------------
2019IJCVCornerNet: Detecting Objects as Paired KeypointsHei Law, Jia DengPaper/arXiv<br>ECCV18/Code
2019IJCVCorner Detection Using Multi-directional Structure Tensor with Multiple ScalesWeichuan Zhang, Changming SunPaper/Code
2019IJCVHierarchical Attention for Part-Aware Face DetectionShuzhe Wu, Meina Kan, Shiguang Shan, Xilin ChenPaper/Code
2019TIPCombining Faster R-CNN and Model-Driven Clustering for Elongated Object DetectionFen Fang, et al.Paper/Code
2019arXivDeep Learning for 2D and 3D Rotatable Data: An Overview of MethodsLuca Della Libera, Daniel Cremers, et al.Paper/Code
2019ICCVAutoFocus: Efficient Multi-Scale InferenceMahyar Najibi, Bharat Singh, Larry S. DavisPaper/Code
2019ICCVScale-Aware Trident Networks for Object DetectionYanghao Li, Naiyan Wang, et al.Paper/Code
2019ICCVRepPoints: Point Set Representation for Object DetectionZe Yang, Han Hu, et al.Paper/Code
2019CVPRSelf-Supervised Representation Learning by Rotation Feature DecouplingZeyu Feng, Chang Xu, Dacheng TaoPaper/Code
2019ICLRA rotation-equivariant convolutional neural network model of primary visual cortexAlexander S. Ecker, et al.Paper/Code
2019ICLRRotDCF: Decomposition of Convolutional Filters for Rotation-Equivariant Deep NetworksXiuyuan Cheng, et al.Paper/Code
------------
2018PAMIConvolutional Oriented Boundaries: From Image Segmentation to High-Level TasksLuc Van Gool, et al.Paper/Code
2018TIPJoint Hand Detection and Rotation Estimation Using CNNXiaoming Deng, et al.Paper/Code
2018ECCVOcclusion-aware R-CNN: Detecting Pedestrians in a CrowdShifeng Zhang, et al.Paper/Code
2018ECCVSAN: Learning Relationship between Convolutional Features for Multi-Scale Object DetectionYonghyun Kim, et al.Paper/Code
2018NeurIPSSNIPER: Efficient Multi-Scale TrainingBharat Singh, Mahyar Najibi, Larry S. DavisPaper/Code
2018CVPRAn Analysis of Scale Invariance in Object Detection - SNIPBharat Singh, Larry S. DavisPaper/Code
2018ICLR<span style="white-space:nowrap;">Unsupervised Representation Learning by Predicting Image Rotations </span><span style="white-space:nowrap;">S. Gidaris, P. Singh, et al. </span>Paper/Code

Arbitrarily-Oriented Text Detection

YearPub.TitleAuthorsLinks
2021ICCVAdaptive Boundary Proposal Network for Arbitrary Shape Text DetectionShi-Xue Zhang, et al.Paper/Code
2021PAMIPAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped TextWenhai Wang, et al.Paper/Code
2021PAMITowards End-to-End Text Spotting in Natural ScenesPeng Wang, Hui Li, Chunhua ShenPaper/Code
2021PAMIMask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesMinghui Liao, Pengyuan Lyu, et al.Paper/ECCV18
2021IJCVExploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text DetectionYuliang Liu, Chunhua Shen, et al.Paper/Code
2021TIPArbitrarily Shaped Scene Text Detection With a Mask Tightness Text DetectorYuliang Liu, Lianwen Jin, Chuanming FangPaper/Code
2021TIPSLOAN: Scale-Adaptive Orientation Attention Network for Scene Text RecognitionPengwen Dai, Hua Zhang, Xiaochun CaoPaper/Code
2021MMMask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text DetectionXugong Qin, Yu Zhou, et al.Paper/Code
2021CVPRFourier Contour Embedding for Arbitrary-Shaped Text DetectionYiqin Zhu, Lianwen Jin, et al.Paper/Code
2021CVPRProgressive Contour Regression for Arbitrary-Shape Scene Text DetectionPengwen Dai, et al.Paper/Code
2021CVPRMOST: A Multi-Oriented Scene Text Detector with Localization RefinementMinghang He, Xiang Bai, et al.Paper/Code
2021AAAIMANGO: A Mask Attention Guided One-Stage Scene Text SpotterLiang Qiao, Ying Chen, et al.Paper/Code
2021AAAIPGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering NetworkPengfei Wang, et al.Paper/Code
------------
2020TIPASTS: A Unified Framework for Arbitrary Shape Text SpottingJuhua Liu, Zhe Chen, Bo Du, Dacheng TaoPaper/Code
2020TIPText Co-Detection in Multi-View SceneChuan Wang, Huazhu Fu, et al.Paper/Code
2020ECCVCharacter Region Attention For Text SpottingYoungmin Baek, et al.Paper/Code
2020CVPRContourNet: Taking a Further Step Toward Accurate Arbitrary-Shaped Scene Text DetectionYuxin Wang, Zilong Fu, et al.Paper/Code
2020CVPRDeep Relational Reasoning Graph Network for Arbitrary Shape Text DetectionShi-Xue Zhang, et al.Paper/Code
2020AAAIText Perceptron: Towards End-to-End Arbitrary-Shaped Text SpottingLiang Qiao, et al.Paper/Code
2020AAAIAll You Need Is Boundary: Toward Arbitrary-Shaped Text SpottingHao Wang, Xiang Bai, et al.Paper/Code
------------
2019ICCVEfficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation NetworkWenhai Wang, et al.Paper/Code
2019ICCVTextDragon: An End-to-End Framework for Arbitrary Shaped Text SpottingWei Feng, Cheng-Lin Liu, et al.Paper/Code
2019CVPRLearning Shape-Aware Embedding for Scene Text DetectionZhuotao Tian, Jiaya Jia, et al.Paper/Code
2019CVPRArbitrary Shape Scene Text Detection with Adaptive Text Region RepresentationXiaobing Wang, Cheng-Lin Liu, et al.Paper/Code
2019CVPRTowards Robust Curve Text Detection with Conditional Spatial ExpansionZichuan Liu, Guosheng Lin, et al.Paper/Code
2019CVPRShape Robust Text Detection with Progressive Scale Expansion NetworkWenhai Wang, , Tong Lu, et al.Paper/Code
2019CVPRLook More Than Once: An Accurate Detector for Text of Arbitrary ShapesChengquan Zhan, et al.Paper/Code
------------
2018TIPMulti-Oriented and Multi-Lingual Scene Text Detection With Direct RegressionWenhao He, Cheng-Lin Liu, et al.
2018TIPTextBoxes++: A Single-Shot Oriented Scene Text DetectorMinghui Liao, Baoguang She, Xiang Bai
2018TMMArbitrary-Oriented Scene Text Detection via Rotation ProposalsJianqi Ma, et al.Paper/Code
2018ECCVTextSnake: A Flexible Representation for Detecting Text of Arbitrary ShapesShangbang Long, Jiaqiang Ruan, et al.
2018CVPRGeometry-Aware Scene Text Detection with Instance Transformation NetworkFangfang Wang, et al.Paper/Code
2018CVPRRotation-Sensitive Regression for Oriented Scene Text DetectionMinghui Liao, Xiang Bai, et al.
2018CVPRAON: Towards Arbitrarily-Oriented Text Recognition
2018CVPRFOTS: Fast Oriented Text Spotting With a Unified Network
2018CVPRMulti-Oriented Scene Text Detection via Corner Localization and Region Segmentation
------------
2017TIPTracking Based Multi-Orientation Scene Text Detection: A Unified Framework With Dynamic ProgrammingChun Yang, Junchi Yan, et al.
2017ICCVDeep Direct Regression for Multi-Oriented Scene Text Detection
2017ICCVDeep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework<span style="white-space:nowrap;">M. Busta, L. Neumann, Jiri Matas </span>Paper/Code
2017CVPR<span style="white-space:nowrap;">Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection </span>Yuliang Liu, Lianwen JinPaper/Code

Related Resources