Awesome
Object Detection in Aerial Images ![Awesome](https://awesome.re/badge.svg)
🔥 A curated list of awesome resources for generic object detection in aerial images.
:heavy_exclamation_mark: Updated at 2024-06.
<!--TOC-->
Contents:
- Overview
- Oriented Object Detection
- Instance Segmentation
- Small Object Detection
- UAV Object Dectection
- Dataset
- Appendix
Overview
No. | Year | Pub. | Title | Links |
---|---|---|---|---|
10 | 2023 | GRSM | Remote Sensing Object Detection Meets Deep Learning: A Meta-review of Challenges and Advances <br> <sub><sup>Xiangrong Zhang, Tianyang Zhang, Guanchun Wang, Peng Zhu, Xu Tang, Xiuping Jia, Licheng Jiao</sup></sub> | Paper/Code |
09 | 2023 | PAMI | Towards Large-Scale Small Object Detection: Survey and Benchmarks <br> <sub><sup>Gong Cheng, Xiang Yuan, Xiwen Yao, Kebing Yan, Qinghua Zeng, Xingxing Xie, Junwei Han</sup></sub> | Paper/Data |
08 | 2023 | arXiv | Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey <br> <sub><sup>Kun Wang, Zi Wang, Zhang Li, Ang Su, Xichao Teng, Minhao Liu, Qifeng Yu</sup></sub> | Paper/Code |
07 | 2021 | PAMI | <span style="white-space:nowrap;">Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges DOTA    </span> <br><sub><sup>Jian Ding, Nan Xue, Gui-Song Xia, Xiang Bai, et al.</sup></sub> | Paper/Proj |
06 | 2020 | IJCV | Deep Learning for Generic Object Detection: A Survey GOD <br><sub><sup>Li Liu, Wanli Ouyang, et al.</sup></sub> | Paper/Code |
05 | 2020 | JPRS | Object detection in optical remote sensing images: A survey and a new benchmark HBB <br><sub><sup>Ke Li, Gong Cheng, et al.</sup></sub> | Paper/Data |
04 | 2019 | TNNLS | Object Detection With Deep Learning: A Review GOD <br><sub><sup>Zhongqiu Zhao, et al.</sup></sub> | Paper/Code |
03 | 2018 | JPRS | A review of accuracy assessment for object-based image analysis: From per-pixel to per-polygon approaches <br><sub><sup>Su Ye, Rahul Rakshit, et al.</sup></sub> | Paper/Code |
02 | 2018 | CVPR | DOTA: A Large-Scale Dataset for Object Detection in Aerial Images DOTA <br><sub><sup>Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, et al.</sup></sub> | Paper/Code |
01 | 2016 | JPRS | A survey on object detection in optical remote sensing images <br><sub><sup>Gong Cheng, Junwei Han</sup></sub> | Paper/Code |
Oriented Object Detection
<!-- Dataset: DOTA, HRSC2016, ICDAR2015, ICDAR2017 MLT, MSRA-TD500, UCAS-AOD, FDDB, OHD-SJTU, SSDD++, Total-Text. -->Preprint
Year | Pub. | Title | Links |
---|---|---|---|
2024 | arXiv | GOOD: Towards Domain Generalized Orientated Object Detection <br><sub><sup>Qi Bi, Beichen Zhou, Jingjun Yi, Wei Ji, Haolan Zhan, Gui-Song Xia</sup></sub> | Paper/Code |
2023 | arXiv | P2RBox: A Single Point is All You Need for Oriented Object Detection <br><sub><sup>Guangming Cao, Xuehui Yu, Wenwen Yu, Xumeng Han, Xue Yang, Guorong Li, Jianbin Jiao, Zhenjun Han</sup></sub> | Paper/Code |
2022 | arXiv | DAFNe: A One-Stage Anchor-Free Deep Model for Oriented Object Detection <br><sub><sup>Steven Lang, et al.</sup></sub> | Paper/Code |
2024
No. | Pub. | Title | Links |
---|---|---|---|
15 | JSTARS | MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining <br><sub><sup>Di Wang, Jing Zhang, Minqiang Xu, Lin Liu, Dongsheng Wang, Erzhong Gao, Chengxi Han, Haonan Guo, Bo Du, Dacheng Tao, Liangpei Zhang</sup></sub> | Paper/Code |
14 | TPAMI | Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery <br> <sup><sub>Yansheng Li; Junwei Luo; Yongjun Zhang; Yihua Tan; Jin-Gang Yu; Song Bai</sub></sup> | Paper/Code |
13 | CVPR | Weakly Misalignment-free Adaptive Feature Alignment for UAVs-based Multimodal Object Detection <br> <sup><sub>Chen Chen, Jiahao Qi, Xingyue Liu, Kangcheng Bin, Ruigang Fu, Xikun Hu, Ping Zhong</sub></sup> | Paper/Code |
12 | CVPR | WildlifeMapper: Aerial Image Analysis for Multi-Species Detection and Identification <br> <sup><sub>Satish Kumar, Bowen Zhang, Chandrakanth Gudavalli, Connor Levenson, Lacey Hughey, et al.</sub></sup> | Paper/Code |
11 | CVPR | Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation <br> <sup><sub>Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji</sub></sup> | Paper/Code |
10 | CVPR | Relational Matching for Weakly Semi-Supervised Oriented Object Detection <br> <sup><sub>Wenhao Wu, Hau-San Wong, Si Wu, Tianyou Zhang</sub></sup> | Paper/Code |
09 | CVPR | Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision <br> <sup><sub>Yi Yu, Xue Yang, Qingyun Li, Feipeng Da, Jifeng Dai, Yu Qiao, Junchi Yan</sub></sup> | Paper/Code |
08 | CVPR | PointOBB: Learning Oriented Object Detection via Single Point Supervision <br> <sup><sub>Junwei Luo, Xue Yang, Yi Yu, Qingyun Li, Junchi Yan, Yansheng Li</sub></sup> | Paper/Code |
07 | CVPR | Rethinking Boundary Discontinuity Problem for Oriented Object Detection <br> <sup><sub>Hang Xu, Xinyuan Liu, Haonan Xu, Yike Ma, Zunjie Zhu, Chenggang Yan, Feng Dai</sub></sup> | Paper/Code |
06 | CVPR | Theoretically Achieving Continuous Representation of Oriented Bounding Boxes <br> <sup><sub>Zikai Xiao, Guo-Ye Yang, Xue Yang, Tai-Jiang Mu, Junchi Yan, Shi-min Hu</sub></sup> | Paper/Code |
05 | CVPR | Poly Kernel Inception Network for Remote Sensing Detection <br> <sup><sub>Xinhao Cai, Qiuxia Lai, Yuwei Wang, Wenguan Wang, Zeren Sun, Yazhou Yao</sub></sup> | Paper/Code |
04 | AAAI | FRED: Towards a Full Rotation-Equivariance in Aerial Image Object Detection <br> <sup><sub>Chanho Lee, Jinsu Son, Hyounguk Shon, Yunho Jeon, Junmo Kim</sub></sup> | Paper/Code |
03 | AAAI | Spatial Transform Decoupling for Oriented Object Detection <br><sub><sup>Hongtian Yu, Yunjie Tian, Qixiang Ye, Yunfan Liu</sup></sub> | Paper/Code |
02 | IJCV | Oriented R-CNN and Beyond <br><sub><sup>Xingxing Xie, Gong Cheng, Jiabao Wang, Ke Li, Xiwen Yao & Junwei Han</sup></sub> | Paper/Code |
01 | TGRS | ARS-DETR: Aspect Ratio-Sensitive Detection Transformer for Aerial Oriented Object Detection <br><sub><sup>Ying Zeng, Yushi Chen, Xue Yang, Qingyun Li, Junchi Yan</sup></sub> | Paper/Code |
2023
No. | Pub. | Title | Links |
---|---|---|---|
12 | arXiv | Adaptive Dense Pseudo Label Selection for Semi-supervised Oriented Object Detection <br><sub><sup>Tong Zhao, Qiang Fang, Shuohao Shi, Xin Xu</sup></sub> | Paper/Code |
11 | ICCV | Adaptive Rotated Convolution for Rotated Object Detection <br><sub><sup>Yifan Pu, Yiru Wang, Zhuofan Xia, Yizeng Han, Yulin Wang, Weihao Gan, Zidong Wang, Shiji Song, Gao Huang</sup></sub> | Paper/Code |
10 | ICCV | Large Selective Kernel Network for Remote Sensing Object Detection <br><sub><sup>Yuxuan Li, Qibin Hou, Zhaohui Zheng, Ming-Ming Cheng, Jian Yang, and Xiang Li</sup></sub> | Paper/Code |
09 | TIP | Sampling Equivariant Self-attention Networks for Object Detection in Aerial Images <br><sub><sup>Guo-Ye Yang; Xiang-Li Li; Zi-Kai Xiao; Tai-Jiang Mu; Ralph R. Martin; Shi-Min Hu</sup></sub> | Paper/Code |
08 | PR | RoMP-Transformer: Rotational bounding box with Multi-level feature Pyramid Transformer for object detection <br><sub><sup>Joonhyeok Moon, Munsu Jeon, Siheon Jeong, Ki-Yong Oh</sup></sub> | Paper/Code |
07 | PAMI | Detecting rotated objects as Gaussian distributions and its 3-D generalization <br><sub><sup>Yang, Xue and Zhang, Gefan and Yang, Xiaojiang and Zhou, Yue and Wang, Wentao and Tang, Jin and He, Tao and Yan, Junchi</sup></sub> | Paper/Code |
06 | CVPR | Dynamic Coarse-to-Fine Learning for Oriented Tiny Object Detection <br><sub><sup>Chang Xu, Jian Ding, Jinwang Wang, Chang Xu, Huai Yu, Lei Yu, Gui-Song Xia</sup></sub> | Paper/Code |
05 | CVPR | SOOD: Towards Semi-Supervised Oriented Object Detection <br><sub><sup>Wei Hua, Dingkang Liang, jingyu li, Xiaolong Liu, Zhikang Zou, Xiaoqing Ye, Xiang Bai</sup></sub> | Paper/Code |
04 | CVPR | Knowledge Combination to Learn Rotated Detection Without Rotated Annotation <br><sub><sup>Tianyu Zhu, Bryce Ferenczi, Pulak Purkait, Tom Drummond, Hamid Rezatofighi, Anton van den Hengel</sup></sub> | Paper/Code |
03 | CVPR | Phase-Shifting Coder: Predicting Accurate Orientation in Oriented Object Detection <br><sub><sup>Yi Yu, Feipeng Da</sup></sub> | Paper/Code |
02 | ICLR | H2RBox: Horizontal Box Annotation is All You Need for Oriented Object Detection <br><sub><sup>Yang, Xue and Zhang, Gefan and Li, Wentong and Wang, Xuehui and Zhou, Yue and Yan, Junchi</sup></sub> | Paper/Code |
01 | TCSVT | AO2-DETR: Arbitrary-Oriented Object Detection Transformer <br><sub><sup>Linhui Dai, Hong Liu, Hao Tang, Zhiwei Wu, Pinhao Song</sup></sub> | Paper/Code |
2022
No. | Pub. | Title | Links |
---|---|---|---|
15 | TGRS | Anchor-free Oriented Proposal Generator for Object Detection <br><sub><sup>Gong Cheng, Jiabao Wang, et al.</sup></sub> | Paper/Code |
14 | IJCV | On the Arbitrary-Oriented Object Detection: Classification Based Approaches Revisited <br><sub><sup>Xue Yang, Junchi Yan</sup></sub> | Paper/Code |
13 | ECCV | EAutoDet: Efficient Architecture Search for Object Detection <br><sub><sup>Xiaoxing Wang, Jiale Lin, Junchi Yan, Juanping Zhao, Xiaokang Yang</sup></sub> | Paper/Code |
12 | PAMI | SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing <br><sub><sup>Xue Yang, Junchi Yan, et al.</sup></sub> | Paper/Proj |
11 | TIP | GGHL: A General Gaussian Heatmap Label Assignment for Arbitrary-Oriented Object Detection <br><sub><sup>Zhanchao Huang, Wei Li, et al.</sup></sub> | Paper/Code |
10 | TIP | ACE: Anchor-Free Corner Evolution for Real-Time Arbitrarily-Oriented Object Detection <br><sub><sup>Pengwen Dai; Siyuan Yao; Zekun Li; Sanyi Zhang; Xiaochun Cao</sup></sub> | Paper/Code |
09 | TCSVT | RSDet++: Point-based Modulated Loss for More Accurate Rotated Object Detection <br><sub><sup>Wen Qian, Xue Yang, et al.</sup></sub> | Paper/Code |
08 | CVPR | Interactive Multi-Class Tiny-Object Detection <br><sub><sup>Chunggi Lee, Seonwook Park, Heon Song, Jeongun Ryu, Sanghoon Kim</sup></sub> | Paper/Code |
07 | CVPR | Weakly Supervised Rotation-Invariant Aerial Object Detection Network <br><sub><sup>Xiaoxu Feng, Gong Cheng, et al.</sup></sub> | Paper/Code |
06 | CVPR | OSKDet: Towards Orientation-sensitive Keypoint Localization for Rotated Object Detection <br><sub><sup>Dongchen Lu, et al.</sup></sub> | Paper/Code |
05 | CVPR | Oriented RepPoints for Aerial Object Detection <br><sub><sup>Wentong Li, Jianke Zhu</sup></sub> | Paper/Code |
04 | CVPR | Paper/Code | |
03 | AAAI | Shape-Adaptive Selection and Measurement for Oriented Object Detection <br><sub><sup>Liping Hou, Ke Lu, et al.</sup></sub> | Paper/Code<br>MMRotate |
02 | AAAI | Polygon-to-Polygon Distance Loss for Rotated Object Detection <br><sub><sup>Yang Yang, Jifeng Chen, et al.</sup></sub> | Paper/Code/Slides |
01 | TMM | AdaZoom: Towards Scale-Aware Large Scene Object Detection HBB <br><sub><sup>Jingtao Xu, Yali Li, et al.</sup></sub> | Paper/Code<br>arXiv |
2021
No. | Pub. | Title | Links |
---|---|---|---|
16 | PAMI | Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection <br><sub><sup>Yongchao Xu, et al.</sup></sub> | Paper/arXiv<br>Code |
15 | TIP | <span style="white-space:nowrap;">GSDet: Object Detection in Aerial Images Based on Scale Reasoning </span> <br><sub><sup>Wei Li, Wei Wei, Lei Zhang</sup></sub> | Paper/Code |
14 | NeurIPS | Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence <br><sub><sup>Xue Yang, Junchi Yan, et al.</sup></sub> | Paper/Code |
13 | ICCV | Towards Rotation Invariance in Object Detection <br><sub><sup>Agastya Kalra, Guy Stoppi, et al.</sup></sub> | Paper/Code |
12 | ICCV | Oriented R-CNN for Object Detection <br><sub><sup>X. Xie, Gong Cheng, et al.</sup></sub> | Paper/Code |
11 | MM | Polar Ray: A Single-stage Angle-free Detector for Oriented Object Detection in Aerial Images <br><sub><sup>Shuai Liu, Huchuan Lu, et al.</sup></sub> | Paper/Code |
10 | ICML | Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss <br><sub><sup>Xue Yang, Junchi Yan, et al.</sup></sub> | arXiv/Code<br>SUPP |
09 | CVPR | ReDet: A Rotation-equivariant Detector for Aerial Object Detection <br><sub><sup>Jiaming Han, Jian Ding, Nan Xue, Gui-Song Xia</sup></sub> | Paper/Code |
08 | CVPR | Beyond Bounding-Box: Convex-Hull Feature Adaptation for Oriented and Densely Packed Object Detection <br><sub><sup>Z. Guo, Qixiang Ye, et al.</sup></sub> | Paper/Code/<br>Journal |
07 | CVPR | Dense Label Encoding for Boundary Discontinuity Free Rotation Detection <br><sub><sup>Xue Yang, Junchi Yan, et al.</sup></sub> | Paper/Code |
06 | CVPR | GAIA: A Transfer Learning System of Object Detection that Fits Your Needs HBB <br><sub><sup>X. Bu, Zhaoxiang Zhang, et al.</sup></sub> | Paper/Code |
05 | AAAI | Dynamic Anchor Learning for Arbitrary-Oriented Object Detection <br><sub><sup>Qi Ming, Zhiqiang Zhou, et al.</sup></sub> | Paper/Code |
04 | AAAI | Learning Modulated Loss for Rotated Object Detection RSDet <br><sub><sup>Wen Qian, Xue Yang, Junchi Yang, et al.</sup></sub> | Paper/arXiv<br>Code |
03 | AAAI | <span style="white-space:nowrap;">R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object   </span> <br><sub><sup>Xue Yang, Junchi Yang, et al.</sup></sub> | Paper/arXiv<br>Py/TF |
02 | WACV | Oriented object detection in aerial images with box boundary-aware vectors <br><sub><sup>Yi, Jingru and Wu, Pengxiang and Liu, Bo and Huang, Qiaoying and Qu, Hui and Metaxas, Dimitris</sup></sub> | Paper/Code |
01 | PR | Gradient-Aligned Convolution Neural Network rotation invariance <br><sub><sup>You Hao, Ping Hu, et al.</sup></sub> | Paper/Code |
2020
No. | Pub. | Title | Links |
---|---|---|---|
10 | TIP | A Global-Local Self-Adaptive Network for Drone-View Object Detection <br><sub><sup>Sutao Deng, et al.</sup></sub> | Paper/Code |
09 | TNNLS | CRPN-SFNet: A High-Performance Object Detector on Large-Scale Remote Sensing Images <br><sub><sup>Qifeng Lin, et al.</sup></sub> | Paper/Code |
08 | JPRS | Orientation guided anchoring for geospatial object detection from remote sensing imagery <br><sub><sup>Yongtao Yu, et al.</sup></sub> | Paper/Code |
07 | JPRS | <span style="white-space:nowrap;">Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images  </span> <br><sub><sup>Kun Fu, et al.</sup></sub> | Paper/Code |
06 | JPRS | Oriented objects as pairs of middle lines <br><sub><sup>Haoran Wei, et al.</sup></sub> | Paper/Code |
05 | ECCV | Arbitrary-Oriented Object Detection with Circular Smooth Label Conf <br> On the Arbitrary-Oriented Object Detection: Classification based Approaches Revisited Journal <br><sub><sup>Xue Yang, Junchi Yan, Tao He</sup></sub> | Paper/Code<br>Code2/Journal<br>Proj/Data |
04 | ECCV | PIoU Loss: Towards Accurate Oriented Object Detection in Complex Environments <br><sub><sup>Zhiming Chen, Weiyao Lin, et al.</sup></sub> | Paper/Code |
03 | ICME | Cascade Detector With Feature Fusion For Arbitrary-Oriented Objects In Remote Sensing Images <br><sub><sup>Liping Hou, Ke Lu, et al.</sup></sub> | Paper/Code |
02 | CVPR | Foreground-Aware Relation Network for Geospatial Object Segmentation in High Spatial Resolution Remote Sensing Imagery <br><sub><sup>Zhuo Zheng, et al.</sup></sub> | Paper/Code |
01 | CVPR | <span style="white-space:nowrap;">Dynamic Refinement Network for Oriented and Densely Packed Object Detection </span> <br><sub><sup>X. Pan, Changsheng Xu, et al.</sup></sub> | Paper/Code |
2019
No. | Pub. | Title | Links |
---|---|---|---|
09 | TIP | Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection <br><sub><sup>Gong Cheng, Junwei Han, et al.</sup></sub> | Paper/CVPR16<br>Code |
08 | TGRS | CAD-Net: A Context-Aware Detection Network for Objects in Remote Sensing Imagery <br><sub><sup>Gongjie Zhang, Shijian Lu, Wei Zhang</sup></sub> | Paper/arXiv<br>Code |
07 | ICCV | Delving Into Robust Object Detection From Unmanned Aerial Vehicles: A Deep Nuisance Disentanglement Approach HBB <br><sub><sup>Zhenyu Wu, et al.</sup></sub> | Paper/Code |
06 | ICCV | SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects <br><sub><sup>Xue Yang, Junchi Yan, et al.</sup></sub> | Paper/Code |
05 | ICCV | Clustered Object Detection in Aerial Images HBB <br><sub><sup>Fan Yang, Haibin Ling, et al.</sup></sub> | Paper/Code |
04 | ICME | Cropping Region Proposal Network Based Framework for Efficient Object Detection on Large Scale Remote Sensing Images <br><sub><sup>Qifeng Lin, et al.</sup></sub> | Paper/Code |
03 | CVPR | <span style="white-space:nowrap;">Learning RoI Transformer for Oriented Object Detection in Aerial Images </span> <br><sub><sup>Jian Ding, Nan Xue, et al.</sup></sub> | Paper/Code |
02 | CVPR | Precise Detection in Densely Packed Scenes GOD <br><sub><sup>Eran Goldman, Roei Herzig, et al.</sup></sub> | Paper/Code |
01 | CVPR | Towards Universal Object Detection by Domain Attention <br><sub><sup>Xudong Wang, et al.</sup></sub> | Paper/Code |
2018
No. | Pub. | Title | Links |
---|---|---|---|
03 | TIP | Random Access Memories: A New Paradigm for Target Detection in High Resolution Aerial Remote Sensing Images <br><sub><sup>Zhengxia Zou, Zhenwei Shi</sup></sub> | Paper/Code |
02 | CVPR | DOTA: A Large-Scale Dataset for Object Detection in Aerial Images <br><sub><sup>Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, et al.</sup></sub> | Paper/Code |
01 | CVPR | <span style="white-space:nowrap;">ClusterNet: Detecting Small Objects in Large Scenes by Exploiting Spatio-Temporal Information HBB  </span> <br><sub><sup>R. Londe, Dong Zhang, Mubarak Shah</sup></sub> | Paper/arXiv<br>Code |
Before 2018
No. | Year | Pub. | Title | Links |
---|---|---|---|---|
04 | 2016 | CVPR | RIFD-CNN: Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection <br><sub><sup>Gong Cheng, Peicheng Zhou, Junwei Han</sup></sub> | Paper/Code |
03 | 2016 | TGRS | <span style="white-space:nowrap;">Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images </span> <br><sub><sup>Gong Cheng, Peicheng Zhou, Junwei Han</sup></sub> | Paper/Code1<br>Code2 |
02 | 2016 | ECCV | A Large Contextual Dataset for Classification, Detection and Counting of Cars with Deep Learning [HBB] <br><sub><sup>T. Nathan Mundhenk, et al.</sup></sub> | Paper/Code |
01 | 2015 | ICCV | Oriented Object Proposals <br><sub><sup>Shengfeng He, Rynson W. H. Lau</sup></sub> | Paper/Code |
Instance Segmentation
Year | Pub. | Title | Links |
---|---|---|---|
2024 | CVPR | Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation <br> <sup><sub>Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji</sub></sup> | Paper/Code |
2021 | TCYB | Semantic Attention and Scale Complementary Network for Instance Segmentation in Remote Sensing Images <br> <sup><sub>Tianyang Zhang, Licheng Jiao, et al.</sub></sup> | Paper/Code |
2020 | CVPR | Foreground-Aware Relation Network for Geospatial Object Segmentation in High Spatial Resolution Remote Sensing Imagery <br> <sup><sub>Zhuo Zheng, et al.</sub></sup> | Paper/Code |
Small Object Detection
Year | Pub. | Title | Links |
---|---|---|---|
2024 | TCSVT | Save the Tiny, Save the All: Hierarchical Activation Network for Tiny Object Detection <br> <sub><sup>Guangqian Guo; Pengfei Chen; Xuehui Yu; Zhenjun Han; Qixiang Ye; Shan Gao</sup></sub> | Paper/Code |
2023 | arXiv | Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art <br> <sub><sup>Aref Miri Rekavandi, Shima Rashidi, Farid Boussaid, Stephen Hoefs, Emre Akbas, Mohammed Bennamoun</sup></sub> | Paper/Code |
2023 | ICCV | Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning <br> <sub><sup>Xiang Yuan, Gong Cheng, Kebing Yan, Qinghua Zeng, Junwei Han</sup></sub> | Paper/Code |
2023 | PAMI | Towards Large-Scale Small Object Detection: Survey and Benchmarks <br> <sub><sup>Gong Cheng, Xiang Yuan, Xiwen Yao, Kebing Yan, Qinghua Zeng, Junwei Han</sup></sub> | Paper/Data |
2022 | JPRS | Detecting tiny objects in aerial images: A normalized Wasserstein distance and a new benchmark <br> <sub><sup>Chang Xu, Jinwang Wang, Wen Yang, Huai Yua, Lei Yua, Gui-SongXi</sup></sub> | Paper/Code |
2022 | ECCV | RFLA: Gaussian Receptive Field based Label Assignment for Tiny Object Detection <br> <sub><sup>Chang Xu, Jinwang Wang, Wen Yang, Huai Yu, Lei Yu, Gui-Song Xia</sup></sub> | Paper/Code |
2022 | CVPR | QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection <br> <sub><sup>Chenhongyi Yang, Zehao Huang, Naiyan Wang</sup></sub> | Paper/Code |
-- | -- | -- | -- |
2021 | arXiv07 | A Survey on Deep Domain Adaptation and Tiny Object Detection Challenges, Techniques and Datasets <br> <sub><sup>Muhammed Muzammul, Xi Li</sup></sub> | Paper/Code |
2021 | TMM | Extended Feature Pyramid Network for Small Object Detection <br> <sub><sup>Chunfang Deng, et al.</sup></sub> | Paper/Code |
2021 | TMM | CrossNet: Detecting Objects as Crosses <br> <sub><sup>Jiaxu Leng, Xinbo Gao, et al.</sup></sub> | Paper/Code |
2021 | CVPR | Dot distance for tiny object detection in aerial images <br> <sub><sup>Chang Xu, Jinwang Wang, Wen Yang, Lei Yu</sup></sub> | Paper/Code |
2021 | CVPR | Dogfight: Detecting Drones from Drones Videos <br> <sub><sup>M. Ashraf, W. Sultani, Mubarak Shah</sup></sub> | Paper/Code |
2021 | WACV | Effective Fusion Factor in FPN for Tiny Object Detection <br> <sub><sup>Yuqi Gong, Zhenjun Han, et al.</sup></sub> | Paper/Code |
--- | --- | --- | |
2020 | IJCV | Multi-task Generative Adversarial Network for Detecting Small Objects in the Wild <br> <sub><sup>Yongqiang Zhang, Yancheng Bai, et al.</sup></sub> | Paper/CVPR18<br>ECCV18/Code |
2020 | TCYB | Context-Aware Block Net for Small Object Detection <br> <sub><sup>Lisha Cui, et al.</sup></sub> | Paper/Code |
2020 | MM | CODAN: Counting-driven Attention Network for Vehicle Detection in Congested Scenes <br> <sub><sup>Wei Li, et al.</sup></sub> | Paper/Code |
--- | --- | --- | |
2019 | arXiv02 | Augmentation for small object detection <br> <sub><sup>Mate Kisantal, et al.</sup></sub> | Paper/Code |
2019 | TCSVT | Detecting Small Objects Using a Channel-Aware Deconvolutional Network <br> <sub><sup>Kaiwen Duan, Dawei Du, et al.</sup></sub> | Paper/Code |
2019 | TGRS | R2 -CNN: Fast Tiny Object Detection in Large-Scale Remote Sensing Images <br> <sub><sup>J. Pang, Jianping Shi, et al.</sup></sub> | Paper/Code |
2019 | ICCV | Better to Follow, Follow to Be Better: Towards Precise Supervision of Feature Super-Resolution for Small Object Detection <br> <sub><sup>Junhyug Noh, et al.</sup></sub> | Paper/Code |
2019 | ICCV | Miss Detection vs. False Alarm: Adversarial Learning for Small Object Segmentation in Infrared Images <br> <sub><sup>Huan Wang, Luping Zhou, Lei Wang</sup></sub> | Paper/Code |
2019 | MM | Small and Dense Commodity Object Detection with Multi-Receptive Field Attention <br> <sub><sup>Zhong Ji, Yanwei Pang, et al.</sup></sub> | Paper/Code |
--- | --- | --- | |
2018 | CVPR | Finding Tiny Faces in the Wild With Generative Adversarial Network <br> <sub><sup>Yancheng Bai, Bernard Ghanem, et al.</sup></sub> | Paper/Code |
2017 | ICCV | Focal Loss for Dense Object Detection <br> <sub><sup>Tsung-Yi, Kaiming He, et al.</sup></sub> | Paper/Code |
2017 | CVPR | Perceptual Generative Adversarial Networks for Small Object Detection <br> <sub><sup>Jianan Li, Tingfa Xu, et al.</sup></sub> | Paper/Code |
UAV Object Detection
Year | Pub. | Title | Authors | Links |
---|---|---|---|---|
2021 | CVPR | Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark | Longyin Wen, Dawei Du, et al. | Paper/Code |
--- | --- | --- | --- | |
2020 | arXiv | Vision Meets Drones: Past, Present and Future | Pengfei Zhu, Dawei Du, et al. | Paper/Code |
2020 | IJCV | The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline | Hongyang Yu, Dawei Du, et al. | Paper/Proj<br>ECCV18 |
2020 | MM | MOR-UAV: A Benchmark Dataset and Baselines for Moving Object Recognition in UAV Videos | M. Mandal, Lav Kumar, S. Vipparthi | Paper/Code |
2020 | MM | <span style="white-space:nowrap;">Guided Attention Network for Object Detection and Counting on Drones </span> | <span style="white-space:nowrap;">Y. Cai, Dawei Du, et al. </span> | Paper/Code |
Dataset
Year | Name | Paper | Pub. |
---|---|---|---|
2022 | SODA | <font size=2>Towards Large-Scale Small Object Detection: Survey and Benchmarks [OBB] </font> | Paper |
2021 | FAIR1M | <font size=2>FAIR1M: A Benchmark Dataset for Fine-grained Object Recognition in High-Resolution Remote Sensing Imagery [OBB] </font> | arXiv<br>Intro-ch |
2021 | SaRNet | <font size=2>SaRNet: A Dataset for Deep Learning Assisted Search and Rescue with Satellite Imagery [HBB] </font> | arXiv |
2021 | AI-TOD | <font size=2>Tiny Object Detection in Aerial Images [HBB] </font> | ICPR |
2019 | DIOR | <font size=2>Object detection in optical remote sensing images: A survey and a new benchmark [HBB] </font> | ISPRSJ |
2019 | iSAID | <font size=2>iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images</font> | CVPRW |
2019 | HRRSD | <font size=2>Hierarchical and Robust Convolutional Neural Network for Very High-Resolution Remote Sensing Object Detection</font> | TGRS |
2019 | VRAI | <font size=2>Vehicle Re-identification in Aerial Imagery: Dataset and Approach</font> | ICCV |
2019 | ITCVD | <font size=2>Vehicle Detection in Aerial Images</font> | PERS |
2019 | Aerial<br>Elephant | <font size=2>The Aerial Elephant Dataset: A New Public Benchmark for Aerial Object Detection</font> | CVPRW |
2018 | DOTA | <font size=2>DOTA: A Large-Scale Dataset for Object Detection in Aerial Images</font> | CVPR/Kit |
2018 | xView | <font size=2>xView: Objects in Context in Overhead Imagery [HBB] </font> | arXiv/Kit |
2018 | VisDrone | <font size=2>Vision Meets Drones: Past, Present and Future [HBB] </font> | arXiv/Data |
2018 | LPODC | <font size=2>DAC-SDC Low Power Object Detection Challenge for UAV Applications [HBB] </font> | PAMI |
2018 | UAVDT | <font size=2>The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking</font> | ECCV |
2016 | NWPU<br>VHR-10 | <font size=2>Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sensing images</font> | TGRS |
2016 | RSOD | <font size=2>Accurate Object Localization in Remote Sensing Images Based on Convolutional Neural Networks</font> | TGRS |
2016 | HRSC<br>2016 | <font size=2>A High Resolution Optical Satellite Image Dataset for Ship Recognition and Some New Baselines</font> | ICPRAM<br>Kaggle |
2015 | VEDAI | <font size=2>Vehicle Detection in Aerial Imagery: A small target detection benchmark</font> | JVCIR |
2014 | UCAS-AOD | <font size=2>Orientation robust object detection in aerial images using deep convolutional neural network</font> | ICIP |
Appendix
GOD
Year | Pub. | Title | Authors | Links |
---|---|---|---|---|
2021 | arXiv09 | Progressive Hard-case Mining across Pyramid Levels in Object Detection | Binghong Wu, et al. | Paper/Code |
2021 | arXiv04 | Slender Object Detection: Diagnoses and Improvements | Zhaoyi Wan, Yimin Chen, et al. | Paper/Code |
2021 | arXiv01 | Focal and Efficient IOU Loss for Accurate Bounding Box Regression | Yi-Fan Zhang, Liang Wang, et al. | Paper/Code |
2021 | NeurIPS | You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection | Yuxin Fang, Xinggang Wang, et al. | Paper/Code |
2021 | PAMI | Scale Normalized Image Pyramids with AutoFocus for Object Detection | Bharat Singh, Mahyar Najibi, et al. | Paper/Code |
2021 | IJCV | Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition Under Occlusion | Adam Kortylewski, et al. | Paper/Code |
2021 | IJCV | Scale-Aware Domain Adaptive Faster R-CNN | Y. Chen, D. Dai, Luc Van Gool, et al. | Paper/Code |
2021 | IJCV | Guided Attention in CNNs for Occluded Pedestrian Detection and Re-identification | S. Zhang, Di Chen, Jian Yang, Bernt Schiele | Paper/Code |
2021 | ICCV | SOTR: Segmenting Objects With Transformers | Ruohao Guo, Dantong Niu, Liao Qu, Zhenbo Li | Paper/Code |
2021 | ICCV | Dynamic DETR: End-to-End Object Detection With Dynamic Attention | Xiyang Dai, Lu Yuan; Lei Zhang, et al. | Paper/Code |
2021 | ICCV | CrossDet: Crossline Representation for Object Detection | Heqian Qiu, Hongliang Li, et al. | Paper/Code |
2021 | CVPR | UP-DETR: Unsupervised Pre-training for Object Detection with Transformers | Z. Dai, J. Chen, et al. | Paper/Code |
2021 | CVPR | Adaptive Image Transformer for One-Shot Object Detection | Ding-Jie Chen, He-Yen Hsieh, Tyng-Luh Liu | Paper/Code |
2021 | CVPR | Scale-aware Automatic Augmentation for Object Detection | Yukang Chen, Jiaya Jia, et al. | Paper/Code |
2021 | CVPR | Sparse R-CNN: End-to-End Object Detection with Learnable Proposals | Peize Sun, Ping Luo, et al. | Paper/Code<br>arXiv |
2021 | ICLR | Deformable DETR: Deformable Transformers for End-to-End Object Detection | Xizhou Zhu, Xiaogang Wang, Jifeng Dai, et al. | Paper/Code |
2021 | ICLR | On the Universality of Rotation Equivariant Point Cloud Networks | Nadav Dym, Haggai Maron | Paper/Code |
2021 | AAAI | Rethinking Object Detection in Retail Stores | Yuanqiang Cai, Dawei Du, etc | Paper/Code |
--- | --- | --- | --- | |
2020 | NeurIPS | RepPoints V2: Verification Meets Regression for Object Detection | Yihong Chen, Zheng Zhang, et al. | Paper/Code |
2020 | TIP | Self-Supervised Feature Augmentation for Large Image Object Detection | Xingjia Pan, et al. | Paper/Code |
2020 | ECCV | End-to-End Object Detection with Transformers DETR | Carion N, et al. | Paper/Code |
2020 | ECCV | Learning Data Augmentation Strategies for Object Detection | Barret Zoph, Quoc V. Le, et al. | Paper/Code |
2020 | ECCV | Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes | Chenhongyi Yang, et al. | Paper/Code |
--- | --- | --- | --- | |
2019 | IJCV | CornerNet: Detecting Objects as Paired Keypoints | Hei Law, Jia Deng | Paper/arXiv<br>ECCV18/Code |
2019 | IJCV | Corner Detection Using Multi-directional Structure Tensor with Multiple Scales | Weichuan Zhang, Changming Sun | Paper/Code |
2019 | IJCV | Hierarchical Attention for Part-Aware Face Detection | Shuzhe Wu, Meina Kan, Shiguang Shan, Xilin Chen | Paper/Code |
2019 | TIP | Combining Faster R-CNN and Model-Driven Clustering for Elongated Object Detection | Fen Fang, et al. | Paper/Code |
2019 | arXiv | Deep Learning for 2D and 3D Rotatable Data: An Overview of Methods | Luca Della Libera, Daniel Cremers, et al. | Paper/Code |
2019 | ICCV | AutoFocus: Efficient Multi-Scale Inference | Mahyar Najibi, Bharat Singh, Larry S. Davis | Paper/Code |
2019 | ICCV | Scale-Aware Trident Networks for Object Detection | Yanghao Li, Naiyan Wang, et al. | Paper/Code |
2019 | ICCV | RepPoints: Point Set Representation for Object Detection | Ze Yang, Han Hu, et al. | Paper/Code |
2019 | CVPR | Self-Supervised Representation Learning by Rotation Feature Decoupling | Zeyu Feng, Chang Xu, Dacheng Tao | Paper/Code |
2019 | ICLR | A rotation-equivariant convolutional neural network model of primary visual cortex | Alexander S. Ecker, et al. | Paper/Code |
2019 | ICLR | RotDCF: Decomposition of Convolutional Filters for Rotation-Equivariant Deep Networks | Xiuyuan Cheng, et al. | Paper/Code |
--- | --- | --- | --- | |
2018 | PAMI | Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks | Luc Van Gool, et al. | Paper/Code |
2018 | TIP | Joint Hand Detection and Rotation Estimation Using CNN | Xiaoming Deng, et al. | Paper/Code |
2018 | ECCV | Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd | Shifeng Zhang, et al. | Paper/Code |
2018 | ECCV | SAN: Learning Relationship between Convolutional Features for Multi-Scale Object Detection | Yonghyun Kim, et al. | Paper/Code |
2018 | NeurIPS | SNIPER: Efficient Multi-Scale Training | Bharat Singh, Mahyar Najibi, Larry S. Davis | Paper/Code |
2018 | CVPR | An Analysis of Scale Invariance in Object Detection - SNIP | Bharat Singh, Larry S. Davis | Paper/Code |
2018 | ICLR | <span style="white-space:nowrap;">Unsupervised Representation Learning by Predicting Image Rotations </span> | <span style="white-space:nowrap;">S. Gidaris, P. Singh, et al. </span> | Paper/Code |
Arbitrarily-Oriented Text Detection
Year | Pub. | Title | Authors | Links |
---|---|---|---|---|
2021 | ICCV | Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection | Shi-Xue Zhang, et al. | Paper/Code |
2021 | PAMI | PAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text | Wenhai Wang, et al. | Paper/Code |
2021 | PAMI | Towards End-to-End Text Spotting in Natural Scenes | Peng Wang, Hui Li, Chunhua Shen | Paper/Code |
2021 | PAMI | Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes | Minghui Liao, Pengyuan Lyu, et al. | Paper/ECCV18 |
2021 | IJCV | Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection | Yuliang Liu, Chunhua Shen, et al. | Paper/Code |
2021 | TIP | Arbitrarily Shaped Scene Text Detection With a Mask Tightness Text Detector | Yuliang Liu, Lianwen Jin, Chuanming Fang | Paper/Code |
2021 | TIP | SLOAN: Scale-Adaptive Orientation Attention Network for Scene Text Recognition | Pengwen Dai, Hua Zhang, Xiaochun Cao | Paper/Code |
2021 | MM | Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text Detection | Xugong Qin, Yu Zhou, et al. | Paper/Code |
2021 | CVPR | Fourier Contour Embedding for Arbitrary-Shaped Text Detection | Yiqin Zhu, Lianwen Jin, et al. | Paper/Code |
2021 | CVPR | Progressive Contour Regression for Arbitrary-Shape Scene Text Detection | Pengwen Dai, et al. | Paper/Code |
2021 | CVPR | MOST: A Multi-Oriented Scene Text Detector with Localization Refinement | Minghang He, Xiang Bai, et al. | Paper/Code |
2021 | AAAI | MANGO: A Mask Attention Guided One-Stage Scene Text Spotter | Liang Qiao, Ying Chen, et al. | Paper/Code |
2021 | AAAI | PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network | Pengfei Wang, et al. | Paper/Code |
--- | --- | --- | --- | |
2020 | TIP | ASTS: A Unified Framework for Arbitrary Shape Text Spotting | Juhua Liu, Zhe Chen, Bo Du, Dacheng Tao | Paper/Code |
2020 | TIP | Text Co-Detection in Multi-View Scene | Chuan Wang, Huazhu Fu, et al. | Paper/Code |
2020 | ECCV | Character Region Attention For Text Spotting | Youngmin Baek, et al. | Paper/Code |
2020 | CVPR | ContourNet: Taking a Further Step Toward Accurate Arbitrary-Shaped Scene Text Detection | Yuxin Wang, Zilong Fu, et al. | Paper/Code |
2020 | CVPR | Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection | Shi-Xue Zhang, et al. | Paper/Code |
2020 | AAAI | Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting | Liang Qiao, et al. | Paper/Code |
2020 | AAAI | All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting | Hao Wang, Xiang Bai, et al. | Paper/Code |
--- | --- | --- | --- | |
2019 | ICCV | Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network | Wenhai Wang, et al. | Paper/Code |
2019 | ICCV | TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting | Wei Feng, Cheng-Lin Liu, et al. | Paper/Code |
2019 | CVPR | Learning Shape-Aware Embedding for Scene Text Detection | Zhuotao Tian, Jiaya Jia, et al. | Paper/Code |
2019 | CVPR | Arbitrary Shape Scene Text Detection with Adaptive Text Region Representation | Xiaobing Wang, Cheng-Lin Liu, et al. | Paper/Code |
2019 | CVPR | Towards Robust Curve Text Detection with Conditional Spatial Expansion | Zichuan Liu, Guosheng Lin, et al. | Paper/Code |
2019 | CVPR | Shape Robust Text Detection with Progressive Scale Expansion Network | Wenhai Wang, , Tong Lu, et al. | Paper/Code |
2019 | CVPR | Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes | Chengquan Zhan, et al. | Paper/Code |
--- | --- | --- | --- | |
2018 | TIP | Multi-Oriented and Multi-Lingual Scene Text Detection With Direct Regression | Wenhao He, Cheng-Lin Liu, et al. | |
2018 | TIP | TextBoxes++: A Single-Shot Oriented Scene Text Detector | Minghui Liao, Baoguang She, Xiang Bai | |
2018 | TMM | Arbitrary-Oriented Scene Text Detection via Rotation Proposals | Jianqi Ma, et al. | Paper/Code |
2018 | ECCV | TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes | Shangbang Long, Jiaqiang Ruan, et al. | |
2018 | CVPR | Geometry-Aware Scene Text Detection with Instance Transformation Network | Fangfang Wang, et al. | Paper/Code |
2018 | CVPR | Rotation-Sensitive Regression for Oriented Scene Text Detection | Minghui Liao, Xiang Bai, et al. | |
2018 | CVPR | AON: Towards Arbitrarily-Oriented Text Recognition | ||
2018 | CVPR | FOTS: Fast Oriented Text Spotting With a Unified Network | ||
2018 | CVPR | Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation | ||
--- | --- | --- | --- | |
2017 | TIP | Tracking Based Multi-Orientation Scene Text Detection: A Unified Framework With Dynamic Programming | Chun Yang, Junchi Yan, et al. | |
2017 | ICCV | Deep Direct Regression for Multi-Oriented Scene Text Detection | ||
2017 | ICCV | Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework | <span style="white-space:nowrap;">M. Busta, L. Neumann, Jiri Matas </span> | Paper/Code |
2017 | CVPR | <span style="white-space:nowrap;">Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection </span> | Yuliang Liu, Lianwen Jin | Paper/Code |