Awesome
DVTOD
Data Collection
- Camera equipment: We use DJI MAVIC 2 Enterprise Advanced drone to acquire all image pairs. (a) Thermal camera: the sensor is an uncooled vanadium oxide microbolometer, image resolution is 640×512, pixel pitch is 12μm, and wavelength range is 8-14μm. (b) Visible camera: the sensor is a 1/2 inch complementary metal oxide semiconductor (CMOS), the effective pixels are 48 million, the viewing angle is 84°, the video resolution is 1920×1080, the equivalent focal length is 24 mm, and the aperture is f/ 2.8.
- Data collection: The dataset contains 2179 visible-thermal image pairs and covers different objects and challenging scenes. As shown in Fig. 1, during the data collection, we set different challenging attributes to simulate the distribution of real data from the perspective of changes in illumination, severe weather, occlusion, and diversity of objects.
Data Annotation
As shown in the top right corner of Fig. 1, when the object is in some challenging scenes, such as special material occlusion, or overexposure scenes, the image quality of the object is extremely poor or even disappears. To handle this situation, we choose the visible modality for annotation in special scenes and the thermal modality for annotation in the rest of the cases. Ultimately, a pair of images share a single label.
Summary
- Visible-thermal image pairs are misaligned images acquired with drone, which provides the basis for researchers to investigate misaligned multispectral object detection methods.
- The acquired image pairs contain more challenging attributes, such as rain, snow, fog, extreme exposure, darkness, and special material occlusion, which are more in line with the real-world data distribution.
- The dataset is of higher quality than other dataset image pairs, with a resolution of 1920×1080 for visible images and 640×512 for thermal images.
- The dataset contains data at different times of the day and during all seasons of the year.
Download the dataset and code
The dataset and code are available at:https://pan.baidu.com/s/1KMHdbwQwwY0dkEy6TkJC6A?pwd=wqnq
Paper
Misaligned Visible-Thermal Object Detection A Drone-based Benchmark and Baseline.pdf
https://ieeexplore.ieee.org/document/10522939
Related Works of UAV RGB-T
UAV-RGB-T-2400 https://github.com/VDT-2048/UAV-RGB-T-2400
https://ieeexplore.ieee.org/document/10315195
Related Survey
RGB-T Image Analysis Technology and Application: A Survey [J]. Engineering Applications of Artificial Intelligence, 2023, 120, 105919.
https://www.sciencedirect.com/science/article/abs/pii/S0952197623001033
2023-RGB-T Image Analysis Technology and Application A Survey.pdf