Home

Awesome

Content Based Image Retrieval for Satellite Data

<a target="_blank" href="https://colab.research.google.com/github/amirafshari/rs-cbir/blob/main/vision-language-search.ipynb"> <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/> </a> <a target="_blank" href="https://colab.research.google.com/github/amirafshari/rs-cbir/blob/main/image-search.ipynb"> <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/> </a>

With the rapid advancement of remote sensing technologies, the availability of large-scale satellite image datasets has grown exponentially. These datasets contain invaluable information for various applications, including environmental monitoring, urban planning, and disaster management. However, searching for specific categories of objects, such as identifying all images which are similar to a sepcific query (image/text) within a dataset of thousands or millions or even billions of samples, presents a significant challenge for human analyst.

Solution

This task, which is overwhelming for human analysts, can be efficiently addressed using vector search techniques. By leveraging deep learning models to transform images into high-dimensional vectors and utilizing various models such as classification, segmentation, etc, we can use their last layer features and employ nearest neighbor search algorithms to quickly and accurately retrieve relevant images based on their content or semantic meaning.

For instance, you find and interesting shape in your dataset and you want to figure out if there is any similar image in your dataset or not? To do so, you can use that image as a search query to find the similar images.

Or in a Vision-Language model, you can search for "red ship floating on the ocean" and use it as the query and the system provides you the appropriate instances of the dataset while there is no metadata available.

Good to know

Results

Visual Search

Query: Church (Image)

jpg

Query: Mountain (Image)

jpg

Query: Parking (Image)

jpg

Query: Baseballfield (Image)

jpg

Vision Language (Multimodal)

Query: Stadium (Text)

jpg