Home

Awesome

Retrieval-Augmented Open-Vocabulary Object Detection

This is the official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".

PWC PWC

Jooyean Kim*, Eulrang Cho*, Sehyung Kim, Hyunwoo J. Kim.

Department of Computer Science and Engineering, Korea University

ralf_figure

Introduction

RALF is structured into multiple branches.

The other branches are the integration of existing OVD model and RALF.

Results

COCO

Model$\text{AP}^\text{N}_\text{50}$
RALF + OADP33.4
RALF + Object-Centric-OVD41.3

LVIS

Model$\text{AP}_\text{r}$
RALF + OADP21.9
RALF + DetPro21.1
RALF + Object-Centric-OVD18.5

Citation

@inproceedings{kim2024retrieval,
  title={Retrieval-Augmented Open-Vocabulary Object Detection},
  author={Kim, Jooyeon and Cho, Eulrang and Kim, Sehyung and Kim, Hyunwoo J},
  booktitle={CVPR},
  year={2024}
}

References

This code is built on CLIP, V3Det, GPT-3, OADP, Object-Centric-OVD and DetPro.