Awesome

DQ-DETR

This is the official pytorch implementation of our AAAI 2023 paper DQ-DETR. Code will be available soon!

Authors: Shilong Liu, Yaoyuan Liang, Feng Li, Shijia Huang, Hao Zhang, Hang Su, Jun Zhu, Lei Zhang

phrase extraction and grounding(PEG)

PEG requires a model to extract phrases from text and locate objects from image simultaneously. As phrase extraction can be regarded as a 1D text segmentation problem, we formulate PEG as a dual detection problem.

tasks_comparison

To evaluate the performance of PEG, we also propose a new metric CMAP (cross-modal average precision), analogous to the AP metric in object detection.

cmap

DQ-DETR

As phrase extraction can be regarded as a 1D text segmentation problem, we formulate PEG as a dual detection problem.

dq-detr

Experiments

refcoco flickr30k