Home

Awesome

DDPN

This project is the implementation of the paper Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding.The network architecture with DDPN for our visual grounding model is illustrated in Figure 1.

<img src="https://github.com/XiangChenchao/DDPN/raw/master/images/DDPN.jpg" alt="Figure 1: The model architecture for our visual grounding model." width="60%"/> <center>Figure 1: The model network architecture for our visual grounding model.</center>

Requirements

Pretrained Models

We release the trained models on four datasets, which achieve slightly better results than that shown in the paper.

DatasetsFlickr30k-EntitiesReferitRefcocoRefcoco+
val72.78%63.77%76.61%64.34%
test73.45%63.27%76.23%64.01%
testA79.99%71.24%
testB72.11%55.55%
  1. Download pretrained models BaiduYun
  2. Unzip the model files in directory './pretrained_model'.

Preprocess

Training

Testing

Citation

If the codes are helpful for your research, please cite

@article{yu2018rethining,
  title={Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding},
  author={Yu, Zhou and Yu, Jun and Xiang, Chenchao and Zhao, Zhou and Tian, Qi and Tao, Dacheng},
  journal={International Joint Conference on Artificial Intelligence (IJCAI)},
  year={2018}
}