Home

Awesome

Transformer-Based Multi-modal Proposal and Re-Rank for Wikipedia Image-Caption Matching

This repo contains the code for reproducing the results obtained by our team FanDani at the Wikipedia Image/Caption Matching Challenge.

<p align="center"> <img alt="Wiki Image-Caption Matching" src="teaser.png" width="500px"> </p>

Updates

Organization

This repo is organized into two modules, as explained in the paper:

These are two separate modules located in the mcprop and crank folders respectively. You can find installation and run instructions in the README.md files inside these folders.

Citation

If you find this work useful for your research, please cite our paper:

@article{messina2022transformer,
  title={Transformer-Based Multi-modal Proposal and Re-Rank for Wikipedia Image-Caption Matching},
  author={Messina, Nicola and Coccomini, Davide Alessandro and Esuli, Andrea and Falchi, Fabrizio},
  journal={arXiv e-prints},
  pages={arXiv--2206},
  year={2022}
}