Awesome

Prevalent: A Pretrained Generic VLN Agent

This repository contains source code to reproduce the results presented in the paper:

Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training, CVPR 2020 <br> Weituo Hao*, Chunyuan Li*, Xiujun Li, Lawrence Carin, Jianfeng Gao

<p align="center"> <img width="460" height="300" src=pretrain_finetune.png> </p>

Pretrain

<p align="center"> <kbd> <img width="600" height="250" src=pre_training_scheme.png> </kbd> </p>

Our collected triplets can be downloaded here

The pretrained model can be downloaded here

R2R

Please check here for experiment setup
Please check here for PREVALENT application

CVDN

Please check here for experiment setup
Please check here for PREVALENT application

HANNA

Please check here for experiment setup
Please check here for PREVALENT application

Citation

If you use this code for your research, please cite our paper:

@article{hao2020prevalent,
  title={Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training},
  author={Hao, Weituo and Li, Chunyuan and Li, Xiujun and Carin, Lawrence and Gao, Jianfeng},
  journal={Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2020}
}