Home

Awesome

vqa-detection

Object detection-based Visual Question Answering

Project status

License

Dependencies

To execute this, you must have Python 3.6.*, PyTorch, Visdom OpenCV, Numpy and Matplotlib installed, to accomplish this, we recommend installing the Anaconda Python distribution and use conda to install the dependencies, as it follows:

conda install pytorch torchvision cuda80 -c soumith
conda install opencv -c conda-forge
conda install matplotlib numpy
conda install aria2 -c bioconda
pip install -U visual-genome
pip install -U visdom

Dataset download

You must download the Visual Genome dataset, as well the train/val/test split used for our experiments. For this, we provide the download_dataset.sh bash script, it will take care of the downloads required.