Awesome
vqa-detection
Object detection-based Visual Question Answering
Project status
Dependencies
To execute this, you must have Python 3.6.*, PyTorch, Visdom OpenCV, Numpy and Matplotlib installed, to accomplish this, we recommend installing the Anaconda Python distribution and use conda to install the dependencies, as it follows:
conda install pytorch torchvision cuda80 -c soumith
conda install opencv -c conda-forge
conda install matplotlib numpy
conda install aria2 -c bioconda
pip install -U visual-genome
pip install -U visdom
Dataset download
You must download the Visual Genome dataset, as well the train/val/test split used for our experiments. For this, we provide the download_dataset.sh
bash script, it will take care of the downloads required.