Awesome
Disentangling Visual Embeddings for Attributes and Objects (OADis)
This repository provides dataset splits and code for Paper:
Disentangling Visual Embeddings for Attributes and Objects, CVPR 2022
Nirat Saini, Khoi Pham, Abhinav Shrivastava
VAW-CZSL Dataset
We provide compositional splits for Generalized CZSL, following prior works:
The dataset and splits can be downloaded from: VAW-CZSL. This folder has a jupyter notebook vaw_dataset_orig.ipynb
, and folder named compositional-split-natural
. The folder also has metadata
file which splits image ids for each split.
compositional-split-natural
: lists attribute-object pairs for each split [training, validation and testing]. Images folfder has all relevant images used in VAW-CZSL dataset.vaw_dataset_orig.ipynb
explains the steps for creation of splits, more details can also be found in Supplementary material. This file build the dataset splits from scratch.
For building split files and metedata files from scratch, you need
- The VAW-dataset from the website: VAW.
- Some images are part of Visual Genome, and can be downloaded from the official website.
Code Instructions:
Pre-requisites:
- Update the path for dataset images and log file in the config/*.yml files.
- Download and dump the pre-trained models from here to a folder named
saved_models
- The compositional splits for MIT-states and UT-Zappos can be downloaded from: https://www.senthilpurushwalkam.com/publication/compositional/compositional_split_natural.tar.gz
To run OADis for MIT-States Dataset:
Training:
python train.py --cfg config/mit-states.yml
Testing:
python test.py --cfg config/mit-states.yml --load mit_final.pth
Similar instructions can be used for other datasets: UT-Zappos and VAW-CZSL. The code works well, and is tested for:
Pytorch - 1.6.0+cu92
Python - 3.6.12
tensorboardx - v2.4
For more qualitative results and details, refer to the Project Page
For questions and queries, feel free to reach out to Nirat.
Citation
Please cite our CVPR 2022 paper if you use the this repo for OADis.
@InProceedings{Saini_2022_CVPR,
author = {Saini, Nirat and Pham, Khoi and Shrivastava, Abhinav},
title = {Disentangling Visual Embeddings for Attributes and Objects},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2022},
pages = {13658-13667}
}