Awesome

SLViT

Code for IJCAI 2023 paper 'SLViT: Scale-Wise Language-Guided Vision Transformer for Referring Image Segmentation'.

Datasets

Refer to the instructions provided in the ./refer directory to establish subdirectories and retrieve annotations. This directory contains a clone of the refer public API, excluding two unnecessary data files.

Download images from COCO. Please use the first downloading link 2014 Train images, and extract the downloaded train_2014.zip file to ./refer/data/images/mscoco/images.

Enviorments

python 3.7.0
pytorch 1.7.1
torchvision 0.8.2
torchaudio 0.7.2
mmcv-full 1.3.12
mmsegmentation 0.17.0

Reference

LAVT
SegNeXt