Home

Awesome

Fully Convolutional Instance-aware Semantic Segmentation

The major contributors of this repository include Haozhi Qi, Yi Li, Guodong Zhang, Haochen Zhang, Jifeng Dai, and Yichen Wei.

Introduction

FCIS is a fully convolutional end-to-end solution for instance segmentation, which won the first place in COCO segmentation challenge 2016.

FCIS is initially described in a CVPR 2017 spotlight paper. It is worth noticing that:

Resources

  1. Visual results on the first 5k images from COCO test set of our COCO 2016 challenge entry: OneDrive.
  2. Slides in ImageNet ILSVRC and COCO workshop 2016: OneDrive.
<img src='data/readme_img.png' width='800'>

Disclaimer

This is an official implementation for Fully Convolutional Instance-aware Semantic Segmentation (FCIS) based on MXNet. It is worth noticing that:

License

© Microsoft, 2017. Licensed under an MIT license.

Citing FCIS

If you find FCIS useful in your research, please consider citing:

@inproceedings{li2016fully,
  Author = {Yi Li, Haozhi Qi, Jifeng Dai, Xiangyang Ji and Yichen Wei}
  Title = {Fully Convolutional Instance-aware Semantic Segmentation},
  Conference = {CVPR},
  year = {2017}
}

Main Results

training datatesting datamAP^r@0.5mAP^r@0.7time
FCIS, ResNet-v1-101VOC 2012 trainVOC 2012 val66.051.90.23s
<sub>training data</sub><sub>testing data</sub><sub>mAP^r</sub><sub>mAP^r@0.5</sub><sub>mAP^r@0.75</sub><sub>mAP^r@S</sub><sub>mAP^r@M</sub><sub>mAP^r@L</sub>
<sub>FCIS, ResNet-v1-101, OHEM </sub><sub>coco trainval35k</sub><sub>coco minival</sub>29.250.829.77.931.451.1
<sub>FCIS, ResNet-v1-101, OHEM </sub><sub>coco trainval35k</sub><sub>coco test-dev</sub>29.651.430.28.031.049.7

Running time is counted on a single Maxwell Titan X GPU (mini-batch size is 1 in inference).

Requirements: Software

  1. MXNet from the offical repository. We tested our code on MXNet@(commit 62ecb60). Due to the rapid development of MXNet, it is recommended to checkout this version if you encounter any issues. We may maintain this repository periodically if MXNet adds important feature in future release.

  2. Python packages might missing: cython, opencv-python >= 3.2.0, easydict. If pip is set up on your system, those packages should be able to be fetched and installed by running

    pip install Cython
    pip install opencv-python==3.2.0.6
    pip install easydict==1.6
    pip install hickle
    
  3. For Windows users, Visual Studio 2015 is needed to compile cython module.

Requirements: Hardware

Any NVIDIA GPUs with at least 5GB memory should be OK

Installation

  1. Clone the FCIS repository, and we'll call the directory that you cloned FCIS as ${FCIS_ROOT}.
git clone https://github.com/msracver/FCIS.git
  1. For Windows users, run cmd .\init.bat. For Linux user, run sh ./init.sh. The scripts will build cython module automatically and create some folders.

  2. Install MXNet:

    Note: The MXNet's Custom Op cannot execute parallelly using multi-gpus after this PR. We strongly suggest the user rollback to version MXNet@(commit 998378a) for training (following Section 3.2 - 3.6).

    Quick start

    3.1 Install MXNet and all dependencies by

    pip install -r requirements.txt
    

    If there is no other error message, MXNet should be installed successfully.

    Build from source (alternative way)

    3.2 Clone MXNet and checkout to MXNet@(commit 998378a) by

    git clone --recursive https://github.com/dmlc/mxnet.git
    git checkout 998378a
    git submodule init
    git submodule update
    

    3.3 Copy channel operators in $(FCIS_ROOT)/fcis/operator_cxx to $(YOUR_MXNET_FOLDER)/src/operator/contrib by

    cp -r $(FCIS_ROOT)/fcis/operator_cxx/channel_operator* $(MXNET_ROOT)/src/operator/contrib/
    

    3.4 Compile MXNet

    cd ${MXNET_ROOT}
    make -j $(nproc) USE_OPENCV=1 USE_BLAS=openblas USE_CUDA=1 USE_CUDA_PATH=/usr/local/cuda USE_CUDNN=1
    

    3.5 Install the MXNet Python binding by

    Note: If you will actively switch between different versions of MXNet, please follow 3.5 instead of 3.4

    cd python
    sudo python setup.py install
    

    3.6 For advanced users, you may put your Python packge into ./external/mxnet/$(YOUR_MXNET_PACKAGE), and modify MXNET_VERSION in ./experiments/fcis/cfgs/*.yaml to $(YOUR_MXNET_PACKAGE). Thus you can switch among different versions of MXNet quickly.

Demo

  1. To run the demo with our trained model (on COCO trainval35k), please download the model manually from OneDrive (Chinese users can also get it from BaiduYun with code tmd4), and put it under folder model/.

    Make sure it looks like this:

    ./model/fcis_coco-0000.params
    
  2. Run

    python ./fcis/demo.py
    

Preparation for Training & Testing

  1. Please download VOC 2012 dataset with additional annotations from SBD. Move inst, cls, img folders to VOCdevit and make sure it looks like this:

    Please use the train&val split in this repo, which follows the protocal of SDS.

    .data/VOCdevkit/VOCSDS/img/
    .data/VOCdevkit/VOCSDS/inst/
    .data/VOCdevkit/VOCSDS/cls/
    
  2. Please download COCO dataset and annotations for the 5k image minival subset and val2014 minus minival (val35k). Make sure it looks like this:

    .data/coco/
    .data/coco/annotations/instances_valminusminival2014.json
    .data/coco/annotations/instances_minival2014.json
    
  3. Please download ImageNet-pretrained ResNet-v1-101 model manually from OneDrive, and put it under folder ./model. Make sure it looks like this:

    ./model/pretrained_model/resnet_v1_101-0000.params
    

Usage

  1. All of our experiment settings (GPU #, dataset, etc.) are kept in yaml config files at folder ./experiments/fcis/cfgs.
  2. Two config files have been provided so far: FCIS@COCO with OHEM and FCIS@VOC without OHEM. We use 8 and 4 GPUs to train models on COCO and on VOC, respectively.
  3. To perform experiments, run the python scripts with the corresponding config file as input. For example, to train and test FCIS on COCO with ResNet-v1-101, use the following command
    python experiments/fcis/fcis_end2end_train_test.py --cfg experiments/fcis/cfgs/resnet_v1_101_coco_fcis_end2end_ohem.yaml
    
    A cache folder would be created automatically to save the model and the log under output/fcis/coco/ or output/fcis/voc/.
  4. Please find more details in config files and in our code.

Misc.

Code has been tested under: