Awesome

Scene Understanding Challenge for Autonomous Navigation in Unstructured Environments

Code for working with the dataset used for the Scene Understanding Challenge for Autonomous Navigation in Unstructured Environments. For details of getting the dataset and updates see:

AutoNUE 2021 (Domain Adaptation and Semantic Segmentation)

This repository contains the datasets related to domain adaptation and segmentation challenges for AutoNEU 2021, CVPR Workshop. For more details, please visit https://cvit.iiit.ac.in/autonue2021/challenge. For the segmentation challenge, please skip "Source datasets" section below.

Source datasets:

Participants are requested to download the datasets from original websites, given below for easy reference:-

https://www.mapillary.com/dataset/vistas?pKey=q0GhQpk20wJm1ba1mfwJmw
https://bdd-data.berkeley.edu/ (you might have to click on Advanced tab, and then click on "proceed to bdd-data.berkeley.edu")
https://download.visinf.tu-darmstadt.de/data/from_games/
https://www.cityscapes-dataset.com/examples/#fine-annotations (only fine-annotations to be used)

After downloading all the source datasets, move them to folder ./domain_adaptation/source/datasets/. Its folder structure should be as follows:

datasets
  |--mapillary-vistas-dataset_public_v1.1/
  |  |--training/
  |  |  |--images/
  |  |  |--labels/
  |  |--validation/
  |  |  |--images/
  |  |  |--labels/
  |  |--testing/
  |     |--images/
  |--bdd100k/
  |  |--seg/
  |     |--images/
  |     |  |--train/
  |     |  |--val/
  |     |  |--test/
  |     |--labels/
  |        |--train/
  |        |--val/
  |--gta/
  |  |--images/
  |  |--labels/
  |--cityscapes/
     |--gtFine/
     |  |--train/
     |  |--val/
     |  |--test/
     |--leftImg8bit/
        |--train/
        |--val/
        |--test/

Run the following commands:

pip3 install requirements.txt
chmod +x domain_adaptation/source/prep_all.sh
./domain_adaptation/source/prep_all.sh

This will create a folder "domain_adaptation/source/source_datasets_dir/" where you will find the images and annotations for the source dataset to be used for any of the domain adaptation challenges.

Target datasets:

For using first add helpers/ to $PYTHONPATH

export PYTHONPATH="${PYTHONPATH}:helpers/"

Dataset Structure

The structure is similar to the cityscapes dataset. That is:

gtFine/{split}/{drive_no}/{img_id}_gtFine_polygons.json for ground truths
leftImg8bit/{split}/{drive_no}/{img_id}_leftImg8bit.png for image frames

Semantic Segmentation

Furthermore for training, label masks needs to be generated as described below resulting in the following files:

gtFine/{split}/{drive_no}/{img_id}_gtFine_labellevel3Ids.png
gtFine/{split}/{drive_no}/{img_id}_gtFine_instancelevel3Ids.png

Labels

See helpers/anue_labels.py

Generate Label Masks (for training/evaluation) (Semantic/Instance/Panoptic Segmentation)

python preperation/createLabels.py --datadir $ANUE --id-type $IDTYPE --color [True|False] --instance [True|False] --num-workers $C

ANUE is the path to the AutoNUE dataset
IDTYPE can be id, csId, csTrainId, level3Id, level2Id, level1Id.
color True generates the color masks
instance True generates the instance masks with the id given by IDTYPE
panoptic True generates panoptic masks in the format similar to COCO. See the modified evaluation scripts here: https://github.com/AutoNUE/panopticapi
C is the number of threads to run in parallel

For the supervised domain adaptation and semantic segmentation tasks, the masks should be generated using IDTYPE of level3Id and used for training models (similar to trainId in cityscapes). This can be done by the command:

python preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers $C

Following commands are updated for the target labels of other domain adaptation tasks:

python3 preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers $C --semisup_da True
python3 preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers $C --weaksup_da True
python3 preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers $C --unsup_da True

The bounding box labels for weakly supervised domain adapation can be downloaded from here: https://github.com/AutoNUE/public-code/tree/master/domain_adaptation/target/weakly-supervised

The generated files:

_gtFine_labelLevel3Ids.png will be used for semantic segmentation

AutoNUE 2019

For using first add helpers/ to $PYTHONPATH

export PYTHONPATH="${PYTHONPATH}:helpers/"

The code has been tested on python 3.6.4

Dataset Structure

The structure is similar to the cityscapes dataset. That is:

gtFine/{split}/{drive_no}/{img_id}_gtFine_polygons.json for ground truths
leftImg8bit/{split}/{drive_no}/{img_id}_leftImg8bit.png for image frames

Semantic Segmentation and Instance Segmentation

Furthermore for training, label masks needs to be generated as described below resulting in the following files:

gtFine/{split}/{drive_no}/{img_id}_gtFine_labellevel3Ids.png
gtFine/{split}/{drive_no}/{img_id}_gtFine_instancelevel3Ids.png

Panoptic Challenge

Furthermore for training, panoptic masks needs to be generated as described below resulting in the following files:

gtFine/{split}_panoptic/{drive_no}_{img_id}_gtFine_panopticlevel3Ids.png
gtFine/{split}_panoptic.json

Detection

The structure is slightly similar to Pascal VOC dataset.

JPEGImages/<capture_category>/<drive sequence>/<>.jpg for images
Annotations/<capture_category>/<drive sequence>/<>.xml for Annotations

Labels

See helpers/anue_labels.py

Generate Label Masks (for training/evaluation) (Semantic/Instance/Panoptic Segmentation)

python preperation/createLabels.py --datadir $ANUE --id-type $IDTYPE --color [True|False] --instance [True|False] --num-workers $C

ANUE is the path to the AutoNUE dataset
IDTYPE can be id, csId, csTrainId, level3Id, level2Id, level1Id.
color True generates the color masks
instance True generates the instance masks with the id given by IDTYPE
panoptic True generates panoptic masks in the format similar to COCO. See the modified evaluation scripts here: https://github.com/AutoNUE/panopticapi
C is the number of threads to run in parallel

For the semantic segmentation challenge, masks should be generated using IDTYPE of level3Id and used for training models (similar to trainId in cityscapes). This can be done by the command:

python preperation/createLabels.py --datadir $ANUE --id-type level3Id --num-workers $C

For the instance segmentation challenge, instance masks should be generated by the following comand:

python preperation/createLabels.py --datadir $ANUE --id-type id --num-workers $C

The generated files:

_gtFine_labelLevel3Ids.png will be used for semantic segmentation
_gtFine_instanceids.png will be used for instance segmentation
_gtFine_panopticLevel3Ids.png will be used for panoptic segmentation under the folder gtFine/{split}_panoptic and the gtFine/{split}_panoptic.json

Detection

We use subset of labels from helpers/anue_labels.py.

We have person(level3Id: 4 , Trainable : True), rider (level3Id: 5, Trainable : True), car (level3Id: 9, Trainable : True), truck (level3Id: 10, Trainable : True), bus(level3Id: 11, Trainable : True), motorcycle(level3Id: 6, Trainable : True), bicycle(level3Id: 7, Trainable : True), autorickshaw(level3Id: 8, Trainable : True), animal(level3Id: 4 , Trainable : True), traffic light(level3Id: 18, Trainable : True), traffic sign(level3Id: 19, Trainable : True), vehicle fallback (level3Id: 12, Trainable : False), caravan (level3Id: 12, Trainable : False), trailer (level3Id: 12, Trainable : False), train (level3Id: 12, Trainable : False).

Note : We train based on level3Id’s and only those labels which are mentioned as trainable and report accuracies on them.

Viewer

First generate label masks as described above. To view the ground truths / prediction masks at different levels of heirarchy use:

python viewer/viewer.py ---datadir $ANUE

ANUE has the folder path to the dataset or prediction masks with similar file/folder structure as dataset.

TODO: Make the color map more sensible.

Evaluation

Semantic Segmentation

First generate labels masks with level3Ids as described before. Then

python evaluate/evaluate_mIoU.py --gts $GT  --preds $PRED  --num-workers $C

GT is the folder path of ground truths containing <drive_no>/<img_no>_gtFine_labellevel3Ids.png
PRED is the folder paths of predictions with the same folder structure and file names.
C is the number of threads to run in parallel

Constrained Semantic Segmentation

First generate labels masks with level1Ids as described before. Then

python evaluate/idd_lite_evaluate_mIoU.py --gts $GT  --preds $PRED  --num-workers $C

GT is the folder path of ground truths containing <drive_no>/<img_no>_gtFine_labellevel1Ids.png
PRED is the folder paths of predictions with the same folder structure and file names.
C is the number of threads to run in parallel

Instance Segmentation

First generate instance label masks with ID_TYPE=id, as described before. Then

python evaluate/evaluate_instance_segmentation.py --gts $GT  --preds $PRED

GT is the folder path of ground truths containing <drive_no>/<img_no>_gtFine_labellevel3Ids.png
PRED is the folder paths of predictions with the same folder structure and file names. The format for predictions is the same as the cityscapes dataset. That is a .txt file where each line is of the form "<instance_mask_png> <label id> <conf score>". Note that the ID_TYPE=id is used by this evaluation code.
C is the number of threads to run in parallel

Panoptic Segmentation

Please use https://github.com/AutoNUE/panopticapi

Detection

python evaluate/evaluate_detection.py --gts $GT  --preds $PRED

GT is the folder path of ground truths containing Annotations/<capture_category>/<drive sequence>/<>.xml
PRED is the folder path of predictions with generated outputs in idd_det_<image_set>_<level3Id>.txt format. Here image_set can take {train,val,test}, while level3Id for all trainable labels has to present.

Acknowledgement

Some of the code was adapted from the cityscapes code at: https://github.com/mcordts/cityscapesScripts/ Some of the code was adapted from https://github.com/rbgirshick/py-faster-rcnn Some of the code was adapted from https://github.com/cocodataset/panopticapi