Home

Awesome

QD-IMD: Quick Draw Irregular Mask Dataset

Masked CelebA

Inpainting is an important computer vision task, where goal is to restore masked parts of an image. E.g., inpainting can help in erasing from photo something unwanted like passing bystander or your ex-partner.

Many recent approaches focus on rectangular shaped holes, often assumed to be center in the image. These limitations are absolutely not practical, because we often need to erase something with irregular form. That's why we need dataset with masks of irregular forms.

Guilin Liu et al. in their recent paper proposed such dataset, where source of irregular patterns were the results of occlusion/dis-occlusion mask estimation method between two consecutive frames for videos. Paper showed good results in inpainting, but we think their dataset has some weaknesses:

NVidia Irregular Mask Dataset [1]:

NVidia Irregular Mask Dataset examples

Our Irregular Mask Dataset (QD-IMD):

Quick Draw Irregular Mask Dataset examples

We decided to fight these problems and generated QD-IMD (Quick Draw Irregular Mask Dataset).

How it's generated

Our dataset is based on Quick Draw dataset (a collection of 50 million human drawings). Our hypothesis is that combination of strokes drawn by human hand is a good source of patterns for irregular masks. Here are steps for generating a single mask (default values are given in square brackets):

  1. Randomly choose number of strokes for mask [~N(4, 2)]
  2. Randomly sample strokes from Quick Draw dataset
  3. For each stroke choose width (px) [~Uniform(5, 15)] and draw it on canvas
  4. Sample upscale rate [~Uniform(1.0, 1.5)] and upscale canvas correspondingly
  5. Make central crop of target shape [(512, 512)]
  6. Binarize resulting mask

Note: all parameters can changed for a specific task

Download dataset

Dataset can be downloaded from Dropbox of Yandex.Disk.

Reproduce dataset

First clone this repository:

git clone https://github.com/karfly/qd-imd
cd qd-imd

Then download Quick Draw (simplified). It's located on Google Cloud here. You can install gsutil following these instructions and execute command:

mkdir quickdraw_simplified
gsutil -m cp -R gs://quickdraw_dataset/full/simplified/ quickdraw_simplified

(!) Note: Quick Draw (simplified) weight is ~22Gb. If it's not suitable for you, just download part of the dataset, and everything will work correctly

To reproduce dataset or generate masks with other parameters you'll need Python 3 with install packages specified in requirements.txt. You can use pip:

pip install -r requirements.txt

Then run:

python generate_dataset.py

To get detailed information about parameters execute:

python generate_dataset.py --help

Future work

References