Awesome

<img src="figures/gem (3).png" width="30"> GEM: Glass-Segmentor

Jing Hao, Moyun Liu, Jinrong Yang, Kuo Feng Hung.

This repository is the official implementation of the GEM: Boost Simple Network for Glass Surface Segmentation via Vision Foundation Models. Our code is based on MaskDINO.

<details open> <summary> <font size=8><strong>Todo list</strong></font> </summary>

Release code and GEM-Tiny checkpoint
Release S-GSD dataset
Release multiple pre-trained models
Release GEM-Base model

</details>

Features

A simple but accurate segmentation framework, named GEM, for glass surface segmentation.
Automatically construct a large-scale synthesized glass surface dataset with precise mask annotation, termed S-GSD.
Surpasses the previous state-of-the-art methods by a large margin (IoU +2.1%).

Installation

See Mask DINO.

Getting Started

See Inference Demo with Pre-trained Model

See Results.

See Preparing Datasets for GEM.

See Getting Started.

Results

<table><tbody>   <th valign="bottom">Model</th> <th valign="bottom">Pre-trained dataset</th> <th valign="bottom">IoU</th> <th valign="bottom">F_β</th> <th valign="bottom">MAE</th> <th valign="bottom">BER</th> <th valign="bottom">FPS</th> <th valign="bottom">download</th> <tr><td align="left">GEM-Tiny | <a href="configs/gsd-s/semantic-segmentation/gem_sam_tiny_bs32_iter1w_steplr.yaml">config</a></td> <td align="center"><a href="https://pan.baidu.com/s/1GnMCKw2Rvt9BoUUvSIWPRw?pwd=jpzb">S-GSD-1x</a></td> <td align="center">0.755</td> <td align="center">0.852</td> <td align="center">0.038</td> <td align="center">8.39</td> <td align="center">16.09</td> <td align="center"><a href="https://pan.baidu.com/s/1i0i0Q-ewYX4zWMuTuqMGpg?pwd=tr14">BaiduDisk</a></td> </tr> <tr><td align="left">GEM-Tiny | <a href="configs/gsd-s/semantic-segmentation/gem_sam_tiny_bs32_iter1w_steplr.yaml">config</a></td> <td align="center">S-GSD-5x</td> <td align="center">0.757</td> <td align="center">0.855</td> <td align="center">0.035</td> <td align="center">8.54</td> <td align="center">16.09</td> <td align="center"><a href="https://pan.baidu.com/s/126FOm3G_FKBHAOv0kuuYyg?pwd=aehh">BaiduDisk</a></td> </tr> <tr><td align="left">GEM-Tiny | <a href="configs/gsd-s/semantic-segmentation/gem_sam_tiny_bs32_iter1w_steplr.yaml">config</a></td> <td align="center">S-GSD-10x</td> <td align="center">0.764</td> <td align="center">0.866</td> <td align="center">0.034</td> <td align="center">8.62</td> <td align="center">16.09</td> <td align="center"><a href="https://pan.baidu.com/s/1U5xkf0mhJejgnptq8JqBeQ?pwd=skac">BaiduDisk</a></td> </tr> <tr><td align="left">GEM-Tiny | <a href="configs/gsd-s/semantic-segmentation/gem_sam_tiny_bs32_iter1w_steplr.yaml">config</a></td> <td align="center">S-GSD-20x</td> <td align="center">0.770</td> <td align="center">0.865</td> <td align="center">0.032</td> <td align="center">8.21</td> <td align="center">16.09</td> <td align="center"><a href="https://pan.baidu.com/s/1Wo0iOulD6Qd-RLzB-webcg?pwd=5c8j">BaiduDisk</a></td> </tr> <tr><td align="left">GEM-Base | <a href="configs/gsd-s/semantic-segmentation/gem_sam_base_bs16_iter2w_steplr.yaml">config</a></td> <td align="center"><a href="https://pan.baidu.com/s/1GnMCKw2Rvt9BoUUvSIWPRw?pwd=jpzb">S-GSD-1x</a></td> <td align="center">0.766</td> <td align="center">0.873</td> <td align="center">0.031</td> <td align="center">9.44</td> <td align="center">11.55</td> <td align="center"><a href="https://pan.baidu.com/s/1-UzfguaIPL-y3VJpITQi7w?pwd=ucgt">BaiduDisk</a></td> </tr> <tr><td align="left">GEM-Base | <a href="configs/gsd-s/semantic-segmentation/gem_sam_base_bs16_iter2w_steplr.yaml">config</a></td> <td align="center">S-GSD-5x</td> <td align="center">0.769</td> <td align="center">0.858</td> <td align="center">0.032</td> <td align="center">8.16</td> <td align="center">11.55</td> <td align="center"><a href="https://pan.baidu.com/s/1CBAvC77hZa9tqTo_v3RFCA?pwd=i6ui">BaiduDisk</a></td> </tr> <tr><td align="left">GEM-Base | <a href="configs/gsd-s/semantic-segmentation/gem_sam_base_bs16_iter2w_steplr.yaml">config</a></td> <td align="center">S-GSD-10x</td> <td align="center">0.774</td> <td align="center">0.868</td> <td align="center">0.032</td> <td align="center">8.56</td> <td align="center">11.55</td> <td align="center"><a href="https://pan.baidu.com/s/1VAxVmVXb_zphZng0_FMnVg?pwd=6cv8">BaiduDisk</a></td> </tr> <tr><td align="left">GEM-Base | <a href="configs/gsd-s/semantic-segmentation/gem_sam_base_bs16_iter2w_steplr.yaml">config</a></td> <td align="center">S-GSD-20x</td> <td align="center">0.774</td> <td align="center">0.865</td> <td align="center">0.029</td> <td align="center">8.35</td> <td align="center">11.55</td> <td align="center"><a href="https://pan.baidu.com/s/1g3swWzY4C6AYYGq2cZVRVQ?pwd=xj3e">BaiduDisk</a></td> </tr> </tbody></table>

[07/15/2024] All pre-trained models can also download from Huggingface

[07/18/2024] S-GSD-1x dataset can also download from Huggingface

The download link of S-GSD-5x is blow here, it be divided into three parts:

Part1: https://pan.baidu.com/s/1CL0x8s1LXdIIoOjcr5wirw?pwd=2ff4

Part2: https://pan.baidu.com/s/18uCDKFmzy7vSmWm1IF4JFQ?pwd=9jqb

Part3: https://pan.baidu.com/s/1Z8pePl9Ps3QtrZB7WQ7aoQ?pwd=8fku

The S-GSD-10x and S-GSD-20x are not released because of the large disk storage, if you want to get these two large-scale datasets, feel free to contact me via isjinghao@gmail.com.

Getting Started

In the above tables, the corresponding model checkpoints can pre-trained models can be downloaded from the link.

If your dataset files are not under this repo, you need to add export DETECTRON2_DATASETS=/path/to/your/data or use Symbolic Link ln -s to link the dataset into this repo before the following command first.

Evalaluate our pretrained models

You can download our pretrained models and evaluate them with the following commands.

python train_net.py --eval-only --num-gpus 8 --config-file config_path MODEL.WEIGHTS /path/to/checkpoint_file

Train GEM to reproduce results

Use the above command without eval-only will train the model. For MobileSAM/SAM backbones, you need to download its weight from () and specify the path of the pretrained backbones with MODEL.WEIGHTS /path/to/pretrained_checkpoint
```
python train_net.py --num-gpus 8 --config-file config_path MODEL.WEIGHTS /path/to/checkpoint_file
```

You can also refer to Getting Started with Detectron2 for full usage.

<a name="CitingMaskDINO"></a>Citing GEM

If you find our work helpful for your research, please consider citing the following BibTeX entry.

@article{hao2024gem,
  title={GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis},
  author={Hao, Jing and Liu, Moyun and Hung, Kuo Feng},
  journal={arXiv preprint arXiv:2401.15282},
  year={2024}
}

If you find the code useful, please also consider the following BibTeX entry.

@misc{li2022mask,
      title={Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation}, 
      author={Feng Li and Hao Zhang and Huaizhe xu and Shilong Liu and Lei Zhang and Lionel M. Ni and Heung-Yeung Shum},
      year={2022},
      eprint={2206.02777},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Acknowledgement

Many thanks to these excellent opensource projects