Awesome

<h1 align="center">Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation</h1> ICCV, 2023 <a href="https://lxtgh.github.io/">Xiangtai Li</a> · <a href="https://yuanhaobo.me/">Haobo Yuan</a> · <a href="https://zhangwenwei.cn/">Wenwei Zhang</a> · <a href="https://sites.google.com/view/guangliangcheng">Guangliang Cheng</a> <a href="https://oceanpang.github.io/">Jiangmiao Pang</a> . <a href="https://www.mmlab-ntu.com/person/ccloy/">Chen Change Loy*</a> <a href='https://arxiv.org/pdf/2303.12782'> <img src='https://img.shields.io/badge/Paper-PDF-green?style=flat&logo=arXiv&logoColor=green' alt='arXiv PDF'> </a> <a href='' style='padding-left: 0.5rem;'> <img src='https://img.shields.io/badge/Project-Page-blue?style=flat&logo=Google%20chrome&logoColor=blue' alt='Project Page'> </a>

Universal Video Segmentation Model For VSS, VPS, and VIS

avatar

News !!

Update Swin Large Config For Tube-Link VPS on VIP-Seg.
Code released!!

[Paper] [CODE]

Features

$\color{#2F6EBA}{Universal\ Video\ Segmentation\ Model}$

A new framework unifies online video segmentation methods and near online video segmentation methods.
A new unified solution for three video segmentation tasks: VSS, VIS, and VPS.

$\color{#2F6EBA}{Explore\ the\ Cross-Tube\ Relation}$

The first video segmentation method that explores the cross-tube relation.
Proposed Tube-wise matching performs better than Frame-wise matching.

$\color{#2F6EBA}{Strong\ Performance}$

Achieves strong performance on VIS, VSS, and VPS datasets (five datasets) in one unified architecture.
Achieves even better performance than those specific architectures.

Visualization Results

[VIS] Youtube-VIS 2019

vis_demo_1

vis_demo_2

</details>

[VPS] VIP-Seg

vps_demo_1

vps_demo_2

</details>

[VSS] VSPW

vss_demo

</details>

[VPS] KITTI-STEP

vps_demo_3

</details>

Citation

If you think both Tube-Link and its codebase are useful for your research, please consider referring Tube-Link:


@inproceedings{li2023tube,
  title={Tube-link: A flexible cross tube baseline for universal video segmentation},
  author={Li, Xiangtai and Yuan, Haobo and Zhang, Wenwei and Cheng, Guangliang and Pang, Jiangmiao and Loy, Chen Change},
  booktitle={ICCV},
  year={2023}
}

@inproceedings{li2022videoknet,
  title={Video k-net: A simple, strong, and unified baseline for video segmentation},
  author={Li, Xiangtai and Zhang, Wenwei and Pang, Jiangmiao and Chen, Kai and Cheng, Guangliang and Tong, Yunhai and Loy, Chen Change},
  booktitle={CVPR},
  year={2022}
}

License

MIT

Awesome

News !!

Features

$\color{#2F6EBA}{Universal\ Video\ Segmentation\ Model}$

$\color{#2F6EBA}{Explore\ the\ Cross-Tube\ Relation}$

$\color{#2F6EBA}{Strong\ Performance}$

Dataset

Install

Training, Evaluation, and Models

Visualization Results

[VIS] Youtube-VIS 2019

[VPS] VIP-Seg

[VSS] VSPW

[VPS] KITTI-STEP

Citation

License