Home

Awesome

Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2

🚨 News

The paper is available: Arxiv

🧐 Installation & Usage

Directly use .ipynb file

🎯 Example Cases

EndoNeRF Dataset

Prompt (first frame)Video segmentation
<div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/cutting_prompt.png" width="360" alt="Result"/></div><div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/cutting_output.gif" width="360" alt="Result"/></div>
<div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/pulling_prompt.png" width="360" alt="Result"/></div><div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/pulling_output.gif" width="360" alt="Result"/></div>

EndoVis'17 Segmentation Dataset

Prompt (first frame)Video segmentation
<div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/endovis17_1_prompt.png" width="360" alt="Result"/></div><div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/endovis17_1.gif" width="360" alt="Result"/></div>
<div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/endovis17_3_prompt.png" width="360" alt="Result"/></div><div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/endovis17_3.gif" width="360" alt="Result"/></div>

SurgToolLoc Dataset

Prompt (first frame)Video segmentation
<div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/surgtoolloc22_prompt.png" width="360" alt="Result"/></div><div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/surgtoolloc22.gif" width="360" alt="Result"/></div>

Cochlear Implant Dataset (We focus more on the insertion and driling tools)

Prompt (first frame)Video segmentation
<div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/driling_vumc_prompt.png" width="360" alt="Result"/></div><div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/driling_vumc.gif" width="360" alt="Result"/></div>
<div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/insertion_vumc_prompt.png" width="360" alt="Result"/></div><div align=center><img src="https://github.com/AngeLouCN/SAM-2_Surgical_Video/blob/main/result/insertion_vumc.gif" width="360" alt="Result"/></div>

📝 Cite

@misc{lou2024zeroshotsurgicaltoolsegmentation,
     title={Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2}, 
     author={Ange Lou and Yamin Li and Yike Zhang and Robert F. Labadie and Jack Noble},
     year={2024},
     eprint={2408.01648},
     archivePrefix={arXiv},
     primaryClass={eess.IV},
     url={https://arxiv.org/abs/2408.01648}, 
}