Home

Awesome

<div align="center"> <samp> <h2> Revisiting Distillation for Continual Learning on Visual Question Localized-Answering in Robotic Surgery </h1> <h4> Long Bai*, Mobarakol Islam*, Hongliang Ren </h3> <h3> Medical Image Computing and Computer Assisted Intervention (MICCAI) 2023 </h2> </samp>
[arXiv][Paper]

</div>

If you find our code or paper useful, please cite the paper as

@inproceedings{bai2023revisiting,
  title={Revisiting Distillation for Continual Learning on Visual Question Localized-Answering in Robotic Surgery},
  author={Bai, Long and Islam, Mobarakol and Ren, Hongliang},
  booktitle={International Conference on Medical Image Computing and Computer-Assisted Intervention},
  pages={68--78},
  year={2023},
  organization={Springer}
}

Abstract

The visual-question answering (VQA) system can serve as a knowledgeable assistant in surgical education. Nevertheless, various similar surgical instruments and actions can further confuse learners. To tackle this, we introduce an extended visual-question localized-answering (VQLA) system which can provide the answer with localization, to further help the learners understand the surgical scene. However, deep neural networks (DNNs) suffer from catastrophic forgetting when learning new knowledge. Specifically, when DNNs learn on incremental classes or tasks, their performance on old tasks drops dramatically. Furthermore, due to medical data privacy and licensing issues, it is often difficult to access old data when updating continual learning (CL) models. Therefore, we develop a non-exemplar continual surgical VQLA framework, to explore and balance the rigidity-plasticity trade-off of DNNs in a sequential learning paradigm. We revisit the distillation loss in CL tasks, and propose rigidity-plasticity-aware distillation (RP-Dist) and self-calibrated heterogeneous distillation (SH-Dist) to preserve the old knowledge. The weight aligning (WA) technique is also integrated to adjust the weight bias between old and new tasks. We further establish a CL framework on three public surgical datasets in the context of surgical settings that consist of overlapping classes between old and new surgical VQLA tasks. With extensive experiments, we demonstrate that our proposed method excellently reconciles learning and forgetting on the continual surgical VQLA over conventional CL methods.


Environment

Directory Setup

<!---------------------------------------------------------------------------------------------------------------->

In this project, we implement our method using the Pytorch library, the structure is as follows:


Run training