Awesome
U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation
:pushpin: This is an official PyTorch implementation of U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation
<!-- [[`Project Page`](https://yes-u-kan.github.io/)] [[`arXiv`](https://arxiv.org/abs/2406.02918)] [[`BibTeX`](#citation)] --!> <p align="center"> <img src="./assets/logo_1.png" alt="" width="120" height="120"> </p> <!-- > [**U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation**](https://arxiv.org/abs/2406.02918)<br> > [Chenxin Li](https://xggnet.github.io/)<sup>1\*</sup>, [Xinyu Liu](https://xinyuliu-jeffrey.github.io/)<sup>1\*</sup>, [Wuyang Li](https://wymancv.github.io/wuyang.github.io/)<sup>1\*</sup>, [Cheng Wang](https://scholar.google.com/citations?user=AM7gvyUAAAAJ&hl=en)<sup>1\*</sup>, [Hengyu Liu](https://liuhengyu321.github.io/)<sup>1</sup>, [Yifan Liu](https://yifliu3.github.io/)<sup>1</sup>, [Chen Zhen](https://franciszchen.github.io/)<sup>2</sup>, [Yixuan Yuan](https://www.ee.cuhk.edu.hk/~yxyuan/people/people.htm)<sup>1β</sup><br> <sup>1</sup>The Chinese Univerisity of Hong Kong, <sup>2</sup>Centre for Artificial Intelligence and Robotics, Hong Kong --!> We explore the untapped potential of Kolmogorov-Anold Network (aka. KAN) in improving backbones for vision tasks. We investigate, modify and re-design the established U-Net pipeline by integrating the dedicated KAN layers on the tokenized intermediate representation, termed U-KAN. Rigorous medical image segmentation benchmarks verify the superiority of U-KAN by higher accuracy even with less computation cost. We further delved into the potential of U-KAN as an alternative U-Net noise predictor in diffusion models, demonstrating its applicability in generating task-oriented model architectures. These endeavours unveil valuable insights and sheds light on the prospect that with U-KAN, you can make strong backbone for medical image segmentation and generation. <div align="center"> <img width="100%" alt="UKAN overview" src="assets/framework-1.jpg"/> </div> ## π°News **[2024.6]** Some modifications are done in Seg_UKAN for better performance reproduction. The previous code can be quickly updated by replacing the contents of train.py and archs.py with the new ones. **[2024.6]** Model checkpoints and training logs are released! **[2024.6]** Code and paper of U-KAN are released! ## π‘Key Features - The first effort to incorporate the advantage of emerging KAN to improve established U-Net pipeline to be more **accurate, efficient and interpretable**. - A Segmentation U-KAN with **tokenized KAN block to effectively steer the KAN operators** to be compatible with the exiting convolution-based designs. - A Diffusion U-KAN as an **improved noise predictor** demonstrates its potential in backboning generative tasks and broader vision settings. ## π Setup ```bash git clone https://github.com/CUHK-AIM-Group/U-KAN.git cd U-KAN conda create -n ukan python=3.10 conda activate ukan cd Seg_UKAN && pip install -r requirements.txt ``` **Tips A**: We test the framework using pytorch=1.13.0, and the CUDA compile version=11.6. Other versions should be also fine but not totally ensured. ## πData Preparation **BUSI**: The dataset can be found [here](https://www.kaggle.com/datasets/aryashah2k/breast-ultrasound-images-dataset). **GLAS**: The dataset can be found [here](https://websignon.warwick.ac.uk/origin/slogin?shire=https%3A%2F%2Fwarwick.ac.uk%2Fsitebuilder2%2Fshire-read&providerId=urn%3Awarwick.ac.uk%3Asitebuilder2%3Aread%3Aservice&target=https%3A%2F%2Fwarwick.ac.uk%2Ffac%2Fcross_fac%2Ftia%2Fdata%2Fglascontest&status=notloggedin). <!-- You can directly use the [processed GLAS data]() without further data processing. -->CVC-ClinicDB: The dataset can be found here.
<!-- You can directly use the [processed CVC-ClinicDB data]() without further data processing. -->We also provide all the pre-processed dataset without requiring any further data processing. You can directly download and put them into the data dir.
The resulted file structure is as follows.
Seg_UKAN
βββ inputs
β βββ busi
β βββ images
β βββ malignant (1).png
| βββ ...
| βββ masks
β βββ 0
β βββ malignant (1)_mask.png
| βββ ...
β βββ GLAS
β βββ images
β βββ 0.png
| βββ ...
| βββ masks
β βββ 0
β βββ 0.png
| βββ ...
β βββ CVC-ClinicDB
β βββ images
β βββ 0.png
| βββ ...
| βββ masks
β βββ 0
β βββ 0.png
| βββ ...
πEvaluating Segmentation U-KAN
You can directly evaluate U-KAN from the checkpoint model. Here is an example for quick usage for using our pre-trained models in Segmentation Model Zoo:
- Download the pre-trained weights and put them to
{args.output_dir}/{args.name}/model.pth
- Run the following scripts to
cd Seg_UKAN
python val.py --name ${dataset}_UKAN --output_dir [YOUR_OUTPUT_DIR]
β³Training Segmentation U-KAN
You can simply train U-KAN on a single GPU by specifing the dataset name --dataset
and input size --input_size
.
cd Seg_UKAN
python train.py --arch UKAN --dataset {dataset} --input_w {input_size} --input_h {input_size} --name {dataset}_UKAN --data_dir [YOUR_DATA_DIR]
For example, train U-KAN with the resolution of 256x256 with a single GPU on the BUSI dataset in the inputs
dir:
cd Seg_UKAN
python train.py --arch UKAN --dataset busi --input_w 256 --input_h 256 --name busi_UKAN --data_dir ./inputs
Please see Seg_UKAN/scripts.sh for more details. Note that the resolution of glas is 512x512, differing with other datasets (256x256).
πͺSegmentation Model Zoo
We provide all the pre-trained model checkpoints Here is an overview of the released performance&checkpoints. Note that results on a single run and the reported average results in the paper differ.
Method | Dataset | IoU | F1 | Checkpoints |
---|---|---|---|---|
Seg U-KAN | BUSI | 65.26 | 78.75 | Link |
Seg U-KAN | GLAS | 87.51 | 93.33 | Link |
Seg U-KAN | CVC-ClinicDB | 85.61 | 92.19 | Link |
The parameter ``--no_kan'' denotes the baseline model that is replaced the KAN layers with MLP layers. Please note: it is reasonable to find occasional inconsistencies in performance, and the average results over multiple runs can reveal a more obvious trend.
Method | Layer Type | IoU | F1 | Checkpoints |
---|---|---|---|---|
Seg U-KAN (--no_kan) | MLP Layer | 63.49 | 77.07 | Link |
Seg U-KAN | KAN Layer | 65.26 | 78.75 | Link |
πMedical Image Generation with Diffusion U-KAN
Please refer to Diffusion_UKAN
πTODO List
- Release code for Seg U-KAN.
- Release code for Diffusion U-KAN.
- Upload the pretrained checkpoints.
πAcknowledgements
Greatly appreciate the tremendous effort for the following projects!
<!-- ## πCitation If you find this work helpful for your project,please consider citing the following paper: ``` @article{li2024ukan, title={U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation}, author={Li, Chenxin and Liu, Xinyu and Li, Wuyang and Wang, Cheng and Liu, Hengyu and Yuan, Yixuan}, journal={arXiv preprint arXiv:2406.02918}, year={2024} ''' } --!>