Awesome
LHU-Net: A Light Hybrid U-Net for Cost-Efficient High-Performance Volumetric Medical Image Segmentation
This repository contains the official implementation of LHU-Net. Our paper, "LHU-Net: A Light Hybrid U-Net for Cost-Efficient High-Performance Volumetric Medical Image Segmentation," addresses the growing complexity in medical image segmentation models, focusing on balancing computational efficiency with segmentation accuracy.
Yousef Sadegheih, Afshin Bozorgpour, Pratibha Kumari, Reza Azad, and Dorit Merhof
Abstract
As a result of the rise of Transformer architectures in medical image analysis, specifically in the domain of medical image segmentation, a multitude of hybrid models have been created that merge the advantages of Convolutional Neural Networks (CNNs) and Transformers. These hybrid models have achieved notable success by significantly improving segmentation accuracy. Yet, this progress often comes at the cost of increased model complexity, both in terms of parameters and computational demand. Moreover, many of these models fail to consider the crucial interplay between spatial and channel features, which could further refine and improve segmentation outcomes. To address this, we introduce LHU-Net, a Light Hybrid U-Net architecture optimized for volumetric medical image segmentation. LHU-Net is meticulously designed to prioritize spatial feature analysis in its initial layers before shifting focus to channel-based features in its deeper layers, ensuring a comprehensive feature extraction process. Rigorous evaluation across five benchmark datasets — Synapse, LA, Pancreas, ACDC, and BRaTS 2018 — underscores LHU-Net's superior performance, showcasing its dual capacity for efficiency and accuracy. Notably, LHU-Net sets new performance benchmarks, such as attaining a Dice score of 92.65 on the ACDC dataset, while simultaneously reducing parameters by $85%$ and quartering the computational load compared to existing state-of-the-art models. Achieved without any reliance on pre-training, additional data, or model ensemble, LHU-Net's effectiveness is further evidenced by its state-of-the-art performance across all evaluated datasets, utilizing fewer than 11 million parameters. This achievement highlights that balancing computational efficiency with high accuracy in medical image segmentation is feasible.
Key Contributions
- Efficient Hybrid Attention Selection: Introduces a strategic deployment of specialized attention mechanisms within Transformers, enabling nuanced feature extraction tailored to the demands of medical image segmentation.
- Benchmark Setting Efficiency: Achieves high-performance segmentation with significantly reduced computational resources, demonstrating an optimal balance between model complexity and computational efficiency.
- Versatile Superiority: Showcases unparalleled versatility and state-of-the-art performance across multiple datasets, highlighting its robustness and potential as a universal solution for medical image segmentation.
Model Architecture
LHU-Net leverages a hierarchical U-Net encoder-decoder structure optimized for 3D medical image segmentation. The architecture integrates convolutional-based blocks with hybrid attention mechanisms, capturing both local features and non-local dependencies effectively.
<p align="center"> <img width="900" alt="LHU-Net Architecture" src="figures/network_architecture.jpg"> </p> <!-- ![LHU-Net Architecture](figures/network_architecture.jpg) -->For a detailed explanation of each component, please refer to our paper.
Datasets
Our experiments were conducted on five benchmark datasets:
- Synapse
- ACDC
- LA
- NIH Pancreas dataset (CT-82)
- BraTS Challenge 2018
LHU-Net demonstrated exceptional performance across these datasets, significantly outperforming existing state-of-the-art models in terms of efficiency and accuracy.
Results
<p align="center"> <img width="700" alt="synapse_table" src="figures/synapse_table.png"> </p> <p align="center"> <img width="700" alt="ACDC_table" src="figures/acdc_table.png"> </p> <!-- ![synapse_table](figures/synapse_table.png) ![ACDC_table](figures/acdc_table.png) -->Getting Started
This section provides instructions on how to run LHU-Net for your segmentation tasks.
Requirements
- Ubuntu 16.04 or higher
- CUDA 12.x
- Python v3.7 or higher
- Pytorch v2.x
- Pytorch-lightning 2.x
- Hardware Spec
- GPU with 8GB memory or larger capacity
- For our experiments, we used 1GPU(A100-80G)
Install the required packages:
pip install -r requirements.txt
Install 3D deformable convolutions:
cd dcn/
pip install .
Training, Inference, Pre-trained weights, and Visulizations
Instructions on how to train LHU-Net and perform infernece on your dataset.
For detailed instructions for Synapse and ACDC datasets, please refer to the README.
For detailed instructions for BraTS 2018, LA, and Pancreas datasets, please refer to the README.
Acknowledgments
This repository is built based on nnFormer, nnU-Net, UNETR++, MCF, D3D, D-LKA. We thank the authors for their code repositories.
Citation
If you find this work useful for your research, please cite:
@article{sadegheih2024lhunet,
title={LHU-Net: A Light Hybrid U-Net for Cost-Efficient High-Performance Volumetric Medical Image Segmentation},
author={Sadegheih, Yousef and Bozorgpour, Afshin and Kumari, Pratibha and Azad, Reza and Merhof, Dorit},
year={2024}
}