Home

Awesome

A-Eval [Paper]

<a href="https://arxiv.org/abs/2309.03906"> <img src="https://img.shields.io/badge/eess.IV-2309.03906-b31b1b?logo=arxiv&logoColor=red"> </a>

<div align="center"> <img src="assets/fig_overview.png"> </div>

🌟 Highlights

πŸ“š Datasets

We train models on the official sets of FLARE22, AMOS, WORD, and TotalSegmentator, and evaluate them using their official <span style="color:red; font-size: larger;">validation sets</span> as well as BTCV's official training set.

Note: While these datasets do have test sets, FLARE22, AMOS, and BTCV do not make their test labels publicly available. Therefore, for consistent evaluation, we use validation sets instead of test sets in A-Eval, regardless of label availability.

DatasetModality# Train# Test# OrgansRegion
FLARE22CT50 labeled <br> 2000 unlabeled5013North American <br> European
AMOSCT & MR200 CT <br> 40 MR100 CT <br> 20 MR15Asian
WORDCT1002016Asian
TotalSegmentatorCT108257104European
BTCVCT-3013North American
A-Eval TotalsCT & MR1432 labeled CT <br> 2000 unlabeled CT <br> 40 MR257 CT <br> 20 MR8North American <br> European <br> Asian

To ensure a meaningful and fair comparison across datasets, we evaluate the models’ performance based on a set of eight organ classes shared by all five datasets. We unify these labels using an overlapped label system. The corresponding code for label systems and label conversion can be found in the repository: label_systems.py and convert_label_2_overlap_label.py.

Organ ClassFLARE22AMOSWORDTotalSegmentatorBTCVA-Eval
Liverβœ“βœ“βœ“βœ“βœ“βœ“
Kidney Rightβœ“βœ“βœ“βœ“βœ“βœ“
Kidney Leftβœ“βœ“βœ“βœ“βœ“βœ“
Spleenβœ“βœ“βœ“βœ“βœ“βœ“
Pancreasβœ“βœ“βœ“βœ“βœ“βœ“
Aortaβœ“βœ“βœ—βœ“βœ“βœ—
Inferior Vena Cavaβœ“βœ“βœ—βœ“βœ“βœ—
Adrenal Gland Rightβœ“βœ“βœ—βœ“βœ“βœ—
Adrenal Gland Leftβœ“βœ“βœ—βœ“βœ“βœ—
Gallbladderβœ“βœ“βœ“βœ“βœ“βœ“
Esophagusβœ“βœ“βœ“βœ“βœ“βœ“
Stomachβœ“βœ“βœ“βœ“βœ“βœ“
Duodenumβœ“βœ“βœ“βœ“βœ—βœ—

πŸ† Results

πŸ’‘ DSC

Train/TestFLARE22AMOS CTWORDTotalSegBTCVCT MeanAMOS MRAll Mean
FLARE22 w/o PL89.2076.5385.9474.0686.1182.3724.7772.77
FLARE22 w/ PL91.9887.5387.1585.5587.3587.9142.7480.38
AMOS CT89.1493.0289.0186.3986.8488.8870.0885.75
AMOS MR61.4773.9745.3048.0877.6061.2891.7366.36
AMOS CT+MR89.8193.2489.3688.4287.6689.7092.7290.20
WORD86.8687.5390.9280.5884.6986.1227.3876.33
TotalSeg90.3289.6586.3095.1287.7389.8238.7281.31
Joint Train91.9892.4288.8893.8788.9091.2190.8791.15

πŸ’‘ NSD

Train/TestFLARE22AMOS CTWORDTotalSegBTCVCT MeanAMOS MRAll Mean
FLARE22 w/o PL90.1980.2590.7676.5689.2885.4123.9675.17
FLARE22 w/ PL93.4690.9292.0188.2990.9491.1244.1983.30
AMOS CT89.4996.4794.8289.2891.6592.3472.9289.11
AMOS MR59.9748.6943.9348.0961.6152.2695.2259.42
AMOS CT+MR90.4696.8095.1891.3692.5393.2796.5893.82
WORD88.7392.3495.7583.4788.7489.8130.7579.96
TotalSeg91.9694.0292.4697.3392.7293.7040.4484.82
Joint Train93.5896.4695.2896.1093.8095.0495.2895.08

πŸ’‘ Visualization

<div align="center"> <img src="assets/visualization.png"> </div>

🎫 License

This project is released under the Apache 2.0 license.

πŸ™ Acknowledgement

πŸ‘‹ Hiring & Global Collaboration