Home

Awesome

Maintenance PR's Welcome Awesome

Awesome-Multimodal-Applications-In-Medical-Imaging

This repository includes resources on several applications of multi-modal learning in medical imaging, including papers related to <b>large language models (LLM)</b>. Papers involving LLM are bold.

Contributing

Please feel free to send me pull requests or email to add links or to discuss with me about this area. Markdown format:

- [**Name of Conference or Journal + Year**] Paper Name. [[pdf]](link) [[code]](link)

News

Citation

@article{xia2024cares,
  title={CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models},
  author={Xia, Peng and Chen, Ze and Tian, Juanxi and Gong, Yangrui and Hou, Ruibo and Xu, Yue and Wu, Zhenbang and Fan, Zhiyuan and Zhou, Yiyang and Zhu, Kangyu and others},
  journal={arXiv preprint arXiv:2406.06007},
  year={2024}
}

@inproceedings{xia2024rule,
  title={RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models},
  author={Xia, Peng and Zhu, Kangyu and Li, Haoran and Zhu, Hongtu and Li, Yun and Li, Gang and Zhang, Linjun and Yao, Huaxiu},
  booktitle={Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing},
  pages={1081--1093},
  year={2024}
}

@article{xia2024mmed,
  title={MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models},
  author={Xia, Peng and Zhu, Kangyu and Li, Haoran and Wang, Tianze and Shi, Weijia and Wang, Sheng and Zhang, Linjun and Zou, James and Yao, Huaxiu},
  journal={arXiv preprint arXiv:2410.13085},
  year={2024}
}

Overview


Data Source

Image-Caption Datasets

datasetdomainimagetextsourcelanguage
ROCOmultiple87K87Kresearch papersEn
MedICaTmultiple217K217Kresearch papersEn
PMC-OAmultiple1.6M1.6Mresearch papersEn
ChiMed-VLmultiple580K580Kresearch papersEn/zh
FFA-IRfundus1M10Kmedical reportsEn/zh
PadChestcxr160K109Kmedical reportsSp
MIMIC-CXRcxr377K227Kmedical reportsEn
OpenPathhistology208K208Ksocial mediaEn
Quilt-1Mhistology1M1Mresearch papers<br>social mediaEn
Harvard-FairVLMedfundus10k10Kmedical reportsEn
MedTrinity-25Mmultiple25M25Mresearch papers<br>social mediaEn

Visual Question Answering Datasets

datasetdomainimageQA Itemslanguage
VQA-RADradiology3153kEn
SLAKEradiology64214kEn/zh
Path-VQAhistology5k32MEn
VQA-Medradiology4.5k5.5kEn
PMC-VQAmultiple149k227kEn
OmniMedVQAmultiple118k128kEn
ProbMedradiology6k57kEn
PubMedVisionmultiple914k1.3MEn

Survey


Medical Report Generation

2018

2019

2020

2021

2022

2023

2024


Medical Visual Question Answering

2020

2021

2022

2023

2024


Medical Vision-Language Model

2022

2023

2024


šŸŽ‰ Contribution

Contributing to this paper list

ā­" Join us in improving this repository! If you know of any important works we've missed, please contribute. Your efforts are highly valued! "

Contributors

<a href="https://github.com/richard-peng-xia/awesome-multimodal-in-medical-imaging/graphs/contributors"> <img src="https://contrib.rocks/image?repo=richard-peng-xia/awesome-multimodal-in-medical-imaging" /> </a>

āœØStar History Chart