Home

Awesome

<h1 align="center"> <img align="center" width="450" src="./images/logo.png" alt="..."> </h1> <h4 align="center">Improving Medical Entity Linking with Semantic Type Prediction</h4> <p align="center"> <a href="https://arxiv.org/abs/2005.00460"><img src="http://img.shields.io/badge/Paper-PDF-red.svg"></a> <a href="https://medtype.github.io"><img src="http://img.shields.io/badge/Demo-Live-green.svg"></a> <a href="https://github.com/svjan5/medtype/blob/master/LICENSE"> <img src="https://img.shields.io/badge/License-Apache%202.0-blue.svg"> </a> </p> <h2 align="center"> What is MedType? </h2>

MedType is a BERT-based entity disambiguation module which can be incorporated with an any existing medical entity linker for enhancing its performance. For a given input text, MedType takes in the set of identified mentions along with their list of candidate concepts as input. Then, for each mention MedType predicts its semantic type based on its context in the text. The identified semantic type is utilized to disambiguate extracted mentions by filtering the candidate concepts. The figure below summarizes the entire process. The results demonstrate that MedType achieves state-of-the-art performance for medical entity linking task. Please refer to the paper for more details.

<img align="center" src="./images/overview.png" alt="..."> <h1 align="center"> Contents </h1>

We make the following resources available in this repository:

<h2 align="center"> Datasets </h2>

We present two new, automatically-created datasets (available on Google Drive):

Datasets statistics:

Datasets#Docs#Sents#Mentions#Unq Concepts
NCBI7927,6456,8171,638
Bio CDR1,50014,16628,5599,149
Sharecorpus43127,24617,8091,719
MedMentions4,39242,602352,49634,724
WikiMed393,61811,331,3211,067,08357,739
PubMedDS13,197,430127,670,59057,943,35444,881

Formatting information:

<h2 align="center"> Citation </h2>

Please consider citing our paper if you use this code in your work.

@ARTICLE{medtype2020,
       author = {{Vashishth}, Shikhar and {Joshi}, Rishabh and {Newman-Griffis}, Denis and
         {Dutt}, Ritam and {Rose}, Carolyn},
        title = "{MedType: Improving Medical Entity Linking with Semantic Type Prediction}",
      journal = {arXiv e-prints},
     keywords = {Computer Science - Computation and Language},
         year = 2020,
        month = may,
          eid = {arXiv:2005.00460},
        pages = {arXiv:2005.00460},
archivePrefix = {arXiv},
       eprint = {2005.00460},
 primaryClass = {cs.CL},
       adsurl = {https://ui.adsabs.harvard.edu/abs/2020arXiv200500460V},
      adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}

For any clarification, comments, or suggestions please create an issue or contact Shikhar.

Acknowledgements:

This work was funded in part by NSF grants IIS 1917668 IIS 1822831, Dow Chemical and UPMC Enterprises/Abridge, and the National Library of Medicine of the National Institutes of Health under award number T15 LM007059.