Home

Awesome

LMPred: Predicting Antimicrobial Peptides Using Pre-Trained Language Models and Deep Learning

Now published in Oxford Academic - Bioinformatics Advances

https://academic.oup.com/bioinformaticsadvances/article/2/1/vbac021/6561563

Abstract

Antimicrobial peptides (AMPs) are increasingly being used in the development of new therapeutic drugs, in areas such as cancer therapy and hypertension. Additionally, they are seen as an alternative to antibiotics due to the increasing occurrence of bacterial resistance. Wetlaboratory experimental identification, however, is both time consuming and costly, so in-silico models are now commonly used in order to screen new AMP candidates. This paper proposes a novel approach of creating model inputs; using pre-trained language models to produce contextualized embeddings representing the amino acids within each peptide sequence, before a convolutional neural network is then trained as the classifier. The optimal model was validated on two datasets, being one previously used in AMP prediction research, and an independent dataset, created by this paper. Predictive accuracies of 93.33% and 88.26% were achieved respectively, outperforming all previous state-of-the-art classification models.

Files Uploaded:

Language Models - Citation

External Research:

July 2020: ProtTrans: Towards Cracking the Language of Life’s Code Through Self-Supervised Learning. preprint, Bioinformatics. Elnaggar, A., M. Heinzinger, C. Dallago, G. Rehawi, Y. Wang, L. Jones, T. Gibbs, T. Feher, C. Angerer, M. Steinegger, D. Bhowmik, and B. Rost (2020, July). https://www.biorxiv.org/content/10.1101/2020.07.12.199554v1.full.pdf

Update: July 2021: ProtTrans: Towards Cracking the Language of Life's Code Through Self-Supervised Deep Learning and High Performance Computing Elnaggar, Ahmed and Heinzinger, Michael and Dallago, Christian and Rehawi, Ghalia and Yu, Wang and Jones, Llion and Gibbs, Tom and Feher, Tamas and Angerer, Christoph and Steinegger, Martin and Bhowmik, Debsindhu and Rost, Burkhard https://ieeexplore.ieee.org/abstract/document/9477085

ProtTrans Github: https://github.com/agemagician/ProtTrans