Awesome
Data-Balanced Transformer for Accelerated LNPs Screening in mRNA Delivery
TransLNP
TransMA is an explainable LNPs transfection efficiency prediction model, which adopts a multi-modal molecular structure fusion architecture.
An overview of TransMA can be seen below:
Getting Started
Installation
Set up conda environment and clone the github repo. Uni-Core is needed, please install it first. unicore-0.0.1+ CU117torch2.0.0-CP39-CP39-linux_x86_64-whl
is recommended.
# create a new environment
$ conda create --name TransMA python=3.9 -y
$ conda activate TransMA
# install requirements
$ pip install torch==2.0.0 torchvision==0.15.1 torchaudio==2.0.1
$ pip install -r requirements.txt
#Download pretrained weights and finetune weights
$ git clone https://github.com/wklix/TransMA.git
$ cd ./TransMA
$ wget https://github.com/wklix/TransLNP/releases/download/v1.0/mol_pre_no_h_220816.pt
$ mv *.pt weights/
# The trained models using scaffold and cliff data splitting methods from the Hela and RAW 264.7 cell line datasets are available at `https://github.com/wklix/TransMA/releases/download/v1.0/`.
#For example:
$ wget https://github.com/wklix/TransMA/releases/download/v1.0/raw_cliff.zip
Training
The LNPs transfection efficiency dataset which from AGILE is derived from Hela and RAW 264.7 cell lines. You can choose between two data splitting methods: cliff and scaffold. The corresponding data is located in ./dataset
.
# ./dataset/Hela/cliff/train.csv is the path of Hela cell lines dataset under cliff data splitting method.
$ python train_model.py
Test
Run test.py
to observe the predictive results on the test set. We provide models trained under cliff and scaffold data splitting in the Hela and RAW 264.7 cell line datasets. The models can be found at https://github.com/wklix/TransMA/releases/download/v1.0/
.
$ python test.py