Home

Awesome

Idpgan

This is the idpGAN repository. IdpGAN is a machine-learning based conformational ensemble generator for coarse grained (CG) models of intrinsically disordered proteins (IDPs).

Details of idpGAN are described in https://doi.org/10.1038/s41467-023-36443-x.

How to run

To use idpGAN (implemented in PyTorch), you can run a Jupyter notebook illustrating idpGAN functionalities.

The notebook shows how to use the generator neural network of idpGAN to generate 3D structures of CG IDPs.

There are two ways in which you can run the notebook.

Colab version

This is the easiest way. This option allows you to run the notebook remotely. Just reach the notebook at: idpGAN Colab notebook.

Running the initial cells will automatically install all the dependencies (please note that the installation may require a few minutes).

NOTE: make sure to use a GPU runtime to largely speed up the idpGAN conformation generation process. If you use the default runtime (running on CPU), the process could take several minutes. To use a GPU runtime:

Running locally

You can also run the notebook on your machine. What you need to do is:

Datasets and files for the idpGAN version trained on a CG-based protein model

We trained an idpGAN version on conformations extracted from MD simulations performed using a CG-based protein model developed in our research group. In the data directory of this repository, we have the following files with information on the training, validation and test sets of this version of idpGAN:

We also have the following files, that allow you to run a demo of the generatative model on a small dataset:

Files for the idpGAN version trained on a ABSINTH implicit solvent simulations

We also trained an idpGAN version on Cα traces extracted from all-atom simulations performed using the ABSINTH implicit solvent model, which was found to reproduce with good accuracy the experimental behavior of several IDPs. ABSINTH simulations can be performed using the CAMPARI software package. In the data directory of this repository, we have the following files for this version of idpGAN:

System requirements

You can run the idpGAN notebook remotely on Colab. You do not need to install any software on your system, you only need to first login on your Google account.

Otherwise you can run the idpGAN notebook (and library) locally on your machine. Any Python (version >= 3.8) with the above requirements should work well on all major operating systems (Windows, Mac, Linux). We developed and tested the code on Linux, Python (3.8.10) and PyTorch (1.7.1).

In all cases, if you plan to generate conformational ensembles with large number of snapshots (> 5000), we suggest to use PyTorch GPU support to largely speed up computational times.

References

Contact

FeigLab, mfeiglab@gmail.com

Michigan State University