Home

Awesome

A kinetic ensemble of the Alzheimer's Aβ peptide

This repository contains the full code and some small example data to reproduce our results on the kinetic ensemble of amyloid-β 42. See also the original implementation of the constrained VAMPNets.

Reproducibility information

The analysis was performed on a single Google compute engine instance using 12 vCPUs, 78 GB memory and 1x NVIDIA Tesla V100 GPU. The instance used the c3-deeplearning-tf-ent-2-1-cu100-20200131 image with CUDA 10.0 and tensorflow 2.1, based on Debian 10. The original environment for this machine is provided in env-tf-original.yml. Training a single network takes approximately 2 hours on this architecture.

Dataset

The full dataset can be found on Zenodo. The size of the full dataset is around 45 GB and includes the following directories:

Notebooks

The easiest way to try out the notebooks is by using conda. We include several environment specifications: env-tf.yml specifies the environment to be used for running the neural network, env-analysis.yml specifies the packages needed for the analysis and plotting of the results, and env-msmbuilder.yml specifies the environment for use with the msm-classic.ipynb notebook for the conventional Markov models. Because the installation of tensorflow is mostly highly specific to your machine, we strongly recommend following the official installation instructions. To create the environments, run conda env create -f env-analysis.yml and activate the new environment with conda activate analysis. You will also need to install a tensorflow 2.* compatible version of vamptools for training.