Awesome
<span style="font-size:larger;">DeePMD-kit Manual</span>
Table of contents
About DeePMD-kit
DeePMD-kit is a package written in Python/C++, designed to minimize the effort required to build deep learning-based model of interatomic potential energy and force field and to perform molecular dynamics (MD). This brings new hopes to addressing the accuracy-versus-efficiency dilemma in molecular simulations. Applications of DeePMD-kit span from finite molecules to extended systems and from metallic systems to chemically bonded systems.
For more information, check the documentation.
Highlights in DeePMD-kit v2.0
- Model compression. Accelerate the efficiency of model inference 4-15 times.
- New descriptors. Including
se_e2_r
andse_e3
. - Hybridization of descriptors. Hybrid descriptor constructed from the concatenation of several descriptors.
- Atom type embedding. Enable atom-type embedding to decline training complexity and refine performance.
- Training and inference of the dipole (vector) and polarizability (matrix).
- Split of training and validation dataset.
- Optimized training on GPUs.
Highlighted features
- interfaced with TensorFlow, one of the most popular deep learning frameworks, making the training process highly automatic and efficient, in addition, Tensorboard can be used to visualize training procedures.
- interfaced with high-performance classical MD and quantum (path-integral) MD packages, i.e., LAMMPS and i-PI, respectively.
- implements the Deep Potential series models, which have been successfully applied to finite and extended systems including organic molecules, metals, semiconductors, insulators, etc.
- implements MPI and GPU supports, making it highly efficient for high-performance parallel and distributed computing.
- highly modularized, easy to adapt to different descriptors for deep learning-based potential energy models.
License and credits
The project DeePMD-kit is licensed under GNU LGPLv3.0. If you use this code in any future publications, please cite the following publications for general purpose:
- Han Wang, Linfeng Zhang, Jiequn Han, and Weinan E. "DeePMD-kit: A deep learning package for many-body potential energy representation and molecular dynamics." Computer Physics Communications 228 (2018): 178-184.
- Jinzhe Zeng, Duo Zhang, Denghui Lu, Pinghui Mo, Zeyu Li, Yixiao Chen, Marián Rynik, Li'ang Huang, Ziyao Li, Shaochen Shi, Yingze Wang, Haotian Ye, Ping Tuo, Jiabin Yang, Ye Ding, Yifan Li, Davide Tisi, Qiyu Zeng, Han Bao, Yu Xia, Jiameng Huang, Koki Muraoka, Yibo Wang, Junhan Chang, Fengbo Yuan, Sigbjørn Løland Bore, Chun Cai, Yinnian Lin, Bo Wang, Jiayan Xu, Jia-Xin Zhu, Chenxing Luo, Yuzhi Zhang, Rhys E. A. Goodall, Wenshuo Liang, Anurag Kumar Singh, Sikai Yao, Jingchao Zhang, Renata Wentzcovitch, Jiequn Han, Jie Liu, Weile Jia, Darrin M. York, Weinan E, Roberto Car, Linfeng Zhang, Han Wang. "DeePMD-kit v2: A software package for deep potential models." J. Chem. Phys. 159 (2023): 054801.
In addition, please follow the bib file to cite the methods you used.
Deep Potential in a nutshell
The goal of Deep Potential is to employ deep learning techniques and realize an inter-atomic potential energy model that is general, accurate, computationally efficient and scalable. The key component is to respect the extensive and symmetry-invariant properties of a potential energy model by assigning a local reference frame and a local environment to each atom. Each environment contains a finite number of atoms, whose local coordinates are arranged in a symmetry-preserving way. These local coordinates are then transformed, through a sub-network, to so-called atomic energy. Summing up all the atomic energies gives the potential energy of the system.
The initial proof of concept is in the Deep Potential paper, which employed an approach that was devised to train the neural network model with the potential energy only. With typical ab initio molecular dynamics (AIMD) datasets this is insufficient to reproduce the trajectories. The Deep Potential Molecular Dynamics (DeePMD) model overcomes this limitation. In addition, the learning process in DeePMD improves significantly over the Deep Potential method thanks to the introduction of a flexible family of loss functions. The NN potential constructed in this way reproduces accurately the AIMD trajectories, both classical and quantum (path integral), in extended and finite systems, at a cost that scales linearly with system size and is always several orders of magnitude lower than that of equivalent AIMD simulations.
Although highly efficient, the original Deep Potential model satisfies the extensive and symmetry-invariant properties of a potential energy model at the price of introducing discontinuities in the model. This has negligible influence on a trajectory from canonical sampling but might not be sufficient for calculations of dynamical and mechanical properties. These points motivated us to develop the Deep Potential-Smooth Edition (DeepPot-SE) model, which replaces the non-smooth local frame with a smooth and adaptive embedding network. DeepPot-SE shows great ability in modeling many kinds of systems that are of interest in the fields of physics, chemistry, biology, and materials science.
In addition to building up potential energy models, DeePMD-kit can also be used to build up coarse-grained models. In these models, the quantity that we want to parameterize is the free energy, or the coarse-grained potential, of the coarse-grained particles. See the DeePCG paper for more details.
See our latest paper for details of all features.
Download and install
Please follow our GitHub webpage to download the latest released version and development version.
DeePMD-kit offers multiple installation methods. It is recommended to use easy methods like offline packages, conda and docker.
One may manually install DeePMD-kit by following the instructions on installing the Python interface and installing the C++ interface. The C++ interface is necessary when using DeePMD-kit with LAMMPS, i-PI or GROMACS.
Use DeePMD-kit
A quick start on using DeePMD-kit can be found here.
A full document on options in the training input script is available.
Advanced
- Installation
- Data
- Model
- Overall
- Descriptor
"se_e2_a"
- Descriptor
"se_e2_r"
- Descriptor
"se_e3"
- Descriptor
"se_atten"
- Descriptor
"se_atten_v2"
- Descriptor
"hybrid"
- Descriptor
sel
- Fit energy
- Fit spin energy
- Fit
tensor
likeDipole
andPolarizability
- Fit electronic density of states (DOS)
- Train a Deep Potential model using
type embedding
approach - Deep potential long-range
- Deep Potential - Range Correction (DPRc)
- Linear model
- Interpolation or combination with a pairwise potential
- Training
- Freeze and Compress
- Test
- Inference
- Integrate with third-party packages
- Use NVNMD
Code structure
The code is organized as follows:
data/raw
: tools manipulating the raw data files.examples
: examples.deepmd
: DeePMD-kit python modules.source/api_cc
: source code of DeePMD-kit C++ API.source/ipi
: source code of i-PI client.source/lib
: source code of DeePMD-kit library.source/lmp
: source code of Lammps module.source/gmx
: source code of Gromacs plugin.source/op
: TensorFlow op implementation. working with the library.
Troubleshooting
- Model compatibility
- Installation
- The temperature undulates violently during the early stages of MD
- MD: cannot run LAMMPS after installing a new version of DeePMD-kit
- Do we need to set rcut < half boxsize?
- How to set sel?
- How to control the parallelism of a job?
- How to tune Fitting/embedding-net size?
- Why does a model have low precision?
Contributing
See DeePMD-kit Contributing Guide to become a contributor! 🤓