Running and analyzing massively parallel molecular simulations

15 October 2024, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Protein conformational landscapes contain the functionally relevant information useful for understanding biological processes. Mapping out conformational landscapes provides valuable insights into protein behaviors and biological phenomena, and has relevance to therapeutic design. While experimental structural biology (X-ray, Cryo-EM, NMR) can provide high resolution structures, they struggle to provide information about the full conformational landscapes of biomolecules. Molecular dynamics (MD) simulations are a powerful tool for exploring these landscapes at atomic-scale resolution. However, inferring functionally relevant information, such as the full conformational pathway of long-timescale processes, the impact of mutations on binding, or allosteric coupling between residues across long distances, requires too extensive sampling that a single MD simulation may not achieve. This sampling limitation can be circumvented by generating datasets of parallel molecular simulations, a powerful approach to sample long-timescale events and study complex biological phenomena. Here, we discuss recent advances and present a practical guide to generating massively parallel molecular dynamics datasets. We start by detailing the practical considerations prior to generating a dataset, spanning from storage needs to the timescales addressed by the dataset, as well as modern simulation engines. Subsequently, we discuss how to analyze thee datasets to build unified models of conformational space, including future insights to be made possible by distributed simulation architectures.

Keywords

Molecular dynamics
Distributed Computing
Allostery
Markov state models
Statistical modeling

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.