Published February 12, 2021 | Version v1
Journal article Open

OpenAWSEM with Open3SPN2: A fast, flexible, and accessible framework for large-scale coarse-grained biomolecular simulations

Description

We present OpenAWSEM and Open3SPN2, new cross-compatible implementations of coarse-grained models for protein (AWSEM) and DNA (3SPN2) molecular dynamics simulations within the OpenMM framework. These new implementations retain the chemical accuracy and intrinsic efficiency of the original models while adding GPU acceleration and the ease of forcefield modification provided by OpenMM's Custom Forces software framework. By utilizing GPUs, we achieve around a 30-fold speedup in protein and protein-DNA simulations over the existing LAMMPS-based implementations running on a single CPU core. We showcase the benefits of OpenMM's Custom Forces framework by devising and implementing two new potentials that allow us to address important aspects of protein folding and structure prediction and by testing the ability of the combined OpenAWSEM and Open3SPN2 to model protein-DNA binding. The first potential is used to describe the changes in effective interactions that occur as a protein becomes partially buried in a membrane. We also introduced an interaction to describe proteins with multiple disulfide bonds. Using simple pairwise disulfide bonding terms results in unphysical clustering of cysteine residues, posing a problem when simulating the folding of proteins with many cysteines. We now can computationally reproduce Anfinsen's early Nobel prize winning experiments by using OpenMM's Custom Forces framework to introduce a multi-body disulfide bonding term that prevents unphysical clustering. Our protein-DNA simulations show that the binding landscape is funneled towards structures that are quite similar to those found using experiments. In summary, this paper provides a simulation tool for the molecular biophysics community that is both easy to use and sufficiently efficient to simulate large proteins and large protein-DNA systems that are central to many cellular processes. These codes should facilitate the interplay between molecular simulations and cellular studies, which have been hampered by the large mismatch between the time and length scales accessible to molecular simulations and those relevant to cell biology.

Data availability

All relevant data are within the manuscript and its Supporting information files. All codes can be found in GitHub: https://github.com/npschafer/openawsem, and Open3SPN2, and https://github.com/cabb99/open3spn2.

Files

journal.pcbi.1008308.pdf

Files (7.1 MB)

Name Size Download all
Article
md5:7c94e6e39c43bd49eb30be6f601c6ebe
3.1 MB Preview Download
Supporting information
md5:41fea345d4472555ba197b1c799561c9
4.0 MB Preview Download

Additional details

Identifiers

DOI
10.1371/journal.pcbi.1008308
Other
oai:uchicago.tind.io:5980

Funding

Center for Theoretical Biological Physics
National Science Foundation
PHY- 2019745
Rice University
D. R. Bullard-Welch Chair
National Science Foundation
BIO/MCB 1818328

UChicago Information

Division(s)
Pritzker School of Molecular Engineering