Published November 29, 2018 | Version v1
Journal article Open

Coevolving residues inform protein dynamics profiles and disease susceptibility of nSNVs

  • 1. Arizona State University
  • 2. University of Chicago

Description

The conformational dynamics of proteins is rarely used in methodologies used to predict the impact of genetic mutations due to the paucity of three-dimensional protein structures as compared to the vast number of available sequences. Until now a three-dimensional (3D) structure has been required to predict the conformational dynamics of a protein. We introduce an approach that estimates the conformational dynamics of a protein, without relying on structural information. This de novo approach utilizes coevolving residues identified from a multiple sequence alignment (MSA) using Potts models. These coevolving residues are used as contacts in a Gaussian network model (GNM) to obtain protein dynamics. B-factors calculated using sequence-based GNM (Seq-GNM) are in agreement with crystallographic B-factors as well as theoretical B-factors from the original GNM that utilizes the 3D structure. Moreover, we demonstrate the ability of the calculated B-factors from the Seq-GNM approach to discriminate genomic variants according to their phenotypes for a wide range of proteins. These results suggest that protein dynamics can be approximated based on sequence information alone, making it possible to assess the phenotypes of nSNVs in cases where a 3D structure is unknown. We hope this work will promote the use of dynamics information in genetic disease prediction at scale by circumventing the need for 3D structures.

Data availability

All relevant data are within the paper and its Supporting Information files.

Files

journal.pcbi.1006626.pdf

Files (3.9 MB)

Name Size Download all
Article
md5:f5f90e12b0dcbda7c9ff2c9edc9adb7f
3.9 MB Preview Download
Tables
md5:1cf61f5454c0083e969213f4850bf8d6
42.9 kB Preview Download

Additional details

Identifiers

DOI
10.1371/journal.pcbi.1006626
Other
oai:uchicago.tind.io:6372

Funding

NSF-MCB
1715591
RCS
Scialog Fellow Award
Gordon & Betty Moore Foundation

UChicago Information

Division(s)
Harris School of Public Policy Studies
Department(s)
Harris School of Public Policy Studies Research Publications
Center(s) or Institute(s)
Data Science Institute