Published February 29, 2012 | Version v1
Journal article Open

On Docking, Scoring and Assessing Protein-DNA Complexes in a Rigid-Body Framework

  • 1. University of Chicago

Description

We consider the identification of interacting protein-nucleic acid partners using the rigid body docking method FTdock, which is systematic and exhaustive in the exploration of docking conformations. The accuracy of rigid body docking methods is tested using known protein-DNA complexes for which the docked and undocked structures are both available. Additional tests with large decoy sets probe the efficacy of two published statistically derived scoring functions that contain a huge number of parameters. In contrast, we demonstrate that state-of-the-art machine learning techniques can enormously reduce the number of parameters required, thereby identifying the relevant docking features using a miniscule fraction of the number of parameters in the prior works. The present machine learning study considers a 300 dimensional vector (dependent on only 15 parameters), termed the Chemical Context Profile (CCP), where each dimension reflects a specific type of protein amino acid-nucleic acid base interaction. The CCP is designed to capture the chemical complementarities of the interface and is well suited for machine learning techniques. Our objective function is the Chemical Context Discrepancy (CCD), which is defined as the angle between the native system's CCP vector and the decoy's vector and which serves as a substitute for the more commonly used root mean squared deviation (RMSD). We demonstrate that the CCP provides a useful scoring function when certain dimensions are properly weighted. Finally, we explore how the amino acids on a protein's surface can help guide DNA binding, first through long-range interactions, followed by direct contacts, according to specific preferences for either the major or minor grooves of the DNA.

Files

journal.pone.0032647.pdf

Files (1.5 MB)

Name Size Download all
Article
md5:7d6beda891c241f845b47e692dad9921
808.0 kB Preview Download
md5:b7c5f9e6a2faf1c4c9f5ee2cd0ed37bd
696.6 kB Preview Download

Additional details

Identifiers

DOI
10.1371/journal.pone.0032647
Other
oai:uchicago.tind.io:10670

Funding

National Institutes of Health
GM57880
National Science Foundation
CHE-1111918
National Institutes of Health
S10 RR029030-01
National Science Foundation
OCI-1007115

UChicago Information

Division(s)
Biological Sciences Division
Department(s)
Biochemistry and Molecular Biology, Chemistry, Organismal Biology and Anatomy
Center(s) or Institute(s)
Institute for Biophysical Dynamics