Published August 18, 2023 | Version v1
Journal article Open

A Hybrid Structure-Based Machine Learning Approach for Predicting Kinase Inhibition by Small Molecules

  • 1. Harvard University
  • 2. Novartis Institutes for Biomedical Research
  • 3. University of Chicago
  • 4. Columbia University

Description

Kinases have been the focus of drug discovery programs for three decades leading to over 70 therapeutic kinase inhibitors and biophysical affinity measurements for over 130,000 kinase-compound pairs. Nonetheless, the precise target spectrum for many kinases remains only partly understood. In this study, we describe a computational approach to unlocking qualitative and quantitative kinome-wide binding measurements for structure-based machine learning. Our study has three components: (i) a Kinase Inhibitor Complex (KinCo) data set comprising in silico predicted kinase structures paired with experimental binding constants, (ii) a machine learning loss function that integrates qualitative and quantitative data for model training, and (iii) a structure-based machine learning model trained on KinCo. We show that our approach outperforms methods trained on crystal structures alone in predicting binary and quantitative kinase-compound interaction affinities; relative to structure-free methods, our approach also captures known kinase biochemistry and more successfully generalizes to distant kinase sequences and compound scaffolds.

Data availability

The code required to generate the docked structures and to train the models is deposited in https://github.com/labsyspharm/KinCo [DOI: 10.5281/zenodo.7703409]. The kinase-compound affinities used for training are available under https://github.com/labsyspharm/KinCo/tree/main/resources. Selected docked poses of all kinase-compound pairs in KinCo are available for interactive viewing on the Rshiny app https://lsp.connect.hms.harvard.edu/ikinco/. Due to the size of the files (∼2TB), all docked structures and homology models in KinCo are available for download via Globus. Instructions for accessing the data can be found on https://lsp.connect.hms.harvard.edu/ikinco/ and at https://github.com/labsyspharm/KinCo [DOI: 10.5281/zenodo.7703409].

Files

liu-et-al-2023-a-hybrid-structure-based-machine-learning-approach-for-predicting-kinase-inhibition-by-small-molecules.pdf

Additional details

Identifiers

DOI
10.1021/acs.jcim.3c00347
Other
oai:uchicago.tind.io:13472

Funding

DARPA
PANACEA
National Institutes of Health
U24-DK116204

UChicago Information

Division(s)
Pritzker School of Molecular Engineering