Published December 2, 2025 | Version v1
Journal article

Cartesian equivariant representations for learning and understanding molecular orbitals

  • 1. University of Chicago
  • 2. University of California, Berkeley

Description

Qualitative and quantitative orbital properties such as bonding/antibonding character, localization, and orbital energies are critical to how chemists understand reactivity, catalysis, and excited-state behavior. Despite this, representations of orbitals in deep learning models have been very underdeveloped relative to representations of molecular geometries and Hamiltonians. Here, we apply state-of-the-art equivariant deep learning architectures to the task of assigning global labels to orbitals, namely energies characterizations, given the molecular coefficients from Hartree–Fock or density functional theory. The architecture we have developed, the Cartesian Equivariant Orbital Network (CEONET), shows how molecular orbital coefficients are readily featurized as equivariant node features common to all graph-based machine-learned potentials. We find that CEONET performs well at predicting difficult quantitative labels such as the orbital energy and orbital entropy. Furthermore, we find that the CEONET representation provides an intuitive latent space for differentiating orbital character for the qualitative assignment of e.g. bonding or antibonding character. In addition to providing a useful representation for further integrating deep learning with electronic structure theory, we expect CEONET to be useful for automatizing and interpreting the results of advanced electronic structure methods such as complete active space self-consistent field theory. In particular, the ability of CEONET to infer multireference character via the orbital entropy paves the way toward the machine-learned selection of active spaces.

Data availability

Code has been deposited to https://github.com/GagliardiGroup/CEONet (83). Data has been deposited to https://doi.org/10.5281/zenodo.16934624 (84).

Additional details

Identifiers

DOI
10.1073/pnas.2510235122
Other
oai:uchicago.tind.io:16662

Funding

Office of Basic Energy Sciences
DE-SC0023383

UChicago Information

Division(s)
Physical Sciences Division, Pritzker School of Molecular Engineering
Department(s)
Chemistry, Computer Science
Center(s) or Institute(s)
Chicago Center for Theoretical Chemistry, James Franck Institute