Combining Force Fields and Neural Networks for an Accurate Representation of Chemically Diverse Molecular Interactions. Journal of the American Chemical Society Illarionov, A., Sakipov, S., Pereyaslavets, L., Kurnikov, I. V., Kamath, G., Butin, O., Voronina, E., Ivahnenko, I., Leontyev, I., Nawrocki, G., Darkhovskiy, M., Olevanov, M., Cherniavskyi, Y. K., Lock, C., Greenslade, S., Sankaranarayanan, S. K., Kurnikova, M. G., Potoff, J., Kornberg, R. D., Levitt, M., Fain, B. 2023

Abstract

A key goal of molecular modeling is the accurate reproduction of the true quantum mechanical potential energy of arbitrary molecular ensembles with a tractable classical approximation. The challenges are that analytical expressions found in general purpose force fields struggle to faithfully represent the intermolecular quantum potential energy surface at close distances and in strong interaction regimes; that the more accurate neural network approximations do not capture crucial physics concepts, e.g., nonadditive inductive contributions and application of electric fields; and that the ultra-accurate narrowly targeted models have difficulty generalizing to the entire chemical space. We therefore designed a hybrid wide-coverage intermolecular interaction model consisting of an analytically polarizable force field combined with a short-range neural network correction for the total intermolecular interaction energy. Here, we describe the methodology and apply the model to accurately determine the properties of water, the free energy of solvation of neutral and charged molecules, and the binding free energy of ligands to proteins. The correction is subtyped for distinct chemical species to match the underlying force field, to segment and reduce the amount of quantum training data, and to increase accuracy and computational speed. For the systems considered, the hybrid ab initio parametrized Hamiltonian reproduces the two-body dimer quantum mechanics (QM) energies to within 0.03 kcal/mol and the nonadditive many-molecule contributions to within 2%. Simulations of molecular systems using this interaction model run at speeds of several nanoseconds per day.

View details for DOI 10.1021/jacs.3c07628

View details for PubMedID 37856313