Bayesian-inference-driven model parametrization and model selection for 2CLJQ fluid models Journal Article


Authors: Madin, O. C.; Boothroyd, S.; Messerly, R. A.; Fass, J.; Chodera, J. D.; Shirts, M. R.
Article Title: Bayesian-inference-driven model parametrization and model selection for 2CLJQ fluid models
Abstract: A high level of physical detail in a molecular model improves its ability to perform high accuracy simulations but can also significantly affect its complexity and computational cost. In some situations, it is worthwhile to add complexity to a model to capture properties of interest; in others, additional complexity is unnecessary and can make simulations computationally infeasible. In this work, we demonstrate the use of Bayesian inference for molecular model selection, using Monte Carlo sampling techniques accelerated with surrogate modeling to evaluate the Bayes factor evidence for different levels of complexity in the two-centered Lennard-Jones + quadrupole (2CLJQ) fluid model. Examining three nested levels of model complexity, we demonstrate that the use of variable quadrupole and bond length parameters in this model framework is justified only for some chemistries. Through this process, we also get detailed information about the distributions and correlation of parameter values, enabling improved parametrization and parameter analysis. We also show how the choice of parameter priors, which encode previous model knowledge, can have substantial effects on the selection of models, penalizing careless introduction of additional complexity. We detail the computational techniques used in this analysis, providing a roadmap for future applications of molecular model selection via Bayesian inference and surrogate modeling. © 2022 American Chemical Society
Keywords: molecular model; computational chemistry; property; monte carlo methods; computational methods; bayesian inference; computational costs; article; inference engines; bayesian networks; model selection; molecular modeling; high-accuracy; bond length; complexity costs; fluid modeling; model parametrization; quadrupoles; surrogate modeling
Journal Title: Journal of Chemical Information and Modeling
Volume: 62
Issue: 4
ISSN: 1549-9596
Publisher: American Chemical Society  
Date Published: 2022-02-28
Start Page: 874
End Page: 889
Language: English
DOI: 10.1021/acs.jcim.1c00829
PUBMED: 35129974
PROVIDER: scopus
PMCID: PMC9217127
DOI/URL:
Notes: Article -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. John Damon Chodera
    118 Chodera