{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,11]],"date-time":"2025-11-11T15:55:19Z","timestamp":1762876519383,"version":"3.37.3"},"reference-count":31,"publisher":"IOP Publishing","issue":"4","license":[{"start":{"date-parts":[[2024,10,7]],"date-time":"2024-10-07T00:00:00Z","timestamp":1728259200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2024,10,7]],"date-time":"2024-10-07T00:00:00Z","timestamp":1728259200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/iopscience.iop.org\/info\/page\/text-and-data-mining"}],"funder":[{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"crossref","award":["ZA 1175\/3-1"],"award-info":[{"award-number":["ZA 1175\/3-1"]}],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["iopscience.iop.org"],"crossmark-restriction":false},"short-container-title":["Mach. Learn.: Sci. Technol."],"published-print":{"date-parts":[[2024,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Multifidelity machine learning (MFML) for quantum chemical properties has seen strong development in the recent years. The method has been shown to reduce the cost of generating training data for high-accuracy low-cost ML models. In such a set-up, the ML models are trained on molecular geometries and some property of interest computed at various computational chemistry accuracies, or fidelities. These are then combined in training the MFML models. In some multifidelity models, the training data is required to be nested, that is the same molecular geometries are included to calculate the property across all the fidelities. In these multifidelity models, the requirement of a nested configuration restricts the kind of sampling that can be performed while selection training samples at different fidelities. This work assesses the use of non-nested training data for two of these multifidelity methods, namely MFML and optimized MFML (o-MFML). The assessment is carried out for the prediction of ground state energies and first vertical excitation energies of a diverse collection of molecules of the CheMFi dataset. Results indicate that the MFML method still requires a nested structure of training data across the fidelities. However, the o-MFML method shows promising results for non-nested multifidelity training data with model errors comparable to the nested configurations.<\/jats:p>","DOI":"10.1088\/2632-2153\/ad7f25","type":"journal-article","created":{"date-parts":[[2024,9,24]],"date-time":"2024-09-24T23:01:40Z","timestamp":1727218900000},"page":"045005","update-policy":"https:\/\/doi.org\/10.1088\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Assessing non-nested configurations of multifidelity machine learning for quantum-chemical properties"],"prefix":"10.1088","volume":"5","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6218-5053","authenticated-orcid":true,"given":"Vivin","family":"Vinod","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7028-6580","authenticated-orcid":true,"given":"Peter","family":"Zaspel","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"266","published-online":{"date-parts":[[2024,10,7]]},"reference":[{"key":"mlstad7f25bib1","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-020-00460-5","article-title":"Molecular representations in AI-driven drug discovery: a review and practical guide","volume":"12","author":"David","year":"2020","journal-title":"J. Cheminf."},{"key":"mlstad7f25bib2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-017-0089-0","article-title":"A survey on heterogeneous transfer learning","volume":"4","author":"Day","year":"2017","journal-title":"J. Big Data"},{"key":"mlstad7f25bib3","doi-asserted-by":"publisher","first-page":"388","DOI":"10.1038\/s41570-021-00278-1","article-title":"Molecular excited states through a machine learning lens","volume":"5","author":"Dral","year":"2021","journal-title":"Nat. Rev. Chem."},{"key":"mlstad7f25bib4","doi-asserted-by":"publisher","DOI":"10.1063\/5.0006498","article-title":"Hierarchical machine learning of potential energy surfaces","volume":"152","author":"Dral","year":"2020","journal-title":"J. Chem. Phys."},{"key":"mlstad7f25bib5","doi-asserted-by":"publisher","DOI":"10.1063\/5.0201681","article-title":"Multitask methods for predicting molecular properties from heterogeneous data","volume":"161","author":"Fisher","year":"2024","journal-title":"J. Chem. Phys."},{"key":"mlstad7f25bib6","doi-asserted-by":"publisher","first-page":"128","DOI":"10.1016\/j.jcp.2013.06.013","article-title":"Combination technique based k-th moment analysis of elliptic problems with random diffusion","volume":"252","author":"Harbrecht","year":"2013","journal-title":"J. Comput. Phys."},{"key":"mlstad7f25bib7","first-page":"pp 143","article-title":"Recent developments in the theory and application of the sparse grid combination technique","author":"Hegland","year":"2016"},{"key":"mlstad7f25bib8","doi-asserted-by":"publisher","first-page":"945","DOI":"10.1038\/s41557-020-0527-z","article-title":"Quantum machine learning using atom-in-molecule-based fragments selected on the fly","volume":"12","author":"Huang","year":"2020","journal-title":"Nat. Chem."},{"article-title":"Charge and exciton transfer simulations using machine-learned hamiltonians","author":"Kr\u00e4mer","key":"mlstad7f25bib9","doi-asserted-by":"crossref","DOI":"10.1021\/acs.jctc.0c00246"},{"key":"mlstad7f25bib10","doi-asserted-by":"publisher","first-page":"1134","DOI":"10.1109\/TPAMI.2013.167","article-title":"Learning with augmented features for supervised and semi-supervised heterogeneous domain adaptation","volume":"36","author":"Li","year":"2013","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"mlstad7f25bib11","doi-asserted-by":"publisher","first-page":"10187","DOI":"10.1021\/acs.chemrev.0c00665","article-title":"Carrington. Neural network potential energy surfaces for small molecules and reactions","volume":"121","author":"Manzhos","year":"2021","journal-title":"Chem. Rev."},{"key":"mlstad7f25bib12","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcp.2019.109020","article-title":"A composite neural network that learns from multi-fidelity data: application to function approximation and inverse pde problems","volume":"401","author":"Meng","year":"2020","journal-title":"J. Comput. Phys."},{"key":"mlstad7f25bib13","doi-asserted-by":"crossref","DOI":"10.1016\/j.addma.2024.104440","article-title":"Multi-fidelity surrogate with heterogeneous input spaces for modeling melt pools in laser-directed energy deposition","author":"Menon","year":"2024"},{"key":"mlstad7f25bib14","doi-asserted-by":"publisher","first-page":"4694","DOI":"10.1038\/s41598-022-08413-8","article-title":"Agents for sequential learning using multiple-fidelity data","volume":"12","author":"Palizhati","year":"2022","journal-title":"Sci. Rep."},{"key":"mlstad7f25bib15","doi-asserted-by":"publisher","DOI":"10.1016\/j.commatsci.2019.109286","article-title":"A multi-fidelity information-fusion approach to machine learn and predict polymer bandgap","volume":"172","author":"Patra","year":"2020","journal-title":"Comput. Mat. Sci."},{"key":"mlstad7f25bib16","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/srep02810","article-title":"Accelerating materials property predictions using machine learning","volume":"3","author":"Pilania","year":"2013","journal-title":"Sci. Rep."},{"key":"mlstad7f25bib17","doi-asserted-by":"publisher","first-page":"2087","DOI":"10.1021\/acs.jctc.5b00099","article-title":"Big data meets quantum chemistry approximations: the \u0394-Machine Learning approach","volume":"11","author":"Ramakrishnan","year":"2015","journal-title":"J. Chem. Theory Comput."},{"key":"mlstad7f25bib18","doi-asserted-by":"crossref","DOI":"10.1088\/2632-2153\/ad7ad5","article-title":"Multi-fidelity Gaussian process surrogate modeling for regression problems in physics","author":"Ravi","year":"2024"},{"key":"mlstad7f25bib19","doi-asserted-by":"publisher","first-page":"544","DOI":"10.1093\/imanum\/drs004","article-title":"Analysis of linear difference schemes in the sparse grid combination technique","volume":"33","author":"Reisinger","year":"2013","journal-title":"IMA J. Numer. Anal."},{"key":"mlstad7f25bib20","doi-asserted-by":"publisher","first-page":"05830\u20131 \u2013 05830","DOI":"10.1103\/PhysRevLett.108.058301","article-title":"Fast and accurate modeling of molecular atomization energies with machine learning","volume":"108","author":"Rupp","year":"2012","journal-title":"Phys. Rev. Lett."},{"article-title":"Multi-fidelity learning with heterogeneous domains","year":"2019","author":"Sarkar","key":"mlstad7f25bib21"},{"key":"mlstad7f25bib22","first-page":"pp 1049","article-title":"Transfer learning on heterogenous feature spaces via spectral transformation","author":"Shi","year":"2010"},{"key":"mlstad7f25bib23","doi-asserted-by":"publisher","DOI":"10.1088\/2632-2153\/ad2cef","article-title":"Optimized multifidelity machine learning for quantum chemistry","volume":"5","author":"Vinod","year":"2024","journal-title":"Mach. Learn.: Sci. Technol."},{"key":"mlstad7f25bib24","doi-asserted-by":"publisher","first-page":"7658","DOI":"10.1021\/acs.jctc.3c00882","article-title":"Multifidelity machine learning for molecular excitation energies","volume":"19","author":"Vinod","year":"2023","journal-title":"J. Chem. Theory Comput."},{"key":"mlstad7f25bib25","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.12734761","article-title":"CheMFi: a multifidelity dataset of quantum chemical properties of diverse molecules","author":"Vinod","year":"2024"},{"article-title":"CheMFi: a multifidelity dataset of quantum chemical properties of diverse molecules","year":"2024","author":"Vinod","key":"mlstad7f25bib26"},{"key":"mlstad7f25bib27","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-016-0043-6","article-title":"A survey of transfer learning","volume":"3","author":"Weiss","year":"2016","journal-title":"J. Big Data"},{"key":"mlstad7f25bib28","doi-asserted-by":"publisher","DOI":"10.1063\/5.0047760","article-title":"Perspective on integrating machine learning into computational chemistry and materials science","volume":"154","author":"Westermayr","year":"2021","journal-title":"J. Chem. Phys."},{"key":"mlstad7f25bib29","doi-asserted-by":"publisher","first-page":"9873","DOI":"10.1021\/acs.chemrev.0c00749","article-title":"Machine learning for electronically excited states of molecules","volume":"121","author":"Westermayr","year":"2020","journal-title":"Chem. Rev."},{"key":"mlstad7f25bib30","doi-asserted-by":"publisher","first-page":"1546","DOI":"10.1021\/acs.jctc.8b00832","article-title":"Boosting quantum machine learning models with a multilevel combination technique: Pople diagrams revisited","volume":"15","author":"Zaspel","year":"2019","journal-title":"J. Chem. Theory Comput."},{"key":"mlstad7f25bib31","first-page":"pp 1304","article-title":"Heterogeneous transfer learning for image classification","volume":"vol 25","author":"Zhu","year":"2011"}],"container-title":["Machine Learning: Science and Technology"],"original-title":[],"link":[{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ad7f25","content-type":"text\/html","content-version":"am","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ad7f25\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ad7f25","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ad7f25\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ad7f25\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ad7f25\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ad7f25\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"similarity-checking"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ad7f25\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,7]],"date-time":"2024-10-07T12:08:08Z","timestamp":1728302888000},"score":1,"resource":{"primary":{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ad7f25"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,7]]},"references-count":31,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2024,10,7]]},"published-print":{"date-parts":[[2024,12,1]]}},"URL":"https:\/\/doi.org\/10.1088\/2632-2153\/ad7f25","relation":{},"ISSN":["2632-2153"],"issn-type":[{"type":"electronic","value":"2632-2153"}],"subject":[],"published":{"date-parts":[[2024,10,7]]},"assertion":[{"value":"Assessing non-nested configurations of multifidelity machine learning for quantum-chemical properties","name":"article_title","label":"Article Title"},{"value":"Machine Learning: Science and Technology","name":"journal_title","label":"Journal Title"},{"value":"paper","name":"article_type","label":"Article Type"},{"value":"\u00a9 2024 The Author(s). Published by IOP Publishing Ltd","name":"copyright_information","label":"Copyright Information"},{"value":"2024-07-24","name":"date_received","label":"Date Received","group":{"name":"publication_dates","label":"Publication dates"}},{"value":"2024-09-24","name":"date_accepted","label":"Date Accepted","group":{"name":"publication_dates","label":"Publication dates"}},{"value":"2024-10-07","name":"date_epub","label":"Online publication date","group":{"name":"publication_dates","label":"Publication dates"}}]}}