{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,8]],"date-time":"2026-05-08T22:46:26Z","timestamp":1778280386798,"version":"3.51.4"},"reference-count":39,"publisher":"IOP Publishing","issue":"3","license":[{"start":{"date-parts":[[2021,5,12]],"date-time":"2021-05-12T00:00:00Z","timestamp":1620777600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,5,12]],"date-time":"2021-05-12T00:00:00Z","timestamp":1620777600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/iopscience.iop.org\/info\/page\/text-and-data-mining"}],"funder":[{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"crossref","award":["EXC 2075 - 390740016"],"award-info":[{"award-number":["EXC 2075 - 390740016"]}],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100004350","name":"Studienstiftung des deutschen Volkes","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100004350","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["iopscience.iop.org"],"crossmark-restriction":false},"short-container-title":["Mach. Learn.: Sci. Technol."],"published-print":{"date-parts":[[2021,9,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Machine learning has been proven to have the potential to bridge the gap between the accuracy of <jats:italic>ab initio<\/jats:italic> methods and the efficiency of empirical force fields. Neural networks are one of the most frequently used approaches to construct high-dimensional potential energy surfaces. Unfortunately, they lack an inherent uncertainty estimation which is necessary for efficient and automated sampling through the chemical and conformational space to find extrapolative configurations. The identification of the latter is needed for the construction of transferable and uniformly accurate potential energy surfaces. In this paper, we propose an active learning approach that uses the estimated model\u2019s output variance derived in the framework of the optimal experimental design. This method has several advantages compared to the established active learning approaches, e.g. Query-by-Committee, Monte Carlo dropout, feature and latent distances, in terms of the predictive power and computational efficiency. We have shown that the application of the proposed active learning scheme leads to transferable and uniformly accurate potential energy surfaces constructed using only a small fraction of data points. Additionally, it is possible to define a natural threshold value for the proposed uncertainty metric which offers the possibility to generate highly informative training data on-the-fly.<\/jats:p>","DOI":"10.1088\/2632-2153\/abe294","type":"journal-article","created":{"date-parts":[[2021,2,3]],"date-time":"2021-02-03T07:07:01Z","timestamp":1612336021000},"page":"035009","update-policy":"https:\/\/doi.org\/10.1088\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["Exploration of transferable and uniformly accurate neural network interatomic potentials using optimal experimental design"],"prefix":"10.1088","volume":"2","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9940-8548","authenticated-orcid":false,"given":"Viktor","family":"Zaverkin","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6178-7669","authenticated-orcid":false,"given":"Johannes","family":"K\u00e4stner","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"266","published-online":{"date-parts":[[2021,5,12]]},"reference":[{"key":"mlstabe294bib1","doi-asserted-by":"publisher","first-page":"712","DOI":"10.1002\/prot.21123","volume":"65","author":"Hornak","year":"2006","journal-title":"Proteins"},{"key":"mlstabe294bib2","doi-asserted-by":"publisher","first-page":"671","DOI":"10.1002\/jcc.21367","volume":"31","author":"Vanommeslaeghe","year":"2010","journal-title":"J. Comput. Chem."},{"key":"mlstabe294bib3","doi-asserted-by":"publisher","first-page":"490","DOI":"10.1002\/(SICI)1096-987X(199604)17:5\/6490::AID-JCC13.0.CO;2-P","volume":"17","author":"Halgren","year":"1996","journal-title":"J. Comput. Chem."},{"key":"mlstabe294bib4","doi-asserted-by":"publisher","first-page":"1584","DOI":"10.1002\/jcc.20082","volume":"25","author":"Mackerell Jr","year":"2004","journal-title":"J. Comput. Chem."},{"key":"mlstabe294bib5","doi-asserted-by":"publisher","first-page":"2336","DOI":"10.1021\/acs.jpclett.9b03664","volume":"11","author":"Dral","year":"2020","journal-title":"J. Phys. Chem. Lett."},{"key":"mlstabe294bib6","doi-asserted-by":"publisher","first-page":"5410","DOI":"10.1021\/acs.jctc.0c00347","volume":"16","author":"Zaverkin","year":"2020","journal-title":"J. Chem. Theory Comput."},{"key":"mlstabe294bib7","doi-asserted-by":"publisher","first-page":"1373","DOI":"10.1093\/mnras\/staa2891","volume":"499","author":"Molpeceres","year":"2020","journal-title":"Mon. Not. R. Astron. Soc."},{"key":"mlstabe294bib8","article-title":"Active learning literature survey Computer Sciences","author":"Settles","year":"2009"},{"key":"mlstabe294bib9","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1038\/s41524-020-0283-z","volume":"6","author":"Vandermause","year":"2020","journal-title":"npj Comput. Mater."},{"key":"mlstabe294bib10","doi-asserted-by":"publisher","first-page":"823","DOI":"10.1080\/00268976.2017.1407460","volume":"116","author":"Guan","year":"2018","journal-title":"Mol. Phys."},{"key":"mlstabe294bib11","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.114.096405","volume":"114","author":"Li","year":"2015","journal-title":"Phys. Rev. Lett."},{"key":"mlstabe294bib12","article-title":"On-the-fly machine learning of quantum mechanical forces and its potential applications for large scale molecular dynamics","author":"Li","year":"2014"},{"key":"mlstabe294bib13","doi-asserted-by":"publisher","first-page":"1351","DOI":"10.1021\/acs.jpclett.7b00038","volume":"8","author":"Browning","year":"2017","journal-title":"J. Phys. Chem. Lett."},{"key":"mlstabe294bib14","doi-asserted-by":"publisher","first-page":"945","DOI":"10.1038\/s41557-020-0527-z","volume":"12","author":"Huang","year":"2020","journal-title":"Nat. Chem."},{"key":"mlstabe294bib15","doi-asserted-by":"publisher","DOI":"10.1063\/1.5023802","volume":"148","author":"Smith","year":"2018","journal-title":"J. Chem. Phys."},{"key":"mlstabe294bib16","doi-asserted-by":"publisher","first-page":"6924","DOI":"10.1039\/C7SC02267K","volume":"8","author":"Gastegger","year":"2017","journal-title":"Chem. Sci."},{"key":"mlstabe294bib17","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevMaterials.3.023804","volume":"3","author":"Zhang","year":"2019","journal-title":"Phys. Rev. Mater."},{"key":"mlstabe294bib18","doi-asserted-by":"publisher","first-page":"88","DOI":"10.1021\/acs.jctc.9b00805","volume":"16","author":"Schran","year":"2020","journal-title":"J. Chem. Theory Comput."},{"key":"mlstabe294bib19","first-page":"pp 1050","article-title":"Dropout as a Bayesian approximation: representing model uncertainty in deep learning","volume":"vol 48,s","author":"Gal","year":"2016"},{"key":"mlstabe294bib20","doi-asserted-by":"publisher","first-page":"7913","DOI":"10.1039\/C9SC02298H","volume":"10","author":"Janet","year":"2019","journal-title":"Chem. Sci."},{"key":"mlstabe294bib21","doi-asserted-by":"publisher","first-page":"8939","DOI":"10.1021\/acs.jpca.7b08750","volume":"121","author":"Janet","year":"2017","journal-title":"J. Phys. Chem. A"},{"key":"mlstabe294bib22","doi-asserted-by":"publisher","first-page":"13973","DOI":"10.1021\/acs.iecr.8b04015","volume":"57","author":"Nandy","year":"2018","journal-title":"Ind. Eng. Chem. Res."},{"key":"mlstabe294bib23","doi-asserted-by":"publisher","first-page":"1071","DOI":"10.1016\/0893-6080(95)00137-9","volume":"9","author":"Cohn","year":"1996","journal-title":"Neural Netw."},{"key":"mlstabe294bib24","doi-asserted-by":"publisher","first-page":"590","DOI":"10.1162\/neco.1992.4.4.590","volume":"4","author":"MacKay","year":"1992","journal-title":"Neural Comput."},{"key":"mlstabe294bib25","author":"Fedorov","year":"1972"},{"key":"mlstabe294bib26","doi-asserted-by":"publisher","DOI":"10.1063\/1.5005095","volume":"148","author":"Gubaev","year":"2018","journal-title":"J. Chem. Phys."},{"key":"mlstabe294bib27","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1016\/j.commatsci.2017.08.031","volume":"140","author":"Podryabinkin","year":"2017","journal-title":"Comput. Mater. Sci."},{"key":"mlstabe294bib28","doi-asserted-by":"publisher","first-page":"2864","DOI":"10.1021\/ci300415d","volume":"52","author":"Ruddigkeit","year":"2012","journal-title":"J. Chem. Inf. Model."},{"key":"mlstabe294bib29","doi-asserted-by":"publisher","DOI":"10.1038\/sdata.2014.22","volume":"1","author":"Ramakrishnan","year":"2014","journal-title":"Sci. Data"},{"key":"mlstabe294bib30","author":"Reddi","year":"2019"},{"key":"mlstabe294bib31","article-title":"TensorFlow: large-scale machine learning on heterogeneous systems software available from tensorflow.org","author":"Abadi","year":"2015"},{"key":"mlstabe294bib32","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1162\/neco.1992.4.1.1","volume":"4","author":"Geman","year":"1992","journal-title":"Neural Comput."},{"key":"mlstabe294bib33","doi-asserted-by":"publisher","first-page":"3865","DOI":"10.1103\/PhysRevLett.77.3865","volume":"77","author":"Perdew","year":"1996","journal-title":"Phys. Rev. Lett."},{"key":"mlstabe294bib34","doi-asserted-by":"publisher","DOI":"10.1063\/1.3382344","volume":"132","author":"Grimme","year":"2010","journal-title":"J. Chem. Phys."},{"key":"mlstabe294bib35","doi-asserted-by":"publisher","first-page":"1456","DOI":"10.1002\/jcc.21759","volume":"32","author":"Grimme","year":"2011","journal-title":"J. Comput. Chem."},{"key":"mlstabe294bib36","doi-asserted-by":"publisher","first-page":"1223","DOI":"10.1063\/1.476673","volume":"109","author":"Rassolov","year":"1998","journal-title":"J. Chem. Phys."},{"key":"mlstabe294bib37","author":"Prechelt","year":"2012"},{"key":"mlstabe294bib38","article-title":"N-ASW: molecular dynamics data (v1)","author":"Molpeceres","year":"2020"},{"key":"mlstabe294bib39","doi-asserted-by":"publisher","DOI":"10.1063\/1.4927476","volume":"143","author":"Grimme","year":"2015","journal-title":"J. Chem. Phys."}],"container-title":["Machine Learning: Science and Technology"],"original-title":[],"link":[{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe294","content-type":"text\/html","content-version":"am","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe294\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe294","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe294\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe294\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe294\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe294\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"similarity-checking"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe294\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,12,13]],"date-time":"2021-12-13T15:46:08Z","timestamp":1639410368000},"score":1,"resource":{"primary":{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe294"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,12]]},"references-count":39,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2021,5,12]]},"published-print":{"date-parts":[[2021,9,1]]}},"URL":"https:\/\/doi.org\/10.1088\/2632-2153\/abe294","relation":{},"ISSN":["2632-2153"],"issn-type":[{"value":"2632-2153","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,5,12]]},"assertion":[{"value":"Exploration of transferable and uniformly accurate neural network interatomic potentials using optimal experimental design","name":"article_title","label":"Article Title"},{"value":"Machine Learning: Science and Technology","name":"journal_title","label":"Journal Title"},{"value":"paper","name":"article_type","label":"Article Type"},{"value":"\u00a9 2021 The Author(s). Published by IOP Publishing Ltd","name":"copyright_information","label":"Copyright Information"},{"value":"2020-09-09","name":"date_received","label":"Date Received","group":{"name":"publication_dates","label":"Publication dates"}},{"value":"2021-02-02","name":"date_accepted","label":"Date Accepted","group":{"name":"publication_dates","label":"Publication dates"}},{"value":"2021-05-12","name":"date_epub","label":"Online publication date","group":{"name":"publication_dates","label":"Publication dates"}}]}}