{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,15]],"date-time":"2025-10-15T18:02:55Z","timestamp":1760551375433,"version":"3.37.3"},"reference-count":74,"publisher":"IOP Publishing","issue":"3","license":[{"start":{"date-parts":[[2021,4,22]],"date-time":"2021-04-22T00:00:00Z","timestamp":1619049600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,4,22]],"date-time":"2021-04-22T00:00:00Z","timestamp":1619049600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/iopscience.iop.org\/info\/page\/text-and-data-mining"}],"funder":[{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"crossref","award":["Be3264\/10-1"],"award-info":[{"award-number":["Be3264\/10-1"]}],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["iopscience.iop.org"],"crossmark-restriction":false},"short-container-title":["Mach. Learn.: Sci. Technol."],"published-print":{"date-parts":[[2021,9,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>In recent years the development of machine learning potentials (MLPs) has become a very active field of research. Numerous approaches have been proposed, which allow one to perform extended simulations of large systems at a small fraction of the computational costs of electronic structure calculations. The key to the success of modern MLPs is the close-to first principles quality description of the atomic interactions. This accuracy is reached by using very flexible functional forms in combination with high-level reference data from electronic structure calculations. These data sets can include up to hundreds of thousands of structures covering millions of atomic environments to ensure that all relevant features of the potential energy surface are well represented. The handling of such large data sets is nowadays becoming one of the main challenges in the construction of MLPs. In this paper we present a method, the bin-and-hash (BAH) algorithm, to overcome this problem by enabling the efficient identification and comparison of large numbers of multidimensional vectors. Such vectors emerge in multiple contexts in the construction of MLPs. Examples are the comparison of local atomic environments to identify and avoid unnecessary redundant information in the reference data sets that is costly in terms of both the electronic structure calculations as well as the training process, the assessment of the quality of the descriptors used as structural fingerprints in many types of MLPs, and the detection of possibly unreliable data points. The BAH algorithm is illustrated for the example of high-dimensional neural network potentials using atom-centered symmetry functions for the geometrical description of the atomic environments, but the method is general and can be combined with any current type of MLP.<\/jats:p>","DOI":"10.1088\/2632-2153\/abe663","type":"journal-article","created":{"date-parts":[[2021,2,15]],"date-time":"2021-02-15T22:32:08Z","timestamp":1613428328000},"page":"037001","update-policy":"https:\/\/doi.org\/10.1088\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["A bin and hash method for analyzing reference data and descriptors in machine learning potentials"],"prefix":"10.1088","volume":"2","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8427-0221","authenticated-orcid":false,"given":"Mart\u00edn Leandro","family":"Paleico","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1220-1542","authenticated-orcid":false,"given":"J\u00f6rg","family":"Behler","sequence":"additional","affiliation":[]}],"member":"266","published-online":{"date-parts":[[2021,4,22]]},"reference":[{"key":"mlstabe663bib1","doi-asserted-by":"publisher","DOI":"10.1063\/1.4966192","article-title":"Perspective: machine learning potentials for atomistic simulations","volume":"145","author":"Behler","year":"2016","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib2","doi-asserted-by":"publisher","first-page":"511","DOI":"10.1021\/acs.jpcc.6b10908","article-title":"Machine learning force fields: construction, validation and outlook","volume":"121","author":"Botu","year":"2017","journal-title":"J. Phys. Chem. C"},{"key":"mlstabe663bib3","doi-asserted-by":"publisher","DOI":"10.1002\/adma.201902765","article-title":"Machine learning interatomic potentials as emerging tools for materials science","volume":"31","author":"Deringer","year":"2019","journal-title":"Adv. Mater."},{"key":"mlstabe663bib4","doi-asserted-by":"publisher","first-page":"B864","DOI":"10.1103\/PhysRev.136.B864","article-title":"Inhomogeneous electron gas","volume":"136","author":"Hohenberg","year":"1964","journal-title":"Phys. Rev."},{"key":"mlstabe663bib5","doi-asserted-by":"publisher","first-page":"A1133","DOI":"10.1103\/PhysRev.140.A1133","article-title":"Self-consistent equations including exchange and correlation effects","volume":"140","author":"Kohn","year":"1965","journal-title":"Phys. Rev."},{"key":"mlstabe663bib6","doi-asserted-by":"publisher","first-page":"4129","DOI":"10.1063\/1.469597","article-title":"Neural network models of potential energy surfaces","volume":"103","author":"Blank","year":"1995","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib7","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.98.146401","article-title":"Generalized neural-network representation of high-dimensional potential-energy surfaces","volume":"98","author":"Behler","year":"2007","journal-title":"Phys. Rev. Lett."},{"key":"mlstabe663bib8","doi-asserted-by":"publisher","DOI":"10.1063\/1.4817187","article-title":"Permutation invariant polynomial neural network approach to fitting potential energy surfaces","volume":"139","author":"Jiang","year":"2013","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib9","doi-asserted-by":"publisher","first-page":"210","DOI":"10.1016\/j.cplett.2004.07.076","article-title":"Representing high-dimensional potential-energy surfaces for reactions at surfaces by neural networks","volume":"395","author":"Lorenz","year":"2004","journal-title":"Chem. Phys. Lett."},{"key":"mlstabe663bib10","doi-asserted-by":"publisher","DOI":"10.1063\/1.3021471","article-title":"Using neural networks, optimized coordinates and high-dimensional model representations to obtain a vinyl bromide potential surface","volume":"129","author":"Manzhos","year":"2008","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib11","doi-asserted-by":"publisher","first-page":"3678","DOI":"10.1021\/acs.jctc.9b00181","article-title":"Physnet: a neural network for predicting energies, forces, dipole moments and partial charges","volume":"15","author":"Unke","year":"2019","journal-title":"J. Chem. Theory Comput."},{"key":"mlstabe663bib12","doi-asserted-by":"publisher","DOI":"10.1063\/1.5019779","article-title":"Schnet\u2014a deep learning architecture for molecules and materials","volume":"148","author":"Sch\u00fctt","year":"2018","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib13","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.120.143001","article-title":"Deep potential molecular dynamics: a scalable model with the accuracy of quantum mechanics","volume":"120","author":"Zhang","year":"2018","journal-title":"Phys. Rev. Lett."},{"key":"mlstabe663bib14","doi-asserted-by":"publisher","first-page":"3192","DOI":"10.1039\/C6SC05720A","article-title":"ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost","volume":"8","author":"Smith","year":"2017","journal-title":"Chem. Sci."},{"key":"mlstabe663bib15","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.104.136403","article-title":"Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons","volume":"104","author":"Bart\u00f3k","year":"2010","journal-title":"Phys. Rev. Lett."},{"key":"mlstabe663bib16","doi-asserted-by":"publisher","first-page":"1051","DOI":"10.1002\/qua.24927","article-title":"Gaussian approximation potentials: a brief tutorial introduction","volume":"115","author":"Bart\u00f3k","year":"2015","journal-title":"Int. J. Quant. Chem."},{"key":"mlstabe663bib17","doi-asserted-by":"publisher","first-page":"1153","DOI":"10.1137\/15M1054183","article-title":"Moment tensor potentials: a class of systematically improvable interatomic potentials","volume":"14","author":"Shapeev","year":"2016","journal-title":"Multiscale Model. Simul."},{"key":"mlstabe663bib18","doi-asserted-by":"publisher","first-page":"316","DOI":"10.1016\/j.jcp.2014.12.018","article-title":"Spectral neighbor analysis method for automated generation of quantum-accurate interatomic potentials","volume":"285","author":"Thompson","year":"2015","journal-title":"J. Comput. Phys."},{"key":"mlstabe663bib19","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.98.144102","article-title":"Electronic structure based descriptor for characterizing local atomic environments","volume":"98","author":"Jenke","year":"2018","journal-title":"Phys. Rev. B"},{"key":"mlstabe663bib20","doi-asserted-by":"publisher","DOI":"10.1039\/C1CP00051A","article-title":"Support vector machine regression (LS-SVM)-an alternative to artificial neural networks (ANNS) for the analysis of quantum chemistry data?","volume":"13","author":"Balabin","year":"2011","journal-title":"Phys. Chem. Chem. Phys."},{"key":"mlstabe663bib21","doi-asserted-by":"publisher","first-page":"6924","DOI":"10.1039\/C7SC02267K","article-title":"Machine learning molecular dynamics for the simulation of infrared spectra","volume":"8","author":"Gastegger","year":"2017","journal-title":"Chem. Sci."},{"key":"mlstabe663bib22","doi-asserted-by":"publisher","first-page":"1435","DOI":"10.1021\/ct800166r","article-title":"Beyond point charges: dynamic polarization from neural net predicted multipole moments","volume":"4","author":"Darley","year":"2008","journal-title":"J. Chem. Theor. Comput."},{"key":"mlstabe663bib23","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1186\/s13321-018-0296-5","article-title":"Machine learning for the prediction of molecular dipole moments obtained by density functional theory","volume":"10","author":"Pereira","year":"2018","journal-title":"J. Cheminformatics"},{"key":"mlstabe663bib24","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.83.153101","article-title":"High-dimensional neural-network potentials for multicomponent systems: applications to zinc oxide","volume":"83","author":"Artrith","year":"2011","journal-title":"Phys. Rev. B"},{"key":"mlstabe663bib25","doi-asserted-by":"publisher","DOI":"10.1063\/1.3682557","article-title":"A neural network potential-energy surface for the water dimer based on environment-dependent atomic energies and charges","volume":"136","author":"Morawietz","year":"2012","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib26","doi-asserted-by":"publisher","first-page":"2261","DOI":"10.1039\/C7SC04934J","article-title":"The tensor Mol-0.1 model chemistry: a neural network augmented with long-range physics","volume":"9","author":"Yao","year":"2018","journal-title":"Chem. Sci"},{"key":"mlstabe663bib27","doi-asserted-by":"publisher","first-page":"3225","DOI":"10.1021\/acs.jctc.5b00301","article-title":"Transferable atomic multipole machine learning models for small organic molecules","volume":"11","author":"Bereau","year":"2015","journal-title":"J. Chem. Theory Comput."},{"key":"mlstabe663bib28","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.95.104105","article-title":"High accuracy and transferability of a neural network potential through charge equilibration for calcium fluoride","volume":"95","author":"Faraji","year":"2017","journal-title":"Phys. Rev. B"},{"key":"mlstabe663bib29","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.93.115104","article-title":"Prediction model of band gap for inorganic compounds by combination of density functional theory calculations and machine learning techniques","volume":"93","author":"Lee","year":"2016","journal-title":"Phys. Rev. B"},{"key":"mlstabe663bib30","doi-asserted-by":"publisher","first-page":"156","DOI":"10.1016\/j.commatsci.2016.12.004","article-title":"Multi-fidelity machine learning models for accurate bandgap predictions of solids","volume":"129","author":"Pilania","year":"2017","journal-title":"Comput. Mater. Sci."},{"key":"mlstabe663bib31","doi-asserted-by":"crossref","DOI":"10.1063\/5.0021452","article-title":"Predicting oxidation and spin states by high-dimensional neural networks: applications to lithium manganese oxide spinels","author":"Eckhoff","year":"2020"},{"key":"mlstabe663bib32","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.108.058301","article-title":"Fast and accurate modeling of molecular atomization energies with machine learning","volume":"108","author":"Rupp","year":"2012","journal-title":"Phys. Rev. Lett."},{"key":"mlstabe663bib33","doi-asserted-by":"publisher","first-page":"10502","DOI":"10.1021\/jp1012973","article-title":"Efficient sampling of atomic configurational spaces","volume":"114","author":"P\u00e1rtay","year":"2010","journal-title":"J. Phys. Chem. B"},{"key":"mlstabe663bib34","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.97.195424","article-title":"Neural-network-enhanced evolutionary algorithm applied to supported metal nanoparticles","volume":"97","author":"Kolsbjerg","year":"2018","journal-title":"Phys. Rev. B"},{"key":"mlstabe663bib35","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41524-019-0181-4","article-title":"Genetic algorithms for computational materials discovery accelerated by machine learning","volume":"5","author":"Jennings","year":"2019","journal-title":"npj Comput. Mater."},{"key":"mlstabe663bib36","doi-asserted-by":"publisher","first-page":"13023","DOI":"10.1073\/pnas.1108486108","article-title":"Simplifying the representation of complex free-energy landscapes using sketch-map","volume":"108","author":"Ceriotti","year":"2011","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"mlstabe663bib37","doi-asserted-by":"publisher","first-page":"6","DOI":"10.1186\/s13321-017-0192-4","article-title":"Mapping and classifying molecules from a high-throughput structural database","volume":"9","author":"De","year":"2017","journal-title":"J. Cheminformatics"},{"key":"mlstabe663bib38","doi-asserted-by":"publisher","DOI":"10.1063\/1.4828704","article-title":"Metrics for measuring distances in configuration spaces","volume":"139","author":"Sadeghi","year":"2013","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib39","doi-asserted-by":"publisher","DOI":"10.1063\/1.4940026","article-title":"A fingerprint based metric for measuring similarities of crystalline structures","volume":"144","author":"Zhu","year":"2016","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib40","doi-asserted-by":"publisher","first-page":"13754","DOI":"10.1039\/C6CP00415F","article-title":"Comparing molecules and solids across structural and alchemical space","volume":"18","author":"De","year":"2016","journal-title":"Phys. Chem. Chem. Phys."},{"year":"2020","author":"Parsaeifard","article-title":"An assessment of the structural resolution of various fingerprints commonly used in machine learning","key":"mlstabe663bib41"},{"key":"mlstabe663bib42","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1007\/s13218-015-0381-0","article-title":"Beyond manual tuning of hyperparameters","volume":"29","author":"Hutter","year":"2015","journal-title":"K\u00fcnstl Intell."},{"key":"mlstabe663bib43","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1007\/s13721-016-0125-6","article-title":"A review of automatic selection methods for machine learning algorithms and hyper-parameter values","volume":"5","author":"Luo","year":"2016","journal-title":"Netw. Model. Anal. Health Inform. Bioinform."},{"year":"2017","author":"Klein","first-page":"pp 528","key":"mlstabe663bib44"},{"key":"mlstabe663bib45","doi-asserted-by":"publisher","DOI":"10.1063\/1.5019667","article-title":"wACSF\u2014weighted atom-centered symmetry functions as descriptors in machine learning potentials","volume":"148","author":"Gastegger","year":"2018","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib46","doi-asserted-by":"publisher","first-page":"1351","DOI":"10.1021\/acs.jpclett.7b00038","article-title":"Genetic optimization of training sets for improved machine learning models of molecular properties","volume":"8","author":"Browning","year":"2017","journal-title":"J. Phys. Chem. Lett."},{"key":"mlstabe663bib47","doi-asserted-by":"publisher","DOI":"10.1063\/1.5024611","article-title":"Automatic selection of atomic fingerprints and reference configurations for machine-learning potentials","volume":"148","author":"Imbalzano","year":"2018","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib48","doi-asserted-by":"publisher","first-page":"12828","DOI":"10.1002\/anie.201703114","article-title":"First principles neural network potentials for reactive simulations of large molecular and condensed systems","volume":"56","author":"Behler","year":"2017","journal-title":"Angew. Chem., Int. Ed."},{"key":"mlstabe663bib49","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.87.184115","article-title":"On representing chemical environments","volume":"87","author":"Bart\u00f3k","year":"2013","journal-title":"Phys. Rev. B"},{"key":"mlstabe663bib50","doi-asserted-by":"publisher","first-page":"2991","DOI":"10.1021\/acs.jctc.8b00110","article-title":"Many-body descriptors for predicting molecular properties with machine learning: analysis of pairwise and three-body interactions in molecules","volume":"14","author":"Pronobis","year":"2018","journal-title":"J. Chem. Theory Comput."},{"key":"mlstabe663bib51","doi-asserted-by":"publisher","DOI":"10.1063\/1.4983392","article-title":"Spherical harmonics based descriptor for neural network potentials: structure and dynamics of Au147 nanocluster","volume":"146","author":"Jindal","year":"2017","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib52","doi-asserted-by":"publisher","DOI":"10.1063\/1.5086167","article-title":"A novel approach to describe chemical environments in high-dimensional neural network potentials","volume":"150","author":"Kocer","year":"2019","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib53","doi-asserted-by":"publisher","DOI":"10.1063\/1.5020710","article-title":"Alchemical and structural distribution based representation for universal quantum machine learning","volume":"148","author":"Faber","year":"2018","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib54","doi-asserted-by":"publisher","DOI":"10.1063\/1.3553717","article-title":"Atom-centered symmetry functions for constructing high-dimensional neural network potentials","volume":"134","author":"Behler","year":"2011","journal-title":"J. Chem. Phys."},{"key":"mlstabe663bib55","doi-asserted-by":"publisher","DOI":"10.1088\/0953-8984\/26\/18\/183001","article-title":"Representing potential energy surfaces by high-dimensional neural network potentials","volume":"26","author":"Behler","year":"2014","journal-title":"J. Phys.: Condens. Matter"},{"key":"mlstabe663bib56","doi-asserted-by":"publisher","first-page":"1032","DOI":"10.1002\/qua.24890","article-title":"Constructing high-dimensional neural network potentials: a tutorial review","volume":"115","author":"Behler","year":"2015","journal-title":"Int. J. Quantum Chem."},{"key":"mlstabe663bib57","first-page":"pp 287","article-title":"Query by committee","author":"Seung","year":"1992"},{"key":"mlstabe663bib58","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.85.045439","article-title":"High-dimensional neural network potentials for metal surfaces: aprototype study for copper","volume":"85","author":"Artrith","year":"2012","journal-title":"Phys. Rev. B"},{"key":"mlstabe663bib59","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1016\/j.commatsci.2017.08.031","article-title":"Active learning of linearly parametrized interatomic potentials","volume":"140","author":"Podryabinkin","year":"2017","journal-title":"Comp. Mater. Sci."},{"key":"mlstabe663bib60","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevMaterials.3.023804","article-title":"Active learning of uniformly accurate interatomic potentials for materials simulation","volume":"3","author":"Zhang","year":"2019","journal-title":"Phys. Rev. Mater."},{"key":"mlstabe663bib61","doi-asserted-by":"publisher","first-page":"88","DOI":"10.1021\/acs.jctc.9b00805","article-title":"Automated fitting of neural network potentials at coupled cluster accuracy: protonated water clusters as testing ground","volume":"16","author":"Schran","year":"2020","journal-title":"J. Chem. Theory Comput."},{"year":"2009","author":"Cormen","key":"mlstabe663bib62"},{"key":"mlstabe663bib63","doi-asserted-by":"publisher","first-page":"509","DOI":"10.1145\/361002.361007","article-title":"Multidimensional binary search trees used for associative searching","volume":"18","author":"Bentley","year":"1975","journal-title":"Commun. ACM"},{"key":"mlstabe663bib64","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1080\/14786440109462720","article-title":"On lines and planes of closest fit to systems of points in space","volume":"2","author":"Pearson","year":"1901","journal-title":"London, Edinburgh Dublin Phil. Mag. J. Sci."},{"key":"mlstabe663bib65","doi-asserted-by":"publisher","first-page":"417","DOI":"10.1037\/h0071325","article-title":"Analysis of a complex of statistical variables into principal components","volume":"24","author":"Hotelling","year":"1933","journal-title":"J. Educ. Psychol."},{"year":"2002","author":"Frenkel","key":"mlstabe663bib66"},{"year":"2001","author":"Aggarwal","first-page":"420","key":"mlstabe663bib67"},{"key":"mlstabe663bib68"},{"author":"","article-title":"5. Data structures","key":"mlstabe663bib69"},{"key":"mlstabe663bib70"},{"key":"mlstabe663bib71","doi-asserted-by":"publisher","first-page":"12682","DOI":"10.1021\/acs.jpcc.0c00559","article-title":"Properties of \u03b1-Brass nanoparticles. 1. Neural network potential energy surface","volume":"124","author":"Weinreich","year":"2020","journal-title":"J. Phys. Chem. C"},{"key":"mlstabe663bib72","doi-asserted-by":"publisher","first-page":"288","DOI":"10.1103\/PhysRevLett.75.288","article-title":"Molecular geometry optimization with a genetic algorithm","volume":"75","author":"Deaven","year":"1995","journal-title":"Phys. Rev. Lett."},{"key":"mlstabe663bib73","doi-asserted-by":"publisher","first-page":"9911","DOI":"10.1063\/1.1724816","article-title":"Minima hopping: an efficient search method for the global minimum of the potential energy surface of complex molecular systems","volume":"120","author":"Goedecker","year":"2004","journal-title":"J. Chem. Phys."},{"year":"2007","author":"Voter","first-page":"pp 1","key":"mlstabe663bib74"}],"container-title":["Machine Learning: Science and Technology"],"original-title":[],"link":[{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe663","content-type":"text\/html","content-version":"am","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe663\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe663","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe663\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe663\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe663\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe663\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"similarity-checking"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe663\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,1,6]],"date-time":"2022-01-06T13:42:36Z","timestamp":1641476556000},"score":1,"resource":{"primary":{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/abe663"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,22]]},"references-count":74,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2021,4,22]]},"published-print":{"date-parts":[[2021,9,1]]}},"URL":"https:\/\/doi.org\/10.1088\/2632-2153\/abe663","relation":{},"ISSN":["2632-2153"],"issn-type":[{"type":"electronic","value":"2632-2153"}],"subject":[],"published":{"date-parts":[[2021,4,22]]},"assertion":[{"value":"A bin and hash method for analyzing reference data and descriptors in machine learning potentials","name":"article_title","label":"Article Title"},{"value":"Machine Learning: Science and Technology","name":"journal_title","label":"Journal Title"},{"value":"paper","name":"article_type","label":"Article Type"},{"value":"\u00a9 2021 The Author(s). Published by IOP Publishing Ltd","name":"copyright_information","label":"Copyright Information"},{"value":"2020-12-08","name":"date_received","label":"Date Received","group":{"name":"publication_dates","label":"Publication dates"}},{"value":"2021-02-15","name":"date_accepted","label":"Date Accepted","group":{"name":"publication_dates","label":"Publication dates"}},{"value":"2021-04-22","name":"date_epub","label":"Online publication date","group":{"name":"publication_dates","label":"Publication dates"}}]}}