{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,3]],"date-time":"2025-10-03T09:01:14Z","timestamp":1759482074349,"version":"3.44.0"},"reference-count":41,"publisher":"IOP Publishing","issue":"3","license":[{"start":{"date-parts":[[2025,8,19]],"date-time":"2025-08-19T00:00:00Z","timestamp":1755561600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2025,8,19]],"date-time":"2025-08-19T00:00:00Z","timestamp":1755561600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/iopscience.iop.org\/info\/page\/text-and-data-mining"}],"funder":[{"name":"Abdul Kalam Technology Innovation National Fellowship","award":["INAE\/SA\/4784"],"award-info":[{"award-number":["INAE\/SA\/4784"]}]},{"name":"LANL\/LDRD Program","award":["20240740PRD1"],"award-info":[{"award-number":["20240740PRD1"]}]}],"content-domain":{"domain":["iopscience.iop.org"],"crossmark-restriction":false},"short-container-title":["Mach. Learn.: Sci. Technol."],"published-print":{"date-parts":[[2025,9,30]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>We propose a physics-based regularization technique for function learning, inspired by statistical mechanics. By drawing an analogy between optimizing the parameters of an interpolator and minimizing the energy of a system, we introduce corrections that impose constraints on the lower-order moments of the data distribution. This minimizes the discrepancy between the discrete and continuum representations of the data, in turn allowing to access more favorable energy landscapes, thus improving the accuracy of the interpolator. Our approach improves performance in both interpolation and regression tasks, even in high-dimensional spaces. Unlike traditional methods, it does not require empirical parameter tuning, making it particularly effective for handling noisy data. We also show that thanks to its local nature, the method offers computational and memory efficiency advantages over Radial Basis Function interpolators, especially for large datasets.<\/jats:p>","DOI":"10.1088\/2632-2153\/adf93a","type":"journal-article","created":{"date-parts":[[2025,8,7]],"date-time":"2025-08-07T22:53:43Z","timestamp":1754607223000},"page":"035035","update-policy":"https:\/\/doi.org\/10.1088\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["A kinetic-based regularization method for data science applications"],"prefix":"10.1088","volume":"6","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9666-8392","authenticated-orcid":true,"given":"Abhisek","family":"Ganguly","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8367-6596","authenticated-orcid":true,"given":"Alessandro","family":"Gabbana","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8293-9529","authenticated-orcid":false,"given":"Vybhav","family":"Rao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3070-3079","authenticated-orcid":false,"given":"Sauro","family":"Succi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6938-233X","authenticated-orcid":false,"given":"Santosh","family":"Ansumali","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"266","published-online":{"date-parts":[[2025,8,19]]},"reference":[{"key":"mlstadf93abib1","doi-asserted-by":"publisher","first-page":"87","DOI":"10.1016\/j.bdr.2015.04.001","article-title":"Efficient machine learning for big data: a review","volume":"2","author":"Al-Jarrah","year":"2015","journal-title":"Big Data Res."},{"key":"mlstadf93abib2","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"LeCun","year":"2015","journal-title":"Nature"},{"key":"mlstadf93abib3","doi-asserted-by":"publisher","first-page":"7776","DOI":"10.1109\/ACCESS.2017.2696365","article-title":"Machine learning with big data: challenges and approaches","volume":"5","author":"L\u2019Heureux","year":"2017","journal-title":"IEEE Access"},{"key":"mlstadf93abib4","doi-asserted-by":"publisher","first-page":"2554","DOI":"10.1073\/pnas.79.8.2554","article-title":"Neural networks and physical systems with emergent collective computational abilities","volume":"79","author":"Hopfield","year":"1982","journal-title":"Proc. Natl Acad. Sci."},{"key":"mlstadf93abib5","doi-asserted-by":"publisher","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with alphafold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"article-title":"LLaMA: open and efficient foundation language models","year":"2023","author":"Touvron","key":"mlstadf93abib6"},{"year":"2000","author":"Pearl","key":"mlstadf93abib7"},{"key":"mlstadf93abib8","doi-asserted-by":"publisher","first-page":"612","DOI":"10.1109\/JPROC.2021.3058954","article-title":"Toward causal representation learning","volume":"109","author":"Sch\u00f6lkopf","year":"2021","journal-title":"Proc. IEEE"},{"key":"mlstadf93abib9","doi-asserted-by":"publisher","first-page":"55","DOI":"10.2307\/1271436","article-title":"Ridge regression: biased estimation for nonorthogonal problems","volume":"12","author":"Hoerl","year":"1970","journal-title":"Technometrics"},{"key":"mlstadf93abib10","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J. R. Stat. Soc. B"},{"year":"2006","author":"Rasmussen","key":"mlstadf93abib11","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/3206.001.0001"},{"key":"mlstadf93abib12","doi-asserted-by":"publisher","first-page":"2318","DOI":"10.1109\/TKDE.2017.2720168","article-title":"Theory-guided data science: a new paradigm for scientific discovery from data","volume":"29","author":"Karpatne","year":"2017","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"mlstadf93abib13","doi-asserted-by":"publisher","first-page":"686","DOI":"10.1016\/j.jcp.2018.10.045","article-title":"Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations","volume":"378","author":"Raissi","year":"2019","journal-title":"J. Comput. Phys."},{"article-title":"Geometric deep learning: grids, groups, graphs, geodesics, and gauges","year":"2021","author":"Bronstein","key":"mlstadf93abib14"},{"article-title":"Integrating physics-based modeling with machine learning: a survey","year":"2020","author":"Willard","key":"mlstadf93abib15"},{"article-title":"Physics-informed machine learning: a survey on problems, methods and applications","year":"2022","author":"Hao","key":"mlstadf93abib16"},{"key":"mlstadf93abib17","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1023\/A:1019188517934","article-title":"General foundation of high dimensional model representations","volume":"25","author":"Rabitz","year":"1999","journal-title":"J. Math. Chem."},{"key":"mlstadf93abib18","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.126.098302","article-title":"Enforcing analytic constraints in neural networks emulating physical systems","volume":"126","author":"Beucler","year":"2021","journal-title":"Phys. Rev. Lett."},{"year":"2018","author":"Succi","key":"mlstadf93abib19","doi-asserted-by":"publisher","DOI":"10.1093\/oso\/9780199592357.001.0001)"},{"year":"1994","author":"Bishop","key":"mlstadf93abib20"},{"key":"mlstadf93abib21","doi-asserted-by":"publisher","first-page":"606","DOI":"10.1162\/neco.1995.7.3.606","article-title":"Regularization in the selection of radial basis function centers","volume":"7","author":"Orr","year":"1995","journal-title":"Neural Comput."},{"year":"1996","author":"Orr","key":"mlstadf93abib22"},{"year":"2016","author":"Goodfellow","key":"mlstadf93abib23"},{"key":"mlstadf93abib24","doi-asserted-by":"publisher","first-page":"6","DOI":"10.1103\/PhysRevLett.81.6","article-title":"Maximum entropy principle for lattice kinetic equations","volume":"81","author":"Karlin","year":"1998","journal-title":"Phys. Rev. Lett."},{"key":"mlstadf93abib25","doi-asserted-by":"publisher","first-page":"798","DOI":"10.1209\/epl\/i2003-00496-6","article-title":"Minimal entropic kinetic models for hydrodynamics","volume":"63","author":"Ansumali","year":"2003","journal-title":"EuroPhys. Lett."},{"year":"1999","author":"Nocedal","key":"mlstadf93abib26","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-40065-5)"},{"key":"mlstadf93abib27","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","article-title":"Maximum likelihood from incomplete data via the em algorithm","volume":"39","author":"Dempster","year":"1977","journal-title":"J. R. Stat. Soc. B"},{"key":"mlstadf93abib28","doi-asserted-by":"publisher","first-page":"181","DOI":"10.1090\/S0025-5718-1982-0637296-4","article-title":"Scattered data interpolation: tests of some methods","volume":"38","author":"Franke","year":"1982","journal-title":"Math. Comput."},{"key":"mlstadf93abib29","doi-asserted-by":"publisher","first-page":"1469","DOI":"10.1016\/j.ijheatfluidflow.2008.05.002","article-title":"Lattice Boltzmann model for melting with natural convection","volume":"29","author":"Huber","year":"2008","journal-title":"Int. J. Heat Fluid Flow"},{"key":"mlstadf93abib30","doi-asserted-by":"publisher","DOI":"10.1016\/j.jocs.2023.101977","article-title":"Probabilistic optimal interpolation for data assimilation between machine learning model predictions and real time observations","volume":"67","author":"Wei","year":"2023","journal-title":"J. Comput. Sci."},{"key":"mlstadf93abib31","doi-asserted-by":"publisher","first-page":"5418","DOI":"10.1109\/TIM.2020.2966310","article-title":"Window selection of the Savitzky\u2014Golay filters for signal recovery from noisy measurements","volume":"69","author":"Sadeghi","year":"2020","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"mlstadf93abib32","first-page":"299","article-title":"Optimal delaunay triangulations","volume":"22","author":"Chen","year":"2004","journal-title":"J. Comput. Math."},{"key":"mlstadf93abib33","first-page":"pp 515","article-title":"Finite element stiffness matrices for analysis of plates in bending","volume":"vol 1","author":"Clough","year":"1965"},{"article-title":"Climate data store (cds)","year":"2024","author":"Copernicus Climate Change Service (C3S)","key":"mlstadf93abib34"},{"key":"mlstadf93abib35","doi-asserted-by":"publisher","first-page":"4810","DOI":"10.1118\/1.3213517","article-title":"Noise injection for training artificial neural networks: a comparison with weight decay and early stopping","volume":"36","author":"Zur","year":"2009","journal-title":"Med. Phys."},{"key":"mlstadf93abib36","doi-asserted-by":"publisher","first-page":"916","DOI":"10.1016\/j.egyr.2022.05.265","article-title":"Noise-intensification data augmented machine learning for day-ahead wind power forecast","volume":"8","author":"Chen","year":"2022","journal-title":"Energy Rep."},{"key":"mlstadf93abib37","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1016\/S0377-0427(02)00869-5","article-title":"Constructing smoothing functions in smoothed particle hydrodynamics with applications","volume":"155","author":"Liu","year":"2003","journal-title":"J. Comput. Appl. Math."},{"key":"mlstadf93abib38","doi-asserted-by":"publisher","DOI":"10.1016\/j.compfluid.2023.106122","article-title":"Study of the convergence of the meshless lattice Boltzmann method in Taylor\u2013Green, annular channel and a porous medium flows","volume":"269","author":"Strzelczyk","year":"2024","journal-title":"Comput. Fluids"},{"key":"mlstadf93abib39","doi-asserted-by":"publisher","first-page":"325","DOI":"10.1002\/cpa.3160020402","article-title":"Note on n-dimensional hermite polynomials","volume":"2","author":"Grad","year":"1949","journal-title":"Commun. Pure Appl. Math."},{"article-title":"Additive gaussian processes","year":"2011","author":"Duvenaud","key":"mlstadf93abib40"},{"key":"mlstadf93abib41","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1007\/s10910-022-01407-x","article-title":"Optimization of hyperparameters of Gaussian process regression with the help of a low-order high-dimensional model representation: application to a potential energy surface","volume":"61","author":"Manzhos","year":"2023","journal-title":"J. Math. Chem."}],"container-title":["Machine Learning: Science and Technology"],"original-title":[],"link":[{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adf93a","content-type":"text\/html","content-version":"am","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adf93a\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adf93a","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adf93a\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adf93a\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adf93a\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adf93a\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"similarity-checking"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adf93a\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,19]],"date-time":"2025-08-19T09:29:30Z","timestamp":1755595770000},"score":1,"resource":{"primary":{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adf93a"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,19]]},"references-count":41,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2025,8,19]]},"published-print":{"date-parts":[[2025,9,30]]}},"URL":"https:\/\/doi.org\/10.1088\/2632-2153\/adf93a","relation":{},"ISSN":["2632-2153"],"issn-type":[{"type":"electronic","value":"2632-2153"}],"subject":[],"published":{"date-parts":[[2025,8,19]]},"assertion":[{"value":"A kinetic-based regularization method for data science applications","name":"article_title","label":"Article Title"},{"value":"Machine Learning: Science and Technology","name":"journal_title","label":"Journal Title"},{"value":"paper","name":"article_type","label":"Article Type"},{"value":"\u00a9 2025 The Author(s). Published by IOP Publishing Ltd","name":"copyright_information","label":"Copyright Information"},{"value":"2025-03-06","name":"date_received","label":"Date Received","group":{"name":"publication_dates","label":"Publication dates"}},{"value":"2025-08-07","name":"date_accepted","label":"Date Accepted","group":{"name":"publication_dates","label":"Publication dates"}},{"value":"2025-08-19","name":"date_epub","label":"Online publication date","group":{"name":"publication_dates","label":"Publication dates"}}]}}