{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T01:25:38Z","timestamp":1772933138907,"version":"3.50.1"},"reference-count":31,"publisher":"IOP Publishing","issue":"1","license":[{"start":{"date-parts":[[2025,2,20]],"date-time":"2025-02-20T00:00:00Z","timestamp":1740009600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2025,2,20]],"date-time":"2025-02-20T00:00:00Z","timestamp":1740009600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/iopscience.iop.org\/info\/page\/text-and-data-mining"}],"funder":[{"DOI":"10.13039\/501100018809","name":"Berlin Center for Machine Learning","doi-asserted-by":"crossref","award":["033KI129"],"award-info":[{"award-number":["033KI129"]}],"id":[{"id":"10.13039\/501100018809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"crossref","award":["287321140"],"award-info":[{"award-number":["287321140"]}],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["iopscience.iop.org"],"crossmark-restriction":false},"short-container-title":["Mach. Learn.: Sci. Technol."],"published-print":{"date-parts":[[2025,3,31]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Machine learning has become increasingly important in materials design, yet traditional single-task learning (STL) models fail to fully exploit the potential of available data in scenarios involving multiple targets and incomplete datasets. While STL models overlook the inherent correlations between target properties, this study showcases how multi-task learning (MTL) effectively leverages these correlations. Therefore, the performance of MTL methods compared to STL is evaluated across five datasets, covering twelve prediction tasks and incorporating different types and levels of data sparsity. Our findings reveal that MTL significantly outperforms STL, particularly in sparse data scenarios, with up to 15% prediction improvements across all tasks. Moreover, MTL methods utilizing regressor chains with automated machine learning tools achieve superior performance compared to those based on neural networks, with minimal training effort required. This work advances data efficiency in data-driven materials design, establishing MTL as a potent tool for simultaneous learning and predicting multiple material properties.<\/jats:p>","DOI":"10.1088\/2632-2153\/adae53","type":"journal-article","created":{"date-parts":[[2025,1,24]],"date-time":"2025-01-24T22:56:31Z","timestamp":1737759391000},"page":"015045","update-policy":"https:\/\/doi.org\/10.1088\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Leveraging multi-task learning regressor chains for small and sparse tabular data in materials design"],"prefix":"10.1088","volume":"6","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9998-1907","authenticated-orcid":true,"given":"Felix","family":"Conrad","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7540-4235","authenticated-orcid":false,"given":"Hajo","family":"Wiemer","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9258-5178","authenticated-orcid":false,"given":"Steffen","family":"Ihlenfeldt","sequence":"additional","affiliation":[]}],"member":"266","published-online":{"date-parts":[[2025,2,20]]},"reference":[{"key":"mlstadae53bib1","doi-asserted-by":"publisher","first-page":"338","DOI":"10.1002\/inf2.12028","article-title":"Machine learning in materials science","volume":"1","author":"Wei","year":"2019","journal-title":"InfoMat"},{"key":"mlstadae53bib2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41524-023-01000-z","article-title":"Small data machine learning in materials science","volume":"9","author":"Xu","year":"2023","journal-title":"npj Comput. Mater."},{"key":"mlstadae53bib3","doi-asserted-by":"publisher","DOI":"10.1016\/j.patter.2021.100238","article-title":"Polymer informatics with multi-task learning","volume":"2","author":"Kuenneth","year":"2021","journal-title":"Patterns"},{"key":"mlstadae53bib4","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-022-23327-1","article-title":"Benchmarking AutoML for regression tasks on small tabular data in materials design","volume":"12","author":"Conrad","year":"2022","journal-title":"Sci. Rep."},{"key":"mlstadae53bib5","doi-asserted-by":"publisher","DOI":"10.1016\/j.compstruct.2021.113917","article-title":"Data-driven machine learning approach for exploring and assessing mechanical properties of carbon nanotube-reinforced cement composites","volume":"267","author":"Huang","year":"2021","journal-title":"Compos. Struct."},{"key":"mlstadae53bib6","doi-asserted-by":"publisher","DOI":"10.1088\/2515-7639\/ab077b","article-title":"2019 Simultaneous learning of several materials properties from incomplete databases with multi-task SISSO","volume":"2","author":"Ouyang","journal-title":"J. Phys. Mater."},{"key":"mlstadae53bib7","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41524-022-00929-x","article-title":"2022 Towards overcoming data scarcity in materials science: unifying models and datasets with a mixture of experts framework","volume":"8","author":"Chang","journal-title":"npj Comput. Mater."},{"key":"mlstadae53bib8","doi-asserted-by":"publisher","DOI":"10.1145\/3219819.3220007","article-title":"Modeling task relationships in multi-task learning with multi-gate mixture-of-experts","volume":"1930\u20139","author":"Ma","year":"2018"},{"key":"mlstadae53bib9","doi-asserted-by":"publisher","first-page":"4748","DOI":"10.3390\/app9224748","article-title":"Multi-task learning for multi-dimensional regression: application to luminescence sensing","volume":"9","author":"Michelucci","year":"2019","journal-title":"Appl. Sci."},{"key":"mlstadae53bib10","article-title":"One model to learn them all","author":"Kaiser"},{"key":"mlstadae53bib11","doi-asserted-by":"publisher","DOI":"10.1016\/j.jag.2021.102657","article-title":"Multi-target regressor chains with repetitive permutation scheme for characterization of built environments with remote sensing","volume":"106","author":"Gei\u00df","year":"2022","journal-title":"Int. J. Appl. Earth Obs. Geoinf."},{"key":"mlstadae53bib12","doi-asserted-by":"publisher","DOI":"10.1088\/2632-2153\/ad513a","article-title":"Symbolicregression as Afeature Engineering Method Formachine Anddeep Learning Regression Tasks","volume":"5","author":"Shmuel","year":"2024","journal-title":"Mach. Learn. Sci. Technol."},{"key":"mlstadae53bib13","doi-asserted-by":"publisher","DOI":"10.1016\/j.compstruc.2022.106774","article-title":"A novel self-adversarial training scheme for enhanced robustness of inelastic constitutive descriptions by neural networks","volume":"265","author":"St\u00f6cker","year":"2022","journal-title":"Comput. Struct."},{"key":"mlstadae53bib14","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41524-020-00406-3","article-title":"Benchmarking materials property prediction methods: the matbench test set and automatminer reference algorithm","volume":"6","author":"Dunn","year":"2020","journal-title":"npj Comput. Mater."},{"key":"mlstadae53bib15","doi-asserted-by":"publisher","DOI":"10.1063\/5.0171540","article-title":"Gradient boosted and statistical feature selection workflow for materials property predictions","volume":"159","author":"Jung","year":"2023","journal-title":"J. Chem. Phys."},{"key":"mlstadae53bib16","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41524-022-00734-6","article-title":"Recent advances and applications of deep learning methods in materials science","volume":"8","author":"Choudhary","year":"2022","journal-title":"npj Comput. Mater."},{"key":"mlstadae53bib17","doi-asserted-by":"publisher","first-page":"2385","DOI":"10.1145\/3292500.3330703","article-title":"Irnet: a general purpose deep residual regression framework for materials discovery","author":"Jha","year":"2019"},{"key":"mlstadae53bib18","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1016\/j.commatsci.2018.12.056","article-title":"A predicting model for properties of steel using the industrial big data based on machine learning","volume":"160","author":"Guo","year":"2019","journal-title":"Comput. Mater. Sci."},{"key":"mlstadae53bib19","doi-asserted-by":"publisher","first-page":"2873","DOI":"10.1007\/s11661-021-06279-5","article-title":"Prediction of mechanical properties of wrought aluminium alloys using feature engineering assisted machine learning approach","volume":"52","author":"Hu","year":"2021","journal-title":"Metall. Mater. Trans. A"},{"key":"mlstadae53bib20","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1007\/s10845-012-0682-1","article-title":"Bead geometry prediction for robotic GMAW-based rapid manufacturing through a neural network and a second-order regression analysis","volume":"25","author":"Xiong","year":"2014","journal-title":"J. Intell. Manuf."},{"key":"mlstadae53bib21","doi-asserted-by":"publisher","DOI":"10.1016\/j.compstruct.2021.114328","article-title":"Machine learning and materials informatics approaches for evaluating the interfacial properties of fiber-reinforced composites","volume":"273","author":"Yin","year":"2021","journal-title":"Compos. Struct."},{"key":"mlstadae53bib22","doi-asserted-by":"publisher","first-page":"2962","DOI":"10.5555\/2969442.2969547","article-title":"Efficient and robust automated machine learning","author":"Feurer","year":"2015"},{"key":"mlstadae53bib23","article-title":"Mljar: state-of-the-art automated machine learning framework for tabular data. version 0.10.3","author":"P\u0142o\u0144ska","year":"2021"},{"key":"mlstadae53bib24","article-title":"H2O AutoML: scalable automatic machine learning","author":"LeDell","year":"2020"},{"key":"mlstadae53bib25","doi-asserted-by":"publisher","first-page":"785","DOI":"10.1145\/2939672.2939785","article-title":"XGBoost: a scalable tree boosting system","author":"Chen","year":"2016"},{"key":"mlstadae53bib26","doi-asserted-by":"publisher","first-page":"1937","DOI":"10.1007\/s10462-020-09896-5","article-title":"A comparative analysis of gradient boosting algorithms","volume":"54","author":"Bent\u00e9jac","year":"2021","journal-title":"Artif. Intell. Rev."},{"key":"mlstadae53bib27","article-title":"A comprehensive benchmark of machine and deep learning across diverse tabular datasets","author":"Shmuel","year":"2024"},{"key":"mlstadae53bib28","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-18318-8_47","article-title":"Impact of data sampling on performance and robustness of machine learning models in production engineering","volume":"vol 463\u201372","author":"Conrad","year":"2023"},{"key":"mlstadae53bib29","article-title":"The effect of random seeds for data splitting on recommendation accuracy","author":"Wegmeth","year":"2023"},{"key":"mlstadae53bib30","doi-asserted-by":"publisher","first-page":"917","DOI":"10.1007\/s10618-019-00619-1","article-title":"Deep learning for time series classification: a review","volume":"33","author":"Ismail Fawaz","year":"2019","journal-title":"Data Min. Knowl. Discovery"},{"key":"mlstadae53bib31","first-page":"pp 507","article-title":"Why do tree-based models still outperform deep learning on typical tabular data?","volume":"vol 35","author":"Grinsztajn","year":"2022"}],"container-title":["Machine Learning: Science and Technology"],"original-title":[],"link":[{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adae53","content-type":"text\/html","content-version":"am","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adae53\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adae53","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adae53\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adae53\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adae53\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adae53\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"similarity-checking"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adae53\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,20]],"date-time":"2025-02-20T05:51:10Z","timestamp":1740030670000},"score":1,"resource":{"primary":{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/adae53"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,2,20]]},"references-count":31,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,2,20]]},"published-print":{"date-parts":[[2025,3,31]]}},"URL":"https:\/\/doi.org\/10.1088\/2632-2153\/adae53","relation":{},"ISSN":["2632-2153"],"issn-type":[{"value":"2632-2153","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,2,20]]},"assertion":[{"value":"Leveraging multi-task learning regressor chains for small and sparse tabular data in materials design","name":"article_title","label":"Article Title"},{"value":"Machine Learning: Science and Technology","name":"journal_title","label":"Journal Title"},{"value":"paper","name":"article_type","label":"Article Type"},{"value":"\u00a9 2025 The Author(s). Published by IOP Publishing Ltd","name":"copyright_information","label":"Copyright Information"},{"value":"2024-10-25","name":"date_received","label":"Date Received","group":{"name":"publication_dates","label":"Publication dates"}},{"value":"2025-01-24","name":"date_accepted","label":"Date Accepted","group":{"name":"publication_dates","label":"Publication dates"}},{"value":"2025-02-20","name":"date_epub","label":"Online publication date","group":{"name":"publication_dates","label":"Publication dates"}}]}}