{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,5]],"date-time":"2025-04-05T04:07:12Z","timestamp":1743826032175,"version":"3.40.3"},"publisher-location":"Cham","reference-count":41,"publisher":"Springer Nature Switzerland","isbn-type":[{"value":"9783031787355","type":"print"},{"value":"9783031787362","type":"electronic"}],"license":[{"start":{"date-parts":[[2025,1,1]],"date-time":"2025-01-01T00:00:00Z","timestamp":1735689600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"},{"start":{"date-parts":[[2025,1,1]],"date-time":"2025-01-01T00:00:00Z","timestamp":1735689600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025]]},"DOI":"10.1007\/978-3-031-78736-2_6","type":"book-chapter","created":{"date-parts":[[2025,4,4]],"date-time":"2025-04-04T10:07:39Z","timestamp":1743761259000},"page":"117-138","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Optimising Materials Properties with Minimal Data: Lessons from Vanadium Catalyst Modelling"],"prefix":"10.1007","author":[{"given":"Jos\u00e9","family":"Ferraz-Caetano","sequence":"first","affiliation":[]},{"given":"Filipe","family":"Teixeira","sequence":"additional","affiliation":[]},{"given":"M. Nat\u00e1lia D. S.","family":"Cordeiro","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,4,3]]},"reference":[{"issue":"1","key":"6_CR1","doi-asserted-by":"publisher","first-page":"2312848","DOI":"10.1080\/17518253.2024.2312848","volume":"17","author":"K Venkatesan","year":"2024","unstructured":"Venkatesan K, Sundarababu J, Anandan SS (2024) The recent developments of green and sustainable chemistry in multidimensional way: current trends and challenges. Green Chem Lett Rev 17(1):2312848. https:\/\/doi.org\/10.1080\/17518253.2024.2312848","journal-title":"Green Chem Lett Rev"},{"key":"6_CR2","doi-asserted-by":"publisher","unstructured":"Wei J, Chu X, Sun X-Y, Xu K, Deng H-X, Chen J, Wei Z, Lei M (2019) Machine learning in materials science. InfoMat 1(3):338\u2013358. https:\/\/doi.org\/10.1002\/inf2.12028","DOI":"10.1002\/inf2.12028"},{"issue":"6","key":"6_CR3","doi-asserted-by":"publisher","first-page":"3089","DOI":"10.1021\/acs.chemrev.2c00798","volume":"123","author":"CJ Taylor","year":"2023","unstructured":"Taylor CJ, Pomberger A, Felton KC, Grainger R, Barecka M, Chamberlain TW, Bourne RA, Johnson CN, Lapkin AA (2023) A brief introduction to chemical reaction optimization. Chem Rev 123(6):3089\u20133126. https:\/\/doi.org\/10.1021\/acs.chemrev.2c00798","journal-title":"Chem Rev"},{"issue":"24","key":"6_CR4","doi-asserted-by":"publisher","first-page":"8226","DOI":"10.1039\/C4CS00210E","volume":"43","author":"E Roduner","year":"2014","unstructured":"Roduner E (2014) Understanding catalysis. Chem Soc Rev 43(24):8226\u20138239. https:\/\/doi.org\/10.1039\/C4CS00210E","journal-title":"Chem Soc Rev"},{"issue":"8","key":"6_CR5","doi-asserted-by":"publisher","first-page":"596","DOI":"10.1557\/mrs.2016.164","volume":"41","author":"SR Kalidindi","year":"2016","unstructured":"Kalidindi SR, Brough DB, Li S, Cecen A, Blekh AL, Congo FYP, Campbell C (2016) Role of materials data science and informatics in accelerated materials innovation. MRS Bull 41(8):596\u2013602. https:\/\/doi.org\/10.1557\/mrs.2016.164","journal-title":"MRS Bull"},{"issue":"4","key":"6_CR6","doi-asserted-by":"publisher","DOI":"10.1088\/2515-7639\/ab291e","volume":"2","author":"SL Brunton","year":"2019","unstructured":"Brunton SL, Kutz JN (2019) Methods for data-driven multiscale model discovery for materials. J Phys Mater 2(4):044002. https:\/\/doi.org\/10.1088\/2515-7639\/ab291e","journal-title":"J Phys Mater"},{"issue":"2","key":"6_CR7","doi-asserted-by":"publisher","first-page":"293","DOI":"10.1093\/nsr\/nwt032","volume":"1","author":"J Fan","year":"2014","unstructured":"Fan J, Han F, Liu H (2014) Challenges of big data analysis. Natl Sci Rev 1(2):293\u2013314. https:\/\/doi.org\/10.1093\/nsr\/nwt032","journal-title":"Natl Sci Rev"},{"issue":"5","key":"6_CR8","doi-asserted-by":"publisher","DOI":"10.1063\/1.4946894","volume":"4","author":"A Agrawal","year":"2016","unstructured":"Agrawal A, Choudhary A (2016) Perspective: materials informatics and big data: realization of the \u201cfourth paradigm\u201d of science in materials science. APL Mater 4(5):053208. https:\/\/doi.org\/10.1063\/1.4946894","journal-title":"APL Mater"},{"issue":"13","key":"6_CR9","doi-asserted-by":"publisher","first-page":"8736","DOI":"10.1021\/acs.chemrev.3c00189","volume":"123","author":"B Dou","year":"2023","unstructured":"Dou B, Zhu Z, Merkurjev E, Ke L, Chen L, Jiang J, Zhu Y, Liu J, Zhang B, Wei G-W (2023) Machine learning methods for small data challenges in molecular science. Chem Rev 123(13):8736\u20138780. https:\/\/doi.org\/10.1021\/acs.chemrev.3c00189","journal-title":"Chem Rev"},{"issue":"26","key":"6_CR10","doi-asserted-by":"publisher","DOI":"10.1002\/anie.202219070","volume":"62","author":"H Shalit Peleg","year":"2023","unstructured":"Shalit Peleg H, Milo A (2023) Small data can play a big role in chemical discovery. Angew Chem Int Ed 62(26):e202219070. https:\/\/doi.org\/10.1002\/anie.202219070","journal-title":"Angew Chem Int Ed"},{"issue":"1","key":"6_CR11","doi-asserted-by":"publisher","first-page":"42","DOI":"10.1038\/s41524-023-01000-z","volume":"9","author":"P Xu","year":"2023","unstructured":"Xu P, Ji X, Li M, Lu W (2023) Small data machine learning in materials science. NPJ Comput Mater 9(1):42. https:\/\/doi.org\/10.1038\/s41524-023-01000-z","journal-title":"NPJ Comput Mater"},{"key":"6_CR12","doi-asserted-by":"publisher","DOI":"10.1016\/j.ces.2022.117469","volume":"252","author":"A Thebelt","year":"2022","unstructured":"Thebelt A, Wiebe J, Kronqvist J, Tsay C, Misener R (2022) Maximizing information from chemical engineering data sets: applications to machine learning. Chem Eng Sci 252:117469. https:\/\/doi.org\/10.1016\/j.ces.2022.117469","journal-title":"Chem Eng Sci"},{"issue":"2","key":"6_CR13","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/1168\/2\/022022","volume":"1168","author":"X Ying","year":"2019","unstructured":"Ying X (2019) An overview of overfitting and its solutions. J Phys Conf Ser 1168(2):022022. https:\/\/doi.org\/10.1088\/1742-6596\/1168\/2\/022022","journal-title":"J Phys Conf Ser"},{"issue":"13","key":"6_CR14","doi-asserted-by":"publisher","first-page":"7886","DOI":"10.1021\/acscatal.2c01741","volume":"12","author":"DM Lustosa","year":"2022","unstructured":"Lustosa DM, Milo A (2022) Mechanistic inference from statistical models at different data-size regimes. ACS Catal 12(13):7886\u20137906. https:\/\/doi.org\/10.1021\/acscatal.2c01741","journal-title":"ACS Catal"},{"issue":"5","key":"6_CR15","doi-asserted-by":"publisher","first-page":"1446","DOI":"10.1039\/D1SC06515G","volume":"13","author":"M Wen","year":"2022","unstructured":"Wen M, Blau SM, Xie X, Dwaraknath S, Persson KA (2022) Improving machine learning performance on small chemical reaction data with unsupervised contrastive pretraining. Chem Sci 13(5):1446\u20131458. https:\/\/doi.org\/10.1039\/D1SC06515G","journal-title":"Chem Sci"},{"issue":"1","key":"6_CR16","doi-asserted-by":"publisher","first-page":"5505","DOI":"10.1038\/s41467-020-19267-x","volume":"11","author":"S Stocker","year":"2020","unstructured":"Stocker S, Cs\u00e1nyi G, Reuter K, Margraf JT (2020) Machine learning in chemical reaction space. Nat Commun 11(1):5505. https:\/\/doi.org\/10.1038\/s41467-020-19267-x","journal-title":"Nat Commun"},{"key":"6_CR17","doi-asserted-by":"publisher","unstructured":"Takahashi K, Takahashi L (2022) Data in materials and catalysts informatics. In: Machine learning in materials informatics: methods and applications, vol. 1416. American Chemical Society, pp 239\u2013246. https:\/\/doi.org\/10.1021\/bk-2022-1416.ch010","DOI":"10.1021\/bk-2022-1416.ch010"},{"issue":"7","key":"6_CR18","doi-asserted-by":"publisher","first-page":"1415","DOI":"10.1039\/D0QO01636E","volume":"8","author":"Y Zhang","year":"2021","unstructured":"Zhang Y, Wang L, Wang X, Zhang C, Ge J, Tang J, Su A, Duan H (2021) Data augmentation and transfer learning strategies for reaction prediction in low chemical data regimes. Organ Chem Front 8(7):1415\u20131423. https:\/\/doi.org\/10.1039\/D0QO01636E","journal-title":"Organ Chem Front"},{"issue":"1","key":"6_CR19","doi-asserted-by":"publisher","first-page":"4874","DOI":"10.1038\/s41467-020-18671-7","volume":"11","author":"G Pesciullesi","year":"2020","unstructured":"Pesciullesi G, Schwaller P, Laino T, Reymond J-L (2020) Transfer learning enables the molecular transformer to predict regio- and stereoselective reactions on carbohydrates. Nat Commun 11(1):4874. https:\/\/doi.org\/10.1038\/s41467-020-18671-7","journal-title":"Nat Commun"},{"issue":"12","key":"6_CR20","doi-asserted-by":"publisher","first-page":"5592","DOI":"10.1021\/acsapm.0c00921","volume":"2","author":"AL Liu","year":"2020","unstructured":"Liu AL, Venkatesh R, McBride M, Reichmanis E, Meredith JC, Grover MA (2020) Small data machine learning: classification and prediction of poly(ethylene terephthalate) stabilizers using molecular descriptors. ACS Appl Polym Mater 2(12):5592\u20135601. https:\/\/doi.org\/10.1021\/acsapm.0c00921","journal-title":"ACS Appl Polym Mater"},{"issue":"1","key":"6_CR21","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1038\/s42004-023-01086-y","volume":"7","author":"T Taniike","year":"2024","unstructured":"Taniike T, Fujiwara A, Nakanowatari S, Garc\u00eda-Escobar F, Takahashi K (2024) Automatic feature engineering for catalyst design using small data without prior knowledge of target catalysis. Commun Chem 7(1):11. https:\/\/doi.org\/10.1038\/s42004-023-01086-y","journal-title":"Commun Chem"},{"issue":"16","key":"6_CR22","doi-asserted-by":"publisher","first-page":"2222","DOI":"10.1039\/D2CC05938J","volume":"59","author":"K Takahashi","year":"2023","unstructured":"Takahashi K, Ohyama J, Nishimura S, Fujima J, Takahashi L, Uno T, Taniike T (2023) Catalysts informatics: paradigm shift towards data-driven catalyst design. Chem Commun 59(16):2222\u20132238. https:\/\/doi.org\/10.1039\/D2CC05938J","journal-title":"Chem Commun"},{"issue":"10","key":"6_CR23","doi-asserted-by":"publisher","first-page":"2410","DOI":"10.1246\/bcsj.20210253","volume":"94","author":"Y Oaki","year":"2021","unstructured":"Oaki Y, Igarashi Y (2021) Materials informatics for 2D materials combined with sparse modeling and chemical perspective: toward small-data-driven chemistry and materials science. Bull Chem Soc Jpn 94(10):2410\u20132422. https:\/\/doi.org\/10.1246\/bcsj.20210253","journal-title":"Bull Chem Soc Jpn"},{"key":"6_CR24","doi-asserted-by":"publisher","unstructured":"Kalthoff SF, Sandfort F, K\u00fchnemund M, Sch\u00e4fer FR, Kuchen H, Glorius F (2022) Machine learning for chemical reactivity: the importance of failed experiments. Angew Chem Int Ed 61(29):e202204647. https:\/\/doi.org\/10.1002\/anie.202204647","DOI":"10.1002\/anie.202204647"},{"issue":"2","key":"6_CR25","doi-asserted-by":"publisher","first-page":"108","DOI":"10.1038\/s41929-023-00920-9","volume":"6","author":"T Taniike","year":"2023","unstructured":"Taniike T, Takahashi K (2023) The value of negative results in data-driven catalysis research. Nat Catal 6(2):108\u2013111. https:\/\/doi.org\/10.1038\/s41929-023-00920-9","journal-title":"Nat Catal"},{"key":"6_CR26","doi-asserted-by":"publisher","unstructured":"Weissman SA, Anderson NG (2015) Design of Experiments (DoE) and process optimization. a review of recent publications. Organ Proc Res Dev 19(11):1605\u20131633. https:\/\/doi.org\/10.1021\/op500169m","DOI":"10.1021\/op500169m"},{"issue":"7844","key":"6_CR27","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1038\/s41586-021-03213-y","volume":"590","author":"BJ Shields","year":"2021","unstructured":"Shields BJ, Stevens J, Li J, Parasram M, Damani F, Alvarado JIM, Janey JM, Adams RP, Doyle AG (2021) Bayesian reaction optimization as a tool for chemical synthesis. Nature 590(7844):89\u201396. https:\/\/doi.org\/10.1038\/s41586-021-03213-y","journal-title":"Nature"},{"issue":"11","key":"6_CR28","doi-asserted-by":"publisher","DOI":"10.1016\/j.xcrp.2020.100247","volume":"1","author":"D Reker","year":"2020","unstructured":"Reker D, Hoyt EA, Bernardes GJL, Rodrigues T (2020) Adaptive optimization of chemical reactions with minimal experimental information. Cell Rep Phys Sci 1(11):100247. https:\/\/doi.org\/10.1016\/j.xcrp.2020.100247","journal-title":"Cell Rep Phys Sci"},{"issue":"22","key":"6_CR29","doi-asserted-by":"publisher","first-page":"2301020","DOI":"10.1002\/advs.202301020","volume":"10","author":"L-H Mou","year":"2023","unstructured":"Mou L-H, Han T, Smith PES, Sharman E, Jiang J (2023) Machine learning descriptors for data-driven catalysis study. Adv Sci 10(22):2301020. https:\/\/doi.org\/10.1002\/advs.202301020","journal-title":"Adv Sci"},{"key":"6_CR30","doi-asserted-by":"publisher","unstructured":"Ferraz-Caetano J, Teixeira F, Cordeiro MN (2023) Systematic development of vanadium catalysts for sustainable epoxidation of small alkenes and allylic alcohols. Int J Mol Sci 24(15). https:\/\/doi.org\/10.3390\/ijms241512299","DOI":"10.3390\/ijms241512299"},{"issue":"19","key":"6_CR31","doi-asserted-by":"publisher","first-page":"2165","DOI":"10.1016\/j.ccr.2011.03.006","volume":"255","author":"V Conte","year":"2011","unstructured":"Conte V, Coletti A, Floris B, Licini G, Zonta C (2011) Mechanistic aspects of vanadium catalysed oxidations with peroxides. Coord Chem Rev 255(19):2165\u20132177. https:\/\/doi.org\/10.1016\/j.ccr.2011.03.006","journal-title":"Coord Chem Rev"},{"issue":"4","key":"6_CR32","doi-asserted-by":"publisher","first-page":"2128","DOI":"10.1021\/acs.chemrev.8b00245","volume":"119","author":"RR Langeslay","year":"2019","unstructured":"Langeslay RR, Kaphan DM, Marshall CL, Stair PC, Sattelberger AP, Delferro M (2019) Catalytic applications of vanadium: a mechanistic perspective. Chem Rev 119(4):2128\u20132191. https:\/\/doi.org\/10.1021\/acs.chemrev.8b00245","journal-title":"Chem Rev"},{"issue":"1","key":"6_CR33","doi-asserted-by":"publisher","first-page":"295","DOI":"10.1021\/ja00756a062","volume":"94","author":"KB Sharpless","year":"1972","unstructured":"Sharpless KB, Townsend JM, Williams DR (1972) Mechanism of epoxidation of olefins by covalent peroxides of molybdenum(VI). J Am Chem Soc 94(1):295\u2013296. https:\/\/doi.org\/10.1021\/ja00756a062","journal-title":"J Am Chem Soc"},{"issue":"2","key":"6_CR34","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1007\/s00163-002-0026-9","volume":"14","author":"DD Frey","year":"2003","unstructured":"Frey DD, Engelhardt F, Greitzer EM (2003) A role for \u201cone-factor-at-a-time\u201d experimentation in parameter design. Res Eng Design 14(2):65\u201374. https:\/\/doi.org\/10.1007\/s00163-002-0026-9","journal-title":"Res Eng Design"},{"issue":"1","key":"6_CR35","doi-asserted-by":"publisher","first-page":"112","DOI":"10.1038\/s42004-021-00550-x","volume":"4","author":"M Christensen","year":"2021","unstructured":"Christensen M, Yunker LPE, Adedeji F, H\u00e4se F, Roch LM, Gensch T, dos Passos Gomes G, Zepel T, Sigman MS, Aspuru-Guzik A, Hein JE (2021) Data-science driven autonomous process optimization. Commun Chem 4(1):112. https:\/\/doi.org\/10.1038\/s42004-021-00550-x","journal-title":"Commun Chem"},{"key":"6_CR36","doi-asserted-by":"publisher","unstructured":"Cova TFGG, Pais AACC (2019) Deep learning for deep chemistry: optimizing the prediction of chemical patterns [review]. Front Chem 7. https:\/\/doi.org\/10.3389\/fchem.2019.00809","DOI":"10.3389\/fchem.2019.00809"},{"issue":"4","key":"6_CR37","doi-asserted-by":"publisher","first-page":"463","DOI":"10.1007\/s10708-014-9601-7","volume":"80","author":"R Kitchin","year":"2015","unstructured":"Kitchin R, Lauriault TP (2015) Small data in the era of big data. GeoJournal 80(4):463\u2013475. https:\/\/doi.org\/10.1007\/s10708-014-9601-7","journal-title":"GeoJournal"},{"issue":"12","key":"6_CR38","doi-asserted-by":"publisher","first-page":"5097","DOI":"10.1039\/D3NJ05784D","volume":"48","author":"J Ferraz-Caetano","year":"2024","unstructured":"Ferraz-Caetano J, Teixeira F, Cordeiro MNDS (2024) Navigating epoxidation complexity: building a data science toolbox to design vanadium catalysts. New J Chem 48(12):5097\u20135100. https:\/\/doi.org\/10.1039\/D3NJ05784D","journal-title":"New J Chem"},{"key":"6_CR39","unstructured":"Landrum G (2023) RDKit: Open-source cheminformatics 2022_09_4 (Q3 2022). http:\/\/www.rdkit.org\/. Accessed 18 Jan 2023"},{"issue":"7","key":"6_CR40","doi-asserted-by":"publisher","first-page":"2250","DOI":"10.1021\/acs.jcim.3c00544","volume":"64","author":"J Ferraz-Caetano","year":"2024","unstructured":"Ferraz-Caetano J, Teixeira F, Cordeiro MNDS (2024) Explainable supervised machine learning model to predict solvation gibbs energy. J Chem Inf Model 64(7):2250\u20132262. https:\/\/doi.org\/10.1021\/acs.jcim.3c00544","journal-title":"J Chem Inf Model"},{"key":"6_CR41","doi-asserted-by":"publisher","DOI":"10.1016\/j.chemosphere.2024.142257","volume":"359","author":"J Ferraz-Caetano","year":"2024","unstructured":"Ferraz-Caetano J, Teixeira F, Cordeiro MNDS (2024) Data-driven, explainable machine learning model for predicting volatile organic compounds\u2019 standard vaporization enthalpy. Chemosphere 359:142257. https:\/\/doi.org\/10.1016\/j.chemosphere.2024.142257","journal-title":"Chemosphere"}],"container-title":["Challenges and Advances in Computational Chemistry and Physics","Materials Informatics I"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-031-78736-2_6","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,4]],"date-time":"2025-04-04T10:10:08Z","timestamp":1743761408000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-031-78736-2_6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025]]},"ISBN":["9783031787355","9783031787362"],"references-count":41,"URL":"https:\/\/doi.org\/10.1007\/978-3-031-78736-2_6","relation":{},"ISSN":["2542-4491","2542-4483"],"issn-type":[{"value":"2542-4491","type":"print"},{"value":"2542-4483","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025]]},"assertion":[{"value":"3 April 2025","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declaration of Competing Interest"}}]}}