{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,25]],"date-time":"2025-09-25T13:29:33Z","timestamp":1758806973325,"version":"3.38.0"},"reference-count":63,"publisher":"SAGE Publications","issue":"1-2","license":[{"start":{"date-parts":[[2019,4,24]],"date-time":"2019-04-24T00:00:00Z","timestamp":1556064000000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Data Science"],"published-print":{"date-parts":[[2019,11,25]]},"abstract":"<jats:p> The sharing of scientific and scholarly data has been increasingly promoted over the last decade, leading to open repositories in many different scientific domains. However, data sharing and open data are not final goals in themselves, the real benefit is in data reuse, which allows leveraging investments in research and enables large-scale data-driven research progress. Focusing on reuse, this paper discusses the design of an integrated framework to automatically take advantage of large amounts of scientific data extracted from the literature to support research, and in particular scientific model development. Scientific models reproduce and predict complex phenomena and their development is a rather challenging task, within which scientific experiments have a key role in their continuous validation. Starting from the combustion kinetics domain, this paper discusses a set of use cases and a first prototype for such a framework which leads to a set of new requirements and an architecture that can be generalized to other domains. The paper analyzes the needs, the challenges and the research directions for such a framework, in particular those related to data management, automatic scientific model validation, data aggregation and data analysis, to leverage large amounts of published scientific data for new knowledge extraction. <\/jats:p>","DOI":"10.3233\/ds-190017","type":"journal-article","created":{"date-parts":[[2019,4,26]],"date-time":"2019-04-26T15:40:19Z","timestamp":1556293219000},"page":"245-273","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":6,"title":["Towards a scientific data framework to support scientific model development"],"prefix":"10.1177","volume":"2","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3305-9220","authenticated-orcid":false,"given":"Gabriele","family":"Scalia","sequence":"first","affiliation":[{"name":"Department of Electronics, Information and Bioengineering, Politecnico di Milano, Piazza Leonardo da Vinci 32, 20131 Milano, Italy. E-mail:\u00a0"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3106-0236","authenticated-orcid":false,"given":"Matteo","family":"Pelucchi","sequence":"additional","affiliation":[{"name":"Department of Chemistry, Materials, and Chemical Engineering Giulio Natta, Politecnico di Milano, Piazza Leonardo da Vinci 32, 20131 Milano, Italy. E-mail:\u00a0"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4339-7872","authenticated-orcid":false,"given":"Alessandro","family":"Stagni","sequence":"additional","affiliation":[{"name":"Department of Chemistry, Materials, and Chemical Engineering Giulio Natta, Politecnico di Milano, Piazza Leonardo da Vinci 32, 20131 Milano, Italy. E-mail:\u00a0"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5653-0180","authenticated-orcid":false,"given":"Alberto","family":"Cuoci","sequence":"additional","affiliation":[{"name":"Department of Chemistry, Materials, and Chemical Engineering Giulio Natta, Politecnico di Milano, Piazza Leonardo da Vinci 32, 20131 Milano, Italy. E-mail:\u00a0"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8382-7342","authenticated-orcid":false,"given":"Tiziano","family":"Faravelli","sequence":"additional","affiliation":[{"name":"Department of Chemistry, Materials, and Chemical Engineering Giulio Natta, Politecnico di Milano, Piazza Leonardo da Vinci 32, 20131 Milano, Italy. E-mail:\u00a0"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2034-9774","authenticated-orcid":false,"given":"Barbara","family":"Pernici","sequence":"additional","affiliation":[{"name":"Department of Electronics, Information and Bioengineering, Politecnico di Milano, Piazza Leonardo da Vinci 32, 20131 Milano, Italy. E-mail:\u00a0"}]}],"member":"179","published-online":{"date-parts":[[2019,4,24]]},"reference":[{"doi-asserted-by":"publisher","key":"ref001","DOI":"10.1007\/s11837-018-3079-6"},{"doi-asserted-by":"publisher","key":"ref002","DOI":"10.1109\/TKDE.2014.2330822"},{"doi-asserted-by":"publisher","key":"ref003","DOI":"10.1109\/SOCA.2016.15"},{"doi-asserted-by":"publisher","key":"ref004","DOI":"10.1016\/j.future.2018.07.014"},{"doi-asserted-by":"publisher","key":"ref005","DOI":"10.1016\/j.future.2017.05.041"},{"doi-asserted-by":"publisher","key":"ref006","DOI":"10.1145\/2957324"},{"doi-asserted-by":"publisher","key":"ref007","DOI":"10.1016\/j.combustflame.2016.03.019"},{"doi-asserted-by":"publisher","key":"ref008","DOI":"10.7551\/mitpress\/9963.001.0001"},{"doi-asserted-by":"publisher","key":"ref009","DOI":"10.3233\/SW-160217"},{"doi-asserted-by":"publisher","key":"ref010","DOI":"10.1145\/3216122.3216124"},{"doi-asserted-by":"publisher","key":"ref011","DOI":"10.1021\/acs.jctc.8b00701"},{"doi-asserted-by":"publisher","key":"ref012","DOI":"10.1145\/2588555.2610520"},{"doi-asserted-by":"publisher","key":"ref013","DOI":"10.1145\/2882903.2912574"},{"unstructured":"A.\u00a0Cohan and N.\u00a0Goharian, Scientific article summarization using citation-context and article\u2019s discourse structure, arXiv preprint, arXiv:1704.06619, 2017.","key":"ref014"},{"doi-asserted-by":"publisher","key":"ref015","DOI":"10.1021\/acscentsci.7b00064"},{"doi-asserted-by":"publisher","key":"ref016","DOI":"10.1021\/acs.accounts.8b00087"},{"doi-asserted-by":"publisher","key":"ref017","DOI":"10.1016\/j.cpc.2015.02.014"},{"doi-asserted-by":"publisher","key":"ref018","DOI":"10.3233\/ISU-160805"},{"doi-asserted-by":"publisher","key":"ref019","DOI":"10.1177\/1094342017704893"},{"doi-asserted-by":"publisher","key":"ref020","DOI":"10.1007\/s41060-016-0039-5"},{"doi-asserted-by":"publisher","key":"ref021","DOI":"10.1091\/mbc.E13-12-0756"},{"doi-asserted-by":"publisher","key":"ref022","DOI":"10.1002\/asi.22652"},{"doi-asserted-by":"publisher","key":"ref023","DOI":"10.1016\/j.proci.2006.08.121"},{"doi-asserted-by":"publisher","key":"ref024","DOI":"10.1007\/978-3-319-53637-8_3"},{"doi-asserted-by":"publisher","key":"ref025","DOI":"10.1039\/C7CP07777G"},{"doi-asserted-by":"publisher","key":"ref026","DOI":"10.1109\/ISCC-C.2013.57"},{"doi-asserted-by":"publisher","key":"ref027","DOI":"10.1016\/j.proci.2018.07.023"},{"doi-asserted-by":"publisher","key":"ref028","DOI":"10.1016\/j.bdr.2015.01.005"},{"doi-asserted-by":"publisher","key":"ref029","DOI":"10.1016\/j.proci.2018.07.113"},{"doi-asserted-by":"publisher","key":"ref030","DOI":"10.1016\/j.jss.2017.01.001"},{"doi-asserted-by":"publisher","key":"ref031","DOI":"10.1145\/2850413"},{"doi-asserted-by":"publisher","key":"ref032","DOI":"10.1145\/3012429"},{"doi-asserted-by":"publisher","key":"ref033","DOI":"10.1016\/j.acalib.2014.06.011"},{"doi-asserted-by":"publisher","key":"ref034","DOI":"10.1002\/asi.21339"},{"doi-asserted-by":"publisher","key":"ref035","DOI":"10.1136\/amiajnl-2014-002974"},{"doi-asserted-by":"publisher","key":"ref036","DOI":"10.1016\/j.combustflame.2014.03.006"},{"doi-asserted-by":"publisher","key":"ref037","DOI":"10.1016\/j.combustflame.2014.12.001"},{"doi-asserted-by":"publisher","key":"ref038","DOI":"10.1007\/978-3-642-41335-3_29"},{"doi-asserted-by":"publisher","key":"ref039","DOI":"10.5334\/dsj-2017-008"},{"doi-asserted-by":"publisher","key":"ref040","DOI":"10.1145\/2858036.2858543"},{"doi-asserted-by":"publisher","key":"ref041","DOI":"10.1016\/j.proci.2018.06.104"},{"doi-asserted-by":"publisher","key":"ref042","DOI":"10.1007\/978-3-319-68204-4_19"},{"doi-asserted-by":"publisher","key":"ref043","DOI":"10.5281\/zenodo.2632008"},{"unstructured":"A.\u00a0Rigamonti, Automatic modeling system: A database based infrastructure to develop, validate and evaluate scientific models. An application to combustion kinetic models, Graduation thesis, Politecnico di Milano, 2017. https:\/\/www.politesi.polimi.it\/bitstream\/10589\/133895\/3\/2017-Aprile-Rigamonti.pdf.","key":"ref044"},{"doi-asserted-by":"publisher","key":"ref045","DOI":"10.1016\/j.websem.2016.01.001"},{"doi-asserted-by":"publisher","key":"ref046","DOI":"10.1002\/cpe.4041"},{"doi-asserted-by":"publisher","key":"ref047","DOI":"10.1038\/nrg3394"},{"doi-asserted-by":"publisher","key":"ref048","DOI":"10.5281\/zenodo.2629359"},{"doi-asserted-by":"publisher","key":"ref049","DOI":"10.1007\/978-3-030-01379-0_10"},{"doi-asserted-by":"publisher","key":"ref050","DOI":"10.14778\/2977797.2977806"},{"doi-asserted-by":"publisher","key":"ref051","DOI":"10.1109\/ICDE.2007.367935"},{"doi-asserted-by":"publisher","key":"ref052","DOI":"10.1145\/2623330.2623667"},{"doi-asserted-by":"publisher","key":"ref053","DOI":"10.1016\/j.combustflame.2015.10.013"},{"unstructured":"T.\u00a0Varga, T.\u00a0Tur\u00e1nyi, E.\u00a0Czinki, T.\u00a0Furtenbacher and A.\u00a0Cs\u00e1sz\u00e1r, Respecth: A joint reaction kinetics, spectroscopy, and thermochemistry information system, in: Proceedings of the 7th European Combustion Meeting, Vol.\u00a030, 2015, pp.\u00a01\u20135. http:\/\/www.ecm2015.hu\/papers\/P1-04.pdf.","key":"ref054"},{"doi-asserted-by":"publisher","key":"ref055","DOI":"10.1016\/S0010-2180(02)00552-7"},{"doi-asserted-by":"publisher","key":"ref056","DOI":"10.1016\/j.pecs.2014.10.002"},{"doi-asserted-by":"publisher","key":"ref057","DOI":"10.1109\/BigDataCongress.2015.116"},{"doi-asserted-by":"publisher","key":"ref058","DOI":"10.1002\/kin.21142"},{"doi-asserted-by":"publisher","key":"ref059","DOI":"10.1038\/sdata.2016.18"},{"doi-asserted-by":"publisher","key":"ref060","DOI":"10.1109\/TBDATA.2016.2641460"},{"doi-asserted-by":"publisher","key":"ref061","DOI":"10.1186\/2041-1480-6-4"},{"doi-asserted-by":"publisher","key":"ref062","DOI":"10.1007\/978-3-319-26190-4_37"},{"doi-asserted-by":"publisher","key":"ref063","DOI":"10.1177\/1476127017697510"}],"container-title":["Data Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/DS-190017","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/DS-190017","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/DS-190017","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,11]],"date-time":"2025-03-11T10:20:31Z","timestamp":1741688431000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/DS-190017"}},"subtitle":["New requirements emerging from the combustion kinetics domain"],"short-title":[],"issued":{"date-parts":[[2019,4,24]]},"references-count":63,"journal-issue":{"issue":"1-2","published-print":{"date-parts":[[2019,11,25]]}},"alternative-id":["10.3233\/DS-190017"],"URL":"https:\/\/doi.org\/10.3233\/ds-190017","relation":{},"ISSN":["2451-8484","2451-8492"],"issn-type":[{"type":"print","value":"2451-8484"},{"type":"electronic","value":"2451-8492"}],"subject":[],"published":{"date-parts":[[2019,4,24]]}}}