{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T12:15:38Z","timestamp":1773317738440,"version":"3.50.1"},"reference-count":44,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2023,4,6]],"date-time":"2023-04-06T00:00:00Z","timestamp":1680739200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/pages\/standard-publication-reuse-rights"}],"funder":[{"name":"European Union\u2019s Horizon 2020 research and innovation program"},{"DOI":"10.13039\/100010663","name":"European Research Council","doi-asserted-by":"publisher","award":["714437"],"award-info":[{"award-number":["714437"]}],"id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,8,31]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Archival research is a complicated task that involves several diverse activities for the extraction of evidence and knowledge from a set of archival documents. The involved activities are usually unconnected, in terms of data connection and flow, making difficult their recursive revision and execution, as well as the inspection of provenance information at data element level. This article proposes a workflow model for holistic data management in archival research: from transcribing and documenting a set of archival documents, to curating the transcribed data, integrating it to a rich semantic network (knowledge graph), and then exploring the integrated data quantitatively. The workflow is provenance-aware, highly recursive and focuses on semantic interoperability, aiming at the production of sustainable data of high value and long-term validity. We provide implementation details for each step of the workflow and present its application in maritime history research. We also discuss relevant quality aspects and lessons learned from its application in a real context.<\/jats:p>","DOI":"10.1093\/llc\/fqad018","type":"journal-article","created":{"date-parts":[[2023,4,6]],"date-time":"2023-04-06T21:28:40Z","timestamp":1680816520000},"page":"1049-1066","source":"Crossref","is-referenced-by-count":4,"title":["A workflow model for holistic data management and semantic interoperability in quantitative archival research"],"prefix":"10.1093","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2788-526X","authenticated-orcid":false,"given":"Pavlos","family":"Fafalios","sequence":"first","affiliation":[{"name":"Information Systems Laboratory, FORTH-ICS , Heraklion, Greece"}]},{"given":"Yannis","family":"Marketakis","sequence":"additional","affiliation":[{"name":"Information Systems Laboratory, FORTH-ICS , Heraklion, Greece"}]},{"given":"Anastasia","family":"Axaridou","sequence":"additional","affiliation":[{"name":"Information Systems Laboratory, FORTH-ICS , Heraklion, Greece"}]},{"given":"Yannis","family":"Tzitzikas","sequence":"additional","affiliation":[{"name":"Information Systems Laboratory, FORTH-ICS , Heraklion, Greece"},{"name":"Computer Science Department, University of Crete , Heraklion, Greece"}]},{"given":"Martin","family":"Doerr","sequence":"additional","affiliation":[{"name":"Information Systems Laboratory, FORTH-ICS , Heraklion, Greece"}]}],"member":"286","published-online":{"date-parts":[[2023,4,6]]},"reference":[{"key":"2023083111393717500_fqad018-B1","first-page":"1","article-title":"A survey of RDF stores & SPARQL engines for querying knowledge graphs","author":"Ali","year":"2021","journal-title":"The VLDB Journal"},{"key":"2023083111393717500_fqad018-B2","volume-title":"A Semantic Web Primer","author":"Antoniou","year":"2004"},{"issue":"2","key":"2023083111393717500_fqad018-B3","doi-asserted-by":"crossref","first-page":"279","DOI":"10.3233\/SW-200416","article-title":"A challenge for historical research: making data FAIR using a collaborative ontology management environment (OntoME)","volume":"12","author":"Beretta","year":"2021","journal-title":"Semantic Web"},{"key":"2023083111393717500_fqad018-B4","first-page":"2","author":"Calvanese","year":"1998"},{"issue":"4","key":"2023083111393717500_fqad018-B5","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1016\/j.websem.2005.09.001","article-title":"Named graphs","volume":"3","author":"Carroll","year":"2005","journal-title":"Journal of Web Semantics"},{"key":"2023083111393717500_fqad018-B6","volume-title":"Evidential Reasoning in Archaeology","author":"Chapman","year":"2018"},{"issue":"6","key":"2023083111393717500_fqad018-B7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3418896","article-title":"An overview of end-to-end entity resolution for big data","volume":"53","author":"Christophides","year":"2020","journal-title":"ACM Computing Surveys (CSUR)"},{"issue":"2","key":"2023083111393717500_fqad018-B8","doi-asserted-by":"crossref","first-page":"464","DOI":"10.1177\/0843871420924240","article-title":"Seafaring lives at the crossroads of Mediterranean maritime history","volume":"32","author":"Delis","year":"2020","journal-title":"International Journal of Maritime History"},{"key":"2023083111393717500_fqad018-B9","author":"Dimou","year":"2014"},{"issue":"3","key":"2023083111393717500_fqad018-B10","first-page":"75","article-title":"The CIDOC conceptual reference module: an ontological approach to semantic interoperability of metadata","volume":"24","author":"Doerr","year":"2003","journal-title":"AI Magazine"},{"key":"2023083111393717500_fqad018-B11","first-page":"682","author":"Fafalios","year":"2021"},{"issue":"4","key":"2023083111393717500_fqad018-B12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3461460","article-title":"FAST CAT: collaborative data entry and curation for semantic interoperability in digital humanities","volume":"14","author":"Fafalios","year":"2021","journal-title":"Journal on Computing and Cultural Heritage (JOCCH)"},{"key":"2023083111393717500_fqad018-B13","first-page":"2969","author":"Gurajada","year":"2019"},{"issue":"1","key":"2023083111393717500_fqad018-B14","doi-asserted-by":"crossref","first-page":"498","DOI":"10.3390\/encyclopedia2010032","article-title":"Data quality\u2014concepts and problems","volume":"2","author":"Hassenstein","year":"2022","journal-title":"Encyclopedia"},{"key":"2023083111393717500_fqad018-B15","first-page":"1","volume-title":"Archival Science","author":"Hawkins","year":"2021"},{"issue":"1","key":"2023083111393717500_fqad018-B16","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/978-3-031-79432-2","article-title":"Linked data: Evolving the web into a global data space","volume":"1","author":"Heath","year":"2011","journal-title":"Synthesis Lectures on the Semantic Web: Theory and Technology"},{"issue":"1","key":"2023083111393717500_fqad018-B17","doi-asserted-by":"crossref","first-page":"187","DOI":"10.3233\/SW-190386","article-title":"Using the Semantic Web in digital humanities: shift from data publishing to data-analysis and serendipitous knowledge discovery","volume":"11","author":"Hyv\u00f6nen","year":"2020","journal-title":"Semantic Web"},{"key":"2023083111393717500_fqad018-B18","author":"Hyv\u00f6nen","year":"2020"},{"key":"2023083111393717500_fqad018-B19","first-page":"226","author":"Hyv\u00f6nen","year":"2014"},{"key":"2023083111393717500_fqad018-B20","doi-asserted-by":"crossref","first-page":"101814","DOI":"10.1016\/j.is.2021.101814","article-title":"Keyword search over schema-less RDF datasets by SPARQL query compilation","volume":"102","author":"Izquierdo","year":"2021","journal-title":"Information Systems"},{"key":"2023083111393717500_fqad018-B21","first-page":"121","author":"Kadilierakis","year":"2020"},{"key":"2023083111393717500_fqad018-B22","first-page":"19","author":"Kahle","year":"2017"},{"key":"2023083111393717500_fqad018-B23","author":"Kritsotakis","year":"2018"},{"issue":"4","key":"2023083111393717500_fqad018-B24","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1007\/s00799-016-0179-1","article-title":"X3ML mapping framework for information integration in cultural heritage and beyond","volume":"18","author":"Marketakis","year":"2017","journal-title":"International Journal on Digital Libraries"},{"issue":"3","key":"2023083111393717500_fqad018-B25","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1504\/IJMSO.2021.123044","article-title":"A workflow for supporting the evolution requirements of rdf-based semantic warehouses","volume":"15","author":"Marketakis","year":"2021","journal-title":"International Journal of Metadata, Semantics and Ontologies"},{"key":"2023083111393717500_fqad018-B26","first-page":"116","author":"Mendes","year":"2012"},{"issue":"6","key":"2023083111393717500_fqad018-B27","doi-asserted-by":"crossref","first-page":"539","DOI":"10.3233\/SW-140158","article-title":"Semantic technologies for historical research: a survey","volume":"6","author":"Mero\u00f1o-Pe\u00f1uela","year":"2015","journal-title":"Semantic Web"},{"issue":"3","key":"2023083111393717500_fqad018-B28","doi-asserted-by":"crossref","first-page":"22","DOI":"10.3390\/bdcc4030022","article-title":"Keyword search over RDF: is a single perspective enough?","volume":"4","author":"Nikas","year":"2020","journal-title":"Big Data and Cognitive Computing"},{"key":"2023083111393717500_fqad018-B29","doi-asserted-by":"crossref","first-page":"251","DOI":"10.1002\/9781118680605.ch18","volume-title":"A New Companion to Digital Humanities","author":"Oldman","year":"2015"},{"key":"2023083111393717500_fqad018-B30","first-page":"325","volume-title":"International Semantic Web Conference","author":"Oldman","year":"2018"},{"issue":"1","key":"2023083111393717500_fqad018-B31","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1145\/309844.309849","article-title":"Semantic interoperability in global information systems","volume":"28","author":"Ouksel","year":"1999","journal-title":"ACM Sigmod Record"},{"issue":"28","key":"2023083111393717500_fqad018-B32","doi-asserted-by":"crossref","first-page":"60","DOI":"10.51829\/Drassana.28.649","article-title":"Digitizing, curating and visualizing archival sources of maritime history: the case of ship logbooks of the nineteenth and twentieth centuries","author":"Petrakis","year":"2020","journal-title":"Drassana: Revista del Museu Mar\u00edtim"},{"issue":"4","key":"2023083111393717500_fqad018-B33","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1145\/505248.506010","article-title":"Data quality assessment","volume":"45","author":"Pipino","year":"2002","journal-title":"Communications of the ACM"},{"key":"2023083111393717500_fqad018-B34","first-page":"495","author":"Roussakis","year":"2015"},{"key":"2023083111393717500_fqad018-B35","first-page":"1017","volume-title":"ECAI 2012","author":"Scholz","year":"2012"},{"key":"2023083111393717500_fqad018-B36","first-page":"43","author":"Stefanidis","year":"2014"},{"key":"2023083111393717500_fqad018-B37","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511487385","volume-title":"Making Prehistory: Historical Science and the Scientific Realism Debate","author":"Turner","year":"2007"},{"issue":"3","key":"2023083111393717500_fqad018-B38","doi-asserted-by":"crossref","first-page":"1612","DOI":"10.3390\/heritage5030084","article-title":"CIDOC-CRM and machine learning: a survey and future research","volume":"5","author":"Tzitzikas","year":"2022","journal-title":"Heritage"},{"key":"2023083111393717500_fqad018-B39","doi-asserted-by":"crossref","first-page":"805","DOI":"10.1002\/9781405164061.ch35","volume-title":"The Blackwell Companion to Organizations.","author":"Ventresca","year":"2017"},{"key":"2023083111393717500_fqad018-B40","doi-asserted-by":"crossref","first-page":"428","DOI":"10.1016\/j.jbusres.2017.12.043","article-title":"Open science now: A systematic literature review for an integrated definition","volume":"88","author":"Vicente-Saez","year":"2018","journal-title":"Journal of Business Research"},{"key":"2023083111393717500_fqad018-B41","author":"Volz","year":"2009"},{"issue":"4","key":"2023083111393717500_fqad018-B42","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1080\/07421222.1996.11518099","article-title":"Beyond accuracy: what data quality means to data consumers","volume":"12","author":"Wang","year":"1996","journal-title":"Journal of Management Information Systems"},{"key":"2023083111393717500_fqad018-B43","doi-asserted-by":"crossref","first-page":"364","DOI":"10.1016\/j.future.2022.05.014","article-title":"A survey of human-in-the-loop for machine learning","volume":"135","author":"Wu","year":"2022","journal-title":"Future Generation Computer Systems"},{"issue":"1","key":"2023083111393717500_fqad018-B44","doi-asserted-by":"crossref","first-page":"63","DOI":"10.3233\/SW-150175","article-title":"Quality assessment for linked data: a survey","volume":"7","author":"Zaveri","year":"2016","journal-title":"Semantic Web"}],"container-title":["Digital Scholarship in the Humanities"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/dsh\/article-pdf\/38\/3\/1049\/51309477\/fqad018.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/dsh\/article-pdf\/38\/3\/1049\/51309477\/fqad018.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,31]],"date-time":"2023-08-31T11:43:03Z","timestamp":1693482183000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/dsh\/article\/38\/3\/1049\/7110232"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,6]]},"references-count":44,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2023,4,6]]},"published-print":{"date-parts":[[2023,8,31]]}},"URL":"https:\/\/doi.org\/10.1093\/llc\/fqad018","relation":{},"ISSN":["2055-7671","2055-768X"],"issn-type":[{"value":"2055-7671","type":"print"},{"value":"2055-768X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,9,1]]},"published":{"date-parts":[[2023,4,6]]}}}