{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,21]],"date-time":"2026-03-21T08:25:01Z","timestamp":1774081501832,"version":"3.50.1"},"reference-count":38,"publisher":"Emerald","issue":"7","license":[{"start":{"date-parts":[[2022,2,9]],"date-time":"2022-02-09T00:00:00Z","timestamp":1644364800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["JD"],"published-print":{"date-parts":[[2022,12,19]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>Budgeting data curation tasks in research projects is difficult. In this paper, we investigate the time spent on data curation, more specifically on cleaning and documenting quantitative data for data sharing. We develop recommendations on cost factors in research data management.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>We make use of a pilot study conducted at the GESIS Data Archive for the Social Sciences in Germany between December 2016 and September 2017. During this period, data curators at GESIS - Leibniz Institute for the Social Sciences documented their working hours while cleaning and documenting data from ten quantitative survey studies. We analyse recorded times and discuss with the data curators involved in this work to identify and examine important cost factors in data curation, that is aspects that increase hours spent and factors that lead to a reduction of their work.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>We identify two major drivers of time spent on data curation: The size of the data and personal information contained in the data. Learning effects can occur when data are similar, that is when they contain same variables. Important interdependencies exist between individual tasks in data curation and in connection with certain data characteristics.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>The different tasks of data curation, time spent on them and interdependencies between individual steps in curation have so far not been analysed.<\/jats:p><\/jats:sec>","DOI":"10.1108\/jd-08-2021-0167","type":"journal-article","created":{"date-parts":[[2022,2,8]],"date-time":"2022-02-08T05:18:52Z","timestamp":1644297532000},"page":"282-304","source":"Crossref","is-referenced-by-count":16,"title":["Measuring the time spent on\u00a0data curation"],"prefix":"10.1108","volume":"78","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0574-9275","authenticated-orcid":false,"given":"Anja","family":"Perry","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2784-6968","authenticated-orcid":false,"given":"Sebastian","family":"Netscher","sequence":"additional","affiliation":[]}],"member":"140","published-online":{"date-parts":[[2022,2,9]]},"reference":[{"key":"key2023030313583695200_ref001","unstructured":"4TU.ResearchData, TU Delft (2020), \u201cData management costing tool\u201d, available at: https:\/\/zingtree.com\/host.php?style=buttons&tree_id=511095771&persist_names=Restart&persist_node_ids=1&start_node=1&start_tree=511095771 (accessed 27 August 2021)."},{"key":"key2023030313583695200_ref002","unstructured":"Beagrie, C. (2017), \u201cCESSDA SaW costs factsheet\u201d, doi: 10.18448\/16.0003."},{"key":"key2023030313583695200_ref003","unstructured":"Beagrie, N., Chruszcz, J. and Lavoie, B. (2008), \u201cKeeping research data safe - a cost model and guidance for UK universities\u201d, Final report, available at: https:\/\/www.webarchive.org.uk\/wayback\/archive\/20140615221657\/http:\/\/www.jisc.ac.uk\/media\/documents\/publications\/keepingresearchdatasafe0408.pdf (accessed 27 August 2021)."},{"key":"key2023030313583695200_ref004","unstructured":"Beagrie, N., Lavoie, B. and Woollard, M. (2010), \u201cKeeping research data safe 2\u201d, Final report, available at: https:\/\/www.webarchive.org.uk\/wayback\/archive\/20140615221405\/http:\/www.jisc.ac.uk\/media\/documents\/publications\/reports\/2010\/keepingresearchdatasafe2.pdf (accessed 27 August 2021)."},{"key":"key2023030313583695200_ref005","unstructured":"Bertelmann, R., Gebauer, P., Hasler, T., Kirchner, I., Peters-Kottig, W., Razum, M., Recker, A., Ulbricht, D. and van Gasselt, S. (2014), \u201cEinstieg ins Forschungsdatenmanagement in den Geowissenschaften\u201d, Potsdam, available at: https:\/\/gfzpublic.gfz-potsdam.de\/rest\/items\/item_749901_8\/component\/file_749904\/content (accessed 27 August 2021)."},{"key":"key2023030313583695200_ref006","doi-asserted-by":"publisher","article-title":"Handlungsempfehlungen zu Forschungsdatenmanagement und -infrastruktur an Hochschulstandorten","year":"2019","DOI":"10.25625\/PAYCKB"},{"key":"key2023030313583695200_ref008","volume-title":"Managing and Sharing Research Data: A Guide to Good Practice","year":"2014"},{"key":"key2023030313583695200_ref009","unstructured":"DDI Alliance (2021a), \u201cDocument, discover and interoperate - the website of the DDI alliance\u201d, available at: https:\/\/ddialliance.org\/ (accessed 6 August 2021)."},{"key":"key2023030313583695200_ref010","article-title":"DDI codebook 2.5","author":"DDI Alliance","year":"2021"},{"key":"key2023030313583695200_ref011","unstructured":"Donaldson, M. and Ensberg, V. (2018), \u201cHow to ensure that the costs of data management activities are budgeted in grant proposals?\u201d, Open Working, Blog, available at: https:\/\/openworking.wordpress.com\/2018\/03\/09\/how-to-ensure-that-the-costs-of-data-management-activities-are-budgeted-in-grant-proposals\/ (accessed 8 January 2021)."},{"key":"key2023030313583695200_ref012","unstructured":"European Commission (2019), \u201cH2020 programme. AGA - annotated model grant agreement\u201d, 26 June, available at: https:\/\/ec.europa.eu\/research\/participants\/data\/ref\/h2020\/grants_manual\/amga\/h2020-amga_en.pdf (accessed 27 August 2021)."},{"key":"key2023030313583695200_ref013","unstructured":"European Parliament and Council of the European Union (2018), \u201cGeneral data protection regulation\u00a02016\/678\u201d, available at: https:\/\/eur-lex.europa.eu\/eli\/reg\/2016\/679\/oj (accessed 27 August 2021)."},{"key":"key2023030313583695200_ref014","unstructured":"European Research Council (2019), \u201cOpen research data and data management plans - information for ERC grantees\u201d, European Commission, available at: https:\/\/erc.europa.eu\/sites\/default\/files\/document\/file\/ERC_info_document-Open_Research_Data_and_Data_Management_Plans.pdf (accessed 27 August 2021)."},{"key":"key2023030313583695200_ref015","unstructured":"German Research Foundation (2021), \u201cHandling of research data\u201d, Handling of Research Data - Information on the Resources Available, available at: https:\/\/www.dfg.de\/en\/research_funding\/principles_dfg_funding\/research_data\/resources_available\/index.html (accessed 5 August 2021)."},{"issue":"6","key":"key2023030313583695200_ref016","doi-asserted-by":"crossref","first-page":"1318","DOI":"10.1108\/JD-02-2018-0024","article-title":"Digital curation: the development of a discipline within information science","volume":"74","year":"2018","journal-title":"Journal of Documentation"},{"key":"key2023030313583695200_ref017","unstructured":"ICPSR (2020), \u201cICPSR curation levels\u201d, available at: https:\/\/www.icpsr.umich.edu\/files\/datamanagement\/icpsr-curation-levels.pdf (accessed 12 November 2021)."},{"key":"key2023030313583695200_ref018","doi-asserted-by":"publisher","DOI":"10.1093\/database\/baw110","article-title":"How much does curation cost?","volume":"2016","year":"2016","journal-title":"Database"},{"key":"key2023030313583695200_ref019","doi-asserted-by":"publisher","article-title":"Organisation und Struktur, DFG-Projekt RADIESCHEN - Rahmenbedingungen einer disziplin\u00fcbergreifenden Forschungsdateninfrastruktur","year":"2013","DOI":"10.2312\/RADIESCHEN_005"},{"key":"key2023030313583695200_ref020","doi-asserted-by":"publisher","volume-title":"da|ra Metadata Schema - Documentation for the Publication and Citation of Social and Economic Data","year":"2017","DOI":"10.4232\/10.mdsdoc.4.0"},{"issue":"2","key":"key2023030313583695200_ref021","doi-asserted-by":"publisher","first-page":"401","DOI":"10.1108\/JD-02-2014-0026","article-title":"Data literacy: in search of a name and identity","volume":"71","year":"2015","journal-title":"Journal of Documentation"},{"issue":"1","key":"key2023030313583695200_ref022","doi-asserted-by":"publisher","first-page":"2347","DOI":"10.7710\/2162-3309.2347","article-title":"Conceptualizing data curation activities within two academic libraries","volume":"8","year":"2020","journal-title":"Journal of Librarianship and Scholarly Communication"},{"issue":"3","key":"key2023030313583695200_ref023","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0173987","article-title":"Practices of research data curation in institutional repositories: a qualitative view from repository staff","volume":"12","year":"2017","journal-title":"PLoS ONE"},{"key":"key2023030313583695200_ref024","unstructured":"L'Hours, H., Kejser, U.B., Johansen, K.H.E., Thirifays, A., Wang, D., Strodl, S., Ashley, K., Davidson, J., McCann, P., Krupp, J. and Grindley, N. (2014), \u201cD3.2 cost concept model and gateway specification\u201d, Final report, Colchester, available at: https:\/\/www.4cproject.eu\/documents\/D3.2%20Cost%20Concept%20Model%20and%20Gateway%20Specification.pdf (accessed 27 August 2021)."},{"issue":"7796","key":"key2023030313583695200_ref025","doi-asserted-by":"publisher","first-page":"491","DOI":"10.1038\/d41586-020-00505-7","article-title":"Invest 5% of research funds in ensuring data are reusable","volume":"578","year":"2020","journal-title":"Nature"},{"key":"key2023030313583695200_ref007","volume-title":"Life Cycle Decisions for Biomedical Data: The Challenge of Forecasting Costs","author":"National Academies of Sciences, Engineering and Medicine","year":"2020"},{"key":"key2023030313583695200_ref026","doi-asserted-by":"publisher","DOI":"10.17226\/18590","volume-title":"Preparing the Workforce for Digital Curation","author":"National Research Council","year":"2015"},{"issue":"4","key":"key2023030313583695200_ref027","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1007\/s00799-012-0092-1","article-title":"An activity-based costing model for long-term preservation and dissemination of digital research data: the case of DANS","volume":"12","year":"2012","journal-title":"International Journal on Digital Libraries"},{"issue":"5","key":"key2023030313583695200_ref028","doi-asserted-by":"publisher","first-page":"961","DOI":"10.1108\/JD-10-2015-0123","article-title":"The conceptual landscape of digital curation","volume":"72","year":"2016","journal-title":"Journal of Documentation"},{"key":"key2023030313583695200_ref029","doi-asserted-by":"publisher","year":"2019","DOI":"10.25625\/NTRUKA\/KD3XHY"},{"key":"key2023030313583695200_ref030","unstructured":"Service-Team Forschungsdaten der Uni Hannover und der TIB (2018), \u201cWie lassen sich die Kosten f\u00fcr das Forschungsdatenmanagement absch\u00e4tzen?\u201d, December 2018, available at: https:\/\/www.fdm.uni-hannover.de\/fileadmin\/fdm\/Dokumente\/200727_KalkulationFDMKosten.pdf (accessed 27 August 2021)."},{"key":"key2023030313583695200_ref031","unstructured":"Thirifays, A., Sisu, D., Davidson, J., Haage, K., Faria, L., Grootveld, M., Stokes, P. and Middleton, S. (2014), \u201cD3.3 curation costs Exchange framework, collaboration to clarify the costs of curation\u201d, Final report, available at: https:\/\/www.4cproject.eu\/documents\/4C%20-%20D3%203%20-%20Curation%20Costs%20Exchange%20Framework%20-%2031%20Oct%202014%20-V1.0.pdf (accessed 27 August 2021)."},{"key":"key2023030313583695200_ref032","first-page":"1","article-title":"Data management and the curation continuum: how the Monash experience is informing repository relationships","year":"2017"},{"issue":"1","key":"key2023030313583695200_ref033","doi-asserted-by":"publisher","first-page":"87","DOI":"10.2218\/ijdc.v14i1.643","article-title":"Updating the data curation continuum","volume":"14","year":"2019","journal-title":"International Journal of Digital Curation"},{"key":"key2023030313583695200_ref034","unstructured":"UK Data Service (2015), \u201cUK data service - data management costing tool and checklist\u201d, UK Data Archive and University of Essex, available at: https:\/\/ukdataservice.ac.uk\/media\/622368\/costingtool.pdf (accessed 27 August 2021)."},{"key":"key2023030313583695200_ref035","unstructured":"UK Research and Innovation (2015), \u201cGuidance on best practice in the management of research data\u201d, available at: https:\/\/www.ukri.org\/wp-content\/uploads\/2020\/10\/UKRI-020920-GuidanceBestPracticeManagementResearchData.pdf (accessed 12 November 2021)."},{"key":"key2023030313583695200_ref036","unstructured":"Utrecht University (n.d.), \u201cCosts of data management - research data management support\u201d, available at: https:\/\/www.uu.nl\/en\/research\/research-data-management\/guides\/costs-of-data-management (accessed 27 August 2021)."},{"issue":"1","key":"key2023030313583695200_ref037","doi-asserted-by":"publisher","DOI":"10.1038\/sdata.2016.18","article-title":"The FAIR Guiding Principles for scientific data management and stewardship","volume":"3","year":"2016","journal-title":"Scientific Data"},{"key":"key2023030313583695200_ref038","first-page":"159","article-title":"Dokumentation von Umfragedaten in L\u00e4nder vergleichender Perspektive mithilfe des ZA Dataset Documentation Managers (DSDM)","volume":"59","year":"2006","journal-title":"ZA-Information\/Zentralarchiv F\u00fcr Empirische Sozialforschung"}],"container-title":["Journal of Documentation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/JD-08-2021-0167\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/JD-08-2021-0167\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T22:34:48Z","timestamp":1753396488000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/jd\/article\/78\/7\/282-304\/431929"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,2,9]]},"references-count":38,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2022,2,9]]},"published-print":{"date-parts":[[2022,12,19]]}},"alternative-id":["10.1108\/JD-08-2021-0167"],"URL":"https:\/\/doi.org\/10.1108\/jd-08-2021-0167","relation":{},"ISSN":["0022-0418"],"issn-type":[{"value":"0022-0418","type":"print"}],"subject":[],"published":{"date-parts":[[2022,2,9]]}}}