{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,5]],"date-time":"2026-04-05T21:49:14Z","timestamp":1775425754180,"version":"3.50.1"},"reference-count":105,"publisher":"Emerald","issue":"7","license":[{"start":{"date-parts":[[2024,4,18]],"date-time":"2024-04-18T00:00:00Z","timestamp":1713398400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["JD"],"published-print":{"date-parts":[[2024,12,16]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>This paper focuses on image-to-text manuscript processing through Handwritten Text Recognition (HTR), a Machine Learning (ML) approach enabled by Artificial Intelligence (AI). With HTR now achieving high levels of accuracy, we consider its potential impact on our near-future information environment and knowledge of the past.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>In undertaking a more constructivist analysis, we identified gaps in the current literature through a Grounded Theory Method (GTM). This guided an iterative process of concept mapping through writing sprints in workshop settings. We identified, explored and confirmed themes through group discussion and a further interrogation of relevant literature, until reaching saturation.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>Catalogued as part of our GTM, 120 published texts underpin this paper. We found that HTR facilitates accurate transcription and dataset cleaning, while facilitating access to a variety of historical material. HTR contributes to a virtuous cycle of dataset production and can inform the development of online cataloguing. However, current limitations include dependency on digitisation pipelines, potential archival history omission and entrenchment of bias. We also cite near-future HTR considerations. These include encouraging open access, integrating advanced AI processes and metadata extraction; legal and moral issues surrounding copyright and data ethics; crediting individuals\u2019 transcription contributions and HTR\u2019s environmental costs.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>Our research produces a set of best practice recommendations for researchers, data providers and memory institutions, surrounding HTR use. This forms an initial, though not comprehensive, blueprint for directing future HTR research. In pursuing this, the narrative that HTR\u2019s speed and efficiency will simply transform scholarship in archives is deconstructed.<\/jats:p><\/jats:sec>","DOI":"10.1108\/jd-09-2023-0183","type":"journal-article","created":{"date-parts":[[2024,5,1]],"date-time":"2024-05-01T13:28:09Z","timestamp":1714570089000},"page":"148-167","source":"Crossref","is-referenced-by-count":10,"title":["The implications of handwritten text recognition for accessing the\u00a0past at scale"],"prefix":"10.1108","volume":"80","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4577-6596","authenticated-orcid":false,"given":"Joseph","family":"Nockels","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1044-509X","authenticated-orcid":false,"given":"Paul","family":"Gooding","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6496-3197","authenticated-orcid":false,"given":"Melissa","family":"Terras","sequence":"additional","affiliation":[]}],"member":"140","published-online":{"date-parts":[[2024,4,18]]},"reference":[{"key":"key2024121103340212700_ref001","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18352\/lq.10371","article-title":"Transparency, provenance and collections as data: the national library of Scotland's data foundry","volume":"31","year":"2021","journal-title":"LIBER Quarterly"},{"issue":"9","key":"key2024121103340212700_ref002","doi-asserted-by":"publisher","first-page":"141","DOI":"10.48550\/arXiv.2008.05373","article-title":"Attention-based fully gated cnn-bgru for Russian handwritten text","volume":"6","year":"2020","journal-title":"Journal of Imaging"},{"issue":"7900","key":"key2024121103340212700_ref003","doi-asserted-by":"publisher","first-page":"280","DOI":"10.1038\/s41586-022-04448-z","article-title":"Restoring and attributing ancient texts using deep neural networks","volume":"603","year":"2022","journal-title":"Nature"},{"issue":"1","key":"key2024121103340212700_ref004","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1080\/14626268.2013.767276","article-title":"Speculative design: crafting the speculation","volume":"24","year":"2013","journal-title":"Digital Creativity"},{"issue":"1","key":"key2024121103340212700_ref005","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1177\/20539517231173901","article-title":"Stepping back from AI and Data for Good - current trends and ways forward","volume":"10","year":"2023","journal-title":"Big Data and Society"},{"key":"key2024121103340212700_ref006","doi-asserted-by":"publisher","first-page":"266","DOI":"10.1145\/3083671.3083700","article-title":"Infrastructures of the imagination: community design for speculative urban technologies","year":"2017"},{"key":"key2024121103340212700_ref007","doi-asserted-by":"publisher","first-page":"610","DOI":"10.1145\/3442188.3445922","article-title":"On the dangers of stochastic parrots: can Language Models Be too big?","year":"2021"},{"key":"key2024121103340212700_ref008","doi-asserted-by":"publisher","first-page":"161","DOI":"10.1109\/DAS.2014.40","article-title":"The A2iA Arabic handwritten text recognition system at the open HaRT2013","year":"2014"},{"issue":"2","key":"key2024121103340212700_ref009","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1177\/0165551520950246","article-title":"Reusing digital collections from GLAM institutions","volume":"48","year":"2020","journal-title":"Journal of Information Science"},{"key":"key2024121103340212700_ref010","volume-title":"Constructing Grounded Theory: A Practical Guide through Qualitative Analysis","year":"2006"},{"key":"key2024121103340212700_ref011","unstructured":"Chew, C. (2021), \u201cRecording. Non-discriminatory library cataloguing practices for sound and moving image\u201d, available at: https:\/\/scotlands-sounds.nls.uk\/index.php\/2021\/10\/08\/non-discriminatory-library-cataloguing-practices-for-sound-and-moving-image\/"},{"issue":"1","key":"key2024121103340212700_ref012","doi-asserted-by":"publisher","first-page":"127","DOI":"10.1561\/115.00000005","article-title":"Turning history into data: data collection, measurement and inference in HPE","volume":"1","year":"2021","journal-title":"Journal of Historical Political Economy"},{"key":"key2024121103340212700_ref013","volume-title":"Blog post. Is Google Good for History?","year":"2010"},{"issue":"1","key":"key2024121103340212700_ref014","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1515\/pdtc-2013-0003","article-title":"Preserving imperfection: assessing the incidence of digital imaging error in HathiTrust","volume":"42","year":"2013","journal-title":"Digital Technology and Culture"},{"issue":"1","key":"key2024121103340212700_ref015","doi-asserted-by":"publisher","first-page":"188","DOI":"10.1353\/bh.2017.0006","article-title":"\u2018Q i-jtb the raven\u2019: taking dirty OCR seriously","volume":"20","year":"2017","journal-title":"Book History"},{"key":"key2024121103340212700_ref016","unstructured":"Cordell, R. (2020), \u201cMachine learning + libraries\u201d, Library of Congress, available at: https:\/\/labs.loc.gov\/work\/experiments\/newspaper-navigator\/"},{"key":"key2024121103340212700_ref017","unstructured":"Cottuli, M. (2022), \u201cNew handwriting experiences come to Windows 10 Insider build 16215 for PC\u201d, OnMSFT, 28 December, 2022, available at: https:\/\/www.onmsft.com\/news\/new-handwriting-experiences-come-to-windows-10-insider-build-16215-for-pc\/"},{"issue":"1","key":"key2024121103340212700_ref018","first-page":"1","article-title":"Waiting for the ghost train: strategies for managing electronic personal records before it is too late","volume":"1","year":"1999","journal-title":"Archival Issues"},{"issue":"4","key":"key2024121103340212700_ref019","doi-asserted-by":"publisher","first-page":"690","DOI":"10.1111\/j.1527-2001.2011.01220.x","article-title":"Climate change, vulnerability, and responsibility","volume":"26","year":"2011","journal-title":"Hypatia"},{"key":"key2024121103340212700_ref020","volume-title":"Data Feminism","year":"2020"},{"key":"key2024121103340212700_ref021","doi-asserted-by":"publisher","volume-title":"A Researcher Guide to Writing a Climate Justice Oriented Data Management Plan","year":"2022","DOI":"10.5281\/zenodo.6451499"},{"issue":"12","key":"key2024121103340212700_ref022","doi-asserted-by":"publisher","first-page":"2427","DOI":"10.1002\/asi.23330","article-title":"User conceptions of trustworthiness for digital archival documents","volume":"66","year":"2015","journal-title":"Journal of the Association for Information Science and Technology"},{"issue":"6","key":"key2024121103340212700_ref023","first-page":"1","article-title":"Blood at the root","volume":"8","year":"2021","journal-title":"Journal of Contemporary Archival Studies"},{"key":"key2024121103340212700_ref024","unstructured":"Egger, A. (2021), \u201cTranskribus projects at the Vienna city library\u201d, in READ-COOP Success Stories, available at: https:\/\/readcoop.eu\/success-stories\/vienna"},{"key":"key2024121103340212700_ref025","first-page":"1","article-title":"Historical newspaper user interfaces: a review","year":"2019"},{"issue":"6","key":"key2024121103340212700_ref026","doi-asserted-by":"publisher","DOI":"10.16995\/dscn.12","article-title":"Chapter 12: evaluating digital remediations of women's manuscripts","volume":"6","year":"2016","journal-title":"Digital Studies\/Le champ num\u00e9rique, Beyond Accessibility: Textual Studies in the Twenty-First Century"},{"key":"key2024121103340212700_ref027","unstructured":"Ewing, E.T., Gad, S., Hausman, B.L., Kerr, K., Pencek, B. and Ramakrishnan, N. (2014), \u201cBlog post. Mining coverage of the flu: big data's insights into an epidemic\u201d, Perspectives on History (AHA), available at: https:\/\/www.historians.org\/publications-and-directories\/perspectives-on-history\/january-2014\/mining-coverage-of-the-flu-big-datas-insights-into-an-epidemic"},{"issue":"7","key":"key2024121103340212700_ref028","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1108\/jd-11-2021-0221","article-title":"Gender influences in Digital Humanities co-authorship networks","volume":"78","year":"2022","journal-title":"Journal of Documentation"},{"issue":"2","key":"key2024121103340212700_ref029","doi-asserted-by":"publisher","first-page":"1","DOI":"10.17169\/fqs-5.2.607","article-title":"Remodelling grounded theory. Forum qualitative sozialforschung\/forum","volume":"5","year":"2004","journal-title":"Qualitative Social Research"},{"key":"key2024121103340212700_ref030","volume-title":"The Discovery of Grounded Theory Strategies for Qualitative Research","year":"1967"},{"key":"key2024121103340212700_ref097","volume-title":"Historic Newspapers in the Digital Age: \u2018Search All about it\u2019","year":"2017"},{"key":"key2024121103340212700_ref099","doi-asserted-by":"crossref","unstructured":"Gooding, P. (2023), \u201cInformational abundance and material absence in the digitised early modern press: the case for contextual digitisation\u201d, in Brownlees, N. (Ed.), The Edinburgh History of the British and Irish Press, Edinburgh University Press, Edinburgh, Beginnings and Consolidation 1640-1800, Vol.\u00a01, pp. 586-598.","DOI":"10.3366\/edinburgh\/9781474499170.003.0028"},{"key":"key2024121103340212700_ref031","volume-title":"Exploring Big Historical Data, the Historian's Macroscope","year":"2016"},{"key":"key2024121103340212700_ref032","volume-title":"The History Manifesto","year":"2014"},{"key":"key2024121103340212700_ref033","doi-asserted-by":"crossref","unstructured":"Hanson, A. (2017), \u201cNegative case analysis\u201d, in Allen, M. (Ed.), The International Encyclopaedia of Communication Research Methods, Wiley & Son, New York, pp.\u00a01-3.","DOI":"10.1002\/9781118901731.iecrm0165"},{"key":"key2024121103340212700_ref034","first-page":"215","article-title":"Combining human and machine transcriptions on the zooniverse platform","year":"2018"},{"issue":"9","key":"key2024121103340212700_ref035","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1007\/s10502-020-09332-1","article-title":"Of global reach yet of situated contexts: an examination of the implicit and explicit selection criteria that shape digital archives of historical newspapers","volume":"20","year":"2020","journal-title":"Archival Science"},{"key":"key2024121103340212700_ref036","unstructured":"Havens, L. (2020), \u201cBlog post. Exploring collections as data with jupyter notebooks\u201d, National Library of Scotland Data Foundry, available at:, https:\/\/data.nls.uk\/project\/exploring-collections-as-data-with-juypter-notebooks"},{"key":"key2024121103340212700_ref037","unstructured":"Havens, L., Alex, B. and Terras, M. (forthcoming 2024), \u201cConfronting gender biases in heritage catalogues: a natural language processing approach to revisiting descriptive metadata\u201d, in Ashton, J. (Ed.), The Routledge Handbook on Heritage and Gender, Routledge, London."},{"key":"key2024121103340212700_ref038","doi-asserted-by":"crossref","unstructured":"Hodel, T. (2022), \u201cSupervised and unsupervised: approaches to machine learning for textual entities\u201d, in Jaillant, L. (Ed.), Archives, Access and Artificial Intelligence, Bielefeld University Press, Bielefeld, pp.\u00a0157-178.","DOI":"10.1515\/9783839455845-007"},{"key":"key2024121103340212700_ref039","unstructured":"Kaukonen, M. (2021), \u201cImproved text recognition for Finnish historical newspapers with transkribus\u201d, READ-COOP Success Stories, available at: https:\/\/readcoop.eu\/success-stories\/improved-text-recognition-for-finnish-historical-newspapers-with-transkribus\/"},{"key":"key2024121103340212700_ref040","doi-asserted-by":"publisher","volume-title":"Teaching History in the Digital Age","year":"2013","DOI":"10.3998\/dh.12146032.0001.001"},{"key":"key2024121103340212700_ref041","volume-title":"Trading Zones of Digital History","year":"2021"},{"key":"key2024121103340212700_ref042","volume-title":"In from the Cold: an Assessment of the Scope of \u2018Orphan Works\u2019 and its Impact on the Delivery of Services to the Public","year":"2009"},{"key":"key2024121103340212700_ref043","volume-title":"Content Analysis: an Introduction to its Methodology","year":"2004"},{"key":"key2024121103340212700_ref044","article-title":"Transkribus and IIIF: beneficial possibilities between image sharing and handwritten text recognition frameworks","volume-title":"IIIF Conference","year":"2019"},{"key":"key2024121103340212700_ref045","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1910.09700","article-title":"Quantifying the carbon emissions of machine learning","year":"2019","journal-title":"arXiv. Preprint."},{"key":"key2024121103340212700_ref046","volume-title":"Privacy and the Past: Research, Law, Archives, Ethics","year":"2016"},{"issue":"1","key":"key2024121103340212700_ref047","doi-asserted-by":"publisher","first-page":"72","DOI":"10.3366\/jvc.2005.10.1.72","article-title":"Googling the victorians","volume":"10","year":"2005","journal-title":"Journal of Victorian Culture"},{"key":"key2024121103340212700_ref048","unstructured":"Lincoln, M. (2017), \u201cWays of forgetting: the librarian, the historian, and the machine\u201d, in National Forum Position Statements. Always Already Computational: Library Collections as Data National Forum, available at: https:\/\/collectionsasdata.github.io\/aac_positionstatements.pdf"},{"key":"key2024121103340212700_ref049","unstructured":"Marche, S. (2022), \u201c\u2018Our Mission is Crucial\u2019: meet the warrior librarians of Ukraine\u201d, The Guardian, available at: https:\/\/www.theguardian.com\/books\/2022\/dec\/04\/our-mission-is-crucial-meet-the-warrior-librarians-of-ukraine"},{"key":"key2024121103340212700_ref050","article-title":"The FACTS of technology-assisted sensitivity review","year":"2019"},{"key":"key2024121103340212700_ref051","doi-asserted-by":"crossref","unstructured":"McNeill, J.R. (2016), \u201cHistorians, superhistory, and climate change\u201d, in Jarrick, A., Myrdal, J. and Wallenberg Bondesson, M. (Eds), Methods in World History, A Critical Approach, Nordic Academic Press, Lund, pp.\u00a019-43.","DOI":"10.21525\/kriterium.2.b"},{"key":"key2024121103340212700_ref052","first-page":"1","article-title":"The reconfiguration of the archive as data to Be mined","volume":"1","year":"2016","journal-title":"The Journal of Association of Canadian Archivists"},{"key":"key2024121103340212700_ref053","first-page":"1","article-title":"Transkribus for archives or how artificial intelligence is revolutionizing access to historical documents","year":"2023","journal-title":"Deep-L. Pre-print."},{"issue":"50","key":"key2024121103340212700_ref054","first-page":"965","article-title":"Transforming scholarship in the archives through handwriting text recognition, Transkribus as a case study","volume":"75","year":"2019","journal-title":"Journal of Documentation"},{"key":"key2024121103340212700_ref055","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1080\/13688804.2012.752963","article-title":"The digital turn: exploring the methodological possibilities of digital newspaper archives","volume":"19","year":"2013","journal-title":"Media History"},{"key":"key2024121103340212700_ref100","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10502-022-09397-0","article-title":"Understanding the application of handwritten text recognition technology in heritage contexts: a systematic review of Transkribus in published research","volume":"22","year":"2022","journal-title":"Archival Science"},{"key":"key2024121103340212700_ref056","volume-title":"Named Entities for Computational Linguistics","year":"2016"},{"issue":"21-23","key":"key2024121103340212700_ref057","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s11042-021-11399-6","article-title":"Handwritten Kazakh and Russian (hkr) database for text recognition","volume":"80","year":"2021","journal-title":"Multimedia Tools and Applications"},{"issue":"22","key":"key2024121103340212700_ref058","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1086\/710062","article-title":"The crying child: on colonial archives, digitization, and ethics of care in the cultural commons","volume":"61","year":"2020","journal-title":"Current Anthropology"},{"issue":"1","key":"key2024121103340212700_ref059","first-page":"1","article-title":"Sagas in handwritten and printed books in 19th century Iceland","volume":"11","year":"2004","journal-title":"Sagas and Societies"},{"key":"key2024121103340212700_ref060","volume-title":"The Theory and Craft of Digital Preservation","year":"2018"},{"key":"key2024121103340212700_ref061","volume-title":"On a Collections as Data Imperative","year":"2017"},{"key":"key2024121103340212700_ref062","unstructured":"Padilla, T., Allen, L., Frost, H., Potvin, S., Russey Roke, E. and Varner, S. (2019), \u201cFinal report - always already computational: collections as data\u201d, available at: https:\/\/zenodo.org\/record\/3152935#.X6WOf-LPzIU"},{"issue":"1","key":"key2024121103340212700_ref063","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1177\/0022526620985073","article-title":"The carbon footprint of a scientific community: a survey of the historians of mobility and their normalized yet abundant reliance on air travel","volume":"42","year":"2021","journal-title":"The Journal of Transport History"},{"issue":"1","key":"key2024121103340212700_ref064","doi-asserted-by":"publisher","first-page":"56","DOI":"10.15845\/noril.v13i1.3782","article-title":"Teaching information literacy in the humanities: engaging students with primary sources and cultural heritage material","volume":"13","year":"2022","journal-title":"Nordic Journal of Information Literacy in Higher Education"},{"issue":"2","key":"key2024121103340212700_ref065","doi-asserted-by":"publisher","first-page":"377","DOI":"10.1093\/ahr\/121.2.377","article-title":"The transnational and text-searchable: digitized sources and the shadows they cast","volume":"121","year":"2016","journal-title":"The American History Review"},{"key":"key2024121103340212700_ref066","article-title":"Transkribus daily report","author":"READ-COOP","year":"2023"},{"issue":"22","key":"key2024121103340212700_ref067","doi-asserted-by":"publisher","first-page":"4853","DOI":"10.48550\/arXiv.1909.04032","article-title":"OCR4all - an open-source tool providing a (semi-) automatic OCR workflow for historical printings","volume":"9","year":"2019","journal-title":"Applied Sciences"},{"key":"key2024121103340212700_ref068","doi-asserted-by":"crossref","unstructured":"Romein, C.A., Hodel, T., Gordijn, F., Zundert, J.J.V., Chagu\u00e9, A., Lange, M.V., Jensen, H.S., Stauder, A., Purcell, J., Terras, M., Heuvel, P., van den, Keijzer, C., Rabus, A., Sitaram, C., Bhatia, A., Depuydt, K., Afolabi-Adeolu, M.A., Anikina, A., Bastianello, E., Benzinger, L.V., Bosse, A., Brown, D., Charlton, A., Dannevig, A.N., Gelder, K.V., Go, S.C.P.J., Goh, M.J.C., Gstrein, S., Hasan, S., Heide, S.V.D., Hindermann, M., Huff, D., Huysman, I., Idris, A., Keijzer, L., Kemper, S., Koenders, S., Kuijpers, E., R\u00f8nsig Larsen, L., Lepa, S., Link, T.O., Nispen, A., van, Nockels, J., Noort, L.M.V., Oosterhuis, J.J., Popken, V., Estrella Puertollano, M., Puusaag, J.J., Sheta, A., Stoop, L., Strutzenbladh, E., Sijs, N.V.D., Spek, J.P.V.D., Trouw, B.B., Van Synghel, G., Vu\u010dkovi\u0107, V., Wilbrink, H., Weiss, S., Wrisley, D.J. and Zweistra, R. (2024), \u201cExploring data provenance in handwritten text recognition infrastructure: sharing and reusing ground truth data, referencing models, and acknowledging contributions. Starting the conversation on how we could get it done.\u201d Journal of Data Mining and Digital Humanities. Special Issue: Historical Documents and automatic text recognition, pp. 1-26. doi: 10.46298\/jdmdh.10403.","DOI":"10.46298\/jdmdh.10403"},{"issue":"3","key":"key2024121103340212700_ref069","doi-asserted-by":"publisher","first-page":"735","DOI":"10.1086\/ahr\/108.3.735","article-title":"Scarcity or abundance? Preserving the past in a Digital Era. The American historical review","volume":"108","year":"2003","journal-title":"The American Historical Review"},{"issue":"3","key":"key2024121103340212700_ref070","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3487917","article-title":"Speculative design as a collaborative practice: ameliorating the consequences of illiteracy through digital touch","volume":"29","year":"2022","journal-title":"ACM Transactions on. Computer-Human Interaction"},{"key":"key2024121103340212700_ref071","volume-title":"History of OCR, Optical Character Recognition","year":"1982"},{"key":"key2024121103340212700_ref072","doi-asserted-by":"crossref","unstructured":"Schomaker, L. (2019), \u201cLifelong learning for text retrieval and recognition in historical handwritten document collections\u201d, in Fischer, A., Liwicki, M. and Ingold, R. (Eds), Handwritten Historical Document Analysis, Recognition and Retrieval \u2013 State of the Art and Future Trends, World Scientific, London, pp.\u00a0221-248.","DOI":"10.1142\/9789811203244_0012"},{"key":"key2024121103340212700_ref073","volume-title":"Copyright and E-Learning: A Guide for Practitioners","year":"2016","edition":"2nd ed."},{"key":"key2024121103340212700_ref074","volume-title":"The Social Life of Information","year":"2000"},{"key":"key2024121103340212700_ref075","first-page":"495","article-title":"Automated metadata extraction: challenges and opportunities","year":"2022"},{"key":"key2024121103340212700_ref076","unstructured":"Smith, J. (2021), \u201cBlog post, Palladium: appraisal and sensitivity review of the Carcanet email archive\u201d, John Rylands Research Institute and Library, available at: https:\/\/rylandscollections.com\/2021\/05\/28\/palladium-appraisal-and-sensitivity-review-of-the-carcanet-email-archive\/"},{"key":"key2024121103340212700_ref077","article-title":"Recording. The next generation of Transkribus","year":"2022"},{"key":"key2024121103340212700_ref078","article-title":"Invitation: ChatGPT and transkribus - members meeting","year":"2023"},{"issue":"5","key":"key2024121103340212700_ref079","doi-asserted-by":"publisher","first-page":"87","DOI":"10.1007\/BF02435632","article-title":"Colonial archives and the Arts of governance","volume":"2","year":"2002","journal-title":"Archival Science"},{"key":"key2024121103340212700_ref080","volume-title":"Along the Archival Grain","year":"2008"},{"key":"key2024121103340212700_ref081","unstructured":"Strauss, T., Weidemann, M. and Labahn, R. (2017), \u201cD7.11 Language Models - improving transcriptions by external language resource\u201d, Innsbruck: Recognition and Enrichment of Archival Documents (READ), available at: https:\/\/readcoop.eu\/wp-content\/uploads\/2017\/12\/D7.11_final.pdf"},{"key":"key2024121103340212700_ref082","article-title":"Energy and policy considerations for deep learning in NLP","year":"2019"},{"key":"key2024121103340212700_ref106","year":"2023"},{"issue":"7-8","key":"key2024121103340212700_ref083","doi-asserted-by":"publisher","DOI":"10.1045\/july2009-munoz","article-title":"Measuring mass text digitization quality and usefulness. Lessons learned from assessing the OCR accuracy of the British library's 19th century online newspaper archive","volume":"15","year":"2009","journal-title":"D-Lib Magazine"},{"key":"key2024121103340212700_ref098","doi-asserted-by":"crossref","unstructured":"Terras, M. (2022), \u201cInviting AI into the archives: the reception of handwritten recognition technology into historical manuscript transcription\u201d, in Jaillaint, L. (Ed.), Archives, Access and Artificial Intelligence: Working with Born-Digital and Digitized Archival Collections, Bielefeld University Press, Bielefeld, pp. 179-204.","DOI":"10.1515\/9783839455845-008"},{"key":"key2024121103340212700_ref101","first-page":"1","article-title":"On automating standardised editions: the affordances of handwritten text recognition platforms for scholarly editing","year":"2024","journal-title":"Scholarly Editing"},{"key":"key2024121103340212700_ref084","doi-asserted-by":"crossref","unstructured":"Thomas, W.G. III (2004), \u201cComputing and the historical imagination\u201d, in Schreibman, S., Siemens, R. and Unsworth, J. (Eds), A Companion to the Digital Humanities, Wiley & Sons, New York, pp.\u00a056-68.","DOI":"10.1111\/b.9781405103213.2004.00008.x"},{"key":"key2024121103340212700_ref085","volume-title":"The Politics of Mass Digitization","year":"2019"},{"key":"key2024121103340212700_ref086","doi-asserted-by":"publisher","first-page":"1","DOI":"10.48550\/arXiv.2110.04075","article-title":"KOHTD: Kazakh offline handwritten text dataset. Signal processing","volume":"108","year":"2022","journal-title":"Image Communication"},{"issue":"2","key":"key2024121103340212700_ref087","doi-asserted-by":"publisher","first-page":"176","DOI":"10.17723\/aarc.65.2.920w65g3217706l1","article-title":"A comparison of Jenkinson and Schellenberg on appraisal","volume":"65","year":"2002","journal-title":"The American Archivist"},{"key":"key2024121103340212700_ref088","unstructured":"Turkel, W.J., Kee, K. and Roberts, S. (2012), \u201cA method for navigating the infinite archive\u201d, in Weller, T. (Ed.), History in the Digital Age, Routledge, London, pp.\u00a057-72."},{"key":"key2024121103340212700_ref089","volume-title":"Cataloguing Culture: Legacies of Colonialism in Museum Documentation","year":"2020"},{"issue":"2","key":"key2024121103340212700_ref090","first-page":"1","article-title":"A genealogy of distant reading","volume":"11","year":"2017","journal-title":"Digital Humanities Quarterly"},{"key":"key2024121103340212700_ref091","unstructured":"Unsworth, J. and Tupman, C. (2016), \u201cInterview with John Unsworth, April 2011, carried out and transcribed by Charlotte Tupman\u201d, in Deegan, M. and McCarty, W. (Eds), Collaborative Research in the Digital Humanities, Routledge, London, pp.\u00a0231-240."},{"key":"key2024121103340212700_ref092","doi-asserted-by":"publisher","article-title":"Data augmentation and text recognition on Khmer historical manuscripts","year":"2020","DOI":"10.1109\/ICFHR2020.2020.00024\\"},{"key":"key2024121103340212700_ref093","volume-title":"Emotion Imprints of War: A Computer Assisted Analysis of Emotions in Dutch Parliamentary Debates, 1945-1989","year":"2023"},{"key":"key2024121103340212700_ref094","doi-asserted-by":"crossref","unstructured":"Vu, M.T., Le, V.L. and Beurton-Aimar, M. (2021), \u201cIHR-NomDB: the old degraded Vietnamese handwritten script archive database\u201d, in Elisa, B., Wen, G., Steffan, B. and Yong, M. (Eds), Document Analysis and Recognition - ICDAR 2021, Lecture Notes in Computer Science, Springer International Publishing, Cham, pp.\u00a085-99.","DOI":"10.1007\/978-3-030-86334-0_6"},{"issue":"1","key":"key2024121103340212700_ref095","first-page":"1","article-title":"Generous interfaces for digital cultural collections","volume":"9","year":"2015","journal-title":"Digital Humanities Quarterly"},{"issue":"1","key":"key2024121103340212700_ref096","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/sdata.2016.18","article-title":"The FAIR Guiding Principles for scientific data management and stewardship","volume":"3","year":"2016","journal-title":"Scientific Data"},{"issue":"15","key":"key2024121103340212700_ref102","doi-asserted-by":"publisher","first-page":"332","DOI":"10.1353\/bh.2015.0007","article-title":"The history of archives: the state of the discipline","volume":"18","year":"2015","journal-title":"Book History"},{"issue":"2","key":"key2024121103340212700_ref103","doi-asserted-by":"publisher","first-page":"830","DOI":"10.1093\/llc\/fqac050\/6702047","article-title":"Digital history and the politics of digitization","volume":"38","year":"2019","journal-title":"Digital Scholarship in the Humanities"},{"key":"key2024121103340212700_ref105","unstructured":"GitHub (2023), \u201cText recognition for zooniverse\u201d, available at: https:\/\/github.com\/danhan52\/text_recognition"}],"container-title":["Journal of Documentation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/JD-09-2023-0183\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/JD-09-2023-0183\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T22:35:04Z","timestamp":1753396504000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/jd\/article\/80\/7\/148-167\/1235679"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,18]]},"references-count":105,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2024,4,18]]},"published-print":{"date-parts":[[2024,12,16]]}},"alternative-id":["10.1108\/JD-09-2023-0183"],"URL":"https:\/\/doi.org\/10.1108\/jd-09-2023-0183","relation":{},"ISSN":["0022-0418"],"issn-type":[{"value":"0022-0418","type":"print"}],"subject":[],"published":{"date-parts":[[2024,4,18]]}}}