{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:10:18Z","timestamp":1750219818709,"version":"3.41.0"},"reference-count":47,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2023,3,31]],"date-time":"2023-03-31T00:00:00Z","timestamp":1680220800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Comput. Cult. Herit."],"published-print":{"date-parts":[[2023,3,31]]},"abstract":"<jats:p>The Resolutions of the Dutch States General (1576\u20131796) is an archive covering over two centuries of decision making and consists of a heterogeneous series of handwritten and printed documents. The archive, which has recently been digitised, is a rich source for historical research. However, owing to the archive\u2019s heterogeneity and dispersion of information, historians and other researchers find it hard to use the archive for their research.<\/jats:p>\n          <jats:p>In this article, we describe how we deal with the challenges of structuring and connecting the information in this archive. We focus on identifying the existing structural elements, to turn the archive from a set of pages into a set of meeting dates and individual resolutions, with rich metadata for each resolution. To deal with the challenges of historical language change, spelling variation, and text recognition mistakes, we exploit the repetitive nature of the language of the resolutions and use fuzzy string searching to identify structural elements by the formulaic expressions that signal their boundaries. We also discuss and provide an analysis of the value of extracting different types of entities from the text and argue that the choice of which types of entities to focus on should be made based on how they support relevant research questions and methods. In the resolutions, we choose to prioritise person qualifications such as profession, legal status, or title, over person names. Qualifications allow users to select certain groups of people and to meaningfully combine with other layers of metadata, whereas person names lack contextual information to disambiguate them, making it unclear which and how many persons are referred to by selecting a specific person name. We show how our methodology results in a computational platform that allows users to explore and analyse the archive through many connected layers of metadata.<\/jats:p>","DOI":"10.1145\/3575864","type":"journal-article","created":{"date-parts":[[2023,3,16]],"date-time":"2023-03-16T12:13:14Z","timestamp":1678968794000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["The Value of Preexisting Structures for Digital Access: Modelling the Resolutions of the Dutch States General"],"prefix":"10.1145","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0301-2029","authenticated-orcid":false,"given":"Marijn","family":"Koolen","sequence":"first","affiliation":[{"name":"Huygens Institute for the History of the Netherlands, Netherlands"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6951-8014","authenticated-orcid":false,"given":"Rik","family":"Hoekstra","sequence":"additional","affiliation":[{"name":"Huygens Institute for the History of the Netherlands, Netherlands"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0518-2136","authenticated-orcid":false,"given":"Joris","family":"Oddens","sequence":"additional","affiliation":[{"name":"Huygens Institute for the History of the Netherlands, Netherlands"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2896-9986","authenticated-orcid":false,"given":"Ronald","family":"Sluijter","sequence":"additional","affiliation":[{"name":"Huygens Institute for the History of the Netherlands, Netherlands"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6535-2849","authenticated-orcid":false,"given":"Rutger","family":"Van Koert","sequence":"additional","affiliation":[{"name":"KNAW Humanities Cluster - Department of Digital Infrastructure, Netherlands"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3588-6094","authenticated-orcid":false,"given":"Gijsjan","family":"Brouwer","sequence":"additional","affiliation":[{"name":"KNAW Humanities Cluster - Department of Digital Infrastructure, Netherlands"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8402-1831","authenticated-orcid":false,"given":"Hennie","family":"Brugman","sequence":"additional","affiliation":[{"name":"KNAW Humanities Cluster - Department of Digital Infrastructure, Netherlands"}]}],"member":"320","published-online":{"date-parts":[[2023,6]]},"reference":[{"key":"e_1_3_2_2_2","article-title":"The Getty end-user online searching project in the humanities; Report No. 6: Overview and conclusions","volume":"57","author":"Bates Marcia J.","year":"1996","unstructured":"Marcia J. Bates. 1996. The Getty end-user online searching project in the humanities; Report No. 6: Overview and conclusions. Coll. Res. Libraries 57 (1996).","journal-title":"Coll. Res. Libraries"},{"key":"e_1_3_2_3_2","first-page":"1","volume-title":"Proceedings of the Workshop on Language Technology for Cultural Heritage Data (LaTeCH\u201907).","author":"Borin Lars","year":"2007","unstructured":"Lars Borin, Dimitrios Kokkinakis, and Leif-J\u00f6ran Olsson. 2007. Naming the past: Named entity and animacy recognition in 19th Century Swedish literature. In Proceedings of the Workshop on Language Technology for Cultural Heritage Data (LaTeCH\u201907).1\u20138."},{"key":"e_1_3_2_4_2","first-page":"1","volume-title":"Proceedings of the Conference and Labs of the Evaluation Forum (CLEF\u201920)","volume":"2696","author":"Boros Emanuela","year":"2020","unstructured":"Emanuela Boros, Elvys Linhares Pontes, Luis Adri\u00e1n Cabrera-Diego, Ahmed Hamdi, Jos\u00e9 Moreno, Nicolas Sid\u00e8re, and Antoine Doucet. 2020. Robust named entity recognition and linking on historical multilingual documents. In Proceedings of the Conference and Labs of the Evaluation Forum (CLEF\u201920), Vol. 2696. CEUR-WS Working Notes, 1\u201317."},{"key":"e_1_3_2_5_2","volume-title":"Social Research Methods","author":"Bryman Alan","year":"2016","unstructured":"Alan Bryman. 2016. Social Research Methods. Oxford University Press, Oxford, UK."},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1108\/JD-09-2017-0133"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/JCDL.2017.7991582"},{"key":"e_1_3_2_8_2","doi-asserted-by":"crossref","unstructured":"Eunsol Choi Omer Levy Yejin Choi and Luke Zettlemoyer. 2018. Ultra-fine entity typing. Retrieved from https:\/\/arXiv:1807.04905.","DOI":"10.18653\/v1\/P18-1009"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2019.00245"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.3389\/fdigh.2019.00004"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1002\/asi.24399"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.5555\/2632841"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.17723\/aarc.66.1.l375uj047224737n"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1086\/lq.72.4.40039793"},{"key":"e_1_3_2_15_2","unstructured":"Maud Ehrmann Ahmed Hamdi Elvys Linhares Pontes Matteo Romanello and Antoine Doucet. 2021. Named entity recognition and classification on historical documents: A survey. Retrieved from https:\/\/arXiv:2109.11406."},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.18352\/bmgn-lchr.9350"},{"key":"e_1_3_2_17_2","unstructured":"Dan Gillick Nevena Lazic Kuzman Ganchev Jesse Kirchner and David Huynh. 2014. Context-dependent fine-grained entity type tagging. Retrieved from https:\/\/arXiv:1412.1820."},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1108\/JD-10-2014-0149"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-63438-X_2"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1086\/383353"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1093\/llc\/fqz024"},{"key":"e_1_3_2_22_2","unstructured":"Rik Hoekstra. 2017. The griffiers and the keeping of information in the Resolutions of the States General of the United Dutch Provinces 1576\u20131796. Retrieved from https:\/\/www.researchgate.net\/publication\/352679752."},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1080\/01615440.2018.1484676"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.5555\/1814490.1814506"},{"volume-title":"Resoluti\u00ebn der Staten-Generaal 1576\u20131625","author":"Japikse N.","key":"e_1_3_2_25_2","unstructured":"N. Japikseet al. 1915\u20131994. Resoluti\u00ebn der Staten-Generaal 1576\u20131625. Nijhof, Den Haag."},{"key":"e_1_3_2_26_2","first-page":"54","volume-title":"Schetsboek digitale onderzoek-omgeving en dienstverlening: Van vraag naar experiment","author":"Jeurgens Charles","year":"2016","unstructured":"Charles Jeurgens. 2016. Schurende systemen: Seriearchieven in de digitale wereld. In Schetsboek digitale onderzoek-omgeving en dienstverlening: Van vraag naar experiment, H. Berende, K. van der Heiden, T. Thomassen, C. Jeurgens, C. van der Ven, and H. de Man (Eds.). Stichting Archiefpublicaties, \u2019s-Gravenhage, 54\u201361."},{"key":"e_1_3_2_27_2","volume-title":"New Drugs for the Dutch Republic: The Commodification of Fever Remedies in the Netherlands (c. 1650\u20131800)","author":"Klein Wouter","year":"2018","unstructured":"Wouter Klein. 2018. New Drugs for the Dutch Republic: The Commodification of Fever Remedies in the Netherlands (c. 1650\u20131800). Ph.D. Dissertation. Utrecht University."},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1203"},{"issue":"4","key":"e_1_3_2_29_2","article-title":"Mining embodied emotions: A comparative analysis of bodily emotion expressions in dutch theatre texts 1600\u20131800","volume":"11","author":"Leemans I. B.","year":"2017","unstructured":"I. B. Leemans, E. Maks, J. M. van der Zwaan, H. M. E. P. Kuijpers, and Kristine Steenbergh. 2017. Mining embodied emotions: A comparative analysis of bodily emotion expressions in dutch theatre texts 1600\u20131800. Dig. Human. Quart. 11, 4 (2017).","journal-title":"Dig. Human. Quart."},{"key":"e_1_3_2_30_2","volume-title":"Proceedings of the 26th AAAI Conference on Artificial Intelligence","author":"Ling Xiao","year":"2012","unstructured":"Xiao Ling and Daniel S. Weld. 2012. Fine-grained entity recognition. In Proceedings of the 26th AAAI Conference on Artificial Intelligence."},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.3233\/SW-140158"},{"key":"e_1_3_2_32_2","first-page":"159","volume-title":"Proceedings of the 2nd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature","author":"Opitz Juri","year":"2018","unstructured":"Juri Opitz, Leo Born, and Vivi Nastase. 2018. Induction of a large-scale knowledge graph from the Regesta Imperii. In Proceedings of the 2nd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature. 159\u2013168."},{"issue":"4","key":"e_1_3_2_33_2","first-page":"78","article-title":"Digital historical research: Context, concepts and the need for reflection","volume":"128","author":"Piersma Hinke","year":"2013","unstructured":"Hinke Piersma and Kees Ribbens. 2013. Digital historical research: Context, concepts and the need for reflection. BMGN-Low Count. Hist. Rev. 128, 4 (2013), 78\u2013102.","journal-title":"BMGN-Low Count. Hist. Rev."},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.5555\/3019244"},{"issue":"1","key":"e_1_3_2_35_2","first-page":"8","article-title":"Historical models and serial sources","volume":"4","author":"Piotrowski Michael","year":"2019","unstructured":"Michael Piotrowski. 2019. Historical models and serial sources. J. Eur. Period. Stud. 4, 1 (2019), 8\u201318.","journal-title":"J. Eur. Period. Stud."},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2019.00054"},{"key":"e_1_3_2_37_2","unstructured":"Lorenzo Quir\u00f3s. 2017. P2PaLA: Page to PAGE Layout Analysis toolkit. Retrieved from https:\/\/github.com\/lquirosd\/P2PaLA."},{"key":"e_1_3_2_38_2","volume-title":"De griffie van hare hoog mogenden: Bijdrage tot de skennis van het archief van de Staten-Generaal der Vereenigde Nederlanden","author":"Riemsdijk Theodorus Helenus Franciscus","year":"1885","unstructured":"Theodorus Helenus Franciscus Riemsdijk. 1885. De griffie van hare hoog mogenden: Bijdrage tot de skennis van het archief van de Staten-Generaal der Vereenigde Nederlanden. M. Nijhoff."},{"key":"e_1_3_2_39_2","first-page":"410","volume-title":"Proceedings of the Konferenz zur Verarbeitung nat\u00fcrlicher Sprache\/Conference on Natural Language Processing (KONVENS\u201912)","author":"Rodriquez Kepa Joseba","year":"2012","unstructured":"Kepa Joseba Rodriquez, Mike Bryant, Tobias Blanke, and Magdalena Luszczynska. 2012. Comparison of named entity recognition tools for raw OCR text. In Proceedings of the Konferenz zur Verarbeitung nat\u00fcrlicher Sprache\/Conference on Natural Language Processing (KONVENS\u201912). 410\u2013414."},{"key":"e_1_3_2_40_2","article-title":"The datafication of early modern ordinances","volume":"2","author":"Romein C. Annemieke","year":"2020","unstructured":"C. Annemieke Romein, Michel de Gruijter, and Sara Floor Veldhoen. 2020. The datafication of early modern ordinances. DH Benelux J. 2 (2020).","journal-title":"DH Benelux J."},{"key":"e_1_3_2_41_2","volume-title":"Proceedings of the Conference and Labs of the Evaluation Forum (CLEF\u201920)","author":"Su\u00e1rez Pedro Javier Ortiz","year":"2020","unstructured":"Pedro Javier Ortiz Su\u00e1rez, Yoann Dupont, Ga\u00ebl Lejeune, and Tian Tian. 2020. SinNer@Clef-Hipe2020: Sinful adaptation of SotA models for named entity recognition in French and German. In Proceedings of the Conference and Labs of the Evaluation Forum (CLEF\u201920)."},{"key":"e_1_3_2_42_2","volume-title":"Onderzoeksgids: Instrumenten van de macht: de Staten-Generaal en hun archieven 1576\u20131796 (Band 1)","author":"Thomassen Theo","year":"2019","unstructured":"Theo Thomassen. 2019. Onderzoeksgids: Instrumenten van de macht: de Staten-Generaal en hun archieven 1576\u20131796 (Band 1). Sidestone Press. 426 pages."},{"key":"e_1_3_2_43_2","volume-title":"Onderzoeksgids: Instrumenten van de macht: de Staten-Generaal en hun archieven 1576\u20131796 (Band 2)","author":"Thomassen Theo","year":"2019","unstructured":"Theo Thomassen. 2019. Onderzoeksgids: Instrumenten van de macht: de Staten-Generaal en hun archieven 1576\u20131796 (Band 2). Sidestone Press. 426 pages."},{"key":"e_1_3_2_44_2","volume-title":"Proceedings of the Conference and Labs of the Evaluation Forum (CLEF\u201920)","author":"Todorov Konstantin","year":"2020","unstructured":"Konstantin Todorov and Giovanni Colavizza. 2020. Transfer learning for named entity recognition in historical corpora. In Proceedings of the Conference and Labs of the Evaluation Forum (CLEF\u201920)."},{"key":"e_1_3_2_45_2","volume-title":"Recordkeeping Informatics for a Networked Age","author":"Upward Frank","year":"2018","unstructured":"Frank Upward, Barbara Reed, Gillian Oliver, and Joanne Evans. 2018. Recordkeeping Informatics for a Networked Age. Monash University."},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.5220\/0009169004840496"},{"key":"e_1_3_2_47_2","volume-title":"Proceedings of the Workshop on Ontology Learning","author":"Vargas-Vera Maria","year":"2001","unstructured":"Maria Vargas-Vera, John Domingue, Yannis Kalfoglou, Enrico Motta, and Simon Buckingham Shum. 2001. Template driven information extraction for populating ontologies. In Proceedings of the Workshop on Ontology Learning."},{"key":"e_1_3_2_48_2","first-page":"1361","volume-title":"Proceedings of the International Conference on Computational Linguistics (COLING\u201912)","author":"Yosef Mohamed Amir","year":"2012","unstructured":"Mohamed Amir Yosef, Sandro Bauer, Johannes Hoffart, Marc Spaniol, and Gerhard Weikum. 2012. Hyena: Hierarchical type classification for entity names. In Proceedings of the International Conference on Computational Linguistics (COLING\u201912). 1361\u20131370."}],"container-title":["Journal on Computing and Cultural Heritage"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3575864","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3575864","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:46:12Z","timestamp":1750178772000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3575864"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,31]]},"references-count":47,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2023,3,31]]}},"alternative-id":["10.1145\/3575864"],"URL":"https:\/\/doi.org\/10.1145\/3575864","relation":{},"ISSN":["1556-4673","1556-4711"],"issn-type":[{"type":"print","value":"1556-4673"},{"type":"electronic","value":"1556-4711"}],"subject":[],"published":{"date-parts":[[2023,3,31]]},"assertion":[{"value":"2021-11-08","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-09-09","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-06-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}