{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:32:12Z","timestamp":1750221132910,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":18,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,5,8]],"date-time":"2019-05-08T00:00:00Z","timestamp":1557273600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"publisher","award":["409784275"],"award-info":[{"award-number":["409784275"]}],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,5,8]]},"DOI":"10.1145\/3322905.3322916","type":"proceedings-article","created":{"date-parts":[[2019,10,23]],"date-time":"2019-10-23T15:44:57Z","timestamp":1571845497000},"page":"3-8","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Labelling OCR Ground Truth for Usage in Repositories"],"prefix":"10.1145","author":[{"given":"Matthias","family":"Boenig","sequence":"first","affiliation":[{"name":"Berlin-Brandenburg Academy of Sciences and Humanities, Berlin, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Konstantin","family":"Baierer","sequence":"additional","affiliation":[{"name":"Berlin State Library - Prussian Cultural Heritage, Berlin, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Volker","family":"Hartmann","sequence":"additional","affiliation":[{"name":"Karlsruhe Institute of Technology (KIT), Steinbuch Centre for Computing, Karlsruhe, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Maria","family":"Federbusch","sequence":"additional","affiliation":[{"name":"Berlin State Library - Prussian Cultural Heritage, Berlin, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Clemens","family":"Neudecker","sequence":"additional","affiliation":[{"name":"Berlin State Library - Prussian Cultural Heritage, Berlin, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2019,5,8]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"[n. d.]. The BagIt File Packaging Format (V1.0). https:\/\/tools.ietf.org\/html\/draft-kunze-bagit-16. Accessed: 2019-01-13.  [n. d.]. The BagIt File Packaging Format (V1.0). https:\/\/tools.ietf.org\/html\/draft-kunze-bagit-16. Accessed: 2019-01-13."},{"key":"e_1_3_2_1_2_1","unstructured":"[n. d.]. Metadata Encoding Transmission Standard (METS). http:\/\/www.loc.gov\/standards\/mets\/. Accessed: 2019-01-13.  [n. d.]. Metadata Encoding Transmission Standard (METS). http:\/\/www.loc.gov\/standards\/mets\/. Accessed: 2019-01-13."},{"key":"e_1_3_2_1_3_1","unstructured":"[n. d.]. Metadata Object Description Schema (MODS). http:\/\/www.loc.gov\/standards\/mods\/. Accessed: 2019-01-13.  [n. d.]. Metadata Object Description Schema (MODS). http:\/\/www.loc.gov\/standards\/mods\/. Accessed: 2019-01-13."},{"key":"e_1_3_2_1_4_1","unstructured":"[n. d.]. OCRD-ZIP. https:\/\/ocr-d.github.io\/ocrd_zip. Accessed: 2019-01-13.  [n. d.]. OCRD-ZIP. https:\/\/ocr-d.github.io\/ocrd_zip. Accessed: 2019-01-13."},{"key":"e_1_3_2_1_5_1","unstructured":"[n. d.]. The online repository: Europeana Newspapers Project Dataset (ENP) . https:\/\/www.primaresearch.org\/repository\/index\/ENP. Accessed: 2019-01-13.  [n. d.]. The online repository: Europeana Newspapers Project Dataset (ENP) . https:\/\/www.primaresearch.org\/repository\/index\/ENP. Accessed: 2019-01-13."},{"key":"e_1_3_2_1_6_1","unstructured":"[n. d.]. Richtlinien zur Transkription f\u00fcr Ground Truth. https:\/\/ocr-d.github.io\/gt\/\/trans_documentation\/index.html. Accessed: 2019-01-13.  [n. d.]. Richtlinien zur Transkription f\u00fcr Ground Truth. https:\/\/ocr-d.github.io\/gt\/\/trans_documentation\/index.html. Accessed: 2019-01-13."},{"key":"e_1_3_2_1_7_1","unstructured":"Andy Boyko J Kunze J Littman L Madden and B Vargas. 2011. The bagit file packaging format (v0. 97). Washington DC (2011).  Andy Boyko J Kunze J Littman L Madden and B Vargas. 2011. The bagit file packaging format (v0. 97). Washington DC (2011)."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/DAS.2018.46"},{"volume-title":"Aletheia - An Advanced Document Layout and Text Ground-Truthing System for Production Environments. In 2011 International Conference on Document Analysis and Recognition. 48--52","year":"2011","author":"Clausner C.","key":"e_1_3_2_1_9_1"},{"key":"e_1_3_2_1_10_1","unstructured":"Polzin Christian Federbusch Maria and Thomas St\u00e4cker. 2014. Volltext via OCR - M\u00f6glichkeiten und Grenzen. Beitr\u00e4ge aus der Staatsbibliothek zu Berlin - Preu\u00dfischer Kulturbesitz Vol. 43. Staatsbibliothek zu Berlin - Preu\u00dfischer Kulturbesitz. http:\/\/staatsbibliothek-berlin.de\/fileadmin\/user_upload\/zentrale_Seiten\/historische_drucke\/pdf\/SBB_OCR_STUDIE_WEBVERSION_Final.pdf  Polzin Christian Federbusch Maria and Thomas St\u00e4cker. 2014. Volltext via OCR - M\u00f6glichkeiten und Grenzen. Beitr\u00e4ge aus der Staatsbibliothek zu Berlin - Preu\u00dfischer Kulturbesitz Vol. 43. Staatsbibliothek zu Berlin - Preu\u00dfischer Kulturbesitz. http:\/\/staatsbibliothek-berlin.de\/fileadmin\/user_upload\/zentrale_Seiten\/historische_drucke\/pdf\/SBB_OCR_STUDIE_WEBVERSION_Final.pdf"},{"key":"e_1_3_2_1_11_1","unstructured":"Thomas Jejkal Alexander Vondrous Andreas Kopmann Rainer Stotzka and Volker Hartmann. 2014. KIT Data Manager: The Repository Architecture Enabling Cross-Disciplinary Research. Karlsruhe 9--11.  Thomas Jejkal Alexander Vondrous Andreas Kopmann Rainer Stotzka and Volker Hartmann. 2014. KIT Data Manager: The Repository Architecture Enabling Cross-Disciplinary Research. Karlsruhe 9--11."},{"volume-title":"1st International Workshop on Open Services and Tools for Document Analysis, 14th IAPR International Conference on Document Analysis and Recognition, OST@ICDAR","year":"2017","author":"Kahle Philip","key":"e_1_3_2_1_12_1"},{"key":"e_1_3_2_1_13_1","unstructured":"Sebastian Meyer. [n. d.].  Sebastian Meyer. [n. d.]."},{"key":"e_1_3_2_1_14_1","unstructured":"[n.d.]. 1500. Historia. Mathis Hupfuff. http:\/\/resolver.staatsbibliothek-berlin.de\/SBB0000A94200000000  [n.d.]. 1500. Historia. Mathis Hupfuff. http:\/\/resolver.staatsbibliothek-berlin.de\/SBB0000A94200000000"},{"volume-title":"The PAGE (Page Analysis and Ground-Truth Elements) Format Framework. In 2010 20th International Conference on Pattern Recognition. 257--260","year":"2010","author":"Pletschacher S.","key":"e_1_3_2_1_15_1"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Ajinkya Prabhune Rainer Stotzka Vaibhav Sakharkar J\u00fcrgen W. Hesser and Michael Gertz. 2018. MetaStore: an adaptive metadata management framework for heterogeneous metadata models. Distributed and parallel databases 36 1 (2018) 153--194. https:\/\/doi.org\/10.1007\/s10619-017-7210-4  Ajinkya Prabhune Rainer Stotzka Vaibhav Sakharkar J\u00fcrgen W. Hesser and Michael Gertz. 2018. MetaStore: an adaptive metadata management framework for heterogeneous metadata models. Distributed and parallel databases 36 1 (2018) 153--194. https:\/\/doi.org\/10.1007\/s10619-017-7210-4","DOI":"10.1007\/s10619-017-7210-4"},{"key":"e_1_3_2_1_17_1","unstructured":"David Smith and Ryan Cordell. 2018. A Research Agenda for Historical and Multilingual Optical Character Recognition. Mathis Hupfuff. http:\/\/hdl.handle.net\/2047\/D20297452  David Smith and Ryan Cordell. 2018. A Research Agenda for Historical and Multilingual Optical Character Recognition. Mathis Hupfuff. http:\/\/hdl.handle.net\/2047\/D20297452"},{"key":"e_1_3_2_1_18_1","unstructured":"Christoph Stollwerk. 2016. Machbarkeitsstudie zu Einsatzm\u00f6glichkeiten von OCR-Software im Bereich \"Alter Drucke\" zur Vorbereitung einer vollst\u00e4ndigen Digitalisierung deutscher Druckerzeugnisse zwischen 1500 und 1930. DARIAH-DE working papers Vol. 16. GOEDOC Dokumenten- und Publikationsserver der Georg-August-Universit\u00e4t G\u00f6ttingen. http:\/\/nbn-resolving.de\/urn:nbn:de:gbv:7-dariah-2016-2-8  Christoph Stollwerk. 2016. Machbarkeitsstudie zu Einsatzm\u00f6glichkeiten von OCR-Software im Bereich \"Alter Drucke\" zur Vorbereitung einer vollst\u00e4ndigen Digitalisierung deutscher Druckerzeugnisse zwischen 1500 und 1930. DARIAH-DE working papers Vol. 16. GOEDOC Dokumenten- und Publikationsserver der Georg-August-Universit\u00e4t G\u00f6ttingen. http:\/\/nbn-resolving.de\/urn:nbn:de:gbv:7-dariah-2016-2-8"}],"event":{"name":"DATeCH2019: 3rd International Conference on Digital Access to Textual Cultural Heritage","acronym":"DATeCH2019","location":"Brussels Belgium"},"container-title":["Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3322905.3322916","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3322905.3322916","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T01:02:26Z","timestamp":1750208546000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3322905.3322916"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,5,8]]},"references-count":18,"alternative-id":["10.1145\/3322905.3322916","10.1145\/3322905"],"URL":"https:\/\/doi.org\/10.1145\/3322905.3322916","relation":{},"subject":[],"published":{"date-parts":[[2019,5,8]]},"assertion":[{"value":"2019-05-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}