{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,21]],"date-time":"2025-12-21T07:11:43Z","timestamp":1766301103308,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":28,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,9,20]],"date-time":"2019-09-20T00:00:00Z","timestamp":1568937600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"publisher","award":["274863866"],"award-info":[{"award-number":["274863866"]}],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,9,20]]},"DOI":"10.1145\/3352631.3352638","type":"proceedings-article","created":{"date-parts":[[2019,10,18]],"date-time":"2019-10-18T12:57:15Z","timestamp":1571403435000},"page":"25-30","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["okralact - a multi-engine Open Source OCR training system"],"prefix":"10.1145","author":[{"given":"Konstantin","family":"Baierer","sequence":"first","affiliation":[{"name":"Staatsbibliothek zu Berlin, Preu\u00dfischer Kulturbesitz"}]},{"given":"Rui","family":"Dong","sequence":"additional","affiliation":[{"name":"Khoury College of Computer Sciences, Northeastern University"}]},{"given":"Clemens","family":"Neudecker","sequence":"additional","affiliation":[{"name":"Staatsbibliothek zu Berlin, Preu\u00dfischer Kulturbesitz"}]}],"member":"320","published-online":{"date-parts":[[2019,9,20]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3322905.3322916"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-7908-2604-3_16"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.5555\/3104322.3104338"},{"volume-title":"Document Recognition and Retrieval XV","author":"Breuel Thomas M.","key":"e_1_3_2_1_4_1","unstructured":"Thomas M. Breuel . 2008. The OCRopus open source OCR system . In Document Recognition and Retrieval XV , Vol. 6815 . Society of Photo-Optical Instrumentation Engineers (SPIE) , WA , USA, 15. https:\/\/doi.org\/10.1117\/12.783598 10.1117\/12.783598 Thomas M. Breuel. 2008. The OCRopus open source OCR system. In Document Recognition and Retrieval XV, Vol. 6815. Society of Photo-Optical Instrumentation Engineers (SPIE), WA, USA, 15. https:\/\/doi.org\/10.1117\/12.783598"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2017.12"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2013.140"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2011.19"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143891"},{"key":"e_1_3_2_1_9_1","unstructured":"Marcin Heli\u0144ski Mi\u0142Sosz Kmieciak and Tomasz Parko\u0142Sa. 2012. Report on the comparison of Tesseract and ABBYY FineReader OCR engines. http:\/\/lib.psnc.pl\/publication\/428  Marcin Heli\u0144ski Mi\u0142Sosz Kmieciak and Tomasz Parko\u0142Sa. 2012. Report on the comparison of Tesseract and ABBYY FineReader OCR engines. http:\/\/lib.psnc.pl\/publication\/428"},{"key":"e_1_3_2_1_10_1","unstructured":"Geoffrey Hinton Nitish Srivastava and Kevin Swersky. 2012. Neural networks for machine learning lecture 6a overview of mini-batch gradient descent.  Geoffrey Hinton Nitish Srivastava and Kevin Swersky. 2012. Neural networks for machine learning lecture 6a overview of mini-batch gradient descent."},{"key":"e_1_3_2_1_11_1","volume-title":"Kingma and Jimmy Ba","author":"Diederik","year":"2015","unstructured":"Diederik P. Kingma and Jimmy Ba . 2015 . Adam : A Method for Stochastic Optimization. CoRR abs\/1412.6980 (2015), 15. https:\/\/arxiv.org\/abs\/1412.6980 Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. CoRR abs\/1412.6980 (2015), 15. https:\/\/arxiv.org\/abs\/1412.6980"},{"key":"e_1_3_2_1_12_1","unstructured":"Alex Krizhevsky Ilya Sutskever and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. NIPS CA USA 1097--1105.  Alex Krizhevsky Ilya Sutskever and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. NIPS CA USA 1097--1105."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"crossref","unstructured":"John Kunze Justin Littman Elizabeth Madden John Scancella and Chris Adams. 2018. The BagIt File Packaging Format (V1.0). https:\/\/tools.ietf.org\/html\/rfc8493. Accessed: 2019-06-09.  John Kunze Justin Littman Elizabeth Madden John Scancella and Chris Adams. 2018. The BagIt File Packaging Format (V1.0). https:\/\/tools.ietf.org\/html\/rfc8493. Accessed: 2019-06-09.","DOI":"10.17487\/RFC8493"},{"key":"e_1_3_2_1_14_1","first-page":"189","article-title":"Navigating the storm: IMPACT, eMOP, and agile steering standards","volume":"32","author":"Mandell Laura C.","year":"2017","unstructured":"Laura C. Mandell , Clemens Neudecker , Apostolos Antonacopoulos , Elizabeth Grumbach , Loretta Auvil , Matthew J. Christy , Jacob A. Heil , and Todd Samuelson . 2017 . Navigating the storm: IMPACT, eMOP, and agile steering standards . Digital Scholarship in the Humanities 32 , 1 (2017), 189 -- 194 . https:\/\/doi.org\/10.1093\/llc\/fqv062 10.1093\/llc Laura C. Mandell, Clemens Neudecker, Apostolos Antonacopoulos, Elizabeth Grumbach, Loretta Auvil, Matthew J. Christy, Jacob A. Heil, and Todd Samuelson. 2017. Navigating the storm: IMPACT, eMOP, and agile steering standards. Digital Scholarship in the Humanities 32, 1 (2017), 189--194. https:\/\/doi.org\/10.1093\/llc\/fqv062","journal-title":"Digital Scholarship in the Humanities"},{"volume-title":"Eleventh annual conference of the international speech communication association","author":"Mikolov Tom\u00e1\u0161","key":"e_1_3_2_1_15_1","unstructured":"Tom\u00e1\u0161 Mikolov , Martin Karafi\u00e1t , Luk\u00e1\u0161 Burget , Jan \u010cernocky , and Sanjeev Khudanpur . 2010. Recurrent neural network based language model . In Eleventh annual conference of the international speech communication association . ISCA , Baixas, France , 1045--1048. Tom\u00e1\u0161 Mikolov, Martin Karafi\u00e1t, Luk\u00e1\u0161 Burget, Jan \u010cernocky, and Sanjeev Khudanpur. 2010. Recurrent neural network based language model. In Eleventh annual conference of the international speech communication association. ISCA, Baixas, France, 1045--1048."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1017\/S0020743817000964"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3322905.3322917"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0893-6080(98)00116-6"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/DAS.2018.30"},{"key":"e_1_3_2_1_20_1","volume-title":"Frank Robert Jenkins, and Thomas A. Nartker","author":"Rice Stephen V.","year":"1995","unstructured":"Stephen V. Rice , Frank Robert Jenkins, and Thomas A. Nartker . 1995 . The Fourth Annual Test of OCR Accuracy . Stephen V. Rice, Frank Robert Jenkins, and Thomas A. Nartker. 1995. The Fourth Annual Test of OCR Accuracy."},{"key":"e_1_3_2_1_21_1","volume-title":"Sarah Bowen Savant, and Benjamin Kiessling.","author":"Romanov Maxim","year":"2017","unstructured":"Maxim Romanov , Matthew Thomas Miller , Sarah Bowen Savant, and Benjamin Kiessling. 2017 . Important New Developments in Arabographic Optical Character Recognition (OCR). CoRR abs\/1703.09550 (2017), 1--11. arXiv:1703.09550 http:\/\/arxiv.org\/abs\/1703.09550 Maxim Romanov, Matthew Thomas Miller, Sarah Bowen Savant, and Benjamin Kiessling. 2017. Important New Developments in Arabographic Optical Character Recognition (OCR). CoRR abs\/1703.09550 (2017), 1--11. arXiv:1703.09550 http:\/\/arxiv.org\/abs\/1703.09550"},{"key":"e_1_3_2_1_22_1","volume-title":"An Overview of the Tesseract OCR Engine. In Ninth International Conference on Document Analysis and Recognition (ICDAR 2007","volume":"2","author":"Smith Ray","year":"2007","unstructured":"Ray Smith . 2007 . An Overview of the Tesseract OCR Engine. In Ninth International Conference on Document Analysis and Recognition (ICDAR 2007 ), Vol. 2 . IEEE, New York, NY, USA, 629--633. https:\/\/doi.org\/10.1109\/ICDAR. 2007.4376991 10.1109\/ICDAR.2007.4376991 Ray Smith. 2007. An Overview of the Tesseract OCR Engine. In Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), Vol. 2. IEEE, New York, NY, USA, 629--633. https:\/\/doi.org\/10.1109\/ICDAR.2007.4376991"},{"key":"e_1_3_2_1_23_1","unstructured":"Ray Smith. 2016. Tesseract blends old and new OCR technology.  Ray Smith. 2016. Tesseract blends old and new OCR technology."},{"volume-title":"Ocrocis - A high accuracy OCR method to convert early printings into digital text. Tutorial","author":"Springmann Uwe","key":"e_1_3_2_1_24_1","unstructured":"Uwe Springmann . 2015. Ocrocis - A high accuracy OCR method to convert early printings into digital text. Tutorial . Center for Information and Language Processing (CIS) . Uwe Springmann. 2015. Ocrocis - A high accuracy OCR method to convert early printings into digital text. Tutorial. Center for Information and Language Processing (CIS)."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2505377.2505394"},{"key":"e_1_3_2_1_26_1","volume-title":"Transfer Learning for OCRopus Model Training on Early Printed Books. 027.7 Zeitschrift f\u00fcr Bibliothekskultur \/ Journal for Library Culture 5, 1","author":"Christoph Wick Christian Reul","year":"2017","unstructured":"Christian Reul und Christoph Wick und Uwe Springmann und Frank Puppe . 2017. Transfer Learning for OCRopus Model Training on Early Printed Books. 027.7 Zeitschrift f\u00fcr Bibliothekskultur \/ Journal for Library Culture 5, 1 ( 2017 ), 38--51. https:\/\/doi.org\/10.12685\/027.7-5-1-169 10.12685\/027.7-5-1-169 Christian Reul und Christoph Wick und Uwe Springmann und Frank Puppe. 2017. Transfer Learning for OCRopus Model Training on Early Printed Books. 027.7 Zeitschrift f\u00fcr Bibliothekskultur \/ Journal for Library Culture 5, 1 (2017), 38--51. https:\/\/doi.org\/10.12685\/027.7-5-1-169"},{"key":"e_1_3_2_1_27_1","volume-title":"Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition. CoRR abs\/1807.02004","author":"Wick Christoph","year":"2018","unstructured":"Christoph Wick , Christian Reul , and Frank Puppe . 2018. Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition. CoRR abs\/1807.02004 ( 2018 ), 1--12. arXiv:1807.02004 http:\/\/arxiv.org\/abs\/1807.02004 Christoph Wick, Christian Reul, and Frank Puppe. 2018. Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition. CoRR abs\/1807.02004 (2018), 1--12. arXiv:1807.02004 http:\/\/arxiv.org\/abs\/1807.02004"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"crossref","first-page":"79","DOI":"10.21248\/jlcl.33.2018.219","article-title":"Comparison of OCR Accuracy on Early Printed Books using the Open Source Engines Calamari and OCRopus","volume":"33","author":"Wick Christoph","year":"2018","unstructured":"Christoph Wick , Christian Reul , and Frank Puppe . 2018 . Comparison of OCR Accuracy on Early Printed Books using the Open Source Engines Calamari and OCRopus . JLCL 33 , 1 (2018), 79 -- 96 . Christoph Wick, Christian Reul, and Frank Puppe. 2018. Comparison of OCR Accuracy on Early Printed Books using the Open Source Engines Calamari and OCRopus. JLCL 33, 1 (2018), 79--96.","journal-title":"JLCL"}],"event":{"name":"HIP '19: The 5th International Workshop on Historical Document Imaging and Processing","sponsor":["FamilySearch FamilySearch"],"location":"Sydney NSW Australia","acronym":"HIP '19"},"container-title":["Proceedings of the 5th International Workshop on Historical Document Imaging and Processing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3352631.3352638","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3352631.3352638","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:25:30Z","timestamp":1750206330000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3352631.3352638"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,9,20]]},"references-count":28,"alternative-id":["10.1145\/3352631.3352638","10.1145\/3352631"],"URL":"https:\/\/doi.org\/10.1145\/3352631.3352638","relation":{},"subject":[],"published":{"date-parts":[[2019,9,20]]},"assertion":[{"value":"2019-09-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}