{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,13]],"date-time":"2026-05-13T20:52:19Z","timestamp":1778705539669,"version":"3.51.4"},"reference-count":119,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2020,10,16]],"date-time":"2020-10-16T00:00:00Z","timestamp":1602806400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Imaging"],"abstract":"<jats:p>Nowadays, deep learning methods are employed in a broad range of research fields. The analysis and recognition of historical documents, as we survey in this work, is not an exception. Our study analyzes the papers published in the last few years on this topic from different perspectives: we first provide a pragmatic definition of historical documents from the point of view of the research in the area, then we look at the various sub-tasks addressed in this research. Guided by these tasks, we go through the different input-output relations that are expected from the used deep learning approaches and therefore we accordingly describe the most used models. We also discuss research datasets published in the field and their applications. This analysis shows that the latest research is a leap forward since it is not the simple use of recently proposed algorithms to previous problems, but novel tasks and novel applications of state of the art methods are now considered. Rather than just providing a conclusive picture of the current research in the topic we lastly suggest some potential future trends that can represent a stimulus for innovative research directions.<\/jats:p>","DOI":"10.3390\/jimaging6100110","type":"journal-article","created":{"date-parts":[[2020,10,16]],"date-time":"2020-10-16T08:56:48Z","timestamp":1602838608000},"page":"110","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":95,"title":["Deep Learning for Historical Document Analysis and Recognition\u2014A Survey"],"prefix":"10.3390","volume":"6","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3268-553X","authenticated-orcid":false,"given":"Francesco","family":"Lombardi","sequence":"first","affiliation":[{"name":"Department of Information Engineering (DINFO), School of Engineering, Universit\u00e0 degli Studi di Firenze, 50139 Florence, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6702-2277","authenticated-orcid":false,"given":"Simone","family":"Marinai","sequence":"additional","affiliation":[{"name":"Department of Information Engineering (DINFO), School of Engineering, Universit\u00e0 degli Studi di Firenze, 50139 Florence, Italy"}]}],"member":"1968","published-online":{"date-parts":[[2020,10,16]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/978-3-540-76280-5_1","article-title":"Introduction to Document Analysis and Recognition","volume":"Volume 90","author":"Marinai","year":"2008","journal-title":"Machine Learning in Document Analysis and Recognition"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1109\/TPAMI.2005.4","article-title":"Artificial Neural Networks for Document Analysis and Recognition","volume":"27","author":"Marinai","year":"2005","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_3","unstructured":"Rumelhart, D.E., McClelland, J.L., and PDP Research Group (1986). Learning Internal Representations by Error Propagation. Parallel Distributed Processing: Explorations in the Microstructure of Cognition, MIT Press."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-based learning applied to document recognition","volume":"86","author":"Lecun","year":"1998","journal-title":"Proc. IEEE"},{"key":"ref_5","unstructured":"Sellen, A.J., and Harper, R.H. (2003). The Myth of the Paperless Office, MIT Press."},{"key":"ref_6","unstructured":"(2004). 1st International Workshop on Document Image Analysis for Libraries (DIAL 2004), IEEE Computer Society."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Kesiman, M.W.A., Burie, J.C., Wibawantara, G.N.M.A., Sunarya, I.M.G., and Ogier, J.M. (2016, January 23\u201326). AMADI_LontarSet: The First Handwritten Balinese Palm Leaf Manuscripts Dataset. Proceedings of the 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), Shenzhen, China.","DOI":"10.1109\/ICFHR.2016.0042"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Suryani, M., Paulus, E., Hadi, S., Darsa, U.A., and Burie, J.C. (2017, January 9\u201315). The handwritten sundanese palm leaf manuscript dataset from 15th century. Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, Japan.","DOI":"10.1109\/ICDAR.2017.135"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Valy, D., Verleysen, M., Chhun, S., and Burie, J.C. (2017, January 10\u201311). A new Khmer Palm leaf manuscript dataset for document analysis and recognition: SleukRith set. Proceedings of the 4th International Workshop on Historical Document Imaging and Processing, Kyoto, Japan.","DOI":"10.1145\/3151509.3151510"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Stromer, D., Christlein, V., Maier, A., Zippert, P., Helmecke, E., Hausotte, T., and Huang, X. (2018, January 24\u201327). Non-destructive Digitization of Soiled Historical Chinese Bamboo Scrolls. Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, DAS, Vienna, Austria.","DOI":"10.1109\/DAS.2018.37"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Mohammed, H., Marthot-Santaniello, I., and Margner, V. (2019, January 20\u201325). GRK-Papyri: A Dataset of Greek Handwriting on Papyri for the Task of Writer Identification. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00121"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Alaasam, R., Kurar, B., and El-Sana, J. (2019, January 20\u201325). Layout Analysis on Challenging Historical Arabic Manuscripts using Siamese Network. Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR), Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00123"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Clanuwat, T., Lamb, A., and Kitamoto, A. (2019, January 20\u201325). KuroNet: Pre-Modern Japanese Kuzushiji Character Recognition with Deep Learning. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00103"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Prusty, A., Aitha, S., Trivedi, A., and Sarvadevabhatla, S.R.K. (2019, January 20\u201325). Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00164"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Hamid, A., Bibi, M., Moetesum, M., and Siddiqi, I. (2019, January 20\u201325). Deep Learning Based Approach for Historical Manuscript Dating. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00159"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Toselli, A., Romero, V., Sanchez, J.A., and Vidal, E. (2019, January 20\u201325). Making Two Vast Historical Manuscript Collections Searchable and Extracting Meaningful Textual Features Through Large-Scale Probabilistic Indexing. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00026"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Trivedi, A., and Sarvadevabhatla, S.R.K. (2019, January 20\u201325). HInDoLA: A Unified Cloud-Based Platform for Annotation, Visualization and Machine Learning-Based Layout Analysis of Historical Manuscripts. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDARW.2019.10035"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Yin, X., Aldarrab, N., Megyesi, B., and Knight, K. (2019, January 20\u201325). Decipherment of Historical Manuscript Images. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00022"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Christlein, V., Nicolaou, A., Seuret, M., Stutzmann, D., and Maier, A. (2019, January 20\u201325). ICDAR 2019 Competition on Image Retrieval for Historical Handwritten Documents. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00242"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Watanabe, K., Takahashi, S., Kamaya, Y., Yamada, M., Mekada, Y., Hasegawa, J., and Miyazaki, S. (2019, January 20\u201325). Japanese Character Segmentation for Historical Handwritten Official Documents Using Fully Convolutional Networks. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00154"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1016\/j.patrec.2020.02.016","article-title":"Text alignment in early printed books combining deep learning and dynamic programming","volume":"133","author":"Ziran","year":"2020","journal-title":"Pattern Recognit. Lett."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Neji, H., Nogueras-Iso, J., Lacasta, J., Ben Halima, M., and Alimi, A. (2019, January 20\u201325). Adversarial Autoencoders for Denoising Digitized Historical Documents: The Use Case of Incunabula. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDARW.2019.50112"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Gr\u00fcning, T., Labahn, R., Diem, M., Kleber, F., and Fiel, S. (2018, January 24\u201327). Read-bad: A new dataset and evaluation scheme for baseline detection in archival documents. Proceedings of the 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), Vienna, Austria.","DOI":"10.1109\/DAS.2018.38"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Saini, R., Dobson, D., Morrey, J., Liwicki, M., and Liwicki, F. (2019, January 20\u201325). ICDAR 2019 Historical Document Reading Challenge on Large Structured Chinese Family Records. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00241"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Clinchant, S., Dejean, H., Meunier, J.L., Lang, E., and Kleber, F. (2018, January 24\u201327). Comparing Machine Learning Approaches for Table Recognition in Historical Register Books. Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, DAS, Vienna, Austria.","DOI":"10.1109\/DAS.2018.44"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1016\/j.patrec.2017.10.023","article-title":"Deep neural networks for record counting in historical handwritten documents","volume":"119","author":"Capobianco","year":"2019","journal-title":"Pattern Recognit. Lett."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Weinman, J., Chen, Z., Gafford, B., Gifford, N., Lamsal, A., and Niehus-Staab, L. (2019, January 20\u201325). Deep Neural Networks for Text Detection and Recognition in Historical Maps. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00149"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1016\/j.patrec.2019.11.025","article-title":"An end-to-end deep learning system for medieval writer identification","volume":"129","author":"Cilia","year":"2020","journal-title":"Pattern Recognit. Lett."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.C. (2018, January 18\u201322). Mobilenetv2: Inverted residuals and linear bottlenecks. Proceedings of the IEEE conference on computer vision and pattern recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Adak, C., Marinai, S., Chaudhuri, B.B., and Blumenstein, M. (2018, January 24\u201327). Offline Bengali Writer Verification by PDF-CNN and Siamese Net. Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, DAS 2018, Vienna, Austria.","DOI":"10.1109\/DAS.2018.33"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"He, S., Sammara, P., Burgers, J., and Schomaker, L. (2014, January 1\u20134). Towards Style-Based Dating of Historical Documents. Proceedings of the 14th International Conference on Frontiers in Handwriting Recognition, ICFHR 2014, Crete, Greece.","DOI":"10.1109\/ICFHR.2014.52"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"701","DOI":"10.1109\/TPAMI.2007.1009","article-title":"Text-Independent Writer Identification and Verification Using Textural and Allographic Features","volume":"29","author":"Bulacu","year":"2007","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"162","DOI":"10.1016\/j.patcog.2011.07.005","article-title":"Writer identification using directional ink-trace width measurements","volume":"45","author":"Brink","year":"2012","journal-title":"Pattern Recognit."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"5252","DOI":"10.1109\/TIP.2016.2602078","article-title":"A Multiple-Label Guided Clustering Algorithm for Historical Document Dating and Localization","volume":"25","author":"He","year":"2016","journal-title":"IEEE Trans. Image Process."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Hamid, A., Bibi, M., Siddiqi, I., and Moetesum, M. (2018, January 17\u201319). Historical Manuscript Dating using Textural Measures. Proceedings of the 2018 International Conference on Frontiers of Information Technology, FIT 2018, Islamabad, Pakistan.","DOI":"10.1109\/FIT.2018.00048"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Studer, L., Alberti, M., Pondenkandath, V., Goktepe, P., Kolonko, T., Fischer, A., Liwicki, M., and Ingold, R. (2019, January 20\u201325). A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00120"},{"key":"ref_37","first-page":"368","article-title":"Exploring Digital Libraries with Document Image Retrieval","volume":"Volume 4675","author":"Fuhr","year":"2007","journal-title":"Proceedings of the Research and Advanced Technology for Digital Libraries, 11th European Conference, ECDL 2007"},{"key":"ref_38","unstructured":"Cesarini, F., Lastri, M., Marinai, S., and Soda, G. (2001, January 10\u201313). Encoding of Modified X-Y Trees for Document Classification. Proceedings of the 6th International Conference on Document Analysis and Recognition (ICDAR 2001), Seattle, WA, USA."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Afzal, M.Z., Capobianco, S., Malik, M.I., Marinai, S., Breuel, T.M., Dengel, A., and Liwicki, M. (2015, January 23\u201326). Deepdocclassifier: Document classification with deep Convolutional Neural Network. Proceedings of the 13th International Conference on Document Analysis and Recognition, ICDAR 2015, Nancy, France.","DOI":"10.1109\/ICDAR.2015.7333933"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Santoro, A., De Stefano, C., and Marcelli, A. (2017, January 7\u201315). Assisted Transcription of Historical Documents by Keyword Spotting: A Performance Model. Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, Japan.","DOI":"10.1109\/ICDAR.2017.162"},{"key":"ref_41","unstructured":"S\u00e1nchez, J.A. (2020, October 15). Bentham Dataset R0. Available online: https:\/\/zenodo.org\/record\/44519."},{"key":"ref_42","unstructured":"Nguyen, K., Nguyen, C., Hotta, S., and Nakagawa, M. (2019, January 20\u201325). A Character Attention Generative Adversarial Network for Degraded Historical Document Restoration. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). U-Net: Convolutional Networks for Biomedical Image Segmentation. arXiv.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_44","first-page":"23","article-title":"Qumran Letter Restoration by Rotation and Reflection Modified PixelCNN","volume":"1","author":"Uzan","year":"2017","journal-title":"Iapr Int. Conf. Doc. Anal. Recognit."},{"key":"ref_45","unstructured":"Salimans, T., Karpathy, A., Chen, X., and Kingma, D.P. (2017). PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications. arXiv."},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Tensmeyer, C., and Martinez, T. (2017, January 9\u201315). Document image binarization with fully convolutional neural networks. Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, Japan.","DOI":"10.1109\/ICDAR.2017.25"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"400","DOI":"10.1016\/B978-0-444-53859-8.00016-3","article-title":"Chapter 16 - Learning Algorithms for Document Layout Analysis","volume":"Volume 31","author":"Rao","year":"2013","journal-title":"Handbook of Statistics"},{"key":"ref_48","unstructured":"Lafferty, J., McCallum, A., and Pereira, F.C. (July, January 28). Conditional random fields: Probabilistic models for segmenting and labeling sequence data. Proceedings of the Eighteenth International Conference on Machine Learning (ICML), Williams College, Williamstown, MA, USA."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Riba, P., Dutta, A., Goldmann, L., Forn\u00e9s, A., Terrades, O.R., and Llad\u00f3s, J. (2019, January 20\u201325). Table Detection in Invoice Documents by Graph Neural Networks. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00028"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Tensmeyer, C., Morariu, V.I., Price, B.L., Cohen, S., and Martinez, T.R. (2019, January 20\u201325). Deep Splitting and Merging for Table Structure Decomposition. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00027"},{"key":"ref_51","first-page":"68","article-title":"Text Line Extraction in Handwritten Historical Documents","volume":"Volume 733","author":"Grana","year":"2017","journal-title":"Proceedings of the Digital Libraries and Archives\u201413th Italian Research Conference on Digital Libraries, IRCDL 2017"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Chen, K., Seuret, M., Hennebert, J., and Ingold, R. (2017, January 9\u201315). Convolutional neural networks for page segmentation of historical document images. Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, Japan.","DOI":"10.1109\/ICDAR.2017.161"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Pastor-Pellicer, J., Afzal, M.Z., Liwicki, M., and Castro-Bleda, M.J. (2016, January 11\u201314). Complete system for text line extraction using convolutional neural networks and watershed transform. Proceedings of the 2016 12th IAPR Workshop on Document Analysis Systems (DAS), Santorini, Greece.","DOI":"10.1109\/DAS.2016.58"},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1007\/s10032-018-0304-3","article-title":"Fully convolutional network with dilated convolutions for handwritten text line segmentation","volume":"21","author":"Renton","year":"2018","journal-title":"IJDAR"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Bukhari, S.S., Breuel, T.M., Asi, A., and El-Sana, J. (2012, January 18\u201320). Layout analysis for arabic historical document images using machine learning. Proceedings of the 2012 International Conference on Frontiers in Handwriting Recognition, Bari, Italy.","DOI":"10.1109\/ICFHR.2012.227"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Barakat, B.K., and El-Sana, J. (2018, January 12\u201314). Binarization Free Layout Analysis for Arabic Historical Documents Using Fully Convolutional Networks. Proceedings of the 2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR), London, UK.","DOI":"10.1109\/ASAR.2018.8480333"},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Fink, M., Layer, T., Mackenbrock, G., and Sprinzl, M. (2018, January 24\u201327). Baseline Detection in Historical Documents Using Convolutional U-Nets. Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, DAS, Vienna, Austria.","DOI":"10.1109\/DAS.2018.34"},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1007\/s10032-019-00332-1","article-title":"A two-stage method for text line detection in historical documents","volume":"22","author":"Leifert","year":"2019","journal-title":"IJDAR"},{"key":"ref_59","first-page":"395","article-title":"Historical Handwritten Document Segmentation by Using a Weighted Loss","volume":"Volume 11081","author":"Pancioni","year":"2018","journal-title":"Proceedings of the Artificial Neural Networks in Pattern Recognition\u20148th IAPR TC3 Workshop, ANNPR 2018"},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1007\/s10032-018-0295-0","article-title":"Attribute CNNs for word spotting in handwritten documents","volume":"21","author":"Sudholt","year":"2018","journal-title":"IJDAR"},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1109\/TPAMI.2011.113","article-title":"A Novel Word Spotting Method Based on Recurrent Neural Networks","volume":"34","author":"Frinken","year":"2012","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_62","doi-asserted-by":"crossref","first-page":"1658","DOI":"10.1016\/j.patcog.2012.11.024","article-title":"The ESPOSALLES database: An ancient marriage license corpus for off-line handwriting recognition","volume":"46","author":"Romero","year":"2013","journal-title":"Pattern Recognit."},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1007\/s10032-019-00336-x","article-title":"HWNet v2: An efficient word image representation for handwritten documents","volume":"22","author":"Krishnan","year":"2019","journal-title":"IJDAR"},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Weinman, J. (2017, January 9\u201315). Geographic and Style Models for Historical Map Alignment and Toponym Recognition. Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, Japan.","DOI":"10.1109\/ICDAR.2017.160"},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1007\/s10032-019-00334-z","article-title":"Total-Text: Toward orientation robustness in scene text detection","volume":"23","author":"Chan","year":"2020","journal-title":"IJDAR"},{"key":"ref_66","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1007\/s10032-019-00320-5","article-title":"Scene text detection and recognition with advances in deep learning: A survey","volume":"22","author":"Liu","year":"2019","journal-title":"IJDAR"},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Cai, J., Peng, L., Tang, Y., Liu, C., and Li, P. (2019, January 20\u201325). TH-GAN: Generative Adversarial Network Based Transfer Learning for Historical Chinese Character Recognition. Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR), Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00037"},{"key":"ref_68","doi-asserted-by":"crossref","first-page":"134","DOI":"10.1016\/j.patrec.2020.05.026","article-title":"An attention-based row-column encoder-decoder model for text recognition in Japanese historical documents","volume":"136","author":"Ly","year":"2020","journal-title":"Pattern Recognit. Lett."},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Nagai, A. (2019, January 20\u201325). On the Improvement of Recognizing Single-Line Strings of Japanese Historical Cursive. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00105"},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Chammas, E., Mokbel, C., and Likforman-Sulem, L. (2018, January 24\u201327). Handwriting Recognition of Historical Documents with Few Labeled Data. Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, DAS, Vienna, Austria.","DOI":"10.1109\/DAS.2018.15"},{"key":"ref_71","unstructured":"Goodfellow, I., Bengio, Y., and Courville, A. (2016). Deep Learning, MIT Press."},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Karpinski, R., and Bela\u00efd, A. (2019, January 20\u201325). Semi-Synthetic Data Augmentation of Scanned Historical Documents. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00051"},{"key":"ref_73","doi-asserted-by":"crossref","unstructured":"Capobianco, S., and Marinai, S. (2017). DocEmul: A Toolkit to Generate Structured Historical Documents. arXiv.","DOI":"10.1109\/ICDAR.2017.196"},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Xu, Y., He, W., Yin, F., and Liu, C. (2017, January 9\u201315). Page Segmentation for Historical Handwritten Documents Using Fully Convolutional Networks. Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, Japan.","DOI":"10.1109\/ICDAR.2017.94"},{"key":"ref_75","doi-asserted-by":"crossref","unstructured":"Asi, A., Cohen, R., Kedem, K., El-Sana, J., and Dinstein, I. (2014, January 1\u20134). A Coarse-to-Fine Approach for Layout Analysis of Ancient Manuscripts. Proceedings of the International Conference on Frontiers in Handwriting Recognition, ICFHR, Crete Island, Greece.","DOI":"10.1109\/ICFHR.2014.31"},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Wick, C., and Puppe, F. (2018, January 24\u201327). Fully convolutional neural networks for page segmentation of historical document images. Proceedings of the 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), Vienna, Austria.","DOI":"10.1109\/DAS.2018.39"},{"key":"ref_77","doi-asserted-by":"crossref","unstructured":"Mechi, O., Mehri, M., Ingold, R., and Essoukri Ben Amara, N. (2019, January 20\u201325). Text Line Segmentation in Historical Document Images Using an Adaptive U-Net Architecture. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00066"},{"key":"ref_78","doi-asserted-by":"crossref","unstructured":"Romain, K., and Abdel, B. (2019, January 22\u201325). Semi-Supervised Learning through Adversary Networks for Baseline Detection. Proceedings of the 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW), Sydney, Australia.","DOI":"10.1109\/ICDARW.2019.40093"},{"key":"ref_79","doi-asserted-by":"crossref","unstructured":"Stauffer, M., Fischer, A., and Riesen, K. (2018, January 24\u201327). Graph-Based Keyword Spotting in Historical Documents Using Context-Aware Hausdorff Edit Distance. Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, DAS, Vienna, Austria.","DOI":"10.1109\/DAS.2018.31"},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Yang, H., Jin, L., and Sun, J. (2018, January 5\u20138). Recognition of chinese text in historical documents with page-level annotations. Proceedings of the 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), Niagara Falls, NY, USA.","DOI":"10.1109\/ICFHR-2018.2018.00043"},{"key":"ref_81","doi-asserted-by":"crossref","unstructured":"Ly, N.T., Nguyen, C.T., and Nakagawa, M. (2018, January 5\u20138). Training an End-to-End Model for Offline Handwritten Japanese Text Recognition by Generated Synthetic Patterns. Proceedings of the 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), Niagara Falls, NY, USA.","DOI":"10.1109\/ICFHR-2018.2018.00022"},{"key":"ref_82","doi-asserted-by":"crossref","unstructured":"Ly, N.T., Nguyen, C.T., and Nakagawa, M. (2019, January 20\u201325). An attention-based end-to-end model for multiple text lines recognition in japanese historical documents. Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR), Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00106"},{"key":"ref_83","unstructured":"Bluche, T. (2016, January 5\u201310). Joint line segmentation and transcription for end-to-end handwritten paragraph recognition. Proceedings of the Advances in Neural Information Processing Systems, Barcelona, Spain."},{"key":"ref_84","doi-asserted-by":"crossref","unstructured":"Pondenkandath, V., Alberti, M., Diatta, M., Ingold, R., and Liwicki, M. (2019, January 22\u201325). Historical Document Synthesis with Generative Adversarial Networks. Proceedings of the Second International Workshop on Machine Learning, WML@ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDARW.2019.40096"},{"key":"ref_85","unstructured":"Bromley, J., Guyon, I., LeCun, Y., S\u00e4ckinger, E., and Shah, R. (December, January 28). Signature verification using a \u201csiamese\u201d time delay neural network. Proceedings of the Advances in Neural Information Processing Systems, Denver, CO, USA."},{"key":"ref_86","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2014). Fully Convolutional Networks for Semantic Segmentation. arXiv.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_87","doi-asserted-by":"crossref","first-page":"3212","DOI":"10.1109\/TNNLS.2018.2876865","article-title":"Object Detection With Deep Learning: A Review","volume":"30","author":"Zhao","year":"2019","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_88","unstructured":"Baldi, P. (2011, January 7). Autoencoders, Unsupervised Learning and Deep Architectures. Proceedings of the 2011 International Conference on Unsupervised and Transfer Learning Workshop, Bellevue, WA, USA."},{"key":"ref_89","doi-asserted-by":"crossref","unstructured":"Wei, H., Seuret, M., Chen, K., Fischer, A., Liwicki, M., and Ingold, R. (2015, January 22). Selecting autoencoder features for layout analysis of historical documents. Proceedings of the 3rd International Workshop on Historical Document Imaging and Processing, Gammarth, Tunisia.","DOI":"10.1145\/2809544.2809548"},{"key":"ref_90","doi-asserted-by":"crossref","unstructured":"Chen, K., Seuret, M., Liwicki, M., Hennebert, J., and Ingold, R. (2015, January 23\u201326). Page segmentation of historical document images with convolutional autoencoders. Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, Tunisia.","DOI":"10.1109\/ICDAR.2015.7333914"},{"key":"ref_91","unstructured":"Liu, P., Qiu, X., and Huang, X. (2016). Recurrent Neural Network for Text Classification with Multi-Task Learning. arXiv."},{"key":"ref_92","doi-asserted-by":"crossref","first-page":"855","DOI":"10.1109\/TPAMI.2008.137","article-title":"A novel connectionist system for unconstrained handwriting recognition","volume":"31","author":"Graves","year":"2008","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_93","doi-asserted-by":"crossref","unstructured":"Mart\u00ednek, J., Lenc, L., Kr\u00e1l, P., Nicolaou, A., and Christlein, V. (2019, January 20\u201325). Hybrid Training Data for Historical Text OCR. Proceedings of the 15th International Conference on Document Analysis and Recognition (ICDAR 2019), Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00096"},{"key":"ref_94","doi-asserted-by":"crossref","unstructured":"Messina, R., and Louradour, J. (2015, January 23\u201326). Segmentation-free handwritten Chinese text recognition with LSTM-RNN. Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, Tunisia.","DOI":"10.1109\/ICDAR.2015.7333746"},{"key":"ref_95","doi-asserted-by":"crossref","unstructured":"Moysset, B., Kermorvant, C., and Wolf, C. (2017, January 9\u201315). Full-page text recognition: Learning where to start and when to stop. Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, Japan.","DOI":"10.1109\/ICDAR.2017.147"},{"key":"ref_96","unstructured":"Finn, C., Christiano, P., Abbeel, P., and Levine, S. (2016). A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models. arXiv."},{"key":"ref_97","unstructured":"Chang, J., Gu, Y., Zhang, Y., and Wang, Y.F. (2018, January 3\u20136). Chinese Handwriting Imitation with Hierarchical Generative Adversarial Network. Proceedings of the BMVC, Newcastle, UK."},{"key":"ref_98","doi-asserted-by":"crossref","unstructured":"Chang, B., Zhang, Q., Pan, S., and Meng, L. (2018, January 12\u201315). Generating handwritten chinese characters using cyclegan. Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA.","DOI":"10.1109\/WACV.2018.00028"},{"key":"ref_99","doi-asserted-by":"crossref","unstructured":"Zhu, J.Y., Park, T., Isola, P., and Efros, A.A. (2017). Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. arXiv.","DOI":"10.1109\/ICCV.2017.244"},{"key":"ref_100","doi-asserted-by":"crossref","unstructured":"Simistira, F., Seuret, M., Eichenberger, N., Garz, A., Liwicki, M., and Ingold, R. (2016, January 23\u201326). Diva-hisdb: A precisely annotated large dataset of challenging medieval manuscripts. Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), Shenzhen, China.","DOI":"10.1109\/ICFHR.2016.0093"},{"key":"ref_101","doi-asserted-by":"crossref","unstructured":"Fischer, A., Frinken, V., Forn\u00e9s, A., and Bunke, H. (2011, January 24\u201327). Transcription Alignment of Latin Manuscripts Using Hidden Markov Models. Proceedings of the 2011 Workshop on Historical Document Imaging and Processing, New York, NY, USA.","DOI":"10.1145\/2037342.2037348"},{"key":"ref_102","doi-asserted-by":"crossref","unstructured":"Fischer, A., Wuthrich, M., Liwicki, M., Frinken, V., Bunke, H., Viehhauser, G., and Stolz, M. (2009, January 9\u201312). Automatic Transcription of Handwritten Medieval Documents. Proceedings of the 2009 15th International Conference on Virtual Systems and Multimedia, Vienna. Austria.","DOI":"10.1109\/VSMM.2009.26"},{"key":"ref_103","doi-asserted-by":"crossref","unstructured":"Kassis, M., Abdalhaleem, A., Droby, A., Alaasam, R., and El-Sana, J. (2017, January 3\u20135). Vml-hd: The historical arabic documents dataset for recognition systems. Proceedings of the 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR), Nancy, France.","DOI":"10.1109\/ASAR.2017.8067751"},{"key":"ref_104","doi-asserted-by":"crossref","unstructured":"Kurar Barakat, B., El-Sana, J., and Rabaev, I. (2019, January 20\u201325). The Pinkas Dataset. Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR), Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00122"},{"key":"ref_105","unstructured":"Clanuwat, T., Bober-Irizar, M., Kitamoto, A., Lamb, A., Yamamoto, K., and Ha, D. (2018). Deep learning for classical japanese literature. arXiv."},{"key":"ref_106","doi-asserted-by":"crossref","unstructured":"Haji\u010d, J., and Pecina, P. (2017, January 9\u201315). The MUSCIMA++ dataset for handwritten optical music recognition. Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, Japan.","DOI":"10.1109\/ICDAR.2017.16"},{"key":"ref_107","unstructured":"Wallach, H., Larochelle, H., Beygelzimer, A., d\u2019 Alche-Buc, F., Fox, E., and Garnett, R. (2019). PyTorch: An Imperative Style, High-Performance Deep Learning Library. Advances in Neural Information Processing Systems 32, Curran Associates, Inc."},{"key":"ref_108","unstructured":"Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G.S., Davis, A., Dean, J., and Devin, M. (2020, October 15). TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. Available online: www.tensorflow.org."},{"key":"ref_109","unstructured":"Chollet, F. (2020, October 15). Keras. Available online: https:\/\/github.com\/fchollet\/keras."},{"key":"ref_110","doi-asserted-by":"crossref","unstructured":"Kiessling, B., Tissot, R., Stokes, P., and Stoekl Ben Ezra, D. (2019, January 22\u201325). eScriptorium: An Open Source Platform for Historical Document Analysis. Proceedings of the 2nd International Workshop on Open Services and Tools for Document Analysis, OST@ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDARW.2019.10032"},{"key":"ref_111","unstructured":"Pettersson, E., and Megyesi, B. (2018, January 7\u20139). The histcorp collection of historical corpora and resources. Proceedings of the DHN 2018 The Third Conference on Digital Humanities in the Nordic Countries, Helsinki, Finland."},{"key":"ref_112","doi-asserted-by":"crossref","unstructured":"Alberti, M., Bouillon, M., Ingold, R., and Liwicki, M. (2017, January 9\u201315). Open Evaluation Tool for Layout Analysis of Document Images. Proceedings of the 1st International Workshop on Open Services and Tools for Document Analysis, 14th IAPR International Conference on Document Analysis and Recognition, OST@ICDAR 2017, Kyoto, Japan.","DOI":"10.1109\/ICDAR.2017.311"},{"key":"ref_113","doi-asserted-by":"crossref","unstructured":"Antonacopoulos, A., Clausner, C., Papadopoulos, C., and Pletschacher, S. (2013, January 35\u201328). ICDAR 2013 Competition on Historical Newspaper Layout Analysis (HNLA 2013). Proceedings of the 2013 12th International Conference on Document Analysis and Recognition, Washington, DC, USA.","DOI":"10.1109\/ICDAR.2013.293"},{"key":"ref_114","doi-asserted-by":"crossref","unstructured":"Simistira, F., Bouillon, M., Seuret, M., W\u00fcrsch, M., Alberti, M., Ingold, R., and Liwicki, M. (2017, January 9\u201315). Icdar2017 competition on layout analysis for challenging medieval manuscripts. Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, Japan.","DOI":"10.1109\/ICDAR.2017.223"},{"key":"ref_115","doi-asserted-by":"crossref","unstructured":"Mehri, M., Heroux, P., Mullot, R., Moreux, J.P., Co\u00fcasnon, B., and Barrett, B. (2019, January 20\u201325). ICDAR2019 Competition on Historical Book Analysis\u2014HBA2019. Proceedings of the 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia.","DOI":"10.1109\/ICDAR.2019.00239"},{"key":"ref_116","doi-asserted-by":"crossref","unstructured":"Gaur, S., Sonkar, S., and Roy, P.P. (2015, January 23\u201326). Generation of synthetic training data for handwritten Indic script recognition. Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR), Tunis, Tunisia.","DOI":"10.1109\/ICDAR.2015.7333810"},{"key":"ref_117","unstructured":"Frinken, V., Barrett, B., Manmatha, R., and M\u00e4rgner, V. (2013, January 24). Generation of learning samples for historical handwriting recognition using image degradation. Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing, HIP@ICDAR 2013, Washington, DC, USA."},{"key":"ref_118","doi-asserted-by":"crossref","unstructured":"Journet, N., Visani, M., Mansencal, B., Van-Cuong, K., and Billy, A. (2017). DocCreator: A New Software for Creating Synthetic Ground-Truthed Document Images. J. Imaging, 3.","DOI":"10.3390\/jimaging3040062"},{"key":"ref_119","doi-asserted-by":"crossref","unstructured":"Chen, C.H. (2020). Deep learning for historical document analysis. Handbook of Pattern Recognition and Computer Vision, World Scientific.","DOI":"10.1142\/11573"}],"container-title":["Journal of Imaging"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2313-433X\/6\/10\/110\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:22:34Z","timestamp":1760178154000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2313-433X\/6\/10\/110"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,16]]},"references-count":119,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2020,10]]}},"alternative-id":["jimaging6100110"],"URL":"https:\/\/doi.org\/10.3390\/jimaging6100110","relation":{},"ISSN":["2313-433X"],"issn-type":[{"value":"2313-433X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,10,16]]}}}