{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,15]],"date-time":"2025-10-15T00:40:35Z","timestamp":1760488835065,"version":"build-2065373602"},"publisher-location":"New York, NY, USA","reference-count":69,"publisher":"ACM","funder":[{"DOI":"10.13039\/501100003359","name":"Generalitat Valenciana","doi-asserted-by":"publisher","award":["CIBEFP\/2023\/170 and ACIF\/2021\/436"],"award-info":[{"award-number":["CIBEFP\/2023\/170 and ACIF\/2021\/436"]}],"id":[{"id":"10.13039\/501100003359","id-type":"DOI","asserted-by":"publisher"}]},{"name":"MCIN\/AEI\/10.13039\/501100011033 and by ERDF EU A way of making Europe","award":["PID2021-124719OB-I00"],"award-info":[{"award-number":["PID2021-124719OB-I00"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,8,27]]},"DOI":"10.1145\/3704268.3742699","type":"proceedings-article","created":{"date-parts":[[2025,8,27]],"date-time":"2025-08-27T16:44:47Z","timestamp":1756313087000},"page":"1-9","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Improving Lightweight Named Entity Recognition in Handwritten Documents by Predicting Pyramidal Histograms of Characters"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2301-6673","authenticated-orcid":false,"given":"David","family":"Villanova-Aparisi","sequence":"first","affiliation":[{"name":"PRHLT Research Center, Universitat Polit\u00e8cnica de Val\u00e8ncia, Valencia, Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6139-2891","authenticated-orcid":false,"given":"Carlos D.","family":"Mart\u00ednez-Hinarejos","sequence":"additional","affiliation":[{"name":"PRHLT Research Center, Universitat Polit\u00e8cnica de Val\u00e8ncia, Valencia, Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1721-5732","authenticated-orcid":false,"given":"Ver\u00f3nica","family":"Romero","sequence":"additional","affiliation":[{"name":"Departament d'Inform\u00e0tica, Universitat de Val\u00e8ncia, Valencia, Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1833-7440","authenticated-orcid":false,"given":"Mois\u00e9s","family":"Pastor-Gadea","sequence":"additional","affiliation":[{"name":"PRHLT Research Center, Universitat Polit\u00e8cnica de Val\u00e8ncia, Valencia, Spain"}]}],"member":"320","published-online":{"date-parts":[[2025,8,27]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"crossref","unstructured":"N. Abadie E. Carlinet J. Chazalon and B. Dum\u00e9nieu. 2022. A benchmark of named entity recognition approaches in historical documents application to 19th century french directories. In Document Analysis Systems. Seiichi Uchida Elisa Barney and V\u00e9ronique Eglin (Eds.) Springer Cham 445--460. ISBN: 978-3-031-06555-2.","DOI":"10.1007\/978-3-031-06555-2_30"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","unstructured":"Chandranath Adak Bidyut B. Chaudhuri and Michael Blumenstein. 2016. Named entity recognition from unstructured handwritten document images. In DAS 375--380. doi:10.1109\/DAS.2016.15.","DOI":"10.1109\/DAS.2016.15"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.3390\/jimaging10010018"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.130"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01557"},{"key":"e_1_3_2_1_6_1","volume-title":"The Twelfth International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=fUtxNAKpdV.","author":"Blecher Lukas","year":"2024","unstructured":"Lukas Blecher, Guillem Cucurull, Thomas Scialom, and Robert Stojnic. 2024. Nougat: neural optical understanding for academic documents. In The Twelfth International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=fUtxNAKpdV."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Emanuela Boro\u015f Ver\u00f3nica Romero Martin Maarand Kate\u0159ina Zenklov\u00e1 Jitka K\u0159e\u010dkov\u00e1 Enrique Vidal Dominique Stutzmann and Christopher Kermorvant. 2020. A comparison of sequential and combined approaches for named entity recognition in a corpus of handwritten medieval charters. In ICFHR 79--84.","DOI":"10.1109\/ICFHR2020.2020.00025"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"crossref","unstructured":"Manuel Carbonell Mauricio Villegas Alicia Forn\u00e9s and Josep Llad\u00f3s. 2018. Joint recognition of handwritten text and named entities with a neural end-to-end model. In DAS 399--404.","DOI":"10.1109\/DAS.2018.52"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2020.106649"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-06555-2"},{"key":"e_1_3_2_1_11_1","volume-title":"End-to-end information extraction in handwritten documents: understanding paris marriage records from 1880 to","author":"Constum Thomas","year":"1940","unstructured":"Thomas Constum, Lucas Preel, Th\u00e9o Larcher, Thierry Paquet, Pierrick Tranouez, and Sandra Bree. 2024. End-to-end information extraction in handwritten documents: understanding paris marriage records from 1880 to 1940. In Document Analysis and Recognition - ICDAR 2024. Elisa H. Barney Smith, Marcus Liwicki, and Liangrui Peng, (Eds.) Springer Nature Switzerland, Cham, 195--214. isbn: 978-3-031-70543-4."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"crossref","unstructured":"Thomas Constum Pierrick Tranouez and Thierry Paquet. 2025. Daniel: a fast document attention network for information extraction and labelling of handwritten documents. International Journal on Document Analysis and Recognition (IJDAR) 1--23.","DOI":"10.1007\/s10032-024-00511-9"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/tpami.2023.3235826"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-41685-9_12"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.520"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-25069-9_19"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1423"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2023.03.020"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3604931"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3604931"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2017.227"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","unstructured":"S. Ghannay A. Caubri\u00e8re Y. Est\u00e8ve N. Camelin E. Simonnet A. Laurent and E. Morin. 2018. End-to-end named entity and semantic concept extraction from speech. In IEEE SLT 692--699. doi:10.1109\/SLT.2018.8639513.","DOI":"10.1109\/SLT.2018.8639513"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143891"},{"key":"e_1_3_2_1_24_1","first-page":"11","volume-title":"Proceedings of the Sixth Workshop on Statistical Machine Translation. ACL","author":"Heafield Kenneth","year":"2011","unstructured":"Kenneth Heafield. 2011. KenLM: faster and smaller language model queries. In Proceedings of the Sixth Workshop on Statistical Machine Translation. ACL, Edinburgh, Scotland, (July 2011), 187--197. https:\/\/www.aclweb.org\/anthology\/W11-2123."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-024-09646-6"},{"key":"e_1_3_2_1_26_1","volume-title":"Jan.","author":"Jocher Glenn","year":"2023","unstructured":"[SW] Glenn Jocher, Jing Qiu, and Ayush Chaurasia, Ultralytics YOLO version 8.0.0, Jan. 2023. URL: https:\/\/github.com\/ultralytics\/ultralytics."},{"key":"e_1_3_2_1_27_1","unstructured":"Stig Johansson Geoffrey Leech and Helen Goodluck. 1978. Manual of information to accompany the lancaster-oslo-bergen corpus of british english for use with digital computers. (1978). http:\/\/korpus.uib.no\/icame\/manuals\/LOB\/INDEX.HTM."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1216"},{"volume-title":"Springer","author":"Kang Lei","key":"e_1_3_2_1_29_1","unstructured":"Lei Kang, J. Ignacio Toledo, Pau Riba, Mauricio Villegas, Alicia Forn\u00e9s, and Mar\u00e7al Rusi\u00f1ol. 2019. Convolve, attend and spell: an attention-based sequence-to-sequence model for handwritten word recognition. In Pattern Recognition. Thomas Brox, Andr\u00e9s Bruhn, and Mario Fritz, (Eds.) Springer International Publishing, Cham, 459--472. isbn: 978-3-030-12939-2."},{"key":"e_1_3_2_1_30_1","unstructured":"Geewook Kim et al. 2022. Ocr-free document understanding transformer. In Computer Vision - ECCV 2022. Shai Avidan Gabriel Brostow Moustapha Ciss\u00e9 Giovanni Maria Farinella and Tal Hassner (Eds.) Springer Nature Switzerland Cham 498--517. isbn: 978-3-031-19815-1."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btg1023"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1995.479394"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N16-1030"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v37i11.26538"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.45"},{"key":"e_1_3_2_1_36_1","unstructured":"Yinhan Liu et al. 2019. Roberta: a robustly optimized bert pretraining approach. (2019). https:\/\/arxiv.org\/abs\/1907.11692 arXiv: 1907.11692 [cs.CL]."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.coling-main.334"},{"volume-title":"A comprehensive comparison of open-source libraries for handwritten text recognition in norwegian","author":"Maarand Martin","key":"e_1_3_2_1_38_1","unstructured":"Martin Maarand, Yngvil Beyer, Andre K\u00e5sen, Knut T. Fosseide, and Christopher Kermorvant. 2022. A comprehensive comparison of open-source libraries for handwritten text recognition in norwegian. In Document Analysis Systems. Seiichi Uchida, Elisa Barney, and V\u00e9ronique Eglin, (Eds.) Springer, Cham, 399--413. ISBN: 978-3-031-06555-2."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1007\/s100320200071"},{"key":"e_1_3_2_1_40_1","unstructured":"Blanche Miret and Christopher Kermorvant. 2021. Nerval: a python library for named-entity recognition evaluation on noisy texts. https:\/\/gitlab.teklia.com\/ner\/nerval. (2021)."},{"volume-title":"Acomprehensive study of open-source libraries for named entity recognition on handwritten historical documents","author":"Monroc Claire Bizon","key":"e_1_3_2_1_41_1","unstructured":"Claire Bizon Monroc, Blanche Miret, Marie-Laurence Bonhomme, and Christopher Kermorvant. 2022. Acomprehensive study of open-source libraries for named entity recognition on handwritten historical documents. In Document Analysis Systems. Seiichi Uchida, Elisa Barney, and V\u00e9ronique Eglin, (Eds.) Springer, Cham, 429--444. ISBN: 978-3-031-06555-2."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-70536-6_11"},{"key":"e_1_3_2_1_43_1","volume-title":"Joan Andreu S\u00e1nchez, and Jos\u00e9 Miguel Bened\u00ed","author":"Parres Daniel","year":"2024","unstructured":"Daniel Parres, Dan Anitei, Roberto Paredes, Joan Andreu S\u00e1nchez, and Jos\u00e9 Miguel Bened\u00ed. 2024. Speed-up pre-trained vision encoder-decoder transformers by leveraging lightweight mixer layers for text recognition. In Document Analysis Systems. Giorgos Sfikas and George Retsinas, (Eds.) Springer Nature Switzerland, Cham, 277--294. ISBN: 978-3-031-70442-0."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-41685-9_16"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.100"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2017.20"},{"key":"e_1_3_2_1_47_1","first-page":"95","volume-title":"Third Workshop on Very Large Corpora, 82--94","author":"Ramshaw Lance","year":"1995","unstructured":"Lance Ramshaw and Mitch Marcus. 1995. Text chunking using transformation-based learning. In Third Workshop on Very Large Corpora, 82--94. https:\/\/aclanthology.org\/W95-0107."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.91"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2012.11.024"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","unstructured":"Ver\u00f3nica Romero Alicia Forn\u00e9s Enrique Vidal and Joan Andreu S\u00e1nchez. 2016. Using the mggi methodology for category-based language modeling in handwritten marriage licenses books. In ICFHR 331--336. doi:10.1109\/ICFHR.2016.0069.","DOI":"10.1109\/ICFHR.2016.0069"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-30754-7_11"},{"key":"e_1_3_2_1_52_1","volume-title":"Proceedings of the 15th International Conference on Natural Language Processing","author":"Rowtula Vijay","year":"2018","unstructured":"Vijay Rowtula and Praveen Krishnan. 2018. POS tagging and named entity recognition on handwritten documents. In Proceedings of the 15th International Conference on Natural Language Processing. Gurpreet Singh Lehal, Dipti Misra Sharma, and Rajeev Sangal, (Eds.) NLP Association of India, International Institute of Information Technology, Hyderabad, India, (Dec. 2018), 82--86. https:\/\/aclanthology.org\/2018.icon-1.11\/."},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2019.05.025"},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/78.650093"},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1317"},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1527"},{"key":"e_1_3_2_1_57_1","unstructured":"Hanna Suominen et al. 2013. Overview of the share\/clef ehealth evaluation lab 2013. In Information Access Evaluation. Multilinguality Multimodality and Visualization. Pamela Forner Henning M\u00fcller Roberto Paredes Paolo Rosso and Benno Stein (Eds.) Springer Berlin Heidelberg Berlin Heidelberg 212--231. ISBN: 978-3-642-40802-1."},{"key":"e_1_3_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-41682-8_26"},{"key":"e_1_3_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-70552-6_10"},{"key":"e_1_3_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-06555-2_43"},{"key":"e_1_3_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-70549-6_23"},{"key":"e_1_3_2_1_62_1","volume-title":"Tjong Kim Sang and Fien De Meulder","author":"Erik","year":"2003","unstructured":"Erik F. Tjong Kim Sang and Fien De Meulder. 2003. Introduction to the CoNLL-2003 shared task: language-independent named entity recognition. In HLT-NAACL, 142--147. https:\/\/aclanthology.org\/W03-0419."},{"key":"e_1_3_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2018.08.020"},{"key":"e_1_3_2_1_64_1","volume-title":"ICDAR","author":"T\u00fcselmann Oliver","year":"2021","unstructured":"Oliver T\u00fcselmann, Fabian Wolf, and Gernot A. Fink. 2021. Are end-to-end systems really necessary for ner on handwritten document images? In ICDAR 2021. Josep Llad\u00f3s, Daniel Lopresti, and Seiichi Uchida, (Eds.) Springer, Cham, 808--822. ISBN: 978-3-030-86331-9."},{"key":"e_1_3_2_1_65_1","volume-title":"Proceedings of NIPS'17","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Proceedings of NIPS'17. Curran Associates Inc., Long Beach, California, USA, 6000--6010. ISBN: 9781510860964."},{"key":"e_1_3_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-41676-7_15"},{"key":"e_1_3_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-41682-8_1"},{"key":"e_1_3_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-70536-6_12"},{"key":"e_1_3_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/321796.321811"}],"event":{"name":"DocEng '25: ACM Symposium on Document Engineering 2025","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"],"location":"Nottingham United Kingdom","acronym":"DocEng '25"},"container-title":["Proceedings of the 2025 ACM Symposium on Document Engineering"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3704268.3742699","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,14]],"date-time":"2025-10-14T18:26:28Z","timestamp":1760466388000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3704268.3742699"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,27]]},"references-count":69,"alternative-id":["10.1145\/3704268.3742699","10.1145\/3704268"],"URL":"https:\/\/doi.org\/10.1145\/3704268.3742699","relation":{},"subject":[],"published":{"date-parts":[[2025,8,27]]},"assertion":[{"value":"2025-08-27","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}