{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T11:24:41Z","timestamp":1772969081520,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":27,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,11,2]],"date-time":"2021-11-02T00:00:00Z","timestamp":1635811200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,11,2]]},"DOI":"10.1145\/3486187.3490206","type":"proceedings-article","created":{"date-parts":[[2021,11,2]],"date-time":"2021-11-02T10:06:15Z","timestamp":1635847575000},"page":"13-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Spatial Named Entity Recognition in Literary Texts"],"prefix":"10.1145","author":[{"given":"Caroline","family":"Koudoro-Parfait","sequence":"first","affiliation":[{"name":"Sorbonne Universit\u00e9, Paris, France"}]},{"given":"Ga\u00ebl","family":"Lejeune","sequence":"additional","affiliation":[{"name":"Sorbonne Universit\u00e9, Paris, France"}]},{"given":"Glenn","family":"Roe","sequence":"additional","affiliation":[{"name":"Sorbonne Universit\u00e9, Paris, France"}]}],"member":"320","published-online":{"date-parts":[[2021,11,2]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"(2021). COST Action Distant Reading for European Literary History.  (2021). COST Action Distant Reading for European Literary History."},{"key":"e_1_3_2_1_2_1","volume-title":"11th Conference on Natural Language Processing, KONVENS 2012, Empirical Methods in Natural Language Processing","volume":"5","author":"Alex B.","year":"2012","unstructured":"Alex , B. , Grover , C. , Klein , E. , and Tobin , R . ( 2012 ). Digitised historical text: Does it have to be mediocre? In Jancsary, J., editor , 11th Conference on Natural Language Processing, KONVENS 2012, Empirical Methods in Natural Language Processing , Vienna, Austria , September 19-21, 2012, volume 5 of Scientific series of the \u00d6GAI, pages 401--409. \u00d6GAI, Wien, \u00d3sterreich. Alex, B., Grover, C., Klein, E., and Tobin, R. (2012). Digitised historical text: Does it have to be mediocre? In Jancsary, J., editor, 11th Conference on Natural Language Processing, KONVENS 2012, Empirical Methods in Natural Language Processing, Vienna, Austria, September 19-21, 2012, volume 5 of Scientific series of the \u00d6GAI, pages 401--409. \u00d6GAI, Wien, \u00d3sterreich."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.conll-1.35"},{"key":"e_1_3_2_1_4_1","first-page":"14","volume-title":"R.","author":"Buscaldi D.","year":"2020","unstructured":"Buscaldi , D. , Felhi , G. , Ghoul , D. , Le Roux , J. , Lejeune , G. , and Zhang , X . ( 2020 ). Calcul de similarit\u00e9 entre phrases: quelles mesures et quels descripteurs ? In Cardon , R. , Grabar, N., Grouin, C., and Hamon, T., editors, 6e conf\u00e9rence conjointe Journ\u00e9es d'\u00c9tudes sur la Parole (JEP, 33e \u00e9dition), Traitement Automatique des Langues Naturelles (TALN, 27e \u00e9dition), Rencontre des \u00c9tudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (R\u00c9CITAL, 22e \u00e9dition). Atelier D\u00c9fi Fouille de Textes, pages 14 -- 25 , Nancy, France. ATALA. Buscaldi, D., Felhi, G., Ghoul, D., Le Roux, J., Lejeune, G., and Zhang, X. (2020). Calcul de similarit\u00e9 entre phrases: quelles mesures et quels descripteurs ? In Cardon, R., Grabar, N., Grouin, C., and Hamon, T., editors, 6e conf\u00e9rence conjointe Journ\u00e9es d'\u00c9tudes sur la Parole (JEP, 33e \u00e9dition), Traitement Automatique des Langues Naturelles (TALN, 27e \u00e9dition), Rencontre des \u00c9tudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (R\u00c9CITAL, 22e \u00e9dition). Atelier D\u00c9fi Fouille de Textes, pages 14--25, Nancy, France. ATALA."},{"key":"e_1_3_2_1_5_1","volume-title":"2017 ACM\/IEEE Joint Conference on Digital Libraries (JCDL), 2017 ACM\/IEEE Joint Conference on Digital Libraries (JCDL)","author":"Chiron G.","unstructured":"Chiron , G. , Doucet , A. , Coustaty , M. , Visani , M. , and Moreux , J . -P. (2017). Impact of OCR errors on the use of digital libraries Towards a better access to information . In 2017 ACM\/IEEE Joint Conference on Digital Libraries (JCDL), 2017 ACM\/IEEE Joint Conference on Digital Libraries (JCDL) , Toronto, Canada. IEEE. Chiron, G., Doucet, A., Coustaty, M., Visani, M., and Moreux, J.-P. (2017). Impact of OCR errors on the use of digital libraries Towards a better access to information. In 2017 ACM\/IEEE Joint Conference on Digital Libraries (JCDL), 2017 ACM\/IEEE Joint Conference on Digital Libraries (JCDL), Toronto, Canada. IEEE."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-019-00347-8"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58219-7_21"},{"key":"e_1_3_2_1_8_1","volume-title":"OCR17: Ground Truth and Models for 17th c. French Prints (and hopefully more). working paper or preprint","author":"Gabay S.","year":"2020","unstructured":"Gabay , S. , Cl\u00e9rice , T. , and Reul , C . ( 2020 ). OCR17: Ground Truth and Models for 17th c. French Prints (and hopefully more). working paper or preprint . Gabay, S., Cl\u00e9rice, T., and Reul, C. (2020). OCR17: Ground Truth and Models for 17th c. French Prints (and hopefully more). working paper or preprint."},{"key":"e_1_3_2_1_9_1","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"29","author":"Gupta A.","year":"2015","unstructured":"Gupta , A. , Gutierrez-Osuna , R. , Christy , M. , Capitanu , B. , Auvil , L. , Grumbach , L. , Furuta , R. , and Mandell , L . ( 2015 ). Automatic assessment of ocr quality in historical documents . In Proceedings of the AAAI Conference on Artificial Intelligence , volume 29 . Gupta, A., Gutierrez-Osuna, R., Christy, M., Capitanu, B., Auvil, L., Grumbach, L., Furuta, R., and Mandell, L. (2015). Automatic assessment of ocr quality in historical documents. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 29."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-54956-5_7"},{"key":"e_1_3_2_1_11_1","volume-title":"spaCy: Industrial-strength Natural Language Processing in Python","author":"Honnibal M.","year":"2020","unstructured":"Honnibal , M. , Montani , I. , Van Landeghem , S. , and Boyd , A . ( 2020 ). spaCy: Industrial-strength Natural Language Processing in Python . Honnibal, M., Montani, I., Van Landeghem, S., and Boyd, A. (2020). spaCy: Industrial-strength Natural Language Processing in Python."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-64452-9_3"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.18352\/lq.10322"},{"key":"e_1_3_2_1_14_1","volume-title":"Old content and modern tools - searching named entities in a finnish ocred historical newspaper collection 1771--1910. CoRR, abs\/1611.02839","author":"Kettunen K.","year":"2016","unstructured":"Kettunen , K. , M\u00e4kel\u00e4 , E. , Ruokolainen , T. , Kuokkala , J. , and L\u00f6fberg , L . ( 2016 ). Old content and modern tools - searching named entities in a finnish ocred historical newspaper collection 1771--1910. CoRR, abs\/1611.02839 . Kettunen, K., M\u00e4kel\u00e4, E., Ruokolainen, T., Kuokkala, J., and L\u00f6fberg, L. (2016). Old content and modern tools - searching named entities in a finnish ocred historical newspaper collection 1771--1910. CoRR, abs\/1611.02839."},{"key":"e_1_3_2_1_15_1","first-page":"19","volume-title":"2019 International Conference on Document Analysis and Recognition Workshops (ICDARW)","volume":"2","author":"Kiessling B.","unstructured":"Kiessling , B. , Tissot , R. , Stokes , P. , and Ezra , D. S. B. (2019). escriptorium: An open source platform for historical document analysis . In 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW) , volume 2 , pages 19 -- 19 . IEEE. Kiessling, B., Tissot, R., Stokes, P., and Ezra, D. S. B. (2019). escriptorium: An open source platform for historical document analysis. In 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW), volume 2, pages 19--19. IEEE."},{"key":"e_1_3_2_1_16_1","volume-title":"A new proposal for evaluating web page cleaning tools. Computacion y Sistemas, 22(4):1249--1258","author":"Lejeune G.","year":"2018","unstructured":"Lejeune , G. and Zhu , L . ( 2018 ). A new proposal for evaluating web page cleaning tools. Computacion y Sistemas, 22(4):1249--1258 . Lejeune, G. and Zhu, L. (2018). A new proposal for evaluating web page cleaning tools. Computacion y Sistemas, 22(4):1249--1258."},{"key":"e_1_3_2_1_17_1","volume-title":"Impact of OCR Quality on Named Entity Linking. In International Conference on Asia-Pacific Digital Libraries","author":"Linhares Pontes E.","year":"2019","unstructured":"Linhares Pontes , E. , Hamdi , A. , Sid\u00e8re , N. , and Doucet , A . ( 2019 ). Impact of OCR Quality on Named Entity Linking. In International Conference on Asia-Pacific Digital Libraries 2019, Kuala Lumpur, Malaysia. Linhares Pontes, E., Hamdi, A., Sid\u00e8re, N., and Doucet, A. (2019). Impact of OCR Quality on Named Entity Linking. In International Conference on Asia-Pacific Digital Libraries 2019, Kuala Lumpur, Malaysia."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-009-0094-8"},{"key":"e_1_3_2_1_19_1","volume-title":"Proceedings of the 4th Workshop on Natural Language for Artificial Intelligence (NL4AI), 19th International Conference of the Italian Association for Artificial Intelligence","author":"Nguyen N. K.","year":"2020","unstructured":"Nguyen , N. K. , Boros , E. , Lejeune , G. , and Doucet , A . ( 2020 ). Impact analysis of document digitization on event extraction . In Proceedings of the 4th Workshop on Natural Language for Artificial Intelligence (NL4AI), 19th International Conference of the Italian Association for Artificial Intelligence , page to appear, Roma, Italy. -. Nguyen, N. K., Boros, E., Lejeune, G., and Doucet, A. (2020). Impact analysis of document digitization on event extraction. In Proceedings of the 4th Workshop on Natural Language for Artificial Intelligence (NL4AI), 19th International Conference of the Italian Association for Artificial Intelligence, page to appear, Roma, Italy. -."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6379"},{"key":"e_1_3_2_1_21_1","volume-title":"Unicase-rethinking casing in language models. arXiv preprint arXiv:2010.11936","author":"Powalski R.","year":"2020","unstructured":"Powalski , R. and Stanislawek , T . ( 2020 ). Unicase-rethinking casing in language models. arXiv preprint arXiv:2010.11936 . Powalski, R. and Stanislawek, T. (2020). Unicase-rethinking casing in language models. arXiv preprint arXiv:2010.11936."},{"key":"e_1_3_2_1_22_1","volume-title":"Stanza: A python natural language processing toolkit for many human languages","author":"Qi P.","year":"2020","unstructured":"Qi , P. , Zhang , Y. , Zhang , Y. , Bolton , J. , and Manning , C. D . ( 2020 ). Stanza: A python natural language processing toolkit for many human languages . Qi, P., Zhang, Y., Zhang, Y., Bolton, J., and Manning, C. D. (2020). Stanza: A python natural language processing toolkit for many human languages."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2007.4376991"},{"key":"e_1_3_2_1_24_1","volume-title":"CLEF 2020 Working Notes. Working Notes of CLEF 2020-Conference and Labs of the Evaluation Forum.","author":"Su\u00e1rez P. J. O.","year":"2020","unstructured":"Su\u00e1rez , P. J. O. , Dupont , Y. , Lejeune , G. , and Tian , T . ( 2020 ). Sinner\u00a9 clef-hipe2020: Sinful adaptation of sota models for named entity recognition in french and german . In CLEF 2020 Working Notes. Working Notes of CLEF 2020-Conference and Labs of the Evaluation Forum. Su\u00e1rez, P. J. O., Dupont, Y., Lejeune, G., and Tian, T. (2020). Sinner\u00a9 clef-hipe2020: Sinful adaptation of sota models for named entity recognition in french and german. In CLEF 2020 Working Notes. Working Notes of CLEF 2020-Conference and Labs of the Evaluation Forum."},{"key":"e_1_3_2_1_25_1","series-title":"Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics","first-page":"252","volume-title":"Impact analysis of OCR quality on research tasks in digital archives","author":"Traub M.","year":"2015","unstructured":"Traub , M. and Van Ossenbruggen , J. H. L. ( 2015 ). Impact analysis of OCR quality on research tasks in digital archives . In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics , pages pp. 252 -- 263 . Traub, M. and Van Ossenbruggen, J. H. L. (2015). Impact analysis of OCR quality on research tasks in digital archives. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, pages pp. 252--263."},{"key":"e_1_3_2_1_26_1","volume-title":"Assessing the Impact of OCR Quality on Downstream NLP Tasks. In In Proceedings of the 12th International Conference on Agents and Artificial Intelligence -","volume":"484","author":"van Strien D.","year":"2020","unstructured":"van Strien , D. , Beelen , K. , Ardanuy , M. , Hosseini , K. , McGillivray , B. , and Colavizza , G . ( 2020 ). Assessing the Impact of OCR Quality on Downstream NLP Tasks. In In Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: ARTIDIGH, pages 484 - 496. van Strien, D., Beelen, K., Ardanuy, M., Hosseini, K., McGillivray, B., and Colavizza, G. (2020). Assessing the Impact of OCR Quality on Downstream NLP Tasks. In In Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: ARTIDIGH, pages 484 - 496."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.36181\/digitalia-00015"}],"event":{"name":"SIGSPATIAL '21: 29th International Conference on Advances in Geographic Information Systems","location":"Beijing China","acronym":"SIGSPATIAL '21","sponsor":["SIGSPATIAL ACM Special Interest Group on Spatial Information"]},"container-title":["Proceedings of the 5th ACM SIGSPATIAL International Workshop on Geospatial Humanities"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3486187.3490206","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3486187.3490206","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:12:07Z","timestamp":1750191127000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3486187.3490206"}},"subtitle":["What is the Influence of OCR Noise?"],"short-title":[],"issued":{"date-parts":[[2021,11,2]]},"references-count":27,"alternative-id":["10.1145\/3486187.3490206","10.1145\/3486187"],"URL":"https:\/\/doi.org\/10.1145\/3486187.3490206","relation":{},"subject":[],"published":{"date-parts":[[2021,11,2]]},"assertion":[{"value":"2021-11-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}