{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T22:25:03Z","timestamp":1759962303827,"version":"3.37.3"},"reference-count":34,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2021,2,27]],"date-time":"2021-02-27T00:00:00Z","timestamp":1614384000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,2,27]],"date-time":"2021-02-27T00:00:00Z","timestamp":1614384000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100000781","name":"European Research Council","doi-asserted-by":"publisher","award":["679528"],"award-info":[{"award-number":["679528"]}],"id":[{"id":"10.13039\/501100000781","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Lang Resources &amp; Evaluation"],"published-print":{"date-parts":[[2021,6]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Medieval documents are a rich source of historical data. Performing named-entity recognition (NER) on this genre of texts can provide us with valuable historical evidence. However, traditional NER categories and schemes are usually designed with modern documents in mind (i.e. journalistic text) and the general-domain NER annotation schemes fail to capture the nature of medieval entities. In this paper we explore the challenges of performing named-entity annotation on a corpus of Spanish medieval documents: we discuss the mismatches that arise when applying traditional NER categories to a corpus of Spanish medieval documents and we propose a novel humanist-friendly TEI-compliant annotation scheme and guidelines intended to capture the particular nature of medieval entities.<\/jats:p>","DOI":"10.1007\/s10579-020-09516-2","type":"journal-article","created":{"date-parts":[[2021,2,27]],"date-time":"2021-02-27T12:02:56Z","timestamp":1614427376000},"page":"525-549","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["TEI-friendly annotation scheme for medieval named entities: a case on a Spanish medieval corpus"],"prefix":"10.1007","volume":"55","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5952-6902","authenticated-orcid":false,"given":"Elena","family":"\u00c1lvarez-Mellado","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mar\u00eda Luisa","family":"D\u00edez-Platas","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Pablo","family":"Ruiz-Fabo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Helena","family":"Berm\u00fadez","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Salvador","family":"Ros","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Elena","family":"Gonz\u00e1lez-Blanco","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2021,2,27]]},"reference":[{"issue":"4","key":"9516_CR1","doi-asserted-by":"publisher","first-page":"555","DOI":"10.1162\/coli.07-034-R2","volume":"34","author":"R Artstein","year":"2008","unstructured":"Artstein, R., & Poesio, M. (2008). Inter-coder agreement for computational linguistics. Computational Linguistics, 34(4), 555\u2013596.","journal-title":"Computational Linguistics"},{"key":"9516_CR2","unstructured":"Bayerl, P. S., L\u00fcngen, H., Gut, U., Paul, K. I. (2003). Methodology for reliable schema development and evaluation of manual annotations. In Proceedings of the Workshop on Knowledge Markup and Semantic Annotation at the Second International Conference on Knowledge Capture (K-CAP 2003, pp. 17\u201323."},{"issue":"2","key":"9516_CR3","first-page":"249","volume":"22","author":"J Carletta","year":"1996","unstructured":"Carletta, J. (1996). Assessing agreement on classification tasks: The kappa statistic. Computational Linguistics, 22(2), 249\u2013254.","journal-title":"Computational Linguistics"},{"key":"9516_CR4","unstructured":"Chinchor, N. A. (1998). Proceedings of the Seventh Message Understanding Conference (MUC-7) named entity task definition. In Proceedings of the Seventh Message Understanding Conference (MUC-7), page 21 pages, Fairfax, VA. version 3.5, http:\/\/www.itl.nist.gov\/iaui\/894.02\/related_projects\/muc\/."},{"issue":"2","key":"9516_CR5","doi-asserted-by":"publisher","first-page":"307","DOI":"10.1007\/s10579-013-9255-y","volume":"48","author":"B Desmet","year":"2014","unstructured":"Desmet, B., & Hoste, V. (2014). Fine-grained dutch named entity recognition. Language Resources and Evaluation, 48(2), 307\u2013343.","journal-title":"Language Resources and Evaluation"},{"key":"9516_CR6","doi-asserted-by":"crossref","unstructured":"D\u00edez Platas, M.L., Ros Mu noz, S., Gonz\u00e1lez-Blanco, E., Ruiz Fabo, P., \u00c1lvarez Mellado, E. (2020). Medieval spanish (12th\u201315th centuries) named entity recognition and attribute annotation system based on contextual information. Journal of the Association for Information Science and Technology.","DOI":"10.1002\/asi.24399"},{"key":"9516_CR7","unstructured":"D\u00edez Platas, M.L., Tobarra, L., Ros Mu noz, S., Gonz\u00e1lez-Blanco Garc\u00eda, E., Robles-G\u00f3mez, A., Caminero, A., Rio Riande, G. d. (2017). Hispanic medieval tagger (hismetag): una aplicaci\u00f3n web para el etiquetado de entidades en textos medievales. http:\/\/doi.org\/10.5281\/zenodo.1123416 [Accessed 29\/05\/2019]."},{"key":"9516_CR8","doi-asserted-by":"crossref","unstructured":"Fort, K., Ehrmann, M., Nazarenko, A. (2009). Towards a methodology for named entities annotation. In Proceedings of the Third Linguistic Annotation Workshop, ACL-IJCNLP \u201909, pages 142\u2013145, Stroudsburg, PA, USA. Association for Computational Linguistics.","DOI":"10.3115\/1698381.1698406"},{"key":"9516_CR9","doi-asserted-by":"crossref","unstructured":"Frontini, F., Brando, C., Riguet, M., Jacquot, C., Jolivet, V. (2016). Annotation of toponyms in TEI digital literary editions and linking to the web of data. MATLIT: Materialidades da Literatura, 4(2):49\u201375.","DOI":"10.14195\/2182-8830_4-2_3"},{"key":"9516_CR10","doi-asserted-by":"crossref","unstructured":"Grishman, R., Sundheim, B. (1996). Message understanding conference-6: A brief history. In Proceedings of the 16th Conference on Computational Linguistics - Volume 1, COLING \u201996, pages 466\u2013471, Stroudsburg, PA, USA. Association for Computational Linguistics.","DOI":"10.3115\/992628.992709"},{"issue":"1","key":"9516_CR11","first-page":"13","volume":"22","author":"E Hovy","year":"2010","unstructured":"Hovy, E., & Lavid, J. (2010). Towards a \u2019science\u2019of corpus annotation: a new methodological challenge for corpus linguistics. International Journal of Translation, 22(1), 13\u201336.","journal-title":"International Journal of Translation"},{"issue":"3\u20134","key":"9516_CR12","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1017\/S135132490400350X","volume":"10","author":"N Ide","year":"2004","unstructured":"Ide, N., & Romary, L. (2004). International standard for a linguistic annotation framework. Natural Language Engineering, 10(3\u20134), 211\u2013225.","journal-title":"Natural Language Engineering"},{"key":"9516_CR13","doi-asserted-by":"crossref","unstructured":"Isaksen, L., Simon, R., Barker, E.T., de Soto Ca namares, P. (2014). Pelagios and the emerging graph of ancient world data. In Proceedings of the 2014 ACM Conference on Web Science, WebSci \u201914, pp. 197\u2013201, New York, NY, USA. ACM.","DOI":"10.1145\/2615569.2615693"},{"key":"9516_CR14","first-page":"5","volume":"4","author":"FG Jover","year":"2015","unstructured":"Jover, F. G. (2015). La biblioteca digital de textos del espa\u00f1ol antiguo (bidtea). Scriptum digital. Revista de Corpus Diacr\u00f2nics i Edici\u00f3 Digital en Lleng\u00fces iberorom\u00e0niques, 4, 5\u201336.","journal-title":"Revista de Corpus Diacr\u00f2nics i Edici\u00f3 Digital en Lleng\u00fces iberorom\u00e0niques"},{"key":"9516_CR15","volume-title":"Content analysis: An introduction to methodology","author":"K Krippendorff","year":"1980","unstructured":"Krippendorff, K. (1980). Content analysis: An introduction to methodology. Beverly Hills, CA: Sage."},{"issue":"1","key":"9516_CR16","doi-asserted-by":"publisher","first-page":"159","DOI":"10.2307\/2529310","volume":"33","author":"J Landis","year":"1977","unstructured":"Landis, J., & Koch, G. (1977). The measurement of observer agreement for categorical data. Biometrics, 33(1), 159\u2013174.","journal-title":"Biometrics"},{"key":"9516_CR17","unstructured":"Linguistic Data Consortium (2005). ACE (Automatic Content Extraction) English annotation guidelines for entities."},{"key":"9516_CR18","first-page":"3","volume":"782","author":"H Maraoui","year":"2018","unstructured":"Maraoui, H., Haddar, K., & Romary, L. (2018). Encoding prototype of al-hadith al-shareef in tei. CoRR, 782, 3\u201326.","journal-title":"CoRR"},{"key":"9516_CR19","unstructured":"Markert, K., Nissim, M., Place, B. (2002). Towards a corpus annotated for metonymies: the case of location names. In In Proc. of the 3 rd International Conference on Language Resources and Evaluation; Las Palmas, Canary Islands, pp. 1385\u20131392."},{"key":"9516_CR20","doi-asserted-by":"crossref","unstructured":"Murray, J. (2017). Family life in the middle ages. https:\/\/doi.org\/10.1093\/obo\/9780195396584-0236 [Accessed 29\/05\/2019].","DOI":"10.1093\/obo\/9780195396584-0236"},{"issue":"1","key":"9516_CR21","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1075\/li.30.1.03nad","volume":"30","author":"D Nadeau","year":"2007","unstructured":"Nadeau, D., & Sekine, S. (2007). A survey of named entity recognition and classification. Linguisticae Investigationes, 30(1), 3\u201326.","journal-title":"Linguisticae Investigationes"},{"key":"9516_CR22","unstructured":"Pettersson, E., Megyesi, B., Tiedemann, J. (2013). An smt approach to automatic annotation of historical text. In Proceedings of the workshop on computational historical linguistics at NODALIDA 2013; May 22-24; 2013; Oslo; Norway. NEALT Proceedings Series 18, number 087, pp. 54\u201369. Link\u00f6ping University Electronic Press."},{"key":"9516_CR23","doi-asserted-by":"publisher","DOI":"10.4324\/9781315577227","volume-title":"Digital scholarly editing: Theories, models and methods","author":"E Pierazzo","year":"2016","unstructured":"Pierazzo, E. (2016). Digital scholarly editing: Theories, models and methods. London: Routledge."},{"key":"9516_CR24","doi-asserted-by":"crossref","unstructured":"Piotrowski, M. (2012). Natural Language Processing for Historical Texts. Synthesis Lectures on Human Language Technologies: Morgan & Claypool Publishers.","DOI":"10.2200\/S00436ED1V01Y201207HLT017"},{"key":"9516_CR25","unstructured":"Plank, B. (2016). What to do about non-standard (or non-canonical) language in NLP. CoRR, abs\/1608.07836."},{"key":"9516_CR26","unstructured":"Poibeau, T. (2006). Dealing with metonymic readings of named entities. In Proceedings of the Annual Meeting of the Cognitive Science Society, volume\u00a028."},{"key":"9516_CR27","doi-asserted-by":"crossref","unstructured":"Poibeau, T., Kosseim, L. (2001). Proper name extraction from non-journalistic texts. Computational Linguistics in the Netherlands, 144\u2013157.","DOI":"10.1163\/9789004333901_011"},{"key":"9516_CR28","unstructured":"Pustejovsky, J., Stubbs, A. (2012). Natural Language Annotation for Machine Learning: A guide to corpus-building for applications. \u201c O\u2019Reilly Media, Inc.\u201d."},{"key":"9516_CR29","unstructured":"S\u00e1nchez-Marco, C., Boleda, G., Padr\u00f3, L. (2011). Extending the tool, or how to annotate historical language varieties. In Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, LaTeCH \u201911, pp. 1\u20139, Stroudsburg, PA, USA. Association for Computational Linguistics."},{"key":"9516_CR30","unstructured":"Sekine, S. (2003). Sekine\u2019s Extended Named Entity Hierarchy. Retrieved April 14, 2018, from http:\/\/nlp.cs.nyu.edu\/ene\/."},{"key":"9516_CR31","unstructured":"Sekine, S., Sudo, K., Nobata, C. (2002). Extended named entity hierarchy. In Proceedings of Thirth International Conference on Language Resources and Evaluation (LREC-2002); Las Palmas, Canary Islands."},{"issue":"1","key":"9516_CR32","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1007\/s10579-011-9164-x","volume":"46","author":"M Stede","year":"2012","unstructured":"Stede, M., & Huang, C.-R. (2012). Inter-operability and reusability: The science of annotation. Language Resources and Evaluation, 46(1), 91\u201394.","journal-title":"Language Resources and Evaluation"},{"key":"9516_CR33","unstructured":"Text Encoding Initiative Consortium. (2008). TEI P5: Guidelines for electronic text encoding and interchange. Retrieved April 14, 2018, from http:\/\/www.tei-c.org\/Guidelines\/P5\/."},{"key":"9516_CR34","doi-asserted-by":"crossref","unstructured":"Tjong Kim Sang, E. F., De Meulder, F. (2003). Introduction to the conll-2003 shared task: Language-independent named entity recognition. In Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003-Volume 4, pp. 142\u2013147. Association for Computational Linguistics.","DOI":"10.3115\/1119176.1119195"}],"container-title":["Language Resources and Evaluation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-020-09516-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10579-020-09516-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-020-09516-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,5,24]],"date-time":"2021-05-24T12:19:23Z","timestamp":1621858763000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10579-020-09516-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,27]]},"references-count":34,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2021,6]]}},"alternative-id":["9516"],"URL":"https:\/\/doi.org\/10.1007\/s10579-020-09516-2","relation":{},"ISSN":["1574-020X","1574-0218"],"issn-type":[{"type":"print","value":"1574-020X"},{"type":"electronic","value":"1574-0218"}],"subject":[],"published":{"date-parts":[[2021,2,27]]},"assertion":[{"value":"28 October 2020","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 February 2021","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}