{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T12:44:31Z","timestamp":1770986671999,"version":"3.50.1"},"reference-count":51,"publisher":"Cambridge University Press (CUP)","issue":"3","license":[{"start":{"date-parts":[[2016,7,12]],"date-time":"2016-07-12T00:00:00Z","timestamp":1468281600000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/www.cambridge.org\/core\/terms"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2017,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This paper presents the improvement process of a mention detector for Basque. The system is rule-based and takes into account the characteristics of mentions in Basque. A classification of error types is proposed based on the errors that occur during mention detection. A deep error analysis distinguishing error types and causes is presented and improvements are proposed. At the final stage, the system obtains an F-measure of 74.57% under the Exact Matching protocol and of 80.57% under Lenient Matching. We also show the performance of the mention detector with gold standard data as input, in order to omit errors caused by the previous stages of linguistic processing. In this scenario, we obtain an F-measure of 85.89% with Strict Matching and of 89.06% with Lenient Matching, i.e., a difference of 11.32 and 8.49 percentage points, respectively. Finally, how improvements in mention detection affect coreference resolution is analysed.<\/jats:p>","DOI":"10.1017\/s1351324916000206","type":"journal-article","created":{"date-parts":[[2016,7,12]],"date-time":"2016-07-12T07:09:30Z","timestamp":1468307370000},"page":"351-384","source":"Crossref","is-referenced-by-count":3,"title":["Improving mention detection for Basque based on a deep error analysis"],"prefix":"10.1017","volume":"23","author":[{"given":"ANDER","family":"SORALUZE","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"OLATZ","family":"ARREGI","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"XABIER","family":"ARREGI","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"ARANTZA","family":"D\u00cdAZ DE ILARRAZA","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"56","published-online":{"date-parts":[[2016,7,12]]},"reference":[{"key":"S1351324916000206_ref001","first-page":"1","article-title":"Methodology and steps towards the construction of EPEC, a corpus of written Basque tagged at morphological and syntactic levels for the automatic processing","volume":"56","author":"Aduriz","year":"2006","journal-title":"Language and Computers"},{"key":"S1351324916000206_ref036","first-page":"1","volume-title":"Proceedings of the 16th Conference on Computational Natural Language Learning (CoNLL 2012)","author":"Pradhan","year":"2012"},{"key":"S1351324916000206_ref006","first-page":"198","volume-title":"II Jornadas de Tratamiento y Recuperaci\u00f3n de Informaci\u00f3n (JOTRI 2003)","author":"Alegria","year":"2003"},{"key":"S1351324916000206_ref027","first-page":"335","volume-title":"Proceedings of the 6th Message Understanding Conference (MUC-6)","year":"1995"},{"key":"S1351324916000206_ref031","first-page":"2408","volume-title":"Proceedings of the International Conference on Language Resources and Evaluation (LREC 2008)","author":"Nguyen","year":"2008"},{"key":"S1351324916000206_ref018","first-page":"102","volume-title":"Proceedings of the 15th Conference on Computational Natural Language Learning: Shared Task (CoNLL 2011)","author":"Kummerfeld","year":"2011"},{"key":"S1351324916000206_ref017","volume-title":"Advanced Approaches to Intelligent Information and Database Systems","author":"Kope\u0107","year":"2014"},{"key":"S1351324916000206_ref047","first-page":"100","volume-title":"Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval 2010)","author":"Uryupina","year":"2010"},{"key":"S1351324916000206_ref038","first-page":"30","volume-title":"Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014)","author":"Pradhan","year":"2014"},{"key":"S1351324916000206_ref013","doi-asserted-by":"publisher","DOI":"10.3115\/1220575.1220623"},{"key":"S1351324916000206_ref040","doi-asserted-by":"publisher","DOI":"10.1017\/S135132491000029X"},{"key":"S1351324916000206_ref050","first-page":"9","volume-title":"Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies","author":"Versley","year":"2008"},{"key":"S1351324916000206_ref028","volume-title":"Proceedings of the 7th Message Understanding Conference (MUC-7)","year":"1998"},{"key":"S1351324916000206_ref043","doi-asserted-by":"publisher","DOI":"10.1162\/089120101753342653"},{"key":"S1351324916000206_ref016","first-page":"423","volume-title":"Proceedings of the 41st Annual Meeting on Association for Computational Linguistics (ACL \u201903)","author":"Klein","year":"2003"},{"key":"S1351324916000206_ref007","unstructured":"Arrieta B. 2010. Azaleko sintaxiaren tratamendua ikasketa automatikoko tekniken bidez: euskarako kateen eta perpausen identifikazioa eta bere erabilera koma-zuzentzaile batean. PhD Thesis, Computer Languages and Systems, University of the Basque Country, Donostia-San Sebasti\u00e1n, Spain."},{"key":"S1351324916000206_ref003","first-page":"48","volume-title":"ACL Workshop on Multiword Expressions (MWE \u201904)","author":"Alegria","year":"2004"},{"key":"S1351324916000206_ref029","unstructured":"NIST 2008. Automatic Content Extraction 2008 Evaluation Plan (ACE08)."},{"key":"S1351324916000206_ref010","first-page":"104","volume-title":"Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval 2010)","author":"Broscheit","year":"2010"},{"key":"S1351324916000206_ref023","doi-asserted-by":"publisher","DOI":"10.3115\/1220575.1220579"},{"key":"S1351324916000206_ref002","first-page":"1","volume-title":"Inquiries into the lexicon-syntax relations in Basque","author":"Aduriz","year":"2003"},{"key":"S1351324916000206_ref037","first-page":"1","volume-title":"Proceedings of the 15th Conference on Computational Natural Language Learning: Shared Task (CoNLL 2011)","author":"Pradhan","year":"2011"},{"key":"S1351324916000206_ref035","doi-asserted-by":"publisher","DOI":"10.1109\/ICSC.2007.83"},{"key":"S1351324916000206_ref004","first-page":"1","volume-title":"Implementation and Application of Automata","author":"Alegria","year":"2002"},{"key":"S1351324916000206_ref005","doi-asserted-by":"publisher","DOI":"10.1093\/llc\/11.4.193"},{"key":"S1351324916000206_ref008","doi-asserted-by":"publisher","DOI":"10.1162\/coli.07-034-R2"},{"key":"S1351324916000206_ref009","first-page":"563","volume-title":"Proceedings of the 1st International Conference on Language Resources and Evaluation Workshop on Linguistics Coreference","author":"Bagga","year":"1998"},{"key":"S1351324916000206_ref034","volume-title":"Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC\u201908)","author":"Orasan","year":"2008"},{"key":"S1351324916000206_ref022","doi-asserted-by":"publisher","DOI":"10.1162\/COLI_a_00152"},{"key":"S1351324916000206_ref011","first-page":"40","volume-title":"Proceedings of the 15th Conference on Computational Natural Language Learning: Shared Task (CoNLL 2011)","author":"Chang","year":"2011"},{"key":"S1351324916000206_ref024","first-page":"313","article-title":"Building a large annotated corpus of english: the Penn treebank","volume":"19","author":"Marcus","year":"1993","journal-title":"Computational Linguistics"},{"key":"S1351324916000206_ref012","first-page":"837","volume-title":"Proceedings of Language Resources and Evaluation Conference (LREC 2004)","author":"Doddington","year":"2004"},{"key":"S1351324916000206_ref014","first-page":"29","volume-title":"Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2009)","author":"Hulden","year":"2009"},{"key":"S1351324916000206_ref015","doi-asserted-by":"publisher","DOI":"10.1515\/9783110882629"},{"key":"S1351324916000206_ref044","first-page":"128","volume-title":"KONVENS 2012, The 11th Conference on Natural Language Processing","author":"Soraluze","year":"2012"},{"key":"S1351324916000206_ref019","first-page":"265","volume-title":"Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013)","author":"Kummerfeld","year":"2011"},{"key":"S1351324916000206_ref020","unstructured":"Laka I. 1996. A brief grammar of Euskara, the Basque language. http:\/\/www.ehu.es\/grammar. University of the Basque Country."},{"key":"S1351324916000206_ref021","first-page":"1824","volume-title":"Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers","author":"Lalitha","year":"2014"},{"key":"S1351324916000206_ref045","first-page":"656","volume-title":"Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP","author":"Stoyanov","year":"2009"},{"key":"S1351324916000206_ref025","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-012-9194-z"},{"key":"S1351324916000206_ref026","volume-title":"Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC\u201908)","author":"Mih\u00e1ltz","year":"2008"},{"key":"S1351324916000206_ref030","first-page":"276","volume-title":"Proceedings of the SIGDIAL 2009 Conference","author":"Ng\u1ee5y","year":"2009"},{"key":"S1351324916000206_ref032","first-page":"167","volume-title":"Proceedings of the 5th Language and Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics","author":"Ogrodniczuk","year":"2011"},{"key":"S1351324916000206_ref049","doi-asserted-by":"publisher","DOI":"10.3115\/1072399.1072405"},{"key":"S1351324916000206_ref033","first-page":"27","volume-title":"Proceedings of the Workshop on Detecting Structure in Scholarly Discourse (ACL\u201912)","author":"Ohta","year":"2008"},{"key":"S1351324916000206_ref039","first-page":"1423","volume-title":"Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL \u201910)","author":"Recasens","year":"2010"},{"key":"S1351324916000206_ref041","first-page":"1","volume-title":"Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval 2010)","author":"Recasens","year":"2010"},{"key":"S1351324916000206_ref042","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-009-9108-x"},{"key":"S1351324916000206_ref046","volume-title":"Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008)","author":"Uryupina","year":"2008"},{"key":"S1351324916000206_ref048","first-page":"100","volume-title":"Proceedings of the 6th International Joint Conference on Natural Language Processing","author":"Uryupina","year":"2013"},{"key":"S1351324916000206_ref051","first-page":"96","volume-title":"Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval 2010)","author":"Zhekova","year":"2010"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324916000206","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,4,17]],"date-time":"2019-04-17T23:01:51Z","timestamp":1555542111000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324916000206\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,7,12]]},"references-count":51,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2017,5]]}},"alternative-id":["S1351324916000206"],"URL":"https:\/\/doi.org\/10.1017\/s1351324916000206","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"value":"1351-3249","type":"print"},{"value":"1469-8110","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,7,12]]}}}