{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T01:03:56Z","timestamp":1775178236664,"version":"3.50.1"},"reference-count":56,"publisher":"MIT Press","license":[{"start":{"date-parts":[[2021,10,12]],"date-time":"2021-10-12T00:00:00Z","timestamp":1633996800000},"content-version":"vor","delay-in-days":284,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["direct.mit.edu"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,10,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>We take a step towards addressing the under- representation of the African continent in NLP research by bringing together different stakeholders to create the first large, publicly available, high-quality dataset for named entity recognition (NER) in ten African languages. We detail the characteristics of these languages to help researchers and practitioners better understand the challenges they pose for NER tasks. We analyze our datasets and conduct an extensive empirical evaluation of state- of-the-art methods across both supervised and transfer learning settings. Finally, we release the data, code, and models to inspire future research on African NLP.1<\/jats:p>","DOI":"10.1162\/tacl_a_00416","type":"journal-article","created":{"date-parts":[[2021,10,17]],"date-time":"2021-10-17T23:03:50Z","timestamp":1634511830000},"page":"1116-1131","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":58,"title":["MasakhaNER: Named Entity Recognition for African Languages"],"prefix":"10.1162","volume":"9","author":[{"given":"David Ifeoluwa","family":"Adelani","sequence":"first","affiliation":[{"name":"Spoken Language Systems Group (LSV), Saarland University, Germany"},{"name":"Masakhane NLP"}]},{"given":"Jade","family":"Abbott","sequence":"additional","affiliation":[{"name":"Retro Rabbit, South Africa"},{"name":"Masakhane NLP"}]},{"given":"Graham","family":"Neubig","sequence":"additional","affiliation":[{"name":"Language Technologies Institute, Carnegie Mellon University, United States"}]},{"given":"Daniel","family":"D\u2019souza","sequence":"additional","affiliation":[{"name":"ProQuest, United States"},{"name":"Masakhane NLP"}]},{"given":"Julia","family":"Kreutzer","sequence":"additional","affiliation":[{"name":"Google Research, Canada"},{"name":"Masakhane NLP"}]},{"given":"Constantine","family":"Lignos","sequence":"additional","affiliation":[{"name":"Brandeis University, United States"},{"name":"Masakhane NLP"}]},{"given":"Chester","family":"Palen-Michel","sequence":"additional","affiliation":[{"name":"Brandeis University, United States"},{"name":"Masakhane NLP"}]},{"given":"Happy","family":"Buzaaba","sequence":"additional","affiliation":[{"name":"Graduate School of Systems and Information Engineering, University of Tsukuba, Japan"},{"name":"Masakhane NLP"}]},{"given":"Shruti","family":"Rijhwani","sequence":"additional","affiliation":[{"name":"Language Technologies Institute, Carnegie Mellon University, United States"}]},{"given":"Sebastian","family":"Ruder","sequence":"additional","affiliation":[{"name":"DeepMind, United Kingdom"}]},{"given":"Stephen","family":"Mayhew","sequence":"additional","affiliation":[{"name":"Duolingo, United States"}]},{"given":"Israel Abebe","family":"Azime","sequence":"additional","affiliation":[{"name":"African Institute for Mathematical Sciences (AIMS-AMMI), Ethiopia"},{"name":"Masakhane NLP"}]},{"given":"Shamsuddeen H.","family":"Muhammad","sequence":"additional","affiliation":[{"name":"University of Porto, Nigeria"},{"name":"Bayero University, Kano, Nigeria"},{"name":"Masakhane NLP"}]},{"given":"Chris Chinenye","family":"Emezue","sequence":"additional","affiliation":[{"name":"Technical University of Munich, Germany"},{"name":"Masakhane NLP"}]},{"given":"Joyce","family":"Nakatumba-Nabende","sequence":"additional","affiliation":[{"name":"Makerere University, Kampala, Uganda"},{"name":"Masakhane NLP"}]},{"given":"Perez","family":"Ogayo","sequence":"additional","affiliation":[{"name":"African Leadership University, Rwanda"},{"name":"Masakhane NLP"}]},{"given":"Aremu","family":"Anuoluwapo","sequence":"additional","affiliation":[{"name":"University of Lagos, Nigeria"},{"name":"Masakhane NLP"}]},{"given":"Catherine","family":"Gitau","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Derguene","family":"Mbaye","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Jesujoba","family":"Alabi","sequence":"additional","affiliation":[{"name":"Max Planck Institute for Informatics, Germany"},{"name":"Masakhane NLP"}]},{"given":"Seid Muhie","family":"Yimam","sequence":"additional","affiliation":[{"name":"LT Group, Universit\u00e4t Hamburg, Germany"}]},{"given":"Tajuddeen Rabiu","family":"Gwadabe","sequence":"additional","affiliation":[{"name":"University of Chinese Academy of Science, China"},{"name":"Masakhane NLP"}]},{"given":"Ignatius","family":"Ezeani","sequence":"additional","affiliation":[{"name":"Lancaster University, United Kingdom"},{"name":"Masakhane NLP"}]},{"given":"Rubungo Andre","family":"Niyongabo","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China, China"},{"name":"Masakhane NLP"}]},{"given":"Jonathan","family":"Mukiibi","sequence":"additional","affiliation":[{"name":"Makerere University, Kampala, Uganda"}]},{"given":"Verrah","family":"Otiende","sequence":"additional","affiliation":[{"name":"United States International University - Africa (USIU-A), Kenya"},{"name":"Masakhane NLP"}]},{"given":"Iroro","family":"Orife","sequence":"additional","affiliation":[{"name":"Niger-Volta LTI"},{"name":"Masakhane NLP"}]},{"given":"Davis","family":"David","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Samba","family":"Ngom","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Tosin","family":"Adewumi","sequence":"additional","affiliation":[{"name":"Luleo University of Technology, Sweden"},{"name":"Masakhane NLP"}]},{"given":"Paul","family":"Rayson","sequence":"additional","affiliation":[{"name":"Lancaster University, United Kingdom"}]},{"given":"Mofetoluwa","family":"Adeyemi","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Gerald","family":"Muriuki","sequence":"additional","affiliation":[{"name":"Makerere University, Kampala, Uganda"}]},{"given":"Emmanuel","family":"Anebi","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Chiamaka","family":"Chukwuneke","sequence":"additional","affiliation":[{"name":"Lancaster University, United Kingdom"}]},{"given":"Nkiruka","family":"Odu","sequence":"additional","affiliation":[{"name":"African University of Science and Technology, Abuja, Nigeria"}]},{"given":"Eric Peter","family":"Wairagala","sequence":"additional","affiliation":[{"name":"Makerere University, Kampala, Uganda"}]},{"given":"Samuel","family":"Oyerinde","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Clemencia","family":"Siro","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Tobius Saul","family":"Bateesa","sequence":"additional","affiliation":[{"name":"Makerere University, Kampala, Uganda"}]},{"given":"Temilola","family":"Oloyede","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Yvonne","family":"Wambui","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Victor","family":"Akinode","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Deborah","family":"Nabagereka","sequence":"additional","affiliation":[{"name":"Makerere University, Kampala, Uganda"}]},{"given":"Maurice","family":"Katusiime","sequence":"additional","affiliation":[{"name":"Makerere University, Kampala, Uganda"}]},{"given":"Ayodele","family":"Awokoya","sequence":"additional","affiliation":[{"name":"University of Ibadan, Nigeria"},{"name":"Masakhane NLP"}]},{"given":"Mouhamadane","family":"MBOUP","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Dibora","family":"Gebreyohannes","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Henok","family":"Tilaye","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Kelechi","family":"Nwaike","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Degaga","family":"Wolde","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Abdoulaye","family":"Faye","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Blessing","family":"Sibanda","sequence":"additional","affiliation":[{"name":"Namibia University of Science and Technology, Namibia"},{"name":"Masakhane NLP"}]},{"given":"Orevaoghene","family":"Ahia","sequence":"additional","affiliation":[{"name":"Instadeep, Nigeria"},{"name":"Masakhane NLP"}]},{"given":"Bonaventure F. P.","family":"Dossou","sequence":"additional","affiliation":[{"name":"Jacobs University Bremen, Germany"},{"name":"Masakhane NLP"}]},{"given":"Kelechi","family":"Ogueji","sequence":"additional","affiliation":[{"name":"University of Waterloo, Canada"},{"name":"Masakhane NLP"}]},{"given":"Thierno Ibrahima","family":"DIOP","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Abdoulaye","family":"Diallo","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Adewale","family":"Akinfaderin","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Tendai","family":"Marengereke","sequence":"additional","affiliation":[{"name":"Masakhane NLP"}]},{"given":"Salomey","family":"Osei","sequence":"additional","affiliation":[{"name":"African Institute for Mathematical Sciences (AIMS-AMMI), Ethiopia"},{"name":"Masakhane NLP"}]}],"member":"281","published-online":{"date-parts":[[2021,10,7]]},"reference":[{"key":"2021101221392144100_bib1","article-title":"MENYO-20k: A multi-domain english-yor\u00f9b\u00e1 corpus for machine translation and domain adaptation","author":"Adelani","year":"2021","journal-title":"ArXiv"},{"key":"2021101221392144100_bib2","doi-asserted-by":"crossref","first-page":"pages 3204\u2013pages 3210","DOI":"10.18653\/v1\/P19-1310","article-title":"JW300: A wide-coverage parallel corpus for low-resource languages","volume-title":"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics","author":"Agi\u0107","year":"2019"},{"key":"2021101221392144100_bib3","first-page":"2754","article-title":"Massive vs. curated embeddings for low-resourced languages: The case of Yor\u00f9b\u00e1 and Twi","volume-title":"Proceedings of the 12th Language Resources and Evaluation Conference","author":"Alabi","year":"2020"},{"key":"2021101221392144100_bib4","first-page":"pages 2524\u2013pages 2531","article-title":"NoSta-D named entity annotation for German: Guidelines and dataset","volume-title":"Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC\u201914)","author":"Benikova","year":"2014"},{"key":"2021101221392144100_bib5","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1162\/tacl_a_00051","article-title":"Enriching word vectors with subword information","volume":"5","author":"Bojanowski","year":"2017","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"2021101221392144100_bib6","article-title":"The geographic diversity of NLP conferences","author":"Caines","year":"2019"},{"key":"2021101221392144100_bib7","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1162\/tacl_a_00104","article-title":"Named entity recognition with bidirectional LSTM-CNNs","volume":"4","author":"Chiu","year":"2016","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"2021101221392144100_bib8","doi-asserted-by":"crossref","first-page":"pages 8440\u2013pages 8451","DOI":"10.18653\/v1\/2020.acl-main.747","article-title":"Unsupervised cross-lingual representation learning at scale","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"Conneau","year":"2020"},{"key":"2021101221392144100_bib9","first-page":"8","article-title":"Unsupervised induction of Dholuo word classes using maximum entropy learning","volume-title":"Proceedings of the First International Computer Science and ICT Conference","author":"Pauw","year":"2007"},{"key":"2021101221392144100_bib10","article-title":"BERT: Pre-training of deep bidirectional transformers for language understanding","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)","author":"Devlin","year":"2019"},{"key":"2021101221392144100_bib11","article-title":"Ethnologue: Languages of the World","author":"Eberhard","year":"2020","edition":"23rd edition"},{"key":"2021101221392144100_bib12","first-page":"3344","article-title":"Government domain named entity recognition for South African languages","volume-title":"Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC\u201916)","author":"Eiselen","year":"2016"},{"key":"2021101221392144100_bib13","doi-asserted-by":"crossref","first-page":"5960","DOI":"10.18653\/v1\/2020.emnlp-main.480","article-title":"CCAligned: A massive collection of cross-lingual web- document pairs","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)","author":"El-Kishky","year":"2020"},{"key":"2021101221392144100_bib14","volume-title":"Elements of Modern Igbo Grammar - a descriptive approach","author":"Emenanjo","year":"1978"},{"key":"2021101221392144100_bib15","doi-asserted-by":"publisher","DOI":"10.1037\/h0031619","article-title":"Igbo- english machine translation: An evaluation benchmark","author":"Ezeani","year":"2020","journal-title":"ArXiv"},{"issue":"5","key":"2021101221392144100_bib16","doi-asserted-by":"crossref","first-page":"378","DOI":"10.1037\/h0031619","article-title":"Measuring nominal scale agreement among many raters.","volume":"76","author":"Fleiss","year":"1971","journal-title":"Psychological Bulletin"},{"key":"2021101221392144100_bib17","first-page":"6058","article-title":"Interpretable multi-dataset evaluation for named entity recognition","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Jinlan","year":"2020"},{"key":"2021101221392144100_bib18","article-title":"Official gazette number 41 bis of 13\/10\/2014","author":"Government","year":"2014"},{"key":"2021101221392144100_bib19","doi-asserted-by":"crossref","first-page":"8342","DOI":"10.18653\/v1\/2020.acl-main.740","article-title":"Don\u2019t stop pretraining: Adapt language models to domains and tasks","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"Gururangan","year":"2020"},{"key":"2021101221392144100_bib20","doi-asserted-by":"crossref","first-page":"2580","DOI":"10.18653\/v1\/2020.emnlp-main.204","article-title":"Transfer learning and distant supervision for multilingual transformer models: A study on African languages","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Hedderich","year":"2020"},{"key":"2021101221392144100_bib21","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/P18-1031","article-title":"Universal language model fine-tuning for text classification","volume-title":"Proceedings of ACL 2018","author":"Howard","year":"2018"},{"key":"2021101221392144100_bib22","article-title":"XTREME: A massively multilingual multi-task benchmark for evaluating cross-lingual generalization","volume-title":"Proceedings of ICML 2020","author":"Junjie","year":"2020"},{"key":"2021101221392144100_bib23","article-title":"Bidirectional LSTM-CRF models for sequence tagging","author":"Huang","year":"2015","journal-title":"ArXiv"},{"key":"2021101221392144100_bib24","first-page":"282","article-title":"Conditional random fields: Probabilistic models for segmenting and labeling sequence data","volume-title":"Proceedings of the Eighteenth International Conference on Machine Learning","author":"Lafferty","year":"2001"},{"key":"2021101221392144100_bib25","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/N16-1030","article-title":"Neural architectures for named entity recognition","volume-title":"Proceedings of NAACL- HLT 2016","author":"Lample","year":"2016"},{"key":"2021101221392144100_bib26","doi-asserted-by":"crossref","first-page":"4483","DOI":"10.18653\/v1\/2020.emnlp-main.363","article-title":"From zero to hero: On the limitations of zero-shot language transfer with multilingual Transformers","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Lauscher","year":"2020"},{"key":"2021101221392144100_bib27","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18653\/v1\/P18-4001","article-title":"Platforms for non-speakers annotating names in any language","volume-title":"Proceedings of ACL 2018, System Demonstrations","author":"Lin","year":"2018"},{"key":"2021101221392144100_bib28","article-title":"Roberta: A robustly optimized bert pretraining approach","author":"Liu","year":"2019"},{"key":"2021101221392144100_bib29","doi-asserted-by":"crossref","first-page":"1064","DOI":"10.18653\/v1\/P16-1101","article-title":"End-to-end sequence labeling via bi-directional LSTM- CNNs-CRF","volume-title":"Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)","author":"Ma","year":"2016"},{"key":"2021101221392144100_bib30","article-title":"A focus on neural machine translation for African languages","author":"Martinus","year":"2019","journal-title":"arXiv preprint arXiv:1906.05685"},{"key":"2021101221392144100_bib31","article-title":"T\u00e9ereb Injiil: La Bible Wolof \u2013 Ancien Testament","author":"","year":"2020"},{"key":"2021101221392144100_bib32","first-page":"3111","article-title":"Distributed representations of words and phrases and their compositionality","volume-title":"Advances in Neural Information Processing Systems","author":"Mikolov","year":"2013"},{"key":"2021101221392144100_bib33","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/2020.findings-emnlp.195","article-title":"Participatory research for low-resourced machine translation: A case study in African languages","volume-title":"Findings of the Association for Computational Linguistics: EMNLP 2020","author":"Nekoto","year":"2020"},{"key":"2021101221392144100_bib34","article-title":"Dynet: The dynamic neural network toolkit","author":"Neubig","year":"2017","journal-title":"ArXiv"},{"key":"2021101221392144100_bib35","doi-asserted-by":"crossref","first-page":"5507","DOI":"10.18653\/v1\/2020.coling-main.480","article-title":"KINNEWS and KIRNEWS: Benchmarking cross-lingual text classification for Kinyarwanda and Kirundi","volume-title":"Proceedings of the 28th International Conference on Computational Linguistics","author":"Niyongabo","year":"2020"},{"issue":"2","key":"2021101221392144100_bib36","doi-asserted-by":"crossref","first-page":"167","DOI":"10.17533\/udea.ikala.10748","article-title":"Grammaticalization in Nigerian Pidgin","volume":"17","author":"Mensah","year":"2012","journal-title":"\u00cdkala, revista de lenguaje y cultura"},{"issue":"12","key":"2021101221392144100_bib37","article-title":"Perspectives and problems of codifying Nigerian Pidgin English orthography","volume":"3","author":"Ojarikre","year":"2013","journal-title":"Perspectives"},{"key":"2021101221392144100_bib38","article-title":"Serial verb construction in Nigerian Pidgin","author":"Onovbiona","year":"2012"},{"key":"2021101221392144100_bib39","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1007\/978-3-319-45510-5_24","article-title":"Predicting morphologically-complex unknown words in Igbo","volume-title":"Text, Speech, and Dialogue","author":"Onyenwe","year":"2016"},{"key":"2021101221392144100_bib40","doi-asserted-by":"crossref","first-page":"1946","DOI":"10.18653\/v1\/P17-1178","article-title":"Cross-lingual name tagging and linking for 282 languages","volume-title":"Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)","author":"Pan","year":"2017"},{"key":"2021101221392144100_bib41","doi-asserted-by":"crossref","first-page":"pages 1532\u2013pages 1543","DOI":"10.3115\/v1\/D14-1162","article-title":"GloVe: Global vectors for word representation","volume-title":"Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Pennington","year":"2014"},{"key":"2021101221392144100_bib42","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/2020.emnlp-main.617","article-title":"MAD-X: An Adapter- based Framework for Multi-task Cross-lingual Transfer","volume-title":"Proceedings of EMNLP 2020","author":"Pfeiffer","year":"2020"},{"key":"2021101221392144100_bib43","article-title":"Unks everywhere: Adapting multilingual language models to new scripts","author":"Pfeiffer","year":"2020","journal-title":"arXiv preprint arXiv:2012.15562"},{"key":"2021101221392144100_bib44","doi-asserted-by":"crossref","first-page":"147","DOI":"10.3115\/1596374.1596399","article-title":"Design challenges and misconceptions in named entity recognition","volume-title":"Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL-2009)","author":"Ratinov","year":"2009"},{"key":"2021101221392144100_bib45","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/D19-1410","article-title":"Sentence-bert: Sentence embeddings using Siamese bert-networks","volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing","author":"Reimers","year":"2019"},{"key":"2021101221392144100_bib46","doi-asserted-by":"crossref","first-page":"8118","DOI":"10.18653\/v1\/2020.acl-main.722","article-title":"Soft gazetteers for low-resource named entity recognition","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"Rijhwani","year":"2020"},{"key":"2021101221392144100_bib47","article-title":"Introduction to the conll-2003 shared task: Language-independent named entity recognition","volume-title":"Proceedings of CoNLL 2003","author":"Sang","year":"2003"},{"key":"2021101221392144100_bib48","article-title":"Proceedings of the IJCNLP-08 workshop on named entity recognition for south and south east Asian languages","author":"Sangal","year":"2008"},{"key":"2021101221392144100_bib49","doi-asserted-by":"publisher","first-page":"469","DOI":"10.1162\/COLI_a_00178","article-title":"A survey of Arabic named entity recognition and classification","volume":"40","author":"Shaalan","year":"2014","journal-title":"Computational Linguistics"},{"key":"2021101221392144100_bib50","first-page":"3273","article-title":"LORELEI language packs: Data, tools, and resources for technology development in low resource languages","volume-title":"Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC\u201916)","author":"Strassel","year":"2016"},{"key":"2021101221392144100_bib51","first-page":"pages 2214\u2013pages 2218","article-title":"Parallel data, tools and interfaces in OPUS","volume-title":"Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC\u201912)","author":"Tiedemann","year":"2012"},{"key":"2021101221392144100_bib52","doi-asserted-by":"crossref","DOI":"10.3115\/1118853.1118877","article-title":"Introduction to the CoNLL-2002 shared task: Language- independent named entity recognition","volume-title":"COLING-02: The 6th Conference on Natural Language Learning 2002 (CoNLL-2002)","author":"Tjong Kim Sang","year":"2002"},{"key":"2021101221392144100_bib53","doi-asserted-by":"crossref","first-page":"142","DOI":"10.3115\/1119176.1119195","article-title":"Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition","volume-title":"Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003","author":"Tjong Kim Sang","year":"2003"},{"key":"2021101221392144100_bib54","article-title":"Huggingface\u2019s transformers: State-of-the-art natural language processing","author":"Wolf","year":"2019","journal-title":"ArXiv"},{"key":"2021101221392144100_bib55","first-page":"2145","article-title":"A survey on recent advances in named entity recognition from deep learning models","volume-title":"Proceedings of the 27th International Conference on Computational Linguistics","author":"Yadav","year":"2018"},{"key":"2021101221392144100_bib56","doi-asserted-by":"crossref","first-page":"6442","DOI":"10.18653\/v1\/2020.emnlp-main.523","article-title":"LUKE: Deep contextualized entity representations with entity-aware self-attention","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Yamada","year":"2020"}],"container-title":["Transactions of the Association for Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/direct.mit.edu\/tacl\/article-pdf\/doi\/10.1162\/tacl_a_00416\/1966201\/tacl_a_00416.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/direct.mit.edu\/tacl\/article-pdf\/doi\/10.1162\/tacl_a_00416\/1966201\/tacl_a_00416.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,11]],"date-time":"2023-11-11T09:01:49Z","timestamp":1699693309000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/tacl\/article\/doi\/10.1162\/tacl_a_00416\/107614\/MasakhaNER-Named-Entity-Recognition-for-African"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021]]},"references-count":56,"URL":"https:\/\/doi.org\/10.1162\/tacl_a_00416","relation":{},"ISSN":["2307-387X"],"issn-type":[{"value":"2307-387X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021]]},"published":{"date-parts":[[2021]]}}}