{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T06:53:43Z","timestamp":1777704823075,"version":"3.51.4"},"reference-count":28,"publisher":"SAGE Publications","issue":"2","license":[{"start":{"date-parts":[[2020,7,18]],"date-time":"2020-07-18T00:00:00Z","timestamp":1595030400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2020,8,31]]},"abstract":"<jats:p>This work presents an experimental study on the task of Named Entity Recognition (NER) for a narrow domain in Spanish language. This study considers two approaches commonly used in this kind of problem, namely, a Conditional Random Fields (CRF) model and Recurrent Neural Network (RNN). For the latter, we employed a bidirectional Long Short-Term Memory with ELMO\u2019s pre-trained word embeddings for Spanish. The comparison between the probabilistic model and the deep learning model was carried out in two collections, the Spanish dataset from CoNLL-2002 considering four classes under the IOB tagging schema, and a Mexican Spanish news dataset with seventeen classes under IOBES schema. The paper presents an analysis about the scalability, robustness, and common errors of both models. This analysis indicates in general that the BiLSTM-ELMo model is more suitable than the CRF model when there is \u201cenough\u201d training data, and also that it is more scalable, as its performance was not significantly affected in the incremental experiments (by adding one class at a time). On the other hand, results indicate that the CRF model is more adequate for scenarios having small training datasets and many classes.<\/jats:p>","DOI":"10.3233\/jifs-179868","type":"journal-article","created":{"date-parts":[[2020,7,28]],"date-time":"2020-07-28T15:02:26Z","timestamp":1595948546000},"page":"2015-2025","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":1,"title":["Probabilistic vs deep learning based approaches for narrow domain NER in Spanish"],"prefix":"10.1177","volume":"39","author":[{"given":"Orlando","family":"Ramos-Flores","sequence":"first","affiliation":[{"name":"Facultad de Ciencias de la Computaci\u00f3n, Benem\u00e9rita Universidad Aut\u00f3noma de Puebla, Puebla, M\u00e9xico"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David","family":"Pinto","sequence":"additional","affiliation":[{"name":"Facultad de Ciencias de la Computaci\u00f3n, Benem\u00e9rita Universidad Aut\u00f3noma de Puebla, Puebla, M\u00e9xico"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Manuel","family":"Montes-y-G\u00f3mez","sequence":"additional","affiliation":[{"name":"Coordinaci\u00f3n de Ciencias Computacionales, Instituto Nacional de Astrof\u00edsica, \u00d3ptica y Electr\u00f3nica, Santa Mar\u00eda Tonantzintla, Puebla, M\u00e9xico"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andr\u00e9s","family":"V\u00e1zquez","sequence":"additional","affiliation":[{"name":"Facultad de Ciencias de la Computaci\u00f3n, Benem\u00e9rita Universidad Aut\u00f3noma de Puebla, Puebla, M\u00e9xico"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2020,7,18]]},"reference":[{"key":"e_1_3_1_2_2","unstructured":"AbachaA.B. and ZweigenbaumP. Medical entity recognition: A comparison of semantic and statistical methods In Proceedings of BioNLP 2011 Workshop BioNLP \u201911 pages 56\u201364 (2011). ISBN 978-1-932432-91-6."},{"key":"e_1_3_1_3_2","doi-asserted-by":"crossref","unstructured":"AsaharaM. and MatsumotoY. Japanese named entity extraction with redundant morphological analysis In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology volume 1 of NAACL \u201903 pages 8\u201315 (2003).","DOI":"10.3115\/1073445.1073447"},{"key":"e_1_3_1_4_2","doi-asserted-by":"crossref","unstructured":"CarrerasX. M\u00e0rquezL. and Padr\u00f3L. Named entity extraction using adaboost In Proceedings of the 6th Conference on Natural Language Learning volume 20 of COLING-02 pages 1\u20134 Stroudsburg PA USA (2002). Association for Computational Linguistics.","DOI":"10.3115\/1118853.1118857"},{"key":"e_1_3_1_5_2","unstructured":"CheW. LiuY. WangY. ZhengB. and LiuT. Towards better UD parsing: Deep contextualized word embeddings ensemble and treebank concatenation In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies pages 55\u201364 Brussels Belgium October (2018). Association for Computational Linguistics."},{"key":"e_1_3_1_6_2","doi-asserted-by":"crossref","unstructured":"ChesneyS. JacquetG. SteinbergerR. and PiskorskiJ. Multiword entity classification in a highly multilingual environment In Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017) pages 11\u201320 (2017).","DOI":"10.18653\/v1\/W17-1702"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00104"},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2013.03.002"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cosrev.2018.06.001"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2005.06.042"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","unstructured":"GravesA. JaitlyN. and MohamedA. Hybrid speech recognition with deep bidirectional lstm In 2013 IEEE Workshop on Automatic Speech Recognition and Understanding pages 273\u2013278 (2013). doi:10.1109\/ASRU.2013.6707742","DOI":"10.1109\/ASRU.2013.6707742"},{"key":"e_1_3_1_12_2","doi-asserted-by":"crossref","unstructured":"GrishmanR. and SundheimB. Message understanding conference-6: A brief history In Proceedings of the 16th Conference on Computational Linguistics volume 1 of COLING \u201996 pages 466\u2013471 Stroudsburg PA USA 1996. Association for Computational Linguistics.","DOI":"10.3115\/992628.992709"},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_1_14_2","unstructured":"HuangZ. XuW. and YuK. Bidirectional lstm-crf models for sequence tagging CoRR abs\/1508.01991 (2015)."},{"key":"e_1_3_1_15_2","doi-asserted-by":"crossref","unstructured":"KudoT. and MatsumotoY. Chunking with support vector machines In Proceedings of the Second Meeting of the North American Chapter of the Association for Computational Linguistics on Language Technologies NAACL \u201901 pages 1\u20138 (2001).","DOI":"10.3115\/1073336.1073361"},{"key":"e_1_3_1_16_2","unstructured":"LaffertyJ.D. McCallumA. and PereiraF.C.N. Conditional random fields: Probabilistic models for segmenting and labeling sequence data In Proceedings of the Eighteenth International Conference on Machine Learning ICML \u201901 pages 282\u2013289 San Francisco CA USA 2001. Morgan Kaufmann Publishers Inc. ISBN 1-55860-778-1."},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1989.1.4.541"},{"key":"e_1_3_1_18_2","doi-asserted-by":"crossref","unstructured":"LeeC. HwangY.-G. OhH.-J. LimS. HeoJ. LeeC.-H. KimH.-J. WangJ.-H. and JangM.-G. Fine-grained named entity recognition using conditional random fields for question answering In H.T. Ng M.-K. Leong M.-Y. Kan and D. Ji editors Information Retrieval Technology pages 581\u2013587 Berlin Heidelberg (2006). Springer Berlin Heidelberg. ISBN 978-3-540-46237-8.","DOI":"10.1007\/11880592_49"},{"key":"e_1_3_1_19_2","doi-asserted-by":"crossref","unstructured":"MaX. and HovyE.H. End-to-end sequence labeling via bidirectional lstm-cnns-crf CoRR abs\/1603.01354 2016.","DOI":"10.18653\/v1\/P16-1101"},{"key":"e_1_3_1_20_2","unstructured":"McCallumA. FreitagD. and PereiraF.C.N. Maximum entropy markov models for information extraction and segmentation In Proceedings of the Seventeenth International Conference on Machine Learning ICML \u201900 pages 591\u2013598 (2000). ISBN 1-55860-707-2."},{"key":"e_1_3_1_21_2","doi-asserted-by":"crossref","unstructured":"\u00d6zg\u00fcrA. \u00d6zg\u00fcrL. and G\u00fcng\u00f6rT. Text categorization with class-based and corpus-based keyword selection In Computer and Information Sciences - ISCIS 2005 pages 606\u2013615 Berlin Heidelberg (2005). Springer Berlin Heidelberg. ISBN 978-3-540-32085-2.","DOI":"10.1007\/11569596_63"},{"key":"e_1_3_1_22_2","unstructured":"PetersM.E. NeumannM. IyyerM. GardnerM. ClarkC. LeeK. and ZettlemoyerL. Deep contextualized word representations In Proceedings of NAACL-HLT (2018)."},{"key":"e_1_3_1_23_2","doi-asserted-by":"crossref","unstructured":"SasadaT. MoriS. KawaharaT. and YamakataY. Named entity recognizer trainable from partially annotated data In K. Hasida and A. Purwarianti editors Computational Linguistics pages 148\u2013160 Singapore (2016). Springer Singapore. ISBN 978-981-10-0515-2.","DOI":"10.1007\/978-981-10-0515-2_11"},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/78.650093"},{"key":"e_1_3_1_25_2","doi-asserted-by":"crossref","unstructured":"SchwarzenbergR. HennigL. and HemsenH. In-memory distributed training of linear-chain conditional random fields with an application to fine-grained named entity recognition In G. Rehm and T. Declerck editors Language Technologies for the Challenges of the Digital Age pages 155\u2013167 Cham (2018). Springer International Publishing. ISBN 978-3-319-73706-5.","DOI":"10.1007\/978-3-319-73706-5_13"},{"key":"e_1_3_1_26_2","doi-asserted-by":"crossref","unstructured":"ShaF. and PereiraF. Shallow parsing with conditional random fields. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1 NAACL\u201903 pages 134\u2013141 (2003).","DOI":"10.3115\/1073445.1073473"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1561\/2200000013"},{"key":"e_1_3_1_28_2","doi-asserted-by":"crossref","unstructured":"Todorovi\u0107B.T. Ran\u010di\u0107S.R. and Mulali\u0107E.H. Context Hidden Markov Model for Named Entity Recognition pages 447\u2013460. Springer NewYork NewYork NY (2011). ISBN 978-1-4419-6594-3.","DOI":"10.1007\/978-1-4419-6594-3_30"},{"key":"e_1_3_1_29_2","doi-asserted-by":"crossref","unstructured":"UchimotoK. MaQ. MurataM. OzakuH. and IsaharaH. Named entity extraction based on a maximum entropy model and transformation rules In Proceedings of the 38th Annual Meeting on Association for Computational Linguistics ACL\u201900 pages 326\u2013335 (2000).","DOI":"10.3115\/1075218.1075260"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179868","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-179868","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179868","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:42:06Z","timestamp":1777455726000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-179868"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,7,18]]},"references-count":28,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2020,8,31]]}},"alternative-id":["10.3233\/JIFS-179868"],"URL":"https:\/\/doi.org\/10.3233\/jifs-179868","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,7,18]]}}}