{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:24:01Z","timestamp":1750220641813,"version":"3.41.0"},"reference-count":29,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2020,6,7]],"date-time":"2020-06-07T00:00:00Z","timestamp":1591488000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2020,7,31]]},"abstract":"<jats:p>Named Entity Recognition (NER) is a basic prerequisite of using Natural Language Processing (NLP) for information retrieval. Arabic NER is especially challenging as the language is morphologically rich and has short vowels with no capitalisation convention. This article presents a novel rule-based approach that uses linguistic grammar-based techniques to extract Arabic composite names from Arabic text. Our approach uniquely exploits the genitive Arabic grammar rules; in particular, the rules regarding the identification of definite nouns (\u0645\u0639\u0631\u0641\u0629) and indefinite nouns (\u0646\u0643\u0631\u0629) to support the process of extracting composite names. Based on domain knowledge and Arabic Genitive Rules (AGR), the developed approach formalises a set of syntactical rules and linguistic patterns that initially use genitive patterns to classify definiteness within phrases and then extracts proper composite names from the unstructured text. The developed novel approach does not place any constraints on the length of the Arabic composite name and our initial experimentation demonstrated high recall and precision results when the NER algorithm was applied to a financial domain corpus.<\/jats:p>","DOI":"10.1145\/3382187","type":"journal-article","created":{"date-parts":[[2020,6,7]],"date-time":"2020-06-07T22:05:01Z","timestamp":1591567501000},"page":"1-16","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Extracting Arabic Composite Names Using Genitive Principles of Arabic Grammar"],"prefix":"10.1145","volume":"19","author":[{"given":"Hussein","family":"Khalil","sequence":"first","affiliation":[{"name":"School of Science 8 Technology, Nottingham Trent University"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8781-2658","authenticated-orcid":false,"given":"Taha","family":"Osman","sequence":"additional","affiliation":[{"name":"School of Science 8 Technology, Nottingham Trent University"}]},{"given":"Mohammed","family":"Miltan","sequence":"additional","affiliation":[{"name":"Arabic Department, Faculty of Arts, Misurata University"}]}],"member":"320","published-online":{"date-parts":[[2020,6,7]]},"reference":[{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.3390\/fi10120123"},{"key":"e_1_2_1_3_1","doi-asserted-by":"crossref","unstructured":"M. Alruily A. Ayesh and H. Zedan. 2014. Crime profiling for the Arabic language using computational linguistic techniques. Information Processing 8 Management 50 315--341.  M. Alruily A. Ayesh and H. Zedan. 2014. Crime profiling for the Arabic language using computational linguistic techniques. Information Processing 8 Management 50 315--341.","DOI":"10.1016\/j.ipm.2013.09.001"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2009.2019927"},{"key":"e_1_2_1_5_1","doi-asserted-by":"crossref","unstructured":"C. Bizer T. Heath and T. Berners-Lee. 2011. Linked data: The story so far. In Semantic Services Interoperability and Web Applications: Emerging Concepts. IGI Global 205--227.  C. Bizer T. Heath and T. Berners-Lee. 2011. Linked data: The story so far. In Semantic Services Interoperability and Web Applications: Emerging Concepts. IGI Global 205--227.","DOI":"10.4018\/978-1-60960-593-3.ch008"},{"key":"e_1_2_1_6_1","volume-title":"Buckwalter Arabic morphological analyzer version 2.0, linguistic data consortium (LDC) catalog No LDC2004L02","author":"Buckwalter T.","year":"2019","unstructured":"T. Buckwalter . 2004. Buckwalter Arabic morphological analyzer version 2.0, linguistic data consortium (LDC) catalog No LDC2004L02 . 2019 . T. Buckwalter. 2004. Buckwalter Arabic morphological analyzer version 2.0, linguistic data consortium (LDC) catalog No LDC2004L02. 2019."},{"key":"e_1_2_1_7_1","first-page":"53","article-title":"A rule based persons names Arabic extraction system","volume":"11","author":"Elsebai A.","year":"2009","unstructured":"A. Elsebai , F. Meziane , and F. Z. Belkredim . 2009 . A rule based persons names Arabic extraction system . Communications of the IBIMA 11 , 53 -- 59 . A. Elsebai, F. Meziane, and F. Z. Belkredim. 2009. A rule based persons names Arabic extraction system. Communications of the IBIMA 11, 53--59.","journal-title":"Communications of the IBIMA"},{"volume-title":"Proceedings of the 23rd International Conference on Computational Linguistics. Association for Computational Linguistics, 394--402","author":"Green S.","key":"e_1_2_1_8_1","unstructured":"S. Green and C. D. Manning . 2010. Better Arabic parsing: Baselines, evaluations, and analysis . In Proceedings of the 23rd International Conference on Computational Linguistics. Association for Computational Linguistics, 394--402 . S. Green and C. D. Manning. 2010. Better Arabic parsing: Baselines, evaluations, and analysis. In Proceedings of the 23rd International Conference on Computational Linguistics. Association for Computational Linguistics, 394--402."},{"volume-title":"Proceedings of the International Conference on Applied Computing (IADIS). 23--27","author":"Harmain H. M.","key":"e_1_2_1_9_1","unstructured":"H. M. Harmain , H. El Khatib , and A. Lakas . 2004. Arabic text mining . In Proceedings of the International Conference on Applied Computing (IADIS). 23--27 . H. M. Harmain, H. El Khatib, and A. Lakas. 2004. Arabic text mining. In Proceedings of the International Conference on Applied Computing (IADIS). 23--27."},{"volume-title":"Theory of language and information: A mathematical approach","author":"Harris Z.","key":"e_1_2_1_10_1","unstructured":"Z. Harris . 1991. Theory of language and information: A mathematical approach . Oxford University Press UK. Z. Harris. 1991. Theory of language and information: A mathematical approach. Oxford University Press UK."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.3923\/itj.2005.32.37"},{"key":"e_1_2_1_12_1","doi-asserted-by":"crossref","unstructured":"H. Khalil and T. Osman. 2014. Challenges in information retrieval from unstructured Arabic data. In UKSim. 456--461.  H. Khalil and T. Osman. 2014. Challenges in information retrieval from unstructured Arabic data. In UKSim. 456--461.","DOI":"10.1109\/UKSim.2014.115"},{"key":"e_1_2_1_13_1","unstructured":"Maknaz. 2018. Maknaz - Expanded Arabic Thesaurus. Retrieved from http:\/\/maknaz.org\/.  Maknaz. 2018. Maknaz - Expanded Arabic Thesaurus. Retrieved from http:\/\/maknaz.org\/."},{"key":"e_1_2_1_14_1","doi-asserted-by":"crossref","unstructured":"N. Omar and Q. Al-tashi. 2018. Arabic nested noun compound extraction based on linguistic features and statistical measures. GEMA Online\u00ae Journal of Language Studies 18.  N. Omar and Q. Al-tashi. 2018. Arabic nested noun compound extraction based on linguistic features and statistical measures. GEMA Online\u00ae Journal of Language Studies 18.","DOI":"10.17576\/gema-2018-1802-07"},{"key":"e_1_2_1_15_1","volume-title":"Proceedings of COLING","author":"Oudah M.","year":"2012","unstructured":"M. Oudah and K. Shaalan . 2012. A pipeline Arabic named entity recognition using a hybrid approach . In Proceedings of COLING 2012 . 2159--2176. M. Oudah and K. Shaalan. 2012. A pipeline Arabic named entity recognition using a hybrid approach. In Proceedings of COLING 2012. 2159--2176."},{"key":"e_1_2_1_16_1","volume-title":"Proceedings of the 2nd Student Research Workshop associated with RANLP","author":"Rabiee H. S.","year":"2011","unstructured":"H. S. Rabiee . 2011 . Adapting standard open-source resources to tagging a morphologically rich language: A case study with Arabic . In Proceedings of the 2nd Student Research Workshop associated with RANLP 2011. 127--132. H. S. Rabiee. 2011. Adapting standard open-source resources to tagging a morphologically rich language: A case study with Arabic. In Proceedings of the 2nd Student Research Workshop associated with RANLP 2011. 127--132."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2016.2607201"},{"key":"e_1_2_1_18_1","doi-asserted-by":"crossref","unstructured":"M. Rodrigues and A. Teixeira. 2015. Advanced Applications of Natural Language Processing for Performing Information Extraction. Springer.  M. Rodrigues and A. Teixeira. 2015. Advanced Applications of Natural Language Processing for Performing Information Extraction. Springer.","DOI":"10.1007\/978-3-319-15563-0"},{"volume-title":"Proceedings of the 6th ArchEng International Symposiums, EEECS.","author":"Saad M. K.","key":"e_1_2_1_19_1","unstructured":"M. K. Saad and W. Ashour . 2010. OSAC: Open source Arabic corpora . In Proceedings of the 6th ArchEng International Symposiums, EEECS. M. K. Saad and W. Ashour. 2010. OSAC: Open source Arabic corpora. In Proceedings of the 6th ArchEng International Symposiums, EEECS."},{"key":"e_1_2_1_20_1","volume-title":"RANER: RDI framework for Arabic named entity recognition. International Journal of Engineering 8 Technology 8, 1.11","author":"Sayed A. M.","year":"2019","unstructured":"A. M. Sayed , S. Abdou , M. Rashwan , and H. Al-Barhamtoshy . 2019 . RANER: RDI framework for Arabic named entity recognition. International Journal of Engineering 8 Technology 8, 1.11 (2009), 161--164. A. M. Sayed, S. Abdou, M. Rashwan, and H. Al-Barhamtoshy. 2019. RANER: RDI framework for Arabic named entity recognition. International Journal of Engineering 8 Technology 8, 1.11 (2009), 161--164."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.5555\/1572678.1572692"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1162\/COLI_a_00178"},{"key":"e_1_2_1_23_1","doi-asserted-by":"crossref","unstructured":"K. Shaalan and H. Raza. 2008. Arabic named entity recognition from diverse text types. In Advances in Natural Language Processing. Springer 440--451.  K. Shaalan and H. Raza. 2008. Arabic named entity recognition from diverse text types. In Advances in Natural Language Processing. Springer 440--451.","DOI":"10.1007\/978-3-540-85287-2_42"},{"volume-title":"Proceedings of the 2007 Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources. Association for Computational Linguistics, 17--24","author":"Shaalan K.","key":"e_1_2_1_24_1","unstructured":"K. Shaalan and H. Raza . 2007. Person name entity recognition for Arabic . In Proceedings of the 2007 Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources. Association for Computational Linguistics, 17--24 . K. Shaalan and H. Raza. 2007. Person name entity recognition for Arabic. In Proceedings of the 2007 Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources. Association for Computational Linguistics, 17--24."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.5555\/1067114.1067116"},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the International Multiconference on Computer Science and Information Technology","author":"Traboulsi H.","year":"2009","unstructured":"H. Traboulsi . 2009 . Arabic named entity extraction: A local grammar-based approach . In Proceedings of the International Multiconference on Computer Science and Information Technology , 2009 (IMCSIT'09). IEEE, 139--143. H. Traboulsi. 2009. Arabic named entity extraction: A local grammar-based approach. In Proceedings of the International Multiconference on Computer Science and Information Technology, 2009 (IMCSIT'09). IEEE, 139--143."},{"volume-title":"Proceedings of the 27th International Conference on Computational Linguistics. 2145--2158","author":"Yadav V.","key":"e_1_2_1_27_1","unstructured":"V. Yadav and S. Bethard . 2018. A survey on recent advances in named entity recognition from deep learning models . In Proceedings of the 27th International Conference on Computational Linguistics. 2145--2158 . V. Yadav and S. Bethard. 2018. A survey on recent advances in named entity recognition from deep learning models. In Proceedings of the 27th International Conference on Computational Linguistics. 2145--2158."},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2090176.2090178"},{"volume-title":"Proceedings of the 2010 International Conference on Machine and Web Intelligence (ICMWI). IEEE 473--475","author":"Zaidi S.","key":"e_1_2_1_29_1","unstructured":"S. Zaidi , M. T. Laskri , and A. Abdelali . 2010. Arabic collocations extraction using gate . In Proceedings of the 2010 International Conference on Machine and Web Intelligence (ICMWI). IEEE 473--475 . S. Zaidi, M. T. Laskri, and A. Abdelali. 2010. Arabic collocations extraction using gate. In Proceedings of the 2010 International Conference on Machine and Web Intelligence (ICMWI). IEEE 473--475."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.13053\/rcs-70-1-7"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3382187","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3382187","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:02:08Z","timestamp":1750197728000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3382187"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,7]]},"references-count":29,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2020,7,31]]}},"alternative-id":["10.1145\/3382187"],"URL":"https:\/\/doi.org\/10.1145\/3382187","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"type":"print","value":"2375-4699"},{"type":"electronic","value":"2375-4702"}],"subject":[],"published":{"date-parts":[[2020,6,7]]},"assertion":[{"value":"2018-09-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-01-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-06-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}