{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,15]],"date-time":"2026-05-15T10:41:51Z","timestamp":1778841711440,"version":"3.51.4"},"reference-count":101,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2010,12,1]],"date-time":"2010-12-01T00:00:00Z","timestamp":1291161600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Transactions on Asian Language Information Processing"],"published-print":{"date-parts":[[2010,12]]},"abstract":"<jats:p>There has been an increase in the amount of multilingual text on the Internet due to the proliferation of news sources and blogs. The Urdu language, in particular, has experienced explosive growth on the Web. Text mining for information discovery, which includes tasks such as identifying topics, relationships and events, and sentiment analysis, requires sophisticated natural language processing (NLP). NLP systems begin with modules such as word segmentation, part-of-speech tagging, and morphological analysis and progress to modules such as shallow parsing and named entity tagging. While there have been considerable advances in developing such comprehensive NLP systems for English, the work for Urdu is still in its infancy. The tasks of interest in Urdu NLP includes analyzing data sources such as blogs and comments to news articles to provide insight into social and human behavior. All of this requires a robust NLP system. The objective of this work is to develop an NLP infrastructure for Urdu that is customizable and capable of providing basic analysis on which more advanced information extraction tools can be built. This system assimilates resources from various online sources to facilitate improved named entity tagging and Urdu-to-English transliteration. The annotated data required to train the learning models used here is acquired by standardizing the currently limited resources available for Urdu. Techniques such as bootstrap learning and resource sharing from a syntactically similar language, Hindi, are explored to augment the available annotated Urdu data. Each of the new Urdu text processing modules has been integrated into a general text-mining platform. The evaluations performed demonstrate that the accuracies have either met or exceeded the state of the art.<\/jats:p>","DOI":"10.1145\/1838751.1838754","type":"journal-article","created":{"date-parts":[[2010,12,20]],"date-time":"2010-12-20T15:55:04Z","timestamp":1292860504000},"page":"1-43","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":33,"title":["An Information-Extraction System for Urdu---A Resource-Poor Language"],"prefix":"10.1145","volume":"9","author":[{"given":"Smruthi","family":"Mukund","sequence":"first","affiliation":[{"name":"State University of New York at Buffalo"}]},{"given":"Rohini","family":"Srihari","sequence":"additional","affiliation":[{"name":"State University of New York at Buffalo"}]},{"given":"Erik","family":"Peterson","sequence":"additional","affiliation":[{"name":"Janya, Inc."}]}],"member":"320","published-online":{"date-parts":[[2010,12]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Ace. 2005. Project specifications. ACE data overview. Ace . 2005. Project specifications. ACE data overview."},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the Conference of the European Chapter of ACL Workshop on New Text-Wikis and Blogs and Other Dynamic Text Sources (ACL\u201906)","author":"Adafre S. F.","unstructured":"Adafre , S. F. and Rijke , M . 2006. Finding similar sentences across multiple languages in Wikipedia . In Proceedings of the Conference of the European Chapter of ACL Workshop on New Text-Wikis and Blogs and Other Dynamic Text Sources (ACL\u201906) . Adafre, S. F. and Rijke, M. 2006. Finding similar sentences across multiple languages in Wikipedia. In Proceedings of the Conference of the European Chapter of ACL Workshop on New Text-Wikis and Blogs and Other Dynamic Text Sources (ACL\u201906)."},{"key":"e_1_2_1_3_1","first-page":"445","article-title":"Improving part-of-speech tagging accuracy for Croatian by morphological analysis","volume":"32","author":"Agi","year":"2008","unstructured":"Agi , \u017d., Dovedan , Z. , and Tadi , M. 2008 . Improving part-of-speech tagging accuracy for Croatian by morphological analysis . Informatica 32 , 445 -- 451 . Agi, \u017d., Dovedan, Z., and Tadi, M. 2008. Improving part-of-speech tagging accuracy for Croatian by morphological analysis. Informatica 32, 445--451.","journal-title":"Informatica"},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of World Academy of Science, Engineering and Technology (WASET\u201907)","volume":"26","author":"Ahmad Z.","unstructured":"Ahmad , Z. , Orakzai , J. K. , Shamsher , I. , and Adnan , A . 2007. Urdu nastaleeq character recognition . In Proceedings of World Academy of Science, Engineering and Technology (WASET\u201907) . Vol. 26 . Ahmad, Z., Orakzai, J. K., Shamsher, I., and Adnan, A. 2007. Urdu nastaleeq character recognition. In Proceedings of World Academy of Science, Engineering and Technology (WASET\u201907). Vol. 26."},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the Conference of Language and Technology (CLT\u201909)","author":"Ahmed T.","year":"2009","unstructured":"Ahmed , T. 2009 . Roman to Urdu transliteration using word list . In Proceedings of the Conference of Language and Technology (CLT\u201909) . Ahmed, T. 2009. Roman to Urdu transliteration using word list. In Proceedings of the Conference of Language and Technology (CLT\u201909)."},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the Text Retrieval Conference (TREC\u201904)","author":"Ahn D.","unstructured":"Ahn , D. , Jijkoun , V. , Mishne , G. , Muller , K. , Rijke , M. , and Schlobach , S . 2004. Using Wikipedia at the TREC QA track . In Proceedings of the Text Retrieval Conference (TREC\u201904) . Ahn, D., Jijkoun, V., Mishne, G., Muller, K., Rijke, M., and Schlobach, S. 2004. Using Wikipedia at the TREC QA track. In Proceedings of the Text Retrieval Conference (TREC\u201904)."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118637.1118642"},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the Conference of Language and Technology (CLT\u201909)","author":"Ali A. R.","unstructured":"Ali , A. R. and Ijaz , M . 2009. English to Urdu transliteration system . In Proceedings of the Conference of Language and Technology (CLT\u201909) . Ali, A. R. and Ijaz, M. 2009. English to Urdu transliteration system. In Proceedings of the Conference of Language and Technology (CLT\u201909)."},{"key":"e_1_2_1_9_1","unstructured":"Aroonmanakul W. 2002. Collocation and Thai word segmentation. In Proceedings of International Workshop on Spanish Language Processing and Language Technologies and International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (SNLP-COCOSDA\u201907). Aroonmanakul W. 2002. Collocation and Thai word segmentation. In Proceedings of International Workshop on Spanish Language Processing and Language Technologies and International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (SNLP-COCOSDA\u201907) ."},{"key":"e_1_2_1_10_1","volume-title":"Proceedings of the Workshop on Shallow Parsing for South Asian Languages (IJCAI\u201907)","author":"Avinesh P.","unstructured":"Avinesh , P. and Karthik , G . 2007. Part of speech tagging and chunking using conditional random fields and transformation-based learning . In Proceedings of the Workshop on Shallow Parsing for South Asian Languages (IJCAI\u201907) . 21--24. Avinesh, P. and Karthik, G. 2007. Part of speech tagging and chunking using conditional random fields and transformation-based learning. In Proceedings of the Workshop on Shallow Parsing for South Asian Languages (IJCAI\u201907). 21--24."},{"key":"e_1_2_1_11_1","volume-title":"Proceedings of the NLPAI Machine Learning Competition.","author":"Awasthi P.","unstructured":"Awasthi , P. , Rao , D. , and Ravindran , B . 2006. Part of speech tagging and chunking with HMM and CRF . In Proceedings of the NLPAI Machine Learning Competition. Awasthi, P., Rao, D., and Ravindran, B. 2006. Part of speech tagging and chunking with HMM and CRF. In Proceedings of the NLPAI Machine Learning Competition."},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of Workshop on Machine Translation and other Language Technology Tools (EAMT\/EACL\u201903)","author":"Babych B.","unstructured":"Babych , B. and Hartley , A . 2003. Improving machine translation quality with automatic named entity recognition . In Proceedings of Workshop on Machine Translation and other Language Technology Tools (EAMT\/EACL\u201903) . 1--8. Babych, B. and Hartley, A. 2003. Improving machine translation quality with automatic named entity recognition. In Proceedings of Workshop on Machine Translation and other Language Technology Tools (EAMT\/EACL\u201903). 1--8."},{"key":"e_1_2_1_13_1","volume-title":"Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL\u201908)","author":"Baidsy F.","unstructured":"Baidsy , F. , Hirschberg , J. , and Filatova , E . 2008. An unsupervised approach to biography production using Wikipedia . In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL\u201908) . Baidsy, F., Hirschberg, J., and Filatova, E. 2008. An unsupervised approach to biography production using Wikipedia. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL\u201908)."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118759.1118761"},{"key":"e_1_2_1_15_1","volume-title":"AnnCorra: Annotating corpora guidelines for POS and chunk annotation for Indian language. Tech. rep","author":"Bharati A.","unstructured":"Bharati , A. , Sharma , D. M. , Bai , L. , Sangal , R. , and IIIT, H. 2006. AnnCorra: Annotating corpora guidelines for POS and chunk annotation for Indian language. Tech. rep . Language Technologies Research Centre , IIIT , Hyderabad. Bharati, A., Sharma, D. M., Bai, L., Sangal, R., and IIIT, H. 2006. AnnCorra: Annotating corpora guidelines for POS and chunk annotation for Indian language. Tech. rep. Language Technologies Research Centre, IIIT, Hyderabad."},{"key":"e_1_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Bhatt R. Narasimhan B. Palmer M. Rambow O. Sharma D. M. and Xia F. 2009. A multi-representational and multi-layered Treebank for Hindi\/Urdu. In Proceedings of the 3rd Linguistic Annotation Workshop (The LAW III) in conjunction with the Association for Computational Linguistics\/International Joint Conference on Natural Language Processing (ACL\/IJCNLP\u201909). Bhatt R. Narasimhan B. Palmer M. Rambow O. Sharma D. M. and Xia F. 2009. A multi-representational and multi-layered Treebank for Hindi\/Urdu. In Proceedings of the 3rd Linguistic Annotation Workshop (The LAW III) in conjunction with the Association for Computational Linguistics\/International Joint Conference on Natural Language Processing (ACL\/IJCNLP\u201909) .","DOI":"10.3115\/1698381.1698417"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007558221122"},{"key":"e_1_2_1_18_1","volume-title":"Proceedings of the Finite-State Methods and Natural Language Processing 6th International Workshop (FSMNLP\u201907)","author":"B\u00f6gel T.","unstructured":"B\u00f6gel , T. , Butt , M. , Hautli , A. , and Sulger , S . 2007. Developing finite state morphological analyzer for Hindi and Urdu . In Proceedings of the Finite-State Methods and Natural Language Processing 6th International Workshop (FSMNLP\u201907) . B\u00f6gel, T., Butt, M., Hautli, A., and Sulger, S. 2007. Developing finite state morphological analyzer for Hindi and Urdu. In Proceedings of the Finite-State Methods and Natural Language Processing 6th International Workshop (FSMNLP\u201907)."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.3115\/974147.974178"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.3115\/1075527.1075553"},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC\u201906)","author":"Buscaldi D.","unstructured":"Buscaldi , D. and Rosso , P . 2006. Mining knowledge from Wikipedia for the question answering task . In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC\u201906) . Buscaldi, D. and Rosso, P. 2006. Mining knowledge from Wikipedia for the question answering task. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC\u201906)."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118759.1118762"},{"key":"e_1_2_1_24_1","volume-title":"Studies in Natural Language and Linguistic Theory Series","volume":"61","author":"Butt M.","unstructured":"Butt , M. and King , T. H . 2004. The status of case. In V. Dayal, and A. Mahajan, Eds. Clause Structure in South Asian Languages , Studies in Natural Language and Linguistic Theory Series , vol. 61 . Butt, M. and King, T. H. 2004. The status of case. In V. Dayal, and A. Mahajan, Eds. Clause Structure in South Asian Languages, Studies in Natural Language and Linguistic Theory Series, vol. 61."},{"key":"e_1_2_1_25_1","unstructured":"Center for Research in Urdu Language Processing (CRULP). 2007. Urdu Component Development Project Urdu Normalization Application v1.0. Center for Research in Urdu Language Processing (CRULP) . 2007. Urdu Component Development Project Urdu Normalization Application v1.0."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the 7th Workshop on Asian Language Resources (ACL-IJCNLP\u201909)","author":"Choi K. S.","unstructured":"Choi , K. S. , Isahara , H. , Kanzaki , K. , Kim , H. , Pak , S. M. , and Sun , M . 2009. Word segmentation standard in Chinese, Japanese, and Korean . In Proceedings of the 7th Workshop on Asian Language Resources (ACL-IJCNLP\u201909) . 179--186. Choi, K. S., Isahara, H., Kanzaki, K., Kim, H., Pak, S. M., and Sun, M. 2009. Word segmentation standard in Chinese, Japanese, and Korean. In Proceedings of the 7th Workshop on Asian Language Resources (ACL-IJCNLP\u201909). 179--186."},{"key":"e_1_2_1_27_1","volume-title":"Proceedings of the Conference on Natural Language Learning (CoNLL\u201999)","author":"Daelemans W.","unstructured":"Daelemans , W. , Buchholz , S. , and Veenstra , J . 1999. Memory-based shallow parsing . In Proceedings of the Conference on Natural Language Learning (CoNLL\u201999) . 53--60. Daelemans, W., Buchholz, S., and Veenstra, J. 1999. Memory-based shallow parsing. In Proceedings of the Conference on Natural Language Learning (CoNLL\u201999). 53--60."},{"key":"e_1_2_1_28_1","volume-title":"Proceedings of the Association of Computational Linguistics (ACL\u201907)","author":"Dandapat S.","unstructured":"Dandapat , S. , Sarkar , S. , and Basu , A . 2007. Automatic part-of-speech tagging for Bengali: An approach for morphologically rich languages in a poor resource scenario . In Proceedings of the Association of Computational Linguistics (ACL\u201907) . 221--224. Dandapat, S., Sarkar, S., and Basu, A. 2007. Automatic part-of-speech tagging for Bengali: An approach for morphologically rich languages in a poor resource scenario. In Proceedings of the Association of Computational Linguistics (ACL\u201907). 221--224."},{"key":"e_1_2_1_29_1","volume-title":"Typology of word and automatic word segmentation in Urdu text corpus","author":"Durrani N.","unstructured":"Durrani , N. 2007. Typology of word and automatic word segmentation in Urdu text corpus . National University of Computer and Emerging Sciences , Lahore, Pakistan . Durrani, N. 2007. Typology of word and automatic word segmentation in Urdu text corpus. National University of Computer and Emerging Sciences, Lahore, Pakistan."},{"key":"e_1_2_1_30_1","volume-title":"Proceedings of the 11th Annual Conference of Human Language Technology Conference\/North American Chapter of the Association for Computational Linguistics (HLT-NAACL\u201910)","author":"Durrani N.","unstructured":"Durrani , N. and Hussain , S . 2010. Urdu word segmentation . In Proceedings of the 11th Annual Conference of Human Language Technology Conference\/North American Chapter of the Association for Computational Linguistics (HLT-NAACL\u201910) . Durrani, N. and Hussain, S. 2010. Urdu word segmentation. In Proceedings of the 11th Annual Conference of Human Language Technology Conference\/North American Chapter of the Association for Computational Linguistics (HLT-NAACL\u201910)."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.5555\/1781034.1781108"},{"key":"e_1_2_1_32_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.33011\/lilt.v2i.1203","article-title":"A conditional random field approach for named entity recognition in Bengali and Hindi","volume":"2","author":"Ekbal A.","year":"2009","unstructured":"Ekbal , A. and Bandyopadhyay , S. 2009 . A conditional random field approach for named entity recognition in Bengali and Hindi . Linguist. Issues Lang. Technol. 2 , 1 . Ekbal, A. and Bandyopadhyay, S. 2009. A conditional random field approach for named entity recognition in Bengali and Hindi. Linguist. Issues Lang. Technol. 2, 1.","journal-title":"Linguist. Issues Lang. Technol."},{"key":"e_1_2_1_33_1","article-title":"Named entity recognition using support vector machine: A language independent approach. Int","author":"Ekbal A.","year":"2010","unstructured":"Ekbal , A. and Bandyopadhyay , S. 2010 . Named entity recognition using support vector machine: A language independent approach. Int . J. Elec. Comput. Syst. Eng. 4. Ekbal, A. and Bandyopadhyay, S. 2010. Named entity recognition using support vector machine: A language independent approach. Int. J. Elec. Comput. Syst. Eng. 4.","journal-title":"J. Elec. Comput. Syst. Eng. 4."},{"key":"e_1_2_1_34_1","volume-title":"Languages of the World","author":"Ethnologue","year":"2002","unstructured":"Ethnologue : Languages of the World , 14 th Ed. 2002 . http:\/\/www.ethnologue.com\/show_country.asp?name=Pakistan (accessed 3\/03). Ethnologue: Languages of the World, 14th Ed. 2002. http:\/\/www.ethnologue.com\/show_country.asp?name=Pakistan (accessed 3\/03).","edition":"14"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.5555\/1117822.1455626"},{"key":"e_1_2_1_36_1","volume-title":"Proceedings of the Workshop on Computational Lexicography and Multimedia Dictionaries (COMLEX\u201900)","author":"Farmakiotou D.","unstructured":"Farmakiotou , D. , Karkaletsis , V. , Koutsias , J. , Sigletos , G. , Spyropoulos , C. D. , and Stamatopoulos , P . 2000. Rule-based named entity recognition for Greek financial texts . In Proceedings of the Workshop on Computational Lexicography and Multimedia Dictionaries (COMLEX\u201900) . 75--78. Farmakiotou, D., Karkaletsis, V., Koutsias, J., Sigletos, G., Spyropoulos, C. D., and Stamatopoulos, P. 2000. Rule-based named entity recognition for Greek financial texts. In Proceedings of the Workshop on Computational Lexicography and Multimedia Dictionaries (COMLEX\u201900). 75--78."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1162\/089120105775299177"},{"key":"e_1_2_1_38_1","volume-title":"G., Sanchez-Martinez, F., Scalco, M. A.","author":"Garrido-Alenda A.","year":"2004","unstructured":"Garrido-Alenda , A. , Gilabert-Zarco , P. , P\u00e9rez-Ortiz , J. A. , Pertusa-Ib\u00e1\u00f1ez , A. , Ram rez-Sanchez , G., Sanchez-Martinez, F., Scalco, M. A. , and Forcada, M. L. 2004 . Shallow parsing for Portuguese-Spanish machine translation. In A. Branco, A. Mendes, and R. Ribeiro Eds., Language Technology for Portuguese: Shallow Processing Tools and Resources , 135--144. Garrido-Alenda, A., Gilabert-Zarco, P., P\u00e9rez-Ortiz, J. A., Pertusa-Ib\u00e1\u00f1ez, A., Ram rez-Sanchez, G., Sanchez-Martinez, F., Scalco, M. A., and Forcada, M. L. 2004. Shallow parsing for Portuguese-Spanish machine translation. In A. Branco, A. Mendes, and R. Ribeiro Eds., Language Technology for Portuguese: Shallow Processing Tools and Resources, 135--144."},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.29403\/LI.6.1.7"},{"key":"e_1_2_1_40_1","unstructured":"Haq A. M. 1987. Amjuman-e-Taraqqi Urdu (Hindi). Haq A. M. 1987. Amjuman-e-Taraqqi Urdu (Hindi)."},{"key":"e_1_2_1_41_1","volume-title":"Developing a tagset for automated part-of-speech tagging in Urdu. Department of Linguistics and Modern English Language","author":"Hardie A.","unstructured":"Hardie , A. 2003. Developing a tagset for automated part-of-speech tagging in Urdu. Department of Linguistics and Modern English Language , University of Lancaster. Hardie, A. 2003. Developing a tagset for automated part-of-speech tagging in Urdu. Department of Linguistics and Modern English Language, University of Lancaster."},{"key":"e_1_2_1_42_1","volume-title":"Urdu morphology, orthography and lexicon extraction. Master\u2019s Thesis. Department of Computing Science","author":"Humayoun M.","unstructured":"Humayoun , M. 2006. Urdu morphology, orthography and lexicon extraction. Master\u2019s Thesis. Department of Computing Science , Chalmers University of Technology . Humayoun, M. 2006. Urdu morphology, orthography and lexicon extraction. Master\u2019s Thesis. Department of Computing Science, Chalmers University of Technology."},{"key":"e_1_2_1_43_1","volume-title":"Proceedings of the 2nd Workshop on Computational Approaches to Arabic Script-based Languages (CAASL\u201907)","author":"Humayoun M.","unstructured":"Humayoun , M. , Hammarstr\u00f6m , H. , and Ranta , A . 2007. Urdu morphology, orthography and lexicon extraction . In Proceedings of the 2nd Workshop on Computational Approaches to Arabic Script-based Languages (CAASL\u201907) . Humayoun, M., Hammarstr\u00f6m, H., and Ranta, A. 2007. Urdu morphology, orthography and lexicon extraction. In Proceedings of the 2nd Workshop on Computational Approaches to Arabic Script-based Languages (CAASL\u201907)."},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.5555\/1621804.1621823"},{"key":"e_1_2_1_45_1","volume-title":"Proceedings of the 6th Workshop on Asian Language Resources (IJCNLP\u201908)","author":"Hussain S.","year":"2008","unstructured":"Hussain , S. 2008 . Resources for Urdu language processing . In Proceedings of the 6th Workshop on Asian Language Resources (IJCNLP\u201908) . Hussain, S. 2008. Resources for Urdu language processing. In Proceedings of the 6th Workshop on Asian Language Resources (IJCNLP\u201908)."},{"key":"e_1_2_1_46_1","volume-title":"Proceedings of IEEE International Multi-Topic Conference (INMIC\u201901)","author":"Hussain","unstructured":"Hussain and Afzal, M . 2001. Urdu computing standards: Urdu Zabta Takhti (UZT 1.01) . In Proceedings of IEEE International Multi-Topic Conference (INMIC\u201901) . Hussain and Afzal, M. 2001. Urdu computing standards: Urdu Zabta Takhti (UZT 1.01). In Proceedings of IEEE International Multi-Topic Conference (INMIC\u201901)."},{"key":"e_1_2_1_47_1","volume-title":"Proceedings of a National Policy Conference on Higher Education Act, Title VI and Fullbright-Hays Program.","author":"Janus L.","year":"1997","unstructured":"Janus , L. 1997 . Less commonly taught languages of emerging importance: Major issues, cost problems, and their national implications . In Proceedings of a National Policy Conference on Higher Education Act, Title VI and Fullbright-Hays Program. Janus, L. 1997. Less commonly taught languages of emerging importance: Major issues, cost problems, and their national implications. In Proceedings of a National Policy Conference on Higher Education Act, Title VI and Fullbright-Hays Program."},{"key":"e_1_2_1_48_1","volume-title":"Taraqqi Urdu Bureau","author":"Javed I.","unstructured":"Javed , I. 1981. Taraqqi Urdu Bureau , New Delhi . Javed, I. 1981. Taraqqi Urdu Bureau, New Delhi."},{"key":"e_1_2_1_49_1","unstructured":"Jiang J. and Conrath W. 1997. Semantic similarity based on corpus statistics and lexical taxonomy. Jiang J. and Conrath W. 1997. Semantic similarity based on corpus statistics and lexical taxonomy ."},{"key":"e_1_2_1_50_1","doi-asserted-by":"crossref","unstructured":"Kanaan G. Al-Shalabi R. and Sawalha M. 2005. Improving Arabic information retrieval systems using part of speech tagging. Inform. Technol. J. 4. 32--37. Kanaan G. Al-Shalabi R. and Sawalha M. 2005. Improving Arabic information retrieval systems using part of speech tagging. Inform. Technol. J. 4 . 32--37.","DOI":"10.3923\/itj.2005.32.37"},{"key":"e_1_2_1_51_1","unstructured":"Karimpour R. Ghorbani A. Pishdad A. Mohtarami M. AleAhmad A. Amiri H. and Oroumchian F. 2008. Using part of speech tagging in Persian information retrieval. CLEF Campaign. Karimpour R. Ghorbani A. Pishdad A. Mohtarami M. AleAhmad A. Amiri H. and Oroumchian F. 2008. Using part of speech tagging in Persian information retrieval. CLEF Campaign."},{"key":"e_1_2_1_52_1","unstructured":"Kashani M. Popowich F. and Sadat F. 2007. Automatic transliteration of proper nouns from Arabic to English. The Challenge of Arabic for NLP\/MT 76--84. Kashani M. Popowich F. and Sadat F. 2007. Automatic transliteration of proper nouns from Arabic to English. The Challenge of Arabic for NLP\/MT 76--84."},{"key":"e_1_2_1_53_1","volume-title":"two scripts: The Hindi movement in 19th century North India. Annual Urdu Studies 10","author":"King C. K.","unstructured":"King , C. K. 1995. One language , two scripts: The Hindi movement in 19th century North India. Annual Urdu Studies 10 . King, C. K. 1995.One language, two scripts: The Hindi movement in 19th century North India. Annual Urdu Studies 10."},{"key":"e_1_2_1_54_1","volume-title":"Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL\u201907)","author":"Ko J.","unstructured":"Ko , J. , Mitamura , T. , and Nyberg , E . 2007. Language-independent probabilistic answer ranking for multilingual question answering . In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL\u201907) . Ko, J., Mitamura, T., and Nyberg, E. 2007. Language-independent probabilistic answer ranking for multilingual question answering. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL\u201907)."},{"key":"e_1_2_1_55_1","volume-title":"Proceedings of the 18th International Conference on Machine Learning (ML\u201901)","author":"Lafferty J. D.","year":"2001","unstructured":"Lafferty , J. D. , McCallum , A. , Pereira , C. N. 2001 . Conditional random fields: Probabilistic models for segmenting and labeling sequence data . In Proceedings of the 18th International Conference on Machine Learning (ML\u201901) . 282--289. Lafferty, J. D., McCallum, A., Pereira, C. N. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the 18th International Conference on Machine Learning (ML\u201901). 282--289."},{"key":"e_1_2_1_56_1","doi-asserted-by":"crossref","unstructured":"Leech G. and Wilson A. 1999. Standards for tagsets. Edited version of EAGLES recommendations for the morphosyntactic annotation of corpora. In van Halteren H. Ed. Syntactic Wordclass Tagging Kluwer Academic Publishers Dordrecht. Leech G. and Wilson A. 1999. Standards for tagsets. Edited version of EAGLES recommendations for the morphosyntactic annotation of corpora. In van Halteren H. Ed. Syntactic Wordclass Tagging Kluwer Academic Publishers Dordrecht.","DOI":"10.1007\/978-94-015-9273-4_5"},{"key":"e_1_2_1_57_1","volume-title":"Proceedings of the IEEE International Conference on Information Reuse and Integration (CIRI\u201902)","author":"Liao P. P.","unstructured":"Liao , P. P. , Liu , Y. , and Chen , L . 2006. Hybrid Chinese text chunking . In Proceedings of the IEEE International Conference on Information Reuse and Integration (CIRI\u201902) . 561--566. Liao, P. P., Liu, Y., and Chen, L. 2006. Hybrid Chinese text chunking. In Proceedings of the IEEE International Conference on Information Reuse and Integration (CIRI\u201902). 561--566."},{"key":"e_1_2_1_58_1","volume-title":"Proceedings of the NEMLAR Conference on Arabic Language Resources and Tools (ALRT\u201904)","author":"Maamouri M.","unstructured":"Maamouri , M. , Bies , A. , Buckwalter , T. , and Mekki , M . 2004. The Penn arabic treebank: Building a large-scale annotated arabic corpus . In Proceedings of the NEMLAR Conference on Arabic Language Resources and Tools (ALRT\u201904) . 102--109. Maamouri, M., Bies, A., Buckwalter, T., and Mekki, M. 2004. The Penn arabic treebank: Building a large-scale annotated arabic corpus. In Proceedings of the NEMLAR Conference on Arabic Language Resources and Tools (ALRT\u201904). 102--109."},{"key":"e_1_2_1_59_1","first-page":"313","article-title":"Building a large annotated corpus of English: The Penn treebank","volume":"19","author":"Marcus M. P.","year":"1994","unstructured":"Marcus , M. P. , Santorini , B. , and Marcinkiewicz , M. A. 1994 . Building a large annotated corpus of English: The Penn treebank . Comput. Linguist. 19 , 313 -- 330 . Marcus, M. P., Santorini, B., and Marcinkiewicz, M. A. 1994. Building a large annotated corpus of English: The Penn treebank. Comput. Linguist. 19, 313--330.","journal-title":"Comput. Linguist."},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.3115\/1119176.1119206"},{"key":"e_1_2_1_61_1","volume-title":"Proceedings of the Natural Language Processing Pacific Rim Symposium (NLP\u201997)","author":"Meknavin S.","unstructured":"Meknavin , S. , Charoenpornsawat , P. , and Kijsirikul , B . 1997. Feature-based Thai word segmentation . In Proceedings of the Natural Language Processing Pacific Rim Symposium (NLP\u201997) . Meknavin, S., Charoenpornsawat, P., and Kijsirikul, B. 1997. Feature-based Thai word segmentation. In Proceedings of the Natural Language Processing Pacific Rim Symposium (NLP\u201997)."},{"key":"e_1_2_1_62_1","volume-title":"Proceedings of the 1st International Conference on Intelligent Human Computer Interaction (IHCI\u201909)","author":"Mishra N.","unstructured":"Mishra , N. , Yadav , S. , and Siddiqui , T. J . 2009. An unsupervised approach to Hindi word sense disambiguation . In Proceedings of the 1st International Conference on Intelligent Human Computer Interaction (IHCI\u201909) . 327--335. Mishra, N., Yadav, S., and Siddiqui, T. J. 2009. An unsupervised approach to Hindi word sense disambiguation. In Proceedings of the 1st International Conference on Intelligent Human Computer Interaction (IHCI\u201909). 327--335."},{"key":"e_1_2_1_63_1","unstructured":"Mitra M. and Majumder P. 2008. Forum for information retrieval evaluation (for Indian languages). Cross Language Evaluation Forum. Indian Statistical Institute Calcutta. Mitra M. and Majumder P. 2008. Forum for information retrieval evaluation (for Indian languages). Cross Language Evaluation Forum. Indian Statistical Institute Calcutta."},{"key":"e_1_2_1_64_1","volume-title":"Proceedings of the Conference on Text REtrieval (TREC\u201902)","author":"Moldovan D.","year":"2002","unstructured":"Moldovan , D. , Harabagiu , S. , Girju , R. , Morarescu , P. , Lacatusu , F. , Novischi , A. , Badulescu , A. , Bolohan , O. 2002 . LCC tools for question answering . In Proceedings of the Conference on Text REtrieval (TREC\u201902) . Moldovan, D., Harabagiu, S., Girju, R., Morarescu, P., Lacatusu, F., Novischi, A., Badulescu, A., Bolohan, O. 2002. LCC tools for question answering. In Proceedings of the Conference on Text REtrieval (TREC\u201902)."},{"key":"e_1_2_1_65_1","volume-title":"Proceedings of the 7th Workshop on Asian Language Resources (ACL-IJCNLP\u201909)","author":"Muaz A.","unstructured":"Muaz , A. , Ali , A. , and Hussain , S . 2009. Analysis and development of Urdu POS tagged corpus . In Proceedings of the 7th Workshop on Asian Language Resources (ACL-IJCNLP\u201909) . 24--31. Muaz, A., Ali, A., and Hussain, S. 2009. Analysis and development of Urdu POS tagged corpus. In Proceedings of the 7th Workshop on Asian Language Resources (ACL-IJCNLP\u201909). 24--31."},{"key":"e_1_2_1_66_1","unstructured":"Muaz A. and Khan A. N. The morphosyntactic behavior of \u201dWala\u201d in Urdu language. Muaz A. and Khan A. N. The morphosyntactic behavior of \u201dWala\u201d in Urdu language ."},{"key":"e_1_2_1_67_1","volume-title":"Proceedings of the 7th International Conference on Natural Language Processing (ICNLP\u201909)","author":"Mukund S.","unstructured":"Mukund , S. , Peterson , E. , and Srihari , R. K . 2009. Context aware transliterations of Urdu names . In Proceedings of the 7th International Conference on Natural Language Processing (ICNLP\u201909) . Mukund, S., Peterson, E., and Srihari, R. K. 2009. Context aware transliterations of Urdu names. In Proceedings of the 7th International Conference on Natural Language Processing (ICNLP\u201909)."},{"key":"e_1_2_1_68_1","volume-title":"Proceedings of the 3rd International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies (CLIAWS3\u201909)","author":"Mukund S.","unstructured":"Mukund , S. and Srihari , R. K . 2009. NE tagging for Urdu based on bootstrap POS learning . In Proceedings of the 3rd International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies (CLIAWS3\u201909) . Mukund, S. and Srihari, R. K. 2009. NE tagging for Urdu based on bootstrap POS learning. In Proceedings of the 3rd International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies (CLIAWS3\u201909)."},{"key":"e_1_2_1_69_1","volume-title":"Proceedings of the 8th International Conference on Machine Learning and Cybernetics (ICMLC\u201909)","author":"Nicholls C.","unstructured":"Nicholls , C. , and Song , F . 2009. Improving sentiment analysis with part-of-speech weighting . In Proceedings of the 8th International Conference on Machine Learning and Cybernetics (ICMLC\u201909) . Nicholls, C., and Song, F. 2009. Improving sentiment analysis with part-of-speech weighting. In Proceedings of the 8th International Conference on Machine Learning and Cybernetics (ICMLC\u201909)."},{"key":"e_1_2_1_70_1","first-page":"1","article-title":"Semantic role labeling for Tamil documents","volume":"1","author":"Pandian S. L.","year":"2009","unstructured":"Pandian , S. L. and Geetha , T. V. 2009 . Semantic role labeling for Tamil documents . Int. J. Recent Trends Eng. 1 , 1 . Pandian, S. L. and Geetha, T. V. 2009. Semantic role labeling for Tamil documents. Int. J. Recent Trends Eng. 1, 1.","journal-title":"Int. J. Recent Trends Eng."},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.3115\/1075812.1075875"},{"key":"e_1_2_1_72_1","unstructured":"Platts T. 1967. A Grammar of the Hindustani or Urdu Language. Munshiram Manoharlal Delhi. Platts T. 1967. A Grammar of the Hindustani or Urdu Language . Munshiram Manoharlal Delhi."},{"key":"e_1_2_1_74_1","volume-title":"Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI\u201905)","author":"Punyakanok V.","unstructured":"Punyakanok , V. , Roth , D. , and Yih , W. T . 2005. The necessity of syntactic parsing for semantic role labeling . In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI\u201905) . 1117. Punyakanok, V., Roth, D., and Yih, W. T. 2005. The necessity of syntactic parsing for semantic role labeling. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI\u201905). 1117."},{"key":"e_1_2_1_75_1","volume-title":"Proceedings of the 3rd Workshop on Very Large Corpora (VLC\u201995)","author":"Ramshaw L. A.","unstructured":"Ramshaw , L. A. and Marcus , M. P . 1995. Text chunking using transformation-based learning . In Proceedings of the 3rd Workshop on Very Large Corpora (VLC\u201995) . 82--94. Ramshaw, L. A. and Marcus, M. P. 1995. Text chunking using transformation-based learning. In Proceedings of the 3rd Workshop on Very Large Corpora (VLC\u201995). 82--94."},{"key":"e_1_2_1_76_1","volume-title":"Proceedings of the Empirical Methods in Natural Language Processing Conference (EMNLP\u201996)","author":"Ratnaparkhi A.","year":"1996","unstructured":"Ratnaparkhi , A. 1996 . A maximum entropy model for part-of-speech tagger . In Proceedings of the Empirical Methods in Natural Language Processing Conference (EMNLP\u201996) . Ratnaparkhi, A. 1996. A maximum entropy model for part-of-speech tagger. In Proceedings of the Empirical Methods in Natural Language Processing Conference (EMNLP\u201996)."},{"key":"e_1_2_1_77_1","volume-title":"Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL\u201908)","author":"Richman A.","unstructured":"Richman , A. and Schone , P . 2008. Mining Wiki resources for multilingual named entity recognition . In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL\u201908) . Richman, A. and Schone, P. 2008. Mining Wiki resources for multilingual named entity recognition. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL\u201908)."},{"key":"e_1_2_1_78_1","volume-title":"Proceedings of the Engineering, Sciences and Technology Student Conference (SCONEST\u201904)","author":"Rizvi J. S.","unstructured":"Rizvi , J. S. , Husssain , M. , and Qaiser , N . 2004. Language oriented parsing through morphologically closed word classes in Urdu . In Proceedings of the Engineering, Sciences and Technology Student Conference (SCONEST\u201904) . 19--24. Rizvi, J. S., Husssain, M., and Qaiser, N. 2004. Language oriented parsing through morphologically closed word classes in Urdu. In Proceedings of the Engineering, Sciences and Technology Student Conference (SCONEST\u201904). 19--24."},{"key":"e_1_2_1_79_1","first-page":"203","article-title":"Some notes on Hindi and Urdu","volume":"11","author":"Russell R.","year":"1996","unstructured":"Russell , R. 1996 . Some notes on Hindi and Urdu . Annual Urdu Studies 11 , 203 -- 208 . Russell, R. 1996. Some notes on Hindi and Urdu. Annual Urdu Studies 11, 203--8.","journal-title":"Annual Urdu Studies"},{"key":"e_1_2_1_80_1","volume-title":"Proceedings of the Conference on Human Language Translations (HLT\u201908)","author":"Saha S. K.","unstructured":"Saha , S. K. , Mitra , P. , and Sarkar , S . 2008. Word clustering and word selection based feature reduction for MaxEnt based Hindi NER . In Proceedings of the Conference on Human Language Translations (HLT\u201908) . 488--95. Saha, S. K., Mitra, P., and Sarkar, S. 2008. Word clustering and word selection based feature reduction for MaxEnt based Hindi NER. In Proceedings of the Conference on Human Language Translations (HLT\u201908). 488--95."},{"key":"e_1_2_1_81_1","volume-title":"Statistical part of speech tagger for Urdu. Unpublished Master\u2019s Thesis","author":"Sajjad H.","unstructured":"Sajjad , H. 2007. Statistical part of speech tagger for Urdu. Unpublished Master\u2019s Thesis . National University of Computer and Emerging Sciences . Lahore, Pakistan. Sajjad, H. 2007. Statistical part of speech tagger for Urdu. Unpublished Master\u2019s Thesis. National University of Computer and Emerging Sciences. Lahore, Pakistan."},{"key":"e_1_2_1_82_1","volume-title":"Proceedings of the 12th Conference of the European Chapter of the Association for Computation Linguistics (EACL\u201909)","author":"Sajjad H.","unstructured":"Sajjad , H. and Schmid , H . 2009. Tagging Urdu text with parts of speech: A tagger comparison . In Proceedings of the 12th Conference of the European Chapter of the Association for Computation Linguistics (EACL\u201909) . 692--700. Sajjad, H. and Schmid, H. 2009. Tagging Urdu text with parts of speech: A tagger comparison. In Proceedings of the 12th Conference of the European Chapter of the Association for Computation Linguistics (EACL\u201909). 692--700."},{"key":"e_1_2_1_83_1","volume-title":"Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP\u201905)","author":"Samy D.","unstructured":"Samy , D. , Moreno , A. , and Guirao , M . 2005. A proposal for an Arabic named entity tagger leveraging a parallel corpus . In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP\u201905) . Samy, D., Moreno, A., and Guirao, M. 2005. A proposal for an Arabic named entity tagger leveraging a parallel corpus. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP\u201905)."},{"key":"e_1_2_1_84_1","volume-title":"Part-of-Speech Tagging Guidelines for the Penn Treebank Project","author":"Santorini B.","unstructured":"Santorini , B. 1990. Part-of-Speech Tagging Guidelines for the Penn Treebank Project 3 rd Ed. University of Pennsylvania . Santorini, B. 1990. Part-of-Speech Tagging Guidelines for the Penn Treebank Project 3rd Ed. University of Pennsylvania.","edition":"3"},{"key":"e_1_2_1_85_1","volume-title":"Probabilistic Part-of-Speech Tagging Using Decision Trees","author":"Schmid H.","unstructured":"Schmid , H. 1994. Probabilistic Part-of-Speech Tagging Using Decision Trees . Institut f\u00fcr Maschinelle Sprachverarbeitung, Universit\u00e4t Stuttgart, Germany . Schmid, H. 1994. Probabilistic Part-of-Speech Tagging Using Decision Trees. Institut f\u00fcr Maschinelle Sprachverarbeitung, Universit\u00e4t Stuttgart, Germany."},{"key":"e_1_2_1_86_1","volume-title":"Urdu: An Essential Grammar","author":"Schmidt R.","year":"1999","unstructured":"Schmidt , R. 1999 . Urdu: An Essential Grammar . Routledge , London . Schmidt, R. 1999. Urdu: An Essential Grammar. Routledge, London."},{"key":"e_1_2_1_87_1","volume-title":"Proceedings of the International Conference on Universal Knowledge and Language (ICUKL\u201902)","author":"Shah C.","unstructured":"Shah , C. and Bhattacharyya , P . 2002. A study for evaluating the importance of various parts of speech (POS) for information retrieval (IR) . In Proceedings of the International Conference on Universal Knowledge and Language (ICUKL\u201902) . Shah, C. and Bhattacharyya, P. 2002. A study for evaluating the importance of various parts of speech (POS) for information retrieval (IR). In Proceedings of the International Conference on Universal Knowledge and Language (ICUKL\u201902)."},{"key":"e_1_2_1_88_1","unstructured":"Shamsfard M. and Mousavi M. S. 2008. Thematic role extraction using shallow parsing. Int. J. Computat. Intell. Shamsfard M. and Mousavi M. S. 2008. Thematic role extraction using shallow parsing. Int. J. Computat. Intell."},{"key":"e_1_2_1_89_1","volume-title":"Jaame-ul-Qwaid - \u00c7 Head of the Department of Urdu","author":"Siddiqi A.","unstructured":"Siddiqi , A. 1971. Jaame-ul-Qwaid - \u00c7 Head of the Department of Urdu , Karachi University , Markazi Urdu Board, Lahore. Siddiqi, A. 1971. Jaame-ul-Qwaid - \u00c7 Head of the Department of Urdu, Karachi University, Markazi Urdu Board, Lahore."},{"key":"e_1_2_1_90_1","doi-asserted-by":"publisher","DOI":"10.5555\/1641976.1641985"},{"key":"e_1_2_1_91_1","volume-title":"Proceedings of Eurolan Doctoral Consortium (EUROLAN\u201907)","author":"Singh A. K.","unstructured":"Singh , A. K. and Surana , H . 2007. Using a single framework for computational modeling of linguistic similarity for solving many NLP problems . In Proceedings of Eurolan Doctoral Consortium (EUROLAN\u201907) . Singh, A. K. and Surana, H. 2007. Using a single framework for computational modeling of linguistic similarity for solving many NLP problems. In Proceedings of Eurolan Doctoral Consortium (EUROLAN\u201907)."},{"key":"e_1_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.5555\/1698239.1698247"},{"key":"e_1_2_1_93_1","volume-title":"Proceedings of the 19th International Conference on Computational Linguistics (COOLING).","author":"Sproat R.","unstructured":"Sproat , R. and Shih , C . 2002. Corpus-based methods in Chinese morphology and phonology . In Proceedings of the 19th International Conference on Computational Linguistics (COOLING). Sproat, R. and Shih, C. 2002. Corpus-based methods in Chinese morphology and phonology. In Proceedings of the 19th International Conference on Computational Linguistics (COOLING)."},{"key":"e_1_2_1_94_1","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324906004116"},{"key":"e_1_2_1_95_1","volume-title":"Proceedings of the COLING\/ACL Workshop on Computational Approaches to Semitic Languages (CAASL\u201998)","author":"Stalls B. G.","unstructured":"Stalls , B. G. and Knight , K . 1998. Translation name and technical terms in Arabic text . In Proceedings of the COLING\/ACL Workshop on Computational Approaches to Semitic Languages (CAASL\u201998) . Stalls, B. G. and Knight, K. 1998. Translation name and technical terms in Arabic text. In Proceedings of the COLING\/ACL Workshop on Computational Approaches to Semitic Languages (CAASL\u201998)."},{"key":"e_1_2_1_96_1","unstructured":"Stanislav M. 2003. Statistical approach to the debate on Urdu and Hindi (student paper). Annual Urdu Studies 18. Stanislav M. 2003. Statistical approach to the debate on Urdu and Hindi (student paper). Annual Urdu Studies 18 ."},{"key":"e_1_2_1_97_1","volume-title":"Proceedings of the 3rd Workshop on Statistical Machine Translation (SMT\u201908)","author":"Stymne S.","unstructured":"Stymne , S. , Holmqvist , M. , and Ahrenberg , L . 2008. Effects of morphological analysis in translation between German and English . In Proceedings of the 3rd Workshop on Statistical Machine Translation (SMT\u201908) , 135--138. Stymne, S., Holmqvist, M., and Ahrenberg, L. 2008. Effects of morphological analysis in translation between German and English. In Proceedings of the 3rd Workshop on Statistical Machine Translation (SMT\u201908), 135--138."},{"key":"e_1_2_1_98_1","volume-title":"Application Programming Interface","author":"Urdu Component Development Project","year":"2007","unstructured":"Urdu Component Development Project , Application Programming Interface 2007 . Normalization Utility, Center for Research in Urdu Language Processing, National University of Computer and Emerging Sciences , Lahore, Pakistan . Urdu Component Development Project, Application Programming Interface 2007. Normalization Utility, Center for Research in Urdu Language Processing, National University of Computer and Emerging Sciences, Lahore, Pakistan."},{"key":"e_1_2_1_99_1","doi-asserted-by":"publisher","DOI":"10.1145\/1242572.1242729"},{"key":"e_1_2_1_100_1","unstructured":"Wilks Y. and Stevenson M. 1996. The grammar of sense: Is word-sense tagging much more than part-of-speech tagging? Wilks Y. and Stevenson M. 1996. The grammar of sense: Is word-sense tagging much more than part-of-speech tagging?"},{"key":"e_1_2_1_101_1","first-page":"29","article-title":"Chinese word segmentation as character tagging","volume":"8","author":"Xue N. W.","year":"2003","unstructured":"Xue , N. W. 2003 . Chinese word segmentation as character tagging . Comput. Linguist. Chinese Lang. Proc. 8 , 1, 29 -- 48 . Xue, N. W. 2003. Chinese word segmentation as character tagging. Comput. Linguist. Chinese Lang. Proc. 8, 1, 29--48.","journal-title":"Comput. Linguist. Chinese Lang. Proc."},{"key":"e_1_2_1_102_1","doi-asserted-by":"publisher","DOI":"10.1017\/S135132490400364X"},{"key":"e_1_2_1_103_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073083.1073163"}],"container-title":["ACM Transactions on Asian Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1838751.1838754","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1838751.1838754","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T11:39:49Z","timestamp":1750246789000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1838751.1838754"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,12]]},"references-count":101,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2010,12]]}},"alternative-id":["10.1145\/1838751.1838754"],"URL":"https:\/\/doi.org\/10.1145\/1838751.1838754","relation":{},"ISSN":["1530-0226","1558-3430"],"issn-type":[{"value":"1530-0226","type":"print"},{"value":"1558-3430","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,12]]},"assertion":[{"value":"2009-09-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2010-04-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2010-12-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}