{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T17:57:31Z","timestamp":1770746251657,"version":"3.49.0"},"reference-count":68,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2021,11,3]],"date-time":"2021-11-03T00:00:00Z","timestamp":1635897600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2022,3,31]]},"abstract":"<jats:p>During the last two decades, sentiment analysis, also known as opinion mining, has become one of the most explored research areas in Natural Language Processing (NLP) and data mining. Sentiment analysis focuses on the sentiments or opinions of consumers expressed over social media or different web sites. Due to exposure on the Internet, sentiment analysis has attracted vast numbers of researchers over the globe. A large amount of research has been conducted in English, Chinese, and other languages used worldwide. However, Roman Urdu has been neglected despite being the third most used language for communication in the world, covering millions of users around the globe. Although some techniques have been proposed for sentiment analysis in Roman Urdu, these techniques are limited to a specific domain or developed incorrectly due to the unavailability of language resources available for Roman Urdu. Therefore, in this article, we are proposing an unsupervised approach for sentiment analysis in Roman Urdu. First, the proposed model normalizes the text to overcome spelling variations of different words. After normalizing text, we have used Roman Urdu and English opinion lexicons to correctly identify users\u2019 opinions from the text. We have also incorporated negation terms and stemming to assign polarities to each extracted opinion. Furthermore, our model assigns a score to each sentence on the basis of the polarities of extracted opinions and classifies each sentence as positive, negative, or neutral. In order to verify our approach, we have conducted experiments on two publicly available datasets for Roman Urdu and compared our approach with the existing model. Results have demonstrated that our approach outperforms existing models for sentiment analysis tasks in Roman Urdu. Furthermore, our approach does not suffer from domain dependency.<\/jats:p>","DOI":"10.1145\/3474119","type":"journal-article","created":{"date-parts":[[2021,11,3]],"date-time":"2021-11-03T15:03:32Z","timestamp":1635951812000},"page":"1-16","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["An Unsupervised Approach for Sentiment Analysis on Social Media Short Text Classification in Roman Urdu"],"prefix":"10.1145","volume":"21","author":[{"given":"Toqir A.","family":"Rana","sequence":"first","affiliation":[{"name":"Department of Computer Science &amp; IT, The University of Lahore, Lahore, Pakistan and School of Computer Sciences, Universiti Sains Malaysia, Penang, Malaysia"}]},{"given":"Kiran","family":"Shahzadi","sequence":"additional","affiliation":[{"name":"Department of Software Engineering, The University of Lahore, Lahore, Pakistan"}]},{"given":"Tauseef","family":"Rana","sequence":"additional","affiliation":[{"name":"Department of Computer Software Engineering, MCS, National University of Sciences and Technology (NUST), Islamabad, Pakistan"}]},{"given":"Ahsan","family":"Arshad","sequence":"additional","affiliation":[{"name":"Department of Computer Science &amp; IT, The University of Lahore, Lahore, Pakistan"}]},{"given":"Mohammad","family":"Tubishat","sequence":"additional","affiliation":[{"name":"School of Information Technology, Skyline University College, Sharjah, United Arab Emirates"}]}],"member":"320","published-online":{"date-parts":[[2021,11,3]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.2994950"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1145\/1838002.1838025"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2017.2787798"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.3039548"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1111\/exsy.12397"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1145\/3300050"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2018.07.006"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jksuci.2015.11.003"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-019-04525-y"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.5121\/cseij.2014.4601"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2019.01.202"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-019-04297-5"},{"issue":"1","key":"e_1_3_2_14_2","first-page":"21","article-title":"Opinion within opinion: Segmentation approach for Urdu sentiment analysis","volume":"15","author":"Hassan Muhammad","year":"2018","unstructured":"Muhammad Hassan and Muhammad Shoaib. 2018. Opinion within opinion: Segmentation approach for Urdu sentiment analysis. International Arab Journal of Information Technology 15, 1 (2018), 21\u201328.","journal-title":"International Arab Journal of Information Technology"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.5555\/2390374.2390383"},{"key":"e_1_3_2_16_2","volume-title":"Proceedings of the 6th International Conference on Language and Technology, Lahore, Pakistan","author":"Jabbar Abdul","year":"2016","unstructured":"Abdul Jabbar, Sajid Iqbal, and Muhammad Usman Ghani Khan. 2016. Analysis and development of resources for Urdu text stemming. In Proceedings of the 6th International Conference on Language and Technology, Lahore, Pakistan. 1\u20137."},{"key":"e_1_3_2_17_2","first-page":"164","volume-title":"ESSEM@ AI* IA, Citeseer","author":"Javed Iqra","year":"2013","unstructured":"Iqra Javed and Hammad Afzal. 2013. Opinion analysis of Bi-lingual event data from social networks. In ESSEM@ AI* IA, Citeseer, 164\u2013172."},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00371-020-01927-0"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324920000285"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.14569\/IJACSA.2018.090981"},{"issue":"1","key":"e_1_3_2_21_2","article-title":"Pattern and semantic analysis to improve unsupervised techniques for opinion target identification","volume":"43","author":"Khan Khairullah","year":"2016","unstructured":"Khairullah Khan, Ashraf Ullah, and Baharum Baharudin. 2016. Pattern and semantic analysis to improve unsupervised techniques for opinion target identification. Kuwait Journal of Science 43, 1 (2016), 129\u2013149.","journal-title":"Kuwait Journal of Science"},{"key":"e_1_3_2_22_2","first-page":"630","volume-title":"Future of Information and Communication Conference","author":"Khan Moin","year":"2018","unstructured":"Moin Khan and Kamran Malik. 2018. Sentiment classification of customer's reviews about automobiles in Roman Urdu. In Future of Information and Communication Conference, Singapore, Springer, 630\u2013640."},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1007\/s13278-016-0381-6"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2019.102141"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-017-9607-x"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-019-09727-2"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/RICE.2018.8509045"},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11063-018-9913-6"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11831-019-09332-0"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2020.102233"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.asej.2014.04.011"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1145\/3329709"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2020.102211"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1111\/exsy.12317"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.tele.2018.08.003"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1142\/S0218001418510011"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.5555\/2390374.2390375"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1145\/1838751.1838754"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1007\/s12046-019-1126-9"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-019-03897-5"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-020-05018-z"},{"key":"e_1_3_2_42_2","article-title":"Generating an emotion ontology for Roman Urdu text","volume":"7","author":"Nargis Gule Zulf","year":"2016","unstructured":"Gule Zulf Nargis and Noreen Jamil. 2016. Generating an emotion ontology for Roman Urdu text. International Journal of Computational Linguistics Research 7, (2016), 83\u201391.","journal-title":"International Journal of Computational Linguistics Research"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-23943-5_16"},{"key":"e_1_3_2_44_2","doi-asserted-by":"publisher","DOI":"10.1007\/s12559-017-9470-8"},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2019.102084"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.32604\/iasc.2021.018572"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.5614\/itbj.ict.res.appl.2016.10.1.6"},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10489-020-01817-x"},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-016-9472-z"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICAICTA.2016.7803101"},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2017.07.047"},{"key":"e_1_3_2_52_2","first-page":"317","volume-title":"International Conference on Computing and Information Technology","author":"Rana Toqir A.","year":"2017","unstructured":"Toqir A. Rana and Yu-N. Cheah. 2017. Improving aspect extraction using aspect frequency and semantic similarity-based approach for aspect-based sentiment analysis. In International Conference on Computing and Information Technology, Bangkok, Thailand, Springer, 317\u2013326."},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.1166\/asl.2018.10752"},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.1177\/0165551518808195"},{"key":"e_1_3_2_55_2","doi-asserted-by":"publisher","DOI":"10.1109\/CITA.2015.7349820"},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2015.06.015"},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/INTECH.2016.7845095"},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2018.08.044"},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2015.2485209"},{"issue":"12","key":"e_1_3_2_60_2","first-page":"213","article-title":"Lexical normalization of Roman Urdu text","volume":"17","author":"Sharf Zareen","year":"2017","unstructured":"Zareen Sharf and Saif Ur Rahman. 2017. Lexical normalization of Roman Urdu text. International Journal of Computer Science and Network Security 17, 12 (2017), 213\u2013221.","journal-title":"International Journal of Computer Science and Network Security"},{"issue":"1","key":"e_1_3_2_61_2","first-page":"252","article-title":"A comparison and analysis of name matching algorithms","volume":"4","author":"Snae Chakkrit","year":"2007","unstructured":"Chakkrit Snae. 2007. A comparison and analysis of name matching algorithms. International Journal of Applied Science, Engineering and Technology 4, 1 (2007), 252\u2013257.","journal-title":"International Journal of Applied Science, Engineering and Technology"},{"key":"e_1_3_2_62_2","volume-title":"PACIS","volume":"96","author":"Sohail Omayya","year":"2018","unstructured":"Omayya Sohail, Inam Elahi, Ahsan Ijaz, Asim Karim, and Faisal Kamiran. 2018. Text classification in an under-resourced language via lexical normalization and feature pooling. In PACIS, Yokohama, Japan. 96."},{"key":"e_1_3_2_63_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2020.105572"},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","DOI":"10.5555\/1927149.1927155"},{"key":"e_1_3_2_65_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-25324-9_33"},{"key":"e_1_3_2_66_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2019.112834"},{"key":"e_1_3_2_67_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2909919"},{"key":"e_1_3_2_68_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2018.12.004"},{"key":"e_1_3_2_69_2","doi-asserted-by":"publisher","DOI":"10.1002\/widm.1253"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3474119","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3474119","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:17:35Z","timestamp":1750191455000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3474119"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,3]]},"references-count":68,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,3,31]]}},"alternative-id":["10.1145\/3474119"],"URL":"https:\/\/doi.org\/10.1145\/3474119","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,11,3]]},"assertion":[{"value":"2021-01-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-07-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-11-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}