{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T16:35:46Z","timestamp":1781109346928,"version":"3.54.1"},"reference-count":27,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2019,6,20]],"date-time":"2019-06-20T00:00:00Z","timestamp":1560988800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2019,6,30]]},"abstract":"<jats:p>Sentiment classification is a popular text mining task in which textual content (e.g., a message) is assigned a polarity label (typically positive or negative) reflecting the sentiment expressed in it. Sentiment classification is used widely in applications like customer feedback analysis where robustness and correctness of results are critical. In this article, we highlight that prediction accuracy alone is not sufficient for assessing the performance of a sentiment classifier; it is also important that the classifier is not biased toward positive or negative polarity, thus distorting the distribution of positive and negative messages in the predictions. We propose a measure, called Polarity Bias Rate, for quantifying this bias in a sentiment classifier. Second, we present two methods for removing this bias in the predictions of unsupervised and supervised sentiment classifiers. Our first method, called Bias-Aware Thresholding (BAT), shifts the decision boundary to control the bias in the predictions. Motivated from cost-sensitive learning, BAT is easily applicable to both lexicon-based unsupervised and supervised classifiers. Our second method, called Balanced Logistic Regression (BLR) introduces a bias-remover constraint into the standard logistic regression model. BLR is an automatic bias-free supervised sentiment classifier.<\/jats:p>\n          <jats:p>We evaluate our methods extensively on seven real-world datasets. The experiments involve two lexicon-based and two supervised sentiment classifiers and include evaluation on multiple train-test data sizes. The results show that bias is controlled effectively in predictions. Furthermore, prediction accuracy is also increased in many cases, thus enhancing the robustness of sentiment classification.<\/jats:p>","DOI":"10.1145\/3328795","type":"journal-article","created":{"date-parts":[[2019,6,20]],"date-time":"2019-06-20T12:18:56Z","timestamp":1561033136000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":22,"title":["Balancing Prediction Errors for Robust Sentiment Classification"],"prefix":"10.1145","volume":"13","author":[{"given":"Mohsin","family":"Iqbal","sequence":"first","affiliation":[{"name":"Information Technology University of the Punjab, Lahore, Pakistan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Asim","family":"Karim","sequence":"additional","affiliation":[{"name":"Lahore University of Management Sciences, Lahore, Pakistan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Faisal","family":"Kamiran","sequence":"additional","affiliation":[{"name":"Information Technology University of the Punjab, Lahore, Pakistan"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2019,6,20]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media.","author":"Bollen Johan","year":"2011"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-010-0190-x"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1613\/jair.953"},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of the International Conference on Learning Representations.","author":"Edwards Harrison"},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI\u201901)","author":"Elkan Charles","year":"2001"},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the 30th International Conference on Neural Information Processing Systems. 2423--2431","author":"Goh Gabriel","year":"2016"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2512938.2512951"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2008.239"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2695664.2695759"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2012.45"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2017.09.064"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2011.83"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/2063576.2063994"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2020408.2020488"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.5555\/2002472.2002491"},{"key":"e_1_2_1_16_1","volume-title":"Proceedings of IEEE International Conference on Systems, Man, and Cybernetics (SMC\u201912)","author":"Mountassir A."},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the ESWC2011 Workshop on \u2018Making Sense of Microposts\u2019: Big Things Come in Small Packages. 93--98","author":"Nielsen F.","year":"2011"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.3115\/1218955.1218990"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1401959"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2009.03.002"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2007.04.009"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCB.2008.2002909"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1177\/0261927X09351676"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.21662"},{"key":"e_1_2_1_25_1","first-page":"12","article-title":"Sentiment in short strength detection informal text","volume":"61","author":"Thelwall Mike","year":"2010","journal-title":"Journal of the Association for Information Science and Technology"},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the 20th International Conference on Artificial Intelligence and Statistics. 962--970","author":"Zafar Muhammad Bilal"},{"key":"e_1_2_1_27_1","volume-title":"Proceedings of the 2nd Workshop on Fairness, Accountability, and Transparency in Machine Learning.","author":"Zliobaite Indre","year":"2015"}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3328795","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3328795","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:33:03Z","timestamp":1750199583000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3328795"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,6,20]]},"references-count":27,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2019,6,30]]}},"alternative-id":["10.1145\/3328795"],"URL":"https:\/\/doi.org\/10.1145\/3328795","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,6,20]]},"assertion":[{"value":"2016-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-04-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-06-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}