{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,4]],"date-time":"2026-02-04T01:07:06Z","timestamp":1770167226929,"version":"3.49.0"},"reference-count":46,"publisher":"SAGE Publications","issue":"5","license":[{"start":{"date-parts":[[2020,3,16]],"date-time":"2020-03-16T00:00:00Z","timestamp":1584316800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2020,5,29]]},"abstract":"<jats:p>\u00a0With the proliferation of social media and mobile technology, huge amount of unstructured data is posted daily online. Consequently, sentiment analysis has gained increasing importance as a tool to understand the opinions of certain groups of people on contemporary political, cultural, social or commercial issues. Unlike western languages, the research on sentiment analysis for dialectical Arabic language is still in its early stages with several challenges to be addressed. The main goal of this study is twofold. First, it compares the performance of core machine learning algorithms for detecting the polarity in imbalanced Arabic tweet datasets using neural word embedding as a feature extractor rather than hand-crafted or traditional features. Second, it examines the impact of using various oversampling techniques to handle the highly-imbalanced nature of the sentiment data. Intensive empirical analysis of nine machine learning methods and six oversampling methods has been conducted and the results have been discussed in terms of a wide range of performance measures.<\/jats:p>","DOI":"10.3233\/jifs-179703","type":"journal-article","created":{"date-parts":[[2020,3,17]],"date-time":"2020-03-17T15:17:34Z","timestamp":1584458254000},"page":"6211-6222","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":9,"title":["Empirical study on imbalanced learning of Arabic sentiment polarity with neural word embedding"],"prefix":"10.1177","volume":"38","author":[{"given":"El-Sayed M.","family":"El-Alfy","sequence":"first","affiliation":[{"name":"Information and Computer Science Department, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia"}]},{"given":"Sadam","family":"Al-Azani","sequence":"additional","affiliation":[{"name":"Information and Computer Science Department, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia"}]}],"member":"179","published-online":{"date-parts":[[2020,3,16]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-02145-9"},{"key":"e_1_3_2_3_2","first-page":"1","article-title":"Fundamentals of Sentiment Analysis and Its Applications","author":"Farhadloo M.","year":"2016","unstructured":"FarhadlooM. and RollandE., Fundamentals of Sentiment Analysis and Its Applications, in: Sentiment Analysis and Ontology Engineering, 2016, pp. 1\u201324.","journal-title":"Sentiment Analysis and Ontology Engineering"},{"key":"e_1_3_2_4_2","doi-asserted-by":"crossref","unstructured":"HuM. and LiuB. Mining and summarizing customer reviews in: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2004 pp. 168\u2013177.","DOI":"10.1145\/1014052.1014073"},{"key":"e_1_3_2_5_2","doi-asserted-by":"crossref","unstructured":"AlmC.O. RothD. and SproatR. Emotions from text: machine learning for text-based emotion prediction in: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing Association for Computational Linguistics 2005 pp. 579\u2013586.","DOI":"10.3115\/1220575.1220648"},{"key":"e_1_3_2_6_2","doi-asserted-by":"crossref","unstructured":"YuH. and HatzivassiloglouV. Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences in: Proceedings of the International Conference on Empirical Methods in Natural Language Processing Association for Computational Linguistics 2003 pp. 129\u2013136.","DOI":"10.3115\/1119355.1119372"},{"key":"e_1_3_2_7_2","unstructured":"CarvalhoP. SarmentoL. TeixeiraJ. and SilvaM.J. Liars and saviors in a sentiment annotated corpus of comments to political debates in: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies 2011 pp. 564\u2013568."},{"key":"e_1_3_2_8_2","doi-asserted-by":"crossref","unstructured":"LloydL. KechagiasD. and SkienaS. Lydia: A system for large-scale news analysis in: International Symposium on String Processing and Information Retrieval 2005 pp. 161\u2013166.","DOI":"10.1007\/11575832_18"},{"key":"e_1_3_2_9_2","first-page":"353","article-title":"Summarizing Emails with Conversational Cohesion and Subjectivity","volume":"8","author":"Carenini G.","year":"2008","unstructured":"CareniniG., NgR.T. and ZhouX., Summarizing Emails with Conversational Cohesion and Subjectivity, in: Association for Computational Linguistics8 (2008), 353\u2013361.","journal-title":"Association for Computational Linguistics"},{"key":"e_1_3_2_10_2","doi-asserted-by":"crossref","unstructured":"HailongZ. WenyanG. and BoJ. Machine learning and lexicon based methods for sentiment classification: A survey in: IEEE Web Information System and Application Conference (WISA) 2014 pp. 262\u2013265.","DOI":"10.1109\/WISA.2014.55"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2015.06.015"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2016.10.004"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2012.12.084"},{"key":"e_1_3_2_14_2","doi-asserted-by":"crossref","unstructured":"ParlarT. \u00d6zelS.A. and SongF. Interactions between term weighting and feature selection methods on the sentiment analysis of Turkish reviews in: International Conference on Intelligent Text Processing and Computational Linguistics 2016 pp. 335\u2013346.","DOI":"10.1007\/978-3-319-75487-1_26"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1007\/s12559-017-9513-1"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.3233\/IFS-151574"},{"key":"e_1_3_2_17_2","doi-asserted-by":"crossref","unstructured":"Al ShboulB. Al-AyyoubM. and JararwehY. Multi-way sentiment classification of arabic reviews in: Proceedings of the 6th IEEE International Conference on Information and Communication Systems (ICICS) 2015 pp. 206\u2013211.","DOI":"10.1109\/IACS.2015.7103228"},{"issue":"1","key":"e_1_3_2_18_2","first-page":"15","article-title":"Data and Text Mining Techniques for Classifying Arabic Tweet Polarity","volume":"14","author":"Brahimi B.","year":"2016","unstructured":"BrahimiB., TouahriaM. and TariA., Data and Text Mining Techniques for Classifying Arabic Tweet Polarity, Journal of Digital Information Management14(1) (2016), 15.","journal-title":"Journal of Digital Information Management"},{"key":"e_1_3_2_19_2","first-page":"429","author":"Omar N.","year":"2014","unstructured":"OmarN., AlbaredM., Al-MoslmiT. and Al-ShabiA., A comparative study of feature selection and machine learning algorithms for Arabic sentiment classification, in: Asia Information Retrieval Symposium, 2014, pp. 429\u2013443.","journal-title":"A comparative study of feature selection and machine learning algorithms for Arabic sentiment classification"},{"key":"e_1_3_2_20_2","doi-asserted-by":"crossref","unstructured":"Rabab\u2019ahA.M. Al-AyyoubM. JararwehY. and Al-KabiM.N. Evaluating SentiStrength for Arabic Sentiment Analysis in: Proceedings of the 7th IEEE International Conference on Computer Science and Information Technology (CSIT) 2016 pp. 1\u20136.","DOI":"10.1109\/CSIT.2016.7549458"},{"key":"e_1_3_2_21_2","unstructured":"RefaeeE. and RieserV. An Arabic Twitter Corpus for Subjectivity and Sentiment Analysis in: Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC) 2014 pp. 2268\u20132273."},{"key":"e_1_3_2_22_2","doi-asserted-by":"crossref","unstructured":"NabilM. AlyM. and AtiyaA.F. Astd: Arabic sentiment tweets dataset in: Proceedings of the International Conference on Empirical Methods in Natural Language Processing 2015 pp. 2515\u20132519.","DOI":"10.18653\/v1\/D15-1299"},{"key":"e_1_3_2_23_2","doi-asserted-by":"crossref","unstructured":"ElSaharH. and El-BeltagyS.R. Building large Arabic multi-domain resources for sentiment analysis in: International Conference on Intelligent Text Processing and Computational Linguistics 2015 pp. 23\u201334.","DOI":"10.1007\/978-3-319-18117-2_2"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.4018\/IJITWE.2016070103"},{"key":"e_1_3_2_25_2","doi-asserted-by":"crossref","unstructured":"RefaeeE. and RieserV. iLab-Edinburgh at SemEval-2016 Task 7: A Hybrid Approach for Determining Sentiment Intensity of Arabic Twitter Phrases in: Proceedings of the 10th International Workshop on Semantic Evaluation SemEval-2016 SemEval\u201916 San Diego California 2016.","DOI":"10.18653\/v1\/S16-1077"},{"key":"e_1_3_2_26_2","doi-asserted-by":"crossref","unstructured":"AltowayanA. and TaoL. Word Embeddings for Arabic Sentiment Analysis in: IEEE International Conference on Big Data 2016.","DOI":"10.1109\/BigData.2016.7841054"},{"key":"e_1_3_2_27_2","doi-asserted-by":"crossref","unstructured":"Al-AzaniS. and El-AlfyE.-S.M. Hybrid deep learning for sentiment polarity determination of arabic microblogs in: International Conference on Neural Information Processing 2017 pp. 491\u2013500.","DOI":"10.1007\/978-3-319-70096-0_51"},{"key":"e_1_3_2_28_2","unstructured":"AMikolovT. ChenK. CorradoG. and DeanJ. Efficient estimation of word representations in vector space in: Proceedings of Workshop at International Conference on Learning Representations 2013."},{"key":"e_1_3_2_29_2","unstructured":"MikolovT. SutskeverI. ChenK. CorradoG.S. and DeanJ. Distributed representations of words and phrases and their compositionality in: Advances in Neural Information Processing Systems 2013 pp. 3111\u20133119."},{"key":"e_1_3_2_30_2","unstructured":"DahouA. XiongS. ZhouJ. HaddoudM.H. and DuanP. Word Embeddings and Convolutional Neural Network for Arabic Sentiment Classification in: Proc. 26th International Conference on Computational Linguistics 2016 pp. 2418\u20132427."},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCC.2011.2161285"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1613\/jair.953"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2008.239"},{"key":"e_1_3_2_34_2","doi-asserted-by":"crossref","unstructured":"HanH. WangW.-Y. and MaoB.-H. Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning in: International Conference on Intelligent Computing Springer 2005 pp. 878\u2013887.","DOI":"10.1007\/11538059_91"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1504\/IJKESDP.2011.039875"},{"key":"e_1_3_2_36_2","doi-asserted-by":"crossref","unstructured":"HeH. BaiY. GarciaE.A. and LiS. ADASYN: Adaptive synthetic sampling approach for imbalanced learning in: IEEE International Joint Conference on Neural Networks 2008 pp. 1322\u20131328.","DOI":"10.1109\/IJCNN.2008.4633969"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1002\/asmb.537"},{"key":"e_1_3_2_38_2","doi-asserted-by":"crossref","unstructured":"DietterichT.G. Ensemble methods in machine learning in: International Workshop on Multiple Classifier Systems Springer 2000 pp. 1\u201315.","DOI":"10.1007\/3-540-45014-9_1"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/34.58871"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"e_1_3_2_41_2","doi-asserted-by":"crossref","unstructured":"SalamehM. MohammadS. and KiritchenkoS. Sentiment after translation: A case-study on arabic social media posts in: Proceedings of Conference of North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2015 pp. 767\u2013777.","DOI":"10.3115\/v1\/N15-1078"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1613\/jair.4787"},{"key":"e_1_3_2_43_2","first-page":"2825","article-title":"Scikit-learn: Machine learning in Python","volume":"12","author":"Pedregosa F.","year":"2011","unstructured":"PedregosaF., VaroquauxG., GramfortA., MichelV., ThirionB., GriselO., BlondelM., PrettenhoferP., WeissR., DubourgV., et al., Scikit-learn: Machine learning in Python, Journal of Machine Learning Research12(Oct) (2011), 2825\u20132830.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_44_2","unstructured":"LemaitreG. NogueiraF. AridasC.K. Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning Computing Research Repository (CoRR): 1609.06570 (2016)."},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2005.10.010"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1145\/1007730.1007735"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0118432"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179703","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-179703","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179703","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,3]],"date-time":"2026-02-03T12:57:44Z","timestamp":1770123464000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-179703"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,3,16]]},"references-count":46,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2020,5,29]]}},"alternative-id":["10.3233\/JIFS-179703"],"URL":"https:\/\/doi.org\/10.3233\/jifs-179703","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,3,16]]}}}