{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T17:49:10Z","timestamp":1770745750035,"version":"3.49.0"},"reference-count":30,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2022,10,9]],"date-time":"2022-10-09T00:00:00Z","timestamp":1665273600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>With the increasing popularity of Twitter as both a social media platform and a data source for companies, decision makers, advertisers, and even researchers alike, data have been so massive that manual labeling is no longer feasible. This research uses a semi-supervised approach to sentiment analysis of both English and Tagalog tweets using a base classifier. In this study involving the Philippines, where social media played a central role in the campaign of both candidates, the tweets during the widely contested race between the son of the Philippines\u2019 former President and Dictator, and the outgoing Vice President of the Philippines were used. Using Natural Language Processing techniques, these tweets were annotated, processed, and trained to classify both English and Tagalog tweets into three polarities: positive, neutral, and negative. Through the Self-Training with Multinomial Na\u00efve Bayes as base classifier with 30% unlabeled data, the results yielded an accuracy of 84.83%, which outweighs other studies using Twitter data from the Philippines.<\/jats:p>","DOI":"10.3390\/info13100484","type":"journal-article","created":{"date-parts":[[2022,10,9]],"date-time":"2022-10-09T20:49:31Z","timestamp":1665348571000},"page":"484","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":25,"title":["A Semi-Supervised Approach to Sentiment Analysis of Tweets during the 2022 Philippine Presidential Election"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2738-4932","authenticated-orcid":false,"given":"Julio Jerison E.","family":"Macrohon","sequence":"first","affiliation":[{"name":"Department of Information Engineering, I-Shou University, Kaohsiung City 84001, Taiwan"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6905-8727","authenticated-orcid":false,"given":"Charlyn Nayve","family":"Villavicencio","sequence":"additional","affiliation":[{"name":"Department of Information Engineering, I-Shou University, Kaohsiung City 84001, Taiwan"},{"name":"College of Information and Communications Technology, Bulacan State University, Bulacan 3000, Philippines"}]},{"given":"X. Alphonse","family":"Inbaraj","sequence":"additional","affiliation":[{"name":"Department of Information Engineering, I-Shou University, Kaohsiung City 84001, Taiwan"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0026-8333","authenticated-orcid":false,"given":"Jyh-Horng","family":"Jeng","sequence":"additional","affiliation":[{"name":"Department of Information Engineering, I-Shou University, Kaohsiung City 84001, Taiwan"}]}],"member":"1968","published-online":{"date-parts":[[2022,10,9]]},"reference":[{"key":"ref_1","unstructured":"Official Gazette of the Republic of the Philippines (2022, June 10). The Constitution of the Republic of the Philippines, Available online: https:\/\/www.officialgazette.gov.ph\/constitutions\/1987-constitution\/."},{"key":"ref_2","unstructured":"Palatino, M. (2022, June 10). Why the Fight for the Philippines Vice Presidency Matters. The Diplomat, 3 December 2021. Available online: https:\/\/thediplomat.com\/2021\/12\/why-the-fight-for-the-philippines-vice-presidency-matters\/."},{"key":"ref_3","unstructured":"Baclig, C.E. (2022, June 10). TIMELINE: The 4-Year Robredo-Marcos POLL case, INQUIRER.NET, 16 February 2021. Available online: https:\/\/newsinfo.inquirer.net\/1396547\/the-4-year-robredo-marcos-vice-presidential-case."},{"key":"ref_4","unstructured":"DW (2022, June 10). Philippines: Marcos Jr. Wins Presidential Election Landslide, 9 May 2022. Available online: https:\/\/www.dw.com\/en\/philippines-marcos-jr-wins-presidential-election-landslide\/a-61727645."},{"key":"ref_5","unstructured":"Mendoza, D.J. (2022, June 10). Reflections on the Philippine Presidential Race, 9 May 2022. Available online: https:\/\/www.bworldonline.com\/opinion\/2022\/05\/09\/447235\/reflections-on-the-philippine-presidential-race\/."},{"key":"ref_6","unstructured":"StatCounter (2022, July 23). Social Media Stats Philippines, June 2022. Available online: https:\/\/gs.statcounter.com\/social-media-stats\/all\/philippines."},{"key":"ref_7","unstructured":"Mateo, J. (2022, July 23). Philippines Still World\u2019s Social Media Capital\u2013sudy. The Philippine Star, 3 February 2018. Available online: https:\/\/www.philstar.com\/headlines\/2018\/02\/03\/1784052\/philippines-still-worlds-social-media-capital-study\/amp\/."},{"key":"ref_8","unstructured":"Investopedia (2022, June 10). Web 2.0 and Web 3.0, 21 May 2022. Available online: https:\/\/www.investopedia.com\/web-20-web-30-5208698."},{"key":"ref_9","unstructured":"Hubspot (2022, June 10). What Is Twitter and How Does It Work? 29 January 2019. Available online: https:\/\/blog.hubspot.com\/marketing\/what-is-twitter."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.procs.2018.08.183","article-title":"On predicting elections with hybrid topic based sentiment analysis of tweets","volume":"135","author":"Bansala","year":"2018","journal-title":"Procedia Comput. Sci."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1016\/j.procs.2020.06.038","article-title":"Sentimental Analysis of Twitter Data with respect to General Elections in India","volume":"173","author":"Sharma","year":"2020","journal-title":"Procedia Comput. Sci."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"101847","DOI":"10.1109\/ACCESS.2021.3097492","article-title":"Sentiment Analysis of Political Tweets From the 2019 Spanish Elections","volume":"9","year":"2021","journal-title":"IEEE Access"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Yaqub, U., Malik, M.A., and Zaman, S. (2020, January 5\u20137). Sentiment Analysis of Russian IRA Troll Messages on Twitter during US Presidential Elections of 2016. Proceedings of the 7th International Conference on Behavioural and Social Computing (BESC), Bournemouth, UK.","DOI":"10.1109\/BESC51023.2020.9348287"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1504\/IJWBC.2019.098693","article-title":"Lexicon-based Twitter sentiment analysis for vote share prediction using emoji and N-gram features","volume":"15","author":"Bansal","year":"2019","journal-title":"Int. J. Web Based Commun."},{"key":"ref_15","unstructured":"Praciano, B.J.G., da Costa, J.P.C.L., Maranh\u00e3o, J.P.A., de Mendon\u00e7a, F.L.L., Junior, R.T.d., and Prettz, J.B. (2018, January 17\u201320). Spatio-Temporal Trend Analysis of the Brazilian Elections based on Twitter Data. Proceedings of the IEEE International Conference on Data Mining Workshops, Singapore."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"585","DOI":"10.3390\/ijgi10090585","article-title":"#AllforJan: How Twitter Users in Europe Reacted to the Murder of J\u00e1n Kuciak\u2014Revealing Spatiotemporal Patterns through Sentiment Analysis and Topic Modeling","volume":"10","author":"Resch","year":"2021","journal-title":"Int. J. Geo-Inf."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Kramer, J., and Gordon, C. (2014, January 23\u201324). Improvement of a Naive Bayes Sentiment Classifier Using MRS-Based Features. Proceedings of the Third Joint Conference on Lexical and Computational Semantics, Dublin, Ireland.","DOI":"10.3115\/v1\/S14-1003"},{"key":"ref_18","unstructured":"Herrera, M., Aich, A., and Parde, N. (2022, January 20\u201325). TweetTaglish: A Dataset for Investigating Tagalog-English Code-Switching. Proceedings of the 13th Language Resources and Evaluation Conference, Palais du Pharo, Marseille, France."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Villavicencio, C., Macrohon, J.J., Inbaraj, X.A., Jeng, J.-H., and Hsieh, J.-G. (2021). Twitter Sentiment Analysis towards COVID-19 Vaccines in the Philippines Using Na\u00efve Bayes. Information, 12.","DOI":"10.3390\/info12050204"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Samonte, M.J.C., Garcia, J.M.R., Lucero, V.J.L., and Santos, S.C.B. (2017, January 24\u201329). Sentiment and opinion analysis on Twitter about local airlines. Proceedings of the ICCIP \u201917: Proceedings of the 3rd International Conference on Communication and Information Processing, Tokyo, Japan.","DOI":"10.1145\/3162957.3163029"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"408","DOI":"10.30534\/ijatcse\/2020\/6491.32020","article-title":"Philippine Twitter Sentiments during COVID-19 Pandemic using Multinomial Na\u00efve-Bayes","volume":"9","author":"Delizo","year":"2020","journal-title":"Int. J. Adv. Trends Comput. Sci. Eng."},{"key":"ref_22","unstructured":"Lutkevich, B. (2021). Natural Language Processing (NLP), TechTarget. Available online: https:\/\/www.techtarget.com\/searchenterpriseai\/definition\/natural-language-processing-NLP."},{"key":"ref_23","unstructured":"Harshith (2022, June 10). Text Preprocessing in Natural Language Processing, Towards Data Science, 21 November 2019. Available online: https:\/\/towardsdatascience.com\/text-preprocessing-in-natural-language-processing-using-python-6113ff5decd8."},{"key":"ref_24","unstructured":"StackOverflow (2022, June 10). How nltk.TweetTokenizer Different from nltk.word_tokenize?. Available online: https:\/\/stackoverflow.com\/questions\/61919670\/how-nltk-tweettokenizer-different-from-nltk-word-tokenize."},{"key":"ref_25","unstructured":"Beri, A. (2022, June 10). Stemming vs. Lemmatization, Towards Data Science, 14 May 2020. Available online: https:\/\/towardsdatascience.com\/stemming-vs-lemmatization-2daddabcb221."},{"key":"ref_26","unstructured":"Borisov, O. (2022, June 10). Text Generation Using N-Gram Model, Towards Data Science, 28 October 2020. Available online: https:\/\/towardsdatascience.com\/text-generation-using-n-gram-model-8d12d9802aa0."},{"key":"ref_27","unstructured":"Clarke, M. (2022, June 10). How to Use CountVectorizer for n-Gram Analysis, Practical Data Science, 24 December 2021. Available online: https:\/\/practicaldatascience.co.uk\/machine-learning\/how-to-use-count-vectorization-for-n-gram-analysis."},{"key":"ref_28","unstructured":"Sci-Kit Learn (2022, June 10). Sklearn.feature_extraction.text.TfidfTransformer. Available online: https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.feature_extraction.text.TfidfTransformer.html."},{"key":"ref_29","unstructured":"Sci-Kit Learn (2022, June 10). 1.9. Naive Bayes. Available online: https:\/\/scikit-learn.org\/stable\/modules\/naive_bayes.html#multinomial-naive-bayes."},{"key":"ref_30","unstructured":"Standford University (2009). Naive Bayes Text Classification, Cambridge University Press. Available online: https:\/\/nlp.stanford.edu\/IR-book\/html\/htmledition\/naive-bayes-text-classification-1.html."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/13\/10\/484\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:48:38Z","timestamp":1760143718000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/13\/10\/484"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,9]]},"references-count":30,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2022,10]]}},"alternative-id":["info13100484"],"URL":"https:\/\/doi.org\/10.3390\/info13100484","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,10,9]]}}}