{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,5]],"date-time":"2026-01-05T11:05:14Z","timestamp":1767611114156,"version":"build-2065373602"},"reference-count":55,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2024,6,10]],"date-time":"2024-06-10T00:00:00Z","timestamp":1717977600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"FONDECYT","award":["1200525","UIDB\/00013\/2020","UIDP\/00013\/2020"],"award-info":[{"award-number":["1200525","UIDB\/00013\/2020","UIDP\/00013\/2020"]}]},{"name":"Portuguese funds through the CMAT\u2013Research Centre of Mathematics of the University of Minho","award":["1200525","UIDB\/00013\/2020","UIDP\/00013\/2020"],"award-info":[{"award-number":["1200525","UIDB\/00013\/2020","UIDP\/00013\/2020"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["JTAER"],"abstract":"<jats:p>Sentiment analysis is a cornerstone of natural language processing. However, it presents formidable challenges due to the intricacies of lexical diversity, complex linguistic structures, and the subtleties of context dependence. This study introduces a bespoke and integrated approach to analyzing customer sentiment, with a particular emphasis on a case study in the Portuguese retail market. Capitalizing on the strengths of SentiLex-PT, a sentiment lexicon curated for the Portuguese language, and an array of sophisticated machine learning algorithms, this research constructs advanced models that encapsulate both lexical features and the subtleties of linguistic composition. A meticulous comparative analysis singles out multinomial logistic regression as the pre-eminent model for its applicability and accuracy within our case study. The findings of this analysis highlight the pivotal role that sentiment data play in strategic decision-making processes such as reputation management, strategic planning, and forecasting market trends within the retail sector. To the extent of our knowledge, this work is pioneering in its provision of a holistic sentiment analysis framework tailored to the Portuguese retail context, marking an advancement for both the academic field and industry application.<\/jats:p>","DOI":"10.3390\/jtaer19020074","type":"journal-article","created":{"date-parts":[[2024,6,10]],"date-time":"2024-06-10T10:45:56Z","timestamp":1718016356000},"page":"1493-1516","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Optimizing Sentiment Analysis Models for Customer Support: Methodology and Case Study in the Portuguese Retail Sector"],"prefix":"10.3390","volume":"19","author":[{"given":"Catarina","family":"Almeida","sequence":"first","affiliation":[{"name":"Departament of Mathematics, Universidade do Minho, 4710-057 Braga, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9897-8186","authenticated-orcid":false,"given":"Cecilia","family":"Castro","sequence":"additional","affiliation":[{"name":"Centre of Mathematics, Universidade do Minho, 4710-057 Braga, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4755-3270","authenticated-orcid":false,"given":"V\u00edctor","family":"Leiva","sequence":"additional","affiliation":[{"name":"School of Industrial Engineering, Pontificia Universidad Cat\u00f3lica de Valpara\u00edso, Valpara\u00edso 2362807, Chile"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1991-9418","authenticated-orcid":false,"given":"Ana Cristina","family":"Braga","sequence":"additional","affiliation":[{"name":"ALGORITMI Research Centre, LASI, Universidade do Minho, 4710-057 Braga, Portugal"}]},{"given":"Ana","family":"Freitas","sequence":"additional","affiliation":[{"name":"MC Sonae, Senhora da Hora, 4460-503 Matosinhos, Portugal"}]}],"member":"1968","published-online":{"date-parts":[[2024,6,10]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"2257","DOI":"10.3390\/jtaer18040114","article-title":"Are eBay\u2019s feedback ratings consistent with the sentiments embedded in textual comments? An empirical study","volume":"18","author":"Zhang","year":"2023","journal-title":"J. Theor. Appl. Electron. Commer. Res."},{"key":"ref_2","unstructured":"Jurafsky, D., and Martin, J.H. (2009). Speech and Language Processing, Pearson."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"2188","DOI":"10.3390\/jtaer18040110","article-title":"A brief survey of machine learning and deep learning techniques for e-commerce research","volume":"18","author":"Zhang","year":"2023","journal-title":"J. Theor. Appl. Electron. Commer. Res."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Cavalcante, T., Ospina, R., Leiva, V., Martin-Barreiro, C., and Cabezas, X. (2023). Weibull regression and machine learning survival models: Methodology, comparison, and application to biomedical data related to cardiac surgery. Biology, 11.","DOI":"10.3390\/biology12030442"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1007\/s10260-010-0135-y","article-title":"Influence diagnostics in the tobit censored response model","volume":"19","author":"Barros","year":"2010","journal-title":"Stat. Methods Appl."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"280","DOI":"10.1016\/j.ins.2022.10.076","article-title":"A new approach to data differential privacy based on regression models under heteroscedasticity with applications to machine learning repository data","volume":"627","author":"Manchini","year":"2023","journal-title":"Inf. Sci."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1007\/s00477-022-02307-x","article-title":"Machine learning and automatic ARIMA\/Prophet models-based forecasting of COVID-19: Methodology, evaluation, and case study in SAARC countries","volume":"37","author":"Sardar","year":"2023","journal-title":"Stoch. Environ. Res. Risk Assess."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"103038","DOI":"10.1016\/j.jretconser.2022.103038","article-title":"A novel data-driven weighted sentiment analysis based on information entropy for perceived satisfaction","volume":"68","author":"Wang","year":"2022","journal-title":"J. Retail. Consum. Serv."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1002\/asmb.2556","article-title":"Birnbaum-Saunders quantile regression and its diagnostics with application to economic data","volume":"37","author":"Sanchez","year":"2021","journal-title":"Appl. Stoch. Model. Bus. Ind."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1016\/j.eswa.2020.113234","article-title":"Aspect-based sentiment analysis using adaptive aspect-based lexicons","volume":"148","author":"Mowlaei","year":"2020","journal-title":"Expert Syst. Appl."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"12801","DOI":"10.1007\/s11227-021-03709-4","article-title":"Intelligent sentinet-based lexicon for context-aware sentiment analysis: Optimized neural network for sentiment classification on social media","volume":"77","author":"Kumar","year":"2021","journal-title":"J. Supercomput."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"2031","DOI":"10.1007\/s00296-020-04710-5","article-title":"Covid4Rheum: An analytical twitter study in the time of the COVID-19 pandemic","volume":"40","author":"Ruffer","year":"2020","journal-title":"Rheumatol. Int."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"e18897","DOI":"10.2196\/18897","article-title":"Conversations and medical news frames on Twitter: Infodemiological study on COVID-19 in South Korea","volume":"22","author":"Park","year":"2020","journal-title":"J. Med. Internet Res."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"4873","DOI":"10.1007\/s10462-021-10030-2","article-title":"Over a decade of social opinion mining: A systematic review","volume":"54","author":"Cortis","year":"2021","journal-title":"Artif. Intell. Rev."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"370","DOI":"10.1007\/s41347-020-00174-3","article-title":"Role of emotion in excessive use of twitter during COVID-19 imposed lockdown in India","volume":"6","author":"Arora","year":"2021","journal-title":"J. Technol. Behav. Sci."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1186\/s13673-019-0189-2","article-title":"Detection and classification of social media-based extremist affiliations using sentiment analysis techniques","volume":"9","author":"Ahmad","year":"2019","journal-title":"Hum.-Centric Comput. Inf. Sci."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"6343","DOI":"10.1007\/s10462-021-09955-5","article-title":"Mining and classifying customer reviews: A survey","volume":"54","author":"Subhashini","year":"2021","journal-title":"Artif. Intell. Rev."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"5731","DOI":"10.1007\/s10462-022-10144-1","article-title":"A survey on sentiment analysis methods, applications, and challenges","volume":"55","author":"Wankhade","year":"2022","journal-title":"Artif. Intell. Rev."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"103543","DOI":"10.1016\/j.jretconser.2023.103543","article-title":"Qualitative analysis of customer behavior in the retail industry during the COVID-19 pandemic: A word-cloud and sentiment analysis approach","volume":"75","author":"Stanca","year":"2023","journal-title":"J. Retail. Consum. Serv."},{"key":"ref_20","first-page":"23","article-title":"Machine learning techniques as an efficient alternative diagnostic tool for COVID-19 cases","volume":"18","author":"Bustos","year":"2022","journal-title":"Signa Vitae"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1087","DOI":"10.1007\/s10462-020-09870-1","article-title":"A survey of sentiment analysis in the Portuguese language","volume":"54","author":"Pereira","year":"2021","journal-title":"Artif. Intell. Rev."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1007\/s10579-023-09646-3","article-title":"Automatic generation of creative text in Portuguese: An overview","volume":"58","year":"2024","journal-title":"Lang. Resour. Eval."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Chaouch, H., Charfeddine, S., Aoun, S.B., Jerbi, H., and Leiva, V. (2022). Multiscale monitoring using machine learning methods: New methodology and an industrial application to a photovoltaic system. Mathematics, 10.","DOI":"10.3390\/math10060890"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Nor, A.K.M., Pedapati, S.R., Muhammad, M., and Leiva, V. (2022). Abnormality detection and failure prediction using explainable Bayesian deep learning: Methodology and case study with industrial data. Mathematics, 10.","DOI":"10.20944\/preprints202109.0034.v3"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"116143","DOI":"10.1016\/j.eswa.2021.116143","article-title":"A new clustering algorithm based on a radar scanning strategy with applications to machine learning data","volume":"191","author":"Ma","year":"2022","journal-title":"Expert Syst. Appl."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/j.knosys.2019.105353","article-title":"An efficient hybrid filter and evolutionary wrapper approach for sentiment analysis of various topics on Twitter","volume":"192","author":"Hassonah","year":"2020","journal-title":"Knowl.-Based Syst."},{"key":"ref_27","unstructured":"Chang, J.R., Liang, H.Y., Chen, L.S., and Chang, C.W. (2024). Novel feature selection approaches for improving the performance of sentiment classification. J. Ambient. Intell. Humaniz. Comput., in press."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Taylan, O., Alkabaa, A.S., Alqabbaa, H.S., and Pamukcu, E. (2023). Early prediction in classification of cardiovascular diseases with machine learning, neuro-fuzzy and statistical methods. Biology, 12.","DOI":"10.3390\/biology12010117"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"823","DOI":"10.1177\/0165551510388123","article-title":"Aspect-based sentiment analysis of movie reviews on discussion boards","volume":"36","author":"Thet","year":"2010","journal-title":"J. Inf. Sci."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Tan, K.L., Lee, C.P., and Lim, K.M. (2023). A Survey of Sentiment Analysis: Approaches, Datasets, and Future Research. Appl. Sci., 13.","DOI":"10.3390\/app13074550"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Sharma, P., Singh, A.K., Leiva, V., Cabezas, X., and Martin-Barreiro, C. (2022). Modern multivariate statistical methods for evaluating the impact of WhatsApp on academic performance: Methodology and case study in India. Appl. Sci., 12.","DOI":"10.3390\/app12126141"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Palacios, C.A., Reyes-Suarez, J.A., Bearzotti, L.A., Leiva, V., and Marchant, C. (2021). Knowledge discovery for higher education student retention based on data mining: Machine learning algorithms and case study in Chile. Entropy, 23.","DOI":"10.3390\/e23040485"},{"key":"ref_33","unstructured":"Rish, I. (2001, January 4\u20136). An empirical study of the naive Bayes classifier. Proceedings of the IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, Seattle, WA, USA."},{"key":"ref_34","first-page":"385","article-title":"Idiot\u2019s Bayes: Not so stupid after all?","volume":"69","author":"Hand","year":"2001","journal-title":"Int. Stat. Rev."},{"key":"ref_35","unstructured":"Mitchell, T.M. (1997). Machine Learning, McGraw Hill."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Lewis, D.D. (1998, January 21\u201323). Naive (Bayes) at forty: The independence assumption in information retrieval. Proceedings of the 10th European Conference on Machine Learning, Chemnitz, Germany.","DOI":"10.1007\/BFb0026666"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Agresti, A. (2010). Analysis of Ordinal Categorical Data, Wiley.","DOI":"10.1002\/9780470594001"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Hosmer, D., Lemeshow, S., and Sturdivant, R. (2013). Applied Logistic Regression, Wiley.","DOI":"10.1002\/9781118548387"},{"key":"ref_39","unstructured":"Cox, D.R., and Snell, E.J. (1989). The Analysis of Binary Data, Chapman and Hall."},{"key":"ref_40","unstructured":"Zarembka, P. (1974). Conditional logit analysis of qualitative choice behavior. Frontiers in Econometrics, Academic Press."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"691","DOI":"10.1093\/biomet\/78.3.691","article-title":"A note on a general definition of the coefficient of determination","volume":"78","author":"Nagelkerke","year":"1991","journal-title":"Biometrika"},{"key":"ref_42","unstructured":"Osborne, J. (2017). Best Practices in Logistic Regression, SAGE Publications."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Pituch, K., and Stevens, J. (2015). Applied Multivariate Statistics for the Social Sciences: Analyses with SAS and IBM\u2019s SPSS, Taylor and Francis.","DOI":"10.4324\/9781315814919"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Hensher, D., and Stopher, P. (2021). Behavioural Travel Modelling, Taylor and Francis.","DOI":"10.4324\/9781003156055"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Ashqar, H., Shaheen, Q., Ashur, S., and Rakha, H. (2021, January 19\u201322). Impact of risk factors on work zone crashes using logistic models and random forest. Proceedings of the IEEE Intelligent Transportation Systems Conference, Indianapolis, IN, USA.","DOI":"10.1109\/ITSC48978.2021.9564405"},{"key":"ref_46","unstructured":"Long, J. (1997). Regression Models for Categorical and Limited Dependent Variables, SAGE Publications."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1111\/j.2517-6161.1980.tb01109.x","article-title":"Regression models for ordinal data","volume":"42","author":"McCullagh","year":"1980","journal-title":"J. R. Stat. Soc. B"},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1016\/S1090-3801(98)90018-0","article-title":"Ridit scores for analysis and interpretation of ordinal pain data","volume":"2","author":"Donaldson","year":"1998","journal-title":"Eur. J. Pain"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Vapnik, V. (1995). The Nature of Statistical Learning Theory, Springer.","DOI":"10.1007\/978-1-4757-2440-0"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1007\/BF00994018","article-title":"Support-vector networks","volume":"20","author":"Cortes","year":"1995","journal-title":"Mach. Learn."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_52","first-page":"199","article-title":"Random forests machine learning","volume":"2","author":"Breiman","year":"2001","journal-title":"J. Clin. Microbiol."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Chen, T., and Guestrin, C. (2016, January 13\u201317). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA.","DOI":"10.1145\/2939672.2939785"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Akritidis, L., and Bozanis, P. (2022, January 17\u201320). How dimensionality reduction affects sentiment analysis NLP tasks: An experimental study. Proceedings of the 18th International Conference on Artificial Intelligence Applications and Innovations, Crete, Greece.","DOI":"10.1007\/978-3-031-08337-2_25"},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"474","DOI":"10.1007\/s42979-023-01913-y","article-title":"Low dimensional text representations for sentiment analysis NLP tasks","volume":"4","author":"Akritidis","year":"2023","journal-title":"SN Comput. Sci."}],"container-title":["Journal of Theoretical and Applied Electronic Commerce Research"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/0718-1876\/19\/2\/74\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T14:56:17Z","timestamp":1760108177000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/0718-1876\/19\/2\/74"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6,10]]},"references-count":55,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2024,6]]}},"alternative-id":["jtaer19020074"],"URL":"https:\/\/doi.org\/10.3390\/jtaer19020074","relation":{},"ISSN":["0718-1876"],"issn-type":[{"type":"electronic","value":"0718-1876"}],"subject":[],"published":{"date-parts":[[2024,6,10]]}}}