{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,12]],"date-time":"2026-02-12T16:02:45Z","timestamp":1770912165274,"version":"3.50.1"},"reference-count":41,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2023,3,14]],"date-time":"2023-03-14T00:00:00Z","timestamp":1678752000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,3,14]],"date-time":"2023-03-14T00:00:00Z","timestamp":1678752000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Ministry of University (IT) - Prog. Dipartimenti di Eccellenza","award":["1.005.14\/2019"],"award-info":[{"award-number":["1.005.14\/2019"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Adv Data Anal Classif"],"published-print":{"date-parts":[[2024,6]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The Threshold-based Na\u00efve Bayes (Tb-NB) classifier is introduced as a (simple) improved version of the original Na\u00efve Bayes classifier. Tb-NB extracts the sentiment from a Natural Language text corpus and allows the user not only to predict how much a sentence is positive (negative) but also to quantify a sentiment with a numeric value. It is based on the estimation of a single threshold value that concurs to define a decision rule that classifies a text into a positive (negative) opinion based on its content. One of the main advantage deriving from Tb-NB is the possibility to utilize its results as the input of post-hoc analysis aimed at observing how the quality associated to the different dimensions of a product or a service or, in a mirrored fashion, the different dimensions of customer satisfaction evolve in time or change with respect to different locations. The effectiveness of Tb-NB is evaluated analyzing data concerning the tourism industry and, specifically, hotel guests\u2019 reviews from all hotels located in the Sardinian region and available on Booking.com. Moreover, Tb-NB is compared with other popular classifiers used in sentiment analysis in terms of model accuracy, resistance to noise and computational efficiency.<\/jats:p>","DOI":"10.1007\/s11634-023-00536-8","type":"journal-article","created":{"date-parts":[[2023,3,14]],"date-time":"2023-03-14T11:04:34Z","timestamp":1678791874000},"page":"325-361","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Threshold-based Na\u00efve Bayes classifier"],"prefix":"10.1007","volume":"18","author":[{"given":"Maurizio","family":"Romano","sequence":"first","affiliation":[]},{"given":"Giulia","family":"Contu","sequence":"additional","affiliation":[]},{"given":"Francesco","family":"Mola","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2020-5129","authenticated-orcid":false,"given":"Claudio","family":"Conversano","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,3,14]]},"reference":[{"issue":"3","key":"536_CR1","doi-asserted-by":"publisher","first-page":"291","DOI":"10.2307\/3149462","volume":"4","author":"J Arndt","year":"1967","unstructured":"Arndt J (1967) Role of product-related conversations in the diffusion of a new product. J Market Res 4(3):291\u2013295. https:\/\/doi.org\/10.2307\/3149462","journal-title":"J Market Res"},{"key":"536_CR2","unstructured":"Bachtiar FA, Paulina W, Rusydi AN (2020) Text mining for aspect based sentiment analysis on customer review: a case study in the hotel industry. In: Serd\u00fclt U, Loshchilov A, Mahmudy WF, Nurwasito H (eds) Proceedings of the 5th international workshop on innovations in information and communication science and technology (canceled by authorities due to SARS-CoV-2), CEUR workshop proceedings, vol 2627, pp 105\u2013112, Malang, Indonesia, CEUR-WS.org"},{"issue":"5","key":"536_CR3","doi-asserted-by":"publisher","first-page":"662","DOI":"10.1080\/1369118X.2012.678878","volume":"15","author":"D Boyd","year":"2012","unstructured":"Boyd D, Crawford K (2012) Critical questions for big data: provocations for a cultural, technological, and scholarly phenomenon. Inf Commun Soc 15(5):662\u2013679. https:\/\/doi.org\/10.1080\/1369118X.2012.678878","journal-title":"Inf Commun Soc"},{"key":"536_CR4","doi-asserted-by":"crossref","unstructured":"Brownlee J (2017) Deep learning for natural language processing: develop deep learning models for your natural language problems. In: Machine learning mastery, 1.7 edition","DOI":"10.1007\/978-1-4842-3733-5_1"},{"issue":"3","key":"536_CR5","doi-asserted-by":"publisher","first-page":"241","DOI":"10.1080\/096525498346658","volume":"6","author":"FA Buttle","year":"1998","unstructured":"Buttle FA (1998) Word of mouth: understanding and managing referral marketing. J Strateg Market 6(3):241\u2013254. https:\/\/doi.org\/10.1080\/096525498346658","journal-title":"J Strateg Market"},{"key":"536_CR6","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1017\/S1351324920000534","volume":"12","author":"C Chai","year":"2019","unstructured":"Chai C (2019) Text mining in survey data. Surv Pract 12:1\u201313. https:\/\/doi.org\/10.1017\/S1351324920000534","journal-title":"Surv Pract"},{"key":"536_CR7","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1016\/j.inffus.2017.12.006","volume":"44","author":"I Chaturvedi","year":"2018","unstructured":"Chaturvedi I, Cambria E, Welsch RE, Herrera F (2018) Distinguishing between facts and opinions for sentiment analysis: survey and challenges. Inf Fusion 44:65\u201377. https:\/\/doi.org\/10.1016\/j.inffus.2017.12.006","journal-title":"Inf Fusion"},{"key":"536_CR8","doi-asserted-by":"publisher","DOI":"10.1186\/s12864-019-6413-7","author":"D Chicco","year":"2020","unstructured":"Chicco D, Jurman G (2020) The advantages of the Matthews correlation coefficient (mcc) over f1 score and accuracy in binary classification evaluation. BMC Genom. https:\/\/doi.org\/10.1186\/s12864-019-6413-7","journal-title":"BMC Genom"},{"issue":"13","key":"536_CR9","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13040-021-00244-z","volume":"14","author":"D Chicco","year":"2021","unstructured":"Chicco D, T\u00f6tsch N, Jurman G (2021) The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation. BioData Min 14(13):1\u201322. https:\/\/doi.org\/10.1186\/s13040-021-00244-z","journal-title":"BioData Min"},{"key":"536_CR10","unstructured":"Esuli A, Sebastiani F (2006) Determining term subjectivity and term orientation for opinion mining. In: 11th conference of the European chapter of the association for computational linguistics, pp 193\u2013200, Trento, Italy, Association for Computational Linguistics. ISBN 1-932432-59-0"},{"issue":"1","key":"536_CR11","doi-asserted-by":"publisher","first-page":"1","DOI":"10.2200\/S00762ED1V01Y201703HLT037","volume":"10","author":"Y Goldberg","year":"2017","unstructured":"Goldberg Y (2017) Neural network methods in natural language processing. Synth Lect Hum Lang Technol 10(1):1\u2013309. https:\/\/doi.org\/10.2200\/S00762ED1V01Y201703HLT037","journal-title":"Synth Lect Hum Lang Technol"},{"issue":"2","key":"536_CR12","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1109\/MIS.2009.36","volume":"24","author":"A Halevy","year":"2009","unstructured":"Halevy A, Norvig P, Pereira F (2009) The unreasonable effectiveness of data. IEEE Intell Syst 24(2):8\u201312. https:\/\/doi.org\/10.1109\/MIS.2009.36","journal-title":"IEEE Intell Syst"},{"issue":"1","key":"536_CR13","doi-asserted-by":"publisher","first-page":"60","DOI":"10.1177\/109467050141006","volume":"4","author":"LJ Harrison-Walker","year":"2001","unstructured":"Harrison-Walker LJ (2001) The measurement of word-of-mouth communication and an investigation of service quality and customer commitment as potential antecedents. J Serv Res 4(1):60\u201375. https:\/\/doi.org\/10.1177\/109467050141006","journal-title":"J Serv Res"},{"issue":"3","key":"536_CR14","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1016\/0148-2963(95)00126-3","volume":"35","author":"MD Hartline","year":"1996","unstructured":"Hartline MD, Jones KC (1996) Employee performance cues in a hotel service environment: influence on perceived service quality, value, and word-of-mouth intentions. J Bus Res 35(3):207\u2013215. https:\/\/doi.org\/10.1016\/0148-2963(95)00126-3","journal-title":"J Bus Res"},{"key":"536_CR15","doi-asserted-by":"publisher","unstructured":"Huang J, Lu J, Ling C (2003) Comparing Naive Bayes, decision trees, and svm with auc and accuracy. In: Third IEEE international conference on data mining, pp 553\u2013556. https:\/\/doi.org\/10.1109\/ICDM.2003.1250975","DOI":"10.1109\/ICDM.2003.1250975"},{"key":"536_CR16","doi-asserted-by":"publisher","DOI":"10.1016\/j.cosrev.2021.100413","volume":"41","author":"PK Jain","year":"2021","unstructured":"Jain PK, Pamula R, Srivastava G (2021) A systematic literature review on machine learning applications for consumer sentiment analysis using online reviews. Comput Sci Rev 41:100413. https:\/\/doi.org\/10.1016\/j.cosrev.2021.100413","journal-title":"Comput Sci Rev"},{"key":"536_CR17","doi-asserted-by":"publisher","unstructured":"Janowicz-Lomott M, \u0141yskawa K, Polychronidou P, Karasavvoglou A (eds) (2018) Economic and financial challenges for Balkan and eastern European countries. In: Proceedings of the 10th international conference on the economies of the Balkan and Eastern European Countries in the Changing World (EBEEC) in Warsaw, Poland Springer proceedings in business and economics. Springer, Cham, 2020. ISBN 978-3-030-39926-9 978-3-030-39927-6. https:\/\/doi.org\/10.1007\/978-3-030-39927-6","DOI":"10.1007\/978-3-030-39927-6"},{"issue":"43\u201344","key":"536_CR18","doi-asserted-by":"publisher","first-page":"32749","DOI":"10.1007\/s11042-020-09512-2","volume":"79","author":"AH Khan","year":"2020","unstructured":"Khan AH, Zubair M (2020) Classification of multi-lingual tweets, into multi-class model using Na\u00efve Bayes and semi-supervised learning. Multimed Tools Appl 79(43\u201344):32749\u201332767. https:\/\/doi.org\/10.1007\/s11042-020-09512-2","journal-title":"Multimed Tools Appl"},{"issue":"11\/12","key":"536_CR19","doi-asserted-by":"publisher","first-page":"1475","DOI":"10.1108\/03090560710821260","volume":"41","author":"T Mazzarol","year":"2007","unstructured":"Mazzarol T, Sweeney JC, Soutar GN (2007) Conceptualizing word-of-mouth activity, triggers and conditions: an exploratory study. Eur J Market 41(11\/12):1475\u20131494. https:\/\/doi.org\/10.1108\/03090560710821260","journal-title":"Eur J Market"},{"key":"536_CR20","unstructured":"Meyer D, Dimitriadou E, Hornik K, Weingessel A, Leisch F (2019) E1071: misc functions of the department of statistics, probability theory group (Formerly: E1071), TU Wien"},{"key":"536_CR21","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1007\/s10115-019-01410-w","volume":"27","author":"R Morante","year":"2021","unstructured":"Morante R, Blanco E (2021) Recent advances in processing negation. Nat Lang Eng 27:121\u2013130. https:\/\/doi.org\/10.1007\/s10115-019-01410-w","journal-title":"Nat Lang Eng"},{"key":"536_CR22","doi-asserted-by":"publisher","unstructured":"Narayanan V, Arora I, Bhatia A (2013) Fast and accurate sentiment classification using an enhanced Naive Bayes model. In: Hutchison D, Kanade T, Kittler J et\u00a0al (eds) Intelligent data engineering and automated learning\u2014IDEAL 2013, vol 8206, pp 194\u2013201. Springer, Berlin. ISBN 978-3-642-41277-6 978-3-642-41278-3. https:\/\/doi.org\/10.1007\/978-3-642-41278-3_24","DOI":"10.1007\/978-3-642-41278-3_24"},{"key":"536_CR23","unstructured":"Nielsen (2007) Trust in advertising. A global Nielsen consumer report"},{"issue":"8","key":"536_CR24","doi-asserted-by":"publisher","first-page":"567","DOI":"10.1080\/08839514.2021.1922843","volume":"35","author":"B Noori","year":"2021","unstructured":"Noori B (2021) Classification of customer reviews using machine learning algorithms. Appl Artif Intell 35(8):567\u2013588. https:\/\/doi.org\/10.1080\/08839514.2021.1922843","journal-title":"Appl Artif Intell"},{"issue":"7","key":"536_CR25","doi-asserted-by":"publisher","first-page":"754","DOI":"10.1080\/19368623.2010.508007","volume":"19","author":"P O\u2019Connor","year":"2010","unstructured":"O\u2019Connor P (2010) Managing a hotel\u2019s image on TripAdvisor. J Hosp Market Manag 19(7):754\u2013772. https:\/\/doi.org\/10.1080\/19368623.2010.508007","journal-title":"J Hosp Market Manag"},{"issue":"1\u20132","key":"536_CR26","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1561\/1500000011","volume":"2","author":"B Pang","year":"2008","unstructured":"Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2(1\u20132):1\u2013135. https:\/\/doi.org\/10.1561\/1500000011","journal-title":"Found Trends Inf Retr"},{"key":"536_CR27","volume-title":"Creating brand advocates","author":"S Rusticus","year":"2007","unstructured":"Rusticus S (2007) Creating brand advocates. Justin Kirby and Paul Marsden, Oxford"},{"issue":"1","key":"536_CR28","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1007\/s13278-020-00656-5","volume":"10","author":"G Santos","year":"2020","unstructured":"Santos G, Mota VFS, Benevenuto F, Silva TH (2020) Neutrality may matter: sentiment analysis in reviews of Airbnb, Booking, and Couchsurfing in Brazil and USA. Soc Netw Anal Min 10(1):45. https:\/\/doi.org\/10.1007\/s13278-020-00656-5","journal-title":"Soc Netw Anal Min"},{"key":"536_CR29","doi-asserted-by":"publisher","unstructured":"Schmunk S, H\u00f6pken W, Fuchs M, Lexhagen M (2013) Sentiment analysis: extracting decision-relevant knowledge from UGC. In: Xiang Z, Tussyadiah I (eds) Information and communication technologies in tourism 2014. Springer, Cham, pp 253\u2013265. ISBN 978-3-319-03972-5 978-3-319-03973-2. https:\/\/doi.org\/10.1007\/978-3-319-03973-2_19","DOI":"10.1007\/978-3-319-03973-2_19"},{"key":"536_CR30","doi-asserted-by":"publisher","first-page":"143","DOI":"10.1016\/j.ijhm.2014.12.007","volume":"48","author":"M Schuckert","year":"2015","unstructured":"Schuckert M, Liu X, Law R (2015) A segmentation of online reviews by language groups: how English and Non-English speakers rate hotels differently. Int J Hosp Manag 48:143\u2013149. https:\/\/doi.org\/10.1016\/j.ijhm.2014.12.007","journal-title":"Int J Hosp Manag"},{"key":"536_CR31","unstructured":"S\u0131rma E (2009) Word-of-mouth marketing from a global perspective. Ph.D. thesis, Instituto Universit\u00e0rio de Lisboa,"},{"key":"536_CR32","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.tourman.2013.03.007","volume":"39","author":"BA Sparks","year":"2013","unstructured":"Sparks BA, Perkins HE, Buckley R (2013) Online travel reviews as persuasive communication: the effects of content type, source, and certification logos on consumer behavior. Tour Manag 39:1\u20139. https:\/\/doi.org\/10.1016\/j.tourman.2013.03.007","journal-title":"Tour Manag"},{"key":"536_CR33","doi-asserted-by":"publisher","first-page":"1847","DOI":"10.1007\/s10115-019-01410-w","volume":"62","author":"F Tavazoee","year":"2020","unstructured":"Tavazoee F, Conversano C, Mola F (2020) Recurrent random forest for the assessment of popularity in social media. Knowl Inf Syst 62:1847\u20131879. https:\/\/doi.org\/10.1007\/s10115-019-01410-w","journal-title":"Knowl Inf Syst"},{"key":"536_CR34","doi-asserted-by":"publisher","unstructured":"Weihs C, Ligges U, Luebke K, Raabe N (2005) klaR analyzing German business cycles. In: Baier D, Decker R, Schmidt-Thieme L (eds) Data analysis and decision support. Springer, Berlin, pp 335\u2013343. ISBN 978-3-540-26007-3. https:\/\/doi.org\/10.1007\/3-540-28397-8_36","DOI":"10.1007\/3-540-28397-8_36"},{"key":"536_CR35","doi-asserted-by":"publisher","unstructured":"Wiebe JM, Bruce RF, O\u2019Hara TP (1999) Development and use of a gold-standard data set for subjectivity classifications. In: Proceedings of the 37th annual meeting of the association for computational linguistics, College Park, Maryland, USA. Association for Computational Linguistics, pp 246\u2013253. https:\/\/doi.org\/10.3115\/1034678.1034721","DOI":"10.3115\/1034678.1034721"},{"issue":"5","key":"536_CR36","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2020.102221","volume":"57","author":"F Xu","year":"2020","unstructured":"Xu F, Pan Z, Xia R (2020) E-commerce product review sentiment classification based on a Na\u00efve Bayes continuous learning framework. Inf Process Manag 57(5):102221. https:\/\/doi.org\/10.1016\/j.ipm.2020.102221","journal-title":"Inf Process Manag"},{"key":"536_CR37","doi-asserted-by":"publisher","unstructured":"Yang P, Chen Y (2017) A survey on sentiment analysis by using machine learning methods. In: 2017 IEEE 2nd information technology, networking, electronic and automation control conference (ITNEC), pp 117\u2013121. https:\/\/doi.org\/10.1109\/ITNEC.2017.8284920","DOI":"10.1109\/ITNEC.2017.8284920"},{"issue":"C","key":"536_CR38","doi-asserted-by":"publisher","first-page":"40","DOI":"10.1016\/j.tourman.2016.03.021","volume":"56","author":"Y Yang","year":"2016","unstructured":"Yang Y, Mueller NJ, Croes RR (2016) Market accessibility and hotel prices in the Caribbean: the moderating effect of quality-signaling factors. Tour Manag 56(C):40\u201351","journal-title":"Tour Manag"},{"issue":"3","key":"536_CR39","doi-asserted-by":"publisher","first-page":"6527","DOI":"10.1016\/j.eswa.2008.07.035","volume":"36","author":"Q Ye","year":"2009","unstructured":"Ye Q, Zhang Z, Law R (2009) Sentiment classification of online reviews to travel destinations by supervised machine learning approaches. Expert Syst Appl 36(3):6527\u20136535. https:\/\/doi.org\/10.1016\/j.eswa.2008.07.035","journal-title":"Expert Syst Appl"},{"issue":"3","key":"536_CR40","doi-asserted-by":"publisher","first-page":"671","DOI":"10.1109\/TASLP.2017.2788182","volume":"26","author":"L-C Yu","year":"2018","unstructured":"Yu L-C, Wang J, Lai KR, Zhang X (2018) Refining word embeddings using intensity scores for sentiment analysis. IEEE\/ACM Trans Audio Speech Lang Process 26(3):671\u2013681. https:\/\/doi.org\/10.1109\/TASLP.2017.2788182","journal-title":"IEEE\/ACM Trans Audio Speech Lang Process"},{"issue":"8","key":"536_CR41","doi-asserted-by":"publisher","first-page":"5713","DOI":"10.1007\/s00500-019-04300-z","volume":"24","author":"Y-H Yuan","year":"2020","unstructured":"Yuan Y-H, Tsao S-H, Chyou J-T, Tsai S-B (2020) An empirical study on effects of electronic word-of-mouth and Internet risk avoidance on purchase intention: from the perspective of big data. Soft Comput 24(8):5713\u20135728. https:\/\/doi.org\/10.1007\/s00500-019-04300-z","journal-title":"Soft Comput"}],"container-title":["Advances in Data Analysis and Classification"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-023-00536-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11634-023-00536-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-023-00536-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,19]],"date-time":"2024-06-19T08:18:49Z","timestamp":1718785129000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11634-023-00536-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,14]]},"references-count":41,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2024,6]]}},"alternative-id":["536"],"URL":"https:\/\/doi.org\/10.1007\/s11634-023-00536-8","relation":{},"ISSN":["1862-5347","1862-5355"],"issn-type":[{"value":"1862-5347","type":"print"},{"value":"1862-5355","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,3,14]]},"assertion":[{"value":"30 September 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 February 2023","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 February 2023","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 March 2023","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}