{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T15:46:06Z","timestamp":1759333566894},"reference-count":64,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2019,4,27]],"date-time":"2019-04-27T00:00:00Z","timestamp":1556323200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,3,18]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Twitter is an online micro-blogging platform through which one can explore the hidden valuable and delightful information about the current context at any point of time, which also serves as a data source to carry out sentiment analysis. In this paper, the sentiments of large amount of tweets generated from Twitter in the form of big data have been analyzed using machine learning algorithms. A multi-tier architecture for sentiment classification is proposed in this paper, which includes modules such as tokenization, data cleaning, preprocessing, stemming, updated lexicon, stopwords and emoticon dictionaries, feature selection and machine learning classifier. Unigram and bigrams have been used as feature extractors together with \u03c72 (Chi-squared) and Singular Value Decomposition for dimensionality reduction together with two model types (Binary and Reg), with four types of scaling methods (No scaling, Standard, Signed and Unsigned) and represented them in three different vector formats (TF-IDF, Binary and Int). Accuracy is considered as the evaluation standard for random forest and bagged trees classification methods. Sentiments were analyzed through tokenization and having several stages of pre-processing and several combinations of feature vectors and classification methods. Through which it was possible to achieve an accuracy of 84.14%. Obtained results conclude that, the proposed scheme gives a better accuracy when compared with existing schemes in the literature.<\/jats:p>","DOI":"10.1093\/comjnl\/bxz038","type":"journal-article","created":{"date-parts":[[2019,3,27]],"date-time":"2019-03-27T15:00:30Z","timestamp":1553698830000},"page":"395-409","source":"Crossref","is-referenced-by-count":10,"title":["A Multiple-Layer Machine Learning Architecture for Improved Accuracy in Sentiment Analysis"],"prefix":"10.1093","volume":"63","author":[{"given":"L B","family":"Shyamasundar","sequence":"first","affiliation":[{"name":"Department of Computer Science and Engineering, CMR Institute of Technology, Bengaluru, Karnataka 560037, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"P","family":"Jhansi Rani","sequence":"first","affiliation":[{"name":"Department of Computer Science and Engineering, CMR Institute of Technology, Bengaluru, Karnataka 560037, India"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2019,4,27]]},"reference":[{"key":"2020042105454109200_bxz038C1","first-page":"51","article-title":"Twitter news stratification using random forest","volume":"2","author":"Koyande","year":"2015","journal-title":"Int. J. Adv. Electron. Comput. Sci."},{"key":"2020042105454109200_bxz038C2","first-page":"101","article-title":"Classification and regression tree method for forecasting","volume":"5","author":"Muthu Visalatchi","year":"2016","journal-title":"Int. J. Comput. Sci. Mobile Comput."},{"key":"2020042105454109200_bxz038C3","author":"Jianqiang","year":"2016"},{"key":"2020042105454109200_bxz038C4","author":"Nakov","year":"2016"},{"key":"2020042105454109200_bxz038C5","volume-title":"Data Mining: Concepts and Techniques","author":"Han","year":"2006","edition":"2nd ed."},{"key":"2020042105454109200_bxz038C6","doi-asserted-by":"crossref","first-page":"1138","DOI":"10.1016\/j.ins.2010.11.023","article-title":"Ensemble of feature sets and classification algorithms for sentiment classification","volume":"181","author":"Xia","year":"2011","journal-title":"Inf. Sci. (Ny)"},{"key":"2020042105454109200_bxz038C7","first-page":"1","article-title":"Sentiment analysis in multiple languages: feature selection for opinion classification in web forums","volume":"26","author":"Abbasi","year":"2008","journal-title":"ACM Trans. Inform. Syst."},{"key":"2020042105454109200_bxz038C8","author":"Whitehead","year":"2010"},{"key":"2020042105454109200_bxz038C9","author":"Cambria","year":"2013"},{"key":"2020042105454109200_bxz038C10","author":"Cambria","year":"2013"},{"key":"2020042105454109200_bxz038C11","author":"Go","year":"2009"},{"key":"2020042105454109200_bxz038C12","author":"Pang","year":"2002"},{"key":"2020042105454109200_bxz038C13","author":"Bora","year":"2012"},{"key":"2020042105454109200_bxz038C14","author":"Kouloumpis","year":"2011"},{"key":"2020042105454109200_bxz038C15","author":"Saif","year":"2014"},{"key":"2020042105454109200_bxz038C16","doi-asserted-by":"crossref","first-page":"668","DOI":"10.1109\/TSMCA.2010.2093886","article-title":"Gender classification for web forums","volume":"41","author":"Zhang","year":"2011","journal-title":"IEEE Trans. Syst. Man Cybern. Syst. Hum."},{"key":"2020042105454109200_bxz038C17","author":"Pang","year":"2004"},{"key":"2020042105454109200_bxz038C18","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1561\/1500000011","article-title":"Opinion mining and sentiment analysis","volume":"2","author":"Pang","year":"2008","journal-title":"Found. Trends Inf. Retr."},{"key":"2020042105454109200_bxz038C19","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1162\/0891201041850885","article-title":"Learning subjective language","volume":"30","author":"Wiebe","year":"2004","journal-title":"Comput. Ling."},{"key":"2020042105454109200_bxz038C20","first-page":"285","article-title":"Sentiment classification of movie reviews by supervised machine learning approaches","volume":"4","author":"Kalaivani","year":"2013","journal-title":"Indian J. Comput. Sci. Eng."},{"key":"2020042105454109200_bxz038C21","first-page":"1769","article-title":"Performance evaluation of feature selection method for sentiment classification of online reviews using machine learning techniques","volume":"8","author":"Kalaivani","year":"2013","journal-title":"Int. Rev. Comput. Software"},{"key":"2020042105454109200_bxz038C22","author":"Barbosa","year":"2010"},{"key":"2020042105454109200_bxz038C23","doi-asserted-by":"crossref","first-page":"723","DOI":"10.1613\/jair.4272","article-title":"Sentiment analysis of short informal texts","volume":"50","author":"Kiritchenko","year":"2014","journal-title":"J. Artif. Intell. Res."},{"key":"2020042105454109200_bxz038C24","author":"Pak","year":"2010"},{"key":"2020042105454109200_bxz038C25","author":"Bao","year":"2014"},{"key":"2020042105454109200_bxz038C26","first-page":"279","volume-title":"Twitter Sentiment Analysis Using Machine Learning Techniques","author":"Le","year":"2015"},{"key":"2020042105454109200_bxz038C27","author":"Turney","year":"2002"},{"key":"2020042105454109200_bxz038C28","author":"Yun","year":"2015"},{"key":"2020042105454109200_bxz038C29","author":"Moh","year":"2015"},{"key":"2020042105454109200_bxz038C30","author":"Steinbach","year":"2000"},{"key":"2020042105454109200_bxz038C31","first-page":"13","volume-title":"Analysis of Twitter Data Using a Multiple-Level Clustering Strategy","author":"Baralis","year":"2013"},{"key":"2020042105454109200_bxz038C32","author":"Cheng","year":"2007"},{"key":"2020042105454109200_bxz038C33","author":"Pereira","year":"1995"},{"key":"2020042105454109200_bxz038C34","year":"2016"},{"key":"2020042105454109200_bxz038C35","first-page":"18","article-title":"Classification and regression by random forest","volume":"2","author":"Liaw","year":"2002","journal-title":"R News"},{"key":"2020042105454109200_bxz038C36","author":"Cakra","year":"2015"},{"key":"2020042105454109200_bxz038C37","doi-asserted-by":"crossref","first-page":"2870","DOI":"10.1109\/ACCESS.2017.2672677","article-title":"Comparison research on text pre-processing methods on twitter sentiment analysis","volume":"5","author":"Jianqiang","year":"2017","journal-title":"IEEE Access"},{"key":"2020042105454109200_bxz038C38","author":"Zhu","year":"2016"},{"key":"2020042105454109200_bxz038C39","doi-asserted-by":"crossref","first-page":"132","DOI":"10.1016\/j.knosys.2016.05.035","article-title":"Figurative messages and affect in twitter: differences between #irony, #sarcasm and #not","volume":"108","author":"Sulis","year":"2016","journal-title":"Knowl. Base. Syst."},{"key":"2020042105454109200_bxz038C40","first-page":"85","article-title":"Author gender identification from arabic text","volume":"35","author":"Alsmearat","year":"2017","journal-title":"J. Inf. Secur. Appl."},{"key":"2020042105454109200_bxz038C41","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1016\/j.dss.2017.05.012","article-title":"A data analytics approach to building a clinical decision support system for diabetic retinopathy: developing and deploying a model ensemble","volume":"101","author":"Piri","year":"2017","journal-title":"Decis. Support Syst."},{"key":"2020042105454109200_bxz038C42","doi-asserted-by":"crossref","first-page":"886","DOI":"10.1016\/j.ipm.2017.03.005","article-title":"Multilingual author profiling on facebook","volume":"53","author":"Fatima","year":"2017","journal-title":"Inf. Process. Manage."},{"key":"2020042105454109200_bxz038C43","author":"Jain","year":"2015"},{"key":"2020042105454109200_bxz038C44","author":"Sboev","year":"2016"},{"key":"2020042105454109200_bxz038C45"},{"key":"2020042105454109200_bxz038C46","author":"Speriosu","year":"2011"},{"key":"2020042105454109200_bxz038C47","author":"Saif","year":"2012"},{"key":"2020042105454109200_bxz038C48","doi-asserted-by":"crossref","first-page":"170","DOI":"10.1016\/j.dss.2014.07.003","article-title":"Tweet sentiment analysis with classifier ensembles","volume":"66","author":"da Silva","year":"2014","journal-title":"Decis. Support Syst."},{"key":"2020042105454109200_bxz038C49","author":"Bakliwal","year":"2012"},{"key":"2020042105454109200_bxz038C50","author":"Bravo-Marquez","year":"2013"},{"key":"2020042105454109200_bxz038C51","author":"Conover","year":"2011"},{"key":"2020042105454109200_bxz038C52","author":"Weinberger","year":"2009"},{"key":"2020042105454109200_bxz038C53","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1016\/j.compeleceng.2016.11.015","article-title":"A comparison study on active learning integrated ensemble approaches in sentiment analysis","volume":"57","author":"Aldogan","year":"2017","journal-title":"Comput. Electr. Eng."},{"key":"2020042105454109200_bxz038C54","author":"Muthutantrige","year":"2016"},{"key":"2020042105454109200_bxz038C55","author":"Turdjai","year":"2016"},{"key":"2020042105454109200_bxz038C56","first-page":"48","article-title":"A novel approach to solve the challenges in sentiment analysis","volume":"4","author":"Mane","year":"2017","journal-title":"Int. J. Comput. Technol."},{"key":"2020042105454109200_bxz038C57","author":"Prasad","year":"2017"},{"key":"2020042105454109200_bxz038C58","first-page":"45","article-title":"Sentiment analysis of movie reviews using machine learning techniques","volume":"179","author":"Baid","year":"2017","journal-title":"Int. J. Comput. Appl."},{"key":"2020042105454109200_bxz038C59","author":"Enayet","year":"2017"},{"key":"2020042105454109200_bxz038C60","author":"Salamon","year":"2016"},{"key":"2020042105454109200_bxz038C61","author":"Govindarajan","year":"2016"},{"key":"2020042105454109200_bxz038C62","author":"Muthutantrige","year":"2017"},{"key":"2020042105454109200_bxz038C63","author":"Zhang","year":"2016"},{"key":"2020042105454109200_bxz038C64","author":"Melville","year":"2008"}],"container-title":["The Computer Journal"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/comjnl\/article-pdf\/63\/3\/395\/33106408\/bxz038.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/comjnl\/article-pdf\/63\/3\/395\/33106408\/bxz038.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,4,21]],"date-time":"2020-04-21T05:46:04Z","timestamp":1587447964000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/comjnl\/article\/63\/3\/395\/5480914"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,4,27]]},"references-count":64,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2019,4,27]]},"published-print":{"date-parts":[[2020,3,18]]}},"URL":"https:\/\/doi.org\/10.1093\/comjnl\/bxz038","relation":{},"ISSN":["0010-4620","1460-2067"],"issn-type":[{"value":"0010-4620","type":"print"},{"value":"1460-2067","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,3]]},"published":{"date-parts":[[2019,4,27]]}}}