{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,28]],"date-time":"2026-01-28T20:33:46Z","timestamp":1769632426984,"version":"3.49.0"},"reference-count":61,"publisher":"Wiley","license":[{"start":{"date-parts":[[2020,8,1]],"date-time":"2020-08-01T00:00:00Z","timestamp":1596240000000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100011665","name":"Deanship of Scientific Research, King Saud University","doi-asserted-by":"publisher","award":["RG-1438-089"],"award-info":[{"award-number":["RG-1438-089"]}],"id":[{"id":"10.13039\/501100011665","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Scientific Programming"],"published-print":{"date-parts":[[2020,8,1]]},"abstract":"<jats:p>Information is exploding on the web at exponential pace, and online movie review over the web is a substantial source of information for online users. However, users write millions of movie reviews on regular basis, and it is not possible for users to condense the reviews. Classification and summarization of reviews is a difficult task in computational linguistics. Hence, an automatic method is demanded to summarize the vast amount of movie reviews, and this method will permit the users to speedily distinguish between positive and negative features of a movie. This work has proposed a classification and summarization method for movie reviews. For movie review classification, bag-of-words feature extraction technique is used to extract unigrams, bigrams, and trigrams as a feature set from given review documents and represent the review documents as a vector. Next, the Na\u00a8\u0131ve Bayes algorithm is employed to categorize the movie reviews (signified as a feature vector) into negative and positive reviews. For the task of movie review summarization, word2vec model is used to extract features from classified movie review sentences, and then semantic clustering technique is used to cluster semantically related review sentences. Different text features are employed to compute the salience score of all review sentences in clusters. Finally, the best-ranked review sentences are picked based on top salience scores to form a summary of movie reviews. Empirical results indicate that the suggested machine learning approach performed better than benchmark summarization approaches.<\/jats:p>","DOI":"10.1155\/2020\/5812715","type":"journal-article","created":{"date-parts":[[2020,8,1]],"date-time":"2020-08-01T23:33:14Z","timestamp":1596324794000},"page":"1-13","source":"Crossref","is-referenced-by-count":12,"title":["Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics"],"prefix":"10.1155","volume":"2020","author":[{"given":"Atif","family":"Khan","sequence":"first","affiliation":[{"name":"Department of Computer Science, Islamia College Peshawar, Peshawar, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Muhammad Adnan","family":"Gul","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Islamia College Peshawar, Peshawar, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"M. Irfan","family":"Uddin","sequence":"additional","affiliation":[{"name":"Institute of Computing, Kohat University of Science and Technology, Kohat, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4275-9731","authenticated-orcid":true,"given":"Syed Atif","family":"Ali Shah","sequence":"additional","affiliation":[{"name":"Faculty of Engineering and Information Technology, Northern University, Nowshehra, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0712-9133","authenticated-orcid":true,"given":"Shafiq","family":"Ahmad","sequence":"additional","affiliation":[{"name":"Industrial Engineering Department, College of Engineering, King Saud University, P.O. Box 800, Riyadh 11421, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Muhammad Dzulqarnain","family":"Al Firdausi","sequence":"additional","affiliation":[{"name":"Industrial Engineering Department, College of Engineering, King Saud University, P.O. Box 800, Riyadh 11421, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mazen","family":"Zaindin","sequence":"additional","affiliation":[{"name":"Department of Statistics and Operations Research, College of Science, King Saud University, P.O. Box 2455, Riyadh 11451, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"311","reference":[{"key":"1","first-page":"168","article-title":"Mining and summarizing customer reviews","author":"M. Hu"},{"key":"2","first-page":"329","article-title":"Movie review summarization and sentiment analysis using rapidminer","author":"A. F. Alsaqer"},{"key":"3","first-page":"43","article-title":"Movie review mining and summarization","author":"L. Zhuang"},{"issue":"2","key":"4","first-page":"1026","article-title":"Survey on opinion mining and summarization of user reviews on web","volume":"5","author":"V. B. Raut","year":"2014","journal-title":"International Journal of Computer Science and Information Technologies"},{"key":"5","doi-asserted-by":"publisher","DOI":"10.1109\/tsmcc.2011.2136334"},{"key":"6","doi-asserted-by":"publisher","DOI":"10.1109\/jstsp.2012.2229690"},{"key":"7","first-page":"100","article-title":"Introduction to information retrieval","volume":"16","author":"C. Manning","year":"2010","journal-title":"Natural Language Engineering"},{"key":"8","doi-asserted-by":"publisher","DOI":"10.1145\/505282.505283"},{"key":"9","first-page":"417","article-title":"Sentiwordnet: a publicly available lexical resource for opinion mining","volume":"6","author":"A. Esuli","year":"2006","journal-title":"Proceedings of LREC"},{"key":"10","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4614-3223-4_3"},{"issue":"10","key":"11","doi-asserted-by":"crossref","first-page":"3934","DOI":"10.1016\/j.eswa.2012.12.084","article-title":"Sentiment polarity detection in Spanish reviews combining supervised and unsupervised approaches","volume":"40","author":"M.-T. Mart\u00edn-Valdivia","year":"2013","journal-title":"Expert Systems with Applications"},{"key":"12","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2015.02.001"},{"key":"13","doi-asserted-by":"publisher","DOI":"10.1109\/tkde.2015.2405553"},{"key":"14","first-page":"311","article-title":"Product review summarization from a deeper perspective","author":"D. K. Ly"},{"issue":"3","key":"15","article-title":"Survey on movie rating and review summarization in mobile environment","volume":"2","author":"P. Mehta","year":"2013","journal-title":"International Journal of Engineering Research and Technology"},{"key":"16","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2014.02.001"},{"key":"17","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-016-9475-9"},{"key":"18","doi-asserted-by":"publisher","DOI":"10.4304\/jetwi.2.3.258-268"},{"key":"19","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2015.06.002"},{"key":"20","volume-title":"Advances in Automatic Text Summarization","author":"I. Mani","year":"1999"},{"key":"21","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-011-0238-6"},{"key":"22","doi-asserted-by":"publisher","DOI":"10.1016\/j.chb.2013.05.024"},{"key":"23","first-page":"127","article-title":"\u201cC-Feel-It: a sentiment analyzer for micro-blogs","author":"A. Joshi"},{"key":"24","doi-asserted-by":"publisher","DOI":"10.1561\/1500000011"},{"key":"25","doi-asserted-by":"publisher","DOI":"10.1109\/tasl.2012.2217129"},{"key":"26","doi-asserted-by":"publisher","DOI":"10.1155\/2019\/2537689"},{"key":"27","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2010.11.003"},{"key":"28","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1007\/978-1-84628-754-1_2","article-title":"Extracting product features and opinions from reviews","volume-title":"Natural Language Processing and Text Mining","author":"A.-M. Popescu","year":"2007"},{"key":"29","doi-asserted-by":"publisher","DOI":"10.1007\/11552253_12"},{"issue":"1","key":"30","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1016\/j.csl.2013.04.001","article-title":"Ranked wordnet graph for sentiment polarity classification in twitter","volume":"28","author":"A. Montejo-R\u00e1ez","year":"2014","journal-title":"Computer Speech & Language"},{"key":"31","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2016.12.002"},{"key":"32","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2016.05.001"},{"key":"33","article-title":"A non-linear topic detection method for text summarization using wordnet","author":"C. N. Silla"},{"key":"34","doi-asserted-by":"publisher","DOI":"10.1177\/0165551507077406"},{"key":"35","doi-asserted-by":"publisher","DOI":"10.1007\/s11518-009-5100-7"},{"key":"36","first-page":"73","article-title":"A user-oriented web retrieval summarization tool","author":"D. Vazhenin"},{"key":"37","doi-asserted-by":"publisher","DOI":"10.2478\/cait-2012-0011"},{"key":"38","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2016.01.030"},{"key":"39","doi-asserted-by":"publisher","DOI":"10.1155\/2016\/5130603"},{"key":"40","volume-title":"The Pagerank Citation Ranking: Bringing Order to the Web","author":"L. Page","year":"1999"},{"key":"41","doi-asserted-by":"publisher","DOI":"10.1023\/a:1009930203452"},{"key":"42","first-page":"365","article-title":"LexPageRank: prestige in multi- document text summarization","author":"G. Erkan"},{"key":"43","volume-title":"A Language Independent Algorithm for Single and Multiple Document Summarization","author":"R. Mihalcea","year":"2005"},{"key":"44","first-page":"181","article-title":"Improved affinity graph based multi- document summarization","author":"X. Wan"},{"key":"45","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-009-0194-2"},{"key":"46","first-page":"90","article-title":"Weighted graph model based sentence clustering and ranking for document summarization","author":"S. S. Ge"},{"key":"47","doi-asserted-by":"publisher","DOI":"10.1007\/s12652-012-0143-x"},{"issue":"15","key":"48","doi-asserted-by":"crossref","first-page":"6904","DOI":"10.1016\/j.eswa.2014.04.004","article-title":"Event graphs for information re- trieval and multi-document summarization","volume":"41","author":"G. Glava\u0161","year":"2014","journal-title":"Expert Systems with Applications"},{"key":"49","first-page":"136","article-title":"Document centered approach to text normalization","author":"A. Mikheev"},{"key":"50","article-title":"Automatic query expansion using SMART: trec 3","author":"C. Buckley"},{"key":"51","doi-asserted-by":"publisher","DOI":"10.1108\/eb046814"},{"key":"52","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-007-0114-2"},{"key":"53","doi-asserted-by":"publisher","DOI":"10.1155\/2020\/7526580"},{"key":"54","first-page":"142","article-title":"Learning word vectors for sentiment analysis","author":"A. L. Maas"},{"key":"55","doi-asserted-by":"publisher","DOI":"10.1016\/j.asoc.2012.06.017"},{"issue":"1","key":"56","doi-asserted-by":"crossref","first-page":"126","DOI":"10.1016\/j.csl.2008.04.002","article-title":"Based models for automatic text summarization","volume":"23","author":"M. A. Fattah","year":"2009","journal-title":"Computer Speech & Language"},{"key":"57","volume-title":"Automatic Text Processing: The Transformation, Analysis, and Retrieval of Reading","author":"G. Salton","year":"1989"},{"key":"58","first-page":"271","article-title":"A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts","author":"B. Pang"},{"key":"59","doi-asserted-by":"publisher","DOI":"10.1613\/jair.1523"},{"key":"60","first-page":"404","article-title":"Textrank: bringing order into text","author":"R. Mihalcea"},{"key":"61","first-page":"74","article-title":"Rouge: a package for automatic evaluation of summaries,","author":"C.-Y. Lin"}],"updated-by":[{"DOI":"10.1155\/2021\/7871490","type":"corrigendum","label":"Corrigendum","source":"publisher","updated":{"date-parts":[[2021,1,9]],"date-time":"2021-01-09T00:00:00Z","timestamp":1610150400000}}],"container-title":["Scientific Programming"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/downloads.hindawi.com\/journals\/sp\/2020\/5812715.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/sp\/2020\/5812715.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/sp\/2020\/5812715.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,7,13]],"date-time":"2021-07-13T10:59:58Z","timestamp":1626173998000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.hindawi.com\/journals\/sp\/2020\/5812715\/"}},"subtitle":[],"editor":[{"given":"Shaukat","family":"Ali","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2020,8,1]]},"references-count":61,"alternative-id":["5812715","5812715"],"URL":"https:\/\/doi.org\/10.1155\/2020\/5812715","relation":{"corrigendum":[{"id-type":"doi","id":"10.1155\/2021\/7871490","asserted-by":"object"}]},"ISSN":["1875-919X","1058-9244"],"issn-type":[{"value":"1875-919X","type":"electronic"},{"value":"1058-9244","type":"print"}],"subject":[],"published":{"date-parts":[[2020,8,1]]}}}