{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T13:22:32Z","timestamp":1753881752710,"version":"3.41.2"},"reference-count":37,"publisher":"World Scientific Pub Co Pte Ltd","issue":"04","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Image Grap."],"published-print":{"date-parts":[[2023,7]]},"abstract":"<jats:p> Scientific studies of the elements that influence the box office performance of Indian films have generally concentrated on post-production elements, such as those discovered after a film has been completed or released, and notably for Bollywood films. Only fewer studies have looked at regional film industries and pre-production factors, which are elements that are known before a decision to greenlight a film is made. This study looked at Indian films using natural language processing and machine learning approaches to see if they would be profitable in the pre-production stage. We extract movie data and English subtitles (as an approximation to the screenplay) for the top five Indian regional film industries: Bollywood, Kollywood, Tollywood, Mollywood, and Sandalwood, as they make up a major portion of the Indian film industry\u2019s revenue. Subtitle Vector (Sub2Vec), a Paragraph Vector model trained on English subtitles, was used to embed subtitle text into 50 and 100 dimensions. The proposed approach followed a two-stage pipeline. In the first stage, Return on Investment (ROI) was calculated using aggregated subtitle embeddings and associated movie data. Classification models used the ROI calculated in the first step to predicting a film\u2019s verdict in the second step. The optimal regressor\u2013classifier pair was determined by evaluating classification models using [Formula: see text]-score and Cohen\u2019s Kappa scores on various hyperparameters. When compared to benchmark methods, our proposed methodology forecasts box office success more accurately. <\/jats:p>","DOI":"10.1142\/s0219467823500304","type":"journal-article","created":{"date-parts":[[2022,4,7]],"date-time":"2022-04-07T14:31:35Z","timestamp":1649341895000},"source":"Crossref","is-referenced-by-count":0,"title":["Early Success Prediction of Indian Movies Using Subtitles: A Document Vector Approach"],"prefix":"10.1142","volume":"23","author":[{"given":"Vaddadi Sai","family":"Rahul","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India"}]},{"given":"M.","family":"Tejas","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India"}]},{"given":"N.\u00a0Narayanan","family":"Prasanth","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India"}]},{"given":"S.\u00a0P.","family":"Raja","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India"}]}],"member":"219","published-online":{"date-parts":[[2022,4,6]]},"reference":[{"issue":"2","key":"S0219467823500304BIB001","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1287\/mksc.18.2.115","volume":"18","author":"Neelamegham R.","year":"1999","journal-title":"Market. Sci."},{"issue":"3","key":"S0219467823500304BIB002","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1080\/09332480.2000.10542216","volume":"13","author":"Simonoff J. F.","year":"2000","journal-title":"CHANCE"},{"issue":"2","key":"S0219467823500304BIB003","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1016\/j.eswa.2005.07.018","volume":"30","author":"Sharda R.","year":"2006","journal-title":"Expert Syst. Appl."},{"key":"S0219467823500304BIB004","first-page":"1","volume-title":"Proc. 2006 IEEE Int. Conf. Computational Cybernetics","author":"Mazurowski M. A.","year":"2006"},{"key":"S0219467823500304BIB005","first-page":"301","volume-title":"Proc. 2009 IEEE\/WIC\/ACM Int. Joint Conf. Web Intelligence and Intelligent Agent Technology","volume":"1","author":"Zhang W.","year":"2009"},{"issue":"1","key":"S0219467823500304BIB006","doi-asserted-by":"crossref","first-page":"280","DOI":"10.1016\/j.eswa.2007.09.042","volume":"36","author":"Lee K.","year":"2009","journal-title":"Expert Syst. Appl."},{"issue":"1","key":"S0219467823500304BIB007","first-page":"1","volume":"56","author":"Reddy A.","year":"2012","journal-title":"Int. J. Comput. Appl."},{"issue":"9","key":"S0219467823500304BIB008","first-page":"69","volume":"2","author":"Kaur A.","year":"2013","journal-title":"Int. J. Sci. Res."},{"issue":"3","key":"S0219467823500304BIB009","doi-asserted-by":"crossref","first-page":"47","DOI":"10.4102\/sajbm.v44i3.162","volume":"44","author":"Pangarker N. A.","year":"2013","journal-title":"South Afr. J. Bus. Manag."},{"key":"S0219467823500304BIB010","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"crossref","first-page":"571","DOI":"10.1007\/978-3-642-39712-7_44","volume-title":"MLDM 2013: Machine Learning and Data Mining in Pattern Recognition","volume":"7988","author":"Parimi R.","year":"2013"},{"key":"S0219467823500304BIB011","first-page":"1209","volume-title":"Proc. 2013 IEEE\/ACM Int. Conf. Advances in Social Networks Analysis and Mining (ASONAM 2013)","author":"Apala K. R.","year":"2013"},{"issue":"8","key":"S0219467823500304BIB012","doi-asserted-by":"crossref","first-page":"e71226","DOI":"10.1371\/journal.pone.0071226","volume":"8","author":"Mesty\u00e1n M.","year":"2013","journal-title":"PLoS ONE"},{"issue":"2","key":"S0219467823500304BIB013","doi-asserted-by":"crossref","first-page":"350","DOI":"10.4236\/ojml.2014.42028","volume":"4","author":"Hunter S. D.","year":"2014","journal-title":"Open J. Mod. Linguist."},{"issue":"11","key":"S0219467823500304BIB014","doi-asserted-by":"crossref","first-page":"2639","DOI":"10.1109\/TKDE.2014.2306681","volume":"26","author":"Eliashberg J.","year":"2014","journal-title":"IEEE Trans. Knowl. Data Eng."},{"issue":"2","key":"S0219467823500304BIB016","doi-asserted-by":"crossref","first-page":"364","DOI":"10.1016\/j.ijforecast.2014.05.006","volume":"31","author":"Kim T.","year":"2015","journal-title":"Int. J. Forecast."},{"issue":"10","key":"S0219467823500304BIB017","first-page":"651","volume":"5","author":"Taneja H.","year":"2016","journal-title":"Int. J. Sci. Res."},{"issue":"11","key":"S0219467823500304BIB018","first-page":"1","volume":"3","author":"Chaudhari N.","year":"2016","journal-title":"Int. J. Eng. Sci. Manag. Res."},{"issue":"2","key":"S0219467823500304BIB019","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1386\/josc.7.2.135_1","volume":"7","author":"Hunter S.","year":"2016","journal-title":"J. Screenwriting"},{"issue":"15","key":"S0219467823500304BIB020","doi-asserted-by":"crossref","first-page":"4111","DOI":"10.2298\/FIL1615111C","volume":"30","author":"Chen R.","year":"2016","journal-title":"Filomat"},{"issue":"3","key":"S0219467823500304BIB021","doi-asserted-by":"crossref","first-page":"874","DOI":"10.1080\/07421222.2016.1243969","volume":"33","author":"Lash M. T.","year":"2016","journal-title":"J. Manag. Inf. Syst."},{"key":"S0219467823500304BIB022","doi-asserted-by":"crossref","first-page":"608","DOI":"10.1016\/j.ins.2016.08.027","volume":"372","author":"Hur M.","year":"2016","journal-title":"Inf. Sci."},{"key":"S0219467823500304BIB023","first-page":"334","volume-title":"Proc. 2017 Int. Conf. Intelligent Computing and Control Systems (ICICCS)","author":"Magdum S. S.","year":"2017"},{"key":"S0219467823500304BIB024","first-page":"182","volume-title":"Proc. 2017 Int. Conf. Intelligent Sustainable Systems (ICISS)","author":"Subramaniyaswamy V.","year":"2017"},{"key":"S0219467823500304BIB025","doi-asserted-by":"crossref","first-page":"1855","DOI":"10.1007\/s00521-017-3162-x","volume":"31","author":"Zhou Y.","year":"2019","journal-title":"Neural Comput. Appl."},{"key":"S0219467823500304BIB026","first-page":"1","volume-title":"Proc. 2017 3rd Int. Conf. Electrical Information and Communication Technology (EICT)","author":"Quader N.","year":"2017"},{"key":"S0219467823500304BIB027","first-page":"4315419","volume":"2017","author":"Kim T.","year":"2017","journal-title":"Comput. Intell. Neurosci."},{"key":"S0219467823500304BIB028","first-page":"655","volume-title":"Proc. 24th ACM SIGKDD Int. Conf. Knowledge Discovery & Data Mining","author":"Ruhrl\u00e4nder R. P.","year":"2018"},{"key":"S0219467823500304BIB029","first-page":"1","volume-title":"Proc. 2018 IEEE Congr. Evolutionary Computation (CEC)","author":"Zhou Y.","year":"2018"},{"key":"S0219467823500304BIB030","first-page":"385","volume-title":"Proc. 2018 First Int. Conf. Secure Cyber Computing and Communication (ICSCCC)","author":"Dhir R.","year":"2018"},{"key":"S0219467823500304BIB031","first-page":"182","volume-title":"Cogn. Syst. Res.","volume":"52","author":"Ru Y.","year":"2018"},{"key":"S0219467823500304BIB032","first-page":"102","volume-title":"Proc. 2019 Amity Int. Conf. Artificial Intelligence (AICAI)","author":"Verma G.","year":"2019"},{"key":"S0219467823500304BIB033","first-page":"111","author":"Jayachandran S.","year":"2019","journal-title":"J. Appl. Sci. Comput."},{"key":"S0219467823500304BIB034","doi-asserted-by":"crossref","first-page":"127","DOI":"10.18653\/v1\/W19-3414","volume-title":"Proc. Second Storytelling Workshop","author":"Kim Y. J.","year":"2019"},{"issue":"2","key":"S0219467823500304BIB035","first-page":"516","volume":"7","author":"Reddy V. G.","year":"2020","journal-title":"Eur. J. Mol. Clin. Med."},{"issue":"5","key":"S0219467823500304BIB036","doi-asserted-by":"crossref","first-page":"102278","DOI":"10.1016\/j.ipm.2020.102278","volume":"57","author":"Ahmad I.","year":"2020","journal-title":"Inf. Process. Manag."},{"key":"S0219467823500304BIB037","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1016\/j.inffus.2020.02.002","volume":"60","author":"Wang Z.","year":"2020","journal-title":"Inf. Fusion"},{"key":"S0219467823500304BIB038","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1007\/s10479-020-03804-4","volume":"308","author":"Liao Y.","year":"2022","journal-title":"Ann. Oper. Res."}],"container-title":["International Journal of Image and Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219467823500304","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,4]],"date-time":"2023-08-04T05:55:37Z","timestamp":1691128537000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0219467823500304"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,6]]},"references-count":37,"journal-issue":{"issue":"04","published-print":{"date-parts":[[2023,7]]}},"alternative-id":["10.1142\/S0219467823500304"],"URL":"https:\/\/doi.org\/10.1142\/s0219467823500304","relation":{},"ISSN":["0219-4678","1793-6756"],"issn-type":[{"type":"print","value":"0219-4678"},{"type":"electronic","value":"1793-6756"}],"subject":[],"published":{"date-parts":[[2022,4,6]]},"article-number":"2350030"}}