{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,28]],"date-time":"2026-04-28T15:44:39Z","timestamp":1777391079038,"version":"3.51.4"},"reference-count":52,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2018,11,24]],"date-time":"2018-11-24T00:00:00Z","timestamp":1543017600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Data"],"abstract":"<jats:p>Nowadays, overwhelming stock data is available, which areonly of use if it is properly examined and mined. In this paper, the last twelve years of ICICI Bank\u2019s stock data have been extensively examined using statistical and supervised learning techniques. This study may be of great interest for those who wish to mine or study the stock data of banks or any financial organization. Different statistical measures have been computed to explore the nature, range, distribution, and deviation of data. The different descriptive statistical measures assist in finding different valuable metrics such as mean, variance, skewness, kurtosis, p-value, a-squared, and 95% confidence mean interval level of ICICI Bank\u2019s stock data. Moreover, daily percentage changes occurring over the last 12 years have also been recorded and examined. Additionally, the intraday stock status has been mined using ten different classifiers. The performance of different classifiers has been evaluated on the basis of various parameters such as accuracy, misclassification rate, precision, recall, specificity, and sensitivity. Based upon different parameters, the predictive results obtained using logistic regression are more acceptable than the outcomes of other classifiers, whereas na\u00efve Bayes, C4.5, random forest, linear discriminant, and cubic support vector machine (SVM) merely act as a random guessing machine. The outstanding performance of logistic regression has been validated using TOPSIS (technique for order preference by similarity to ideal solution) and WSA (weighted sum approach).<\/jats:p>","DOI":"10.3390\/data3040054","type":"journal-article","created":{"date-parts":[[2018,11,26]],"date-time":"2018-11-26T03:24:27Z","timestamp":1543202667000},"page":"54","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":27,"title":["Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining"],"prefix":"10.3390","volume":"3","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5942-134X","authenticated-orcid":false,"given":"Manik","family":"Sharma","sequence":"first","affiliation":[{"name":"Department of Computer Science and Applications, DAV University, Jalandhar 144401, India"}]},{"given":"Samriti","family":"Sharma","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Guru Nanak Dev University, Amritar 143001, India"}]},{"given":"Gurvinder","family":"Singh","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Guru Nanak Dev University, Amritar 143001, India"}]}],"member":"1968","published-online":{"date-parts":[[2018,11,24]]},"reference":[{"key":"ref_1","unstructured":"Sharma, R. (2018, September 20). ICICI Bank Equity Research. Available online: https:\/\/www.sanasecurities.com\/icici-bank-equity-research."},{"key":"ref_2","unstructured":"IANS (2018, September 20). SBI India\u2019s Most Trusted Bank, ICICI Top in Private Sector: Report. 19 April 2018. Available online: https:\/\/economictimes.indiatimes.com\/industry\/banking\/finance\/banking\/sbi-indias-most-trusted-bank-icici-tops-in-private-sector-report\/articleshow\/63818576.cms."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s40537-015-0030-3","article-title":"Big data analytics: A survey","volume":"2","author":"Tsai","year":"2015","journal-title":"J. Big Data"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"995","DOI":"10.1016\/j.eswa.2006.02.016","article-title":"Data mining techniques for the detection of fraudulent financial statements","volume":"32","author":"Kirkos","year":"2007","journal-title":"Expert Syst. Appl."},{"key":"ref_5","unstructured":"Han, J., Kamber, M., and Pei, J. (2015). Data Mining Concepts and Techniques, Morgan Kauffmann Publishers."},{"key":"ref_6","first-page":"2700","article-title":"Analysis of Data Mining and Soft Computing Techniques in Prospecting Diabetes Disorder in Human Beings: A Review","volume":"9","author":"Kaur","year":"2018","journal-title":"Int. J. Pharm. Sci. Res."},{"key":"ref_7","first-page":"7","article-title":"Application of spatial data mining for agriculture","volume":"15","author":"Rajesh","year":"2011","journal-title":"Int. J. Comput. Appl."},{"key":"ref_8","first-page":"117","article-title":"Applying naive Bayes data mining technique for classification of agricultural land soils","volume":"9","author":"Bhargavi","year":"2009","journal-title":"Int. J. Comput. Sci. Netw. Secur."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"11303","DOI":"10.1016\/j.eswa.2012.02.063","article-title":"Data mining techniques and applications\u2014A decade review from 2000 to 2011","volume":"39","author":"Liao","year":"2012","journal-title":"Expert Syst. Appl."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"377","DOI":"10.14445\/22315381\/IJETT-V16P275","article-title":"Data Mining in Finance","volume":"16","author":"Kadam","year":"2014","journal-title":"Int. J. Eng. Trends Technol."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"927","DOI":"10.1016\/j.eswa.2005.06.024","article-title":"The use of data mining and neural networks for forecasting stock market returns","volume":"29","author":"Enke","year":"2005","journal-title":"Expert Syst. Appl."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1016\/j.engappai.2010.09.007","article-title":"A review on time series data mining","volume":"24","author":"Fu","year":"2011","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"2431","DOI":"10.1007\/s10916-011-9710-5","article-title":"Data mining in healthcare and biomedicine: A survey of the literature","volume":"36","author":"Yoo","year":"2012","journal-title":"J. Med. Syst."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"856","DOI":"10.1016\/j.eswa.2006.01.038","article-title":"Integrating data mining with the case-based reasoning for chronic diseases prognosis and diagnosis","volume":"32","author":"Huang","year":"2007","journal-title":"Expert Syst. Appl."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1016\/j.jksuci.2012.10.003","article-title":"Application of data mining: Diabetes health care in young and old patients","volume":"25","author":"Aljumah","year":"2013","journal-title":"J. King Saud Univ.-Comput. Inf. Sci."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1016\/j.irbm.2017.09.002","article-title":"Stark Assessment of Lifestyle Based Human Disorders Using Data Mining Based Learning Techniques","volume":"38","author":"Sharma","year":"2017","journal-title":"IRBM"},{"key":"ref_17","first-page":"1","article-title":"An Advanced Conceptual Diagnostic Healthcare Framework for Diabetes and Cardiovascular Disorders","volume":"5","author":"Sharma","year":"2018","journal-title":"EAI Endorsed Trans. Scalable Inf. Syst."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1016\/j.drudis.2008.12.005","article-title":"Target discovery from data mining approaches","volume":"14","author":"Yang","year":"2012","journal-title":"Drug Discov. Today"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"571","DOI":"10.1016\/j.tibtech.2006.10.002","article-title":"Text mining and its potential applications in systems biology","volume":"24","author":"Ananiadou","year":"2006","journal-title":"Trends Biotechnol."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"217","DOI":"10.18576\/amis\/120121","article-title":"Performance Analysis of Various Machine Learning Techniques to Predict Cardiovascular Disease: An Emprical Study","volume":"12","author":"Chandralekha","year":"2018","journal-title":"Appl. Math. Inf. Sci."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1016\/j.procs.2016.06.016","article-title":"Performance Evaluation of Supervised Machine Learning Algorithms for Intrusion Detection","volume":"89","author":"Manjula","year":"2016","journal-title":"Procedia Comput. Sci."},{"key":"ref_22","first-page":"1","article-title":"ICICI Bank: A Multivariate Analysis of Customer\u2019s Acceptability","volume":"11","author":"Sangeeta","year":"2011","journal-title":"Glob. J. Manag. Bus. Res."},{"key":"ref_23","first-page":"12","article-title":"A Study of Financial Performance: A Comparative Analysis of AXIS and ICICI Bank","volume":"4","author":"Pooja","year":"2017","journal-title":"Int. J. Multidiscipl. Res. Dev."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1016\/j.eswa.2014.07.040","article-title":"Predicting stock and stock price index movement using Trend Deterministic Data Preparation and machine learning techniques","volume":"42","author":"Patel","year":"2015","journal-title":"Expert Syst. Appl."},{"key":"ref_25","unstructured":"Al-Radaideh, Q.I., Assaf, A.A., and Alnagi, E. (2013, January 17\u201319). Predicting Stock Price Using Data Mining Technique. Proceedings of the International Arab Conference on Information Technology (ACIT\u20192013), Katumu, Sudan."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"6653","DOI":"10.1007\/s00500-016-2216-9","article-title":"A strength-biased prediction model for forecasting exchange rates using support vector machines and genetic algorithms","volume":"21","author":"Toroslu","year":"2017","journal-title":"Soft Comput."},{"key":"ref_27","first-page":"22","article-title":"Predicting Stock Market Behavior using Data Mining Technique and News Sentiment Analysis","volume":"7","author":"Khedr","year":"2017","journal-title":"Int. J. Intell. Syst. Appl."},{"key":"ref_28","first-page":"2780","article-title":"Stock Market Prediction Using Data Mining","volume":"2","author":"Desai","year":"2014","journal-title":"Int. J. Eng. Dev. Res."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Zhao, L., and Wang, L. (2015, January 26\u201328). Price Trend Prediction of Stock Market Using Outlier Data Mining Algorithm. Proceedings of the IEEE Fifth International Conference on Big Data and Cloud Computing, Dalian, China.","DOI":"10.1109\/BDCloud.2015.19"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"1248","DOI":"10.1016\/j.protcy.2016.05.104","article-title":"Clustering and Regression Techniques for Stock Prediction","volume":"24","author":"Bini","year":"2016","journal-title":"Procedia Technol."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1016\/j.dss.2014.04.004","article-title":"A kernel entropy manifold learning approach for financial data analysis","volume":"64","author":"Huang","year":"2014","journal-title":"Decis. Support Syst."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s40854-017-0056-y","article-title":"Internet big data and capital markets: A literature review","volume":"3","author":"Ye","year":"2017","journal-title":"Financ. Innov."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s40854-017-0074-9","article-title":"Performance evaluation of series and parallel strategies for financial time series forecasting","volume":"3","author":"Khashei","year":"2017","journal-title":"Financ. Innov."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s40854-018-0104-2","article-title":"Estimating stock closing indices using a GA-weighted condensed polynomial neural network","volume":"4","author":"Nayak","year":"2018","journal-title":"Financ. Innov."},{"key":"ref_35","first-page":"157","article-title":"Statistical methods and common problems in medical or biomedical science research","volume":"9","author":"Yan","year":"2017","journal-title":"Int. J. Physiol. Pathophysiol. Pharmacol."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Du Prel, J.-B., R\u00f6hrig, B., and Blettner, M. (2009). Statistical Methods in Medical Research, Deutsches \u00c4rzteblatt International.","DOI":"10.3238\/arztebl.2009.0099"},{"key":"ref_37","first-page":"65","article-title":"Application of Statistics in Engineering Technology Programs","volume":"1","author":"Zhan","year":"2010","journal-title":"Am. J. Eng. Educ."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1080\/07350015.1988.10509660","article-title":"The Role of Statistics in Accounting, Marketing, Finance, and Production","volume":"6","author":"Hamada","year":"1988","journal-title":"J. Bus. Econ. Stat."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Buenestado, P., and Acho, L. (2018). Image Segmentation Based on statistical confidence Intervals. Entropy, 20.","DOI":"10.3390\/e20010046"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"32","DOI":"10.18438\/B8FW2H","article-title":"A Statistical Primer: Understanding Descriptive and Inferential Statistics","volume":"2","author":"Gillian","year":"2007","journal-title":"Evid. Based Lib. Inf. Pract."},{"key":"ref_41","unstructured":"Du, H. (2013). Data Mining Techniques and Applications\u2014An Introduction, Cengage Learning. [1st ed.]."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"6297","DOI":"10.1007\/s00500-016-2183-1","article-title":"Developing a trust model for pervasive computing based on Apriori association rules learning and Bayesian classification","volume":"21","author":"Angelo","year":"2017","journal-title":"Soft Comput."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1007\/s00500-011-0734-z","article-title":"Parameter determination and feature selection for the C4.5 algorithm using scatter search approach","volume":"16","author":"Lin","year":"2011","journal-title":"Soft Comput."},{"key":"ref_44","first-page":"20","article-title":"Classification through Machine Learning Technique: C4.5 Algorithm based on Various Entropies","volume":"82","author":"Sharma","year":"2013","journal-title":"Int. J. Comput. Appl."},{"key":"ref_45","first-page":"278","article-title":"Towards Stock Market Data Mining Using Enriched Random Forests from Textual Resources and Technical Indicators","volume":"339","author":"Maragoudakis","year":"2010","journal-title":"IFIP Adv. Inf. Commun. Technol."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"1945","DOI":"10.1007\/s00500-015-1616-6","article-title":"An alternative model for the analysis of detecting electronic industries earnings management using stepwise regression, random forest, and decision tree","volume":"20","author":"Chen","year":"2015","journal-title":"Soft Comput."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"2513","DOI":"10.1016\/j.cor.2004.03.016","article-title":"Forecasting stock market movement direction with support vector machine","volume":"32","author":"Huang","year":"2005","journal-title":"Comput. Oper. Res."},{"key":"ref_48","unstructured":"Larose, D.T., and Larose, C.D. (2016). Discovering Knowledge in Data: An Introduction to Data Mining, Wiley Publishers. [2nd ed.]."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"4","DOI":"10.5120\/cae2016651990","article-title":"Predicting Thyroid Disease using Linear Discriminant Analysis (LDA) Data Mining Technique","volume":"4","author":"Banu","year":"2016","journal-title":"Commun. Appl. Electron. (CAE)"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1756-0500-4-299","article-title":"Data mining methods in the prediction of Dementia: A real-data comparison of the accuracy, sensitivity and specificity of LDA, logistic regression, neural networks, SVM, classification trees and random forests","volume":"4","author":"Maroco","year":"2011","journal-title":"BMC Res. Notes"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"308","DOI":"10.1016\/j.procs.2015.07.054","article-title":"A-TOPSIS\u2014An Approach Based on TOPSIS for Ranking Evolutionary Algorithms","volume":"55","author":"Krohling","year":"2015","journal-title":"Procedia Comput. Sci."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Kolios, A., Mytilinou, V., Lozano-Minguez, E., and Salonitis, K.A. (2016). Comparative Study of Multiple-Criteria Decision-Making Methods under Stochastic Inputs. Energies, 9.","DOI":"10.3390\/en9070566"}],"container-title":["Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2306-5729\/3\/4\/54\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T05:51:56Z","timestamp":1775281916000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2306-5729\/3\/4\/54"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,11,24]]},"references-count":52,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2018,12]]}},"alternative-id":["data3040054"],"URL":"https:\/\/doi.org\/10.3390\/data3040054","relation":{},"ISSN":["2306-5729"],"issn-type":[{"value":"2306-5729","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,11,24]]}}}