{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,17]],"date-time":"2026-04-17T08:12:39Z","timestamp":1776413559027,"version":"3.51.2"},"reference-count":39,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T00:00:00Z","timestamp":1675296000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Data"],"abstract":"<jats:p>Machine learning (ML) was used to develop classification models to predict individual tumor patients\u2019 outcomes. Binary classification defined whether the tumor was malignant or benign. This paper presents a comparative analysis of machine learning algorithms used for breast cancer prediction. This study used a dataset obtained from the National Cancer Institute (NIH), USA, which contains 1.7 million data records. Classical and deep learning methods were included in the accuracy assessment. Classical decision tree (DT), linear discriminant (LD), logistic regression (LR), support vector machine (SVM), and ensemble techniques (ET) algorithms were used. Probabilistic neural network (PNN), deep neural network (DNN), and recurrent neural network (RNN) methods were used for comparison. Feature selection and its effect on accuracy were also investigated. The results showed that decision trees and ensemble techniques outperformed the other techniques, as they both achieved a 98.7% accuracy.<\/jats:p>","DOI":"10.3390\/data8020035","type":"journal-article","created":{"date-parts":[[2023,2,3]],"date-time":"2023-02-03T02:35:10Z","timestamp":1675391710000},"page":"35","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":45,"title":["Accuracy Assessment of Machine Learning Algorithms Used to Predict Breast Cancer"],"prefix":"10.3390","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3615-6811","authenticated-orcid":false,"given":"Mohamed","family":"Ebrahim","sequence":"first","affiliation":[{"name":"Department of Information Systems, College of Computing and Information Technology, Arab Academy for Science Technology and Maritime Transport, Alexandria 511511, Egypt"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8219-7681","authenticated-orcid":false,"given":"Ahmed Ahmed Hesham","family":"Sedky","sequence":"additional","affiliation":[{"name":"Department of Information Systems, College of Computing and Information Technology, Arab Academy for Science Technology and Maritime Transport, Alexandria 511511, Egypt"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1865-2627","authenticated-orcid":false,"given":"Saleh","family":"Mesbah","sequence":"additional","affiliation":[{"name":"Department of Information Systems, College of Computing and Information Technology, Arab Academy for Science Technology and Maritime Transport, Alexandria 511511, Egypt"}]}],"member":"1968","published-online":{"date-parts":[[2023,2,2]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1763","DOI":"10.1093\/ije\/dyaa035","article-title":"Intersections of machine learning and epidemiological methods for health services research","volume":"49","author":"Rose","year":"2021","journal-title":"Int. J. Epidemiol."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"623","DOI":"10.1016\/j.jad.2019.06.034","article-title":"Identifying depression in the National Health and Nutrition Examination Survey data using a deep learning algorithm","volume":"257","author":"Oh","year":"2019","journal-title":"J. Affect. Disord."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1002\/eat.23232","article-title":"Interactions between different eating patterns on recurrent binge-eating behavior: A machine learning approach","volume":"53","author":"Linardon","year":"2020","journal-title":"Int. J. Eat. Disord."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"683","DOI":"10.1002\/erv.2850","article-title":"Machine learning to advance the prediction, prevention and treatment of eating disorders","volume":"29","author":"Wang","year":"2021","journal-title":"Eur. Eat. Disord. Rev. J. Eat. Disord. Assoc."},{"key":"ref_5","unstructured":"(2022, June 11). Breast Cancer-Metastatic: Statistics. Available online: https:\/\/www.cancer.net\/cancer-types\/breast-cancer-metastatic\/statistics#:~:text=The%205%2Dyear%20survival%20rate%20for%20women%20with%20metastatic%20breast,is%20treatable%20at%20any%20stage."},{"key":"ref_6","unstructured":"(2022, September 19). Cancer Facts & Figures 2022|American Cancer Society. Available online: https:\/\/www.cancer.org\/research\/cancer-facts-statistics\/all-cancer-facts-figures\/cancer-facts-figures-2022.html."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Hastie, T., Friedman, J., and Tibshirani, R. (2001). The Elements of Statistical Learning Data Mining, Inference, and Prediction, Springer. [1st ed.].","DOI":"10.1007\/978-0-387-21606-5"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Graves, A., Mohamed, A., and Hinton, G. (2013, January 26\u201331). Speech recognition with deep recurrent neural networks. Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada.","DOI":"10.1109\/ICASSP.2013.6638947"},{"key":"ref_9","unstructured":"Freund, Y., and Mason, L. The Alternating Decision Tree Learning Algorithm. Proceedings of the Sixteenth International Conference on Machine Learning, San Francisco, CA, USA."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"288","DOI":"10.1109\/TNNLS.2012.2230406","article-title":"Generalization Performance of Fisher Linear Discriminant Based on Markov Sampling","volume":"24","author":"Zou","year":"2013","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_11","unstructured":"Duda, R.O., and Hart, P.E. (1973). Pattern Classification and Scene Analysis, Wiley."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Fukunaga, K. (1990). Introduction to Statistical Pattern Recognition, Academic Press. [2nd ed.].","DOI":"10.1016\/B978-0-08-047865-4.50007-7"},{"key":"ref_13","unstructured":"(2022, September 16). Linear Regression in Python\u2014Real Python. Available online: https:\/\/realpython.com\/linear-regression-in-python\/#linear-regression."},{"key":"ref_14","unstructured":"(2022, September 16). \u201cSklearn.linear_model.LinearRegression,\u201d Scikit-Learn. Available online: https:\/\/scikit-learn\/stable\/modules\/generated\/sklearn.linear_model.LinearRegression.html."},{"key":"ref_15","unstructured":"Vapnik, V.N. (1998). Statistical Learning Theory, Wiley-Interscience."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1651","DOI":"10.1214\/aos\/1024691352","article-title":"Boosting the margin: A new explanation for the effectiveness of voting methods","volume":"26","author":"Bartlett","year":"1998","journal-title":"Ann. Stat."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1007\/BF00058655","article-title":"Bagging predictors","volume":"24","author":"Breiman","year":"1996","journal-title":"Mach. Learn."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1109\/72.80210","article-title":"Probabilistic neural networks and the polynomial Adaline as complementary techniques for classification","volume":"1","author":"Specht","year":"1990","journal-title":"IEEE Trans. Neural Netw."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Nazif, A.M., Sedky, A.A.H., and Badawy, O.M. (2020, January 28\u201330). MOOC\u2019s Student Results Classification by Comparing PNN and other Classifiers with Features Selection. Proceedings of the 2020 21st International Arab Conference on Information Technology (ACIT), Giza, Egypt.","DOI":"10.1109\/ACIT50332.2020.9300123"},{"key":"ref_20","unstructured":"Goodfellow, I., Bengio, Y., and Courville, A. (2016). Deep Learning, MIT Press."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.neunet.2014.09.003","article-title":"Deep Learning in Neural Networks: An Overview","volume":"61","author":"Schmidhuber","year":"2015","journal-title":"Neural Netw."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Larochelle, H., Erhan, D., Courville, A., Bergstra, J., and Bengio, Y. (2007, January 20\u201324). An empirical evaluation of deep architectures on problems with many factors of variation. Proceedings of the 24th International Conference on Machine Learning, Corvalis, OR, USA.","DOI":"10.1145\/1273496.1273556"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"LeCun","year":"2015","journal-title":"Nature"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1527","DOI":"10.1162\/neco.2006.18.7.1527","article-title":"A Fast Learning Algorithm for Deep Belief Nets","volume":"18","author":"Hinton","year":"2006","journal-title":"Neural Comput."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Liu, H. (2021). Single-point wind forecasting methods based on reinforcement learning. Wind Forecast. Railw. Eng., 177\u2013214.","DOI":"10.1016\/B978-0-12-823706-9.00005-3"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"89","DOI":"10.36548\/jei.2019.2.004","article-title":"Neural Network Analysis for Tumor Investigation and Cancer Prediction","volume":"2019","author":"Vijayakumar","year":"2019","journal-title":"J. Electron. Inform."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1451","DOI":"10.1007\/s12559-020-09813-6","article-title":"Survey on Machine Learning and Deep Learning Applications in Breast Cancer Diagnosis","volume":"13","author":"Chugh","year":"2021","journal-title":"Cogn. Comput."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1109\/TCBB.2018.2806438","article-title":"A Multimodal Deep Neural Network for Human Breast Cancer Prognosis Prediction by Integrating Multi-Dimensional Data","volume":"16","author":"Sun","year":"2019","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"522","DOI":"10.1055\/a-0855-3532","article-title":"A deep neural network improves endoscopic detection of early gastric cancer without blind spots","volume":"51","author":"Wu","year":"2019","journal-title":"Endoscopy"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Ferroni, P., Zanzotto, F.M., Riondino, S., Scarpato, N., Guadagni, F., and Roselli, M. (2019). Breast Cancer Prognosis Using a Machine Learning Approach. Cancers, 11.","DOI":"10.3390\/cancers11030328"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Zeid, M.A.-E., El-Bahnasy, K., and Abo-Youssef, S.E. (2022, January 8\u20139). DeepBreast: Building Optimized Framework for Prognosis of Breast Cancer Classification Based on Computational Intelligence. Proceedings of the 2022 2nd International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC), Cairo, Egypt.","DOI":"10.1109\/MIUCC55081.2022.9781677"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Yue, W., Wang, Z., Chen, H., Payne, A., and Liu, X. (2018). Machine Learning with Applications in Breast Cancer Diagnosis and Prognosis. Designs, 2.","DOI":"10.3390\/designs2020013"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1016\/j.artmed.2004.07.002","article-title":"Predicting breast cancer survivability: A comparison of three data mining methods","volume":"34","author":"Delen","year":"2005","journal-title":"Artif. Intell. Med."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"2387","DOI":"10.1007\/s00521-012-1196-7","article-title":"Decision tree classifiers for automated medical diagnosis","volume":"23","author":"Azar","year":"2012","journal-title":"Neural Comput. Appl."},{"key":"ref_35","first-page":"10","article-title":"Data Mining Techniques: To Predict and Resolve Breast Cancer Survivability","volume":"3","author":"Chaurasia","year":"2014","journal-title":"Int. J. Comput. Sci. Mob. Comput. IJCSMC"},{"key":"ref_36","unstructured":"Djebbari, A., Liu, Z., Phan, S., and AND Famili, F. (2008, January 3\u20136). International journal of computational biology and drug design (ijcbdd). Proceedings of the 21st Annual Conference on Neural Information Processing Systems, Vancouver, BC, Canada."},{"key":"ref_37","first-page":"37","article-title":"Knowledge based analysis of various statistical tools in detecting breast cancer","volume":"2","author":"Aruna","year":"2011","journal-title":"Comput. Sci. Inf. Technol."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Ojha, U., and Goel, S. (2017, January 12\u201313). A study on prediction of breast cancer recurrence using data mining techniques. Proceedings of the 7th International Conference on Cloud Computing, Data Science & Engineering\u2014Confluence, Noida, India.","DOI":"10.1109\/CONFLUENCE.2017.7943207"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1016\/j.ins.2010.08.047","article-title":"Simultaneous feature selection and classification using kernel-penalized support vect machines","volume":"181","author":"Maldonado","year":"2011","journal-title":"Inf. Sci."}],"container-title":["Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2306-5729\/8\/2\/35\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T18:22:29Z","timestamp":1760120549000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2306-5729\/8\/2\/35"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,2]]},"references-count":39,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2023,2]]}},"alternative-id":["data8020035"],"URL":"https:\/\/doi.org\/10.3390\/data8020035","relation":{},"ISSN":["2306-5729"],"issn-type":[{"value":"2306-5729","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,2,2]]}}}