{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,25]],"date-time":"2026-03-25T13:51:25Z","timestamp":1774446685050,"version":"3.50.1"},"reference-count":41,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2019,11,28]],"date-time":"2019-11-28T00:00:00Z","timestamp":1574899200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001871","name":"Funda\u00e7\u00e3o para a Ci\u00eancia e a Tecnologia","doi-asserted-by":"publisher","award":["UID\/CEC\/00319\/2019"],"award-info":[{"award-number":["UID\/CEC\/00319\/2019"]}],"id":[{"id":"10.13039\/501100001871","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>The development of malign cells that can grow in any part of the stomach, known as gastric cancer, is one of the most common causes of death worldwide. In order to increase the survival rate in patients with this condition, it is essential to improve the decision-making process leading to a better and more efficient selection of treatment strategies. Nowadays, with the large amount of information present in hospital institutions, it is possible to use data mining algorithms to improve the healthcare delivery. Thus, this study, using the CRISP methodology, aims to predict not only the mortality associated with this disease, but also the occurrence of any complication following surgery. A set of classification models were tested and compared in order to improve the prediction accuracy. The study showed that, on one hand, the J48 algorithm using oversampling is the best technique to predict the mortality in gastric cancer patients, with an accuracy of approximately 74%. On the other hand, the rain forest algorithm using oversampling presents the best results when predicting the possible occurrence of complications among gastric cancer patients after their in-hospital stays, with an accuracy of approximately 83%.<\/jats:p>","DOI":"10.3390\/e21121163","type":"journal-article","created":{"date-parts":[[2019,11,28]],"date-time":"2019-11-28T08:14:04Z","timestamp":1574928844000},"page":"1163","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":32,"title":["Application of Data Mining for the Prediction of Mortality and Occurrence of Complications for Gastric Cancer Patients"],"prefix":"10.3390","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8736-7443","authenticated-orcid":false,"given":"Cristiana","family":"Neto","sequence":"first","affiliation":[{"name":"Algoritmi Research Center, University of Minho, 4710-057 Braga, Portugal"}]},{"given":"Maria","family":"Brito","sequence":"additional","affiliation":[{"name":"Algoritmi Research Center, University of Minho, 4710-057 Braga, Portugal"}]},{"given":"V\u00edtor","family":"Lopes","sequence":"additional","affiliation":[{"name":"S\u00e3o Jo\u00e3o Hospital Center, 4200-319 Porto, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3957-2121","authenticated-orcid":false,"given":"Hugo","family":"Peixoto","sequence":"additional","affiliation":[{"name":"Algoritmi Research Center, University of Minho, 4710-057 Braga, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6457-0756","authenticated-orcid":false,"given":"Ant\u00f3nio","family":"Abelha","sequence":"additional","affiliation":[{"name":"Algoritmi Research Center, University of Minho, 4710-057 Braga, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4121-6169","authenticated-orcid":false,"given":"Jos\u00e9","family":"Machado","sequence":"additional","affiliation":[{"name":"Algoritmi Research Center, University of Minho, 4710-057 Braga, Portugal"}]}],"member":"1968","published-online":{"date-parts":[[2019,11,28]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"408","DOI":"10.1016\/j.procs.2015.04.021","article-title":"A survey of big data analytics in healthcare and government","volume":"50","author":"Archenaa","year":"2015","journal-title":"Procedia Comput. Sci."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1186\/2047-2501-2-3","article-title":"Big data analytics in healthcare: Promise and potential","volume":"2","author":"Raghupathi","year":"2014","journal-title":"Health Inf. Sci. Syst."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"21","DOI":"10.4172\/2472-1654.100131","article-title":"The Usefulness and Challenges of Big Data in Healthcare","volume":"3","author":"Fatt","year":"2018","journal-title":"J. Healthc. Commun."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Neto, C., Peixoto, H., Abelha, V., Abelha, A., and Machado, J. (2017). Knowledge Discovery from Surgical Waiting lists. Procedia Comput. Sci.","DOI":"10.1016\/j.procs.2017.11.141"},{"key":"ref_5","first-page":"2","article-title":"Data Mining: Concepts, Background and Methods of Integrating Uncertainty in Data Mining","volume":"3","author":"Li","year":"2010","journal-title":"CCSC SC Stud. E-J."},{"key":"ref_6","unstructured":"(2019, November 13). Stomach Cancer Statistics; World Cancer Research Fund. Available online: https:\/\/www.wcrf.org\/dietandcancer\/cancer-trends\/stomach-cancer-statistics."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Bray, F., Ferlay, J., Soerjomataram, I., Siegel, R.L., Torre, L.A., and Jemal, A. (2018). Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin.","DOI":"10.3322\/caac.21492"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1149","DOI":"10.3748\/wjg.14.1149","article-title":"Role of symptoms in diagnosis and outcome of gastric cancer","volume":"14","author":"Maconi","year":"2008","journal-title":"World J. Gastroenterol. WJG"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"4421","DOI":"10.3748\/wjg.v17.i39.4421","article-title":"Comparative epidemiology of gastric cancer between Japan and China","volume":"17","author":"Lin","year":"2011","journal-title":"World J. Gastroenterol. WJG"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Ferreira, D., Peixoto, H., Machado, J., and Abelha, A. (2018, January 4\u20136). Predictive Data Mining in Nutrition Therapy. Proceedings of the IEEE 2018 13th APCA International Conference on Automatic Control and Soft Computing (CONTROLO), Ponta Delgada, Portugal.","DOI":"10.1109\/CONTROLO.2018.8516413"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Silwattananusarn, T., and Tuamsuk, K. (2012). Data Mining and Its Applications for Knowledge Management: A Literature Review from 2007 to 2012. Int. J. Data Min. Knowl. Manag. Process., 2.","DOI":"10.5121\/ijdkp.2012.2502"},{"key":"ref_12","unstructured":"Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., and Uthurusamy, R. (1996). Advances in Knowledge Discovery and Data Mining, AAAI Press."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"104","DOI":"10.1016\/j.csbj.2016.12.005","article-title":"Machine Learning and Data Mining Methods in Diabetes Research","volume":"15","author":"Kavakiotis","year":"2017","journal-title":"Comput. Struct. Biotechnol. J."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"690","DOI":"10.1086\/502460","article-title":"Application of Data Mining Techniques to Healthcare Data","volume":"25","author":"Obenshain","year":"2004","journal-title":"Infect. Control. Hosp. Epidemiol."},{"key":"ref_15","first-page":"150","article-title":"Mining the Complication Pattern of Gastric Cancer Patients by Using Artificial Neural Networks and Logistic Regression","volume":"2","author":"Lee","year":"2006","journal-title":"J. Hum. Resour. Adult Learn."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Polaka, I., Ga\u0161enko, E., Barash, O., Haick, H., and Leja, M. (2016). Constructing Interpretable Classifiers to Diagnose Gastric Cancer Based on Breath Tests. Procedia Comput. Sci.","DOI":"10.1016\/j.procs.2017.01.136"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Hosein Zadeh, R., Goshayeshi, L., Khooie, A., Etminani, K., Yousefli, Z., Nastarani, S., Farhang Nezhad, N., and Golabpoor, A. (2017). Predictive Model for Survival in Patients With Gastric Cancer. Acta Healthmed.","DOI":"10.19082\/6035"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Berner, E.S. (2007). Clinical Decision Support Systems, Springer.","DOI":"10.1007\/978-0-387-38319-4"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1007\/s11845-014-1100-9","article-title":"Predicting the probability of mortality of gastric cancer patients using decision tree","volume":"184","author":"Mohammadzadeh","year":"2015","journal-title":"Ir. J. Med. Sci."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Silva, E., Cardoso, L., Portela, F., Abelha, A., Santos, M.F., and Machado, J. (2015). Predicting nosocomial infection by using data mining technologies. New Contributions in Information Systems and Technologies, Springer.","DOI":"10.1007\/978-3-319-16528-8_18"},{"key":"ref_21","unstructured":"Chapman, P. (2000). CRISP-DM 1.0: Step-by-Step Data Mining Guide, SPSS."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Tapak, L., Mahjub, H., Hamidi, O., and Poorolajal, J. (2013). Real-data comparison of data mining methods in prediction of diabetes in Iran. Healthc. Inform. Res., 19.","DOI":"10.4258\/hir.2013.19.3.177"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1016\/j.procs.2015.03.182","article-title":"Performance Analysis of Classifier Models to Predict Diabetes Mellitus","volume":"47","author":"Kandhasamy","year":"2015","journal-title":"Procedia Comput. Sci."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Rokach, L., and Maimon, O. (2014). Data Mining with Decision Trees: Theory and Applications, World Scientific Publishing Co., Inc.. [2nd ed.].","DOI":"10.1142\/9097"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Yang, X.S., Nagar, A.K., and Joshi, A. (2017). Smart Trends in Systems, Security and Sustainability: Proceedings of WS4 2017, Springer.","DOI":"10.1007\/978-981-10-6916-1"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1","DOI":"10.5121\/ijdkp.2015.5101","article-title":"Diagnosis of Diabetes Using Classification Mining Techniques","volume":"5","author":"Iyer","year":"2015","journal-title":"Int. J. Data Min. Knowl. Manag. Process."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Landwehr, N., Hall, M., and Frank, E. (2005). Logistic Model Trees. Machine Learning, Springer.","DOI":"10.1007\/s10994-005-0466-3"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Sumner, M., Frank, E., and Hall, M. (2005, January 3\u20137). Speeding up Logistic Model Tree Induction. Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases, Porto, Portugal.","DOI":"10.1007\/11564126_72"},{"key":"ref_29","unstructured":"Area, S., and Mesra, R. (2012). Analysis of Bayes, Neural Network and Tree Classifier of Classification Technique in Data Mining Using WEKA. Computer Science & Information Technology, AIRCC Publishing Corporation."},{"key":"ref_30","unstructured":"Kulczycki, P., Kacprzyk, J., K\u00f3czy, L.T., Mesiar, R., and Wisniewski, R. (2017). Information Technology, Systems Research, and Computational Physics, Springer."},{"key":"ref_31","unstructured":"Frank, E., and Witten, I.H. (1998). Generating Accurate Rule Sets without Global Optimization, Morgan Kaufmann Publishers."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Ry\u017cko, D., Gawrysiak, P., Rybinski, H., and Kryszkiewicz, M. (2011). Emerging Intelligent Technologies in Industry, Springer.","DOI":"10.1007\/978-3-642-22732-5"},{"key":"ref_33","unstructured":"Freund, Y., and Schapire, R.E. (1996, January 3\u20136). Experiments with a new boosting algorithm. Proceedings of the Thirteenth International Conference on International Conference on Machine Learning, Bari, Italy."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1016\/j.procs.2016.04.016","article-title":"Performance Analysis of Data Mining Classification Techniques to Predict Diabetes","volume":"82","author":"Perveen","year":"2016","journal-title":"Procedia Comput. Sci."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1007\/s100440200011","article-title":"Bagging, boosting and the random subspace method for linear classifiers","volume":"5","author":"Skurichina","year":"2002","journal-title":"Pattern Anal. Appl."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Cornelis, C., Kryszkiewicz, M., Ciucci, D., Medina-Moreno, J., Motoda, H., and Ras, Z.W. (2014). Rough Sets and Intelligent Systems Paradigms, Springer.","DOI":"10.1007\/978-3-319-08729-0"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"239","DOI":"10.2147\/CMAR.S149619","article-title":"Gastric cancer: Epidemiology, prevention, classification, and treatment","volume":"10","author":"Sitarz","year":"2018","journal-title":"Cancer Manag. Res."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1016\/j.gtc.2013.01.002","article-title":"Gastric Cancer. Overview","volume":"42","author":"Correa","year":"2013","journal-title":"Gastroenterol. Clin. N. Am."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"vi57","DOI":"10.1093\/annonc\/mdt344","article-title":"Gastric cancer: ESMO\u2013ESSO\u2013ESTRO Clinical Practice Guidelines for diagnosis, treatment and follow-up","volume":"24","author":"Waddell","year":"2013","journal-title":"Ann. Oncol."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1007\/s10620-016-4368-y","article-title":"Gastrointestinal bleeding due to gastrointestinal tract malignancy: Natural history, management, and outcomes","volume":"62","author":"Schatz","year":"2017","journal-title":"Dig. Dis. Sci."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Biskup, E., Cai, F., Vetter, M., and Marsch, S. (2017). Oncological patients in the intensive care unit: Prognosis, decision-making, therapies and end-of-life care. Swiss Med. Wkly., 147.","DOI":"10.4414\/smw.2017.14557"}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/21\/12\/1163\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:38:11Z","timestamp":1760189891000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/21\/12\/1163"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,28]]},"references-count":41,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2019,12]]}},"alternative-id":["e21121163"],"URL":"https:\/\/doi.org\/10.3390\/e21121163","relation":{},"ISSN":["1099-4300"],"issn-type":[{"value":"1099-4300","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,11,28]]}}}