{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,20]],"date-time":"2026-02-20T00:24:53Z","timestamp":1771547093442,"version":"3.50.1"},"reference-count":61,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,3,30]],"date-time":"2023-03-30T00:00:00Z","timestamp":1680134400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,3,30]],"date-time":"2023-03-30T00:00:00Z","timestamp":1680134400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The random forest algorithm could be enhanced and produce better results with a well-designed and organized feature selection phase. The dependency structure between the variables is considered to be the most important criterion behind selecting the variables to be used in the algorithm during the feature selection phase. As the dependency structure is mostly nonlinear, making use of a tool that considers nonlinearity would be a more beneficial approach. Copula-Based Clustering technique (CoClust) clusters variables with copulas according to nonlinear dependency. We show that it is possible to achieve a remarkable improvement in CPU times and accuracy by adding the CoClust-based feature selection step to the random forest technique. We work with two different large datasets, namely, the MIMIC-III Sepsis Dataset and the SMS Spam Collection Dataset. The first dataset is large in terms of rows referring to individual IDs, while the latter is an example of longer column length data with many variables to be considered. In the proposed approach, first, random forest is employed without adding the CoClust step. Then, random forest is repeated in the clusters obtained with CoClust. The obtained results are compared in terms of CPU time, accuracy and ROC (receiver operating characteristic) curve. CoClust clustering results are compared with K-means and hierarchical clustering techniques. The Random Forest, Gradient Boosting and Logistic Regression results obtained with these clusters and the success of RF and CoClust working together are examined. <\/jats:p>","DOI":"10.1186\/s40537-023-00720-9","type":"journal-article","created":{"date-parts":[[2023,3,30]],"date-time":"2023-03-30T13:16:12Z","timestamp":1680182172000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["An enhanced random forest approach using CoClust clustering: MIMIC-III and SMS spam collection application"],"prefix":"10.1186","volume":"10","author":[{"given":"Zeynep","family":"Ilhan Taskin","sequence":"first","affiliation":[]},{"given":"Kasirga","family":"Yildirak","sequence":"additional","affiliation":[]},{"given":"Cagdas Hakan","family":"Aladag","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,3,30]]},"reference":[{"key":"720_CR1","doi-asserted-by":"crossref","unstructured":"Darwiche Aiman A. 2018. \u201cMachine learning methods for septic shock prediction.\u201d PhD Thesis, Nova Southeastern University. Retrieved from NSUWorks, College of Engineering and Computing. (1051) https:\/\/nsuworks.nova.edu\/gscis_etd\/1051","DOI":"10.1145\/3293663.3293673"},{"key":"720_CR2","doi-asserted-by":"publisher","DOI":"10.2196\/medinform.6690","author":"J Lee","year":"2017","unstructured":"Lee J. Patient-specific predictive modeling using random forests: an observational study for the critically Ill. JMIR Med Informat. 2017. https:\/\/doi.org\/10.2196\/medinform.6690.","journal-title":"JMIR Med Informat"},{"issue":"12","key":"720_CR3","doi-asserted-by":"publisher","first-page":"8553","DOI":"10.1007\/s00500-019-04427-z","volume":"24","author":"S Levantesi","year":"2020","unstructured":"Levantesi S, Nigri A. A random forest algorithm to improve the Lee-carter mortality forecasting: impact on q-forward. Soft Comput. 2020;24(12):8553\u201367. https:\/\/doi.org\/10.1007\/s00500-019-04427-z.","journal-title":"Soft Comput"},{"key":"720_CR4","doi-asserted-by":"publisher","DOI":"10.1136\/bmjopen-2018-025925","author":"CJ McWilliams","year":"2019","unstructured":"McWilliams CJ, et al. Towards a decision support tool for \u0131ntensive care discharge: machine learning algorithm development using electronic healthcare data from MIMIC-III and Bristol, UK. BMJ Open. 2019. https:\/\/doi.org\/10.1136\/bmjopen-2018-025925.","journal-title":"BMJ Open"},{"issue":"8","key":"720_CR5","doi-asserted-by":"publisher","first-page":"2967","DOI":"10.1007\/s00500-015-1925-9","volume":"20","author":"P Mistry","year":"2016","unstructured":"Mistry P, Neagu D, Trundle PR, Vessey JD. Using random forest and decision tree models for a new vehicle prediction approach in computational toxicology. Soft Comput. 2016;20(8):2967\u201379. https:\/\/doi.org\/10.1007\/s00500-015-1925-9.","journal-title":"Soft Comput"},{"key":"720_CR6","doi-asserted-by":"publisher","DOI":"10.5772\/intechopen.76988","author":"S Van Poucke","year":"2018","unstructured":"Van Poucke S, Kovacevic A, Vukicevic M. Early prediction of patient mortality based on routine laboratory tests and predictive models in critically Ill patients. In Data Mining InTech. 2018. https:\/\/doi.org\/10.5772\/intechopen.76988.","journal-title":"In Data Mining InTech"},{"key":"720_CR7","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1007\/BF00058655","volume":"24","author":"L Breiman","year":"1996","unstructured":"Breiman L. Bagging predictors. Mach Learn. 1996;24:123\u201340. https:\/\/doi.org\/10.1007\/BF00058655.","journal-title":"Mach Learn"},{"key":"720_CR8","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1023\/A:1007607513941","volume":"40","author":"TG Dietterich","year":"2000","unstructured":"Dietterich TG. An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization. Mach Learn. 2000;40:139\u201357. https:\/\/doi.org\/10.1023\/A:1007607513941.","journal-title":"Mach Learn"},{"issue":"8","key":"720_CR9","doi-asserted-by":"publisher","first-page":"832","DOI":"10.1109\/34.709601","volume":"20","author":"K Ho","year":"1998","unstructured":"Ho K. The random subspace method for constructing decision forests. IEEE Trans Pattern Anal Mach Intell. 1998;20(8):832\u201344. https:\/\/doi.org\/10.1109\/34.709601.","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"720_CR10","volume-title":"\"Using Adaptive Bagging To Debias Regressions.\" Technical Report 547","author":"L Breiman","year":"1999","unstructured":"Breiman L. \u201cUsing Adaptive Bagging To Debias Regressions.\u201d Technical Report 547. Berkeley: University of California at Berkeley; 1999."},{"key":"720_CR11","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L. Random forests. Mach Learn. 2001;45:5\u201332. https:\/\/doi.org\/10.1023\/A:1010933404324.","journal-title":"Mach Learn"},{"issue":"4","key":"720_CR12","doi-asserted-by":"publisher","first-page":"547","DOI":"10.1038\/modpathol.3800322","volume":"18","author":"T Shi","year":"2005","unstructured":"Shi T, et al. Tumor classification by tissue microarray profiling: random forest clustering applied to renal cell carcinoma. Mod Pathol. 2005;18(4):547\u201357. https:\/\/doi.org\/10.1038\/modpathol.3800322.","journal-title":"Mod Pathol"},{"issue":"1","key":"720_CR13","doi-asserted-by":"publisher","first-page":"118","DOI":"10.1198\/106186006X94072","volume":"15","author":"T Shi","year":"2006","unstructured":"Shi T, Horvath S. Unsupervised learning with random forest predictors. J Comput Graph Stat. 2006;15(1):118\u201338.","journal-title":"J Comput Graph Stat"},{"key":"720_CR14","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1016\/j.csda.2014.06.017","volume":"80","author":"A Hapfelmeier","year":"2014","unstructured":"Hapfelmeier A, Ulm K. Variable selection by random forests using data with missing values. Comput Stat Data Anal. 2014;80:129\u201339. https:\/\/doi.org\/10.1016\/j.csda.2014.06.017.","journal-title":"Comput Stat Data Anal"},{"key":"720_CR15","doi-asserted-by":"publisher","unstructured":"Uddin Taufeeq, Azher Uddin. 2015. \u201cA guided random forest based feature selection for activity recognition.\u201d In 2nd Int\u2019l Conf. On electrical engineering and \u0131nfonnation & communication technology (ICEEICT). https:\/\/doi.org\/10.1109\/ICEEICT.2015.7307376","DOI":"10.1109\/ICEEICT.2015.7307376"},{"key":"720_CR16","unstructured":"Gupta Chelsi. 2019. \u201cFeature selection and analysis for standard machine learning of audio beehive samples.\u201d Msc Thesis, Utah State University. https:\/\/digitalcommons.usu.edu\/etd\/7564."},{"key":"720_CR17","first-page":"229","volume":"8","author":"A Sklar","year":"1959","unstructured":"Sklar A. Fonctions de repartition \u00e1 n dimensions et leurs marges. Publications de l\u2019Institut Statistiquede l\u2019Universit\u00e9 de Paris. 1959;8:229\u201331.","journal-title":"Publications de l'Institut Statistiquede l'Universit\u00e9 de Paris"},{"key":"720_CR18","volume-title":"An \u0131ntroduction to copulas","author":"RB Nelsen","year":"2006","unstructured":"Nelsen RB. An \u0131ntroduction to copulas. 2nd ed. Berlin: Springer Science & Business Media; 2006.","edition":"2"},{"key":"720_CR19","doi-asserted-by":"publisher","unstructured":"Jaworski Piotr, Fabrizio Durante, Wolfgang Hardle, Tomasz Rychlik. 2009. \u201cCopula Theory And Its Applications.\u201d Proceedings of the Workshop Held in Warsaw, 25\u201326. https:\/\/doi.org\/10.1007\/978-3-642-12465-5","DOI":"10.1007\/978-3-642-12465-5"},{"key":"720_CR20","doi-asserted-by":"publisher","first-page":"7140","DOI":"10.3390\/app11157140","volume":"11","author":"R Mesiar","year":"2021","unstructured":"Mesiar R, Sheikhi A. Nonlinear random forest classification, a copula-based approach. Appl Sci. 2021;11:7140. https:\/\/doi.org\/10.3390\/app11157140.","journal-title":"Appl Sci"},{"key":"720_CR21","unstructured":"Di Lascio, Francesca Marta Lilja. 2008. \u201cAnalyzing the dependence structure of microarray data: a copula-based approach.\u201d PhD Thesis, University of Bologna."},{"key":"720_CR22","first-page":"994","volume":"2017","author":"AEW Johnson","year":"2018","unstructured":"Johnson AEW, Mark RG. Real-time mortality prediction in the \u0131ntensive care unit. AMIA Annu Symp Proc. 2018;2017:994\u20131003.","journal-title":"AMIA Annu Symp Proc"},{"issue":"1","key":"720_CR23","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1007\/s00357-012-9099-y","volume":"29","author":"Di Lascio","year":"2012","unstructured":"Lascio Di, Lilja FM, Giannerini S. A copula-based algorithm for discovering patterns of dependent observations. J Classif. 2012;29(1):50\u201375. https:\/\/doi.org\/10.1007\/s00357-012-9099-y.","journal-title":"J Classif"},{"issue":"1","key":"720_CR24","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1007\/s00362-016-0822-3","volume":"60","author":"Di Lascio","year":"2019","unstructured":"Lascio Di, Lilja FM, Giannerini S. Clustering dependent observations with copula functions. Stat Pap. 2019;60(1):35\u201351. https:\/\/doi.org\/10.1007\/s00362-016-0822-3.","journal-title":"Stat Pap"},{"issue":"15","key":"720_CR25","doi-asserted-by":"publisher","first-page":"9677","DOI":"10.1007\/s00500-020-05399-1","volume":"25","author":"YA Khan","year":"2021","unstructured":"Khan YA, Shan QS, Liu Q, Abbas SZ. A nonparametric copula-based decision tree for two random variables using MIC as a classification index. Soft Comput. 2021;25(15):9677\u201392. https:\/\/doi.org\/10.1007\/s00500-020-05399-1.","journal-title":"Soft Comput"},{"key":"720_CR26","doi-asserted-by":"publisher","first-page":"651","DOI":"10.1111\/j.1539-6975.2009.01318.x","volume":"76","author":"M Eling","year":"2009","unstructured":"Eling M, Toplek D. Modeling and management of nonlinear dependencies-copulas in dynamic financial analysis. J Risk Insur. 2009;76:651\u201381. https:\/\/doi.org\/10.1111\/j.1539-6975.2009.01318.x.","journal-title":"J Risk Insur"},{"key":"720_CR27","doi-asserted-by":"publisher","first-page":"662340","DOI":"10.3389\/fmed.2021.662340","volume":"8","author":"Y Zhu","year":"2021","unstructured":"Zhu Y, et al. Machine learning prediction models for mechanically ventilated patients analyses of the MIMIC-III database. Front Med. 2021;8:662340. https:\/\/doi.org\/10.3389\/fmed.2021.662340.","journal-title":"Front Med"},{"key":"720_CR28","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1007\/s41019-022-00176-6","volume":"7","author":"SR Khope","year":"2022","unstructured":"Khope SR, Elias S. Critical correlation of predictors for an efficient risk prediction framework of ICU patient using correlation and transformation of MIMIC-III dataset. Data Sci Eng. 2022;7:71\u201386. https:\/\/doi.org\/10.1007\/s41019-022-00176-6.","journal-title":"Data Sci Eng"},{"issue":"3","key":"720_CR29","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1080\/10920277.1998.10595667","volume":"2","author":"EW Frees","year":"1998","unstructured":"Frees EW, Valdez EA. Understanding relationships using copulas. North Am Actuar J. 1998;2(3):1\u201325. https:\/\/doi.org\/10.1080\/10920277.1998.10595667.","journal-title":"North Am Actuar J"},{"key":"720_CR30","volume-title":"The estimation method of \u0131nference functions for margins for multivariate models","author":"H Joe","year":"1996","unstructured":"Joe H, Xu JJ. The estimation method of \u0131nference functions for margins for multivariate models. Vancouver: University of British Columbia; 1996."},{"issue":"3","key":"720_CR31","doi-asserted-by":"publisher","first-page":"543","DOI":"10.1093\/biomet\/82.3.543","volume":"82","author":"C Genest","year":"1995","unstructured":"Genest C, Ghoudi K, Rivest L-P. A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika. 1995;82(3):543\u201352.","journal-title":"Biometrika"},{"key":"720_CR32","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1007\/978-3-319-64221-5_4","volume-title":"Copulas and dependence models with applications. in copulas and dependence models with applications","author":"Di Lascio","year":"2017","unstructured":"Lascio Di, Lilja FM, Durante F, Pappada R. Copulas and dependence models with applications. in copulas and dependence models with applications. Berlin: Springer International Publishing; 2017;49\u201365."},{"key":"720_CR33","doi-asserted-by":"publisher","unstructured":"Lascio Di FML, Disegna M. A copula-based clustering algorithm to analyse EU country diets, Knowledge-Based Systems.  2017;132:72\u201384. https:\/\/doi.org\/10.1016\/j.knosys.2017.06.004","DOI":"10.1016\/j.knosys.2017.06.004"},{"key":"720_CR34","doi-asserted-by":"publisher","first-page":"108387","DOI":"10.1016\/j.apacoust.2020.107387","volume":"167","author":"Ji Xue","year":"2020","unstructured":"Xue Ji, Yang B, Tang Q. Seabed sediment classification using multibeam backscatter data based on the selecting optimal random forest model. Appl Acoust. 2020;167:108387. https:\/\/doi.org\/10.1016\/j.apacoust.2020.107387.","journal-title":"Appl Acoust"},{"issue":"7","key":"720_CR35","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1145\/129902.129905","volume":"35","author":"RL Rivest","year":"1992","unstructured":"Rivest RL, Hellman ME, Anderson JC. Responses to NIST\u2019s proposal. Commun ACM. 1992;35(7):41\u201354. https:\/\/doi.org\/10.1145\/129902.129905.","journal-title":"Commun ACM"},{"key":"720_CR36","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1016\/j.neuroimage.2012.09.065","volume":"65","author":"KR Gray","year":"2013","unstructured":"Gray KR, et al. Random forest-based similarity measures for multi-modal classification of Alzheimer\u2019s disease. Neuroimage. 2013;65:167\u201375. https:\/\/doi.org\/10.1016\/j.neuroimage.2012.09.065.","journal-title":"Neuroimage"},{"key":"720_CR37","doi-asserted-by":"publisher","first-page":"219","DOI":"10.1016\/j.jtbi.2012.10.028","volume":"317","author":"Z Qiu","year":"2013","unstructured":"Qiu Z, Qin C, Jiu M, Wang X. A simple iterative method to optimize protein-ligand-binding residue prediction. J Theor Biol. 2013;317:219\u201323. https:\/\/doi.org\/10.1016\/j.jtbi.2012.10.028.","journal-title":"J Theor Biol"},{"key":"720_CR38","unstructured":"Friedman Jerome, Trevor Hastie, Robert Tibshirani. 2008. The elements of statistical learning preface to the second edition."},{"key":"720_CR39","doi-asserted-by":"publisher","DOI":"10.1155\/2014\/957107","author":"G Sonam","year":"2014","unstructured":"Sonam G, Jamal S, Open source drug discovery consortium, and Vinod Scaria. \u201cCheminformatics models for inhibitors of Schistosoma Mansoni Thioredoxin glutathione reductase.\u201d Sci World J. 2014. https:\/\/doi.org\/10.1155\/2014\/957107.","journal-title":"Sci World J"},{"issue":"4","key":"720_CR40","doi-asserted-by":"publisher","first-page":"2249","DOI":"10.1016\/j.csda.2007.08.015","volume":"52","author":"KJ Archer","year":"2008","unstructured":"Archer KJ, Kimes RV. Empirical characterization of random forest variable importance measures. Comput Stat Data Anal. 2008;52(4):2249\u201360. https:\/\/doi.org\/10.1016\/j.csda.2007.08.015.","journal-title":"Comput Stat Data Anal"},{"key":"720_CR41","doi-asserted-by":"publisher","first-page":"30","DOI":"10.1016\/j.chemolab.2015.07.014","volume":"147","author":"BK Li","year":"2015","unstructured":"Li BK, et al. Modeling, predicting and virtual screening of selective inhibitors of MMP-3 and MMP-9 over MMP-1 using random forest classification. Chemom Intell Lab Syst. 2015;147:30\u201340. https:\/\/doi.org\/10.1016\/j.chemolab.2015.07.014.","journal-title":"Chemom Intell Lab Syst"},{"issue":"1","key":"720_CR42","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1186\/1471-2105-14-329","volume":"14","author":"S Jamal","year":"2013","unstructured":"Jamal S, Scaria V. Cheminformatic models based on machine learning for pyruvate kinase \u0131nhibitors of leishmania mexicana. BMC Bioinformatics. 2013;14(1):329. https:\/\/doi.org\/10.1186\/1471-2105-14-329.","journal-title":"BMC Bioinformatics"},{"key":"720_CR43","doi-asserted-by":"publisher","first-page":"32","DOI":"10.1016\/j.jmgm.2011.10.001","volume":"32","author":"V Kovalishyn","year":"2012","unstructured":"Kovalishyn V, et al. Predictive QSAR modeling of phosphodiesterase 4 inhibitors. J Mol Graph Model. 2012;32:32\u20138. https:\/\/doi.org\/10.1016\/j.jmgm.2011.10.001.","journal-title":"J Mol Graph Model"},{"issue":"8","key":"720_CR44","doi-asserted-by":"publisher","first-page":"e70166","DOI":"10.1371\/journal.pone.0070166","volume":"8","author":"KY Chang","year":"2013","unstructured":"Chang KY, Yang J-R. Analysis and prediction of highly effective antiviral peptides based on random forests. PLoS ONE. 2013;8(8):e70166.","journal-title":"PLoS ONE"},{"key":"720_CR45","doi-asserted-by":"publisher","unstructured":"Metz CE. Basic principles of ROC analysis. Seminars in nuclear medicine. 1978;8(4):283\u2013298. https:\/\/doi.org\/10.1016\/s0001-2998(78)80014-2","DOI":"10.1016\/s0001-2998(78)80014-2"},{"key":"720_CR46","doi-asserted-by":"publisher","first-page":"64","DOI":"10.1016\/j.envsoft.2018.03.003","volume":"104","author":"J Rohmer","year":"2018","unstructured":"Rohmer J, et al. Casting light on forcing and breaching scenarios that lead to marine inundation: combining numerical simulations with a random-forest classification approach. Environ Model Softw. 2018;104:64\u201380. https:\/\/doi.org\/10.1016\/j.envsoft.2018.03.003.","journal-title":"Environ Model Softw"},{"key":"720_CR47","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/sdata.2016.35","volume":"3","author":"AEW Johnson","year":"2016","unstructured":"Johnson AEW, et al. MIMIC-III, a freely accessible critical care database. Sci Data. 2016;3:1.","journal-title":"Sci Data"},{"key":"720_CR48","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.gloplacha.2015.03.001","volume":"129","author":"Q Zhang","year":"2015","unstructured":"Zhang Q, Xiao M, Singh VP. Uncertainty evaluation of copula analysis of hydrological droughts in the east river Basin, China. Global Planet Change. 2015;129:1\u20139. https:\/\/doi.org\/10.1016\/j.gloplacha.2015.03.001.","journal-title":"Global Planet Change"},{"issue":"7","key":"720_CR49","doi-asserted-by":"publisher","first-page":"707","DOI":"10.1007\/BF01709751","volume":"22","author":"J-L Vincent","year":"1996","unstructured":"Vincent J-L, et al. The SOFA (sepsis-related organ failure assessment) score to describe organ dysfunction\/failure. Intensive Care Med. 1996;22(7):707\u201310.","journal-title":"Intensive Care Med"},{"issue":"1","key":"720_CR50","first-page":"1","volume":"2","author":"TA Almeida","year":"2013","unstructured":"Almeida TA, Hidalgo JMG, Hidalgo JMG, Silva TP. Towards SMS spam filtering: results under a new dataset. Int J Informat Secur Sci. 2013;2(1):1\u201318.","journal-title":"Int J Informat Secur Sci"},{"key":"720_CR51","doi-asserted-by":"publisher","unstructured":"TA Almeida, JMG Hidalgo, A Yamakami. 2011. \u201cContributions to the study of SMS spam filtering: new collection and results.\u201d In proceedings of the 2011 ACM symposium on document engineering, Association for Computing Machinery. 259-262. https:\/\/doi.org\/10.1145\/2034691.2034742","DOI":"10.1145\/2034691.2034742"},{"key":"720_CR52","doi-asserted-by":"publisher","unstructured":"Hidalgo JMG, Tiago AA, Akebo Y. 2012. \u201cOn the Validity of a New SMS Spam Collection.\u201d In Proceedings\u20142012 11th International Conference on Machine Learning and Applications, ICMLA. 240\u2013245. https:\/\/doi.org\/10.1109\/ICMLA.2012.211","DOI":"10.1109\/ICMLA.2012.211"},{"key":"720_CR53","doi-asserted-by":"publisher","DOI":"10.1145\/1321440.1321486","author":"GV Cormack","year":"2007","unstructured":"Cormack GV, Mar\u00eda J, S\u00e1nz EP, Hidalgo G. Spam filtering for short messages. Int Conf Informat Knowl Manag Proc. 2007. https:\/\/doi.org\/10.1145\/1321440.1321486.","journal-title":"Int Conf Informat Knowl Manag Proc"},{"key":"720_CR54","doi-asserted-by":"publisher","unstructured":"Hidalgo, Jos\u00e9 Mar\u00eda G\u00f3mez, Guillermo Cajigas Bringas, Enrique Puertas S\u00e1nz, and Francisco Carrero Garc\u00eda. 2006. \u201cContent Based SMS Spam Filtering.\u201d In Proceedings of the 2006 ACM symposium on document engineering, DocEng. 2006, 107\u2013114. https:\/\/doi.org\/10.1145\/1166160.1166191","DOI":"10.1145\/1166160.1166191"},{"key":"720_CR55","unstructured":"\u0130lhan, Zeynep. 2019. \u201cKopula Temelli De\u011fi\u015fken K\u00fcmeleme Tekniklerinin \u0130ncelenmesi ve Mortalite Tahmini Uygulamas\u0131.\u201d PhD Thesis, Eskisehir Osmangazi University."},{"issue":"3","key":"720_CR56","doi-asserted-by":"publisher","first-page":"589","DOI":"10.1016\/j.clinph.2012.09.008","volume":"124","author":"Y Machado-Ferrer","year":"2013","unstructured":"Machado-Ferrer Y, et al. Heart rate variability for assessing comatose patients with different Glasgow coma scale scores. Clin Neurophysiol. 2013;124(3):589\u201397. https:\/\/doi.org\/10.1016\/j.clinph.2012.09.008.","journal-title":"Clin Neurophysiol"},{"issue":"2","key":"720_CR57","doi-asserted-by":"publisher","first-page":"363","DOI":"10.1097\/01.ta.0000196623.48952.0e","volume":"60","author":"WH Cooke","year":"2006","unstructured":"Cooke WH, et al. Heart rate variability and its association with mortality inprehospital trauma patients. J Trauma Injury Infect Crit Care. 2006;60(2):363\u201370. https:\/\/doi.org\/10.1097\/01.ta.0000196623.48952.0e.","journal-title":"J Trauma Injury Infect Crit Care"},{"issue":"1","key":"720_CR58","doi-asserted-by":"publisher","first-page":"2095","DOI":"10.1038\/s41598-020-59044-w","volume":"10","author":"C Wan-Ting","year":"2020","unstructured":"Wan-Ting C, et al. Reverse shock index multiplied by Glasgow coma scale (RSIG) predicts mortality in severe trauma patients with head injury. Sci Rep. 2020;10(1):2095. https:\/\/doi.org\/10.1038\/s41598-020-59044-w.","journal-title":"Sci Rep"},{"issue":"5","key":"720_CR59","doi-asserted-by":"publisher","first-page":"1555","DOI":"10.1016\/j.athoracsur.2004.10.017","volume":"79","author":"K Hekmat","year":"2005","unstructured":"Hekmat K, et al. Daily assessment of organ dysfunction and survival in intensive care unit cardiac surgical patients. Ann Thorac Surg. 2005;79(5):1555\u201362. https:\/\/doi.org\/10.1016\/j.athoracsur.2004.10.017.","journal-title":"Ann Thorac Surg"},{"issue":"1","key":"720_CR60","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13049-016-0246-z","volume":"24","author":"A Hasanin","year":"2016","unstructured":"Hasanin A, et al. Incidence and outcome of cardiac injury in patients with severe head trauma. Scand J Trauma Resusc Emerg Med. 2016;24(1):1\u20136. https:\/\/doi.org\/10.1186\/s13049-016-0246-z.","journal-title":"Scand J Trauma Resusc Emerg Med"},{"issue":"3","key":"720_CR61","doi-asserted-by":"publisher","first-page":"215","DOI":"10.14744\/anatoljcardiol.2017.7716","volume":"18","author":"B Kaz\u0131m","year":"2017","unstructured":"Kaz\u0131m B, et al. Changes in neutrophil-to-lymphocyte ratios in postcardiac arrest patients treated with targeted temperature management. Anatol J Cardiol. 2017;18(3):215\u201322. https:\/\/doi.org\/10.14744\/anatoljcardiol.2017.7716.","journal-title":"Anatol J Cardiol"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-023-00720-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-023-00720-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-023-00720-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,3,30]],"date-time":"2023-03-30T13:22:07Z","timestamp":1680182527000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-023-00720-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,30]]},"references-count":61,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["720"],"URL":"https:\/\/doi.org\/10.1186\/s40537-023-00720-9","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,3,30]]},"assertion":[{"value":"27 April 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 March 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 March 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"38"}}