{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,16]],"date-time":"2026-05-16T21:10:59Z","timestamp":1778965859962,"version":"3.51.4"},"reference-count":79,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,6,18]],"date-time":"2024-06-18T00:00:00Z","timestamp":1718668800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,6,18]],"date-time":"2024-06-18T00:00:00Z","timestamp":1718668800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Deraya University"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Hepatocellular carcinoma (HCC) is a highly prevalent form of liver cancer that necessitates accurate prediction models for early diagnosis and effective treatment. Machine learning algorithms have demonstrated promising results in various medical domains, including cancer prediction. In this study, we propose a comprehensive approach for HCC prediction by comparing the performance of different machine learning algorithms before and after applying feature reduction methods. We employ popular feature reduction techniques, such as weighting features, hidden features correlation, feature selection, and optimized selection, to extract a reduced feature subset that captures the most relevant information related to HCC. Subsequently, we apply multiple algorithms, including Naive Bayes, support vector machines (SVM), Neural Networks, Decision Tree, and K nearest neighbors (KNN), to both the original high-dimensional dataset and the reduced feature set. By comparing the predictive accuracy, precision, F Score, recall, and execution time of each algorithm, we assess the effectiveness of feature reduction in enhancing the performance of HCC prediction models. Our experimental results, obtained using a comprehensive dataset comprising clinical features of HCC patients, demonstrate that feature reduction significantly improves the performance of all examined algorithms. Notably, the reduced feature set consistently outperforms the original high-dimensional dataset in terms of prediction accuracy and execution time. After applying feature reduction techniques, the employed algorithms, namely decision trees, Naive Bayes, KNN, neural networks, and SVM achieved accuracies of 96%, 97.33%, 94.67%, 96%, and 96.00%, respectively.<\/jats:p>","DOI":"10.1186\/s40537-024-00944-3","type":"journal-article","created":{"date-parts":[[2024,6,18]],"date-time":"2024-06-18T13:03:52Z","timestamp":1718715832000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":201,"title":["Feature reduction for hepatocellular carcinoma prediction using machine learning algorithms"],"prefix":"10.1186","volume":"11","author":[{"given":"Ghada","family":"Mostafa","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hamdi","family":"Mahmoud","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tarek","family":"Abd El-Hafeez","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mohamed E.","family":"ElAraby","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,6,18]]},"reference":[{"issue":"2","key":"944_CR1","doi-asserted-by":"publisher","first-page":"87","DOI":"10.3322\/caac.21262","volume":"65","author":"LA Torre","year":"2015","unstructured":"Torre LA, et al. Global cancer statistics, 2012. CA Cancer J Clin. 2015;65(2):87\u2013108.","journal-title":"CA Cancer J Clin"},{"issue":"1","key":"944_CR2","doi-asserted-by":"publisher","first-page":"446","DOI":"10.1038\/s41467-017-02733-4","volume":"9","author":"D DeWaal","year":"2018","unstructured":"DeWaal D, et al. Hexokinase-2 depletion inhibits glycolysis and induces oxidative phosphorylation in hepatocellular carcinoma and sensitizes to metformin. Nat Commun. 2018;9(1):446.","journal-title":"Nat Commun"},{"key":"944_CR3","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1016\/j.jbi.2015.09.012","volume":"58","author":"MS Santos","year":"2015","unstructured":"Santos MS, et al. A new cluster-based oversampling method for improving survival prediction of hepatocellular carcinoma patients. J Biomed Inform. 2015;58:49\u201359.","journal-title":"J Biomed Inform"},{"issue":"5","key":"944_CR4","doi-asserted-by":"publisher","first-page":"345","DOI":"10.1016\/j.irbm.2020.04.003","volume":"42","author":"L Ali","year":"2021","unstructured":"Ali L, Bukhari S. An approach based on mutually informed neural networks to optimize the generalization capabilities of decision support systems developed for heart failure prediction. Irbm. 2021;42(5):345\u201352.","journal-title":"Irbm"},{"key":"944_CR5","doi-asserted-by":"publisher","first-page":"116","DOI":"10.1016\/j.cogsys.2018.12.001","volume":"54","author":"W Ksi\u0105\u017cek","year":"2019","unstructured":"Ksi\u0105\u017cek W, et al. A novel machine learning approach for early detection of hepatocellular carcinoma patients. Cogn Syst Res. 2019;54:116\u201327.","journal-title":"Cogn Syst Res"},{"key":"944_CR6","doi-asserted-by":"crossref","unstructured":"Ali L et al. A multi-model framework for evaluating type of speech samples having complementary information about Parkinson\u2019s disease. In: 2019 International conference on electrical, communication, and computer engineering (ICECCE). IEEE; 2019.","DOI":"10.1109\/ICECCE47252.2019.8940696"},{"key":"944_CR7","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1016\/j.patrec.2018.11.004","volume":"132","author":"M Abdar","year":"2020","unstructured":"Abdar M, et al. A new nested ensemble technique for automated diagnosis of breast cancer. Pattern Recogn Lett. 2020;132:123\u201331.","journal-title":"Pattern Recogn Lett"},{"issue":"4","key":"944_CR8","doi-asserted-by":"publisher","first-page":"1476","DOI":"10.1016\/j.eswa.2013.08.044","volume":"41","author":"B Zheng","year":"2014","unstructured":"Zheng B, Yoon SW, Lam SS. Breast cancer diagnosis based on feature extraction using a hybrid of K-means and support vector machine algorithms. Expert Syst Appl. 2014;41(4):1476\u201382.","journal-title":"Expert Syst Appl"},{"issue":"1","key":"944_CR9","doi-asserted-by":"publisher","first-page":"173","DOI":"10.1109\/JBHI.2017.2655720","volume":"22","author":"J Shi","year":"2017","unstructured":"Shi J, et al. Multimodal neuroimaging feature learning with multimodal stacked deep polynomial networks for diagnosis of Alzheimer\u2019s disease. IEEE J Biomed Health Inform. 2017;22(1):173\u201383.","journal-title":"IEEE J Biomed Health Inform"},{"key":"944_CR10","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1016\/j.knosys.2018.04.031","volume":"153","author":"X Zhi","year":"2018","unstructured":"Zhi X, et al. Efficient discriminative clustering via QR decomposition-based linear discriminant analysis. Knowl-Based Syst. 2018;153:117\u201332.","journal-title":"Knowl-Based Syst"},{"key":"944_CR11","doi-asserted-by":"crossref","unstructured":"Ali L et al. Early detection of heart failure by reducing the time complexity of the machine learning based predictive model. In: 2019 international conference on electrical, communication, and computer engineering (ICECCE). IEEE; 2019.","DOI":"10.1109\/ICECCE47252.2019.8940737"},{"issue":"5","key":"944_CR12","doi-asserted-by":"publisher","first-page":"424","DOI":"10.3748\/wjg.v30.i5.424","volume":"30","author":"A Ravikulan","year":"2024","unstructured":"Ravikulan A, Rostami K. Leveraging machine learning for early recurrence prediction in hepatocellular carcinoma: a step towards precision medicine. World J Gastroenterol. 2024;30(5):424.","journal-title":"World J Gastroenterol"},{"issue":"1","key":"944_CR13","doi-asserted-by":"publisher","first-page":"147","DOI":"10.5009\/gnl220386","volume":"18","author":"H Hong","year":"2024","unstructured":"Hong H, et al. Prediction of hepatocellular carcinoma development in Korean patients after hepatitis C cure with direct-acting antivirals. Gut and Liver. 2024;18(1):147.","journal-title":"Gut and Liver"},{"issue":"6","key":"944_CR14","doi-asserted-by":"publisher","first-page":"850","DOI":"10.1016\/j.jvir.2018.01.769","volume":"29","author":"A Abajian","year":"2018","unstructured":"Abajian A, et al. Predicting treatment response to intra-arterial therapies for hepatocellular carcinoma with the use of supervised machine learning\u2014an artificial intelligence concept. J Vasc Intervent Radiol. 2018;29(6):850\u20137.","journal-title":"J Vasc Intervent Radiol"},{"issue":"9","key":"944_CR15","doi-asserted-by":"publisher","first-page":"e2015626","DOI":"10.1001\/jamanetworkopen.2020.15626","volume":"3","author":"GN Ioannou","year":"2020","unstructured":"Ioannou GN, et al. Assessment of a deep learning model to predict hepatocellular carcinoma in patients with hepatitis C cirrhosis. JAMA Netw Open. 2020;3(9):e2015626\u2013e2015626.","journal-title":"JAMA Netw Open"},{"issue":"6","key":"944_CR16","doi-asserted-by":"publisher","DOI":"10.1016\/j.jhepr.2020.100175","volume":"2","author":"JY Nam","year":"2020","unstructured":"Nam JY, et al. Deep learning model for prediction of hepatocellular carcinoma in patients with HBV-related cirrhosis on antiviral therapy. JHEP Rep. 2020;2(6): 100175.","journal-title":"JHEP Rep"},{"issue":"10","key":"944_CR17","doi-asserted-by":"publisher","first-page":"2791","DOI":"10.3390\/cancers12102791","volume":"12","author":"JY Nam","year":"2020","unstructured":"Nam JY, et al. Novel model to predict HCC recurrence after liver transplantation obtained using deep learning: a multicenter study. Cancers. 2020;12(10):2791.","journal-title":"Cancers"},{"issue":"13","key":"944_CR18","doi-asserted-by":"publisher","first-page":"6427","DOI":"10.3390\/app12136427","volume":"12","author":"MA Ali","year":"2022","unstructured":"Ali MA, et al. A novel method for survival prediction of hepatocellular carcinoma using feature-selection techniques. Appl Sci. 2022;12(13):6427.","journal-title":"Appl Sci"},{"issue":"2","key":"944_CR19","doi-asserted-by":"publisher","DOI":"10.1002\/widm.1390","volume":"11","author":"Y Cao","year":"2021","unstructured":"Cao Y, et al. Prediction model for recurrence of hepatocellular carcinoma after resection by using neighbor2vec based algorithms. Wiley Interdiscip R Data Min Knowl Discov. 2021;11(2): e1390.","journal-title":"Wiley Interdiscip R Data Min Knowl Discov"},{"issue":"1","key":"944_CR20","doi-asserted-by":"publisher","first-page":"134","DOI":"10.1002\/jmri.27538","volume":"54","author":"Y Zhang","year":"2021","unstructured":"Zhang Y, et al. Deep learning with 3D convolutional neural network for noninvasive prediction of microvascular invasion in hepatocellular carcinoma. J Magn Reson Imaging. 2021;54(1):134\u201343.","journal-title":"J Magn Reson Imaging"},{"issue":"43","key":"944_CR21","doi-asserted-by":"publisher","first-page":"5804","DOI":"10.3748\/wjg.v29.i43.5804","volume":"29","author":"Y-B Zhang","year":"2023","unstructured":"Zhang Y-B, et al. Development of a machine learning-based model for predicting risk of early postoperative recurrence of hepatocellular carcinoma. World J Gastroenterol. 2023;29(43):5804.","journal-title":"World J Gastroenterol"},{"issue":"2","key":"944_CR22","doi-asserted-by":"publisher","DOI":"10.1002\/ctm2.102","volume":"10","author":"H Liao","year":"2020","unstructured":"Liao H, et al. Deep learning-based classification and mutation prediction from histopathological images of hepatocellular carcinoma. Clin Transl Med. 2020;10(2): e102.","journal-title":"Clin Transl Med"},{"key":"944_CR23","doi-asserted-by":"publisher","first-page":"2408348","DOI":"10.1155\/2019\/2408348","volume":"2019","author":"Z Deng","year":"2019","unstructured":"Deng Z, et al. Mining TCGA database for tumor microenvironment-related genes of prognostic value in hepatocellular carcinoma. BioMed Res Int. 2019;2019:2408348.","journal-title":"BioMed Res Int"},{"issue":"1","key":"944_CR24","doi-asserted-by":"publisher","first-page":"675","DOI":"10.18632\/aging.202173","volume":"13","author":"K Wang","year":"2021","unstructured":"Wang K, et al. A novel immune-related genes prognosis biomarker for hepatocellular carcinoma. Aging (Albany NY). 2021;13(1):675.","journal-title":"Aging (Albany NY)"},{"issue":"9","key":"944_CR25","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0202685","volume":"13","author":"CA Bannister","year":"2018","unstructured":"Bannister CA, et al. A genetic programming approach to development of clinical prediction models: a case study in symptomatic cardiovascular disease. PLoS ONE. 2018;13(9): e0202685.","journal-title":"PLoS ONE"},{"issue":"46","key":"944_CR26","doi-asserted-by":"publisher","DOI":"10.1097\/MD.0000000000017510","volume":"98","author":"Y Dong","year":"2019","unstructured":"Dong Y, et al. A novel surgical predictive model for Chinese Crohn\u2019s disease patients. Medicine. 2019;98(46): e17510.","journal-title":"Medicine"},{"issue":"11","key":"944_CR27","doi-asserted-by":"publisher","first-page":"1764","DOI":"10.1016\/j.spinee.2019.06.002","volume":"19","author":"AV Karhade","year":"2019","unstructured":"Karhade AV, et al. Development of machine learning algorithms for prediction of prolonged opioid prescription after surgery for lumbar disc herniation. Spine J. 2019;19(11):1764\u201371.","journal-title":"Spine J"},{"issue":"6","key":"944_CR28","doi-asserted-by":"publisher","first-page":"736","DOI":"10.3171\/2016.10.SPINE16197","volume":"26","author":"JK Scheer","year":"2017","unstructured":"Scheer JK, et al. Development of a preoperative predictive model for major complications following adult spinal deformity surgery. J Neurosurg Spine. 2017;26(6):736\u201343.","journal-title":"J Neurosurg Spine"},{"key":"944_CR29","doi-asserted-by":"publisher","first-page":"1642","DOI":"10.1109\/ACCESS.2016.2552478","volume":"4","author":"S Adams","year":"2016","unstructured":"Adams S, Beling PA, Cogill R. Feature selection for hidden Markov models and hidden semi-Markov models. IEEE Access. 2016;4:1642\u201357.","journal-title":"IEEE Access"},{"key":"944_CR30","doi-asserted-by":"crossref","unstructured":"Bjaoui M et al. Depth insight for data scientist with RapidMiner \u00aban innovative tool for AI and big data towards medical applications\u00bb. In: Proceedings of the 2nd international conference on digital tools & uses congress; 2020.","DOI":"10.1145\/3423603.3424059"},{"key":"944_CR31","doi-asserted-by":"crossref","unstructured":"Roy SP, Kasat A. Diabetic prediction with ensemble model and feature selection using information gain method. In: 2024 2nd international conference on intelligent data communication technologies and internet of things (IDCIoT). IEEE; 2024.","DOI":"10.1109\/IDCIoT59759.2024.10467649"},{"key":"944_CR32","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.122490","volume":"239","author":"IK Ihianle","year":"2024","unstructured":"Ihianle IK, et al. Minimising redundancy, maximising relevance: HRV feature selection for stress classification. Expert Syst Appl. 2024;239: 122490.","journal-title":"Expert Syst Appl"},{"key":"944_CR33","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1023\/A:1025667309714","volume":"53","author":"M Robnik-\u0160ikonja","year":"2003","unstructured":"Robnik-\u0160ikonja M, Kononenko I. Theoretical and empirical analysis of ReliefF and RReliefF. Mach Learn. 2003;53:23\u201369.","journal-title":"Mach Learn"},{"issue":"5","key":"944_CR34","doi-asserted-by":"publisher","first-page":"6637","DOI":"10.3233\/JIFS-179743","volume":"38","author":"AK Shukla","year":"2020","unstructured":"Shukla AK, et al. Knowledge discovery in medical and biological datasets by integration of Relief-F and correlation feature selection techniques. J Intell Fuzzy Syst. 2020;38(5):6637\u201348.","journal-title":"J Intell Fuzzy Syst"},{"key":"944_CR35","first-page":"1","volume":"2018","author":"AU Haq","year":"2018","unstructured":"Haq AU, et al. A hybrid intelligent system framework for the prediction of heart disease using machine learning algorithms. Mob Inf Syst. 2018;2018:1\u201321.","journal-title":"Mob Inf Syst"},{"issue":"3","key":"944_CR36","doi-asserted-by":"publisher","first-page":"1575","DOI":"10.1007\/s10115-023-02010-5","volume":"66","author":"D Theng","year":"2024","unstructured":"Theng D, Bhoyar KK. Feature selection techniques for machine learning: a survey of more than two decades of research. Knowl Inf Syst. 2024;66(3):1575\u2013637.","journal-title":"Knowl Inf Syst"},{"key":"944_CR37","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2024.111380","volume":"286","author":"J Gao","year":"2024","unstructured":"Gao J, et al. Information gain ratio-based subfeature grouping empowers particle swarm optimization for feature selection. Knowl-Based Syst. 2024;286: 111380.","journal-title":"Knowl-Based Syst"},{"key":"944_CR38","doi-asserted-by":"publisher","first-page":"1339","DOI":"10.1007\/s11063-020-10307-7","volume":"52","author":"X Wang","year":"2020","unstructured":"Wang X, Yan Y, Ma X. Feature selection method based on differential correlation information entropy. Neural Process Lett. 2020;52:1339\u201358.","journal-title":"Neural Process Lett"},{"key":"944_CR39","doi-asserted-by":"publisher","DOI":"10.1016\/j.asoc.2019.105524","volume":"97","author":"D Singh","year":"2020","unstructured":"Singh D, Singh B. Investigating the impact of data normalization on classification performance. Appl Soft Comput. 2020;97: 105524.","journal-title":"Appl Soft Comput"},{"key":"944_CR40","doi-asserted-by":"crossref","unstructured":"Raju VG et al. Study the influence of normalization\/transformation process on the accuracy of supervised classification. In: 2020 third international conference on smart systems and inventive technology (ICSSIT). IEEE; 2020.","DOI":"10.1109\/ICSSIT48917.2020.9214160"},{"key":"944_CR41","doi-asserted-by":"publisher","first-page":"153303382412347","DOI":"10.1177\/15330338241234791","volume":"23","author":"S Zhou","year":"2024","unstructured":"Zhou S, et al. Breast cancer prediction based on multiple machine learning algorithms. Technol Cancer Res Treat. 2024;23:15330338241234792.","journal-title":"Technol Cancer Res Treat"},{"issue":"5","key":"944_CR42","doi-asserted-by":"publisher","first-page":"563","DOI":"10.1016\/S0167-8655(00)00112-4","volume":"22","author":"S Aksoy","year":"2001","unstructured":"Aksoy S, Haralick RM. Feature normalization and likelihood-based similarity measures for image retrieval. Pattern Recogn Lett. 2001;22(5):563\u201382.","journal-title":"Pattern Recogn Lett"},{"key":"944_CR43","doi-asserted-by":"publisher","DOI":"10.1016\/j.flowmeasinst.2024.102541","volume":"96","author":"W Ajbar","year":"2024","unstructured":"Ajbar W, et al. Development of artificial neural networks for the prediction of the pressure field along a horizontal pipe conveying high-viscosity two-phase flow. Flow Meas Instrum. 2024;96: 102541.","journal-title":"Flow Meas Instrum"},{"key":"944_CR44","unstructured":"Hsu CW, Chang CC, Lin CJ. A practical guide to support vector classification, Taipei, Taiwan; 2003."},{"issue":"1","key":"944_CR45","doi-asserted-by":"publisher","first-page":"196","DOI":"10.1007\/s42979-023-02529-y","volume":"5","author":"G Parashar","year":"2024","unstructured":"Parashar G, Chaudhary A, Pandey D. Machine learning for prediction of cardiovascular disease and respiratory disease: a review. SN Comput Sci. 2024;5(1):196.","journal-title":"SN Comput Sci"},{"issue":"1","key":"944_CR46","first-page":"1793","volume":"3","author":"T Jayalakshmi","year":"2011","unstructured":"Jayalakshmi T, Santhakumaran A. Statistical normalization and back propagation for classification. Int J Comput Theory Eng. 2011;3(1):1793\u20138201.","journal-title":"Int J Comput Theory Eng"},{"issue":"3","key":"944_CR47","doi-asserted-by":"publisher","first-page":"449","DOI":"10.1109\/TITB.2011.2119322","volume":"15","author":"UR Acharya","year":"2011","unstructured":"Acharya UR, et al. Automated diagnosis of glaucoma using texture and higher order spectra features. IEEE Trans Inf Technol Biomed. 2011;15(3):449\u201355.","journal-title":"IEEE Trans Inf Technol Biomed"},{"issue":"3","key":"944_CR48","doi-asserted-by":"publisher","first-page":"450","DOI":"10.1109\/TPAMI.2005.57","volume":"27","author":"R Snelick","year":"2005","unstructured":"Snelick R, et al. Large-scale evaluation of multimodal biometric authentication using state-of-the-art systems. IEEE Trans Pattern Anal Mach Intell. 2005;27(3):450\u20135.","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"issue":"3","key":"944_CR49","first-page":"508","volume":"25","author":"X Wen","year":"2014","unstructured":"Wen X, et al. Efficient feature selection and classification for vehicle detection. IEEE Trans Circuits Syst Video Technol. 2014;25(3):508\u201317.","journal-title":"IEEE Trans Circuits Syst Video Technol"},{"issue":"3","key":"944_CR50","doi-asserted-by":"publisher","first-page":"818","DOI":"10.1109\/TMECH.2013.2260865","volume":"19","author":"ET Esfahani","year":"2013","unstructured":"Esfahani ET, Wang S, Sundararajan V. Multisensor wireless system for eccentricity and bearing fault detection in induction motors. IEEE\/ASME Trans Mechatron. 2013;19(3):818\u201326.","journal-title":"IEEE\/ASME Trans Mechatron"},{"key":"944_CR51","unstructured":"Pan J, Zhuang Y, Fong S. The impact of data normalization on stock market prediction: using SVM and technical indicators. In: Soft computing in data science: second international conference, SCDS 2016, Kuala Lumpur, Malaysia, September 21\u201322, 2016, Proceedings 2. Springer; 2016."},{"key":"944_CR52","unstructured":"Kadir A et al. Leaf classification using shape, color, and texture features; 2013. arXiv preprint arXiv:1401.4447."},{"issue":"3","key":"944_CR53","doi-asserted-by":"publisher","first-page":"5900","DOI":"10.1016\/j.eswa.2008.07.026","volume":"36","author":"C-M Wang","year":"2009","unstructured":"Wang C-M, Huang Y-F. Evolutionary-based feature selection approaches with new criteria for data mining: a case study of credit approval data. Expert Syst Appl. 2009;36(3):5900\u20138.","journal-title":"Expert Syst Appl"},{"key":"944_CR54","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/1471-2105-6-191","volume":"6","author":"W Wu","year":"2005","unstructured":"Wu W, et al. Evaluation of normalization methods for cDNA microarray data by k-NN classification. BMC Bioinform. 2005;6:1\u201321.","journal-title":"BMC Bioinform"},{"key":"944_CR55","doi-asserted-by":"publisher","first-page":"256","DOI":"10.1016\/j.proenv.2011.12.040","volume":"11","author":"Z Liu","year":"2011","unstructured":"Liu Z. A method of SVM with normalization in intrusion detection. Procedia Environ Sci. 2011;11:256\u201362.","journal-title":"Procedia Environ Sci"},{"key":"944_CR56","doi-asserted-by":"crossref","unstructured":"Su D et al. Anomadroid: profiling android applications\u2019 behaviors for identifying unknown malapps. In: 2016 IEEE Trustcom\/BigDataSE\/ISPA. IEEE; 2016.","DOI":"10.1109\/TrustCom.2016.0127"},{"issue":"1","key":"944_CR57","doi-asserted-by":"publisher","first-page":"310","DOI":"10.32614\/RJ-2021-041","volume":"13","author":"RA Peterson","year":"2021","unstructured":"Peterson RA. Finding optimal normalizing transformations via best normalize. R Journal. 2021;13(1):310\u201329.","journal-title":"R Journal"},{"key":"944_CR58","doi-asserted-by":"publisher","first-page":"66989","DOI":"10.1109\/ACCESS.2020.2986232","volume":"8","author":"IM El-Hasnony","year":"2020","unstructured":"El-Hasnony IM, et al. Improved feature selection model for big data analytics. IEEE Access. 2020;8:66989\u20137004.","journal-title":"IEEE Access"},{"issue":"9","key":"944_CR59","doi-asserted-by":"publisher","first-page":"9573","DOI":"10.1109\/TCYB.2021.3061152","volume":"52","author":"X-F Song","year":"2021","unstructured":"Song X-F, et al. A fast hybrid feature selection based on correlation-guided clustering and particle swarm optimization for high-dimensional data. IEEE Trans Cybern. 2021;52(9):9573\u201386.","journal-title":"IEEE Trans Cybern"},{"issue":"23","key":"944_CR60","doi-asserted-by":"publisher","first-page":"2984","DOI":"10.3390\/electronics10232984","volume":"10","author":"M Mohamad","year":"2021","unstructured":"Mohamad M, et al. Enhancing big data feature selection using a hybrid correlation-based feature selection. Electronics. 2021;10(23):2984.","journal-title":"Electronics"},{"issue":"4","key":"944_CR61","first-page":"1060","volume":"34","author":"UM Khaire","year":"2022","unstructured":"Khaire UM, Dhanalakshmi R. Stability of feature selection algorithm: a review. J King Saud Univ Comput Inf Sci. 2022;34(4):1060\u201373.","journal-title":"J King Saud Univ Comput Inf Sci"},{"key":"944_CR62","doi-asserted-by":"crossref","unstructured":"Camattari F et al. Greedy feature selection: Classifier-dependent feature selection via greedy methods. arXiv preprint arXiv:2403.05138; 2024.","DOI":"10.1007\/s11222-024-10460-2"},{"key":"944_CR63","doi-asserted-by":"publisher","DOI":"10.1007\/s13042-024-02098-3","author":"W Chen","year":"2024","unstructured":"Chen W, Sun X. Dynamic multi-label feature selection algorithm based on label importance and label correlation. Int J Mach Learn Cybern. 2024. https:\/\/doi.org\/10.1007\/s13042-024-02098-3.","journal-title":"Int J Mach Learn Cybern"},{"key":"944_CR64","doi-asserted-by":"publisher","DOI":"10.1007\/s13369-024-08776-2","author":"M Habib","year":"2024","unstructured":"Habib M, Okayli M. Evaluating the sensitivity of machine learning models to data preprocessing technique in concrete compressive strength estimation. Arab J Sci Eng. 2024. https:\/\/doi.org\/10.1007\/s13369-024-08776-2.","journal-title":"Arab J Sci Eng"},{"issue":"1","key":"944_CR65","doi-asserted-by":"publisher","first-page":"264","DOI":"10.1186\/s12864-024-10160-1","volume":"25","author":"M Peng","year":"2024","unstructured":"Peng M, et al. scFSNN: a feature selection method based on neural network for single-cell RNA-seq data. BMC Genomics. 2024;25(1):264.","journal-title":"BMC Genomics"},{"key":"944_CR66","doi-asserted-by":"publisher","first-page":"44","DOI":"10.1016\/j.inffus.2020.01.005","volume":"59","author":"S Ayesha","year":"2020","unstructured":"Ayesha S, Hanif MK, Talib R. Overview and comparative study of dimensionality reduction techniques for high dimensional data. Inf Fus. 2020;59:44\u201358.","journal-title":"Inf Fus"},{"key":"944_CR67","doi-asserted-by":"publisher","first-page":"3473","DOI":"10.1007\/s10462-020-09928-0","volume":"54","author":"P Ray","year":"2021","unstructured":"Ray P, Reddy SS, Banerjee T. Various dimension reduction techniques for high dimensional data analysis: a review. Artif Intell Rev. 2021;54:3473\u2013515.","journal-title":"Artif Intell Rev"},{"issue":"2","key":"944_CR68","doi-asserted-by":"publisher","first-page":"56","DOI":"10.38094\/jastt1224","volume":"1","author":"R Zebari","year":"2020","unstructured":"Zebari R, et al. A comprehensive review of dimensionality reduction techniques for feature selection and feature extraction. J Appl Sci Technol Trends. 2020;1(2):56\u201370.","journal-title":"J Appl Sci Technol Trends"},{"issue":"12","key":"944_CR69","doi-asserted-by":"publisher","first-page":"5763","DOI":"10.1007\/s12652-021-03256-z","volume":"13","author":"M Sharma","year":"2022","unstructured":"Sharma M, Kumar N. Improved hepatocellular carcinoma fatality prognosis using ensemble learning approach. J Ambient Intell Humaniz Comput. 2022;13(12):5763\u201377.","journal-title":"J Ambient Intell Humaniz Comput"},{"issue":"6","key":"944_CR70","doi-asserted-by":"publisher","DOI":"10.1002\/cnm.3599","volume":"38","author":"W Ksi\u0105\u017cek","year":"2022","unstructured":"Ksi\u0105\u017cek W, Turza F, P\u0142awiak P. NCA-GA-SVM: a new two-level feature selection method based on neighborhood component analysis and genetic algorithm in hepatocellular carcinoma fatality prognosis. Int J Numer Methods Biomed Eng. 2022;38(6): e3599.","journal-title":"Int J Numer Methods Biomed Eng"},{"issue":"1","key":"944_CR71","doi-asserted-by":"publisher","first-page":"248","DOI":"10.1186\/s12859-022-04805-9","volume":"23","author":"B Cheng","year":"2022","unstructured":"Cheng B, Zhou P, Chen Y. Machine-learning algorithms based on personalized pathways for a novel predictive model for the diagnosis of hepatocellular carcinoma. BMC Bioinform. 2022;23(1):248.","journal-title":"BMC Bioinform"},{"issue":"1","key":"944_CR72","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1038\/s41746-020-0232-8","volume":"3","author":"A Kiani","year":"2020","unstructured":"Kiani A, et al. Impact of a deep learning assistant on the histopathologic classification of liver cancer. NPJ Dig Med. 2020;3(1):23.","journal-title":"NPJ Dig Med"},{"issue":"12","key":"944_CR73","doi-asserted-by":"publisher","first-page":"3562","DOI":"10.3390\/cancers12123562","volume":"12","author":"H Wang","year":"2020","unstructured":"Wang H, et al. Single-cell spatial analysis of tumor and immune microenvironment on whole-slide image reveals hepatocellular carcinoma subtypes. Cancers. 2020;12(12):3562.","journal-title":"Cancers"},{"key":"944_CR74","doi-asserted-by":"publisher","DOI":"10.1155\/2022\/2014625","author":"W Deng","year":"2022","unstructured":"Deng W, et al. Classification and prognostic characteristics of hepatocellular carcinoma based on glycolysis cholesterol synthesis axis. J Oncol. 2022. https:\/\/doi.org\/10.1155\/2022\/2014625.","journal-title":"J Oncol"},{"issue":"5","key":"944_CR75","doi-asserted-by":"publisher","DOI":"10.1002\/jcla.24377","volume":"36","author":"D Cheng","year":"2022","unstructured":"Cheng D, et al. Identification and construction of a 13-gene risk model for prognosis prediction in hepatocellular carcinoma patients. J Clin Lab Anal. 2022;36(5): e24377.","journal-title":"J Clin Lab Anal"},{"issue":"1","key":"944_CR76","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-021-81506-y","volume":"11","author":"R Yamashita","year":"2021","unstructured":"Yamashita R, et al. Deep learning predicts postsurgical recurrence of hepatocellular carcinoma from digital histopathologic images. Sci Rep. 2021;11(1):1\u201314.","journal-title":"Sci Rep"},{"issue":"6","key":"944_CR77","doi-asserted-by":"publisher","first-page":"2000","DOI":"10.1002\/hep.31207","volume":"72","author":"C Saillard","year":"2020","unstructured":"Saillard C, et al. Predicting survival after hepatocellular carcinoma resection using deep learning on histological slides. Hepatology. 2020;72(6):2000\u201313.","journal-title":"Hepatology"},{"key":"944_CR78","doi-asserted-by":"publisher","DOI":"10.1155\/2021\/5212953","author":"S Tohme","year":"2021","unstructured":"Tohme S, et al. The use of machine learning to create a risk score to predict survival in patients with hepatocellular carcinoma: a TCGA cohort analysis. Can J Gastroenterol Hepatol. 2021. https:\/\/doi.org\/10.1155\/2021\/5212953.","journal-title":"Can J Gastroenterol Hepatol"},{"issue":"5","key":"944_CR79","doi-asserted-by":"publisher","first-page":"951","DOI":"10.1136\/gutjnl-2020-320930","volume":"70","author":"J-Y Shi","year":"2021","unstructured":"Shi J-Y, et al. Exploring prognostic indicators in the pathological images of hepatocellular carcinoma based on deep learning. Gut. 2021;70(5):951\u201361.","journal-title":"Gut"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-024-00944-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-024-00944-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-024-00944-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,22]],"date-time":"2024-11-22T03:11:47Z","timestamp":1732245107000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-024-00944-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6,18]]},"references-count":79,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["944"],"URL":"https:\/\/doi.org\/10.1186\/s40537-024-00944-3","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,6,18]]},"assertion":[{"value":"27 February 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"31 May 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 June 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"This article does not contain any studies with human participants or animals performed by any of the authors.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"All authors have read and agreed to the published version of the manuscript.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"88"}}