{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,16]],"date-time":"2026-02-16T10:08:58Z","timestamp":1771236538622,"version":"3.50.1"},"reference-count":31,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2022,2,8]],"date-time":"2022-02-08T00:00:00Z","timestamp":1644278400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["81701794"],"award-info":[{"award-number":["81701794"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>Early diagnosis of cancer is beneficial in the formulation of the best treatment plan; it can improve the survival rate and the quality of patient life. However, imaging detection and needle biopsy usually used not only find it difficult to effectively diagnose tumors at early stage, but also do great harm to the human body. Since the changes in a patient\u2019s health status will cause changes in blood protein indexes, if cancer can be diagnosed by the changes in blood indexes in the early stage of cancer, it can not only conveniently track and detect the treatment process of cancer, but can also reduce the pain of patients and reduce the costs. In this paper, 39 serum protein markers were taken as research objects. The difference of the entropies of serum protein marker sequences in different types of patients was analyzed, and based on this, a cost-sensitive analysis model was established for the purpose of improving the accuracy of cancer recognition. The results showed that there were significant differences in entropy of different cancer patients, and the complexity of serum protein markers in normal people was higher than that in cancer patients. Although the dataset was rather imbalanced, containing 897 instances, including 799 normal instances, 44 liver cancer instances, and 54 ovarian cancer instances, the accuracy of our model still reached 95.21%. Other evaluation indicators were also stable and satisfactory; precision, recall, F1 and AUC reach 0.807, 0.833, 0.819 and 0.92, respectively. This study has certain theoretical and practical significance for cancer prediction and clinical application and can also provide a research basis for the intelligent medical treatment.<\/jats:p>","DOI":"10.3390\/e24020253","type":"journal-article","created":{"date-parts":[[2022,2,9]],"date-time":"2022-02-09T01:03:40Z","timestamp":1644368620000},"page":"253","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":20,"title":["Cost-Sensitive KNN Algorithm for Cancer Prediction Based on Entropy Analysis"],"prefix":"10.3390","volume":"24","author":[{"given":"Chaohong","family":"Song","sequence":"first","affiliation":[{"name":"Department of Mathematics and Statistics, Huazhong Agricultural University, Wuhan 430070, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5678-6829","authenticated-orcid":false,"given":"Xinran","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Statistics, Huazhong Agricultural University, Wuhan 430070, China"}]}],"member":"1968","published-online":{"date-parts":[[2022,2,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Alwohaibi, M., Alzaqebah, M., Alotaibi, N.M., Alzahrania, A.M., and Zouchab, M. (2021). A hybrid multi-stage learning technique based on brain storming optimization algorithm for breast cancer recurrence prediction. J. King Saud Univ. Sci.","DOI":"10.1016\/j.jksuci.2021.05.004"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1016\/j.cca.2018.12.028","article-title":"Blood-based protein biomarkers in breast cancer","volume":"490","year":"2019","journal-title":"Clin. Chim. Acta"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"926","DOI":"10.1126\/science.aar3247","article-title":"Detection and localization of surgically resectable cancers with a multi-analyte blood test","volume":"359","author":"Cohen","year":"2018","journal-title":"Science"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"15552","DOI":"10.1038\/s41598-020-72510-9","article-title":"Quantitative proteomics identifes a plasma multi protein model for detection of hepatocellular carcinoma","volume":"10","author":"Du","year":"2020","journal-title":"Sci. Rep."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1016\/j.csbj.2014.11.005","article-title":"Machine learning applications in cancer prognosis and prediction","volume":"13","author":"Konstantina","year":"2015","journal-title":"Comput. Struct. Biotechnol."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"7402","DOI":"10.1038\/s41598-017-07408-0","article-title":"Machine Learning Applications for Prediction of Relapse in Childhood Acute Lymphoblastic Leukemia","volume":"7","author":"Pan","year":"2017","journal-title":"Sci. Rep."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"52.1","DOI":"10.1145\/2988544","article-title":"Predicting Breast Cancer Recurrence using Machine Learning Techniques: A Systematic Review","volume":"49","author":"Abreu","year":"2017","journal-title":"ACM Comput. Surv."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1195","DOI":"10.1016\/j.pan.2020.07.399","article-title":"A machine learning approach identified a diagnostic model for pancreatic cancer through using circulating microRNA signatures","volume":"20","author":"Savareh","year":"2020","journal-title":"Pancreatology"},{"key":"ref_9","first-page":"i446","article-title":"Deep learning with multimodal representation for pancancer prognosis prediction","volume":"14","author":"Anika","year":"2019","journal-title":"Bioinformatics"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1248","DOI":"10.1158\/1078-0432.CCR-17-0853","article-title":"Deep learning-based multi-omics integration robustly predicts survival in liver cancer","volume":"24","author":"Chaudhary","year":"2018","journal-title":"Clin. Cancer Res."},{"key":"ref_11","first-page":"107277","article-title":"Incorporating deep learning and multi-omics autoencoding for analysis of lung adenocarcinoma prognostication","volume":"87","author":"Lee","year":"2020","journal-title":"Comput. Biol."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1007\/BF01001956","article-title":"Rough sets","volume":"11","author":"Pawlak","year":"1982","journal-title":"J. Comput. Inform. Sci."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Domingos, P. (1999, January 15\u201318). MetaCost: A general method for making classifiers cost-sensitive. Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA.","DOI":"10.1145\/312129.312220"},{"key":"ref_14","unstructured":"Elkan, C. (2001, January 4\u201310). The foundations of cost-sensitive learning. Proceedings of the Seventeenth International Joint Conference of Artificial Intelligence, Seattle, WA, USA."},{"key":"ref_15","unstructured":"Turney, P. (July, January 29). Types of cost in inductive concept learning. Proceedings of the Workshop on Cost-Sensitive Learning at the Seventeenth International Conference on Machine Learning, Stanford, CA, USA."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Zadrozny, B. (2005, January 21). One-Benefit learning: Cost-sensitive learning with restricted cost information. Proceedings of the 1st International Workshop on Utility-Based Data Mining, Chicago, IL, USA.","DOI":"10.1145\/1089827.1089834"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1016\/j.ins.2017.09.013","article-title":"Cost-sensitive and hybrid-attribute measure multi-decision tree over imbalanced data sets","volume":"422","author":"Li","year":"2018","journal-title":"Inf. Sci."},{"key":"ref_18","unstructured":"Veropoulos, K., Campbell, C., and Cristianini, N. (August, January 31). Controlling the sensitivity of support vector machines. Proceedings of the 1999 International Joint Conference on AI, Stockholm, Sweden."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1016\/j.ins.2019.02.062","article-title":"Self-adaptive cost weights-based support vector machine cost-sensitive ensemble for imbalanced data classification","volume":"487","author":"Tao","year":"2019","journal-title":"Inf. Sci."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1109\/TIT.1967.1053964","article-title":"Nearest neighbour pattern classification","volume":"13","author":"Cover","year":"1967","journal-title":"IEEE Trans. Inf. Theor."},{"key":"ref_21","first-page":"302","article-title":"Survey of nearest neighbour techniques","volume":"8","author":"Bhatia","year":"2010","journal-title":"Int. J. Comput. Sci. Inf. Secur."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1605","DOI":"10.1002\/ijc.28792","article-title":"Prospective cohort studies of association between family history of liver cancer and risk of liver cancer","volume":"135","author":"Yang","year":"2014","journal-title":"Int. J. Cancer"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"S20","DOI":"10.1097\/IGC.0000000000001118","article-title":"Ovarian cancer prevention, screening, and early detection: Report from the 11th biennial ovarian cancer research symposium","volume":"27","author":"Chien","year":"2017","journal-title":"Int. J. Gynecol. Cancer"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"424","DOI":"10.1111\/j.1399-5618.2006.00373.x","article-title":"Approximate entropy of self-reported mood prior to episodes in bipolar disorder","volume":"8","author":"Glenn","year":"2006","journal-title":"Bipolar Disord."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1007\/BF01619355","article-title":"A regularity statistic for medical data analysis","volume":"7","author":"Pincus","year":"1991","journal-title":"J. Clin. Monit."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Delgado-Bonal, A., and Marshak, A. (2019). Approximate Entropy and Sample Entropy: A Comprehensive Tutorial. Entropy, 21.","DOI":"10.3390\/e21060541"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"339","DOI":"10.1016\/j.physa.2018.01.002","article-title":"Mixture models with entropy regularization for community detection in networks","volume":"496","author":"Chang","year":"2018","journal-title":"Physica A"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1016\/j.compbiomed.2012.11.005","article-title":"Analysis of heart rate variability using fuzzy measure entropy","volume":"43","author":"Liu","year":"2013","journal-title":"Comput. Biol. Med."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1016\/j.medengphy.2008.04.005","article-title":"Measuring complexity using fuzzyen, apen, and sampen","volume":"31","author":"Chen","year":"2009","journal-title":"Med. Eng. Phys."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Fern\u00e1ndez, A., Garc\u00eda, S., Galar, M., Prati, R.C., Krawczyk, B., and Herrera, F. (2018). Learning from Imbalanced Data Sets, Springer.","DOI":"10.1007\/978-3-319-98074-4"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"2872","DOI":"10.1109\/TKDE.2014.2312336","article-title":"A new strategy of cost-free learning in the class imbalance problem","volume":"26","author":"Zhang","year":"2014","journal-title":"IEEE Trans. Knowl. Data Eng."}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/24\/2\/253\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T22:16:25Z","timestamp":1760134585000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/24\/2\/253"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,2,8]]},"references-count":31,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2022,2]]}},"alternative-id":["e24020253"],"URL":"https:\/\/doi.org\/10.3390\/e24020253","relation":{},"ISSN":["1099-4300"],"issn-type":[{"value":"1099-4300","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,2,8]]}}}