{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T02:24:03Z","timestamp":1760235843163,"version":"build-2065373602"},"reference-count":64,"publisher":"MDPI AG","issue":"19","license":[{"start":{"date-parts":[[2021,9,28]],"date-time":"2021-09-28T00:00:00Z","timestamp":1632787200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100005270","name":"Fujian Provincial Department of Science and Technology","doi-asserted-by":"publisher","award":["2019H0001"],"award-info":[{"award-number":["2019H0001"]}],"id":[{"id":"10.13039\/501100005270","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Due to the high mortality of many cancers and their related diseases, the prediction and prognosis techniques of cancers are being extensively studied to assist doctors in making diagnoses. Many machine-learning-based cancer predictors have been put forward, but many of them have failed to become widely utilised due to some crucial problems. For example, most methods require too much training data, which is not always applicable to institutes, and the complicated genetic mutual effects of cancers are generally ignored in many proposed methods. Moreover, a majority of these assist models are actually not safe to use, as they are generally built on black-box machine learners that lack references from related field knowledge. We observe that few machine-learning-based cancer predictors are capable of employing prior knowledge (PrK) to mitigate these issues. Therefore, in this paper, we propose a generalisable informed machine learning architecture named the Informed Attentive Predictor (IAP) to make PrK available to the predictor\u2019s decision-making phases and apply it to the field of cancer prediction. Specifically, we make several implementations of the IAP and evaluate its performance on six TCGA datasets to demonstrate the effectiveness of our architecture as an assist system framework for actual clinical usage. The experimental results show a noticeable improvement in IAP models on accuracies, f1-scores and recall rates compared to their non-IAP counterparts (i.e., basic predictors).<\/jats:p>","DOI":"10.3390\/s21196484","type":"journal-article","created":{"date-parts":[[2021,9,28]],"date-time":"2021-09-28T21:39:29Z","timestamp":1632865169000},"page":"6484","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Informed Attentive Predictors: A Generalisable Architecture for Prior Knowledge-Based Assisted Diagnosis of Cancers"],"prefix":"10.3390","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3344-8691","authenticated-orcid":false,"given":"Han","family":"Li","sequence":"first","affiliation":[{"name":"School of Informatics, Xiamen University, Xiamen 361001, China"}]},{"given":"Linling","family":"Qiu","sequence":"additional","affiliation":[{"name":"School of Informatics, Xiamen University, Xiamen 361001, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8639-7675","authenticated-orcid":false,"given":"Meihong","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Informatics, Xiamen University, Xiamen 361001, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,9,28]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1002\/jso.2930130205","article-title":"An investigation of the cause of death from cancer","volume":"13","author":"Houten","year":"1980","journal-title":"J. Surg. Oncol."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"928","DOI":"10.1109\/TCBB.2014.2377729","article-title":"Integrative Data Analysis of Multi-Platform Cancer Data with a Multimodal Deep Learning Approach","volume":"12","author":"Liang","year":"2015","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1016\/j.tig.2015.11.003","article-title":"Understanding the Genetic Mechanisms of Cancer Drug Resistance Using Genomic Approaches","volume":"32","author":"Hu","year":"2016","journal-title":"Trends Genet."},{"key":"ref_4","unstructured":"Ray, S. (2021). A Survey on Application of Machine Learning Algorithms in Cancer Prediction and Prognosis. Data Management, Analytics and Innovation, Springer."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1109\/JBHI.2018.2803793","article-title":"SetSVM: An approach to set classification in nuclei-based cancer detection","volume":"23","author":"Liu","year":"2018","journal-title":"IEEE J. Biomed. Health Inform."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"3155","DOI":"10.1093\/jxb\/erw497","article-title":"An outlook on protein S-acylation in plants: What are the next steps?","volume":"68","author":"Hemsley","year":"2017","journal-title":"J. Exp. Bot."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Kretowska, M. (2014). Computational Intelligence in Survival Analysis. Encyclopedia of Business Analytics and Optimization, IGI Global.","DOI":"10.4018\/978-1-4666-5202-6.ch044"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1013","DOI":"10.1158\/1535-7163.MCT-07-0177","article-title":"Improved grading and survival prediction of human astrocytic brain tumors by artificial neural network analysis of gene expression microarray data","volume":"7","author":"Petalidis","year":"2008","journal-title":"Mol. Cancer Ther."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1413","DOI":"10.1109\/JBHI.2019.2944643","article-title":"Deep Learning-Based Gleason grading of prostate cancer from histopathology Images\u2014Role of multiscale decision aggregation and data augmentation","volume":"24","author":"Karimi","year":"2019","journal-title":"IEEE J. Biomed. Health Informatics"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1109\/TRPMS.2018.2872406","article-title":"Treatment Outcome Prediction for Cancer Patients Based on Radiomics and Belief Function Theory","volume":"3","author":"Wu","year":"2019","journal-title":"IEEE Trans. Radiat. Plasma Med Sci."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Murthy, N.S., and Bethala, C. (2021). Review paper on research direction towards cancer prediction and prognosis using machine learning and deep learning models. J. Ambient. Intell. Humaniz. Comput., 1\u201319.","DOI":"10.1007\/s12652-021-03147-3"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.artmed.2019.06.001","article-title":"A deep survival analysis method based on ranking","volume":"98","author":"Jing","year":"2019","journal-title":"Artif. Intell. Med."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1016\/j.ebiom.2019.07.046","article-title":"A 23 gene\u2013based molecular prognostic score precisely predicts overall survival of breast cancer patients","volume":"46","author":"Shimizu","year":"2019","journal-title":"eBioMedicine"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1519","DOI":"10.1038\/s41591-019-0583-3","article-title":"Deep learning-based classification of mesothelioma improves prediction of patient outcome","volume":"25","author":"Courtiol","year":"2019","journal-title":"Nat. Med."},{"key":"ref_15","unstructured":"Shin, H.C. (2012, January 4\u20138). Hybrid clustering and logistic regression for multi-modal brain tumor segmentation. Proceedings of the Workshops and Challanges in Medical Image Computing and Computer-Assisted Intervention (MICCAI\u201912), Lima, Peru."},{"key":"ref_16","unstructured":"Hu, Y., Huber, A., Anumula, J., and Liu, S.C. (2018). Overcoming the vanishing gradient problem in plain recurrent networks. arXiv."},{"key":"ref_17","unstructured":"Su, J. (2018). GAN-QP: A novel GAN framework without gradient vanishing and lipschitz constraint. arXiv."},{"key":"ref_18","first-page":"19","article-title":"Informed machine learning\u2014Towards a taxonomy of explicit integration of knowledge into machine learning","volume":"18","author":"Mayer","year":"2019","journal-title":"Learning"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"101656","DOI":"10.1016\/j.isci.2020.101656","article-title":"Integrating Machine Learning with Human Knowledge","volume":"23","author":"Deng","year":"2020","journal-title":"Iscience"},{"key":"ref_20","first-page":"11","article-title":"Does machine learning really work?","volume":"18","author":"Mitchell","year":"1997","journal-title":"AI Mag."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1038\/323533a0","article-title":"Learning representations by back-propagating errors","volume":"323","author":"Rumelhart","year":"1986","journal-title":"Nature"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1007\/BF00994018","article-title":"Support-vector networks","volume":"20","author":"Cortes","year":"1995","journal-title":"Mach. Learn."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1007\/BF00116251","article-title":"Induction of decision trees","volume":"1","author":"Quinlan","year":"1986","journal-title":"Mach. Learn."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1007\/BF00993877","article-title":"Using neural networks to diagnose cancer","volume":"15","author":"Maclin","year":"1991","journal-title":"J. Med Syst."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1016\/0021-9681(85)90090-6","article-title":"Treatment selection for cancer patients: Application of statistical decision theory to the treatment of advanced ovarian cancer","volume":"38","author":"Simes","year":"1985","journal-title":"J. Chronic Dis."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"2725","DOI":"10.1158\/1078-0432.CCR-1115-03","article-title":"Predictive models for breast cancer susceptibility from multiple single nucleotide polymorphisms","volume":"10","author":"Listgarten","year":"2004","journal-title":"Clin. Cancer Res."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Hu, Y., Ashenayi, K., Veltri, R., O\u2019Dowd, G., Miller, G., Hurst, R., and Bonner, R. (July, January 28). A comparison of neural network and fuzzy c-means methods in bladder cancer cell classification. Proceedings of the 1994 IEEE International Conference on Neural Networks (ICNN\u201994), San Francisco, CA, USA.","DOI":"10.1109\/ICNN.1994.374891"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Bevilacqua, V., Mastronardi, G., Menolascina, F., Pannarale, P., and Pedone, A. (2006, January 16\u201321). A novel multi-objective genetic algorithm approach to artificial neural network topology optimisation: The breast cancer classification problem. Proceedings of the 2006 IEEE International Joint Conference on Neural Network, Vancouver, BC, Canada.","DOI":"10.1109\/IJCNN.2006.246940"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1007\/s10489-006-0020-4","article-title":"Cancer classification using ensemble of neural networks with multiple significant gene subsets","volume":"26","author":"Cho","year":"2007","journal-title":"Appl. Intell."},{"key":"ref_30","first-page":"25","article-title":"Human liver cancer classification using microarray gene expression data","volume":"34","author":"Rajeswari","year":"2011","journal-title":"Int. J. Comput. Appl."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1016\/j.proeng.2012.06.005","article-title":"A Novel Feature Selection Algorithm using Particle Swarm Optimization for Cancer Microarray Data","volume":"38","author":"Sahu","year":"2012","journal-title":"PRocedia Eng."},{"key":"ref_32","first-page":"1534","article-title":"A classification technique for microarray gene expression data using PSO-FLANN","volume":"4","author":"Dev","year":"2012","journal-title":"Int. J. Comput. Sci. Eng."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"470","DOI":"10.1016\/j.procs.2014.09.023","article-title":"A Latent Space Support Vector Machine (LSSVM) Model for Cancer Prognosis","volume":"36","author":"Ford","year":"2014","journal-title":"Procedia Comput. Sci."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"1127","DOI":"10.1109\/TITB.2011.2165076","article-title":"Multiparametric decision support system for the prediction of oral cancer reoccurrence","volume":"16","author":"Exarchos","year":"2011","journal-title":"IEEE Trans. Inf. Technol. Biomed."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"3240","DOI":"10.1016\/j.eswa.2008.01.009","article-title":"Support vector machines combined with feature selection for breast cancer diagnosis","volume":"36","author":"Akay","year":"2009","journal-title":"Expert Syst. Appl."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"103498","DOI":"10.1016\/j.compbiomed.2019.103498","article-title":"Deep learning analysis of breast MRIs for prediction of occult invasive disease in ductal carcinoma in situ","volume":"115","author":"Zhu","year":"2019","journal-title":"Comput. Biol. Med."},{"key":"ref_37","first-page":"3","article-title":"Using three machine learning techniques for predicting breast cancer recurrence","volume":"4","author":"Ahmad","year":"2013","journal-title":"J. Health Med. Inform."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1136\/amiajnl-2012-001570","article-title":"Breast cancer survivability prediction using labeled, unlabeled, and pseudo-labeled patient data","volume":"20","author":"Kim","year":"2013","journal-title":"J. Am. Med Inform. Assoc."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"LeCun","year":"2015","journal-title":"Nature"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"317","DOI":"10.1016\/j.ebiom.2017.12.026","article-title":"Deep convolutional neural networks enable discrimination of heterogeneous digital pathology images","volume":"27","author":"Khosravi","year":"2018","journal-title":"eBioMedicine"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"621","DOI":"10.1002\/ima.22403","article-title":"Residual learning based CNN for breast cancer histopathological image classification","volume":"30","author":"Gour","year":"2020","journal-title":"Int. J. Imag. Syst. Technol."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"128613","DOI":"10.1109\/ACCESS.2020.3008868","article-title":"PROMETEO: A CNN-based computer-aided diagnosis system for WSI prostate cancer detection","volume":"8","year":"2020","journal-title":"IEEE Access"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Zhu, W., Xie, L., Han, J., and Guo, X. (2020). The application of deep learning in cancer prognosis prediction. Cancers, 12.","DOI":"10.3390\/cancers12030603"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"102439","DOI":"10.1016\/j.ipm.2020.102439","article-title":"Improved breast cancer classification through combining graph convolutional network and convolutional neural network","volume":"58","author":"Zhang","year":"2021","journal-title":"Inf. Process. Manag."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Althubaiti, S., Kulmanov, M., Liu, Y., Gkoutos, G., Schofield, P., and Hoehndorf, R. (2021). DeepMOCCA: A pan-cancer prognostic model identifies personalized prognostic markers through graph attention and multi-omics data integration. bioRxiv.","DOI":"10.1101\/2021.03.02.433454"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Qiu, L., Li, H., Wang, M., and Wang, X. (2021). Gated Graph Attention Network for Cancer Prediction. Sensors, 21.","DOI":"10.3390\/s21061938"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"034603","DOI":"10.1103\/PhysRevFluids.2.034603","article-title":"Physics-informed machine learning approach for reconstructing Reynolds stress modeling discrepancies based on DNS data","volume":"2","author":"Wang","year":"2017","journal-title":"Phys. Rev. Fluids"},{"key":"ref_48","unstructured":"Xu, J., Zhang, Z., Friedman, T., Liang, Y., and Broeck, G. (2018, January 10\u201315). A semantic loss function for deep learning with symbolic knowledge. Proceedings of the International Conference on Machine Learning, Stockholm, Sweden."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Ren, H., Stewart, R., Song, J., Kuleshov, V., and Ermon, S. (2018). Adversarial constraint learning for structured prediction. arXiv.","DOI":"10.24963\/ijcai.2018\/366"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"619","DOI":"10.1038\/s42256-020-00232-8","article-title":"Integration of mechanistic immunological knowledge into a machine learning pipeline improves predictions","volume":"2","author":"Culos","year":"2020","journal-title":"Nat. Mach. Intell."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Cun, Y., and Fr\u00f6hlich, H. (2012). Prognostic gene signatures for patient stratification in breast cancer-accuracy, stability and interpretability of gene selection approaches using prior knowledge on protein-protein interactions. BMC Bioinform., 13.","DOI":"10.1186\/1471-2105-13-69"},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1756-9966-28-103","article-title":"Lung cancer gene expression database analysis incorporating prior knowledge with support vector machine-based classification method","volume":"28","author":"Guan","year":"2009","journal-title":"J. Exp. Clin. Cancer Res."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1038\/nature12634","article-title":"Mutational landscape and significance across 12 major cancer types","volume":"502","author":"Kandoth","year":"2013","journal-title":"Nature"},{"key":"ref_54","first-page":"797","article-title":"An improved weighted decision tree approach for breast cancer prediction","volume":"12","author":"Juneja","year":"2020","journal-title":"Int. J. Inf. Technol."},{"key":"ref_55","first-page":"8026","article-title":"Pytorch: An imperative style, high-performance deep learning library","volume":"32","author":"Paszke","year":"2019","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_56","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"Hinton","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s40537-019-0192-5","article-title":"Survey on deep learning with class imbalance","volume":"6","author":"Johnson","year":"2019","journal-title":"J. Big Data"},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"962","DOI":"10.1109\/72.286891","article-title":"An improved algorithm for neural network classification of imbalanced training sets","volume":"4","author":"Anand","year":"1993","journal-title":"IEEE Trans. Neural Netw."},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Van Hulse, J., Khoshgoftaar, T.M., and Napolitano, A. (2007, January 20\u201324). Experimental perspectives on learning from imbalanced data. Proceedings of the 24th International Conference on Machine Learning, Corvallis, OR, USA.","DOI":"10.1145\/1273496.1273614"},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1613\/jair.953","article-title":"SMOTE: Synthetic Minority Over-sampling Technique","volume":"16","author":"Chawla","year":"2002","journal-title":"J. Artif. Intell. Res."},{"key":"ref_61","unstructured":"He, H., Yang, B., Garcia, E.A., and Li, S. (2008, January 1\u20138). ADASYN: Adaptive synthetic sampling approach for imbalanced learning. Proceedings of the 2008 IEEE International Joint Conference on Neural Networks, Hong Kong, China."},{"key":"ref_62","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1007\/s13748-016-0094-0","article-title":"Learning from imbalanced data: Open challenges and future directions","volume":"5","author":"Krawczyk","year":"2016","journal-title":"Prog. Artif. Intell."},{"key":"ref_63","unstructured":"Glorot, X., and Bengio, Y. (2010, January 13\u201315). Understanding the difficulty of training deep feedforward neural networks. Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Sardinia, Italy."},{"key":"ref_64","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/19\/6484\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:06:31Z","timestamp":1760166391000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/19\/6484"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,9,28]]},"references-count":64,"journal-issue":{"issue":"19","published-online":{"date-parts":[[2021,10]]}},"alternative-id":["s21196484"],"URL":"https:\/\/doi.org\/10.3390\/s21196484","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2021,9,28]]}}}