{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,14]],"date-time":"2026-01-14T19:11:34Z","timestamp":1768417894086,"version":"3.49.0"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2019,11,14]],"date-time":"2019-11-14T00:00:00Z","timestamp":1573689600000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2019,11,14]],"date-time":"2019-11-14T00:00:00Z","timestamp":1573689600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61872261"],"award-info":[{"award-number":["61872261"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61872261"],"award-info":[{"award-number":["61872261"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2019,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Lung cancer is one of the most common types of cancer, among which lung adenocarcinoma accounts for the largest proportion. Currently, accurate staging is a prerequisite for effective diagnosis and treatment of lung adenocarcinoma. Previous research has used mainly single-modal data, such as gene expression data, for classification and prediction. Integrating multi-modal genetic data (gene expression RNA-seq, methylation data and copy number variation) from the same patient provides the possibility of using multi-modal genetic data for cancer prediction. A new machine learning method called gcForest has recently been proposed. This method has been proven to be suitable for classification in some fields. However, the model may face challenges when applied to small samples and high-dimensional genetic data.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>In this paper, we propose a multi-weighted gcForest algorithm (MLW-gcForest) to construct a lung adenocarcinoma staging model using multi-modal genetic data. The new algorithm is based on the standard gcForest algorithm. First, different weights are assigned to different random forests according to the classification performance of these forests in the standard gcForest model. Second, because the feature vectors generated under different scanning granularities have a diverse influence on the final classification result, the feature vectors are given weights according to the proposed sorting optimization algorithm. Then, we train three MLW-gcForest models based on three single-modal datasets (gene expression RNA-seq, methylation data, and copy number variation) and then perform decision fusion to stage lung adenocarcinoma. Experimental results suggest that the MLW-gcForest model is superior to the standard gcForest model in constructing a staging model of lung adenocarcinoma and is better than the traditional classification methods. The accuracy, precision, recall, and AUC reached 0.908, 0.896, 0.882, and 0.96, respectively.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>The MLW-gcForest model has great potential in lung adenocarcinoma staging, which is helpful for the diagnosis and personalized treatment of lung adenocarcinoma. The results suggest that the MLW-gcForest algorithm is effective on multi-modal genetic data, which consist of small samples and are high dimensional.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-019-3172-z","type":"journal-article","created":{"date-parts":[[2019,11,14]],"date-time":"2019-11-14T14:03:10Z","timestamp":1573740190000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":27,"title":["MLW-gcForest: a multi-weighted gcForest model towards the staging of lung adenocarcinoma based on multi-modal genetic data"],"prefix":"10.1186","volume":"20","author":[{"given":"Yunyun","family":"Dong","sequence":"first","affiliation":[]},{"given":"Wenkai","family":"Yang","sequence":"additional","affiliation":[]},{"given":"Jiawen","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Juanjuan","family":"Zhao","sequence":"additional","affiliation":[]},{"given":"Yan","family":"Qiang","sequence":"additional","affiliation":[]},{"given":"Zijuan","family":"Zhao","sequence":"additional","affiliation":[]},{"given":"Ntikurako Guy Fernand","family":"Kazihise","sequence":"additional","affiliation":[]},{"given":"Yanfen","family":"Cui","sequence":"additional","affiliation":[]},{"given":"Xiaotong","family":"Yang","sequence":"additional","affiliation":[]},{"given":"Siyuan","family":"Liu","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2019,11,14]]},"reference":[{"issue":"3","key":"3172_CR1","doi-asserted-by":"publisher","first-page":"22","DOI":"10.1007\/s12032-018-1082-y","volume":"35","author":"N Motono","year":"2018","unstructured":"Motono N, Funasaki A, Sekimura A, et al. Prognostic value of epidermal growth factor receptor mutations and histologic subtypes with lung adenocarcinoma[J]. Med Oncol. 2018;35(3):22.","journal-title":"Med Oncol"},{"issue":"10","key":"3172_CR2","doi-asserted-by":"publisher","first-page":"e0186567","DOI":"10.1371\/journal.pone.0186567","volume":"12","author":"CY Lin","year":"2017","unstructured":"Lin CY, Wu YM, Hsieh MH, et al. Prognostic implication of EGFR gene mutations and histological classification in patients with resected stage I lung adenocarcinoma. [J]. Plos One. 2017;12(10):e0186567.","journal-title":"[J]. Plos One"},{"issue":"9","key":"3172_CR3","doi-asserted-by":"publisher","first-page":"2443","DOI":"10.1158\/1078-0432.CCR-11-2370","volume":"18","author":"P Perezmoreno","year":"2012","unstructured":"Perezmoreno P, Brambilla E, Thomas R, et al. Squamous cell carcinoma of the lung: molecular subtypes and therapeutic opportunities. [J]. Clinical Cancer Research An Official Journal of the American Association for Cancer Research. 2012;18(9):2443.","journal-title":"[J]. Clinical Cancer Research An Official Journal of the American Association for Cancer Research"},{"key":"3172_CR4","doi-asserted-by":"publisher","first-page":"12474","DOI":"10.1038\/ncomms12474","volume":"7","author":"KH Yu","year":"2016","unstructured":"Yu KH, Zhang C, Berry GJ, et al. Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features:[J]. Nat Commun. 2016;7:12474.","journal-title":"Nat Commun"},{"issue":"3","key":"3172_CR5","doi-asserted-by":"publisher","first-page":"791","DOI":"10.1039\/C4MB00659C","volume":"11","author":"Z Cai","year":"2015","unstructured":"Cai Z, Xu D, Zhang Q, et al. Classification of lung cancer using ensemble-based feature selection and machine learning methods[J]. Mol BioSyst. 2015;11(3):791\u2013800.","journal-title":"Mol BioSyst"},{"issue":"3","key":"3172_CR6","doi-asserted-by":"publisher","first-page":"275","DOI":"10.1504\/IJCBDD.2015.072109","volume":"8","author":"X Li","year":"2015","unstructured":"Li X, Scheich B. Predicting tumour stages of lung cancer adenocarcinoma tumours from pooled microarray data using machine learning methods[J]. International Journal of Computational Biology and Drug Design. 2015;8(3):275\u201392.","journal-title":"International Journal of Computational Biology and Drug Design"},{"key":"3172_CR7","doi-asserted-by":"crossref","unstructured":"Nguyen A, Moore D, McCowan I, et al. Multi-class classification of cancer stages from free-text histology reports using support vector machines[C]\/\/2007 29th annual international conference of the IEEE engineering in medicine and biology society. IEEE. 2007:5140\u20133.","DOI":"10.1109\/IEMBS.2007.4353497"},{"key":"3172_CR8","doi-asserted-by":"crossref","unstructured":"Singh NP, Bapi RS, Vinod PK. Machine learning models to predict the progression from early to late stages of papillary renal cell carcinoma. [J]. Computers in Biology & Medicine. 2018:100.","DOI":"10.1016\/j.compbiomed.2018.06.030"},{"key":"3172_CR9","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.cmpb.2017.09.005","volume":"153","author":"Y Xiao","year":"2018","unstructured":"Xiao Y, Wu J, Lin Z, et al. A deep learning-based multi--model ensemble method for cancer prediction[J]. Computer Methods & Programs in Biomedicine. 2018;153:1.","journal-title":"Computer Methods & Programs in Biomedicine"},{"issue":"1","key":"3172_CR10","doi-asserted-by":"publisher","first-page":"319","DOI":"10.1186\/1471-2105-9-319","volume":"9","author":"A Statnikov","year":"2008","unstructured":"Statnikov A, Wang L, Aliferis CF. A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification[J]. BMC bioinformatics. 2008;9(1):319.","journal-title":"BMC bioinformatics"},{"issue":"6","key":"3172_CR11","doi-asserted-by":"publisher","first-page":"1248","DOI":"10.1158\/1078-0432.CCR-17-0853","volume":"24","author":"K Chaudhary","year":"2018","unstructured":"Chaudhary K, Poirion OB, Lu L, et al. Deep learning\u2013based multi-omics integration robustly predicts survival in liver cancer[J]. Clin Cancer Res. 2018;24(6):1248\u201359.","journal-title":"Clin Cancer Res"},{"issue":"2","key":"3172_CR12","doi-asserted-by":"publisher","first-page":"687","DOI":"10.1016\/j.ejor.2017.12.001","volume":"267","author":"Haifeng Wang","year":"2018","unstructured":"Wang H, Zheng B, Sang WY, et al. A support vector machine-based ensemble algorithm for breast Cancer diagnosis[J]. Eur J Oper Res. 2018;267.","journal-title":"European Journal of Operational Research"},{"key":"3172_CR13","doi-asserted-by":"publisher","DOI":"10.7717\/peerj.5285","volume":"6","author":"MS Tan","year":"2018","unstructured":"Tan MS, Chang SW, Cheah PL, et al. Integrative machine learning analysis of multiple gene expression profiles in cervical cancer[J]. PeerJ. 2018;6:e5285.","journal-title":"PeerJ"},{"issue":"1","key":"3172_CR14","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1021\/acs.jproteome.7b00595","volume":"17","author":"FM Alkawaa","year":"2018","unstructured":"Alkawaa FM, Chaudhary K, Garmire LX. Deep learning accurately predicts estrogen receptor status in breast cancer metabolomics data[J]. J Proteome Res. 2018;17(1):337\u201347.","journal-title":"J Proteome Res"},{"key":"3172_CR15","doi-asserted-by":"crossref","unstructured":"Coudray N. Et al. classification and mutation prediction from non\u2013small cell lung cancer histopathology images using deep learning. Nat Med. 2018;1.","DOI":"10.1101\/197574"},{"key":"3172_CR16","doi-asserted-by":"publisher","first-page":"26286","DOI":"10.1038\/srep26286","volume":"6","author":"G Litjens","year":"2016","unstructured":"Litjens G, S\u00e1nchez CI, Timofeeva N, et al. Deep learning as a tool for increased accuracy and efficiency of histopathological diagnosis[J]. Sci Rep. 2016;6:26286.","journal-title":"Sci Rep"},{"issue":"3","key":"3172_CR17","doi-asserted-by":"publisher","first-page":"841","DOI":"10.1109\/TCBB.2018.2806438","volume":"16","author":"D Sun","year":"2019","unstructured":"Sun D, Wang M, Li A. A multimodal deep neural network for human breast cancer prognosis prediction by integrating multi-dimensional data[J]. IEEE\/ACM Transactions on Computational Biology and Bioinformatics (TCBB). 2019;16(3):841\u201350.","journal-title":"IEEE\/ACM Transactions on Computational Biology and Bioinformatics (TCBB)"},{"key":"3172_CR18","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1007\/978-3-642-40763-5_50","volume-title":"Advanced Information Systems Engineering","author":"Angel Alfonso Cruz-Roa","year":"2013","unstructured":"Cruz-Roa AA, Ovalle JEA, Madabhushi A, et al. A deep learning architecture for image representation, visual interpretability and automated basal-cell carcinoma cancer detection[C]\/\/International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, Berlin, Heidelberg; 2013. p. 403\u201310."},{"issue":"1","key":"3172_CR19","doi-asserted-by":"publisher","first-page":"426","DOI":"10.1186\/s12859-019-2992-1","volume":"20","author":"CM Wilson","year":"2019","unstructured":"Wilson CM, Li K, Yu X, et al. Multiple-kernel learning for genomic data mining and prediction[J]. BMC bioinformatics. 2019;20(1):426.","journal-title":"BMC bioinformatics"},{"key":"3172_CR20","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1016\/j.cmpb.2018.05.016","volume":"163","author":"PH Feng","year":"2018","unstructured":"Feng PH, Chen TT, Lin YT, et al. Classification of lung cancer subtypes based on autofluorescence bronchoscopic pattern recognition: a preliminary study[J]. Comput Methods Prog Biomed. 2018;163:33\u20138.","journal-title":"Comput Methods Prog Biomed"},{"issue":"1","key":"3172_CR21","doi-asserted-by":"publisher","first-page":"57","DOI":"10.2174\/1574893611666160609081155","volume":"13","author":"Z Liao","year":"2018","unstructured":"Liao Z, Li D, Wang X, et al. Cancer diagnosis through IsomiR expression with machine learning method[J]. Curr Bioinforma. 2018;13(1):57\u201363.","journal-title":"Curr Bioinforma"},{"issue":"10","key":"3172_CR22","doi-asserted-by":"publisher","first-page":"6777","DOI":"10.1002\/jcp.26531","volume":"233","author":"Xiao-Feng Zhu","year":"2018","unstructured":"Zhu XF, Zhu BS, Wu FM, et al. DNA methylation biomarkers for the occurrence of lung adenocarcinoma from TCGA data mining. [J]. Journal of Cellular Physiology. 2018;233(3 Suppl).","journal-title":"Journal of Cellular Physiology"},{"key":"3172_CR23","doi-asserted-by":"crossref","unstructured":"Tran N, Abhyankar V, Nguyen KT, et al. Microrna dysregulational synergistic network: learning context-specific microrna dysregulations in lung cancer subtypes[C]\/\/2017 IEEE international conference on bioinformatics and biomedicine (BIBM). IEEE. 2017:142\u20135.","DOI":"10.1109\/BIBM.2017.8217640"},{"key":"3172_CR24","doi-asserted-by":"publisher","first-page":"188","DOI":"10.1016\/j.humpath.2017.06.002","volume":"66","author":"C Molina-Romero","year":"2017","unstructured":"Molina-Romero C, Rangel-Escare\u00f1o C, Ortega-G\u00f3mez A, et al. Differential gene expression profiles according to the Association for the Study of lung Cancer\/American Thoracic Society\/European Respiratory Society histopathological classification in lung adenocarcinoma subtypes[J]. Hum Pathol. 2017;66:188\u201399.","journal-title":"Hum Pathol"},{"issue":"3","key":"3172_CR25","doi-asserted-by":"publisher","first-page":"520","DOI":"10.1016\/j.jtho.2016.11.2216","volume":"12","author":"J Shin","year":"2017","unstructured":"Shin J, Keam B, Kim M, et al. Prognostic impact of newly proposed M descriptors in TNM classification of non-small cell lung Cancer[J]. J Thorac Oncol. 2017;12(3):520\u20138.","journal-title":"J Thorac Oncol"},{"issue":"5","key":"3172_CR26","doi-asserted-by":"publisher","first-page":"449","DOI":"10.1038\/nm.3850","volume":"21","author":"R Cristescu","year":"2015","unstructured":"Cristescu R, Lee J, Nebozhyn M, et al. Molecular analysis of gastric cancer identifies subtypes associated with distinct clinical outcomes[J]. Nat Med. 2015;21(5):449.","journal-title":"Nat Med"},{"issue":"1092","key":"3172_CR27","doi-asserted-by":"publisher","first-page":"20180334","DOI":"10.1259\/bjr.20180334","volume":"91","author":"JF Xiong","year":"2018","unstructured":"Xiong JF, Jia TY, Li XY, et al. Identifying epidermal growth factor receptor mutation status in patients with lung adenocarcinoma by three-dimensional convolutional neural networks[J]. Br J Radiol. 2018;91(1092):20180334.","journal-title":"Br J Radiol"},{"key":"3172_CR28","doi-asserted-by":"crossref","unstructured":"Zhou Z H, Feng J. Deep forest: Towards an alternative to deep neural networks. arXiv 2017[J]. arXiv preprint arXiv:1702.08835.","DOI":"10.24963\/ijcai.2017\/497"},{"issue":"17","key":"3172_CR29","doi-asserted-by":"publisher","first-page":"3589","DOI":"10.3390\/app9173589","volume":"9","author":"Y Dong","year":"2019","unstructured":"Dong Y, Yang W, Wang J, et al. MLW-gcForest: a multi-weighted gcForest model for Cancer subtype classification by methylation data[J]. Appl Sci. 2019;9(17):3589.","journal-title":"Appl Sci"},{"issue":"4","key":"3172_CR30","doi-asserted-by":"publisher","first-page":"385","DOI":"10.1002\/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO;2-3","volume":"16","author":"R Tibshirani","year":"1997","unstructured":"Tibshirani R. The lasso method for variable selection in the cox model. Stat Med. 1997;16(4):385\u201395.","journal-title":"Stat Med"},{"issue":"9","key":"3172_CR31","doi-asserted-by":"publisher","first-page":"1617","DOI":"10.1007\/s11425-016-9116-6","volume":"61","author":"Y Lin","year":"2018","unstructured":"Lin Y, Liu X, Hao M. Model-free feature screening for high-dimensional survival data[J]. SCIENCE CHINA Math. 2018;61(9):1617\u201336.","journal-title":"SCIENCE CHINA Math"},{"issue":"1","key":"3172_CR32","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L. Random forests[J]. Mach Learn. 2001;45(1):5\u201332.","journal-title":"Mach Learn"},{"key":"3172_CR33","unstructured":"Fan W, Wang H, Yu PS, et al. Is random model better? On its accuracy and efficiency[C]\/\/third IEEE international conference on data mining. IEEE. 2003:51\u20138."},{"key":"3172_CR34","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1016\/j.csbj.2014.11.005","volume":"13","author":"K Kourou","year":"2015","unstructured":"Kourou K, Exarchos TP, Exarchos KP, et al. Machine learning applications in cancer prognosis and prediction[J]. Computational and structural biotechnology journal. 2015;13:8\u201317.","journal-title":"Computational and structural biotechnology journal"},{"issue":"1","key":"3172_CR35","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1006\/jmps.1997.1183","volume":"42","author":"BK Scurfield","year":"1998","unstructured":"Scurfield BK. Generalization of the theory of signal detectability ton-Eventm-dimensional forced-choice tasks[J]. J Math Psychol. 1998;42(1):5\u201331.","journal-title":"J Math Psychol"},{"issue":"3","key":"3172_CR36","doi-asserted-by":"publisher","first-page":"566","DOI":"10.1093\/biostatistics\/kxm050","volume":"9","author":"J Li","year":"2008","unstructured":"Li J, Fine JP. ROC analysis with multiple classes and multiple tests: methodology and its application in microarray studies[J]. Biostatistics. 2008;9(3):566\u201376.","journal-title":"Biostatistics"},{"issue":"7511","key":"3172_CR37","doi-asserted-by":"publisher","first-page":"543","DOI":"10.1038\/nature13385","volume":"511","author":"TCGA Network","year":"2014","unstructured":"Network TCGA. Comprehensive molecular profiling of lung adenocarcinoma[J]. Nature. 2014;511(7511):543\u201350.","journal-title":"Nature"},{"issue":"1","key":"3172_CR38","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1186\/1471-2105-7-91","volume":"7","author":"S Varma","year":"2006","unstructured":"Varma S, Simon R. Bias in error estimation when using cross-validation for model selection[J]. BMC bioinformatics. 2006;7(1):91.","journal-title":"BMC bioinformatics"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-019-3172-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12859-019-3172-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-019-3172-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,13]],"date-time":"2020-11-13T00:11:31Z","timestamp":1605226291000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-019-3172-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,14]]},"references-count":38,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2019,12]]}},"alternative-id":["3172"],"URL":"https:\/\/doi.org\/10.1186\/s12859-019-3172-z","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,11,14]]},"assertion":[{"value":"30 October 2018","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 October 2019","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 November 2019","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"578"}}