{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,16]],"date-time":"2026-02-16T20:22:59Z","timestamp":1771273379877,"version":"3.50.1"},"reference-count":47,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2020,9,7]],"date-time":"2020-09-07T00:00:00Z","timestamp":1599436800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61533011"],"award-info":[{"award-number":["61533011"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["U1806202"],"award-info":[{"award-number":["U1806202"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61877064"],"award-info":[{"award-number":["61877064"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,5,20]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Cancer is a highly heterogeneous disease caused by dysregulation in different cell types and tissues. However, different cancers may share common mechanisms. It is critical to identify decisive genes involved in the development and progression of cancer, and joint analysis of multiple cancers may help to discover overlapping mechanisms among different cancers. In this study, we proposed a fusion feature selection framework attributed to ensemble method named Fisher score and Gradient Boosting Decision Tree (FS\u2013GBDT) to select robust and decisive feature genes in high-dimensional gene expression datasets. Joint analysis of 11 human cancers types was conducted to explore the key feature genes subset of cancer. To verify the efficacy of FS\u2013GBDT, we compared it with four other common feature selection algorithms by Support Vector Machine (SVM) classifier. The algorithm achieved highest indicators, outperforms other four methods. In addition, we performed gene ontology analysis and literature validation of the key gene subset, and this subset were classified into several functional modules. Functional modules can be used as markers of disease to replace single gene which is difficult to be found repeatedly in applications of gene chip, and to study the core mechanisms of cancer.<\/jats:p>","DOI":"10.1093\/bib\/bbaa189","type":"journal-article","created":{"date-parts":[[2020,8,15]],"date-time":"2020-08-15T11:10:55Z","timestamp":1597489855000},"source":"Crossref","is-referenced-by-count":35,"title":["FS\u2013GBDT: identification multicancer-risk module via a feature selection algorithm by integrating Fisher score and GBDT"],"prefix":"10.1093","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7661-623X","authenticated-orcid":false,"given":"Jialin","family":"Zhang","sequence":"first","affiliation":[{"name":"School of Mathematics and Statistics at Shandong University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Da","family":"Xu","sequence":"additional","affiliation":[{"name":"School of Mathematics and Statistics at Shandong University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kaijing","family":"Hao","sequence":"additional","affiliation":[{"name":"School of Mathematics and Statistics at Shandong University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yusen","family":"Zhang","sequence":"additional","affiliation":[{"name":"academic leader of Computer Engineering in Shandong University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Mathematics and Statistics at Shandong University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiaguo","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Mathematics and Statistics at Shandong University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rui","family":"Gao","sequence":"additional","affiliation":[{"name":"School of Control Science and Engineering, Shandong University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chuanyan","family":"Wu","sequence":"additional","affiliation":[{"name":"School of Intelligent Engineering in Shandong Management University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yang","family":"De Marinis","sequence":"additional","affiliation":[{"name":"Diabetes Centre at Lund University, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2020,9,7]]},"reference":[{"key":"2021052110411676300_ref1","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science"},{"key":"2021052110411676300_ref2","first-page":"5974","article-title":"Analysis of gene expression identifies candidate markers and pharmacological targets in prostate cancer","volume":"61","author":"Welsh","year":"2001","journal-title":"Cancer Res"},{"key":"2021052110411676300_ref3","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","volume":"5439","author":"Golub","year":"1999","journal-title":"Science"},{"issue":"4","key":"2021052110411676300_ref4","doi-asserted-by":"crossref","first-page":"0","DOI":"10.1016\/S0002-9440(10)62551-5","article-title":"Discovery of novel tumor markers of pancreatic cancer using global gene expression technology","volume":"160","author":"Iacobuzio-Donahue","year":"2002","journal-title":"Am J Pathol"},{"issue":"2004","key":"2021052110411676300_ref5","first-page":"22","article-title":"Gene expression profiles and molecular markers to predict recurrence of Dukes' B colon cancer","volume":"9","author":"Wang","year":"1564","journal-title":"J Clin Oncol Off J Am Soc Clin Oncol"},{"issue":"19","key":"2021052110411676300_ref6","doi-asserted-by":"crossref","first-page":"3741","DOI":"10.1093\/bioinformatics\/bti618","article-title":"Analysis of recursive gene selection approaches from microarray data","volume":"21","author":"Li","journal-title":"Bioinformatics"},{"key":"2021052110411676300_ref7","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1023\/A:1012487302797","article-title":"Gene selection for cancer classification using support vector machines","volume":"46","author":"Guyon","year":"2002","journal-title":"Mach Learn"},{"issue":"5","key":"2021052110411676300_ref8","first-page":"475","article-title":"A survey of dimension reduction techniques","volume":"7","author":"Fodor","year":"2002","journal-title":"Neoplasia"},{"key":"2021052110411676300_ref9","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1109\/CSB.2003.1227396","volume-title":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","author":"Ding","year":"2003"},{"issue":"1","key":"2021052110411676300_ref10","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1016\/j.compbiolchem.2007.09.005","article-title":"Improved binary PSO for feature selection using gene expression data","volume":"32","author":"Chuang","year":"2008","journal-title":"Comput Biol Chem"},{"key":"2021052110411676300_ref11","doi-asserted-by":"crossref","first-page":"1131","DOI":"10.1109\/TCBB.2014.2344655","article-title":"GECC: gene expression based ensemble classification of colon samples","volume":"11","author":"Rathore","year":"2014","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2021052110411676300_ref12","article-title":"An introduction of variable and feature selection","volume":"3","author":"Guyon","year":"2003","journal-title":"J Mach Learn Res"},{"issue":"8","key":"2021052110411676300_ref13","doi-asserted-by":"crossref","first-page":"1259","DOI":"10.1109\/TCYB.2013.2281820","article-title":"Feature selection inspired classifier ensemble reduction","volume":"44","author":"Diao","year":"2014","journal-title":"IEEE Trans Cybern"},{"key":"2021052110411676300_ref14","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1093\/nar\/30.1.207","article-title":"Gene expression omnibus: NCBI gene expression and hybridization array data repository","volume":"30","author":"Edgar","year":"2002","journal-title":"Nucleic Acids Res"},{"issue":"8","key":"2021052110411676300_ref15","doi-asserted-by":"crossref","first-page":"3146","DOI":"10.1158\/0008-5472.CAN-04-2490","article-title":"Progression of Barrett's metaplasia to adenocarcinoma is associated with the suppression of the transcriptional programs of epidermal differentiation","volume":"65","author":"Kimchi","year":"2005","journal-title":"Cancer Res"},{"issue":"30","key":"2021052110411676300_ref16","article-title":"Identification of differentially expressed genes in cutaneous squamous cell carcinoma by microarray expression profiling","volume":"5","author":"Nindl","year":"2006","journal-title":"Mol Cancer"},{"issue":"3","key":"2021052110411676300_ref17","doi-asserted-by":"crossref","first-page":"288","DOI":"10.1001\/archdermatol.2009.378","article-title":"Gene expression patterns of normal human skin, actinic keratosis, and squamous cell carcinoma: a spectrum of disease progression","volume":"146","author":"Padilla","year":"2010","journal-title":"Arch Dermatol"},{"issue":"5","key":"2021052110411676300_ref18","doi-asserted-by":"crossref","first-page":"393","DOI":"10.1016\/j.ccr.2005.10.001","article-title":"Integrative genomic and proteomic analysis of prostate cancer reveals signatures of metastatic progression","volume":"8","author":"Varambally","year":"2005","journal-title":"Cancer Cell"},{"issue":"4","key":"2021052110411676300_ref19","doi-asserted-by":"crossref","first-page":"759","DOI":"10.1002\/ijc.22769","article-title":"HPV related VIN: highly proliferative and diminished responsiveness to extracellular signals","volume":"121","author":"Santegoets","year":"2007","journal-title":"Int J Cancer"},{"issue":"12","key":"2021052110411676300_ref20","doi-asserted-by":"crossref","first-page":"2874","DOI":"10.1002\/ijc.26345","article-title":"Different DNA damage and cell cycle checkpoint control in low- and high-risk human papillomavirus infections of the vulva","volume":"130","author":"Santegoets","year":"2012","journal-title":"Int J Cancer"},{"issue":"55","key":"2021052110411676300_ref21","article-title":"Novel markers for differentiation of lobular and ductal invasive breast carcinomas by laser microdissection and microarray analysis","volume":"7","author":"Turashvili","year":"2007","journal-title":"BMC Cancer"},{"issue":"10","key":"2021052110411676300_ref22","doi-asserted-by":"crossref","first-page":"2153","DOI":"10.1038\/sj.leu.2404877","article-title":"Combined single nucleotide polymorphism-based genomic mapping and global gene expression profiling identifies novel chromosomal imbalances, mechanisms and candidate genes important in the pathogenesis of T-cell prolymphocytic leukemia with inv(14)(q11q32)","volume":"21","author":"D\u00fcrig","year":"2007","journal-title":"Leukemia"},{"issue":"9","key":"2021052110411676300_ref23","doi-asserted-by":"crossref","first-page":"e6986","DOI":"10.1371\/journal.pone.0006986","article-title":"A comprehensive microarray-based DNA methylation study of 367 hematological neoplasms","volume":"4","author":"Martin-Subero","year":"2009","journal-title":"PLoS One"},{"issue":"12","key":"2021052110411676300_ref24","doi-asserted-by":"crossref","first-page":"1263","DOI":"10.1158\/1541-7786.MCR-07-0267","article-title":"Transcriptome profile of human colorectal adenomas","volume":"5","author":"Sabates-Bellver","year":"2007","journal-title":"Mol Cancer Res"},{"key":"2021052110411676300_ref25","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1186\/1471-2164-9-69","article-title":"Transcriptomic dissection of tongue squamous cell carcinoma","volume":"9","author":"Ye","year":"2008","journal-title":"BMC Genomics"},{"issue":"2","key":"2021052110411676300_ref26","doi-asserted-by":"crossref","first-page":"e1651","DOI":"10.1371\/journal.pone.0001651","article-title":"Gene expression signature of cigarette smoking and its role in lung adenocarcinoma development and survival","volume":"3","author":"Landi","year":"2008","journal-title":"PLoS One"},{"issue":"3","key":"2021052110411676300_ref27","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1016\/j.ejca.2008.10.032","article-title":"Genome-wide expression profile of sporadic gastric cancers with microsatellite instability","volume":"45","author":"D'Errico","year":"2009","journal-title":"Eur J Cancer"},{"issue":"88","key":"2021052110411676300_ref28","first-page":"2016","article-title":"Combined gene expression analysis of whole-tissue and microdissected pancreatic ductal adenocarcinoma identifies genes specifically overexpressed in tumor epithelia","volume":"55","author":"Badea","year":"2008","journal-title":"Hepatogastroenterology"},{"issue":"32","key":"2021052110411676300_ref29","doi-asserted-by":"publisher","first-page":"53180","DOI":"10.18632\/oncotarget.18261","article-title":"Regulation of actin-binding protein ANLN by antitumor miR-217 inhibits cancer cell aggressiveness in pancreatic ductal adenocarcinoma","volume":"8","author":"Idichi","year":"2017","journal-title":"Oncotarget"},{"key":"2021052110411676300_ref30","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1093\/biostatistics\/4.2.249","article-title":"Exploration, normalization, and summaries of high density oligonucleotide array probe level data","volume":"4","author":"Irizarry","year":"2003","journal-title":"Biostatistics"},{"key":"2021052110411676300_ref31","volume-title":"Pattern Classification","author":"Duda","year":"2001"},{"key":"2021052110411676300_ref32","first-page":"856","volume-title":"Proceedings of the Twentieth International Conference on Machine Learning (ICML\u201903)","author":"Yu","year":"2003"},{"key":"2021052110411676300_ref33","doi-asserted-by":"publisher","first-page":"1106","DOI":"10.1109\/TCBB.2012.33","volume-title":"IEEE\/ACM Trans Comput Biol Bioinform","author":"Lazar","year":"2012"},{"key":"2021052110411676300_ref34","author":"Gu","year":"2012"},{"key":"2021052110411676300_ref35","doi-asserted-by":"crossref","first-page":"313","DOI":"10.1007\/978-3-540-87481-2_21","article-title":"Robust feature selection using ensemble feature selection techniques","author":"Saeys","year":"2008","journal-title":"J Eur Conf Mach Learn Knowl Discovery Databases"},{"key":"2021052110411676300_ref36","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1016\/j.patcog.2016.11.003","article-title":"A Survey on semi-supervised feature selection methods","volume":"64","author":"Sheikhpour","year":"2017","journal-title":"Pattern Recogn"},{"key":"2021052110411676300_ref37","volume-title":"Introduction to Machine Learning (Adaptive Computation and Machine Learning)","author":"Alpaydn","year":"2004"},{"key":"2021052110411676300_ref38","doi-asserted-by":"crossref","first-page":"1484","DOI":"10.1093\/bioinformatics\/btg182","article-title":"Class prediction and discovery using gene microarray and proteomics mass spectroscopy data: curses, caveats, cautions","volume":"19","author":"Somorjai","year":"2003","journal-title":"Bioinformatics"},{"issue":"8","key":"2021052110411676300_ref39","doi-asserted-by":"publisher","first-page":"2229","DOI":"10.1039\/c4mb00316k","article-title":"Identification of bacteriophage virion proteins by the ANOVA feature selection and analysis","volume":"10","author":"Ding","year":"2014","journal-title":"Mol Biosyst"},{"issue":"5","key":"2021052110411676300_ref40","doi-asserted-by":"publisher","first-page":"1063","DOI":"10.1039\/c3mb70489k","article-title":"Improving enzyme regulatory protein classification by means of SVM-RFE feature selection","volume":"10","author":"Fernandez-Lozano","year":"2014","journal-title":"Mol Biosyst"},{"key":"2021052110411676300_ref41","doi-asserted-by":"publisher","first-page":"517","DOI":"10.1109\/ICPR.2014.99","volume-title":"22nd International Conference on Pattern Recognition","year":"2014"},{"key":"2021052110411676300_ref42","doi-asserted-by":"publisher","first-page":"14","DOI":"10.1016\/j.jneumeth.2017.12.010.0","article-title":"Random forest feature selection, fusion and ensemble strategy: combining multiple morphological MRI measures to discriminate among healhy elderly, MCI, cMCI and alzheimer's disease patients: from the alzheimer's disease neuroimaging initiative (ADNI) database","volume":"302","author":"Dimitriadis","year":"2018","journal-title":"J Neurosci Methods"},{"key":"2021052110411676300_ref43","doi-asserted-by":"crossref","first-page":"696","DOI":"10.1038\/s41568-018-0060-1","article-title":"The COSMIC cancer gene census: describing genetic dysfunction across all human cancers","volume":"18","author":"Sondka","year":"2018","journal-title":"Nat Rev Cancer"},{"key":"2021052110411676300_ref44","doi-asserted-by":"crossref","DOI":"10.1186\/s13059-018-1612-0","article-title":"The network of cancer genes (NCG): a comprehensive catalogue of known and candidate cancer genes from cancer sequencing screens","volume":"20","author":"Repana","year":"2019","journal-title":"Genome Biol"},{"key":"2021052110411676300_ref45","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1089\/omi.2011.0118","article-title":"clusterProfiler: an R package for comparing biological themes among gene clusters","volume":"16","author":"Yu","year":"2012","journal-title":"Omics"},{"key":"2021052110411676300_ref46","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1186\/1471-2105-5-81","article-title":"Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes","volume":"5","author":"Jiang","year":"2004","journal-title":"BMC Bioinf"},{"key":"2021052110411676300_ref47","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1016\/j.neuroimage.2014.01.021","article-title":"Sparse representation based biomarker selection for schizophrenia with integrated analysis of fMRI and SNPs","volume":"102","author":"Cao","year":"2014","journal-title":"Neuroimage"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/3\/bbaa189\/37965671\/bbaa189.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/3\/bbaa189\/37965671\/bbaa189.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,5,21]],"date-time":"2021-05-21T10:43:02Z","timestamp":1621593782000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaa189\/5901960"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,7]]},"references-count":47,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2021,5,20]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaa189","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,5]]},"published":{"date-parts":[[2020,9,7]]},"article-number":"bbaa189"}}