{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T16:39:50Z","timestamp":1772210390566,"version":"3.50.1"},"reference-count":48,"publisher":"Oxford University Press (OUP)","issue":"24","license":[{"start":{"date-parts":[[2021,7,21]],"date-time":"2021-07-21T00:00:00Z","timestamp":1626825600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61873089"],"award-info":[{"award-number":["61873089"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61772313"],"award-info":[{"award-number":["61772313"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["U1836216"],"award-info":[{"award-number":["U1836216"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Major Fundamental Research Project of Shandong Province","award":["ZR2019ZD03"],"award-info":[{"award-number":["ZR2019ZD03"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,12,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Cancer subtype identification aims to divide cancer patients into subgroups with distinct clinical phenotypes and facilitate the development for subgroup specific therapies. The massive amount of multi-omics datasets accumulated in the public databases have provided unprecedented opportunities to fulfill this task. As a result, great computational efforts have been made to accurately identify cancer subtypes via integrative analysis of these multi-omics datasets.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this article, we propose a Consensus Guided Graph Autoencoder (CGGA) to effectively identify cancer subtypes. First, we learn for each omic a new feature matrix by using graph autoencoders, where both structure information and node features can be effectively incorporated during the learning process. Second, we learn a set of omic-specific similarity matrices together with a consensus matrix based on the features obtained in the first step. The learned omic-specific similarity matrices are then fed back to the graph autoencoders to guide the feature learning. By iterating the two steps above, our method obtains a final consensus similarity matrix for cancer subtyping. To comprehensively evaluate the prediction performance of our method, we compare CGGA with several approaches ranging from general-purpose multi-view clustering algorithms to multi-omics-specific integrative methods. The experimental results on both generic datasets and cancer datasets confirm the superiority of our method. Moreover, we validate the effectiveness of our method in leveraging multi-omics datasets to identify cancer subtypes. In addition, we investigate the clinical implications of the obtained clusters for glioblastoma and provide new insights into the treatment for patients with different subtypes.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availabilityand implementation<\/jats:title>\n                  <jats:p>The source code of our method is freely available at https:\/\/github.com\/alcs417\/CGGA.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab535","type":"journal-article","created":{"date-parts":[[2021,7,19]],"date-time":"2021-07-19T11:15:30Z","timestamp":1626693330000},"page":"4779-4786","source":"Crossref","is-referenced-by-count":28,"title":["Cancer subtype identification by consensus guided graph autoencoders"],"prefix":"10.1093","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3832-0969","authenticated-orcid":false,"given":"Cheng","family":"Liang","sequence":"first","affiliation":[{"name":"School of Information Science and Engineering, Shandong Normal University , Jinan 250358, China"}]},{"given":"Mingchao","family":"Shang","sequence":"additional","affiliation":[{"name":"School of Information Science and Engineering, Shandong Normal University , Jinan 250358, China"}]},{"given":"Jiawei","family":"Luo","sequence":"additional","affiliation":[{"name":"College of Computer Science and Electronic Engineering, Hunan University , Changsha 410082, China"}]}],"member":"286","published-online":{"date-parts":[[2021,7,21]]},"reference":[{"key":"2023051607130293000_btab535-B1","doi-asserted-by":"crossref","first-page":"462","DOI":"10.1016\/j.cell.2013.09.034","article-title":"The somatic genomic landscape of glioblastoma","volume":"155","author":"Brennan","year":"2013","journal-title":"Cell"},{"key":"2023051607130293000_btab535-B2","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1186\/s12920-017-0306-x","article-title":"Subtype identification from heterogeneous TCGA datasets on a genomic scale by multi-view clustering with enhanced consensus","volume":"10","author":"Cai","year":"2017","journal-title":"BMC Med. Genomics"},{"key":"2023051607130293000_btab535-B3","doi-asserted-by":"crossref","first-page":"1113","DOI":"10.1038\/ng.2764","article-title":"The Cancer Genome Atlas Pan-Cancer analysis project","volume":"45","year":"2013","journal-title":"Nat. Genet"},{"key":"2023051607130293000_btab535-B4","doi-asserted-by":"crossref","first-page":"5679","DOI":"10.1038\/s41467-019-13528-0","article-title":"Pan-cancer molecular subtypes revealed by mass-spectrometry-based proteomic characterization of more than 500 human cancers","volume":"10","author":"Chen","year":"2019","journal-title":"Nat. Commun"},{"key":"2023051607130293000_btab535-B5","doi-asserted-by":"crossref","first-page":"1271","DOI":"10.1109\/TIP.2019.2940693","article-title":"Scalable deep hashing for large-scale social image retrieval","volume":"29","author":"Cui","year":"2020","journal-title":"IEEE Trans. Image Process"},{"key":"2023051607130293000_btab535-B6","first-page":"2929","article-title":"Breast cancer intrinsic subtype classification, clinical use and future trends","volume":"5","author":"Dai","year":"2015","journal-title":"Am. J. Cancer Res"},{"key":"2023051607130293000_btab535-B7","author":"Dua","year":"2019"},{"key":"2023051607130293000_btab535-B8","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1016\/j.cviu.2005.09.012","article-title":"Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories","volume":"106","author":"Fei-Fei","year":"2007","journal-title":"Comput. Vis. Image Understand"},{"key":"2023051607130293000_btab535-B9","first-page":"377","author":"Greene","year":"2006"},{"key":"2023051607130293000_btab535-B10","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1016\/j.ebiom.2019.06.034","article-title":"Development and validation of radiomic signatures of head and neck squamous cell carcinoma molecular features and subtypes","volume":"45","author":"Huang","year":"2019","journal-title":"EBioMedicine"},{"key":"2023051607130293000_btab535-B11","first-page":"3569","author":"Huang","year":"2015"},{"key":"2023051607130293000_btab535-B12","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1177\/1758834013517414","article-title":"Tumor heterogeneity in the clinic: is it a real problem?","volume":"6","author":"Janku","year":"2014","journal-title":"Ther. Adv. Med. Oncol"},{"key":"2023051607130293000_btab535-B13","author":"Jiang","year":"2019"},{"key":"2023051607130293000_btab535-B14","doi-asserted-by":"crossref","first-page":"20","DOI":"10.3389\/fgene.2019.00020","article-title":"Discovering cancer subtypes via an accurate fusion strategy on multiple profile data","volume":"10","author":"Jiang","year":"2019","journal-title":"Front. Genet"},{"key":"2023051607130293000_btab535-B15","author":"Kipf","year":"2016"},{"key":"2023051607130293000_btab535-B16","author":"Kipf","year":"2016"},{"key":"2023051607130293000_btab535-B17","doi-asserted-by":"crossref","first-page":"1492","DOI":"10.1038\/s41416-018-0109-7","article-title":"Cancer subtype identification using somatic mutation data","volume":"118","author":"Kuijjer","year":"2018","journal-title":"Br. J. Cancer"},{"key":"2023051607130293000_btab535-B18","first-page":"393","author":"Kumar","year":"2011"},{"key":"2023051607130293000_btab535-B19","first-page":"1413","author":"Kumar","year":"2011"},{"key":"2023051607130293000_btab535-B20","first-page":"325","article-title":"A review on machine learning principles for multi-view biological data integration","volume":"19","author":"Li","year":"2018","journal-title":"Brief. Bioinform"},{"key":"2023051607130293000_btab535-B21","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1093\/biostatistics\/kxx017","article-title":"A fully Bayesian latent variable model for integrative clustering analysis of multi-type omics data","volume":"19","author":"Mo","year":"2018","journal-title":"Biostatistics"},{"key":"2023051607130293000_btab535-B22","author":"Nene","year":"1996"},{"key":"2023051607130293000_btab535-B23","doi-asserted-by":"crossref","first-page":"e1007677","DOI":"10.1371\/journal.pcbi.1007677","article-title":"Multiview learning for understanding functional multiomics","volume":"16","author":"Nguyen","year":"2020","journal-title":"PLoS Comput. Biol"},{"key":"2023051607130293000_btab535-B24","doi-asserted-by":"crossref","first-page":"2025","DOI":"10.1101\/gr.215129.116","article-title":"A novel approach for data integration and disease subtyping","volume":"27","author":"Nguyen","year":"2017","journal-title":"Genome Res"},{"key":"2023051607130293000_btab535-B25","first-page":"977","author":"Nie","year":"2014"},{"key":"2023051607130293000_btab535-B26","first-page":"2022","author":"Nie","year":"2018"},{"key":"2023051607130293000_btab535-B27","doi-asserted-by":"crossref","first-page":"2877","DOI":"10.1093\/bioinformatics\/btw324","article-title":"R.JIVE for exploration of multi-source molecular data","volume":"32","author":"O'Connell","year":"2016","journal-title":"Bioinformatics"},{"key":"2023051607130293000_btab535-B28","doi-asserted-by":"crossref","first-page":"10546","DOI":"10.1093\/nar\/gky889","article-title":"Multi-omic and multi-view clustering algorithms: review and cancer benchmark","volume":"46","author":"Rappoport","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2023051607130293000_btab535-B29","doi-asserted-by":"crossref","first-page":"e47","DOI":"10.1093\/nar\/gkv007","article-title":"limma powers differential expression analyses for RNA-sequencing and microarray studies","volume":"43","author":"Ritchie","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023051607130293000_btab535-B30","author":"Salha","year":"2019"},{"key":"2023051607130293000_btab535-B31","doi-asserted-by":"crossref","first-page":"eaba1862","DOI":"10.1126\/sciadv.aba1862","article-title":"Matching cell lines with cancer type and subtype of origin via mutational, epigenomic, and transcriptomic patterns","volume":"6","author":"Salvadores","year":"2020","journal-title":"Sci. Adv"},{"key":"2023051607130293000_btab535-B32","doi-asserted-by":"crossref","first-page":"2906","DOI":"10.1093\/bioinformatics\/btp543","article-title":"Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis","volume":"25","author":"Shen","year":"2009","journal-title":"Bioinformatics"},{"key":"2023051607130293000_btab535-B33","doi-asserted-by":"crossref","first-page":"i268","DOI":"10.1093\/bioinformatics\/btv244","article-title":"Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery","volume":"31","author":"Speicher","year":"2015","journal-title":"Bioinformatics"},{"key":"2023051607130293000_btab535-B34","doi-asserted-by":"crossref","first-page":"1724","DOI":"10.1109\/TMM.2018.2889560","article-title":"Learning a joint affinity graph for multiview subspace clustering","volume":"21","author":"Tang","year":"2019","journal-title":"IEEE Trans. Multimedia"},{"key":"2023051607130293000_btab535-B35","doi-asserted-by":"crossref","first-page":"5237","DOI":"10.1093\/bioinformatics\/btaa655","article-title":"PAMOGK: a Pathway Graph Kernel based Multi-Omics Approach for Patient Clustering","volume":"36","author":"Tepeli","year":"2020","journal-title":"Bioinformatics"},{"key":"2023051607130293000_btab535-B36","doi-asserted-by":"crossref","first-page":"i237","DOI":"10.1093\/bioinformatics\/btq182","article-title":"Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM","volume":"26","author":"Vaske","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051607130293000_btab535-B37","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1016\/j.ccr.2009.12.020","article-title":"Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1","volume":"17","author":"Verhaak","year":"2010","journal-title":"Cancer Cell"},{"key":"2023051607130293000_btab535-B38","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1038\/nmeth.2810","article-title":"Similarity network fusion for aggregating data types on a genomic scale","volume":"11","author":"Wang","year":"2014","journal-title":"Nat. Methods"},{"key":"2023051607130293000_btab535-B39","first-page":"889","author":"Wang","year":"2017"},{"key":"2023051607130293000_btab535-B40","doi-asserted-by":"crossref","first-page":"1116","DOI":"10.1109\/TKDE.2019.2903810","article-title":"GMC: graph-based multi-view clustering","volume":"32","author":"Wang","year":"2020","journal-title":"IEEE Trans. Knowl. Data Eng"},{"key":"2023051607130293000_btab535-B41","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1109\/TPAMI.2018.2875002","article-title":"Detecting coherent groups in crowd scenes by multiview clustering","volume":"42","author":"Wang","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"2023051607130293000_btab535-B42","doi-asserted-by":"crossref","first-page":"1022","DOI":"10.1186\/s12864-015-2223-8","article-title":"Fast dimension reduction and integrative clustering of multi-omics data using low-rank approximation: application to cancer molecular classification","volume":"16","author":"Wu","year":"2015","journal-title":"BMC Genomics"},{"key":"2023051607130293000_btab535-B43","first-page":"6861","author":"Wu","year":"2019"},{"key":"2023051607130293000_btab535-B44","doi-asserted-by":"crossref","first-page":"236","DOI":"10.3389\/fgene.2019.00236","article-title":"Simultaneous interrogation of cancer omics to identify subtypes with significant clinical differences","volume":"10","author":"Xu","year":"2019","journal-title":"Front. Genet"},{"key":"2023051607130293000_btab535-B45","first-page":"5356","author":"Xu","year":"2016"},{"key":"2023051607130293000_btab535-B46","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/bioinformatics\/btv544","article-title":"A non-negative matrix factorization method for detecting modules in heterogeneous omics multi-modal data","volume":"32","author":"Yang","year":"2016","journal-title":"Bioinformatics"},{"key":"2023051607130293000_btab535-B47","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1089\/omi.2011.0118","article-title":"clusterProfiler: an R package for comparing biological themes among gene clusters","volume":"16","author":"Yu","year":"2012","journal-title":"Omics J. Integr. Biol"},{"key":"2023051607130293000_btab535-B48","first-page":"1601","author":"Zelnik-Manor","year":"2004"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab535\/39590329\/btab535.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/24\/4779\/50334705\/btab535.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/24\/4779\/50334705\/btab535.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T07:39:37Z","timestamp":1684222777000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/24\/4779\/6325018"}},"subtitle":[],"editor":[{"given":"Janet","family":"Kelso","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,7,21]]},"references-count":48,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2021,12,11]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab535","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,12,15]]},"published":{"date-parts":[[2021,7,21]]}}}