{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T05:39:25Z","timestamp":1775799565186,"version":"3.50.1"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2021,11,11]],"date-time":"2021-11-11T00:00:00Z","timestamp":1636588800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,1,27]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Accurate disease diagnosis and prognosis based on omics data rely on the effective identification of robust prognostic and diagnostic markers that reflect the states of the biological processes underlying the disease pathogenesis and progression. In this article, we present GCNCC, a Graph Convolutional Network-based approach for Clustering and Classification, that can identify highly effective and robust network-based disease markers. Based on a geometric deep learning framework, GCNCC learns deep network representations by integrating gene expression data with protein interaction data to identify highly reproducible markers with consistently accurate prediction performance across independent datasets possibly from different platforms. GCNCC identifies these markers by clustering the nodes in the protein interaction network based on latent similarity measures learned by the deep architecture of a graph convolutional network, followed by a supervised feature selection procedure that extracts clusters that are highly predictive of the disease state.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>By benchmarking GCNCC based on independent datasets from different diseases (psychiatric disorder and cancer) and different platforms (microarray and RNA-seq), we show that GCNCC outperforms other state-of-the-art methods in terms of accuracy and reproducibility.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>https:\/\/github.com\/omarmaddouri\/GCNCC.<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab772","type":"journal-article","created":{"date-parts":[[2021,11,5]],"date-time":"2021-11-05T12:23:59Z","timestamp":1636115039000},"page":"1075-1086","source":"Crossref","is-referenced-by-count":10,"title":["Deep graph representations embed network information for robust disease marker identification"],"prefix":"10.1093","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0305-0348","authenticated-orcid":false,"given":"Omar","family":"Maddouri","sequence":"first","affiliation":[{"name":"Department of Electrical and Computer Engineering, Texas A&M University , College Station, TX 77843, USA"}]},{"given":"Xiaoning","family":"Qian","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, Texas A&M University , College Station, TX 77843, USA"},{"name":"Computational Science Initiative, Brookhaven National Laboratory , Upton, NY 11973, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9328-1101","authenticated-orcid":false,"given":"Byung-Jun","family":"Yoon","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, Texas A&M University , College Station, TX 77843, USA"},{"name":"Computational Science Initiative, Brookhaven National Laboratory , Upton, NY 11973, USA"}]}],"member":"286","published-online":{"date-parts":[[2021,11,11]]},"reference":[{"key":"2023020108525139100_btab772-B1","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1089\/106652700750050943","article-title":"Tissue classification with gene expression profiles","volume":"7","author":"Ben-Dor","year":"2000","journal-title":"J. Comput. Biol"},{"key":"2023020108525139100_btab772-B2","doi-asserted-by":"crossref","first-page":"617395","DOI":"10.3389\/fpsyt.2020.617395","article-title":"Editorial: comorbidity and autism spectrum disorder","volume":"11","author":"Casanova","year":"2020","journal-title":"Front. Psychiatry"},{"key":"2023020108525139100_btab772-B3","doi-asserted-by":"crossref","first-page":"e1002820","DOI":"10.1371\/journal.pcbi.1002820","article-title":"Network biology approach to complex diseases","volume":"8","author":"Cho","year":"2012","journal-title":"PLoS Comput. Biol"},{"key":"2023020108525139100_btab772-B4","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1038\/msb4100180","article-title":"Network-based classification of breast cancer metastasis","volume":"3","author":"Chuang","year":"2007","journal-title":"Mol. Syst. Biol"},{"key":"2023020108525139100_btab772-B5","first-page":"pp. 3844","article-title":"Convolutional neural networks on graphs with fast localized spectral filtering","author":"Defferrard","year":"2016","journal-title":"Proceedings of the 30th International Conference on Neural Information Processing Systems (NIPS'16),"},{"key":"2023020108525139100_btab772-B6","author":"Dongen","year":"2000"},{"key":"2023020108525139100_btab772-B7","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1093\/bib\/bbs032","article-title":"A comparative analysis of biclustering algorithms for gene expression data","volume":"14","author":"Eren","year":"2013","journal-title":"Brief. Bioinf"},{"key":"2023020108525139100_btab772-B8","doi-asserted-by":"crossref","first-page":"972","DOI":"10.1126\/science.1136800","article-title":"Clustering by passing messages between data points","volume":"315","author":"Frey","year":"2007","journal-title":"Science"},{"key":"2023020108525139100_btab772-B9","doi-asserted-by":"crossref","first-page":"693","DOI":"10.1126\/science.aad6469","article-title":"Shared molecular neuropathology across major psychiatric disorders parallels polygenic overlap","volume":"359","author":"Gandal","year":"2018","journal-title":"Science"},{"key":"2023020108525139100_btab772-B10","author":"Gao","year":"2018"},{"key":"2023020108525139100_btab772-B11","doi-asserted-by":"crossref","first-page":"6994","DOI":"10.1073\/pnas.0912708107","article-title":"A pathway-based classification of human breast cancer","volume":"107","author":"Gatza","year":"2010","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020108525139100_btab772-B12","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science"},{"key":"2023020108525139100_btab772-B13","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1016\/S0924-9338(02)80108-0","article-title":"Genome-wide expression analysis reveals dyregulation of myelination-related genes in chronic schizophrenia","volume":"17","author":"Hakak","year":"2002","journal-title":"Eur. Psychiatry"},{"key":"2023020108525139100_btab772-B14","doi-asserted-by":"crossref","first-page":"1108","DOI":"10.1038\/nmeth.2651","article-title":"Network-based stratification of tumor mutations","volume":"10","author":"Hofree","year":"2013","journal-title":"Nat. Methods"},{"key":"2023020108525139100_btab772-B15","doi-asserted-by":"crossref","first-page":"633","DOI":"10.1038\/ni.2587","article-title":"Identification of transcriptional regulators in the mouse immune system","volume":"14","author":"Jojic","year":"2013","journal-title":"Nat. Immunol"},{"key":"2023020108525139100_btab772-B16","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1016\/j.compbiomed.2007.11.001","article-title":"Techniques for clustering gene expression data","volume":"38","author":"Kerr","year":"2008","journal-title":"Comput. Biol. Med"},{"key":"2023020108525139100_btab772-B17","doi-asserted-by":"crossref","first-page":"618461","DOI":"10.1155\/2013\/618461","article-title":"Identification of robust pathway markers for cancer through rank-based pathway activity inference","volume":"2013","author":"Khunlertgit","year":"2013","journal-title":"Adv. Bioinf"},{"key":"2023020108525139100_btab772-B18","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1186\/s12859-016-1224-1","article-title":"Incorporating topological information for predicting robust cancer subnetwork markers in human protein\u2013protein interaction network","volume":"17","author":"Khunlertgit","year":"2016","journal-title":"BMC Bioinformatics"},{"key":"2023020108525139100_btab772-B19","author":"Kipf","year":"2016"},{"key":"2023020108525139100_btab772-B20","doi-asserted-by":"crossref","first-page":"W90","DOI":"10.1093\/nar\/gkw377","article-title":"Enrichr: a comprehensive gene set enrichment analysis web server 2016 update","volume":"44","author":"Kuleshov","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023020108525139100_btab772-B21","doi-asserted-by":"crossref","first-page":"e1000217","DOI":"10.1371\/journal.pcbi.1000217","article-title":"Inferring pathway activity toward precise disease classification","volume":"4","author":"Lee","year":"2008","journal-title":"PLoS Comput. Biol"},{"key":"2023020108525139100_btab772-B22","author":"Maddouri","year":"2021"},{"key":"2023020108525139100_btab772-B23","doi-asserted-by":"crossref","first-page":"796","DOI":"10.1038\/nmeth.2016","article-title":"Wisdom of crowds for robust gene network inference","volume":"9","author":"Marbach","year":"2012","journal-title":"Nat. Methods"},{"key":"2023020108525139100_btab772-B24","article-title":"Psychiatric genome-wide association study analyses implicate neuronal, immune and histone pathways","volume":"18","year":"2015","journal-title":"Nature Neuroscience"},{"key":"2023020108525139100_btab772-B25","doi-asserted-by":"crossref","first-page":"8577","DOI":"10.1073\/pnas.0601602103","article-title":"Modularity and community structure in networks","volume":"103","author":"Newman","year":"2006","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020108525139100_btab772-B26","doi-asserted-by":"crossref","first-page":"D833","DOI":"10.1093\/nar\/gkw943","article-title":"DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants","volume":"45","author":"Pi\u00f1ero","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023020108525139100_btab772-B27","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1038\/ng1060","article-title":"A molecular signature of metastasis in primary solid tumors","volume":"33","author":"Ramaswamy","year":"2003","journal-title":"Nat. Genet"},{"key":"2023020108525139100_btab772-B28","doi-asserted-by":"crossref","first-page":"e1002367","DOI":"10.1371\/journal.pgen.1002367","article-title":"Integrating genome-wide genetic variations and monocyte expression data reveals trans-regulated gene modules in humans","volume":"7","author":"Rotival","year":"2011","journal-title":"PLoS Genet"},{"key":"2023020108525139100_btab772-B29","doi-asserted-by":"crossref","first-page":"e1003252","DOI":"10.1371\/journal.pcbi.1003252","article-title":"Integrated module and gene-specific regulatory inference implicates upstream signaling networks","volume":"9","author":"Roy","year":"2013","journal-title":"PLoS Comput. Biol"},{"key":"2023020108525139100_btab772-B30","doi-asserted-by":"crossref","first-page":"1090","DOI":"10.1038\/s41467-018-03424-4","article-title":"A comprehensive evaluation of module detection methods for gene expression data","volume":"9","author":"Saelens","year":"2018","journal-title":"Nat. Commun"},{"key":"2023020108525139100_btab772-B31","first-page":"327","article-title":"Kernel principal component analysis","author":"Sch\u00f6lkopf","year":"1999","journal-title":"Advances in Kernel Methods: Support Vector Learning"},{"key":"2023020108525139100_btab772-B32","doi-asserted-by":"crossref","first-page":"717","DOI":"10.1038\/nrmicro2419","article-title":"Advantages and limitations of current network inference methods","volume":"8","author":"Smet","year":"2010","journal-title":"Nat. Rev. Microbiol"},{"key":"2023020108525139100_btab772-B33","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1093\/bioinformatics\/18.1.207","article-title":"Genesis: cluster analysis of microarray data","volume":"18","author":"Sturn","year":"2002","journal-title":"Bioinformatics"},{"key":"2023020108525139100_btab772-B34","doi-asserted-by":"crossref","first-page":"e8161","DOI":"10.1371\/journal.pone.0008161","article-title":"Accurate and reliable cancer classification based on probabilistic inference of pathway activity","volume":"4","author":"Su","year":"2009","journal-title":"PLoS One"},{"key":"2023020108525139100_btab772-B35","doi-asserted-by":"crossref","first-page":"S8","DOI":"10.1186\/1471-2105-11-S6-S8","article-title":"Identification of diagnostic subnetwork markers for cancer in human protein\u2013protein interaction network","volume":"11","author":"Su","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023020108525139100_btab772-B36","author":"Su","year":"2010"},{"key":"2023020108525139100_btab772-B37","doi-asserted-by":"crossref","first-page":"D447","DOI":"10.1093\/nar\/gku1003","article-title":"STRING v10: protein\u2013protein interaction networks, integrated over the tree of life","volume":"43","author":"Szklarczyk","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023020108525139100_btab772-B38","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1146\/annurev.genom.5.061903.180050","article-title":"Autism as a paradigmatic complex genetic disorder","volume":"5","author":"Veenstra-VanderWeele","year":"2004","journal-title":"Annu. Rev. Genomics Hum. Genet"},{"key":"2023020108525139100_btab772-B39","author":"Wang","year":"2020"},{"key":"2023020108525139100_btab772-B40","first-page":"80","article-title":"Extracting a biologically relevant latent space from cancer transcriptomes with variational autoencoders","volume":"23","author":"Way","year":"2018","journal-title":"Pac. Symposium Biocomput"},{"key":"2023020108525139100_btab772-B41","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1038\/nature11981","article-title":"Dynamic regulatory network controlling TH17 cell differentiation","volume":"496","author":"Yosef","year":"2013","journal-title":"Nature"},{"key":"2023020108525139100_btab772-B42","doi-asserted-by":"crossref","first-page":"Article17","DOI":"10.2202\/1544-6115.1128","article-title":"A general framework for weighted gene co-expression network analysis","volume":"4","author":"Zhang","year":"2005","journal-title":"Stat. Appl. Genet. Mol. Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab772\/41381121\/btab772.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/4\/1075\/49008578\/btab772.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/4\/1075\/49008578\/btab772.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,12]],"date-time":"2023-11-12T08:53:04Z","timestamp":1699779184000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/4\/1075\/6425669"}},"subtitle":[],"editor":[{"given":"Jinbo","family":"Xu","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,11,11]]},"references-count":42,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2022,1,27]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab772","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,2,15]]},"published":{"date-parts":[[2021,11,11]]}}}