{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T19:50:56Z","timestamp":1772135456967,"version":"3.50.1"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2022,3,23]],"date-time":"2022-03-23T00:00:00Z","timestamp":1647993600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["12071273"],"award-info":[{"award-number":["12071273"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Shanghai Research Center for Data Science and Decision Technology"},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["CA204120"],"award-info":[{"award-number":["CA204120"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Platform of Public Health & Disease Control and Prevention"},{"name":"Major Innovation & Planning Interdisciplinary Platform for the \u2018Double-First Class\u2019 Initiative"},{"DOI":"10.13039\/501100004260","name":"Renmin University of China","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100004260","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,5,13]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Cancer genetic heterogeneity analysis has critical implications for tumour classification, response to therapy and choice of biomarkers to guide personalized cancer medicine. However, existing heterogeneity analysis based solely on molecular profiling data usually suffers from a lack of information and has limited effectiveness. Many biomedical and life sciences databases have accumulated a substantial volume of meaningful biological information. They can provide additional information beyond molecular profiling data, yet pose challenges arising from potential noise and uncertainty.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this study, we aim to develop a more effective heterogeneity analysis method with the help of prior information. A network-based penalization technique is proposed to innovatively incorporate a multi-view of prior information from multiple databases, which accommodates heterogeneity attributed to both differential genes and gene relationships. To account for the fact that the prior information might not be fully credible, we propose a weighted strategy, where the weight is determined dependent on the data and can ensure that the present model is not excessively disturbed by incorrect information. Simulation and analysis of The Cancer Genome Atlas glioblastoma multiforme data demonstrate the practical applicability of the proposed method.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>R code implementing the proposed method is available at https:\/\/github.com\/mengyunwu2020\/PECM. The data that support the findings in this paper are openly available in TCGA (The Cancer Genome Atlas) at https:\/\/portal.gdc.cancer.gov\/.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btac183","type":"journal-article","created":{"date-parts":[[2022,3,22]],"date-time":"2022-03-22T12:22:00Z","timestamp":1647951720000},"page":"2855-2862","source":"Crossref","is-referenced-by-count":5,"title":["Network-based cancer heterogeneity analysis incorporating multi-view of prior information"],"prefix":"10.1093","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6287-5094","authenticated-orcid":false,"given":"Yang","family":"Li","sequence":"first","affiliation":[{"name":"Center for Applied Statistics, School of Statistics, Statistical Consulting Center, and RSS and China-Re Life Joint Lab on Public Health and Risk Management, Renmin University of China, Beijing 100872, China"}]},{"given":"Shaodong","family":"Xu","sequence":"additional","affiliation":[{"name":"Center for Applied Statistics, School of Statistics, Statistical Consulting Center, and RSS and China-Re Life Joint Lab on Public Health and Risk Management, Renmin University of China, Beijing 100872, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9001-4999","authenticated-orcid":false,"given":"Shuangge","family":"Ma","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Yale School of Public Health , New Haven, CT 06520, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0970-4712","authenticated-orcid":false,"given":"Mengyun","family":"Wu","sequence":"additional","affiliation":[{"name":"School of Statistics and Management, Shanghai University of Finance and Economics , Shanghai 200433, China"}]}],"member":"286","published-online":{"date-parts":[[2022,3,23]]},"reference":[{"key":"2023020109064074800_btac183-B1","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1016\/j.csda.2016.08.003","article-title":"A simple approach to sparse clustering","volume":"105","author":"Arias-Castro","year":"2017","journal-title":"Comput. Stat. Data Anal"},{"key":"2023020109064074800_btac183-B2","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1016\/j.csda.2012.12.008","article-title":"Model-based clustering of high-dimensional data: a review","volume":"71","author":"Bouveyron","year":"2014","journal-title":"Comput. Stat. Data Anal"},{"key":"2023020109064074800_btac183-B3","first-page":"1265","article-title":"Sparse k-means with l\u221e\/l0 penalty for high-dimensional data clustering","volume":"28","author":"Chang","year":"2018","journal-title":"Stat. Sin"},{"key":"2023020109064074800_btac183-B4","doi-asserted-by":"crossref","first-page":"551","DOI":"10.1038\/nrg.2017.38","article-title":"Network propagation: a universal amplifier of genetic associations","volume":"18","author":"Cowen","year":"2017","journal-title":"Nat. Rev. Genet"},{"key":"2023020109064074800_btac183-B5","doi-asserted-by":"crossref","first-page":"492","DOI":"10.1093\/bib\/bbx124","article-title":"Evaluation of variable selection methods for random forests and omics data sets","volume":"20","author":"Degenhardt","year":"2019","journal-title":"Brief. Bioinform"},{"key":"2023020109064074800_btac183-B6","doi-asserted-by":"crossref","first-page":"698","DOI":"10.1111\/cpr.12291","article-title":"p53-dependent up-regulation of CDKN1A and down-regulation of CCNE2 in response to beryllium","volume":"49","author":"Gorjala","year":"2016","journal-title":"Cell Prolif"},{"key":"2023020109064074800_btac183-B7","first-page":"1","article-title":"Simultaneous clustering and estimation of heterogeneous graphical models","volume":"18","author":"Hao","year":"2018","journal-title":"J. Mach. Learn. Res"},{"key":"2023020109064074800_btac183-B8","doi-asserted-by":"crossref","first-page":"866","DOI":"10.1214\/15-AOAS813","article-title":"Multi-species distribution modeling using penalized mixture of regressions","volume":"9","author":"Hui","year":"2015","journal-title":"Ann. Appl. Stat"},{"key":"2023020109064074800_btac183-B9","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1186\/s12929-016-0269-9","article-title":"Zinc finger proteins in cancer progression","volume":"23","author":"Jen","year":"2016","journal-title":"J. Biomed. Sci"},{"key":"2023020109064074800_btac183-B10","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1080\/01621459.2015.1008363","article-title":"Variable selection with prior information for generalized linear models via the prior lasso method","volume":"111","author":"Jiang","year":"2016","journal-title":"J. Am. Stat. Assoc"},{"key":"2023020109064074800_btac183-B11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v072.i05","article-title":"RSKC: an R package for a robust and sparse K-means clustering algorithm","volume":"72","author":"Kondo","year":"2016","journal-title":"J. Stat. Softw"},{"key":"2023020109064074800_btac183-B12","doi-asserted-by":"crossref","first-page":"2293","DOI":"10.1214\/12-AOS1037","article-title":"High dimensional semiparametric Gaussian copula graphical models","volume":"40","author":"Liu","year":"2012","journal-title":"Ann. Stat"},{"key":"2023020109064074800_btac183-B13","doi-asserted-by":"crossref","first-page":"bbaa395","DOI":"10.1093\/bib\/bbaa395","article-title":"Classification and gene selection of triple-negative breast cancer subtype embedding gene connectivity matrix in deep neural network","volume":"22","author":"Liu","year":"2021","journal-title":"Brief. Bioinform"},{"key":"2023020109064074800_btac183-B14","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1186\/1471-2105-15-37","article-title":"A network-assisted co-clustering algorithm to discover cancer subtypes based on gene expression","volume":"15","author":"Liu","year":"2014","journal-title":"BMC Bioinformatics"},{"key":"2023020109064074800_btac183-B15","doi-asserted-by":"crossref","first-page":"2388","DOI":"10.1093\/bioinformatics\/btl393","article-title":"Semi-supervised learning via penalized mixture model with application to microarray sample classification","volume":"22","author":"Pan","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020109064074800_btac183-B16","doi-asserted-by":"crossref","first-page":"103620","DOI":"10.1016\/j.jbi.2020.103620","article-title":"Weighted dimensionality reduction and robust Gaussian mixture model based cancer patient subtyping from gene expression data","volume":"112","author":"Rafique","year":"2020","journal-title":"J. Biomed. Inform"},{"key":"2023020109064074800_btac183-B17","article-title":"Gaussian graphical model-based heterogeneity analysis via penalized fusion","author":"Ren","year":"2021","journal-title":"Biometrics"},{"key":"2023020109064074800_btac183-B18","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1038\/s41568-019-0133-9","article-title":"Molecular subtypes of small cell lung cancer: a synthesis of human and mouse model data","volume":"19","author":"Rudin","year":"2019","journal-title":"Nat. Rev. Cancer"},{"key":"2023020109064074800_btac183-B19","doi-asserted-by":"crossref","first-page":"e1007357","DOI":"10.1371\/journal.pcbi.1007357","article-title":"SourceSet: a graphical model approach to identify primary genes in perturbed biological pathways","volume":"15","author":"Salviato","year":"2019","journal-title":"PLoS Comput. Biol"},{"key":"2023020109064074800_btac183-B20","doi-asserted-by":"crossref","first-page":"3818","DOI":"10.1093\/bioinformatics\/btaa203","article-title":"Cancer subtype classification and modeling by pathway attention and propagation","volume":"36","author":"Sangseon","year":"2020","journal-title":"Bioinformatics"},{"key":"2023020109064074800_btac183-B21","doi-asserted-by":"crossref","first-page":"85","DOI":"10.3389\/fmed.2018.00085","article-title":"Overview on clinical relevance of intra-tumor heterogeneity","volume":"5","author":"Stanta","year":"2018","journal-title":"Front. Med"},{"key":"2023020109064074800_btac183-B22","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1038\/s41571-019-0241-1","article-title":"Biomarker-guided therapy for colorectal cancer: strength in complexity","volume":"17","author":"Sveen","year":"2020","journal-title":"Nat. Rev. Clin. Oncol"},{"key":"2023020109064074800_btac183-B23","doi-asserted-by":"crossref","DOI":"10.1523\/ENEURO.0066-20.2020","article-title":"Peripheral nerve single-cell analysis identifies mesenchymal ligands that promote axonal growth","volume":"7","author":"Toma","year":"2020","journal-title":"eNeuro"},{"key":"2023020109064074800_btac183-B24","doi-asserted-by":"crossref","first-page":"2247","DOI":"10.1093\/bioinformatics\/btm320","article-title":"Penalized and weighted K-means for clustering with scattered objects and prior information in high-throughput biological data","volume":"23","author":"Tseng","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020109064074800_btac183-B25","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1038\/s41576-019-0114-6","article-title":"Resolving genetic heterogeneity in cancer","volume":"20","author":"Turajlic","year":"2019","journal-title":"Nat. Rev. Genet"},{"key":"2023020109064074800_btac183-B26","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1016\/j.ccr.2009.12.020","article-title":"Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1","volume":"17","author":"Verhaak","year":"2010","journal-title":"Cancer Cell"},{"key":"2023020109064074800_btac183-B27","doi-asserted-by":"crossref","first-page":"393","DOI":"10.1080\/10618600.2017.1377081","article-title":"Sparse convex clustering","volume":"27","author":"Wang","year":"2018","journal-title":"J. Comput. Graph. Stat"},{"key":"2023020109064074800_btac183-B28","doi-asserted-by":"crossref","first-page":"1620","DOI":"10.1002\/sim.8064","article-title":"Identifying gene-environment interactions incorporating prior information","volume":"38","author":"Wang","year":"2019","journal-title":"Stat. Med"},{"key":"2023020109064074800_btac183-B29","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1016\/j.neucom.2020.10.105","article-title":"Convex clustering method for compositional data via sparse group lasso","volume":"425","author":"Wang","year":"2021","journal-title":"Neurocomputing"},{"key":"2023020109064074800_btac183-B30","doi-asserted-by":"crossref","first-page":"2633","DOI":"10.1093\/bioinformatics\/btt443","article-title":"Incorporating prior knowledge into gene network study","volume":"29","author":"Wang","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020109064074800_btac183-B31","doi-asserted-by":"crossref","first-page":"713","DOI":"10.1198\/jasa.2010.tm09415","article-title":"A framework for feature selection in clustering","volume":"105","author":"Witten","year":"2010","journal-title":"J. Am. Stat. Assoc"},{"key":"2023020109064074800_btac183-B32","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-020-20225-w","article-title":"Glioblastoma epigenome profiling identifies SOX10 as a master regulator of molecular tumour subtype","volume":"11","author":"Wu","year":"2020","journal-title":"Nat. Commun"},{"key":"2023020109064074800_btac183-B33","article-title":"Information-incorporated Gaussian graphical model for gene expression data","author":"Yi","year":"2021","journal-title":"Biometrics"},{"key":"2023020109064074800_btac183-B34","doi-asserted-by":"crossref","first-page":"bbaa316","DOI":"10.1093\/bib\/bbaa316","article-title":"scGMAI: a Gaussian mixture model for clustering single-cell RNA-Seq data based on deep autoencoder","volume":"22","author":"Yu","year":"2021","journal-title":"Brief. Bioinform"},{"key":"2023020109064074800_btac183-B35","doi-asserted-by":"crossref","first-page":"2436","DOI":"10.1093\/bioinformatics\/btx208","article-title":"Incorporating prior information into differential network analysis using non-paranormal graphical models","volume":"33","author":"Zhang","year":"2017","journal-title":"Bioinformatics"},{"key":"2023020109064074800_btac183-B36","doi-asserted-by":"crossref","first-page":"1043","DOI":"10.1109\/TCYB.2019.2952711","article-title":"A joint graphical model for inferring gene networks across multiple subpopulations and data types","volume":"51","author":"Zhang","year":"2021","journal-title":"IEEE Trans. Cybern"},{"key":"2023020109064074800_btac183-B37","article-title":"Heterogeneity analysis via integrating multi-sources high-dimensional data with applications to cancer studies","author":"Zhong","year":"2021","journal-title":"Stat. Sin"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btac183\/43288812\/btac183.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/10\/2855\/49009746\/btac183.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/10\/2855\/49009746\/btac183.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,1]],"date-time":"2023-02-01T21:00:29Z","timestamp":1675285229000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/10\/2855\/6553013"}},"subtitle":[],"editor":[{"given":"Jonathan","family":"Wren","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,3,23]]},"references-count":37,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2022,5,13]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btac183","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,5,15]]},"published":{"date-parts":[[2022,3,23]]}}}