{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T17:37:38Z","timestamp":1775324258535,"version":"3.50.1"},"reference-count":47,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2021,4,5]],"date-time":"2021-04-05T00:00:00Z","timestamp":1617580800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["U1611265"],"award-info":[{"award-number":["U1611265"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["11631015"],"award-info":[{"award-number":["11631015"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,9,2]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>With diverse types of omics data widely available, many computational methods have been recently developed to integrate these heterogeneous data, providing a comprehensive understanding of diseases and biological mechanisms. But most of them hardly take noise effects into account. Data-specific patterns unique to data types also make it challenging to uncover the consistent patterns and learn a compact representation of multi-omics data. Here we present a multi-omics integration method considering these issues. We explicitly model the error term in data reconstruction and simultaneously consider noise effects and data-specific patterns. We utilize a denoised network regularization in which we build a fused network using a denoising procedure to suppress noise effects and data-specific patterns. The error term collaborates with the denoised network regularization to capture data-specific patterns. We solve the optimization problem via an inexact alternating minimization algorithm. A comparative simulation study shows the method\u2019s superiority at discovering common patterns among data types at three noise levels. Transcriptomics-and-epigenomics integration, in seven cancer cohorts from The Cancer Genome Atlas, demonstrates that the learned integrative representation extracted in an unsupervised manner can depict survival information. Specially in liver hepatocellular carcinoma, the learned integrative representation attains average Harrell\u2019s C-index of 0.78 in 10 times 3-fold cross-validation for survival prediction, which far exceeds competing methods, and we discover an aggressive subtype in liver hepatocellular carcinoma with this latent representation, which is validated by an external dataset GSE14520. We also show that DeFusion is applicable to the integration of other omics types.<\/jats:p>","DOI":"10.1093\/bib\/bbab057","type":"journal-article","created":{"date-parts":[[2021,2,5]],"date-time":"2021-02-05T15:17:39Z","timestamp":1612538259000},"source":"Crossref","is-referenced-by-count":22,"title":["DeFusion: a denoised network regularization framework for multi-omics integration"],"prefix":"10.1093","volume":"22","author":[{"given":"Weiwen","family":"Wang","sequence":"first","affiliation":[{"name":"Intelligent Data Center, School of Mathematics, Sun Yat-Sen University, Guangzhou, 510275, China"}]},{"given":"Xiwen","family":"Zhang","sequence":"additional","affiliation":[{"name":"Intelligent Data Center, School of Mathematics, Sun Yat-Sen University, Guangzhou, 510275, China"}]},{"given":"Dao-Qing","family":"Dai","sequence":"additional","affiliation":[{"name":"Intelligent Data Center, School of Mathematics, Sun Yat-Sen University, Guangzhou, 510275, China"}]}],"member":"286","published-online":{"date-parts":[[2021,4,5]]},"reference":[{"key":"2021090815240554800_ref1","first-page":"1","article-title":"More is better: recent progress in multi-omics data integration methods","volume":"Jun","author":"Huang","year":"2017","journal-title":"Front Genet"},{"issue":"2","key":"2021090815240554800_ref2","first-page":"325","article-title":"A review on machine learning principles for multi-view biological data integration","volume":"19","author":"Li","year":"2018","journal-title":"Brief Bioinform"},{"issue":"20","key":"2021090815240554800_ref3","doi-asserted-by":"crossref","first-page":"10546","DOI":"10.1093\/nar\/gky889","article-title":"Multi-omic and multi-view clustering algorithms: review and cancer benchmark","volume":"46","author":"Rappoport","year":"2018","journal-title":"Nucleic Acids Res"},{"issue":"16","key":"2021090815240554800_ref4","doi-asserted-by":"crossref","first-page":"2626","DOI":"10.1093\/bioinformatics\/bth294","article-title":"A statistical framework for genomic data fusion","volume":"20","author":"Lanckriet","year":"2004","journal-title":"Bioinformatics"},{"issue":"12","key":"2021090815240554800_ref5","doi-asserted-by":"crossref","first-page":"i268","DOI":"10.1093\/bioinformatics\/btv244","article-title":"Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery","volume":"31","author":"Speicher","year":"2016","journal-title":"Bioinformatics"},{"key":"2021090815240554800_ref6","article-title":"An interpretable multiple kernel learning approach for the discovery of integrative cancer subtypes","author":"Speicher","year":"2018"},{"issue":"3","key":"2021090815240554800_ref7","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1038\/nmeth.2810","article-title":"Similarity network fusion for aggregating data types on a genomic scale","volume":"11","author":"Wang","year":"2014","journal-title":"Nat Methods"},{"issue":"4","key":"2021090815240554800_ref8","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pone.0152792","article-title":"Identifying cancer subtypes from miRNA-TF-mRNA regulatory networks and expression data","volume":"11","author":"Xu","year":"2016","journal-title":"PLoS One"},{"issue":"18","key":"2021090815240554800_ref9","doi-asserted-by":"crossref","first-page":"3348","DOI":"10.1093\/bioinformatics\/btz058","article-title":"NEMO: cancer subtyping by integration of partial multi-omic data","volume":"35","author":"Rappoport","year":"2019","journal-title":"Bioinformatics"},{"issue":"5","key":"2021090815240554800_ref10","doi-asserted-by":"crossref","first-page":"1115","DOI":"10.1109\/TCBB.2016.2621769","article-title":"Cancer subtype discovery based on integrative model of multigenomic data","volume":"14","author":"Ge","year":"2017","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"issue":"24","key":"2021090815240554800_ref11","doi-asserted-by":"crossref","first-page":"3290","DOI":"10.1093\/bioinformatics\/bts595","article-title":"Bayesian correlated clustering to integrate multiple datasets","volume":"28","author":"Kirk","year":"2012","journal-title":"Bioinformatics"},{"issue":"20","key":"2021090815240554800_ref12","doi-asserted-by":"crossref","first-page":"2610","DOI":"10.1093\/bioinformatics\/btt425","article-title":"Bayesian consensus clustering","volume":"29","author":"Lock","year":"2013","journal-title":"Bioinformatics"},{"issue":"10","key":"2021090815240554800_ref13","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1002227","article-title":"Patient-specific data fusion defines prognostic cancer subtypes","volume":"7","author":"Yuan","year":"2011","journal-title":"PLoS Comput Biol"},{"issue":"10","key":"2021090815240554800_ref14","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1005781","article-title":"Clusternomics: integrative context-dependent clustering for heterogeneous datasets","volume":"13","author":"Gabasova","year":"2017","journal-title":"PLoS Comput Biol"},{"issue":"4","key":"2021090815240554800_ref15","doi-asserted-by":"crossref","first-page":"928","DOI":"10.1109\/TCBB.2014.2377729","article-title":"Integrative data analysis of multi-platform cancer data with a multimodal deep learning approach","volume":"12","author":"Liang","year":"2015","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"issue":"6","key":"2021090815240554800_ref16","doi-asserted-by":"crossref","first-page":"1248","DOI":"10.1158\/1078-0432.CCR-17-0853","article-title":"Deep learning\u2013based multi-omics integration robustly predicts survival in liver cancer","volume":"24","author":"Chaudhary","year":"2018","journal-title":"Clin Cancer Res"},{"issue":"22","key":"2021090815240554800_ref17","doi-asserted-by":"crossref","first-page":"2906","DOI":"10.1093\/bioinformatics\/btp543","article-title":"Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis","volume":"25","author":"Shen","year":"2009","journal-title":"Bioinformatics"},{"issue":"1","key":"2021090815240554800_ref18","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1214\/12-AOAS578","article-title":"Sparse integrative clustering of multiple omics data sets","volume":"7","author":"Shen","year":"2013","journal-title":"Ann Appl Stat"},{"issue":"11","key":"2021090815240554800_ref19","doi-asserted-by":"crossref","first-page":"4245","DOI":"10.1073\/pnas.1208949110","article-title":"Pattern discovery and cancer gene identification in integrated cancer genomic data","volume":"110","author":"Mo","year":"2013","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"1","key":"2021090815240554800_ref20","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1093\/biostatistics\/kxx017","article-title":"A fully Bayesian latent variable model for integrative clustering analysis of multi-type omics data","volume":"19","author":"Mo","year":"2018","journal-title":"Biostatistics"},{"issue":"19","key":"2021090815240554800_ref21","doi-asserted-by":"crossref","first-page":"9379","DOI":"10.1093\/nar\/gks725","article-title":"Discovery of multi-dimensional modules by integrative analysis of cancer genomic data","volume":"40","author":"Zhang","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2021090815240554800_ref22","first-page":"252","article-title":"Multi-view clustering via joint nonnegative matrix factorization","volume":"2013","author":"Liu","journal-title":"In: Proceedings of the 2013 SIAM International Conference on Data Mining"},{"issue":"1","key":"2021090815240554800_ref23","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1214\/12-AOAS597","article-title":"Joint and individual variation explained (JIVE) for integrated analysis of multiple data types","volume":"7","author":"Lock","year":"2013","journal-title":"Ann Appl Stat"},{"issue":"3","key":"2021090815240554800_ref24","doi-asserted-by":"crossref","first-page":"537","DOI":"10.1093\/biostatistics\/kxw005","article-title":"Integrative clustering of high-dimensional data with joint and individual clusters","volume":"17","author":"Hellton","year":"2016","journal-title":"Biostatistics"},{"issue":"1","key":"2021090815240554800_ref25","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/bioinformatics\/btw552","article-title":"A non-negative matrix factorization method for detecting modules in heterogeneous omics multi-modal data","volume":"32","author":"Yang","year":"2016","journal-title":"Bioinformatics"},{"issue":"13","key":"2021090815240554800_ref26","doi-asserted-by":"crossref","first-page":"6606","DOI":"10.1093\/nar\/gkz488","article-title":"Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization","volume":"47","author":"Zhang","year":"2019","journal-title":"Nucleic Acids Res"},{"issue":"6755","key":"2021090815240554800_ref27","doi-asserted-by":"crossref","first-page":"788","DOI":"10.1038\/44565","article-title":"Learning the parts of objects by non-negative matrix factorization","volume":"401","author":"Lee","year":"1999","journal-title":"Nature"},{"key":"2021090815240554800_ref28","article-title":"Evaluation of integrative clustering methods for the analysis of multi-omics data","volume":"Feb","author":"Chauvel","year":"2019","journal-title":"Brief Bioinform"},{"key":"2021090815240554800_ref29","first-page":"1813","article-title":"Efficient and robust feature selection via joint l$_2,1$-norms minimization","volume":"2","author":"Nie","year":"2010","journal-title":"24th Annual Conference on Neural Information Processing Systems"},{"key":"2021090815240554800_ref30","first-page":"1589","article-title":"L $_2,1$ -norm regularized discriminative feature selection for unsupervised learning. Proceedings of the twenty-second international joint conference on","volume":"2","author":"Yang","year":"2011","journal-title":"Artificial Intelligence"},{"issue":"4","key":"2021090815240554800_ref31","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1109\/TBDATA.2017.2735991","article-title":"Low-rank graph-regularized structured sparse regression for identifying genetic biomarkers","volume":"3","author":"Zhu","year":"2017","journal-title":"IEEE Trans Big Data"},{"issue":"1","key":"2021090815240554800_ref32","doi-asserted-by":"crossref","first-page":"3108","DOI":"10.1038\/s41467-018-05469-x","article-title":"Network enhancement as a general method to denoise weighted biological network","volume":"9","author":"Wang","year":"2018","journal-title":"Nat Commun"},{"key":"2021090815240554800_ref33","first-page":"2399","article-title":"Manifold regularization: a geometric framework for learning from labeled and unlabeled examples","volume":"7","author":"Belkin","year":"2006","journal-title":"J Mach Learn Res"},{"issue":"8","key":"2021090815240554800_ref34","doi-asserted-by":"crossref","first-page":"1548","DOI":"10.1109\/TPAMI.2010.231","article-title":"Graph regularized nonnegative matrix factorization for data representation","volume":"33","author":"Cai","year":"2011","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"issue":"1","key":"2021090815240554800_ref35","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1109\/TPAMI.2014.2343973","article-title":"Data fusion by matrix factorization","volume":"37","author":"\u017eitnik","year":"2015","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"issue":"3","key":"2021090815240554800_ref36","doi-asserted-by":"crossref","first-page":"974","DOI":"10.1109\/TCBB.2017.2665557","article-title":"Regularized non-negative matrix factorization for identifying differentially expressed genes and clustering samples: a survey","volume":"15","author":"Liu","year":"2018","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"issue":"10","key":"2021090815240554800_ref37","doi-asserted-by":"crossref","first-page":"2756","DOI":"10.1162\/neco.2007.19.10.2756","article-title":"Projected gradient methods for nonnegative matrix factorization","volume":"19","author":"Lin","year":"2007","journal-title":"Neural Comput"},{"issue":"1","key":"2021090815240554800_ref38","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1137\/080716542","article-title":"A fast iterative shrinkage thresholding algorithm for linear inverse problems","volume":"2","author":"Beck","year":"2009","journal-title":"SIAM J Imaging Sci"},{"issue":"22","key":"2021090815240554800_ref39","doi-asserted-by":"crossref","first-page":"3206","DOI":"10.1093\/bioinformatics\/btr511","article-title":"survcomp: an R\/Bioconductor package for performance assessment and comparison of survival models","volume":"27","author":"Schr\u00f6der","year":"2011","journal-title":"Bioinformatics"},{"issue":"6","key":"2021090815240554800_ref40","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1016\/j.cels.2015.12.004","article-title":"The molecular signatures database hallmark gene set collection","volume":"1","author":"Liberzon","year":"2015","journal-title":"Cell Syst"},{"issue":"7","key":"2021090815240554800_ref41","doi-asserted-by":"crossref","first-page":"e47","DOI":"10.1093\/nar\/gkv007","article-title":"Limma powers differential expression analyses for RNA-sequencing and microarray studies","volume":"43","author":"Ritchie","year":"2015","journal-title":"Nucleic Acids Res"},{"issue":"8","key":"2021090815240554800_ref42","doi-asserted-by":"crossref","first-page":"1091","DOI":"10.1093\/bioinformatics\/btp101","article-title":"ClueGO: a cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks","volume":"25","author":"Bindea","year":"2009","journal-title":"Bioinformatics"},{"issue":"3","key":"2021090815240554800_ref43","doi-asserted-by":"crossref","first-page":"792","DOI":"10.1109\/TCBB.2018.2874238","article-title":"Identifying key genes of liver cancer by networking of multiple data sets","volume":"16","author":"Deng","year":"2019","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"issue":"1","key":"2021090815240554800_ref44","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"KEGG: Kyoto encyclopedia of genes and genomes","volume":"28","author":"Kanehisa","year":"2000","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2021090815240554800_ref45","doi-asserted-by":"crossref","first-page":"80","DOI":"10.33590\/emjhepatol\/10311638","article-title":"Targeting the relaxin pathway for liver disease treatment","volume":"6","author":"Bennett","year":"2018","journal-title":"EMJ Hepatol"},{"issue":"24","key":"2021090815240554800_ref46","doi-asserted-by":"crossref","first-page":"10202","DOI":"10.1158\/0008-5472.CAN-10-2607","article-title":"A unique metastasis gene signature enables prediction of tumor relapse in early-stage hepatocellular carcinoma patients","volume":"70","author":"Roessler","year":"2010","journal-title":"Cancer Res"},{"issue":"1","key":"2021090815240554800_ref47","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1016\/j.cell.2020.05.043","article-title":"Integrative proteomic characterization of human lung adenocarcinoma","volume":"182","author":"Xu","year":"2020","journal-title":"Cell"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/5\/bbab057\/40259436\/bbab057.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/5\/bbab057\/40259436\/bbab057.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,15]],"date-time":"2022-12-15T00:15:20Z","timestamp":1671063320000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbab057\/6210063"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,5]]},"references-count":47,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2021,9,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbab057","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,9]]},"published":{"date-parts":[[2021,4,5]]},"article-number":"bbab057"}}