{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,9]],"date-time":"2026-01-09T20:08:02Z","timestamp":1767989282693,"version":"3.49.0"},"reference-count":29,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2024,6,10]],"date-time":"2024-06-10T00:00:00Z","timestamp":1717977600000},"content-version":"vor","delay-in-days":9,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62172028"],"award-info":[{"award-number":["62172028"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61772197"],"award-info":[{"award-number":["61772197"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,6,3]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The identification of cancer subtypes plays a crucial role in cancer research and treatment. With the rapid development of high-throughput sequencing technologies, there has been an exponential accumulation of cancer multi-omics data. Integrating multi-omics data has emerged as a cost-effective and efficient strategy for cancer subtyping. While current methods primarily rely on genomics data, protein expression data offers a closer representation of phenotype. Therefore, integrating protein expression data holds promise for enhancing subtyping accuracy. However, the scarcity of protein expression data compared to genomics data presents a challenge in its direct incorporation into existing methods. Moreover, striking a balance between omics-specific learning and cross-omics learning remains a prevalent challenge in current multi-omics integration methods.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We introduce Subtype-MGTP, a novel cancer subtyping framework based on the translation of Multiple Genomics To Proteomics. Subtype-MGTP comprises two modules: a translation module, which leverages available protein data to translate multi-type genomics data into predicted protein expression data, and an improved deep subspace clustering module, which integrates contrastive learning to cluster the predicted protein data, yielding refined subtyping results. Extensive experiments conducted on benchmark datasets demonstrate that Subtype-MGTP outperforms nine state-of-the-art cancer subtyping methods. The interpretability of clustering results is further supported by the clinical and survival analysis. Subtype-MGTP also exhibits strong robustness against varying rates of missing protein data and demonstrates distinct advantages in integrating multi-omics data with imbalanced multi-omics data.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The code and results are available at https:\/\/github.com\/kybinn\/Subtype-MGTP.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae360","type":"journal-article","created":{"date-parts":[[2024,6,7]],"date-time":"2024-06-07T21:47:24Z","timestamp":1717796844000},"source":"Crossref","is-referenced-by-count":7,"title":["Subtype-MGTP: a cancer subtype identification framework based on multi-omics translation"],"prefix":"10.1093","volume":"40","author":[{"given":"Minzhu","family":"Xie","sequence":"first","affiliation":[{"name":"College of Information Science and Engineering, Hunan Normal University , Changsha 410081, China"},{"name":"Key Laboratory of Computing and Stochastic Mathematics (Ministry of Education) , Changsha 410081, China"},{"name":"College of Mathematics and Statistics, Hunan Normal University , Changsha 410081, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-6551-0434","authenticated-orcid":false,"given":"Yabin","family":"Kuang","sequence":"additional","affiliation":[{"name":"College of Information Science and Engineering, Hunan Normal University , Changsha 410081, China"}]},{"given":"Mengyun","family":"Song","sequence":"additional","affiliation":[{"name":"College of Information Science and Engineering, Hunan Normal University , Changsha 410081, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7636-5709","authenticated-orcid":false,"given":"Ergude","family":"Bao","sequence":"additional","affiliation":[{"name":"School of Software Engineering, Beijing Jiaotong University , Beijing 100044, China"}]}],"member":"286","published-online":{"date-parts":[[2024,6,10]]},"reference":[{"key":"2024062407563713200_btae360-B1","doi-asserted-by":"crossref","first-page":"674","DOI":"10.1038\/nrclinonc.2016.66","article-title":"Triple-negative breast cancer: challenges and opportunities of a heterogeneous disease","volume":"13","author":"Bianchini","year":"2016","journal-title":"Nat Rev Clin Oncol"},{"key":"2024062407563713200_btae360-B2","doi-asserted-by":"crossref","first-page":"bbad282","DOI":"10.1093\/bib\/bbad282","article-title":"Deep multi-view contrastive learning for cancer subtype identification","volume":"24","author":"Chen","year":"2023","journal-title":"Brief Bioinform"},{"key":"2024062407563713200_btae360-B3","doi-asserted-by":"crossref","first-page":"107378","DOI":"10.1016\/j.isci.2023.107378","article-title":"Mocss: multi-omics data clustering and cancer subtyping via shared and specific representation learning","volume":"26","author":"Chen","year":"2023","journal-title":"Iscience"},{"key":"2024062407563713200_btae360-B4","first-page":"29","author":"Ding","year":"2004"},{"key":"2024062407563713200_btae360-B5","doi-asserted-by":"crossref","first-page":"2765","DOI":"10.1109\/TPAMI.2013.57","article-title":"Sparse subspace clustering: algorithm, theory, and applications","volume":"35","author":"Elhamifar","year":"2013","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"2024062407563713200_btae360-B6","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1007\/BF01908075","article-title":"Comparing partitions","volume":"2","author":"Hubert","year":"1985","journal-title":"J Classif"},{"key":"2024062407563713200_btae360-B7","volume-title":"Advances in Neural Information Processing Systems","author":"Ji","year":"2017"},{"key":"2024062407563713200_btae360-B8","doi-asserted-by":"crossref","first-page":"61","DOI":"10.4137\/BIC.S9455","article-title":"Molecular profiling for breast cancer: a comprehensive review","volume":"5","author":"Kittaneh","year":"2013","journal-title":"Biomark Cancer"},{"key":"2024062407563713200_btae360-B9","doi-asserted-by":"crossref","first-page":"1046","DOI":"10.1038\/nmeth.2650","article-title":"TCPA: a resource for cancer functional proteomics data","volume":"10","author":"Li","year":"2013","journal-title":"Nat Methods"},{"key":"2024062407563713200_btae360-B10","doi-asserted-by":"crossref","first-page":"1795","DOI":"10.1093\/bib\/bby051","article-title":"Review of applications of high-throughput sequencing in personalized medicine: barriers and facilitators of future progress in research and clinical application","volume":"20","author":"Lightbody","year":"2019","journal-title":"Brief Bioinform"},{"key":"2024062407563713200_btae360-B11","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1002\/9780470479216.corpsy0491","article-title":"Kruskal\u2013Wallis test","author":"McKight","year":"2010","journal-title":"Corsini Encyclopedia Psychol"},{"key":"2024062407563713200_btae360-B12","article-title":"On spectral clustering: analysis and an algorithm","volume":"14","author":"Ng","year":"2001","journal-title":"Advances in Neural Information Processing Systems,"},{"key":"2024062407563713200_btae360-B13","doi-asserted-by":"crossref","first-page":"2025","DOI":"10.1101\/gr.215129.116","article-title":"A novel approach for data integration and disease subtyping","volume":"27","author":"Nguyen","year":"2017","journal-title":"Genome Res"},{"key":"2024062407563713200_btae360-B14","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1145\/1007730.1007731","article-title":"Subspace clustering for high dimensional data: a review","volume":"6","author":"Parsons","year":"2004","journal-title":"SIGKDD Explor Newsl"},{"key":"2024062407563713200_btae360-B15","doi-asserted-by":"crossref","first-page":"10546","DOI":"10.1093\/nar\/gky889","article-title":"Multi-omic and multi-view clustering algorithms: review and cancer benchmark","volume":"46","author":"Rappoport","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2024062407563713200_btae360-B16","doi-asserted-by":"crossref","first-page":"3348","DOI":"10.1093\/bioinformatics\/btz058","article-title":"NEMO: cancer subtyping by integration of partial multi-omic data","volume":"35","author":"Rappoport","year":"2019","journal-title":"Bioinformatics"},{"key":"2024062407563713200_btae360-B17","doi-asserted-by":"crossref","first-page":"2906","DOI":"10.1093\/bioinformatics\/btp543","article-title":"Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis","volume":"25","author":"Shen","year":"2009","journal-title":"Bioinformatics"},{"key":"2024062407563713200_btae360-B18","doi-asserted-by":"crossref","first-page":"516","DOI":"10.1038\/ncponc0908","article-title":"Origins of breast cancer subtypes and therapeutic implications","volume":"4","author":"Sims","year":"2007","journal-title":"Nat Rev Clin Oncol"},{"key":"2024062407563713200_btae360-B19","doi-asserted-by":"crossref","first-page":"1269","DOI":"10.1093\/bib\/bbx167","article-title":"Multi-omics integration\u2014a comparison of unsupervised clustering methodologies","volume":"20","author":"Tini","year":"2019","journal-title":"Brief Bioinform"},{"key":"2024062407563713200_btae360-B20","first-page":"2579","article-title":"Visualizing data using t-sne","volume":"9","author":"Van der Maaten","year":"2008","journal-title":"J Mach Learn Res"},{"key":"2024062407563713200_btae360-B21","first-page":"235","article-title":"Clustering stability: an overview","volume":"2","author":"Von Luxburg","year":"2010","journal-title":"Found Trends Mach Learn"},{"key":"2024062407563713200_btae360-B22","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1038\/nmeth.2810","article-title":"Similarity network fusion for aggregating data types on a genomic scale","volume":"11","author":"Wang","year":"2014","journal-title":"Nat Methods"},{"key":"2024062407563713200_btae360-B23","doi-asserted-by":"crossref","first-page":"Article28","DOI":"10.2202\/1544-6115.1470","article-title":"Extensions of sparse canonical correlation analysis with applications to genomic data","volume":"8","author":"Witten","year":"2009","journal-title":"Stat Appl Genet Mol Biol"},{"key":"2024062407563713200_btae360-B24","doi-asserted-by":"crossref","first-page":"1022","DOI":"10.1186\/s12864-015-2223-8","article-title":"Fast dimension reduction and integrative clustering of multi-omics data using low-rank approximation: application to cancer molecular classification","volume":"16","author":"Wu","year":"2015","journal-title":"BMC Genomics"},{"key":"2024062407563713200_btae360-B25","doi-asserted-by":"crossref","first-page":"2231","DOI":"10.1093\/bioinformatics\/btab109","article-title":"Subtype-GAN: a deep learning approach for integrative cancer subtyping of multi-omics data","volume":"37","author":"Yang","year":"2021","journal-title":"Bioinformatics"},{"key":"2024062407563713200_btae360-B26","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1089\/omi.2011.0118","article-title":"clusterprofiler: an r package for comparing biological themes among gene clusters","volume":"16","author":"Yu","year":"2012","journal-title":"OMICS"},{"key":"2024062407563713200_btae360-B27","doi-asserted-by":"crossref","first-page":"1398","DOI":"10.1073\/pnas.1205299110","article-title":"Genetic heterogeneity of diffuse large B-cell lymphoma","volume":"110","author":"Zhang","year":"2013","journal-title":"Proc Natl Acad Sci USA"},{"key":"2024062407563713200_btae360-B28","doi-asserted-by":"crossref","first-page":"bbad025","DOI":"10.1093\/bib\/bbad025","article-title":"Subtype-DCC: decoupled contrastive clustering method for cancer subtype identification based on multi-omics data","volume":"24","author":"Zhao","year":"2023","journal-title":"Brief Bioinform"},{"key":"2024062407563713200_btae360-B29","doi-asserted-by":"crossref","first-page":"572","DOI":"10.1093\/bib\/bby026","article-title":"Molecular subtyping of cancer: current status and moving toward clinical applications","volume":"20","author":"Zhao","year":"2019","journal-title":"Brief Bioinform"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae360\/58185191\/btae360.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/6\/btae360\/58310108\/btae360.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/6\/btae360\/58310108\/btae360.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,24]],"date-time":"2024-06-24T07:57:08Z","timestamp":1719215828000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae360\/7690747"}},"subtitle":[],"editor":[{"given":"Can","family":"Alkan","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,6]]},"references-count":29,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2024,6,3]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae360","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,6]]},"published":{"date-parts":[[2024,6]]},"article-number":"btae360"}}