{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,11]],"date-time":"2025-12-11T11:56:48Z","timestamp":1765454208802,"version":"3.46.0"},"reference-count":36,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2025,12,1]],"date-time":"2025-12-01T00:00:00Z","timestamp":1764547200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Single-cell multi-omics technologies enable the simultaneous profiling of gene expression and chromatin accessibility, providing complementary insights into cellular identity and gene regulatory mechanisms. However, integrating paired scRNA-seq and scATAC-seq data (i.e. profiles from the same single cell) remains challenging due to inherent sparsity, technical noise, and the limited availability of high-quality paired measurements. In contrast, large-scale unpaired scRNA-seq datasets often exhibit robust and biologically meaningful cell cluster structures.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We introduce Guided Co-clustering Transfer (GuidedCoC), a novel framework that transfers structural knowledge from unpaired scRNA-seq source data to improve both cell clustering and feature alignment in paired scRNA-seq\/scATAC-seq target data. GuidedCoC jointly co-cluster cells and features across modalities and domains via a unified information-theoretic objective, aligning gene expression modules with regulatory elements while implicitly performing cross-modal dimensionality reduction to reduce noise. Additionally, it automatically aligns cell populations across unpaired and paired datasets without requiring explicit annotations. Extensive experiments on multiple benchmark datasets demonstrate that GuidedCoC achieves superior clustering accuracy and biological interpretability compared to existing methods. These results highlight the promise of structure-guided transfer learning for robust, scalable, and interpretable integration of single-cell multi-omics data.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>GuidedCoC is available as open-source code at https:\/\/github.com\/No-AgCl\/GuidedCoC.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf639","type":"journal-article","created":{"date-parts":[[2025,11,24]],"date-time":"2025-11-24T12:34:48Z","timestamp":1763987688000},"source":"Crossref","is-referenced-by-count":0,"title":["Guided co-clustering transfer across unpaired and paired single-cell multi-omics data"],"prefix":"10.1093","volume":"41","author":[{"given":"Hongyao","family":"Li","sequence":"first","affiliation":[{"name":"Institute of Mathematical Sciences, ShanghaiTech University , Shanghai, 201210,","place":["China"]}]},{"given":"Yunrui","family":"Liu","sequence":"additional","affiliation":[{"name":"Institute of Mathematical Sciences, ShanghaiTech University , Shanghai, 201210,","place":["China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0724-5313","authenticated-orcid":false,"given":"Pengcheng","family":"Zeng","sequence":"additional","affiliation":[{"name":"Institute of Mathematical Sciences, ShanghaiTech University , Shanghai, 201210,","place":["China"]}]}],"member":"286","published-online":{"date-parts":[[2025,12,1]]},"reference":[{"key":"2025121106422528400_btaf639-B1","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1038\/s41590-018-0276-y","article-title":"Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage","volume":"20","author":"Aran","year":"2019","journal-title":"Nat Immunol"},{"key":"2025121106422528400_btaf639-B2","doi-asserted-by":"crossref","first-page":"e8124","DOI":"10.15252\/msb.20178124","article-title":"Multi-omics factor analysis\u2014a framework for unsupervised integration of multi-omics data sets","volume":"14","author":"Argelaguet","year":"2018","journal-title":"Mol Syst Biol"},{"key":"2025121106422528400_btaf639-B3","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1186\/s13059-020-02015-1","article-title":"Mofa+: a statistical framework for comprehensive integration of multi-modal single-cell data","volume":"21","author":"Argelaguet","year":"2020","journal-title":"Genome Biol"},{"key":"2025121106422528400_btaf639-B4","doi-asserted-by":"publisher","first-page":"1222","DOI":"10.1038\/s41592-023-01909-9","article-title":"Multivi: deep generative model for the integration of multimodal data","volume":"20","author":"Ashuach","year":"2023","journal-title":"Nat Methods"},{"key":"2025121106422528400_btaf639-B5","doi-asserted-by":"publisher","first-page":"411","DOI":"10.1038\/nbt.4096","article-title":"Integrating single-cell transcriptomic data across different conditions, technologies, and species","volume":"36","author":"Butler","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2025121106422528400_btaf639-B6","doi-asserted-by":"crossref","first-page":"317","DOI":"10.3389\/fgene.2019.00317","article-title":"Single-cell RNA-seq technologies and related computational data analysis","volume":"10","author":"Chen","year":"2019","journal-title":"Front Genet"},{"year":"2008","author":"Dai","key":"2025121106422528400_btaf639-B7"},{"key":"2025121106422528400_btaf639-B8","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1089\/cmb.2021.0446","article-title":"SCOT: single-cell multi-omics alignment with optimal transport","volume":"29","author":"Demetci","year":"2022","journal-title":"J Comput Biol"},{"first-page":"89","year":"2003","author":"Dhillon","key":"2025121106422528400_btaf639-B9"},{"key":"2025121106422528400_btaf639-B10","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1186\/s13059-021-02556-z","article-title":"Cobolt: integrative analysis of multimodal single-cell sequencing data","volume":"22","author":"Gong","year":"2021","journal-title":"Genome Biol"},{"key":"2025121106422528400_btaf639-B11","doi-asserted-by":"publisher","first-page":"421","DOI":"10.1038\/nbt.4091","article-title":"Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors","volume":"36","author":"Haghverdi","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2025121106422528400_btaf639-B12","doi-asserted-by":"crossref","first-page":"3573","DOI":"10.1016\/j.cell.2021.04.048","article-title":"Integrated analysis of multimodal single-cell data","volume":"184","author":"Hao","year":"2021","journal-title":"Cell"},{"key":"2025121106422528400_btaf639-B13","doi-asserted-by":"publisher","first-page":"293","DOI":"10.1038\/s41587-023-01767-y","article-title":"Dictionary learning for integrative, multimodal and scalable single-cell analysis","volume":"42","author":"Hao","year":"2024","journal-title":"Nat Biotechnol"},{"key":"2025121106422528400_btaf639-B14","doi-asserted-by":"publisher","first-page":"44","DOI":"10.1038\/nprot.2008.211","article-title":"Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources","volume":"4","author":"Huang","year":"2009","journal-title":"Nat Protoc"},{"key":"2025121106422528400_btaf639-B15","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/nar\/gkn923","article-title":"Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists","volume":"37","author":"Huang","year":"2009","journal-title":"Nucleic Acids Res"},{"key":"2025121106422528400_btaf639-B16","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1186\/s13059-020-1926-6","article-title":"Eleven grand challenges in single-cell data science","volume":"21","author":"L\u00e4hnemann","year":"2020","journal-title":"Genome Biol"},{"first-page":"162","year":"2022","author":"Lance","key":"2025121106422528400_btaf639-B17"},{"key":"2025121106422528400_btaf639-B18","doi-asserted-by":"publisher","first-page":"244","DOI":"10.1186\/s13059-023-03073-x","article-title":"Benchmarking algorithms for joint integration of unpaired and paired single-cell RNA-seq and ATAC-seq data","volume":"24","author":"Lee","year":"2023","journal-title":"Genome Biol"},{"key":"2025121106422528400_btaf639-B19","doi-asserted-by":"publisher","first-page":"703","DOI":"10.1038\/s41587-022-01284-4","article-title":"Scjoint integrates atlas-scale single-cell RNA-seq and ATAC-seq data with transfer learning","volume":"40","author":"Lin","year":"2022","journal-title":"Nat Biotechnol"},{"key":"2025121106422528400_btaf639-B20","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1038\/s41587-021-01001-7","article-title":"Mapping single-cell data to reference atlases by transfer learning","volume":"40","author":"Lotfollahi","year":"2022","journal-title":"Nat Biotechnol"},{"key":"2025121106422528400_btaf639-B21","doi-asserted-by":"publisher","first-page":"e2313719121","DOI":"10.1073\/pnas.2313719121","article-title":"Principled and interpretable alignability testing and integration of single-cell data","volume":"121","author":"Ma","year":"2024","journal-title":"Proc Natl Acad Sci USA"},{"key":"2025121106422528400_btaf639-B22","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1016\/j.tig.2016.12.003","article-title":"Single-cell multiomics: multiple measurements from single cells","volume":"33","author":"Macaulay","year":"2017","journal-title":"Trends Genet"},{"key":"2025121106422528400_btaf639-B23","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511809071","volume-title":"Introduction to Information Retrieval","author":"Manning","year":"2008"},{"key":"2025121106422528400_btaf639-B24","doi-asserted-by":"crossref","first-page":"858","DOI":"10.1016\/j.molcel.2018.06.044","article-title":"Cicero predicts cis-regulatory dna interactions from single-cell chromatin accessibility data","volume":"71","author":"Pliner","year":"2018","journal-title":"Mol Cell"},{"key":"2025121106422528400_btaf639-B25","doi-asserted-by":"crossref","first-page":"e27041","DOI":"10.7554\/eLife.27041","article-title":"The human cell atlas","volume":"6","author":"Regev","year":"2017","journal-title":"eLife"},{"key":"2025121106422528400_btaf639-B26","doi-asserted-by":"crossref","first-page":"975","DOI":"10.1038\/nmeth.4401","article-title":"Chromvar: inferring transcription-factor-associated accessibility from single-cell epigenomic data","volume":"14","author":"Schep","year":"2017","journal-title":"Nat Methods"},{"key":"2025121106422528400_btaf639-B27","doi-asserted-by":"crossref","first-page":"1888","DOI":"10.1016\/j.cell.2019.05.031","article-title":"Comprehensive integration of single-cell data","volume":"177","author":"Stuart","year":"2019","journal-title":"Cell"},{"key":"2025121106422528400_btaf639-B28","doi-asserted-by":"publisher","first-page":"1333","DOI":"10.1038\/s41592-021-01282-5","article-title":"Single-cell chromatin state analysis with Signac","volume":"18","author":"Stuart","year":"2021","journal-title":"Nat Methods"},{"key":"2025121106422528400_btaf639-B29","doi-asserted-by":"crossref","first-page":"590","DOI":"10.1038\/s41586-020-2496-1","article-title":"A single-cell transcriptomic atlas characterizes ageing tissues in the mouse","volume":"583","author":"Tabula Muris Consortium","year":"2020","journal-title":"Nature"},{"key":"2025121106422528400_btaf639-B30","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1016\/j.cels.2019.06.004","article-title":"Singlecellnet: a computational tool to classify single cell RNA-seq data across platforms and across species","volume":"9","author":"Tan","year":"2019","journal-title":"Cell Syst"},{"key":"2025121106422528400_btaf639-B31","doi-asserted-by":"publisher","first-page":"5233","DOI":"10.1038\/s41598-019-41695-z","article-title":"From louvain to leiden: guaranteeing well-connected communities","volume":"9","author":"Traag","year":"2019","journal-title":"Sci Rep"},{"key":"2025121106422528400_btaf639-B32","doi-asserted-by":"crossref","first-page":"e2023070118","DOI":"10.1073\/pnas.2023070118","article-title":"Babel enables cross-modality translation between multi-omic profiles at single-cell resolution","volume":"118","author":"Wu","year":"2021","journal-title":"Proc Nat Acad Sci USA"},{"key":"2025121106422528400_btaf639-B33","doi-asserted-by":"publisher","first-page":"btad505","DOI":"10.1093\/bioinformatics\/btad505","article-title":"Scncl: transferring labels from scrna-seq to scatac-seq data with neighborhood contrastive regularization","volume":"39","author":"Yan","year":"2023","journal-title":"Bioinformatics"},{"key":"2025121106422528400_btaf639-B35","doi-asserted-by":"publisher","first-page":"e1009064","DOI":"10.1371\/journal.pcbi.1009064","article-title":"Couplecoc+: an information-theoretic co-clustering-based transfer learning framework for the integrative analysis of single-cell genomic data","volume":"17","author":"Zeng","year":"2021","journal-title":"PLoS Comput Biol"},{"key":"2025121106422528400_btaf639-B299751588","doi-asserted-by":"publisher","first-page":"bbaa347","DOI":"10.1093\/bib\/bbaa347","article-title":"Coupled co-clustering-based unsupervised transfer learning for the integrative analysis of single-cell genomics data","volume":"22","author":"Zeng","year":"2021","journal-title":"Briefings in Bioinformatics"},{"key":"2025121106422528400_btaf639-B36","doi-asserted-by":"publisher","first-page":"384","DOI":"10.1038\/s41467-023-36066-2","article-title":"scMoMaT jointly performs single cell mosaic integration and multi-modal bio-marker detection","volume":"14","author":"Zhang","year":"2023","journal-title":"Nat Commun"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf639\/65667524\/btaf639.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/12\/btaf639\/65667524\/btaf639.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/12\/btaf639\/65667524\/btaf639.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,11]],"date-time":"2025-12-11T11:42:33Z","timestamp":1765453353000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf639\/8362267"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,1]]},"references-count":36,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2025,12,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf639","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2025,12]]},"published":{"date-parts":[[2025,12,1]]},"article-number":"btaf639"}}