{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T23:26:59Z","timestamp":1773790019863,"version":"3.50.1"},"reference-count":40,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2025,8,26]],"date-time":"2025-08-26T00:00:00Z","timestamp":1756166400000},"content-version":"vor","delay-in-days":56,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,7,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>The rapid advancement of single-cell omics technologies such as single-cell RNA sequencing and single-cell assay for transposase-accessible chromatin with high throughput sequencing has transformed our understanding of cellular heterogeneity and regulatory mechanisms. However, integrating these data types remains challenging due to distributional discrepancies and distinct feature spaces. To address this, we present a novel single-cell Contrastive INtegration framework (sCIN) that integrates different omics modalities into a shared low-dimensional latent space. sCIN uses modality-specific encoders and contrastive learning to generate latent representations for each modality, aligning cells across modalities and removing technology-specific biases. The framework was designed to rigorously prevent data leakage between training and testing, and was extensively evaluated on three real-world paired datasets namely simultaneous high-throughput ATAC and RNA expression with sequencing, 10X PBMC (10k version), and cellular indexing of transcriptomes and epitopes, and one unpaired dataset of gene expression and chromatin accessibility. Paired datasets refer to multi-omics data generated using technologies capable of capturing different omics features from the same cell population while unpaired datasets are measured from different cell populations from a tissue. Results on paired and unpaired datasets show that sCIN outperforms state-of-the-art models, including scGLUE, scBridge, sciCAN, Con-AAE, Harmony, and MOFA+, across multiple metrics: average silhouette width for clustering quality, Recall@k, cell type@k, cell type accuracy, and median rank for integration quality. Moreover, sCIN was evaluated on simulated unpaired datasets derived from paired data, demonstrating its ability to leverage available biological information for effective multimodal integration. In summary, sCIN reliably integrates omics modalities while preserving biological meaning in both paired and unpaired settings.<\/jats:p>","DOI":"10.1093\/bib\/bbaf411","type":"journal-article","created":{"date-parts":[[2025,8,26]],"date-time":"2025-08-26T12:12:14Z","timestamp":1756210334000},"source":"Crossref","is-referenced-by-count":2,"title":["sCIN: a contrastive learning framework for single-cell multi-omics data integration"],"prefix":"10.1093","volume":"26","author":[{"given":"Amir","family":"Ebrahimi","sequence":"first","affiliation":[{"name":"Department of Biotechnology, College of Science, University of Tehran , Ghods 37, Tehran , 1417763135,","place":["Iran"]}]},{"given":"Alireza Fotuhi","family":"Siahpirani","sequence":"additional","affiliation":[{"name":"Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran , Ghods 37, Tehran , 1417763135,","place":["Iran"]}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-8771-2308","authenticated-orcid":false,"given":"Hesam","family":"Montazeri","sequence":"additional","affiliation":[{"name":"Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran , Ghods 37, Tehran , 1417763135,","place":["Iran"]}]}],"member":"286","published-online":{"date-parts":[[2025,8,26]]},"reference":[{"key":"2025082608120596500_ref1","doi-asserted-by":"publisher","first-page":"176","DOI":"10.1126\/science.aam8999","article-title":"Single-cell profiling of the developing mouse brain and spinal cord with split-pool barcoding","volume":"360","author":"Rosenberg","year":"2018","journal-title":"Science"},{"key":"2025082608120596500_ref2","doi-asserted-by":"publisher","first-page":"1309","DOI":"10.1016\/j.cell.2018.06.052","article-title":"A single-cell atlas of in vivo mammalian chromatin accessibility","volume":"174","author":"Cusanovich","year":"2018","journal-title":"Cell"},{"key":"2025082608120596500_ref3","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1038\/s41592-019-0330-1","article-title":"Methods to study RNA-protein interactions","volume":"16","author":"Ramanathan","year":"2019","journal-title":"Nat Methods"},{"key":"2025082608120596500_ref4","doi-asserted-by":"publisher","first-page":"70","DOI":"10.1038\/nbt.4038","article-title":"Integrative single-cell analysis of transcriptional and epigenetic states in the human adult brain","volume":"36","author":"Lake","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2025082608120596500_ref5","doi-asserted-by":"publisher","first-page":"1007","DOI":"10.1016\/j.tibtech.2020.02.013","article-title":"Integrative methods and practical challenges for single-cell multi-omics","volume":"38","author":"Ma","year":"2020","journal-title":"Trends Biotechnol"},{"key":"2025082608120596500_ref6","doi-asserted-by":"publisher","first-page":"600","DOI":"10.1126\/science.aan3351","article-title":"Single-cell methylomes identify neuronal subtypes and regulatory elements in mammalian cortex","volume":"357","author":"Luo","year":"2017","journal-title":"Science"},{"key":"2025082608120596500_ref7","doi-asserted-by":"publisher","first-page":"428","DOI":"10.1038\/nbt.4112","article-title":"Highly scalable generation of DNA methylation profiles in single cells","volume":"36","author":"Mulqueen","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2025082608120596500_ref8","doi-asserted-by":"publisher","first-page":"1103","DOI":"10.1016\/j.cell.2020.09.056","article-title":"Chromatin potential identified by shared single-cell profiling of RNA and chromatin","volume":"183","author":"Ma","year":"2020","journal-title":"Cell"},{"key":"2025082608120596500_ref9","doi-asserted-by":"publisher","first-page":"1452","DOI":"10.1038\/s41587-019-0290-0","article-title":"High-throughput sequencing of the transcriptome and chromatin accessibility in the same cell","volume":"37","author":"Chen","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2025082608120596500_ref10","doi-asserted-by":"publisher","first-page":"865","DOI":"10.1038\/nmeth.4380","article-title":"Simultaneous epitope and transcriptome measurement in single cells","volume":"14","author":"Stoeckius","year":"2017","journal-title":"Nat Methods"},{"key":"2025082608120596500_ref11","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1093\/bib\/bbac317","article-title":"Learning discriminative and structural samples for rare cell types with deep generative model","volume":"23","author":"Wang","year":"2022","journal-title":"Brief Bioinform"},{"key":"2025082608120596500_ref12","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1093\/bib\/bbac068","article-title":"Learning deep features and topological structure of cells for clustering of scRNA-sequencing data","volume":"23","author":"Wang","year":"2022","journal-title":"Brief Bioinform"},{"key":"2025082608120596500_ref13","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1093\/bib\/bbac585","article-title":"scGMAAE: Gaussian mixture adversarial autoencoders for diversification analysis of scRNA-seq data","volume":"24","author":"Wang","year":"2023","journal-title":"Brief Bioinform"},{"key":"2025082608120596500_ref14","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1186\/s12859-015-0857-9","article-title":"Methods for the integration of multi-omics data: mathematical aspects","volume":"17","author":"Bersanelli","year":"2016","journal-title":"BMC Bioinformatics"},{"key":"2025082608120596500_ref15","doi-asserted-by":"publisher","first-page":"695","DOI":"10.1038\/s41580-023-00615-w","article-title":"The technological landscape and applications of single-cell multi-omics","volume":"24","author":"Baysoy","year":"2023","journal-title":"Nat Rev Mol Cell Biol"},{"key":"2025082608120596500_ref16","first-page":"1247","volume-title":"ICML","author":"Andrew","year":"2013"},{"key":"2025082608120596500_ref17","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1093\/biomet\/28.3-4.321","article-title":"Relations between two sets of variates","volume":"28","author":"Hotelling","year":"1936","journal-title":"Biometrika"},{"key":"2025082608120596500_ref18","doi-asserted-by":"publisher","first-page":"272","DOI":"10.1038\/s41592-020-01050-x","article-title":"Joint probabilistic modeling of single-cell multi-omic data with totalVI","volume":"18","author":"Gayoso","year":"2021","journal-title":"Nat Methods"},{"key":"2025082608120596500_ref19","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1186\/s13059-022-02706-x","article-title":"scDART: integrating unmatched scRNA-seq and scATAC-seq data and learning cross-modality relationship simultaneously","volume":"23","author":"Zhang","year":"2022","journal-title":"Genome Biol"},{"key":"2025082608120596500_ref20","doi-asserted-by":"publisher","first-page":"1458","DOI":"10.1038\/s41587-022-01284-4","article-title":"Multi-omics single-cell data integration and regulatory inference with graph-linked embedding","volume":"40","author":"Cao","year":"2022","journal-title":"Nat Biotechnol"},{"key":"2025082608120596500_ref21","doi-asserted-by":"publisher","first-page":"1888","DOI":"10.1016\/j.cell.2019.05.031","article-title":"Comprehensive integration of single-cell data","volume":"177","author":"Stuart","year":"2019","journal-title":"Cell"},{"key":"2025082608120596500_ref22","doi-asserted-by":"publisher","first-page":"e8124","DOI":"10.15252\/msb.20178124","article-title":"Multi-omics factor analysis-a framework for unsupervised integration of multi-omics data sets","volume":"14","author":"Argelaguet","year":"2018","journal-title":"Mol Syst Biol"},{"key":"2025082608120596500_ref23","doi-asserted-by":"publisher","first-page":"1289","DOI":"10.1038\/s41592-019-0619-0","article-title":"Fast, sensitive and accurate integration of single-cell data with harmony","volume":"16","author":"Korsunsky","year":"2019","journal-title":"Nat Methods"},{"key":"2025082608120596500_ref24","doi-asserted-by":"publisher","first-page":"3134","DOI":"10.1109\/JBHI.2024.3370868","article-title":"Learning consistency and specificity of cells from single-cell multi-omic data","volume":"28","author":"Wang","year":"2024","journal-title":"IEEE J Biomed Health Inform"},{"key":"2025082608120596500_ref25","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbab546","article-title":"Network-based integrative analysis of single-cell transcriptomic and epigenomic data for cell types","volume":"23","author":"Wu","year":"2022","journal-title":"Brief Bioinform"},{"key":"2025082608120596500_ref26","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbad081","article-title":"A universal framework for single-cell multi-omics data integration with graph convolutional networks","volume":"24","author":"Gao","year":"2023","journal-title":"Brief Bioinform"},{"key":"2025082608120596500_ref27","doi-asserted-by":"publisher","first-page":"1222","DOI":"10.1038\/s41592-023-01909-9","article-title":"MultiVI: deep generative model for the integration of multimodal data","volume":"20","author":"Ashuach","year":"2023","journal-title":"Nat Methods"},{"key":"2025082608120596500_ref28","doi-asserted-by":"publisher","first-page":"btad162","DOI":"10.1093\/bioinformatics\/btad162","article-title":"Con-AAE: contrastive cycle adversarial autoencoders for single-cell multi-omics alignment and integration","volume":"39","author":"Wang","year":"2023","journal-title":"Bioinformatics"},{"key":"2025082608120596500_ref29","article-title":"Learning transferable visual models from natural language supervision","volume-title":"Proceedings of the 38th International Conference on Machine Learning","author":"Radford"},{"key":"2025082608120596500_ref30"},{"key":"2025082608120596500_ref31","volume-title":"Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1 (NeurIPS Datasets and Benchmarks 2021)","author":"Luecken","year":"2021"},{"key":"2025082608120596500_ref32","doi-asserted-by":"publisher","first-page":"2190","DOI":"10.1038\/s41467-021-22368-w","article-title":"Single cell transcriptional and chromatin accessibility profiling redefine cellular heterogeneity in the adult human kidney","volume":"12","author":"Muto","year":"2021","journal-title":"Nat Commun"},{"key":"2025082608120596500_ref33","doi-asserted-by":"publisher","first-page":"6045","DOI":"10.1038\/s41467-023-41795-5","article-title":"scBridge embraces cell heterogeneity in single-cell RNA-seq and ATAC-seq data integration","volume":"14","author":"Li","year":"2023","journal-title":"Nat Commun"},{"key":"2025082608120596500_ref34","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1038\/s41540-022-00245-6","article-title":"sciCAN: single-cell chromatin accessibility and gene expression data integration via cycle-consistent adversarial network","volume":"8","author":"Xu","year":"2022","journal-title":"NPJ Syst Biol Appl"},{"key":"2025082608120596500_ref35","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1186\/s13059-020-02015-1","article-title":"MOFA+: a statistical framework for comprehensive integration of multi-modal single-cell data","volume":"21","author":"Argelaguet","year":"2020","journal-title":"Genome Biol"},{"key":"2025082608120596500_ref36","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1186\/s13059-017-1382-0","article-title":"SCANPY: Large-scale single-cell gene expression data analysis","volume":"19","author":"Wolf","year":"2018","journal-title":"Genome Biol"},{"key":"2025082608120596500_ref37","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.01874","article-title":"RegionPLC: regional point-language contrastive learning for open-world 3D scene understanding","volume-title":"2024 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Yang"},{"key":"2025082608120596500_ref38","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.01472","article-title":"Enhancing visual document understanding with contrastive learning in large visual-language models","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)","author":"Li"},{"key":"2025082608120596500_ref39","doi-asserted-by":"crossref","first-page":"627","DOI":"10.1109\/ICECA63461.2024.10800778","volume-title":"2024 8th International Conference on Electronics, Communication and Aerospace Technology (ICECA)","author":"Kosuru","year":"2024"},{"key":"2025082608120596500_ref40","doi-asserted-by":"publisher","DOI":"10.1101\/2024.06.20.599868","article-title":"Contrastive learning for robust cell annotation and representation from single-cell transcriptomics","author":"Andrekson","year":"2024"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/4\/bbaf411\/64137607\/bbaf411.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/4\/bbaf411\/64137607\/bbaf411.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,26]],"date-time":"2025-08-26T12:12:16Z","timestamp":1756210336000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaf411\/8241298"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7]]},"references-count":40,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,7,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaf411","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,7]]},"published":{"date-parts":[[2025,7]]},"article-number":"bbaf411"}}