{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T05:45:23Z","timestamp":1775627123436,"version":"3.50.1"},"reference-count":43,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2025,4,21]],"date-time":"2025-04-21T00:00:00Z","timestamp":1745193600000},"content-version":"vor","delay-in-days":51,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"publisher","award":["2022ZD0116004"],"award-info":[{"award-number":["2022ZD0116004"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Guangdong Talent Program","award":["2021CX02Y145"],"award-info":[{"award-number":["2021CX02Y145"]}]},{"name":"Guangdong Provincial Key Laboratory of Big Data Computing"},{"name":"Shenzhen Science and Technology Program","award":["ZDSYS20230626091302006"],"award-info":[{"award-number":["ZDSYS20230626091302006"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,3,4]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>The rapid progress of single-cell technology has facilitated cost-effective acquisition of diverse omics data, allowing biologists to unravel the complexities of cell populations, disease states, and more. Additionally, single-cell multi-omics technologies have opened new avenues for studying biological interactions. However, the high dimensionality and sparsity of omics data present significant analytical challenges. Dimension reduction (DR) techniques are hence essential for analyzing such complex data, yet many existing methods have inherent limitations. Linear methods like principal component analysis (PCA) struggle to capture intricate associations within data. In response, nonlinear techniques have emerged, but they may face scalability issues, be restricted to single-omics data, or prioritize visualization over generating informative embeddings. Here, we introduce dissimilarity based on conditional ordered list (DCOL) correlation, a novel measure for quantifying nonlinear relationships between variables. Based on this measure, we propose DCOL-PCA and DCOL-Canonical Correlation Analysis for dimension reduction and integration of single- and multi-omics data. In simulations, our methods outperformed nine DR methods and four joint dimension reduction methods, demonstrating stable performance across various settings. We also validated these methods on real datasets, with our method demonstrating its ability to detect intricate signals within and between omics data and generate lower dimensional embeddings that preserve the essential information and latent structures.<\/jats:p>","DOI":"10.1093\/bib\/bbaf184","type":"journal-article","created":{"date-parts":[[2025,4,21]],"date-time":"2025-04-21T03:57:00Z","timestamp":1745207820000},"source":"Crossref","is-referenced-by-count":2,"title":["Nonlinear embedding and integration of omics data: a fast and tuning-free approach"],"prefix":"10.1093","volume":"26","author":[{"ORCID":"https:\/\/orcid.org\/0009-0000-0260-8878","authenticated-orcid":false,"given":"Shengjie","family":"Liu","sequence":"first","affiliation":[{"name":"School of Data Science, The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen) , 2001 Longxiang Boulevard, Longgang District, Shenzhen 518172, Guangdong ,","place":["P.R. China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2502-1628","authenticated-orcid":false,"given":"Tianwei","family":"Yu","sequence":"additional","affiliation":[{"name":"School of Data Science, The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen) , 2001 Longxiang Boulevard, Longgang District, Shenzhen 518172, Guangdong ,","place":["P.R. China"]}]}],"member":"286","published-online":{"date-parts":[[2025,4,21]]},"reference":[{"key":"2025042023564405800_ref1","doi-asserted-by":"publisher","first-page":"1007","DOI":"10.1016\/j.tibtech.2020.02.013","article-title":"Integrative methods and practical challenges for single-cell multi-omics","volume":"38","author":"Ma","year":"2020","journal-title":"Trends Biotechnol"},{"key":"2025042023564405800_ref2","doi-asserted-by":"publisher","first-page":"109087","DOI":"10.1016\/j.compbiomed.2024.109087","article-title":"m5c-seq: Machine learning-enhanced profiling of rna 5-methylcytosine modifications","volume":"182","author":"Abbas","year":"2024","journal-title":"Comput Biol Med"},{"key":"2025042023564405800_ref3","doi-asserted-by":"publisher","first-page":"104536","DOI":"10.1016\/j.chemolab.2022.104536","article-title":"Alzheimer\u2019s disease prediction based on continuous feature representation using multi-omics data integration","volume":"223","author":"Abbas","year":"2022","journal-title":"Chemom Intel Lab Syst"},{"key":"2025042023564405800_ref4","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1080\/14786440109462720","article-title":"Liii. On lines and planes of closest fit to systems of points in space","volume":"2","author":"Pearson","year":"1901","journal-title":"London Edinburgh Dublin Philos Mag J Sci"},{"key":"2025042023564405800_ref5","doi-asserted-by":"publisher","first-page":"411","DOI":"10.1016\/S0893-6080(00)00026-5","article-title":"Independent component analysis: Algorithms and applications","volume":"13","author":"Hyv\u00e4rinen","year":"2000","journal-title":"Neural Netw"},{"key":"2025042023564405800_ref6","doi-asserted-by":"crossref","first-page":"169","DOI":"10.3233\/AIC-170729","article-title":"Linear discriminant analysis: A detailed tutorial","volume":"30","author":"Tharwat","year":"2017","journal-title":"AI Commun"},{"key":"2025042023564405800_ref7","doi-asserted-by":"publisher","DOI":"10.1007\/BFb0020217","article-title":"Kernel principal component analysis","volume-title":"Artificial Neural Networks \u2013 ICANN\u201997","author":"Sch\u00f6lkopf"},{"key":"2025042023564405800_ref8","doi-asserted-by":"publisher","DOI":"10.1038\/s41467-018-07931-2","article-title":"Single-cell rna-seq denoising using a deep count autoencoder","volume":"10","author":"Eraslan","year":"2019","journal-title":"Nat Commun"},{"key":"2025042023564405800_ref9","doi-asserted-by":"publisher","first-page":"2323","DOI":"10.1126\/science.290.5500.2323","article-title":"Nonlinear dimensionality reduction by locally linear embedding","volume":"290","author":"Roweis","year":"2000","journal-title":"Science"},{"key":"2025042023564405800_ref10","doi-asserted-by":"publisher","first-page":"1373","DOI":"10.1162\/089976603321780317","article-title":"Laplacian eigenmaps for dimensionality reduction and data representation","volume":"15","author":"Belkin","year":"2003","journal-title":"Neural Comput"},{"key":"2025042023564405800_ref11","doi-asserted-by":"publisher","volume-title":"Quantitative Applications in the Social Sciences","author":"Kruskal","DOI":"10.4135\/9781412985130"},{"key":"2025042023564405800_ref12","doi-asserted-by":"publisher","first-page":"401","DOI":"10.1109\/T-C.1969.222678","article-title":"A nonlinear mapping for data structure analysis","volume":"C-18","author":"Sammon","year":"1969","journal-title":"IEEE Trans Comput"},{"key":"2025042023564405800_ref13","doi-asserted-by":"publisher","first-page":"2319","DOI":"10.1126\/science.290.5500.2319","article-title":"Global geometric framework for nonlinear dimensionality reduction [j]","volume":"290","author":"Teenbaum","year":"2000","journal-title":"Science"},{"key":"2025042023564405800_ref14","article-title":"Visualizing data using t-sne","volume":"9","author":"Van der Maaten","year":"2008","journal-title":"J Mach Learn Res"},{"key":"2025042023564405800_ref15","doi-asserted-by":"publisher","DOI":"10.21105\/joss.00861","article-title":"Umap: Uniform manifold approximation and projection for dimension reduction","volume-title":"Journal of Open Source Software","author":"McInnes"},{"key":"2025042023564405800_ref16","doi-asserted-by":"publisher","first-page":"229","DOI":"10.1038\/nmeth.3728","article-title":"Parallel single-cell sequencing links transcriptional and epigenetic heterogeneity","volume":"13","author":"Angermueller","year":"2016","journal-title":"Nat Methods"},{"key":"2025042023564405800_ref17","doi-asserted-by":"publisher","first-page":"833","DOI":"10.1038\/nmeth.3961","article-title":"Single-cell multimodal profiling reveals cellular epigenetic heterogeneity","volume":"13","author":"Cheow","year":"2016","journal-title":"Nat Methods"},{"key":"2025042023564405800_ref18","doi-asserted-by":"publisher","first-page":"865","DOI":"10.1038\/nmeth.4380","article-title":"Simultaneous epitope and transcriptome measurement in single cells","volume":"14","author":"Stoeckius","year":"2017","journal-title":"Nat Methods"},{"key":"2025042023564405800_ref19","doi-asserted-by":"publisher","first-page":"1103","DOI":"10.1016\/j.cell.2020.09.056","article-title":"Chromatin potential identified by shared single-cell profiling of rna and chromatin","volume":"183","author":"Ma","year":"2020","journal-title":"Cell"},{"key":"2025042023564405800_ref20","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1038\/nmeth.2810","article-title":"Similarity network fusion for aggregating data types on a genomic scale","volume":"11","author":"Wang","year":"2014","journal-title":"Nat Methods"},{"key":"2025042023564405800_ref21","doi-asserted-by":"publisher","first-page":"3573","DOI":"10.1016\/j.cell.2021.04.048","article-title":"Integrated analysis of multimodal single-cell data","volume":"184","author":"Hao","year":"2021","journal-title":"Cell"},{"key":"2025042023564405800_ref22","doi-asserted-by":"publisher","first-page":"841","DOI":"10.1109\/TCBB.2018.2806438","article-title":"A multimodal deep neural network for human breast cancer prognosis prediction by integrating multi-dimensional data","volume":"16","author":"Sun","year":"2018","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2025042023564405800_ref23","doi-asserted-by":"publisher","first-page":"e0176278","DOI":"10.1371\/journal.pone.0176278","article-title":"Integrative clustering of multi-level \u2018omic data based on non-negative matrix factorization algorithm","volume":"12","author":"Chalise","year":"2017","journal-title":"PloS One"},{"key":"2025042023564405800_ref24","doi-asserted-by":"publisher","first-page":"2906","DOI":"10.1093\/bioinformatics\/btp543","article-title":"Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis","volume":"25","author":"Shen","year":"2009","journal-title":"Bioinformatics"},{"key":"2025042023564405800_ref25","doi-asserted-by":"publisher","first-page":"e8124","DOI":"10.15252\/msb.20178124","article-title":"Multi-omics factor analysis\u2014A framework for unsupervised integration of multi-omics data sets","volume":"14","author":"Argelaguet","year":"2018","journal-title":"Mol Syst Biol"},{"key":"2025042023564405800_ref43","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-020-02015-1","article-title":"Mofa+: A statistical framework for comprehensive integration of multi-modal single-cell data","volume":"21","author":"Argelaguet","year":"2020","journal-title":"Genome Biol"},{"key":"2025042023564405800_ref26","doi-asserted-by":"publisher","first-page":"13","DOI":"10.14495\/jsiaml.7.13","article-title":"Joint singular value decomposition algorithm based on the Riemannian trust-region method","volume":"7","author":"Sato","year":"2015","journal-title":"JSIAM Lett"},{"key":"2025042023564405800_ref27","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4612-4380-9_14","article-title":"Relations between two sets of variates","volume-title":"Breakthroughs in Statistics","author":"Hotelling"},{"key":"2025042023564405800_ref28","article-title":"A kernel method for canonical correlation analysis","author":"Akaho","year":"2006"},{"key":"2025042023564405800_ref29","doi-asserted-by":"publisher","first-page":"277","DOI":"10.1111\/j.1365-2427.1994.tb01741.x","article-title":"Co-inertia analysis: An alternative method for studying species\u2013environment relationships","volume":"31","author":"Dol\u00e9dec","year":"1994","journal-title":"Freshwater Biol"},{"key":"2025042023564405800_ref30","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbaa287","article-title":"Deep-joint-learning analysis model of single cell transcriptome and open chromatin accessibility data","volume":"22","author":"Zuo","year":"2021","journal-title":"Brief Bioinform"},{"key":"2025042023564405800_ref31","doi-asserted-by":"publisher","first-page":"723","DOI":"10.1109\/TCBB.2010.73","article-title":"Incorporating nonlinear relationships in microarray missing value imputation","volume":"8","author":"Tianwei","year":"2010","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2025042023564405800_ref32","doi-asserted-by":"publisher","first-page":"3458","DOI":"10.1038\/s41467-020-17281-7","article-title":"Searching large-scale scrna-seq databases via unbiased cell embedding with cell blast","volume":"11","author":"Cao","year":"2020","journal-title":"Nat Commun"},{"key":"2025042023564405800_ref33","doi-asserted-by":"publisher","first-page":"571","DOI":"10.1038\/s41586-020-2316-7","article-title":"Deciphering human macrophage development at single-cell resolution","volume":"582","author":"Bian","year":"2020","journal-title":"Nature"},{"key":"2025042023564405800_ref34","volume-title":"10X Genomics. PBMC from a Healthy Donor, Single Cell Multiome ATAC Gene Expression Demonstration Data by Cell Ranger ARC 1.0.0","year":"2023"},{"key":"2025042023564405800_ref35","doi-asserted-by":"publisher","first-page":"1333","DOI":"10.1038\/s41592-021-01282-5","article-title":"Single-cell chromatin state analysis with signac","volume":"18","author":"Stuart","year":"2021","journal-title":"Nat Methods"},{"key":"2025042023564405800_ref36","doi-asserted-by":"publisher","first-page":"496521","DOI":"10.1101\/496521","article-title":"Improved peak-calling with macs2","author":"Gaspar","year":"2018"},{"key":"2025042023564405800_ref37","doi-asserted-by":"publisher","DOI":"10.1038\/s41587-023-01767-y","article-title":"Dictionary learning for integrative, multimodal and scalable single-cell analysis","volume":"42","author":"Hao","journal-title":"Nat Biotechnol"},{"key":"2025042023564405800_ref38","doi-asserted-by":"crossref","first-page":"996","DOI":"10.1214\/aos\/1176343954","article-title":"General distribution theory of the concomitants of order statistics","volume":"5","author":"Yang","year":"1977","journal-title":"Ann Stat"},{"key":"2025042023564405800_ref39","doi-asserted-by":"publisher","first-page":"1034","DOI":"10.1214\/aos\/1176342823","article-title":"Convergence of sample paths of normalized sums of induced order statistics","volume":"2","author":"Bhattacharya","year":"1974","journal-title":"Ann Stat"},{"key":"2025042023564405800_ref40","doi-asserted-by":"publisher","DOI":"10.1201\/9781315119427","volume-title":"Sufficient Dimension Reduction: Methods and Applications with R","author":"Li","year":"2018"},{"key":"2025042023564405800_ref41","doi-asserted-by":"publisher","volume":"334","journal-title":"Science","DOI":"10.1126\/science.1205438"},{"key":"2025042023564405800_ref42","doi-asserted-by":"publisher","DOI":"10.1214\/09-AOAS312","article-title":"Brownian distance covariance","volume":"3","author":"Szekely","year":"2009","journal-title":"Ann Appl Stat"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/2\/bbaf184\/62958299\/bbaf184.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/2\/bbaf184\/62958299\/bbaf184.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,21]],"date-time":"2025-04-21T03:57:02Z","timestamp":1745207822000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaf184\/8116688"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3]]},"references-count":43,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2025,3,4]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaf184","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,3]]},"published":{"date-parts":[[2025,3]]},"article-number":"bbaf184"}}