{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,17]],"date-time":"2025-10-17T00:10:44Z","timestamp":1760659844130,"version":"build-2065373602"},"reference-count":49,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2025,10,16]],"date-time":"2025-10-16T00:00:00Z","timestamp":1760572800000},"content-version":"vor","delay-in-days":46,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"name":"Deutsche Forschungsgemeinschaft KFO5011","award":["445703531"],"award-info":[{"award-number":["445703531"]}]},{"DOI":"10.13039\/501100002347","name":"Bundesministerium f\u00fcr Bildung und Forschung","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100002347","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,8,31]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>The analysis of single-cell disease atlases represents a challenge due to the presence of batch effects, low quality of disease samples, and the multiscale nature of the data, i.e. samples are described by different cell distributions. Because of these, few computational approaches are performing sample-level disease progression analysis so far. Here, we introduce Patient-Level Analysis with Optimal Transport based on Gaussian Mixture Variational Autoencoders (PILOT-GM-VAE). PILOT-GM-VAE explores the power of GM-VAE to estimate models describing complex single-cell distributions through efficient optimal transport algorithms for estimating the distance between Gaussian Mixtures. Extensive benchmarking on 12 single-cell disease atlases and competing approaches demonstrate the performance of PILOT-GM-VAE in sample-level clustering, sample-level trajectory inference, and batch correction tasks. Moreover, we performed a case study on a breast cancer disease atlas, where PILOT-GM-VAE highlighted cellular and molecular changes associated with breast cancer disease progression.<\/jats:p>","DOI":"10.1093\/bib\/bbaf547","type":"journal-article","created":{"date-parts":[[2025,10,16]],"date-time":"2025-10-16T04:03:33Z","timestamp":1760587413000},"source":"Crossref","is-referenced-by-count":0,"title":["PILOT-GM-VAE: patient-level analysis of single-cell disease atlas with optimal transport of Gaussian mixture variational autoencoders"],"prefix":"10.1093","volume":"26","author":[{"given":"Mehdi","family":"Joodaki","sequence":"first","affiliation":[{"name":"Institute for Computational Genomics, Center for Computational Life Sciences, RWTH Aachen University Medical School , Pauwelstr. 19, 52074 Aachen ,","place":["Germany"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mina","family":"Shaigan","sequence":"additional","affiliation":[{"name":"Institute for Computational Genomics, Center for Computational Life Sciences, RWTH Aachen University Medical School , Pauwelstr. 19, 52074 Aachen ,","place":["Germany"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Samaneh","family":"Samiei","sequence":"additional","affiliation":[{"name":"Department of Nephrology, RWTH Aachen University Medical School , Pauwelstr, 30, 52074 Aachen ,","place":["Germany"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"James","family":"Nagai","sequence":"additional","affiliation":[{"name":"Institute for Computational Genomics, Center for Computational Life Sciences, RWTH Aachen University Medical School , Pauwelstr. 19, 52074 Aachen ,","place":["Germany"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tiago","family":"Mai\u00e9","sequence":"additional","affiliation":[{"name":"Institute for Computational Genomics, Center for Computational Life Sciences, RWTH Aachen University Medical School , Pauwelstr. 19, 52074 Aachen ,","place":["Germany"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christoph","family":"Kuppe","sequence":"additional","affiliation":[{"name":"Department of Nephrology, RWTH Aachen University Medical School , Pauwelstr, 30, 52074 Aachen ,","place":["Germany"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ivan G","family":"Costa","sequence":"additional","affiliation":[{"name":"Institute for Computational Genomics, Center for Computational Life Sciences, RWTH Aachen University Medical School , Pauwelstr. 19, 52074 Aachen ,","place":["Germany"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2025,10,16]]},"reference":[{"key":"2025101600032659900_ref1","first-page":"377","article-title":"mRNA-seq whole-transcriptome analysis of a single cell","volume":"2015","author":"Tang","year":"2014","journal-title":"Nat Methods"},{"key":"2025101600032659900_ref2","doi-asserted-by":"publisher","first-page":"1202","DOI":"10.1016\/j.cell.2015.05.002","article-title":"Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets","volume":"161","author":"Macosko","year":"2015","journal-title":"Cell"},{"key":"2025101600032659900_ref3","doi-asserted-by":"publisher","first-page":"553","DOI":"10.1038\/d41586-024-03552-6","article-title":"Cellular atlases are unlocking the mysteries of the human body","volume":"635","author":"Yanai","year":"2024","journal-title":"Nature"},{"key":"2025101600032659900_ref4","doi-asserted-by":"publisher","first-page":"1563","DOI":"10.1038\/s41591-023-02327-2","article-title":"An integrated cell atlas of the lung in health and disease","volume":"29","author":"Sikkema","year":"2023","journal-title":"Nat Med"},{"key":"2025101600032659900_ref5","doi-asserted-by":"publisher","first-page":"1895","DOI":"10.1016\/j.cell.2021.01.053","article-title":"COVID-19 immune features revealed by a large-scale single-cell transcriptome atlas","volume":"184","author":"Ren","year":"2021","journal-title":"Cell"},{"key":"2025101600032659900_ref6","doi-asserted-by":"publisher","first-page":"766","DOI":"10.1038\/s41586-022-05060-x","article-title":"Spatial multi-omic map of human myocardial infarction","volume":"608","author":"Kuppe","year":"2022","journal-title":"Nature"},{"key":"2025101600032659900_ref7","doi-asserted-by":"publisher","first-page":"585","DOI":"10.1038\/s41586-023-05769-3","article-title":"An atlas of healthy and injured cell states and niches in the human kidney","volume":"619","author":"Lake","year":"2023","journal-title":"Nature"},{"key":"2025101600032659900_ref8","doi-asserted-by":"crossref","first-page":"1274","DOI":"10.1038\/s41597-024-04117-y","article-title":"A multi-region single nucleus transcriptomic atlas of Parkinson\u2019s disease","volume":"11","author":"Prashant","year":"2024","journal-title":"Scientific Data"},{"key":"2025101600032659900_ref9","doi-asserted-by":"publisher","first-page":"2366","DOI":"10.1038\/s41593-024-01774-5","article-title":"Integrated multimodal cell atlas of Alzheimer\u2019s disease","volume":"27","author":"Gabitto","year":"2024","journal-title":"Nat Neurosci"},{"key":"2025101600032659900_ref10","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1038\/s41592-021-01336-8","article-title":"Benchmarking atlas-level data integration in single-cell genomics","volume":"19","author":"Luecken","year":"2022","journal-title":"Nat Methods"},{"key":"2025101600032659900_ref11","doi-asserted-by":"publisher","first-page":"302","DOI":"10.1038\/s41592-019-0689-z","article-title":"Uncovering axes of variation among single-cell cancer specimens","volume":"17","author":"Chen","year":"2020","journal-title":"Nat Methods"},{"key":"2025101600032659900_ref12","doi-asserted-by":"publisher","first-page":"57","DOI":"10.1038\/s44320-023-00003-8","article-title":"Detection of PatIent-level distances from single cell genomics and pathomics data with optimal transport (PILOT)","volume":"20","author":"Joodaki","year":"2024","journal-title":"Mol Syst Biol"},{"key":"2025101600032659900_ref13","first-page":"355","article-title":"Computational optimal transport: with applications to data science. Foundations and trends$\\circledR $","volume":"11","author":"Peyr\u00e9","year":"2019","journal-title":"Mach Learn"},{"key":"2025101600032659900_ref14","first-page":"1","article-title":"QOT: quantized optimal transport for sample-level distance matrix in single-cell omics","volume":"26","author":"Wang","year":"2025","journal-title":"Brief Bioinform"},{"key":"2025101600032659900_ref15","doi-asserted-by":"publisher","first-page":"259","DOI":"10.1186\/s13059-024-03398-1","article-title":"Visualizing scRNA-seq data at population scale with GloScope","volume":"25","author":"Wang","year":"2024","journal-title":"Genome Biol"},{"key":"2025101600032659900_ref16","doi-asserted-by":"publisher","first-page":"6269","DOI":"10.1109\/ACCESS.2018.2889838","article-title":"Optimal transport for Gaussian mixture models","volume":"7","author":"Chen","year":"2018","journal-title":"IEEE Access"},{"key":"2025101600032659900_ref17","first-page":"1005","article-title":"Wasserstein geometry of Gaussian measures","volume":"48","author":"Takatsu","year":"2011","journal-title":"Osaka J Math"},{"key":"2025101600032659900_ref18","article-title":"Semi-supervised learning with deep generative models","volume-title":"Advances in Neural Information Processing Systems","author":"Kingma","year":"2014"},{"key":"2025101600032659900_ref19","doi-asserted-by":"publisher","first-page":"181","DOI":"10.1038\/s41586-023-06252-9","article-title":"A spatially resolved single-cell genomic atlas of the adult human breast","volume":"620","author":"Kumar","year":"2023","journal-title":"Nature"},{"key":"2025101600032659900_ref20","doi-asserted-by":"publisher","author":"Jang","year":"2016","DOI":"10.48550\/arXiv.1611.01144"},{"key":"2025101600032659900_ref21","first-page":"2825","article-title":"Scikit-learn: machine learning in python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J Mach Learn Res"},{"key":"2025101600032659900_ref22","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2070781.2024192","article-title":"Displacement interpolation using Lagrangian mass transport","volume":"30","author":"Bonneel","year":"2011","journal-title":"ACM Trans Graph"},{"key":"2025101600032659900_ref23","doi-asserted-by":"publisher","first-page":"165","DOI":"10.1016\/j.exmath.2018.01.002","article-title":"On the Bures\u2013Wasserstein distance between positive definite matrices","volume":"37","author":"Bhatia","year":"2019","journal-title":"Expo Math"},{"key":"2025101600032659900_ref24","doi-asserted-by":"publisher","first-page":"5233","DOI":"10.1038\/s41598-019-41695-z","article-title":"From Louvain to Leiden: guaranteeing well-connected communities","volume":"9","author":"Traag","year":"2019","journal-title":"Sci Rep"},{"key":"2025101600032659900_ref25","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1016\/j.acha.2006.04.006","article-title":"Diffusion maps","volume":"21","author":"Coifman","year":"2006","journal-title":"Appl Comput Harmon Anal"},{"key":"2025101600032659900_ref26","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1016\/j.acha.2015.01.001","article-title":"Variable bandwidth diffusion kernels","volume":"40","author":"Berry","year":"2016","journal-title":"Appl Comput Harmon Anal"},{"key":"2025101600032659900_ref27","doi-asserted-by":"publisher","first-page":"296","DOI":"10.3390\/e22030296","article-title":"Robust and scalable learning of complex intrinsic dataset geometry via ElPiGraph","volume":"22","author":"Albergante","year":"2020","journal-title":"Entropy (Basel)"},{"key":"2025101600032659900_ref28","doi-asserted-by":"publisher","first-page":"1615","DOI":"10.1038\/s42255-023-00876-x","article-title":"Delineating mouse cell identity during lifetime and in diabetes with a single cell atlas","volume":"5","author":"Hrovatin","year":"2023","journal-title":"Nat Metab"},{"key":"2025101600032659900_ref29","doi-asserted-by":"publisher","first-page":"780","DOI":"10.1038\/s42003-024-06478-x","article-title":"Single-cell transcriptional profiling of clear cell renal cell carcinoma reveals a tumor-associated endothelial tip cell phenotype","volume":"7","author":"Zvirblyte","year":"2024","journal-title":"Commun Biol"},{"key":"2025101600032659900_ref30","doi-asserted-by":"publisher","DOI":"10.1126\/science.abf1970","article-title":"Single-cell RNA-seq reveals cell type-specific molecular and genetic associations to lupus","volume":"376","author":"Perez","year":"2022","journal-title":"Science"},{"key":"2025101600032659900_ref31","doi-asserted-by":"publisher","first-page":"777","DOI":"10.1038\/s41422-019-0212-1","article-title":"Author correction: single-cell RNA-seq highlights intra-tumoral heterogeneity and malignant progression in pancreatic ductal adenocarcinoma","volume":"29","author":"Peng","year":"2019","journal-title":"Cell Res"},{"key":"2025101600032659900_ref32","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1093\/nar\/30.1.207","article-title":"Gene expression omnibus: NCBI gene expression and hybridization array data repository","volume":"30","author":"Edgar","year":"2002","journal-title":"Nucleic Acids Res"},{"key":"2025101600032659900_ref33","doi-asserted-by":"publisher","first-page":"381","DOI":"10.1038\/nbt.2859","article-title":"The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells","volume":"32","author":"Trapnell","year":"2014","journal-title":"Nat Biotechnol"},{"key":"2025101600032659900_ref34","doi-asserted-by":"crossref","first-page":"2493","DOI":"10.1214\/11-AOAS493","article-title":"Classification and clustering of sequencing data using a Poisson model","volume":"5","author":"Witten","year":"2011","journal-title":"Ann Appl Stat"},{"key":"2025101600032659900_ref35","article-title":"Rectified linear units improve restricted Boltzmann machines","volume-title":"Proceedings of the 27th International Conference on Machine Learning (ICML-10)","author":"Nair"},{"key":"2025101600032659900_ref36","article-title":"A method for stochastic optimization","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Kinga","year":"2015"},{"key":"2025101600032659900_ref37","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1007\/BF01908075","article-title":"Comparing partitions","volume":"2","author":"Hubert","year":"1985","journal-title":"J Classif"},{"key":"2025101600032659900_ref38","volume-title":"Distribution-Free Multiple Comparisons","author":"Nemenyi","year":"1963"},{"key":"2025101600032659900_ref39","doi-asserted-by":"publisher","first-page":"1753","DOI":"10.1214\/aoms\/1177699803","article-title":"A robust version of the probability ratio test","volume":"36","author":"Huber","year":"1965","journal-title":"Ann Math Stat"},{"key":"2025101600032659900_ref40","doi-asserted-by":"publisher","first-page":"1201","DOI":"10.1038\/s41467-020-14766-3","article-title":"Trajectory-based differential expression analysis for single-cell sequencing data","volume":"11","author":"Van den Berge","year":"2020","journal-title":"Nat Commun"},{"key":"2025101600032659900_ref41","doi-asserted-by":"publisher","first-page":"W193","DOI":"10.1093\/nar\/gkm226","article-title":"G profiler\u2013a web-based toolset for functional profiling of gene lists from large-scale experiments","volume":"35","author":"Reimand","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2025101600032659900_ref42","doi-asserted-by":"publisher","first-page":"188356","DOI":"10.1016\/j.bbcan.2020.188356","article-title":"Fibrosis and cancer: a strained relationship","volume":"1873","author":"Piersma","year":"2020","journal-title":"Biochim Biophys Acta Rev Cancer"},{"key":"2025101600032659900_ref43","doi-asserted-by":"publisher","first-page":"5094","DOI":"10.1158\/1078-0432.CCR-24-1518","article-title":"High mechanical conditioning by tumor extracellular matrix stiffness is a predictive biomarker for antifibrotic therapy in HER2-negative breast cancer","volume":"30","author":"Quintela-Fandino","year":"2024","journal-title":"Clin Cancer Res"},{"key":"2025101600032659900_ref44","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1038\/nature11412","article-title":"Comprehensive molecular portraits of human breast tumours","volume":"490","author":"Cancer Genome Atlas Network","year":"2012","journal-title":"Nature"},{"key":"2025101600032659900_ref45","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1186\/s13058-015-0636-6","article-title":"Prosaposin activates the androgen receptor and potentiates resistance to endocrine treatment in breast cancer","volume":"17","author":"Ali","year":"2015","journal-title":"Breast Cancer Res"},{"key":"2025101600032659900_ref46","doi-asserted-by":"publisher","first-page":"1820","DOI":"10.1111\/j.1349-7006.2012.02374.x","article-title":"Prosaposin, a regulator of estrogen receptor alpha, promotes breast cancer growth","volume":"103","author":"Wu","year":"2012","journal-title":"Cancer Sci"},{"key":"2025101600032659900_ref47","doi-asserted-by":"publisher","first-page":"3535","DOI":"10.1109\/TCBB.2023.3298334","article-title":"Multi-view clustering with graph learning for scRNA-seq data","volume":"20","author":"Wu","year":"2023","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2025101600032659900_ref48","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbac068","article-title":"Learning deep features and topological structure of cells for clustering of scRNA-sequencing data","author":"Wang","year":"2022","journal-title":"Brief Bioinform"},{"key":"2025101600032659900_ref49","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbac317","article-title":"Learning discriminative and structural samples for rare cell types with deep generative model","author":"Wang","year":"2022","journal-title":"Brief Bioinform"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/5\/bbaf547\/64714085\/bbaf547.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/5\/bbaf547\/64714085\/bbaf547.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,16]],"date-time":"2025-10-16T04:03:39Z","timestamp":1760587419000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaf547\/8287234"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,31]]},"references-count":49,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2025,8,31]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaf547","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,9]]},"published":{"date-parts":[[2025,8,31]]},"article-number":"bbaf547"}}