{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,6]],"date-time":"2026-03-06T03:16:43Z","timestamp":1772767003344,"version":"3.50.1"},"reference-count":48,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2022,3,25]],"date-time":"2022-03-25T00:00:00Z","timestamp":1648166400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,5,13]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Single-cell sequencing brings about a revolutionarily high resolution for finding differentially expressed genes (DEGs) by disentangling highly heterogeneous cell tissues. Yet, such analysis is so far mostly focused on comparing between different cell types from the same individual. As single-cell sequencing becomes cheaper and easier to use, an increasing number of datasets from case\u2013control studies are becoming available, which call for new methods for identifying differential expressions between case and control individuals.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>To bridge this gap, we propose barycenter single-cell differential expression (BSDE), a nonparametric method for finding DEGs for case\u2013control studies. Through the use of optimal transportation for aggregating distributions and computing their distances, our method overcomes the restrictive parametric assumptions imposed by standard mixed-effect-modeling approaches. Through simulations, we show that BSDE can accurately detect a variety of differential expressions while maintaining the type-I error at a prescribed level. Further, 1345 and 1568 cell type-specific DEGs are identified by BSDE from datasets on pulmonary fibrosis and multiple sclerosis, among which the top findings are supported by previous results from the literature.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>R package BSDE is freely available from doi.org\/10.5281\/zenodo.6332254. For real data analysis with the R package, see doi.org\/10.5281\/zenodo.6332566. These can also be accessed thorough GitHub at github.com\/mqzhanglab\/BSDE and github.com\/mqzhanglab\/BSDE_pipeline. The two single-cell sequencing datasets can be download with UCSC cell browser from cells.ucsc.edu\/?ds=ms and cells.ucsc.edu\/?ds=lung-pf-control.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btac171","type":"journal-article","created":{"date-parts":[[2022,3,23]],"date-time":"2022-03-23T20:13:00Z","timestamp":1648066380000},"page":"2765-2772","source":"Crossref","is-referenced-by-count":11,"title":["BSDE: barycenter single-cell differential expression for case\u2013control studies"],"prefix":"10.1093","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1091-1014","authenticated-orcid":false,"given":"Mengqi","family":"Zhang","sequence":"first","affiliation":[{"name":"Department of Surgery, Perelman Medical School, University of Pennsylvania , Philadelphia, PA 19104, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2081-7398","authenticated-orcid":false,"given":"F Richard","family":"Guo","sequence":"additional","affiliation":[{"name":"Statistical Laboratory, University of Cambridge , Cambridge CB3 0WB, UK"}]}],"member":"286","published-online":{"date-parts":[[2022,3,25]]},"reference":[{"key":"2023020109123629800_btac171-B1","doi-asserted-by":"crossref","first-page":"904","DOI":"10.1137\/100805741","article-title":"Barycenters in the Wasserstein space","volume":"43","author":"Agueh","year":"2011","journal-title":"SIAM J. Math. Anal"},{"key":"2023020109123629800_btac171-B2","first-page":"1961","author":"Altschuler","year":"2017"},{"key":"2023020109123629800_btac171-B3","doi-asserted-by":"crossref","first-page":"A1111","DOI":"10.1137\/141000439","article-title":"Iterative Bregman projections for regularized transportation problems","volume":"37","author":"Benamou","year":"2015","journal-title":"SIAM J. Sci. Comput"},{"key":"2023020109123629800_btac171-B4","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1007\/s00401-013-1112-y","article-title":"Insufficient OPC migration into demyelinated lesions is a cause of poor remyelination in MS and mouse models","volume":"125","author":"Boyd","year":"2013","journal-title":"Acta Neuropathol"},{"key":"2023020109123629800_btac171-B5","doi-asserted-by":"crossref","first-page":"e0142497","DOI":"10.1371\/journal.pone.0142497","article-title":"Secretoglobin 3A2 exhibits anti-fibrotic activity in bleomycin-induced pulmonary fibrosis model mice","volume":"10","author":"Cai","year":"2015","journal-title":"PLoS One"},{"key":"2023020109123629800_btac171-B6","doi-asserted-by":"crossref","first-page":"1540","DOI":"10.1164\/rccm.201904-0792OC","article-title":"Single-cell reconstruction of human basal cell diversity in normal and idiopathic pulmonary fibrosis lungs","volume":"202","author":"Carraro","year":"2020","journal-title":"Am. J. Respir. Crit. Care Med"},{"key":"2023020109123629800_btac171-B7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-020-19894-4","article-title":"Muscat detects subpopulation-specific state transitions from multi-sample multi-condition single-cell transcriptomics data","volume":"11","author":"Crowell","year":"2020","journal-title":"Nat. Commun"},{"key":"2023020109123629800_btac171-B8","volume-title":"Advances in Neural Information Processing Systems","author":"Cuturi","year":"2013"},{"key":"2023020109123629800_btac171-B9","first-page":"685","volume-title":"Proceedings of the 31st International Conference on Machine Learning","author":"Cuturi","year":"2014"},{"key":"2023020109123629800_btac171-B10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-016-0944-6","article-title":"Discrete distributional differential expression (D3E)\u2014a tool for gene expression analysis of single-cell RNA-seq data","volume":"17","author":"Delmans","year":"2016","journal-title":"BMC Bioinformatics"},{"key":"2023020109123629800_btac171-B11","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1038\/nature12647","article-title":"A regenerative approach to the treatment of multiple sclerosis","volume":"502","author":"Deshmukh","year":"2013","journal-title":"Nature"},{"key":"2023020109123629800_btac171-B12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-015-0844-5","article-title":"MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data","volume":"16","author":"Finak","year":"2015","journal-title":"Genome Biol"},{"key":"2023020109123629800_btac171-B13","first-page":"1","article-title":"POT: python optimal transport","volume":"22","author":"Flamary","year":"2021","journal-title":"J. Mach. Learn. Res"},{"key":"2023020109123629800_btac171-B14","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1016\/j.cell.2017.11.011","article-title":"Single-cell genomics: a stepping stone for future immunology discoveries","volume":"172","author":"Giladi","year":"2018","journal-title":"Cell"},{"key":"2023020109123629800_btac171-B15","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1007\/978-3-319-19992-4_20","volume-title":"International Conference on Information Processing in Medical Imaging","author":"Gramfort","year":"2015"},{"key":"2023020109123629800_btac171-B16","doi-asserted-by":"crossref","first-page":"e1004575","DOI":"10.1371\/journal.pcbi.1004575","article-title":"SINCERA: a pipeline for single-cell RNA-seq profiling analysis","volume":"11","author":"Guo","year":"2015","journal-title":"PLoS Comput. Biol"},{"key":"2023020109123629800_btac171-B17","doi-asserted-by":"crossref","first-page":"eaba1972","DOI":"10.1126\/sciadv.aba1972","article-title":"Single-cell RNA sequencing reveals profibrotic roles of distinct epithelial and mesenchymal lineages in pulmonary fibrosis","volume":"6","author":"Habermann","year":"2020","journal-title":"Sci. Adv"},{"key":"2023020109123629800_btac171-B18","doi-asserted-by":"crossref","first-page":"740","DOI":"10.1038\/nmeth.2967","article-title":"Bayesian approach to single-cell differential expression analysis","volume":"11","author":"Kharchenko","year":"2014","journal-title":"Nat. Methods"},{"key":"2023020109123629800_btac171-B19","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-016-1077-y","article-title":"A statistical approach for identifying differential distributions in single-cell RNA-seq experiments","volume":"17","author":"Korthauer","year":"2016","journal-title":"Genome Biol"},{"key":"2023020109123629800_btac171-B20","doi-asserted-by":"crossref","first-page":"647","DOI":"10.1038\/nrneurol.2012.168","article-title":"Progressive multiple sclerosis: pathology and pathogenesis","volume":"8","author":"Lassmann","year":"2012","journal-title":"Nat. Rev. Neurol"},{"key":"2023020109123629800_btac171-B21","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"key":"2023020109123629800_btac171-B22","doi-asserted-by":"crossref","first-page":"451","DOI":"10.1093\/biostatistics\/kxw055","article-title":"Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data","volume":"18","author":"Lun","year":"2017","journal-title":"Biostatistics"},{"key":"2023020109123629800_btac171-B23","doi-asserted-by":"crossref","first-page":"3223","DOI":"10.1093\/bioinformatics\/bty332","article-title":"DEsingle for detecting three types of differential expression in single-cell RNA-seq data","volume":"34","author":"Miao","year":"2018","journal-title":"Bioinformatics"},{"key":"2023020109123629800_btac171-B24","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1002\/0471142735.im1501s88","article-title":"Experimental autoimmune encephalomyelitis in the mouse","volume":"88","author":"Miller","year":"2010","journal-title":"Curr. Protoc. Immunol"},{"key":"2023020109123629800_btac171-B25","doi-asserted-by":"crossref","first-page":"963","DOI":"10.1109\/GlobalSIP.2015.7418340","volume-title":"2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP).","author":"Nabavi","year":"2015"},{"key":"2023020109123629800_btac171-B26","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1093\/bioinformatics\/btv634","article-title":"EMDomics: a robust and powerful method for the identification of genes differentially expressed between heterogeneous classes","volume":"32","author":"Nabavi","year":"2016","journal-title":"Bioinformatics"},{"key":"2023020109123629800_btac171-B27","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1038\/nm.2324","article-title":"A reversible form of axon damage in experimental autoimmune encephalomyelitis and multiple sclerosis","volume":"17","author":"Niki\u0107","year":"2011","journal-title":"Nat. Med"},{"key":"2023020109123629800_btac171-B28","doi-asserted-by":"crossref","first-page":"2269","DOI":"10.3390\/ijms21072269","article-title":"Alveolar epithelial type II cells as drivers of lung fibrosis in idiopathic pulmonary fibrosis","volume":"21","author":"Parimon","year":"2020","journal-title":"Int. J. Mol. Sci"},{"key":"2023020109123629800_btac171-B29","doi-asserted-by":"crossref","first-page":"eaav7188","DOI":"10.1126\/science.aav7188","article-title":"Multiple sclerosis genomic map implicates peripheral immune cells and microglia in susceptibility","volume":"365","author":"Patsopoulos","year":"2019","journal-title":"Science"},{"key":"2023020109123629800_btac171-B30","doi-asserted-by":"crossref","first-page":"2232","DOI":"10.1001\/jama.2013.5827","article-title":"Association between the MUC5B promoter polymorphism and survival in patients with idiopathic pulmonary fibrosis","volume":"309","author":"Peljto","year":"2013","journal-title":"JAMA"},{"key":"2023020109123629800_btac171-B31","doi-asserted-by":"crossref","first-page":"217","DOI":"10.3389\/fimmu.2018.00217","article-title":"The role of astrocytes in multiple sclerosis","volume":"9","author":"Ponath","year":"2018","journal-title":"Front. Immunol"},{"key":"2023020109123629800_btac171-B32","doi-asserted-by":"crossref","first-page":"646","DOI":"10.1002\/ana.1255","article-title":"Immunopathology of secondary-progressive multiple sclerosis","volume":"50","author":"Prineas","year":"2001","journal-title":"Ann. Neurol"},{"key":"2023020109123629800_btac171-B33","doi-asserted-by":"crossref","first-page":"979","DOI":"10.1038\/nmeth.4402","article-title":"Reversed graph embedding resolves complex single-cell trajectories","volume":"14","author":"Qiu","year":"2017","journal-title":"Nat. Methods"},{"key":"2023020109123629800_btac171-B34","first-page":"435","volume-title":"International Conference on Scale Space and Variational Methods in Computer Vision","author":"Rabin","year":"2011"},{"key":"2023020109123629800_btac171-B35","doi-asserted-by":"crossref","first-page":"1941","DOI":"10.1016\/S0140-6736(17)30866-8","article-title":"Idiopathic pulmonary fibrosis","volume":"389","author":"Richeldi","year":"2017","journal-title":"Lancet"},{"key":"2023020109123629800_btac171-B36","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edgeR: a Bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020109123629800_btac171-B37","doi-asserted-by":"crossref","first-page":"928","DOI":"10.1016\/j.cell.2019.01.006","article-title":"Optimal-transport analysis of single-cell gene expression identifies developmental trajectories in reprogramming","volume":"176","author":"Schiebinger","year":"2019","journal-title":"Cell"},{"key":"2023020109123629800_btac171-B38","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1038\/s41586-019-1404-z","article-title":"Neuronal vulnerability and multilineage diversity in multiple sclerosis","volume":"573","author":"Schirmer","year":"2019","journal-title":"Nature"},{"key":"2023020109123629800_btac171-B39","doi-asserted-by":"crossref","first-page":"1503","DOI":"10.1056\/NEJMoa1013660","article-title":"A common MUC5B promoter polymorphism and pulmonary fibrosis","volume":"364","author":"Seibold","year":"2011","journal-title":"N. Engl. J. Med"},{"key":"2023020109123629800_btac171-B40","doi-asserted-by":"crossref","first-page":"3243","DOI":"10.1093\/bioinformatics\/btab337","article-title":"Differential gene expression analysis for multi-subject single cell RNA sequencing studies with aggregateBioVar","volume":"37","author":"Thurman","year":"2021","journal-title":"Bioinformatics"},{"key":"2023020109123629800_btac171-B41","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1038\/nbt.2859","article-title":"The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells","volume":"32","author":"Trapnell","year":"2014","journal-title":"Nat. Biotechnol"},{"key":"2023020109123629800_btac171-B42","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1126\/science.aav8130","article-title":"Single-cell genomics identifies cell type\u2013specific molecular changes in autism","volume":"364","author":"Velmeshev","year":"2019","journal-title":"Science"},{"key":"2023020109123629800_btac171-B43","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-540-71050-9","volume-title":"Optimal Transport: Old and New.","author":"Villani","year":"2009"},{"key":"2023020109123629800_btac171-B44","doi-asserted-by":"crossref","first-page":"202","DOI":"10.1109\/BIBM.2017.8217650","volume-title":"2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","author":"Wang","year":"2017"},{"key":"2023020109123629800_btac171-B45","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1016\/j.ymeth.2018.04.017","article-title":"SigEMD: a powerful method for differential gene expression analysis in single-cell RNA sequencing data","volume":"145","author":"Wang","year":"2018","journal-title":"Methods"},{"key":"2023020109123629800_btac171-B46","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1016\/j.cell.2019.11.027","article-title":"Progressive pulmonary fibrosis is caused by elevated mechanical tension on alveolar stem cells","volume":"180","author":"Wu","year":"2020","journal-title":"Cell"},{"key":"2023020109123629800_btac171-B47","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1186\/s13059-022-02605-1","article-title":"Ideas: individual level differential expression analysis for single-cell RNA-seq data","volume":"23","author":"Zhang","year":"2022","journal-title":"Genome Biol"},{"key":"2023020109123629800_btac171-B48","doi-asserted-by":"crossref","first-page":"e0237529","DOI":"10.1371\/journal.pone.0237529","article-title":"Dysregulation of club cell biology in idiopathic pulmonary fibrosis","volume":"15","author":"Zuo","year":"2020","journal-title":"PLoS One"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btac171\/43293470\/btac171.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/10\/2765\/49010571\/btac171.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/10\/2765\/49010571\/btac171.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,1]],"date-time":"2023-02-01T21:06:11Z","timestamp":1675285571000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/10\/2765\/6554192"}},"subtitle":[],"editor":[{"given":"Anthony","family":"Mathelier","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,3,25]]},"references-count":48,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2022,5,13]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btac171","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,5,15]]},"published":{"date-parts":[[2022,3,25]]}}}