{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T11:42:25Z","timestamp":1753875745031,"version":"3.41.2"},"reference-count":40,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2025,2,10]],"date-time":"2025-02-10T00:00:00Z","timestamp":1739145600000},"content-version":"vor","delay-in-days":80,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000049","name":"National Institute on Aging","doi-asserted-by":"publisher","award":["1U54AG075936"],"award-info":[{"award-number":["1U54AG075936"]}],"id":[{"id":"10.13039\/100000049","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000066","name":"National Institute of Environmental Health Sciences","doi-asserted-by":"publisher","award":["NIH 1R21ES032159"],"award-info":[{"award-number":["NIH 1R21ES032159"]}],"id":[{"id":"10.13039\/100000066","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000049","name":"National Institute on Aging","doi-asserted-by":"publisher","award":["NIH 1U54AG075931"],"award-info":[{"award-number":["NIH 1U54AG075931"]}],"id":[{"id":"10.13039\/100000049","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000051","name":"National Human Genome Research Institute","doi-asserted-by":"publisher","award":["NIH 1R01HG012555"],"award-info":[{"award-number":["NIH 1R01HG012555"]}],"id":[{"id":"10.13039\/100000051","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Instite on Aging","award":["1U54AG075936"],"award-info":[{"award-number":["1U54AG075936"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,11,22]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>In single-cell studies, cells can be characterized with multiple sources of heterogeneity (SOH) such as cell type, developmental stage, cell cycle phase, activation state, and so on. In some studies, many nuisance SOH are of no interest, but may confound the identification of the SOH of interest, and thus affect the accurate annotate the corresponding cell subpopulations. In this paper, we develop B-Lightning, a novel and robust method designed to identify marker genes and cell subpopulations corresponding to an SOH (e.g. cell activation status), isolating it from other SOH (e.g. cell type, cell cycle phase). B-Lightning uses an iterative approach to enrich a small set of trustworthy marker genes to more reliable marker genes and boost the signals of the SOH of interest. Multiple numerical and experimental studies showed that B-Lightning outperforms existing methods in terms of sensitivity and robustness in identifying marker genes. Moreover, it increases the power to differentiate cell subpopulations of interest from other heterogeneous cohorts. B-Lightning successfully identified new senescence markers in ciliated cells from human idiopathic pulmonary fibrosis lung tissues, new T-cell memory and effector markers in the context of SARS-COV-2 infections, and their synchronized patterns that were previously neglected, new AD markers that can better differentiate AD severity, and new dendritic cell functioning markers with differential transcriptomics profiles across breast cancer subtypes. This paper highlights B-Lightning\u2019s potential as a powerful tool for single-cell data analysis, particularly in complex data sets where SOH of interest are entangled with numerous nuisance factors.<\/jats:p>","DOI":"10.1093\/bib\/bbaf033","type":"journal-article","created":{"date-parts":[[2025,2,4]],"date-time":"2025-02-04T04:23:44Z","timestamp":1738643024000},"source":"Crossref","is-referenced-by-count":0,"title":["B-Lightning: using bait genes for marker gene hunting in single-cell data with complex heterogeneity"],"prefix":"10.1093","volume":"26","author":[{"given":"Yiren","family":"Shao","sequence":"first","affiliation":[{"name":"Department of Data Science, Dana-Farber Cancer Institute , Boston, MA 02215 ,","place":["United States"]}]},{"given":"Qi","family":"Gao","sequence":"additional","affiliation":[{"name":"Department of Computational Medicine and Bioinformatics, University of Michigan , Ann Arbor, MI 48104 ,","place":["United States"]}]},{"given":"Liuyang","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Molecular Genetics and Microbiology, Duke University , Durham, NC 27708 ,","place":["United States"]}]},{"given":"Dongmei","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Clinical and Translational Research, Unversity of Rochester Medical Center , Rochester, NY 14642 ,","place":["United States"]}]},{"given":"Andrew B","family":"Nixon","sequence":"additional","affiliation":[{"name":"Department of Medicine, Duke University , Durham, NC 27708 ,","place":["United States"]}]},{"given":"Cliburn","family":"Chan","sequence":"additional","affiliation":[{"name":"Department of Biostatistics and Bioinformatics, Duke University , Durham, NC 27708 ,","place":["United States"]},{"name":"Center for Human Systems Immunology, Duke University , Durham, NC 27708 ,","place":["United States"]}]},{"given":"Qi-Jing","family":"Li","sequence":"additional","affiliation":[{"name":"Institute of Molecular and Cell Biology, Agency for Science, Technology and Research , 138673 ,","place":["Singapore"]},{"name":"Singapore Immunology Network, Agency for Science, Technology and Research , 138648 ,","place":["Singapore"]}]},{"given":"Jichun","family":"Xie","sequence":"additional","affiliation":[{"name":"Department of Biostatistics and Bioinformatics, Duke University , Durham, NC 27708 ,","place":["United States"]},{"name":"Center for Human Systems Immunology, Duke University , Durham, NC 27708 ,","place":["United States"]},{"name":"Department of Mathematics, Duke University , Durham, NC 27708 ,","place":["United States"]}]}],"member":"286","published-online":{"date-parts":[[2025,2,10]]},"reference":[{"key":"2025021013271509800_ref1","doi-asserted-by":"publisher","first-page":"411","DOI":"10.1038\/nbt.4096","article-title":"Integrating single-cell transcriptomic data across different conditions, technologies, and species","volume":"36","author":"Butler","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2025021013271509800_ref2","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1186\/s13059-017-1382-0","article-title":"SCANPY: large-scale single-cell gene expression data analysis","volume":"19","author":"Alexander Wolf","year":"2018","journal-title":"Genome Biol"},{"key":"2025021013271509800_ref3","doi-asserted-by":"publisher","first-page":"609","DOI":"10.1534\/genetics.107.074609","article-title":"Naive application of permutation testing leads to inflated type I error rates","volume":"178","author":"Churchill","year":"2008","journal-title":"Genetics"},{"key":"2025021013271509800_ref4","doi-asserted-by":"publisher","first-page":"535","DOI":"10.1038\/nn.2303","article-title":"Circular analysis in systems neuroscience: the dangers of double dipping","volume":"12","author":"Nikolaus Kriegeskorte","year":"2009","journal-title":"Nat Neurosci"},{"key":"2025021013271509800_ref5","doi-asserted-by":"publisher","first-page":"18943","DOI":"10.1073\/pnas.1820340116","article-title":"Genefishing to reconstruct context specific portraits of biological processes","volume":"116","author":"Liu","year":"2019","journal-title":"Proc Natl Acad Sci USA"},{"key":"2025021013271509800_ref6","doi-asserted-by":"publisher","first-page":"e46","DOI":"10.1093\/nar\/gkae307","article-title":"SifiNet: a robust and accurate method to identify feature gene sets and annotate cells","volume":"52","author":"Gao","year":"2024","journal-title":"Nucleic Acids Res"},{"key":"2025021013271509800_ref7","doi-asserted-by":"publisher","first-page":"278","DOI":"10.1186\/s13059-015-0844-5","article-title":"MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data","volume":"16","author":"Finak","year":"2015","journal-title":"Genome Biol"},{"key":"2025021013271509800_ref8","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J R Stat Soc B Methodol"},{"key":"2025021013271509800_ref9","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1186\/1471-2105-14-7","article-title":"GSVA: gene set variation analysis for microarray and RNA-seq data","volume":"14","author":"H\u00e4nzelmann","year":"2013","journal-title":"BMC Bioinform"},{"volume-title":"Genescape: Simulation of Single Cell RNA-seq Data with Complex Structure","year":"2023","author":"Gao","key":"2025021013271509800_ref10"},{"key":"2025021013271509800_ref11","doi-asserted-by":"publisher","first-page":"eaba1972","DOI":"10.1126\/sciadv.aba1972","article-title":"Single-cell RNA sequencing reveals profibrotic roles of distinct epithelial and mesenchymal lineages in pulmonary fibrosis","volume":"6","author":"Habermann","year":"2020","journal-title":"Sci Adv"},{"key":"2025021013271509800_ref12","doi-asserted-by":"publisher","first-page":"4827","DOI":"10.1038\/s41467-022-32552-1","article-title":"A new gene set identifies senescent cells and predicts senescence-associated pathways across tissues","volume":"13","author":"Saul","year":"2022","journal-title":"Nat Commun"},{"key":"2025021013271509800_ref13","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1186\/s13024-021-00507-7","article-title":"The landscape of human tissue and cell type specific expression and co-regulation of senescence genes","volume":"17","author":"Peng","year":"2022","journal-title":"Mol Neurodegener"},{"key":"2025021013271509800_ref14","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1016\/j.tcb.2022.04.011","article-title":"The heterogeneity of cellular senescence: insights at the single-cell level","volume":"33","author":"Cohn","year":"2023","journal-title":"Trends Cell Biol"},{"key":"2025021013271509800_ref15","doi-asserted-by":"publisher","first-page":"797","DOI":"10.1016\/j.immuni.2021.03.005","article-title":"Longitudinal profiling of respiratory and systemic immune responses reveals myeloid cell-driven lung inflammation in severe COVID-19","volume":"54","author":"Szabo","year":"2021","journal-title":"Immunity"},{"key":"2025021013271509800_ref16","doi-asserted-by":"publisher","first-page":"276","DOI":"10.1038\/s41593-020-00764-7","article-title":"Molecular characterization of selectively vulnerable neurons in Alzheimer\u2019s disease","volume":"24","author":"Leng","year":"2021","journal-title":"Nat Neurosci"},{"key":"2025021013271509800_ref17","doi-asserted-by":"publisher","first-page":"5694","DOI":"10.1038\/s41467-024-49916-4","article-title":"Single-cell resolution characterization of myeloid-derived cell states with implication in cancer outcome","volume":"15","author":"Guimar\u00e3es","year":"2024","journal-title":"Nat Commun"},{"key":"2025021013271509800_ref18","doi-asserted-by":"publisher","first-page":"6823","DOI":"10.1038\/s41467-022-34581-2","article-title":"Single cell profiling of primary and paired metastatic lymph node tumors in breast cancer patients","volume":"13","author":"Liu","year":"2022","journal-title":"Nat Commun"},{"key":"2025021013271509800_ref19","doi-asserted-by":"publisher","first-page":"1334","DOI":"10.1038\/s41588-021-00911-1","article-title":"A single-cell and spatially resolved atlas of human breast cancers","volume":"53","author":"Wu","year":"2021","journal-title":"Nat Genet"},{"key":"2025021013271509800_ref20","doi-asserted-by":"publisher","first-page":"807","DOI":"10.1183\/09031936.00186914","article-title":"Hallmarks of the ageing lung","volume":"45","author":"Meiners","year":"2015","journal-title":"Eur Respir J"},{"key":"2025021013271509800_ref21","doi-asserted-by":"publisher","first-page":"6309","DOI":"10.1038\/s41467-021-26603-2","article-title":"Molecular programs of fibrotic change in aging human lung","volume":"12","author":"Lee","year":"2021","journal-title":"Nat Commun"},{"key":"2025021013271509800_ref22","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1155\/2022\/5578284","article-title":"COVID-19 impact on public health, environment, human psychology, global socioeconomy, and education","volume":"2022","author":"Miyah","year":"2022","journal-title":"Sci World J"},{"key":"2025021013271509800_ref23","doi-asserted-by":"publisher","first-page":"e0245532","DOI":"10.1371\/journal.pone.0245532","article-title":"T cell response to SARS-CoV-2 infection in humans: a systematic review","volume":"16","author":"Shrotri","year":"2021","journal-title":"PloS One"},{"key":"2025021013271509800_ref24","doi-asserted-by":"publisher","first-page":"295","DOI":"10.1007\/s00251-023-01294-9","article-title":"Role of T cells in severe COVID-19 disease, protection, and long term immunity","volume":"75","author":"Hermens","year":"2023","journal-title":"Immunogenetics"},{"key":"2025021013271509800_ref25","doi-asserted-by":"publisher","first-page":"1336","DOI":"10.1038\/s41590-020-0782-6","article-title":"Broad and strong memory CD4+ and CD8+ T cells induced by SARS-CoV-2 in UK convalescent individuals following COVID-19","volume":"21","author":"Peng","year":"2020","journal-title":"Nat Immunol"},{"key":"2025021013271509800_ref26","doi-asserted-by":"publisher","first-page":"5742","DOI":"10.1158\/1078-0432.CCR-21-0206","article-title":"CD4 T-cell exhaustion: does it exist and what are its roles in cancer?","volume":"27","author":"Miggelbrink","year":"2021","journal-title":"Clin Cancer Res"},{"key":"2025021013271509800_ref27","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.ppat.1005177","article-title":"CD39 expression identifies terminally exhausted CD8+ T cells","volume":"11","author":"Gupta","year":"2015","journal-title":"PLoS Pathog"},{"key":"2025021013271509800_ref28","doi-asserted-by":"publisher","first-page":"827","DOI":"10.3389\/fimmu.2020.00827","article-title":"Reduction and functional exhaustion of T cells in patients with coronavirus disease 2019 (COVID-19)","volume":"11","author":"Diao","year":"2020","journal-title":"Front Immunol"},{"key":"2025021013271509800_ref29","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1007\/978-981-32-9721-0_3","article-title":"Genetic markers of Alzheimer\u2019s disease","volume":"1192","author":"Perkovic","year":"2019","journal-title":"Adv Exp Med Biol"},{"key":"2025021013271509800_ref30","doi-asserted-by":"publisher","first-page":"271","DOI":"10.1016\/0197-4580(95)00021-6","article-title":"Staging of Alzheimer\u2019s disease-related neurofibrillary changes","volume":"16","author":"Braak","year":"1995","journal-title":"Neurobiol Aging"},{"key":"2025021013271509800_ref31","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1186\/alzrt214","article-title":"Propagation of tau pathology in Alzheimer\u2019s disease: identification of novel therapeutic targets","volume":"5","author":"Pooler","year":"2013","journal-title":"Alzheimers Res Ther"},{"key":"2025021013271509800_ref32","doi-asserted-by":"publisher","first-page":"eaah4573","DOI":"10.1126\/science.aah4573","article-title":"Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors","volume":"356","author":"Villani","year":"2017","journal-title":"Science"},{"key":"2025021013271509800_ref33","doi-asserted-by":"publisher","first-page":"e1512942","DOI":"10.1080\/2162402X.2018.1512942","article-title":"HER2 signaling regulates the tumor immune microenvironment and trastuzumab efficacy","volume":"8","author":"Triulzi","year":"2019","journal-title":"Onco Targets Ther"},{"key":"2025021013271509800_ref34","doi-asserted-by":"publisher","first-page":"190","DOI":"10.1261\/rna.076422.120","article-title":"Cold-inducible RNA binding protein promotes breast cancer cell malignancy by regulating cystatin c levels","volume":"27","author":"Indacochea","year":"2021","journal-title":"RNA"},{"key":"2025021013271509800_ref35","doi-asserted-by":"publisher","first-page":"3725","DOI":"10.3390\/cancers13153725","article-title":"Endogenous and therapeutic estrogens: maestro conductors of the microenvironment of ER+ breast cancers","volume":"13","author":"Schuler","year":"2021","journal-title":"Cancers (Basel)"},{"key":"2025021013271509800_ref36","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1155\/2020\/5618786","article-title":"Inflammation is associated with worse outcome in the whole cohort but with better outcome in triple-negative subtype of breast cancer patients","volume":"2020","author":"Oshi","year":"2020","journal-title":"J Immunol Res"},{"key":"2025021013271509800_ref37","doi-asserted-by":"publisher","first-page":"3694","DOI":"10.1038\/s41598-024-53999-w","article-title":"An inflamed tumor cell subpopulation promotes chemotherapy resistance in triple negative breast cancer","volume":"14","author":"Jacobo","year":"2024","journal-title":"Sci Rep"},{"key":"2025021013271509800_ref38","doi-asserted-by":"publisher","first-page":"lqad024","DOI":"10.1093\/nargab\/lqad024","article-title":"Single-cell gene set enrichment analysis and transfer learning for functional annotation of scrna-seq data","volume":"5","author":"Franchini","year":"2023","journal-title":"NAR Genom Bioinform"},{"key":"2025021013271509800_ref39","doi-asserted-by":"publisher","first-page":"1083","DOI":"10.1038\/nmeth.4463","article-title":"SCENIC: Single-Cell rEgulatory Network Inference and Clustering","volume":"14","author":"Aibar","year":"2017","journal-title":"Nat Methods"},{"key":"2025021013271509800_ref40","doi-asserted-by":"publisher","first-page":"eaba1972","DOI":"10.1126\/sciadv.aba1972","article-title":"Single-cell RNA sequencing reveals profibrotic roles of distinct epithelial and mesenchymal lineages in pulmonary fibrosis","volume":"6","author":"Habermann","year":"2020","journal-title":"Sci Adv"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/1\/bbaf033\/61817483\/bbaf033.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/1\/bbaf033\/61817483\/bbaf033.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,10]],"date-time":"2025-02-10T13:27:37Z","timestamp":1739194057000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaf033\/8006246"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,11,22]]},"references-count":40,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,11,22]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaf033","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"type":"print","value":"1467-5463"},{"type":"electronic","value":"1477-4054"}],"subject":[],"published-other":{"date-parts":[[2025,1]]},"published":{"date-parts":[[2024,11,22]]},"article-number":"bbaf033"}}