{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:35Z","timestamp":1772138075626,"version":"3.50.1"},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"9","license":[{"start":{"date-parts":[[2025,8,26]],"date-time":"2025-08-26T00:00:00Z","timestamp":1756166400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"European Union\u2019s Horizon 2020 research and innovation program under"},{"name":"German Federal Ministry of Education and Research","award":["031L0173"],"award-info":[{"award-number":["031L0173"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,9,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>The cellular composition of a solid tissue can be assessed either through the physical dissociation of the tissue followed by single-cell analysis techniques or by computational deconvolution of bulk gene expression profiles. However, both approaches are prone to significant biases. Tissue dissociation often results in disproportionate cell loss, while deconvolution is hindered by biological and technological inconsistencies between the datasets it relies on.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Using calibration datasets that include both experimentally measured and deconvolution-based cell compositions, we present a new method, Harp, which reconciles these approaches to produce more reliable deconvolution results in applications where only gene expression data is available. Both on simulated and real data, harmonizing cell reference profiles proved advantageous over competing state-of-the-art deconvolution tools, overcoming technological and biological batch effects.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>R package available at https:\/\/github.com\/spang-lab\/harp (archived as 10.5281\/zenodo.16851930). Code and data for reproducing the results of this paper are available at https:\/\/github.com\/spang-lab\/harplication (archived as 10.5281\/zenodo.16851705) and https:\/\/doi.org\/10.5281\/zenodo.15650057, respectively.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf455","type":"journal-article","created":{"date-parts":[[2025,8,25]],"date-time":"2025-08-25T11:23:08Z","timestamp":1756120988000},"source":"Crossref","is-referenced-by-count":0,"title":["Harp: data harmonization for computational tissue deconvolution across diverse transcriptomics platforms"],"prefix":"10.1093","volume":"41","author":[{"ORCID":"https:\/\/orcid.org\/0009-0009-4207-8256","authenticated-orcid":false,"given":"Zahra","family":"Nozari","sequence":"first","affiliation":[{"name":"Institute for Statistical Bioinformatics, Faculty of Informatics and Data Science, University of Regensburg , 93053 Regensburg,","place":["Germany"]}]},{"given":"Paul","family":"H\u00fcttl","sequence":"additional","affiliation":[{"name":"Institute for Statistical Bioinformatics, Faculty of Informatics and Data Science, University of Regensburg , 93053 Regensburg,","place":["Germany"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7832-2582","authenticated-orcid":false,"given":"Jakob","family":"Simeth","sequence":"additional","affiliation":[{"name":"Institute for Statistical Bioinformatics, Faculty of Informatics and Data Science, University of Regensburg , 93053 Regensburg,","place":["Germany"]},{"name":"NGS and Data Technologies Core, Leibniz Institute for Immunotherapy (LIT), c\/o Universit\u00e4tsklinikum Regensburg , 93053 Regensburg,","place":["Germany"]}]},{"given":"Marian","family":"Sch\u00f6n","sequence":"additional","affiliation":[{"name":"Institute for Statistical Bioinformatics, Faculty of Informatics and Data Science, University of Regensburg , 93053 Regensburg,","place":["Germany"]}]},{"given":"James A","family":"Hutchinson","sequence":"additional","affiliation":[{"name":"Department of Surgery, Universit\u00e4tsklinikum Regensburg , 93053 Regensburg,","place":["Germany"]}]},{"given":"Rainer","family":"Spang","sequence":"additional","affiliation":[{"name":"Institute for Statistical Bioinformatics, Faculty of Informatics and Data Science, University of Regensburg , 93053 Regensburg,","place":["Germany"]}]}],"member":"286","published-online":{"date-parts":[[2025,8,26]]},"reference":[{"key":"2025091918574856500_btaf455-B1","doi-asserted-by":"publisher","first-page":"1969","DOI":"10.1093\/bioinformatics\/bty019","article-title":"Computational deconvolution of transcriptomics data from mixed cell populations","volume":"34","author":"Avila Cobos","year":"2018","journal-title":"Bioinformatics"},{"key":"2025091918574856500_btaf455-B2","doi-asserted-by":"publisher","first-page":"3189","DOI":"10.1038\/s41598-025-87256-5","article-title":"Characterizing the omics landscape based on 10,000+ datasets","volume":"15","author":"Brombacher","year":"2025","journal-title":"Sci Rep"},{"key":"2025091918574856500_btaf455-B3","doi-asserted-by":"publisher","first-page":"502","DOI":"10.1038\/nrrheum.2011.110","article-title":"Screening: CyTOF-the next generation of cell detection","volume":"7","author":"Cheung","year":"2011","journal-title":"Nat Rev Rheumatol"},{"key":"2025091918574856500_btaf455-B4","doi-asserted-by":"publisher","first-page":"505","DOI":"10.1038\/s43018-022-00356-3","article-title":"Cell type and gene expression deconvolution with Bayesprism enables Bayesian integrative analysis across bulk and single-cell RNA sequencing in oncology","volume":"3","author":"Chu","year":"2022","journal-title":"Nat Cancer"},{"key":"2025091918574856500_btaf455-B5","doi-asserted-by":"publisher","first-page":"1018","DOI":"10.1016\/j.cub.2006.03.092","article-title":"Transcriptional pulsing of a developmental gene","volume":"16","author":"Chubb","year":"2006","journal-title":"Curr Biol"},{"key":"2025091918574856500_btaf455-B6","doi-asserted-by":"publisher","first-page":"130","DOI":"10.1186\/s13059-020-02048-6","article-title":"Systematic assessment of tissue dissociation and storage biases in single-cell and single-nucleus RNA-seq workflows","volume":"21","author":"Denisenko","year":"2020","journal-title":"Genome Biol"},{"key":"2025091918574856500_btaf455-B7","doi-asserted-by":"publisher","first-page":"391","DOI":"10.1038\/s41592-023-02166-6","article-title":"Challenges and perspectives in computational deconvolution of genomics data","volume":"21","author":"Garmire","year":"2024","journal-title":"Nat Methods"},{"key":"2025091918574856500_btaf455-B8","doi-asserted-by":"publisher","first-page":"342","DOI":"10.1089\/cmb.2019.0462","article-title":"Loss-function learning for digital tissue deconvolution","volume":"27","author":"G\u00f6rtler","year":"2020","journal-title":"J Comput Biol"},{"key":"2025091918574856500_btaf455-B9","doi-asserted-by":"publisher","first-page":"i100","DOI":"10.1093\/bioinformatics\/btae263","article-title":"Adaptive digital tissue deconvolution","volume":"40","author":"G\u00f6rtler","year":"2024","journal-title":"Bioinformatics"},{"key":"2025091918574856500_btaf455-B10","doi-asserted-by":"publisher","first-page":"75","DOI":"10.1186\/s13073-017-0467-4","article-title":"A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications","volume":"9","author":"Haque","year":"2017","journal-title":"Genome Med"},{"key":"2025091918574856500_btaf455-B8511773","doi-asserted-by":"crossref","DOI":"10.1038\/s43586-024-00363-x","article-title":"Uniform manifold approximation and projection","volume":"4","author":"Healy","year":"2024","journal-title":"Nat Rev Methods Primers"},{"key":"2025091918574856500_btaf455-B11","doi-asserted-by":"publisher","first-page":"116","DOI":"10.3389\/fcell.2016.00116","article-title":"Single cell isolation and analysis","volume":"4","author":"Hu","year":"2016","journal-title":"Front Cell Dev Biol"},{"key":"2025091918574856500_btaf455-B12","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1093\/biostatistics\/kxj037","article-title":"Adjusting batch effects in microarray expression data using empirical bayes methods","volume":"8","author":"Johnson","year":"2007","journal-title":"Biostatistics"},{"key":"2025091918574856500_btaf455-B13","doi-asserted-by":"publisher","first-page":"52","DOI":"10.4132\/jptm.2022.12.19","article-title":"Perspectives on single-nucleus RNA sequencing in different cell types and tissues","volume":"57","author":"Kim","year":"2023","journal-title":"J Pathol Transl Med"},{"key":"2025091918574856500_btaf455-B15","doi-asserted-by":"publisher","first-page":"1627","DOI":"10.1016\/j.celrep.2019.01.041","article-title":"RNA-seq signatures normalized by mRNA abundance allow absolute deconvolution of human immune cell types","volume":"26","author":"Monaco","year":"2019","journal-title":"Cell Rep"},{"key":"2025091918574856500_btaf455-B16","doi-asserted-by":"publisher","first-page":"453","DOI":"10.1038\/nmeth.3337","article-title":"Robust enumeration of cell subsets from tissue expression profiles","volume":"12","author":"Newman","year":"2015","journal-title":"Nat Methods"},{"key":"2025091918574856500_btaf455-B17","doi-asserted-by":"publisher","first-page":"773","DOI":"10.1038\/s41587-019-0114-2","article-title":"Determining cell type abundance and expression from bulk tissues with digital cytometry","volume":"37","author":"Newman","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2025091918574856500_btaf455-B18","doi-asserted-by":"publisher","first-page":"e26476","DOI":"10.7554\/eLife.26476","article-title":"Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data","volume":"6","author":"Racle","year":"2017","journal-title":"eLife"},{"key":"2025091918574856500_btaf455-B19","doi-asserted-by":"publisher","first-page":"896","DOI":"10.1038\/s41556-020-0532-x","article-title":"Dissecting intratumour heterogeneity of nodal b-cell lymphomas at the transcriptional, genetic and drug-response levels","volume":"22","author":"Roider","year":"2020","journal-title":"Nat Cell Biol"},{"key":"2025091918574856500_btaf455-B20","doi-asserted-by":"publisher","first-page":"e10717","DOI":"10.7717\/peerj.10717","article-title":"Effective ribosomal RNA depletion for single-cell total RNA-seq by SCDASH","volume":"9","author":"Shek","year":"2021","journal-title":"PeerJ"},{"key":"2025091918574856500_btaf455-B21","doi-asserted-by":"publisher","first-page":"btae709","DOI":"10.1093\/bioinformatics\/btae709","article-title":"Virtual tissue expression analysis","volume":"40","author":"Simeth","year":"2024","journal-title":"Bioinformatics"},{"key":"2025091918574856500_btaf455-B22","doi-asserted-by":"publisher","first-page":"1422","DOI":"10.1016\/j.ccell.2021.08.011","article-title":"The landscape of tumor cell states and ecosystems in diffuse large b cell lymphoma","volume":"39","author":"Steen","year":"2021","journal-title":"Cancer Cell"},{"key":"2025091918574856500_btaf455-B23","doi-asserted-by":"publisher","first-page":"380","DOI":"10.1038\/s41467-018-08023-x","article-title":"Bulk tissue cell type deconvolution with multi-subject single-cell expression reference","volume":"10","author":"Wang","year":"2019","journal-title":"Nat Commun"},{"key":"2025091918574856500_btaf455-B24","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1038\/nmeth.2694","article-title":"Quantitative assessment of single-cell RNA-sequencing methods","volume":"11","author":"Wu","year":"2014","journal-title":"Nat Methods"},{"key":"2025091918574856500_btaf455-B25","doi-asserted-by":"publisher","first-page":"14049","DOI":"10.1038\/ncomms14049","article-title":"Massively parallel digital transcriptional profiling of single cells","volume":"8","author":"Zheng","year":"2017","journal-title":"Nat Commun"},{"key":"2025091918574856500_btaf455-B26","doi-asserted-by":"publisher","first-page":"e0152034","DOI":"10.1371\/journal.pone.0152034","article-title":"System-wide associations between DNA-methylation, gene expression, and humoral immune response to influenza vaccination","volume":"11","author":"Zimmermann","year":"2016","journal-title":"PLoS One"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf455\/64140923\/btaf455.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/9\/btaf455\/64140923\/btaf455.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/9\/btaf455\/64140923\/btaf455.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,19]],"date-time":"2025-09-19T22:57:52Z","timestamp":1758322672000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf455\/8241958"}},"subtitle":[],"editor":[{"given":"Macha","family":"Nikolski","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,8,26]]},"references-count":26,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2025,9,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf455","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2025.02.26.640330","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,9]]},"published":{"date-parts":[[2025,8,26]]},"article-number":"btaf455"}}