{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,2]],"date-time":"2025-10-02T10:48:17Z","timestamp":1759402097304,"version":"build-2065373602"},"reference-count":51,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,10,2]],"date-time":"2025-10-02T00:00:00Z","timestamp":1759363200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,10,2]],"date-time":"2025-10-02T00:00:00Z","timestamp":1759363200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["U54CA217378","T32GM136624","P30AR075047","U54CA217378","U54CA217378"],"award-info":[{"award-number":["U54CA217378","T32GM136624","P30AR075047","U54CA217378","U54CA217378"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>The high dimensionality of data in single cell transcriptomics (scRNAseq) requires investigators to choose subsets of genes (\u201cfeature selection\u201d) for downstream analysis (e.g., unsupervised cell clustering). The evaluation of different approaches to feature selection is hampered by the fact that, as we show here, the difficulty of feature selection can vary greatly, depending on the dataset being analyzed.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>For routine cell type identification, even randomly chosen features can perform well, but for cell type differences that are subtle, both number of features and selection strategy matter strongly. We present a simple feature selection method grounded in an analytical model that allows for interpretable delineation of how many and which features to choose, facilitating identification of biologically meaningful rare cell types. We compare this method to default methods in scanpy and Seurat, as well as SCTransform, showing how greater accuracy can often be achieved with surprisingly few, well-chosen features.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>Feature selection is a critical step in scRNAseq for downstream analyses. We explore the pitfalls that can arise from incautious feature selection and present a statistical method to facilitate improved outcomes.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/s12859-025-06240-y","type":"journal-article","created":{"date-parts":[[2025,10,2]],"date-time":"2025-10-02T10:00:28Z","timestamp":1759399228000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Statistically principled feature selection for single cell transcriptomics"],"prefix":"10.1186","volume":"26","author":[{"given":"Emmanuel P.","family":"Dollinger","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kai","family":"Silkwood","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Scott","family":"Atwood","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qing","family":"Nie","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Arthur D.","family":"Lander","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,10,2]]},"reference":[{"issue":"7","key":"6240_CR1","doi-asserted-by":"publisher","first-page":"995","DOI":"10.3390\/e24070995","volume":"24","author":"S Das","year":"2022","unstructured":"Das S, Rai A, Rai SN. Differential expression analysis of single-cell RNA-Seq data: current statistical approaches and outstanding challenges. Entropy (Basel). 2022;24(7):995.","journal-title":"Entropy (Basel)"},{"issue":"5","key":"6240_CR2","doi-asserted-by":"publisher","first-page":"bbac286","DOI":"10.1093\/bib\/bbac286","volume":"23","author":"S Junttila","year":"2022","unstructured":"Junttila S, Smolander J, Elo LL. Benchmarking methods for detecting differential States between conditions from multi-subject single-cell RNA-seq data. Brief Bioinform. 2022;23(5):bbac286.","journal-title":"Brief Bioinform"},{"issue":"3","key":"6240_CR3","doi-asserted-by":"publisher","first-page":"bbaa190","DOI":"10.1093\/bib\/bbaa190","volume":"22","author":"H Nguyen","year":"2021","unstructured":"Nguyen H, Tran D, Tran B, Pehlivan B, Nguyen T. A comprehensive survey of regulatory network inference methods using single cell RNA sequencing data. Brief Bioinform. 2021;22(3):bbaa190.","journal-title":"Brief Bioinform"},{"key":"6240_CR4","doi-asserted-by":"crossref","unstructured":"Simmons S. Cell type composition analysis: comparison of statistical methods. bioRxiv. 2022:2022.02.04.479123.","DOI":"10.1101\/2022.02.04.479123"},{"issue":"2","key":"6240_CR5","doi-asserted-by":"publisher","first-page":"dev179788","DOI":"10.1242\/dev.179788","volume":"147","author":"PPL Tam","year":"2020","unstructured":"Tam PPL, Ho JWK. Cellular diversity and lineage trajectory: insights from mouse single cell transcriptomes. Development. 2020;147(2):dev179788.","journal-title":"Development"},{"key":"6240_CR6","doi-asserted-by":"crossref","unstructured":"Tritschler S, B\u00fcttner M, Fischer DS, Lange M, Bergen V, Lickert H et al. Concepts and limitations for learning developmental trajectories from single cell genomics. Development. 2019;146(12):dev170506.","DOI":"10.1242\/dev.170506"},{"key":"6240_CR7","doi-asserted-by":"publisher","first-page":"5874","DOI":"10.1016\/j.csbj.2021.10.027","volume":"19","author":"B Xie","year":"2021","unstructured":"Xie B, Jiang Q, Mora A, Li X. Automatic cell type identification methods for single-cell RNA sequencing. Comput Struct Biotechnol J. 2021;19:5874\u201387.","journal-title":"Comput Struct Biotechnol J"},{"issue":"8","key":"6240_CR8","doi-asserted-by":"publisher","first-page":"550","DOI":"10.1038\/s41576-023-00586-w","volume":"24","author":"L Heumos","year":"2023","unstructured":"Heumos L, Schaar AC, Lance C, Litinetskaya A, Drost F, Zappia L, et al. Best practices for single-cell analysis across modalities. Nat Rev Genet. 2023;24(8):550\u201372.","journal-title":"Nat Rev Genet"},{"key":"6240_CR9","doi-asserted-by":"crossref","unstructured":"Li J, Cheng K, Wang S, Morstatter F, Trevino RP, Tang J et al. Feature selection: A data perspective. ACM Comput Surv. 2017;50(6):94.","DOI":"10.1145\/3136625"},{"issue":"1","key":"6240_CR10","doi-asserted-by":"publisher","first-page":"144","DOI":"10.1186\/s13059-016-1010-4","volume":"17","author":"L Jiang","year":"2016","unstructured":"Jiang L, Chen H, Pinello L, Yuan GC. GiniClust: detecting rare cell types from single-cell gene expression data with Gini index. Genome Biol. 2016;17(1):144.","journal-title":"Genome Biol"},{"key":"6240_CR11","doi-asserted-by":"crossref","unstructured":"Brennecke P, Anders S, Kim JK, Kolodziejczyk AA, Zhang X, Proserpio V et al. Accounting for technical noise in single-cell RNA-seq experiments. Nat Methods. 2013;10(11):1093\u20135.","DOI":"10.1038\/nmeth.2645"},{"issue":"16","key":"6240_CR12","doi-asserted-by":"publisher","first-page":"2865","DOI":"10.1093\/bioinformatics\/bty1044","volume":"35","author":"TS Andrews","year":"2018","unstructured":"Andrews TS, Hemberg M. M3Drop: dropout-based feature selection for ScRNASeq. Bioinformatics. 2018;35(16):2865\u20137.","journal-title":"Bioinformatics"},{"key":"6240_CR13","doi-asserted-by":"crossref","unstructured":"Su K, Yu T, Wu H. Accurate feature selection improves single-cell RNA-seq cell clustering. Brief Bioinf. 2021;22(5).","DOI":"10.1093\/bib\/bbab034"},{"issue":"1","key":"6240_CR14","doi-asserted-by":"publisher","first-page":"295","DOI":"10.1186\/s13059-019-1861-6","volume":"20","author":"FW Townes","year":"2019","unstructured":"Townes FW, Hicks SC, Aryee MJ, Irizarry RA. Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model. Genome Biol. 2019;20(1):295.","journal-title":"Genome Biol"},{"key":"6240_CR15","doi-asserted-by":"crossref","unstructured":"Tyler SR, Lozano-Ojalvo D, Guccione E, Schadt EE. Anti-correlated feature selection prevents false discovery of subpopulations in ScRNAseq. Nat Commun. 2024;15:699.","DOI":"10.1038\/s41467-023-43406-9"},{"issue":"2","key":"6240_CR16","doi-asserted-by":"publisher","first-page":"bbab517","DOI":"10.1093\/bib\/bbab517","volume":"23","author":"S Lall","year":"2022","unstructured":"Lall S, Ghosh A, Ray S, Bandyopadhyay S. sc-REnF: an entropy guided robust feature selection for single-cell RNA-seq data. Brief Bioinform. 2022;23(2):bbab517.","journal-title":"Brief Bioinform"},{"key":"6240_CR17","doi-asserted-by":"crossref","unstructured":"Stuart T, Butler A, Hoffman P, Hafemeister C, Papalexi E, Mauck WM et al. comprehensive integration of single-cell data. Cell. 2019;177(7):1888\u2013902.","DOI":"10.1016\/j.cell.2019.05.031"},{"issue":"5","key":"6240_CR18","doi-asserted-by":"publisher","first-page":"495","DOI":"10.1038\/nbt.3192","volume":"33","author":"R Satija","year":"2015","unstructured":"Satija R, Farrell JA, Gennert D, Schier AF, Regev A. Spatial reconstruction of single-cell gene expression data. Nat Biotechnol. 2015;33(5):495\u2013502.","journal-title":"Nat Biotechnol"},{"issue":"1","key":"6240_CR19","doi-asserted-by":"publisher","first-page":"296","DOI":"10.1186\/s13059-019-1874-1","volume":"20","author":"C Hafemeister","year":"2019","unstructured":"Hafemeister C, Satija R. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biol. 2019;20(1):296.","journal-title":"Genome Biol"},{"issue":"1","key":"6240_CR20","doi-asserted-by":"publisher","first-page":"227","DOI":"10.1186\/s13059-020-02136-7","volume":"21","author":"PL Germain","year":"2020","unstructured":"Germain PL, Sonrel A, Robinson MD. pipeComp, a general framework for the evaluation of computational pipelines, reveals performant single cell RNA-seq preprocessing tools. Genome Biol. 2020;21(1):227.","journal-title":"Genome Biol"},{"key":"6240_CR21","doi-asserted-by":"crossref","unstructured":"Zhao R, Lu J, Zhou W, Zhao N, Ji H. A systematic evaluation of highly variable gene selection methods for single-cell RNA-sequencing. BioRxiv. 2024.","DOI":"10.1101\/2024.08.25.608519"},{"issue":"1","key":"6240_CR22","doi-asserted-by":"publisher","first-page":"305","DOI":"10.1186\/s12859-024-05926-z","volume":"25","author":"K Silkwood","year":"2024","unstructured":"Silkwood K, Dollinger E, Gervin J, Atwood S, Nie Q, Lander AD. Leveraging gene correlations in single cell transcriptomic data. BMC Bioinf. 2024;25(1):305.","journal-title":"BMC Bioinformatics"},{"key":"6240_CR23","doi-asserted-by":"crossref","unstructured":"Hao Y, Hao S, Andersen-Nissen E, Mauck WM, Zheng S, Butler A et al. 2021 integrated analysis of multimodal single-cell data. Cell 184(13):3573\u201387.","DOI":"10.1016\/j.cell.2021.04.048"},{"issue":"1","key":"6240_CR24","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1186\/s13059-017-1382-0","volume":"19","author":"FA Wolf","year":"2018","unstructured":"Wolf FA, Angerer P, Theis FJ. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018;19(1):15.","journal-title":"Genome Biol"},{"issue":"1","key":"6240_CR25","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1186\/s13059-021-02584-9","volume":"23","author":"S Choudhary","year":"2022","unstructured":"Choudhary S, Satija R. Comparison and evaluation of statistical error models for scRNA-seq. Genome Biol. 2022;23(1):27.","journal-title":"Genome Biol"},{"issue":"4","key":"6240_CR26","doi-asserted-by":"publisher","first-page":"239","DOI":"10.1016\/j.cels.2016.04.001","volume":"2","author":"G Heimberg","year":"2016","unstructured":"Heimberg G, Bhatnagar R, El-Samad H, Thomson M. Low dimensionality in gene expression data enables the accurate extraction of transcriptional programs from shallow sequencing. Cell Syst. 2016;2(4):239\u201350.","journal-title":"Cell Syst"},{"key":"6240_CR27","doi-asserted-by":"publisher","first-page":"25696","DOI":"10.1038\/srep25696","volume":"6","author":"M Lenz","year":"2016","unstructured":"Lenz M, Muller FJ, Zenke M, Schuppert A. Principal components analysis and the reported low intrinsic dimensionality of gene expression microarray data. Sci Rep. 2016;6:25696.","journal-title":"Sci Rep"},{"issue":"4","key":"6240_CR28","doi-asserted-by":"publisher","first-page":"322","DOI":"10.1038\/nbt0410-322","volume":"28","author":"M Lukk","year":"2010","unstructured":"Lukk M, Kapushesky M, Nikkila J, Parkinson H, Goncalves A, Huber W, et al. A global map of human gene expression. Nat Biotechnol. 2010;28(4):322\u20134.","journal-title":"Nat Biotechnol"},{"issue":"2","key":"6240_CR29","doi-asserted-by":"publisher","first-page":"493","DOI":"10.1016\/j.immuni.2019.01.001","volume":"50","author":"RJ Miragaia","year":"2019","unstructured":"Miragaia RJ, Gomes T, Chomka A, Jardine L, Riedel A, Hegazy AN, et al. Single-Cell transcriptomics of regulatory T cells reveals trajectories of tissue adaptation. Immunity. 2019;50(2):493\u2013504.","journal-title":"Immunity"},{"issue":"1","key":"6240_CR30","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1109\/TIT.1968.1054102","volume":"14","author":"G Hughes","year":"1968","unstructured":"Hughes G. On the mean accuracy of statistical pattern recognizers. IEEE Trans Inf Theory. 1968;14(1):55\u201363.","journal-title":"IEEE Trans Inf Theory"},{"issue":"5","key":"6240_CR31","doi-asserted-by":"publisher","first-page":"363","DOI":"10.1002\/sam.11161","volume":"5","author":"AS Zimek","year":"2012","unstructured":"Zimek AS, Kriegel E, H. P. A survey on unsupervised outlier detection in high-dimensional numerical data. Stat Anal Data Min. 2012;5(5):363\u201387.","journal-title":"Stat Anal Data Min"},{"key":"6240_CR32","doi-asserted-by":"crossref","unstructured":"Menon M, Mohammadi S, Davila-Velderrain J, Goods BA, Cadwell TD, Xing Y et al. 2019 Single-cell transcriptomic atlas of the human retina identifies cell types associated with age-related macular degeneration. Nat Commun. 2019;10(1):4902.","DOI":"10.1038\/s41467-019-12780-8"},{"issue":"1","key":"6240_CR33","doi-asserted-by":"publisher","first-page":"5233","DOI":"10.1038\/s41598-019-41695-z","volume":"9","author":"VA Traag","year":"2019","unstructured":"Traag VA, Waltman L, van Eck NJ. From Louvain to leiden: guaranteeing well-connected communities. Sci Rep. 2019;9(1):5233.","journal-title":"Sci Rep"},{"key":"6240_CR34","doi-asserted-by":"crossref","unstructured":"Bahar Halpern K, Tanami S, Landen S, Chapal M, Szlak L, Hutzler A et al. Bursty gene expression in the intact mammalian liver. Mol Cell. 2015;58(1):147\u201356.","DOI":"10.1016\/j.molcel.2015.01.027"},{"issue":"1","key":"6240_CR35","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1049\/enb.2017.0004","volume":"1","author":"J Beal","year":"2017","unstructured":"Beal J. Biochemical complexity drives log-normal variation in genetic expression. Eng Biology. 2017;1(1):55\u201360.","journal-title":"Eng Biology"},{"issue":"5","key":"6240_CR36","doi-asserted-by":"publisher","first-page":"363","DOI":"10.1002\/sam.11161","volume":"5","author":"A Zimek","year":"2012","unstructured":"Zimek A, Schubert E, Kriegel H-P. A survey on unsupervised outlier detection in high-dimensional numerical data. Stat Anal Data Mining: ASA Data Sci J. 2012;5(5):363\u201387.","journal-title":"Stat Anal Data Mining: ASA Data Sci J"},{"issue":"1","key":"6240_CR37","doi-asserted-by":"publisher","first-page":"258","DOI":"10.1186\/s13059-021-02451-7","volume":"22","author":"J Lause","year":"2021","unstructured":"Lause J, Berens P, Kobak D. Analytic pearson residuals for normalization of single-cell RNA-seq UMI data. Genome Biol. 2021;22(1):258.","journal-title":"Genome Biol"},{"issue":"4","key":"6240_CR38","doi-asserted-by":"publisher","first-page":"307","DOI":"10.2307\/1400905","volume":"5","author":"EA Cornish","year":"1938","unstructured":"Cornish EA, Fisher RA. Moments and cumulants in the specification of distributions. Revue De l\u2019Institut Int De Statistique \/ Rev Int Stat Inst. 1938;5(4):307\u201320.","journal-title":"Revue De l\u2019Institut Int De Statistique \/ Rev Int Stat Inst"},{"issue":"1","key":"6240_CR39","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","volume":"57","author":"Y Benjamini","year":"1995","unstructured":"Benjamini Y, Hochberg Y. Controlling the false discovery Rate - a practical and powerful approach to multiple testing. J R Stat Soc B. 1995;57(1):289\u2013300.","journal-title":"J R Stat Soc B"},{"key":"6240_CR40","doi-asserted-by":"crossref","unstructured":"Guerrero-Juarez CF, Lee GH, Liu Y, Wang S, Karikomi M, Sha Y et al. 2022 Single-cell analysis of human basal cell carcinoma reveals novel regulators of tumor growth and the tumor microenvironment. Sci Adv 8(23):eabm7981.","DOI":"10.1126\/sciadv.abm7981"},{"key":"6240_CR41","doi-asserted-by":"crossref","unstructured":"Tabula Sapiens C, Jones RC, Karkanias J, Krasnow MA, Pisco AO, Quake SR et al. 2022 the Tabula sapiens: a multiple-organ, single-cell transcriptomic atlas of humans. Science. 376(6594):eabl4896.","DOI":"10.1126\/science.abl4896"},{"issue":"1","key":"6240_CR42","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1080\/03081078608934927","volume":"12","author":"U Kumar","year":"1986","unstructured":"Kumar U, Kapur VK JN. Normalized measures of entropy. Int J Gen Syst. 1986;12(1):55\u201369.","journal-title":"Int J Gen Syst"},{"issue":"1","key":"6240_CR43","doi-asserted-by":"publisher","first-page":"101","DOI":"10.2307\/2529621","volume":"30","author":"MG Bulmer","year":"1974","unstructured":"Bulmer MG. On fitting the Poisson lognormal distribution to Species-Abundance data. Biometrics. 1974;30(1):101\u201310.","journal-title":"Biometrics"},{"key":"6240_CR44","doi-asserted-by":"crossref","unstructured":"Wolock SL, Lopez R, Klein AM, Scrublet. Computational identification of cell doublets in Single-Cell transcriptomic data. Cell Syst. 2019;8(4):281\u201391.","DOI":"10.1016\/j.cels.2018.11.005"},{"key":"6240_CR45","doi-asserted-by":"crossref","unstructured":"Harris CR, Millman KJ, van der Walt SJ, Gommers R, Virtanen P, Cournapeau D et al. 2020 array programming with numpy. Nature. 2020;585(7825):357\u201362.","DOI":"10.1038\/s41586-020-2649-2"},{"key":"6240_CR46","unstructured":"Pedregosa FV, Gramfort G, Michel A, Thirion V, Grisel B, Blondel O, Prettenhofer M, Weiss P, Dubourg R, Vanderplas V, Passos J, Cournapeau A, Brucher D, Perrot M, Duchesnay M. E 2011 Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825\u201330."},{"issue":"1","key":"6240_CR47","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1214\/aoms\/1177730491","volume":"18","author":"HB Mann","year":"1947","unstructured":"Mann HB, Whitney DR. On a test of whether one of two random variables is stochastically larger than the other. Annals Math Stat. 1947;18(1):50\u201360.","journal-title":"Annals Math Stat"},{"key":"6240_CR48","unstructured":"Bengtsson H. profmem: Simple memory profiling for R. 2020."},{"key":"6240_CR49","doi-asserted-by":"crossref","unstructured":"Carvalho K, Rebboah E, Jansen C, Williams K, Dowey A, McGill C et al. Uncovering the gene regulatory networks underlying macrophage polarization through comparative analysis of bulk and single-cell data. bioRxiv. 2021.","DOI":"10.1101\/2021.01.20.427499"},{"key":"6240_CR50","doi-asserted-by":"crossref","unstructured":"Townes, F.W., Hicks, S.C., Aryee, M.J. et al. Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model. Genome Biol 20, 295 (2019).","DOI":"10.1186\/s13059-019-1861-6"},{"key":"6240_CR51","doi-asserted-by":"crossref","unstructured":"Sarkar, A., Stephens, M. Separating measurement and expression models clarifies confusion in single-cell RNA sequencing analysis. Nat Genet 53, 770\u2013777 (2021). https:\/\/doi.org\/10.1038\/s41588-021-00873-4","DOI":"10.1038\/s41588-021-00873-4"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-025-06240-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-025-06240-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-025-06240-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,2]],"date-time":"2025-10-02T10:00:38Z","timestamp":1759399238000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06240-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,2]]},"references-count":51,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["6240"],"URL":"https:\/\/doi.org\/10.1186\/s12859-025-06240-y","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,10,2]]},"assertion":[{"value":"16 January 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 July 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 October 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not Applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"238"}}