{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,7]],"date-time":"2026-04-07T21:50:04Z","timestamp":1775598604391,"version":"3.50.1"},"reference-count":25,"publisher":"Oxford University Press (OUP)","issue":"7","license":[{"start":{"date-parts":[[2020,8,25]],"date-time":"2020-08-25T00:00:00Z","timestamp":1598313600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100007904","name":"Texas A&M University","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100007904","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Texas A&M Institute of Data Science"},{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,5,17]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Quality control (QC) is a critical step in single-cell RNA-seq (scRNA-seq) data analysis. Low-quality cells are removed from the analysis during the QC process to avoid misinterpretation of the data. An important QC metric is the mitochondrial proportion (mtDNA%), which is used as a threshold to filter out low-quality cells. Early publications in the field established a threshold of 5% and since then, it has been used as a default in several software packages for scRNA-seq data analysis, and adopted as a standard in many scRNA-seq studies. However, the validity of using a uniform threshold across different species, single-cell technologies, tissues and cell types has not been adequately assessed.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We systematically analyzed 5\u00a0530\u00a0106 cells reported in 1349 annotated datasets available in the PanglaoDB database and found that the average mtDNA% in scRNA-seq data across human tissues is significantly higher than in mouse tissues. This difference is not confounded by the platform used to generate the data. Based on this finding, we propose new reference values of the mtDNA% for 121 tissues of mouse and 44 tissues of humans. In general, for mouse tissues, the 5% threshold performs well to distinguish between healthy and low-quality cells. However, for human tissues, the 5% threshold should be reconsidered as it fails to accurately discriminate between healthy and low-quality cells in 29.5% (13 of 44) tissues analyzed. We conclude that omitting the mtDNA% QC filter or adopting a suboptimal mtDNA% threshold may lead to erroneous biological interpretations of scRNA-seq data.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availabilityand implementation<\/jats:title>\n                    <jats:p>The code used to download datasets, perform the analyzes and produce the figures is available at https:\/\/github.com\/dosorio\/mtProportion.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa751","type":"journal-article","created":{"date-parts":[[2020,8,18]],"date-time":"2020-08-18T16:55:35Z","timestamp":1597769735000},"page":"963-967","source":"Crossref","is-referenced-by-count":167,"title":["Systematic determination of the mitochondrial proportion in human and mice tissues for single-cell RNA-sequencing data quality control"],"prefix":"10.1093","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4424-8422","authenticated-orcid":false,"given":"Daniel","family":"Osorio","sequence":"first","affiliation":[{"name":"Department of Veterinary Integrative Biosciences, Texas A&M University , College Station, TX 77843-4458, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8081-6725","authenticated-orcid":false,"given":"James J","family":"Cai","sequence":"additional","affiliation":[{"name":"Department of Veterinary Integrative Biosciences, Texas A&M University , College Station, TX 77843-4458, USA"},{"name":"Department of Electrical & Computer Engineering, Texas A&M University , College Station, TX 77843-4458, USA"},{"name":"Interdisciplinary Program of Genetics, Texas A&M University , College Station, TX 77843-4458, USA"}]}],"member":"286","published-online":{"date-parts":[[2020,8,25]]},"reference":[{"key":"2023051612170121300_btaa751-B1","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1016\/j.omtm.2018.07.003","article-title":"An introduction to the analysis of single-cell RNA-sequencing data","volume":"10","author":"AlJanahi","year":"2018","journal-title":"Mol. Ther. Methods Clin. Dev"},{"key":"2023051612170121300_btaa751-B2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-015-0844-5","article-title":"MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data","volume":"16","author":"Finak","year":"2015","journal-title":"Genome Biol"},{"key":"2023051612170121300_btaa751-B3","first-page":"1","article-title":"PanglaoDB: a web server for exploration of mouse and human single-cell RNA sequencing data. Database (Oxford)","author":"et","year":"2019","journal-title":"baz046"},{"key":"2023051612170121300_btaa751-B4","author":"Germain","year":"2020"},{"key":"2023051612170121300_btaa751-B5","doi-asserted-by":"crossref","first-page":"633","DOI":"10.1101\/gr.178426.114","article-title":"Global variability in gene expression and alternative splicing is modulated by mitochondrial content","volume":"25","author":"Guantes","year":"2015","journal-title":"Genome Res"},{"key":"2023051612170121300_btaa751-B6","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1038\/s12276-018-0071-8","article-title":"Single-cell RNA sequencing technologies and bioinformatics pipelines","volume":"50","author":"Hwang","year":"2018","journal-title":"Exp. Mol. Med"},{"key":"2023051612170121300_btaa751-B7","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1186\/s13059-016-0888-1","article-title":"Classification of low quality cells from single-cell RNA-seq data","volume":"17","author":"Ilicic","year":"2016","journal-title":"Genome Biol"},{"key":"2023051612170121300_btaa751-B8","doi-asserted-by":"crossref","first-page":"e92","DOI":"10.1002\/cpmb.92","article-title":"Single-cell RNA-seq: introduction to bioinformatics analysis","volume":"127","author":"Ji","year":"2019","journal-title":"Curr. Protoc. Mol. Biol"},{"key":"2023051612170121300_btaa751-B9","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"KEGG: kyoto encyclopedia of genes and genomes","volume":"28","author":"Kanehisa","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2023051612170121300_btaa751-B10","author":"Korotkevich","year":"2019"},{"key":"2023051612170121300_btaa751-B11","author":"Lang","year":"2012"},{"key":"2023051612170121300_btaa751-B12","doi-asserted-by":"crossref","first-page":"e8746","DOI":"10.15252\/msb.20188746","article-title":"Current best practices in single-cell RNA-seq analysis: a tutorial","volume":"15","author":"Luecken","year":"2019","journal-title":"Mol. Syst. Biol"},{"key":"2023051612170121300_btaa751-B13","doi-asserted-by":"crossref","first-page":"180192","DOI":"10.1038\/sdata.2018.192","article-title":"Single-cell RNA sequencing of adult mouse testes","volume":"5","author":"Lukassen","year":"2018","journal-title":"Sci. Data"},{"key":"2023051612170121300_btaa751-B14","first-page":"2122","article-title":"A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor","volume":"5","author":"Lun","year":"2016","journal-title":"F1000Res"},{"key":"2023051612170121300_btaa751-B15","first-page":"493","author":"Ma","year":"2019"},{"key":"2023051612170121300_btaa751-B16","doi-asserted-by":"crossref","first-page":"1179","DOI":"10.1093\/bioinformatics\/btw777","article-title":"Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R","volume":"33","author":"McCarthy","year":"2017","journal-title":"Bioinformatics"},{"key":"2023051612170121300_btaa751-B17","doi-asserted-by":"crossref","first-page":"645","DOI":"10.1016\/j.cell.2011.06.051","article-title":"The human mitochondrial transcriptome","volume":"146","author":"Mercer","year":"2011","journal-title":"Cell"},{"key":"2023051612170121300_btaa751-B18","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1002\/bies.201500105","article-title":"Mitochondrial content is central to nuclear gene expression: profound implications for human health","volume":"38","author":"Muir","year":"2016","journal-title":"Bioessays"},{"key":"2023051612170121300_btaa751-B19","doi-asserted-by":"crossref","first-page":"156","DOI":"10.1002\/cyto.a.23898","article-title":"Apoptotic cell exclusion and bias-free single-cell selection are important quality control requirements for successful single-cell sequencing applications","volume":"97","author":"Ordonez-Rueda","year":"2020","journal-title":"Cytometry A"},{"key":"2023051612170121300_btaa751-B20","year":"2013"},{"key":"2023051612170121300_btaa751-B21","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1038\/nmeth.2764","article-title":"Entering the era of single-cell transcriptomics in biology and medicine","volume":"11","author":"Sandberg","year":"2014","journal-title":"Nat. Methods"},{"key":"2023051612170121300_btaa751-B22","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1038\/nbt.3192","article-title":"Spatial reconstruction of single-cell gene expression data","volume":"33","author":"Satija","year":"2015","journal-title":"Nat. Biotechnol"},{"key":"2023051612170121300_btaa751-B23","author":"Svensson","year":"2019"},{"key":"2023051612170121300_btaa751-B24","article-title":"xml2: parse XML","volume":".2","author":"Wickham","year":"2018","journal-title":"R Package Version 1.2"},{"key":"2023051612170121300_btaa751-B25","doi-asserted-by":"crossref","first-page":"4411","DOI":"10.1093\/emboj\/cdf445","article-title":"A mitochondrial specific stress response in mammalian cells","volume":"21","author":"Zhao","year":"2002","journal-title":"EMBO J"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa751\/33785747\/btaa751.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/7\/963\/50341166\/btaa751.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/7\/963\/50341166\/btaa751.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T08:18:38Z","timestamp":1684225118000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/7\/963\/5896986"}},"subtitle":[],"editor":[{"given":"Anthony","family":"Mathelier","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,8,25]]},"references-count":25,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2021,5,17]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa751","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.02.20.958793","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,4,1]]},"published":{"date-parts":[[2020,8,25]]}}}