{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,13]],"date-time":"2025-11-13T21:57:17Z","timestamp":1763071037428,"version":"3.37.3"},"reference-count":25,"publisher":"Oxford University Press (OUP)","issue":"15","license":[{"start":{"date-parts":[[2018,12,10]],"date-time":"2018-12-10T00:00:00Z","timestamp":1544400000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"National Program on Key Research Project of China","award":["2016YFC0902700"],"award-info":[{"award-number":["2016YFC0902700"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["31601070","81672745"],"award-info":[{"award-number":["31601070","81672745"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100007219","name":"Natural Science Foundation of Shanghai","doi-asserted-by":"publisher","award":["16ZR1417900"],"award-info":[{"award-number":["16ZR1417900"]}],"id":[{"id":"10.13039\/100007219","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Shanghai Pujiang program","award":["16PJ1405200","16PJ1405100"],"award-info":[{"award-number":["16PJ1405200","16PJ1405100"]}]},{"name":"Interdisciplinary Program of Shanghai Jiao Tong University"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Single-cell transcriptomic data are commonly accompanied by extremely high technical noise due to the low RNA concentrations from individual cells. Precise identification of differentially expressed genes and cell populations are heavily dependent on the effective reduction of technical noise, e.g. by gene filtering. However, there is still no well-established standard in the current approaches of gene filtering. Investigators usually filter out genes based on single fixed threshold, which commonly leads to both over- and under-stringent errors.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this study, we propose a novel algorithm, termed as Optimal Gene Filtering for Single-Cell data, to construct a thresholding curve based on gene expression levels and the corresponding variances. We validated our method on multiple single-cell RNA-seq datasets, including simulated and published experimental datasets. The results show that the known signal and known noise are reliably discriminated in the simulated datasets. In addition, the results of seven experimental datasets demonstrate that these cells of the same annotated types are more sharply clustered using our method. Interestingly, when we re-analyze the dataset from an aging research recently published in Science, we find a list of regulated genes which is different from that reported in the original study, because of using different filtering methods. However, the knowledge based on our findings better matches the progression of immunosenescence. In summary, we here provide an alternative opportunity to probe into the true level of technical noise in single-cell transcriptomic data.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>https:\/\/github.com\/XZouProjects\/OGFSC.git<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty1016","type":"journal-article","created":{"date-parts":[[2018,12,7]],"date-time":"2018-12-07T12:09:49Z","timestamp":1544184589000},"page":"2602-2609","source":"Crossref","is-referenced-by-count":17,"title":["Optimal Gene Filtering for Single-Cell data (OGFSC)\u2014a gene filtering algorithm for single-cell RNA-seq data"],"prefix":"10.1093","volume":"35","author":[{"given":"Jie","family":"Hao","sequence":"first","affiliation":[{"name":"Shanghai Centre for Systems Biomedicine, Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Centre for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai, China"}]},{"given":"Wei","family":"Cao","sequence":"additional","affiliation":[{"name":"Department of Oral Maxillofacial-Head and Neck Oncology, Ninth People\u2019s Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China"},{"name":"Shanghai Key Laboratory of Stomatology & Shanghai Research Institute of Stomatology, National Clinical Research Center of Stomatology, Shanghai, China"}]},{"given":"Jian","family":"Huang","sequence":"additional","affiliation":[{"name":"Shanghai Centre for Systems Biomedicine, Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Centre for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai, China"}]},{"given":"Xin","family":"Zou","sequence":"additional","affiliation":[{"name":"Shanghai Centre for Systems Biomedicine, Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Centre for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai, China"}]},{"given":"Ze-Guang","family":"Han","sequence":"additional","affiliation":[{"name":"Shanghai Centre for Systems Biomedicine, Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Centre for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai, China"}]}],"member":"286","published-online":{"date-parts":[[2018,12,10]]},"reference":[{"key":"2023062713154048700_bty1016-B1","doi-asserted-by":"crossref","first-page":"1093","DOI":"10.1038\/nmeth.2645","article-title":"Accounting for technical noise in single-cell RNA-seq experiments","volume":"10","author":"Brennecke","year":"2013","journal-title":"Nat. Methods"},{"key":"2023062713154048700_bty1016-B2","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1038\/nbt.3102","article-title":"Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells","volume":"33","author":"Buettner","year":"2015","journal-title":"Nat. Biotechnol"},{"key":"2023062713154048700_bty1016-B3","doi-asserted-by":"crossref","first-page":"2225","DOI":"10.1093\/bioinformatics\/btv122","article-title":"Normalization and noise reduction for single cell RNA-seq experiments","volume":"31","author":"Ding","year":"2015","journal-title":"Bioinformatics"},{"key":"2023062713154048700_bty1016-B4","doi-asserted-by":"crossref","first-page":"278","DOI":"10.1186\/s13059-015-0844-5","article-title":"MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data","volume":"16","author":"Finak","year":"2015","journal-title":"Genome Biol"},{"key":"2023062713154048700_bty1016-B5","doi-asserted-by":"crossref","first-page":"e1002330.","DOI":"10.1371\/journal.pcbi.1002330","article-title":"Joint modelling of confounding factors and prominent genetic regulators provides increased accuracy in genetical genomics studies","volume":"8","author":"Fusi","year":"2012","journal-title":"PLoS Comput. Biol"},{"key":"2023062713154048700_bty1016-B6","doi-asserted-by":"crossref","first-page":"692","DOI":"10.1038\/nm.4336","article-title":"Single-cell transcriptomics uncovers distinct molecular signatures of stem cells in chronic myeloid leukemia","volume":"23","author":"Giustacchini","year":"2017","journal-title":"Nat. Med"},{"key":"2023062713154048700_bty1016-B7","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1016\/j.cell.2016.01.047","article-title":"Heterogeneity in Oct4 and Sox2 Targets Biases Cell Fate in 4-Cell Mouse Embryos","volume":"165","author":"Goolam","year":"2016","journal-title":"Cell"},{"key":"2023062713154048700_bty1016-B8","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1038\/nmeth.2930","article-title":"Validation of noise models for single-cell transcriptomics","volume":"11","author":"Grun","year":"2014","journal-title":"Nat. Methods"},{"key":"2023062713154048700_bty1016-B9","doi-asserted-by":"crossref","first-page":"740","DOI":"10.1038\/nmeth.2967","article-title":"Bayesian approach to single-cell differential expression analysis","volume":"11","author":"Kharchenko","year":"2014","journal-title":"Nat. Methods"},{"key":"2023062713154048700_bty1016-B10","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1038\/nmeth.4236","article-title":"SC3: consensus clustering of single-cell RNA-seq data","volume":"14","author":"Kiselev","year":"2017","journal-title":"Nat. Methods"},{"key":"2023062713154048700_bty1016-B11","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1038\/nmeth.4644","article-title":"scmap: projection of single-cell RNA-seq data across data sets","volume":"15","author":"Kiselev","year":"2018","journal-title":"Nat. Methods"},{"key":"2023062713154048700_bty1016-B12","doi-asserted-by":"crossref","first-page":"1187","DOI":"10.1016\/j.cell.2015.04.044","article-title":"Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells","volume":"161","author":"Klein","year":"2015","journal-title":"Cell"},{"key":"2023062713154048700_bty1016-B13","doi-asserted-by":"crossref","first-page":"1433","DOI":"10.1126\/science.aah4115","article-title":"Aging increases cell-to-cell transcriptional variability upon immune stimulation","volume":"355","author":"Martinez-Jimenez","year":"2017","journal-title":"Science"},{"key":"2023062713154048700_bty1016-B14","doi-asserted-by":"crossref","first-page":"889","DOI":"10.15252\/msb.20166969","article-title":"Single-cell sequencing maps gene expression to mutational phylogenies in PDGF- and EGF-driven gliomas","volume":"12","author":"Muller","year":"2016","journal-title":"Mol. Syst. Biol"},{"key":"2023062713154048700_bty1016-B15","doi-asserted-by":"crossref","first-page":"e190.","DOI":"10.1371\/journal.pgen.0020190","article-title":"Population structure and eigenanalysis","volume":"2","author":"Patterson","year":"2006","journal-title":"PLoS Genet"},{"key":"2023062713154048700_bty1016-B16","doi-asserted-by":"crossref","first-page":"1053","DOI":"10.1038\/nbt.2967","article-title":"Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex","volume":"32","author":"Pollen","year":"2014","journal-title":"Nat. Biotechnol"},{"key":"2023062713154048700_bty1016-B17","doi-asserted-by":"crossref","first-page":"1611","DOI":"10.1016\/j.cell.2017.10.044","article-title":"Single-Cell Transcriptomic Analysis of Primary and Metastatic Tumor Ecosystems in Head and Neck Cancer","volume":"171","author":"Puram","year":"2017","journal-title":"Cell"},{"key":"2023062713154048700_bty1016-B18","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1038\/nature13437","article-title":"Single-cell RNA-seq reveals dynamic paracrine control of cellular variation","volume":"510","author":"Shalek","year":"2014","journal-title":"Nature"},{"key":"2023062713154048700_bty1016-B19","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1038\/nmeth.4612","article-title":"Bias, robustness and scalability in single-cell differential expression analysis","volume":"15","author":"Soneson","year":"2018","journal-title":"Nat. Methods"},{"key":"2023062713154048700_bty1016-B20","doi-asserted-by":"crossref","first-page":"1905","DOI":"10.1016\/j.celrep.2014.08.029","article-title":"Single-cell RNA sequencing identifies extracellular matrix gene expression by pancreatic circulating tumor cells","volume":"8","author":"Ting","year":"2014","journal-title":"Cell Rep"},{"key":"2023062713154048700_bty1016-B21","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1038\/nature13173","article-title":"Reconstructing lineage hierarchies of the distal lung epithelium using single-cell RNA-seq","volume":"509","author":"Treutlein","year":"2014","journal-title":"Nature"},{"key":"2023062713154048700_bty1016-B22","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1038\/nn.3881","article-title":"Unbiased classification of sensory neuron types by large-scale single-cell RNA sequencing","volume":"18","author":"Usoskin","year":"2015","journal-title":"Nat. Neurosci"},{"key":"2023062713154048700_bty1016-B23","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1186\/s13059-016-0930-3","article-title":"Beyond comparisons of means: understanding changes in gene expression at the single-cell level","volume":"17","author":"Vallejos","year":"2016","journal-title":"Genome Biol"},{"key":"2023062713154048700_bty1016-B24","doi-asserted-by":"crossref","first-page":"84.","DOI":"10.1186\/s13059-017-1218-y","article-title":"Challenges and emerging directions in single-cell analysis","volume":"18","author":"Yuan","year":"2017","journal-title":"Genome Biol"},{"key":"2023062713154048700_bty1016-B25","doi-asserted-by":"crossref","first-page":"1138","DOI":"10.1126\/science.aaa1934","article-title":"Brain structure. Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq","volume":"347","author":"Zeisel","year":"2015","journal-title":"Science"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/15\/2602\/50722520\/bioinformatics_35_15_2602.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/15\/2602\/50722520\/bioinformatics_35_15_2602.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,27]],"date-time":"2023-06-27T13:16:53Z","timestamp":1687871813000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/15\/2602\/5237553"}},"subtitle":[],"editor":[{"given":"Bonnie","family":"Berger","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,12,10]]},"references-count":25,"journal-issue":{"issue":"15","published-print":{"date-parts":[[2019,8,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty1016","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2019,8,1]]},"published":{"date-parts":[[2018,12,10]]}}}