{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T04:37:09Z","timestamp":1775191029866,"version":"3.50.1"},"reference-count":52,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2019,11,28]],"date-time":"2019-11-28T00:00:00Z","timestamp":1574899200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2019,11,28]],"date-time":"2019-11-28T00:00:00Z","timestamp":1574899200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Nat Commun"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Single-cell transcriptomics yields ever growing data sets containing RNA expression levels for thousands of genes from up to millions of cells. Common data analysis pipelines include a dimensionality reduction step for visualising the data in two dimensions, most frequently performed using t-distributed stochastic neighbour embedding (t-SNE). It excels at revealing local structure in high-dimensional data, but naive applications often suffer from severe shortcomings, e.g. the global structure of the data is not represented accurately. Here we describe how to circumvent such pitfalls, and develop a protocol for creating more faithful t-SNE visualisations. It includes PCA initialisation, a high learning rate, and multi-scale similarity kernels; for very large data sets, we additionally use exaggeration and downsampling-based initialisation. We use published single-cell RNA-seq data sets to demonstrate that this protocol yields superior results compared to the naive application of t-SNE.<\/jats:p>","DOI":"10.1038\/s41467-019-13056-x","type":"journal-article","created":{"date-parts":[[2019,11,28]],"date-time":"2019-11-28T06:03:08Z","timestamp":1574920988000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":949,"title":["The art of using t-SNE for single-cell transcriptomics"],"prefix":"10.1038","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5639-7209","authenticated-orcid":false,"given":"Dmitry","family":"Kobak","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0199-4727","authenticated-orcid":false,"given":"Philipp","family":"Berens","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2019,11,28]]},"reference":[{"key":"13056_CR1","doi-asserted-by":"publisher","first-page":"22","DOI":"10.1038\/nmeth.2764","volume":"11","author":"R Sandberg","year":"2014","unstructured":"Sandberg, R. Entering the era of single-cell transcriptomics in biology and medicine. Nat. Methods 11, 22 (2014).","journal-title":"Nat. Methods"},{"key":"13056_CR2","doi-asserted-by":"publisher","first-page":"1131","DOI":"10.1038\/nn.4366","volume":"19","author":"JF Poulin","year":"2016","unstructured":"Poulin, J. F., Tasic, B., Hjerling-Leffler, J., Trimarchi, J. M. & Awatramani, R. Disentangling neural cell diversity using single-cell transcriptomics. Nat. Neuroscience 19, 1131 (2016).","journal-title":"Nat. Neuroscience"},{"key":"13056_CR3","doi-asserted-by":"publisher","first-page":"72","DOI":"10.1038\/s41586-018-0654-5","volume":"563","author":"B Tasic","year":"2018","unstructured":"Tasic, B. et al. Shared and distinct transcriptomic cell types across neocortical areas. Nature 563, 72 (2018).","journal-title":"Nature"},{"key":"13056_CR4","doi-asserted-by":"crossref","unstructured":"The Tabula Muris Consortium. Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris. Nature 562, 367\u2013372 (2018).","DOI":"10.1038\/s41586-018-0590-4"},{"key":"13056_CR5","doi-asserted-by":"publisher","first-page":"999","DOI":"10.1016\/j.cell.2018.06.021","volume":"174","author":"A Zeisel","year":"2018","unstructured":"Zeisel, A. et al. Molecular architecture of the mouse nervous system. Cell 174, 999\u20131014 (2018).","journal-title":"Cell"},{"key":"13056_CR6","doi-asserted-by":"publisher","first-page":"1091","DOI":"10.1016\/j.cell.2018.02.001","volume":"172","author":"X Han","year":"2018","unstructured":"Han, X. et al. Mapping the mouse cell atlas by Microwell-seq. Cell 172, 1091\u20131107 (2018).","journal-title":"Cell"},{"key":"13056_CR7","doi-asserted-by":"publisher","first-page":"1015","DOI":"10.1016\/j.cell.2018.07.028","volume":"174","author":"A Saunders","year":"2018","unstructured":"Saunders, A. et al. Molecular diversity and specializations among the cells of the adult mouse brain. Cell 174, 1015\u20131030 (2018).","journal-title":"Cell"},{"key":"13056_CR8","doi-asserted-by":"publisher","first-page":"496","DOI":"10.1038\/s41586-019-0969-x","volume":"566","author":"J Cao","year":"2019","unstructured":"Cao, J. et al. The single-cell transcriptional landscape of mammalian organogenesis. Nature 566, 496 (2019).","journal-title":"Nature"},{"key":"13056_CR9","first-page":"2579","volume":"9","author":"L van der Maaten","year":"2008","unstructured":"van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learning Res. 9, 2579\u20132605 (2008).","journal-title":"J. Mach. Learning Res."},{"key":"13056_CR10","doi-asserted-by":"crossref","unstructured":"McInnes, L., Healy, J. & Melville, J. UMAP: Uniform manifold approximation and projection for dimension reduction. https:\/\/arxiv.org\/abs\/1802.03426 (2018).","DOI":"10.21105\/joss.00861"},{"key":"13056_CR11","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1038\/nbt.4314","volume":"37","author":"E Becht","year":"2019","unstructured":"Becht, E. et al. Dimensionality reduction for visualizing single-cell data using UMAP. Nat. Biotechnol. 37, 38 (2019).","journal-title":"Nat. Biotechnol."},{"key":"13056_CR12","doi-asserted-by":"crossref","unstructured":"Wattenberg, M., Vi\u00e9gas, F., & Johnson, I. How to use t-SNE effectively. Distill, http:\/\/distill.pub\/2016\/misread-tsne (2016).","DOI":"10.23915\/distill.00002"},{"key":"13056_CR13","doi-asserted-by":"publisher","first-page":"246","DOI":"10.1016\/j.neucom.2014.12.095","volume":"169","author":"JA Lee","year":"2015","unstructured":"Lee, J. A., Peluffo-Ord\u00f3\u00f1ez, D. H. & Verleysen, M. Multi-scale similarities in stochastic neighbour embedding: Reducing dimensionality while preserving both local and global structure. Neurocomputing 169, 246\u2013261 (2015).","journal-title":"Neurocomputing"},{"key":"13056_CR14","unstructured":"Bodt, C. D., Mulders, D., Verleysen, M., & Lee, J. A. Perplexity-free t-SNE and twice student tt-SNE. In European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning 123\u2013128 (2018)."},{"key":"13056_CR15","doi-asserted-by":"publisher","unstructured":"Belkina, A. C. et al. Automated optimized parameters for t-distributed stochastic neighbor embedding improve visualization and allow analysis of large datasets. Nat. Comms, https:\/\/doi.org\/10.1038\/s41467-019-13055-y\u00a0(2019).","DOI":"10.1038\/s41467-019-13055-y"},{"key":"13056_CR16","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1038\/s41592-018-0308-4","volume":"16","author":"GC Linderman","year":"2019","unstructured":"Linderman, G. C., Rachh, M., Hoskins, J. G., Steinerberger, S. & Kluger, Y. Fast interpolation-based t-SNE for improved visualization of single-cell RNA-seq data. Nat. Methods 16, 243 (2019).","journal-title":"Nat. Methods"},{"key":"13056_CR17","doi-asserted-by":"publisher","first-page":"545","DOI":"10.1038\/nbt.2594","volume":"31","author":"ED Amir","year":"2013","unstructured":"Amir, E. D. et al. viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia. Nat. Biotechnol. 31, 545 (2013).","journal-title":"Nat. Biotechnol."},{"key":"13056_CR18","doi-asserted-by":"publisher","DOI":"10.1038\/s41467-017-01689-9","volume":"8","author":"V Unen","year":"2017","unstructured":"Unen, V. et al. Visual analysis of mass cytometry data by hierarchical stochastic neighbour embedding reveals rare cell types. Nat. Commun. 8, 1740 (2017).","journal-title":"Nat. Commun."},{"key":"13056_CR19","doi-asserted-by":"publisher","first-page":"1750017","DOI":"10.1142\/S0219720017500172","volume":"15","author":"W Li","year":"2017","unstructured":"Li, W., Cerise, J. E., Yang, Y. & Han, H. Application of t-SNE to human genetic data. J. Bioinform. Comput. Biol. 15, 1750017 (2017).","journal-title":"J. Bioinform. Comput. Biol."},{"key":"13056_CR20","doi-asserted-by":"crossref","unstructured":"Diaz-Papkovich, A., Anderson-Trocme, L. Gravel, S. Revealing multi-scale population structure in large cohorts. https:\/\/www.biorxiv.org\/content\/10.1101\/423632v2 (2018).","DOI":"10.1101\/423632"},{"key":"13056_CR21","doi-asserted-by":"crossref","unstructured":"Schmidt, B. Stable random projection: lightweight, general-purpose dimensionality reduction for digitized libraries. http:\/\/culturalanalytics.org\/2018\/09\/stable-random-projection-lightweight-general-purpose-dimensionality-reduction-for-digitized-libraries\/\u00a0(2018).","DOI":"10.31235\/osf.io\/36neu"},{"key":"13056_CR22","doi-asserted-by":"publisher","first-page":"1431","DOI":"10.1016\/j.neucom.2008.12.017","volume":"72","author":"JA Lee","year":"2009","unstructured":"Lee, J. A. & Verleysen, M. Quality assessment of dimensionality reduction: Rank-based criteria. Neurocomputing 72, 1431\u20131443 (2009).","journal-title":"Neurocomputing"},{"key":"13056_CR23","doi-asserted-by":"publisher","DOI":"10.1186\/s13059-017-1382-0","volume":"19","author":"FA Wolf","year":"2018","unstructured":"Wolf, F. A., Angerer, P. & Theis, F. J. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 19, 15 (2018).","journal-title":"Genome Biol."},{"key":"13056_CR24","doi-asserted-by":"publisher","first-page":"1202","DOI":"10.1016\/j.cell.2015.05.002","volume":"161","author":"EZ Macosko","year":"2015","unstructured":"Macosko, E. Z. et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell 161, 1202\u20131214 (2015).","journal-title":"Cell"},{"key":"13056_CR25","doi-asserted-by":"publisher","first-page":"1308","DOI":"10.1016\/j.cell.2016.07.054","volume":"166","author":"K Shekhar","year":"2016","unstructured":"Shekhar, K. et al. Comprehensive classification of retinal bipolar neurons by single-cell transcriptomics. Cell 166, 1308\u20131323 (2016).","journal-title":"Cell"},{"key":"13056_CR26","doi-asserted-by":"publisher","first-page":"e2006387","DOI":"10.1371\/journal.pbio.2006387","volume":"16","author":"KD Harris","year":"2018","unstructured":"Harris, K. D. et al. Classes and continua of hippocampal CA1 inhibitory neurons revealed by single-cell transcriptomics. PLoS Biol. 16, e2006387 (2018).","journal-title":"PLoS Biol."},{"key":"13056_CR27","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1038\/nbt.3445","volume":"34","author":"CR Cadwell","year":"2016","unstructured":"Cadwell, C. R. et al. Electrophysiological, transcriptomic and morphologic profiling of single neurons using Patch-seq. Nat. Biotechnol. 34, 199 (2016).","journal-title":"Nat. Biotechnol."},{"key":"13056_CR28","doi-asserted-by":"publisher","first-page":"359","DOI":"10.1038\/nmeth.4644","volume":"15","author":"VY Kiselev","year":"2018","unstructured":"Kiselev, V. Y., Yiu, A. & Hemberg, M. scmap: projection of single-cell RNA-seq data across data sets. Nat. Methods 15, 359 (2018).","journal-title":"Nat. Methods"},{"key":"13056_CR29","doi-asserted-by":"publisher","first-page":"20140672","DOI":"10.1098\/rsif.2014.0672","volume":"11","author":"GJ Berman","year":"2014","unstructured":"Berman, G. J., Choi, D. M., Bialek, W. & Shaevitz, J. W. Mapping the stereotyped behaviour of freely moving fruit flies. J. Roy. Soc. Interface 11, 20140672 (2014).","journal-title":"J. Roy. Soc. Interface"},{"key":"13056_CR30","doi-asserted-by":"crossref","unstructured":"Poli\u010dar, P. G., Stra\u017ear, M. & Zupan, B. Embedding to reference t-SNE space addresses batch effects in single-cell classification. https:\/\/www.biorxiv.org\/content\/10.1101\/671404v1 (2019).","DOI":"10.1101\/671404"},{"key":"13056_CR31","doi-asserted-by":"publisher","first-page":"335","DOI":"10.1038\/nn.4216","volume":"19","author":"B Tasic","year":"2016","unstructured":"Tasic, B. et al. Adult mouse cortical cell taxonomy revealed by single cell transcriptomics. Nat. Neurosci. 19, 335 (2016).","journal-title":"Nat. Neurosci."},{"key":"13056_CR32","first-page":"3221","volume":"15","author":"L van der Maaten","year":"2014","unstructured":"van der Maaten, L. Accelerating t-SNE using tree-based algorithms. J. Mach. Learning Res. 15, 3221\u20133245 (2014).","journal-title":"J. Mach. Learning Res."},{"key":"13056_CR33","doi-asserted-by":"publisher","first-page":"313","DOI":"10.1137\/18M1216134","volume":"1","author":"GC Linderman","year":"2019","unstructured":"Linderman, G. C. & Steinerberger, S. Clustering with t-SNE, provably. SIAM J. Math. Data Sci. 1, 313\u2013332 (2019).","journal-title":"SIAM J. Math. Data Sci."},{"key":"13056_CR34","unstructured":"Linderman, G. C., Rachh, M., Hoskins, J. G., Steinerberger, S. & Kluger, Y. Efficient algorithms for t-distributed stochastic neighborhood embedding. https:\/\/arxiv.org\/abs\/1712.09005 (2017)."},{"key":"13056_CR35","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1523\/JNEUROSCI.2899-04.2005","volume":"25","author":"C Englund","year":"2005","unstructured":"Englund, C. et al. Pax6, Tbr2, and Tbr1 are expressed sequentially by radial glia, intermediate progenitor cells, and postmitotic neurons in developing neocortex. J. Neurosci. 25, 247\u2013251 (2005).","journal-title":"J. Neurosci."},{"key":"13056_CR36","doi-asserted-by":"publisher","first-page":"3970","DOI":"10.1016\/j.celrep.2017.12.017","volume":"21","author":"SA Yuzwa","year":"2017","unstructured":"Yuzwa, S. A. et al. Developmental emergence of adult neural stem cells as revealed by single-cell transcriptional profiling. Cell Rep. 21, 3970\u20133986 (2017).","journal-title":"Cell Rep."},{"issue":"6","key":"13056_CR37","doi-asserted-by":"publisher","first-page":"878","DOI":"10.1101\/gr.230771.117","volume":"28","author":"Giovanni Iacono","year":"2018","unstructured":"Iacono, G. et al. bigSCale: an analytical framework for big-scale single-cell data. Genome Res. 28, 870\u2013890 (2018).","journal-title":"Genome Research"},{"key":"13056_CR38","doi-asserted-by":"publisher","DOI":"10.1186\/s12915-018-0580-x","volume":"16","author":"A Bhaduri","year":"2018","unstructured":"Bhaduri, A., Nowakowski, T. J., Pollen, A. A. & Kriegstein, A. R. Identification of cell types in a mouse brain single-cell atlas using low sampling coverage. BMC Biol. 16, 113 (2018).","journal-title":"BMC Biol."},{"key":"13056_CR39","doi-asserted-by":"crossref","unstructured":"Tang, J. Liu, J., Zhang, M. & Mei, Q. Visualizing large-scale and high-dimensional data. In Proc. 25th International Conference on World Wide Web 287\u2013297 (2016).","DOI":"10.1145\/2872427.2883041"},{"key":"13056_CR40","doi-asserted-by":"crossref","unstructured":"Chan, D. M. Rao, R., Huang, F. & Canny, J. F. GPU accelerated t-distributed stochastic neighbor embedding. J. Parallel Distributed Comput. 131, 1\u201313 (2019).","DOI":"10.1016\/j.jpdc.2019.04.008"},{"key":"13056_CR41","doi-asserted-by":"crossref","unstructured":"Kobak, D., Linderman, G., Steinerberger, S., Kluger, Y. & Berens, P. Heavy-tailed kernels reveal a finer cluster structure in t-SNE visualisations. In Proceedings of the\u00a0European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, in print. https:\/\/arxiv.org\/abs\/1902.05804 (2019).","DOI":"10.1007\/978-3-030-46150-8_8"},{"key":"13056_CR42","unstructured":"van der Maaten, L. Learning a parametric embedding by preserving local structure. In Proceedings of the Twelth International Conference on Artificial Intelligence and Statistics 384\u2013391 (2009)."},{"key":"13056_CR43","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1016\/j.cels.2018.05.017","volume":"7","author":"H Cho","year":"2018","unstructured":"Cho, H., Berger, B. & Peng, J. Generalizable and scalable visualization of single-cell data using neural networks. Cell Syst. 7, 185\u2013191 (2018).","journal-title":"Cell Syst."},{"key":"13056_CR44","doi-asserted-by":"publisher","DOI":"10.1038\/s41467-018-04368-5","volume":"9","author":"J Ding","year":"2018","unstructured":"Ding, J., Condon, A. & Shah, S. P. Interpretable dimensionality reduction of single cell transcriptome data with deep generative models. Nat. Commun. 9, 2002 (2018).","journal-title":"Nat. Commun."},{"key":"13056_CR45","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1111\/cgf.12878","volume":"35","author":"N Pezzotti","year":"2016","unstructured":"Pezzotti, N., H\u00f6llt, T., Lelieveldt, B., Eisemann, E. & Vilanova, A. Hierarchical stochastic neighbor embedding. Comput. Graphics Forum 35, 21\u201330 (2016).","journal-title":"Comput. Graphics Forum"},{"key":"13056_CR46","doi-asserted-by":"publisher","DOI":"10.1186\/s13059-019-1663-x","volume":"20","author":"FA Wolf","year":"2019","unstructured":"Wolf, F. A. et al. PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells. Genome Biol. 20, 59 (2019).","journal-title":"Genome Biol."},{"key":"13056_CR47","unstructured":"Hinton, G. E. & Roweis, S.T. Stochastic neighbor embedding. In Advances in Neural Information Processing Systems 857\u2013864 (2003)."},{"key":"13056_CR48","doi-asserted-by":"publisher","first-page":"1739","DOI":"10.1109\/TVCG.2016.2570755","volume":"23","author":"N Pezzotti","year":"2017","unstructured":"Pezzotti, N. et al. Approximated and user steerable tSNE for progressive visual analytics. IEEE Trans. Visualization Comput. Graphics 23, 1739\u20131752 (2017).","journal-title":"IEEE Trans. Visualization Comput. Graphics"},{"key":"13056_CR49","doi-asserted-by":"crossref","unstructured":"Andrews, T. S. & Hemberg, M. M3Drop: Dropout-based feature selection for scRNASeq. Bioinformatics (2018).","DOI":"10.1093\/bioinformatics\/bty1044"},{"key":"13056_CR50","doi-asserted-by":"publisher","DOI":"10.1038\/ncomms14049","volume":"8","author":"GXY Zheng","year":"2017","unstructured":"Zheng, G. X. Y. et al. Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 8, 14049 (2017).","journal-title":"Nat. Commun."},{"key":"13056_CR51","doi-asserted-by":"crossref","unstructured":"Townes, F. W., Hicks, S. C., Aryee, M. J. & Irizarry, R. A. Feature selection and dimension reduction for single cell RNA-seq based on a multinomial model. https:\/\/www.biorxiv.org\/content\/10.1101\/574574v1 (2019).","DOI":"10.1101\/574574"},{"key":"13056_CR52","doi-asserted-by":"crossref","unstructured":"Poli\u010dar, P. G., Stra\u017ear, M. & Zupanopen, B. TSNE: a modular python library for t-SNE dimensionality reduction and embedding. https:\/\/www.biorxiv.org\/content\/10.1101\/731877v3 (2019).","DOI":"10.1101\/731877"}],"container-title":["Nature Communications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.nature.com\/articles\/s41467-019-13056-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.nature.com\/articles\/s41467-019-13056-x","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.nature.com\/articles\/s41467-019-13056-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,16]],"date-time":"2022-12-16T21:06:52Z","timestamp":1671224812000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.nature.com\/articles\/s41467-019-13056-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,28]]},"references-count":52,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2019,12]]}},"alternative-id":["13056"],"URL":"https:\/\/doi.org\/10.1038\/s41467-019-13056-x","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/453449","asserted-by":"object"}]},"ISSN":["2041-1723"],"issn-type":[{"value":"2041-1723","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,11,28]]},"assertion":[{"value":"20 November 2018","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 August 2019","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 November 2019","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"5416"}}