{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:29Z","timestamp":1772138069774,"version":"3.50.1"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2024,8,5]],"date-time":"2024-08-05T00:00:00Z","timestamp":1722816000000},"content-version":"vor","delay-in-days":4,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"New Frontiers in Research Fund Exploration","award":["GR003572"],"award-info":[{"award-number":["GR003572"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,8,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Lineage tracing and trajectory inference from single-cell RNA-sequencing data hold tremendous potential for uncovering the genetic programs driving development and disease. Single cell datasets are thought to provide an unbiased view on the diverse cellular architecture of tissues. Sampling bias, however, can skew single cell datasets away from the cellular composition they are meant to represent.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We demonstrate a novel form of sampling bias, caused by a statistical phenomenon related to repeated sampling from a growing, heterogeneous population. Relative growth rates of cells influence the probability that they will be sampled in clones observed across multiple time points. We support our probabilistic derivations with a simulation study and an analysis of a real time-course of T-cell development. We find that this bias can impact fate probability predictions, and we explore how to develop trajectory inference methods which are robust to this bias.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>Source code for the simulated datasets and to create the figures in this manuscript is freely available in python at https:\/\/github.com\/rbonhamcarter\/simulate-clones. A python implementation of the extension of the LineageOT method is freely available at https:\/\/github.com\/rbonhamcarter\/LineageOT\/tree\/multi-time-clones.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae483","type":"journal-article","created":{"date-parts":[[2024,8,1]],"date-time":"2024-08-01T16:46:16Z","timestamp":1722530776000},"source":"Crossref","is-referenced-by-count":2,"title":["Cellular proliferation biases clonal lineage tracing and trajectory inference"],"prefix":"10.1093","volume":"40","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0163-4673","authenticated-orcid":false,"given":"Becca","family":"Bonham-Carter","sequence":"first","affiliation":[{"name":"Department of Mathematics, University of British Columbia , Vancouver, BC V6T 1Z4, Canada"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8290-7997","authenticated-orcid":false,"given":"Geoffrey","family":"Schiebinger","sequence":"additional","affiliation":[{"name":"Department of Mathematics, University of British Columbia , Vancouver, BC V6T 1Z4, Canada"}]}],"member":"286","published-online":{"date-parts":[[2024,8,5]]},"reference":[{"key":"2024081723155823200_btae483-B1","doi-asserted-by":"crossref","first-page":"753","DOI":"10.1038\/s41580-019-0186-3","article-title":"Unravelling cellular relationships during development and regeneration using genetic lineage tracing","volume":"20","author":"Baron","year":"2019","journal-title":"Nat Rev Mol Cell Biol"},{"key":"2024081723155823200_btae483-B2","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1038\/s41586-018-0744-4","article-title":"Single-cell mapping of lineage and identity in direct reprogramming","volume":"564","author":"Biddy","year":"2018","journal-title":"Nature"},{"key":"2024081723155823200_btae483-B3","doi-asserted-by":"crossref","first-page":"1410","DOI":"10.1016\/j.cell.2020.04.048","article-title":"An engineered crispr-cas9 mouse line for simultaneous readout of lineage histories and gene expression profiles in single cells","volume":"181","author":"Bowling","year":"2020","journal-title":"Cell"},{"key":"2024081723155823200_btae483-B4","doi-asserted-by":"crossref","first-page":"496","DOI":"10.1038\/s41586-019-0969-x","article-title":"The single-cell transcriptional landscape of mammalian organogenesis","volume":"566","author":"Cao","year":"2019","journal-title":"Nature"},{"key":"2024081723155823200_btae483-B5","doi-asserted-by":"crossref","first-page":"1903","DOI":"10.1038\/s41467-019-09670-4","article-title":"Single-cell trajectories reconstruction, exploration and mapping of omics data with stream","volume":"10","author":"Chen","year":"2019","journal-title":"Nat Commun"},{"key":"2024081723155823200_btae483-B6","doi-asserted-by":"crossref","first-page":"eaar3131","DOI":"10.1126\/science.aar3131","article-title":"Single-cell reconstruction of developmental trajectories during zebrafish embryogenesis","volume":"360","author":"Farrell","year":"2018","journal-title":"Science"},{"key":"2024081723155823200_btae483-B7","doi-asserted-by":"crossref","first-page":"1800056","DOI":"10.1002\/bies.201800056","article-title":"Creating lineage trajectory maps via integration of single-cell RNA-sequencing and lineage tracing: integrating transgenic lineage tracing and single-cell RNA-sequencing is a robust approach for mapping developmental lineage trajectories and cell fate changes","volume":"40","author":"Fletcher","year":"2018","journal-title":"BioEssays"},{"key":"2024081723155823200_btae483-B8","doi-asserted-by":"crossref","first-page":"4940","DOI":"10.1038\/s41467-021-25133-1","article-title":"LineageOT is a unified framework for lineage tracing and trajectory inference","volume":"12","author":"Forrow","year":"2021","journal-title":"Nat Commun"},{"key":"2024081723155823200_btae483-B9","first-page":"6930","author":"He","year":"2021"},{"key":"2024081723155823200_btae483-B10","doi-asserted-by":"crossref","first-page":"593","DOI":"10.1016\/j.stem.2019.12.009","article-title":"Reconstructed single-cell fate trajectories define lineage plasticity windows during differentiation of human psc-derived distal lung progenitors","volume":"26","author":"Hurley","year":"2020","journal-title":"Cell Stem Cell"},{"key":"2024081723155823200_btae483-B11","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1016\/j.stem.2018.04.014","article-title":"Single-cell transcriptomics meets lineage tracing","volume":"23","author":"Kester","year":"2018","journal-title":"Cell Stem Cell"},{"key":"2024081723155823200_btae483-B12","first-page":"14567","article-title":"Distribution aligning refinery of pseudo-label for imbalanced semi-supervised learning","volume":"33","author":"Kim","year":"2020","journal-title":"Adv Neural Inf Process Syst"},{"key":"2024081723155823200_btae483-B13","author":"Lange","year":"2021"},{"key":"2024081723155823200_btae483-B14","doi-asserted-by":"crossref","first-page":"4707","DOI":"10.1093\/bioinformatics\/btz296","article-title":"Continuous-state hmms for modeling time-series single-cell rna-seq data","volume":"35","author":"Lin","year":"2019","journal-title":"Bioinformatics"},{"key":"2024081723155823200_btae483-B15","doi-asserted-by":"crossref","first-page":"dev169730","DOI":"10.1242\/dev.169730","article-title":"Recording development with single cell dynamic lineage tracing","volume":"146","author":"McKenna","year":"2019","journal-title":"Development"},{"key":"2024081723155823200_btae483-B16","article-title":"Time- and lineage-resolved transcriptional profiling uncovers gene expression programs and clonal relationships that underlie human T lineage specification","author":"Michaels","year":"2023","journal-title":"bioRxiv"},{"key":"2024081723155823200_btae483-B17","author":"Prasad","year":"2020"},{"key":"2024081723155823200_btae483-B18","doi-asserted-by":"crossref","first-page":"442","DOI":"10.1038\/nbt.4103","article-title":"Simultaneous single-cell profiling of lineages and cell types in the vertebrate brain","volume":"36","author":"Raj","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2024081723155823200_btae483-B19","doi-asserted-by":"crossref","first-page":"547","DOI":"10.1038\/s41587-019-0071-9","article-title":"A comparison of single-cell trajectory inference methods","volume":"37","author":"Saelens","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2024081723155823200_btae483-B20","doi-asserted-by":"crossref","first-page":"865","DOI":"10.1038\/s41587-020-0509-0","article-title":"Base editors for simultaneous introduction of C-to-T and A-to-G mutations","volume":"38","author":"Sakata","year":"2020","journal-title":"Nat Biotechnol"},{"key":"2024081723155823200_btae483-B21","doi-asserted-by":"crossref","first-page":"928","DOI":"10.1016\/j.cell.2019.01.006","article-title":"Optimal-transport analysis of single-cell gene expression identifies developmental trajectories in reprogramming","volume":"176","author":"Schiebinger","year":"2019","journal-title":"Cell"},{"key":"2024081723155823200_btae483-B22","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1038\/nbt.3569","article-title":"Wishbone identifies bifurcating developmental trajectories from single-cell data","volume":"34","author":"Setty","year":"2016","journal-title":"Nat Biotechnol"},{"key":"2024081723155823200_btae483-B23","doi-asserted-by":"crossref","first-page":"451","DOI":"10.1038\/s41587-019-0068-4","article-title":"Characterization of cell fate probabilities in single-cell data with palantir","volume":"37","author":"Setty","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2024081723155823200_btae483-B24","doi-asserted-by":"crossref","first-page":"469","DOI":"10.1038\/nbt.4124","article-title":"Simultaneous lineage tracing and cell-type identification using CRISPR-Cas9-induced genetic scars","volume":"36","author":"Spanjaard","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2024081723155823200_btae483-B25","doi-asserted-by":"crossref","first-page":"477","DOI":"10.1186\/s12864-018-4772-0","article-title":"Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics","volume":"19","author":"Street","year":"2018","journal-title":"BMC Genomics"},{"key":"2024081723155823200_btae483-B26","first-page":"12267","author":"Taherkhani","year":"2021"},{"key":"2024081723155823200_btae483-B27","first-page":"9526","author":"Tong","year":"2020"},{"key":"2024081723155823200_btae483-B28","doi-asserted-by":"crossref","first-page":"e1008205","DOI":"10.1371\/journal.pcbi.1008205","article-title":"Tempora: cell trajectory inference using time-series single-cell RNA sequencing data","volume":"16","author":"Tran","year":"2020","journal-title":"PLoS Comput Biol"},{"key":"2024081723155823200_btae483-B29","doi-asserted-by":"crossref","first-page":"dev170506","DOI":"10.1242\/dev.170506","article-title":"Concepts and limitations for learning developmental trajectories from single cell genomics","volume":"146","author":"Tritschler","year":"2019","journal-title":"Development"},{"key":"2024081723155823200_btae483-B30","doi-asserted-by":"crossref","first-page":"410","DOI":"10.1038\/s41576-020-0223-2","article-title":"Lineage tracing meets single-cell omics: opportunities and challenges","volume":"21","author":"Wagner","year":"2020","journal-title":"Nat Rev Genet"},{"key":"2024081723155823200_btae483-B31","doi-asserted-by":"crossref","first-page":"1066","DOI":"10.1038\/s41587-022-01209-1","article-title":"CoSpar identifies early cell fate biases from single-cell transcriptomic and lineage information","volume":"40","author":"Wang","year":"2022","journal-title":"Nat Biotechnol"},{"key":"2024081723155823200_btae483-B32","doi-asserted-by":"crossref","first-page":"E2467","DOI":"10.1073\/pnas.1714723115","article-title":"Fundamental limits on dynamic inference from single-cell snapshots","volume":"115","author":"Weinreb","year":"2018","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2024081723155823200_btae483-B33","doi-asserted-by":"crossref","first-page":"eaaw3381","DOI":"10.1126\/science.aaw3381","article-title":"Lineage tracing on transcriptional landscapes links state to fate during differentiation","volume":"367","author":"Weinreb","year":"2020","journal-title":"Science"},{"key":"2024081723155823200_btae483-B34","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1186\/s13059-019-1663-x","article-title":"PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells","volume":"20","author":"Wolf","year":"2019","journal-title":"Genome Biol"},{"key":"2024081723155823200_btae483-B35","doi-asserted-by":"crossref","first-page":"3222","DOI":"10.1038\/s41467-021-23518-w","article-title":"Generative modeling of single-cell time series with prescient enables prediction of cell trajectories with interventions","volume":"12","author":"Yeo","year":"2021","journal-title":"Nat Commun"},{"key":"2024081723155823200_btae483-B36","doi-asserted-by":"crossref","first-page":"3055","DOI":"10.1038\/s41467-020-16821-5","article-title":"Single-cell lineage tracing by integrating CRISPR-Cas9 mutations with transcriptomic data","volume":"11","author":"Zafar","year":"2020","journal-title":"Nat Commun"},{"key":"2024081723155823200_btae483-B37","doi-asserted-by":"crossref","first-page":"e1009466","DOI":"10.1371\/journal.pcbi.1009466","article-title":"Optimal transport analysis reveals trajectories in steady-state systems","volume":"17","author":"Zhang","year":"2021","journal-title":"PLoS Comput Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae483\/58739188\/btae483.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/8\/btae483\/58844153\/btae483.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/8\/btae483\/58844153\/btae483.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,17]],"date-time":"2024-08-17T19:16:18Z","timestamp":1723922178000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae483\/7727666"}},"subtitle":[],"editor":[{"given":"Can","family":"Alkan","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,8]]},"references-count":37,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2024,8,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae483","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.07.20.549801","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,8]]},"published":{"date-parts":[[2024,8]]},"article-number":"btae483"}}