{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,10]],"date-time":"2026-05-10T06:00:42Z","timestamp":1778392842461,"version":"3.51.4"},"reference-count":21,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,5,10]],"date-time":"2021-05-10T00:00:00Z","timestamp":1620604800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,5,10]],"date-time":"2021-05-10T00:00:00Z","timestamp":1620604800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000936","name":"Gordon and Betty Moore Foundation","doi-asserted-by":"publisher","award":["GBMF4554"],"award-info":[{"award-number":["GBMF4554"]}],"id":[{"id":"10.13039\/100000936","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["R01GM122935"],"award-info":[{"award-number":["R01GM122935"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["DBI-1937540"],"award-info":[{"award-number":["DBI-1937540"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100010319","name":"Shurl and Kay Curci Foundation","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100010319","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100004897","name":"Pennsylvania Department of Health","doi-asserted-by":"publisher","award":["4100070287"],"award-info":[{"award-number":["4100070287"]}],"id":[{"id":"10.13039\/100004897","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Algorithms Mol Biol"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>The probability of sequencing a set of RNA-seq reads can be directly modeled using the abundances of splice junctions in splice graphs instead of the abundances of a list of transcripts. We call this model graph quantification, which was first proposed by Bernard et al. (Bioinformatics 30:2447\u201355, 2014). The model can be viewed as a generalization of transcript expression quantification where every full path in the splice graph is a possible transcript. However, the previous graph quantification model assumes the length of single-end reads or paired-end fragments is fixed.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We provide an improvement of this model to handle variable-length reads or fragments and incorporate bias correction. We prove that our model is equivalent to running a transcript quantifier with exactly the set of all compatible transcripts. The key to our method is constructing an extension of the splice graph based on Aho-Corasick automata. The proof of equivalence is based on a novel reparameterization of the read generation model of a state-of-art transcript quantification method.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>We propose a new approach for graph quantification, which is useful for modeling scenarios where reference transcriptome is incomplete or not available and can be further used in transcriptome assembly or alternative splicing analysis.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s13015-021-00184-7","type":"journal-article","created":{"date-parts":[[2021,5,10]],"date-time":"2021-05-10T19:03:47Z","timestamp":1620673427000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Exact transcript quantification over splice graphs"],"prefix":"10.1186","volume":"16","author":[{"given":"Cong","family":"Ma","sequence":"first","affiliation":[]},{"given":"Hongyu","family":"Zheng","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0118-5516","authenticated-orcid":false,"given":"Carl","family":"Kingsford","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,5,10]]},"reference":[{"issue":"1","key":"184_CR1","doi-asserted-by":"publisher","first-page":"323","DOI":"10.1186\/1471-2105-12-323","volume":"12","author":"B Li","year":"2011","unstructured":"Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinf. 2011;12(1):323.","journal-title":"BMC Bioinf"},{"issue":"5","key":"184_CR2","doi-asserted-by":"publisher","first-page":"525","DOI":"10.1038\/nbt.3519","volume":"34","author":"NL Bray","year":"2016","unstructured":"Bray NL, Pimentel H, Melsted P, Pachter L. Near-optimal probabilistic RNA-seq quantification. Nat Biotechnol. 2016;34(5):525\u20137.","journal-title":"Nat Biotechnol"},{"issue":"4","key":"184_CR3","doi-asserted-by":"publisher","first-page":"417","DOI":"10.1038\/nmeth.4197","volume":"14","author":"R Patro","year":"2017","unstructured":"Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods. 2017;14(4):417\u20139.","journal-title":"Nat Methods"},{"issue":"17","key":"184_CR4","doi-asserted-by":"publisher","first-page":"2447","DOI":"10.1093\/bioinformatics\/btu317","volume":"30","author":"E Bernard","year":"2014","unstructured":"Bernard E, Jacob L, Mairal J, Vert J-P. Efficient RNA isoform identification and quantification from RNA-Seq data with network flows. Bioinformatics. 2014;30(17):2447\u201355.","journal-title":"Bioinformatics"},{"issue":"5","key":"184_CR5","doi-asserted-by":"publisher","first-page":"511","DOI":"10.1038\/nbt.1621","volume":"28","author":"C Trapnell","year":"2010","unstructured":"Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, Van Baren MJ, Salzberg SL, Wold BJ, Pachter L. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010;28(5):511\u20135.","journal-title":"Nat Biotechnol"},{"issue":"3","key":"184_CR6","doi-asserted-by":"publisher","first-page":"290","DOI":"10.1038\/nbt.3122","volume":"33","author":"M Pertea","year":"2015","unstructured":"Pertea M, Pertea GM, Antonescu CM, Chang T-C, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;33(3):290\u20135.","journal-title":"Nat Biotechnol"},{"issue":"1","key":"184_CR7","doi-asserted-by":"publisher","first-page":"213","DOI":"10.1186\/s13059-016-1074-1","volume":"17","author":"J Liu","year":"2016","unstructured":"Liu J, Yu T, Jiang T, Li G. TransComb: genome-guided transcriptome assembly via combing junctions in splicing graphs. Genome Biol. 2016;17(1):213.","journal-title":"Genome Biol"},{"issue":"12","key":"184_CR8","doi-asserted-by":"publisher","first-page":"1167","DOI":"10.1038\/nbt.4020","volume":"35","author":"M Shao","year":"2017","unstructured":"Shao M, Kingsford C. Accurate assembly of transcripts through phase-preserving graph decomposition. Nat Biotechnol. 2017;35(12):1167\u20139.","journal-title":"Nat Biotechnol"},{"issue":"3","key":"184_CR9","doi-asserted-by":"publisher","first-page":"396","DOI":"10.1101\/gr.222976.117","volume":"28","author":"M Tardaguila","year":"2018","unstructured":"Tardaguila M, De La Fuente L, Marti C, Pereira C, Pardo-Palacios FJ, Del Risco H, Ferrell M, Mellado M, Macchietto M, Verheggen K, et al. SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification. Genome Res. 2018;28(3):396\u2013411.","journal-title":"Genome Res"},{"issue":"2","key":"184_CR10","doi-asserted-by":"publisher","first-page":"658","DOI":"10.1109\/TCBB.2017.2779509","volume":"16","author":"M Shao","year":"2019","unstructured":"Shao M, Kingsford C. Theory and a heuristic for the minimum path flow decomposition problem. IEEE\/ACM Trans Comput Biol Bioinf. 2019;16(2):658\u201370.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinf"},{"issue":"18","key":"184_CR11","doi-asserted-by":"publisher","first-page":"2300","DOI":"10.1093\/bioinformatics\/btt396","volume":"29","author":"LH LeGault","year":"2013","unstructured":"LeGault LH, Dewey CN. Inference of alternative splicing from RNA-Seq data with probabilistic splice graphs. Bioinformatics. 2013;29(18):2300\u201310.","journal-title":"Bioinformatics"},{"issue":"12","key":"184_CR12","doi-asserted-by":"publisher","first-page":"1009","DOI":"10.1038\/nmeth.1528","volume":"7","author":"Y Katz","year":"2010","unstructured":"Katz Y, Wang ET, Airoldi EM, Burge CB. Analysis and design of RNA sequencing experiments for identifying isoform regulation. Nat Methods. 2010;7(12):1009.","journal-title":"Nat Methods"},{"issue":"51","key":"184_CR13","doi-asserted-by":"publisher","first-page":"5593","DOI":"10.1073\/pnas.1419161111","volume":"111","author":"S Shen","year":"2014","unstructured":"Shen S, Park JW, Lu Z-X, Lin L, Henry MD, Wu YN, Zhou Q, Xing Y. rMATS: robust and flexible detection of differential alternative splicing from replicate RNA-Seq data. Proc Natl Acad Sci. 2014;111(51):5593\u2013601.","journal-title":"Proc Natl Acad Sci"},{"issue":"1","key":"184_CR14","doi-asserted-by":"publisher","first-page":"40","DOI":"10.1186\/s13059-018-1417-1","volume":"19","author":"JL Trincado","year":"2018","unstructured":"Trincado JL, Entizne JC, Hysenaj G, Singh B, Skalic M, Elliott DJ, Eyras E. SUPPA2: fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions. Genome Biol. 2018;19(1):40.","journal-title":"Genome Biol"},{"issue":"24","key":"184_CR15","doi-asserted-by":"crossref","first-page":"3881","DOI":"10.1093\/bioinformatics\/btv483","volume":"31","author":"J Hensman","year":"2015","unstructured":"Hensman J, Papastamoulis P, Glaus P, Honkela A, Rattray M. Fast and accurate approximate inference of transcript expression from RNA-seq data. Bioinformatics. 2015;31(24):3881\u20139.","journal-title":"Bioinformatics"},{"key":"184_CR16","unstructured":"Pachter L. Models for transcript quantification from RNA-Seq. arXiv preprint arXiv:1104.3889 2011."},{"issue":"6","key":"184_CR17","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1145\/360825.360855","volume":"18","author":"AV Aho","year":"1975","unstructured":"Aho AV, Corasick MJ. Efficient string matching: an aid to bibliographic search. Commun ACM. 1975;18(6):333\u201340.","journal-title":"Commun ACM"},{"issue":"11","key":"184_CR18","doi-asserted-by":"publisher","first-page":"1179","DOI":"10.1038\/mp.2013.170","volume":"19","author":"N Akula","year":"2014","unstructured":"Akula N, Barb J, Jiang X, Wendland J, Choi K, Sen S, Hou L, Chen D, Laje G, Johnson K, et al. RNA-sequencing of the brain transcriptome implicates dysregulation of neuroplasticity, circadian rhythms and GTPase binding in bipolar disorder. Mol Psychiatry. 2014;19(11):1179\u201385.","journal-title":"Mol Psychiatry"},{"issue":"D1","key":"184_CR19","doi-asserted-by":"publisher","first-page":"766","DOI":"10.1093\/nar\/gky955","volume":"47","author":"A Frankish","year":"2018","unstructured":"Frankish A, Diekhans M, Ferreira A-M, Johnson R, Jungreis I, Loveland J, Mudge JM, Sisu C, Wright J, Armstrong J, et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 2018;47(D1):766\u201373.","journal-title":"Nucleic Acids Res"},{"issue":"4","key":"184_CR20","doi-asserted-by":"publisher","first-page":"669","DOI":"10.1016\/j.neuron.2015.01.009","volume":"85","author":"YC Yung","year":"2015","unstructured":"Yung YC, Stoddard NC, Mirendil H, Chun J. Lysophosphatidic acid signaling in the nervous system. Neuron. 2015;85(4):669\u201382.","journal-title":"Neuron"},{"issue":"17","key":"184_CR21","doi-asserted-by":"publisher","first-page":"2778","DOI":"10.1093\/bioinformatics\/btv272","volume":"31","author":"AC Frazee","year":"2015","unstructured":"Frazee AC, Jaffe AE, Langmead B, Leek JT. Polyester: simulating RNA-seq datasets with differential transcript expression. Bioinformatics. 2015;31(17):2778\u201384.","journal-title":"Bioinformatics"}],"container-title":["Algorithms for Molecular Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13015-021-00184-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13015-021-00184-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13015-021-00184-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,3]],"date-time":"2023-11-03T14:54:10Z","timestamp":1699023250000},"score":1,"resource":{"primary":{"URL":"https:\/\/almob.biomedcentral.com\/articles\/10.1186\/s13015-021-00184-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,10]]},"references-count":21,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["184"],"URL":"https:\/\/doi.org\/10.1186\/s13015-021-00184-7","relation":{},"ISSN":["1748-7188"],"issn-type":[{"value":"1748-7188","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,5,10]]},"assertion":[{"value":"10 February 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 April 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 May 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"C.K. is a co-founder of Ocean Genomics, Inc.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"5"}}