{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T16:05:25Z","timestamp":1770739525585,"version":"3.49.0"},"reference-count":17,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":2433,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,3,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Next-generation sequencing (NGS) has enabled whole genome and transcriptome single nucleotide variant (SNV) discovery in cancer. NGS produces millions of short sequence reads that, once aligned to a reference genome sequence, can be interpreted for the presence of SNVs. Although tools exist for SNV discovery from NGS data, none are specifically suited to work with data from tumors, where altered ploidy and tumor cellularity impact the statistical expectations of SNV discovery.<\/jats:p>\n               <jats:p>Results: We developed three implementations of a probabilistic Binomial mixture model, called SNVMix, designed to infer SNVs from NGS data from tumors to address this problem. The first models allelic counts as observations and infers SNVs and model parameters using an expectation maximization (EM) algorithm and is therefore capable of adjusting to deviation of allelic frequencies inherent in genomically unstable tumor genomes. The second models nucleotide and mapping qualities of the reads by probabilistically weighting the contribution of a read\/nucleotide to the inference of a SNV based on the confidence we have in the base call and the read alignment. The third combines filtering out low-quality data in addition to probabilistic weighting of the qualities. We quantitatively evaluated these approaches on 16 ovarian cancer RNASeq datasets with matched genotyping arrays and a human breast cancer genome sequenced to &amp;gt;40\u00d7 (haploid) coverage with ground truth data and show systematically that the SNVMix models outperform competing approaches.<\/jats:p>\n               <jats:p>Availability: Software and data are available at http:\/\/compbio.bccrc.ca<\/jats:p>\n               <jats:p>Contact: \u00a0sshah@bccrc.ca<\/jats:p>\n               <jats:p>Supplemantary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq040","type":"journal-article","created":{"date-parts":[[2010,2,4]],"date-time":"2010-02-04T01:55:22Z","timestamp":1265248522000},"page":"730-736","source":"Crossref","is-referenced-by-count":182,"title":["SNVMix: predicting single nucleotide variants from next-generation sequencing of tumors"],"prefix":"10.1093","volume":"26","author":[{"given":"Rodrigo","family":"Goya","sequence":"first","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"},{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"Mark G.F.","family":"Sun","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"Ryan D.","family":"Morin","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"Gillian","family":"Leung","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"Gavin","family":"Ha","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"Kimberley C.","family":"Wiegand","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"},{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"Janine","family":"Senz","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"},{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"Anamaria","family":"Crisan","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"Marco A.","family":"Marra","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"Martin","family":"Hirst","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"David","family":"Huntsman","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"},{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"Kevin P.","family":"Murphy","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"Sam","family":"Aparicio","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]},{"given":"Sohrab P.","family":"Shah","sequence":"additional","affiliation":[{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"},{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"},{"name":"1 Department of Molecular Oncology Breast Cancer Research Program, British Columbia Cancer Research Centre, 2 Genome Sciences Centre, British Columbia Cancer Agency, 3 Centre for Translational and Applied Genomics of British Columbia Cancer Agency, 4 Provincial Health Services Authority Laboratories and 5 Department of Computer Science, University of British Columbia, Vancouver, BC, Canada"}]}],"member":"286","published-online":{"date-parts":[[2010,2,3]]},"reference":[{"key":"2023012508015648400_B1","doi-asserted-by":"crossref","first-page":"1801","DOI":"10.1126\/science.1164368","article-title":"Core signaling pathways in human pancreatic cancers revealed by global genomic analyses","volume":"321","author":"Jones","year":"2008","journal-title":"Science"},{"key":"2023012508015648400_B2","doi-asserted-by":"crossref","first-page":"R25","DOI":"10.1186\/gb-2009-10-3-r25","article-title":"Ultrafast and memory-efficient alignment of short DNA sequences to the human genome","volume":"10","author":"Langmead","year":"2009","journal-title":"Genome Biol."},{"key":"2023012508015648400_B3","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1038\/nature07485","article-title":"DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome","volume":"456","author":"Ley","year":"2008","journal-title":"Nature"},{"key":"2023012508015648400_B4","doi-asserted-by":"crossref","first-page":"1851","DOI":"10.1101\/gr.078212.108","article-title":"Mapping short DNA sequencing reads and calling variants using mapping quality scores","volume":"18","author":"Li","year":"2008","journal-title":"Genome Res."},{"key":"2023012508015648400_B5","doi-asserted-by":"crossref","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","article-title":"Fast and accurate short read alignment with Burrows-Wheeler transform","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012508015648400_B6","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The Sequence Alignment\/Map format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012508015648400_B7","doi-asserted-by":"crossref","first-page":"713","DOI":"10.1093\/bioinformatics\/btn025","article-title":"SOAP: short oligonucleotide alignment program","volume":"24","author":"Li","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012508015648400_B8","doi-asserted-by":"crossref","first-page":"R63","DOI":"10.1186\/gb-2008-9-4-r63","article-title":"Validation and extension of an empirical Bayes method for SNP calling on Affymetrix microarrays","volume":"9","author":"Lin","year":"2008","journal-title":"Genome Biol."},{"key":"2023012508015648400_B9","doi-asserted-by":"crossref","first-page":"1058","DOI":"10.1056\/NEJMoa0903840","article-title":"Recurring mutations found by sequencing an acute myeloid leukemia genome","volume":"361","author":"Mardis","year":"2009","journal-title":"N. Engl. J. Med."},{"key":"2023012508015648400_B10","doi-asserted-by":"crossref","first-page":"1509","DOI":"10.1101\/gr.079558.108","article-title":"RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays","volume":"18","author":"Marioni","year":"2008","journal-title":"Genome Res."},{"key":"2023012508015648400_B11","doi-asserted-by":"crossref","first-page":"81","DOI":"10.2144\/000112900","article-title":"Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing","volume":"45","author":"Morin","year":"2008","journal-title":"BioTechniques"},{"key":"2023012508015648400_B12","doi-asserted-by":"crossref","first-page":"621","DOI":"10.1038\/nmeth.1226","article-title":"Mapping and quantifying mammalian transcriptomes by RNA-seq","volume":"5","author":"Mortazavi","year":"2008","journal-title":"Nat. Methods"},{"key":"2023012508015648400_B13","doi-asserted-by":"crossref","first-page":"e1000386","DOI":"10.1371\/journal.pcbi.1000386","article-title":"SHRiMP: accurate mapping of short color-space reads","volume":"5","author":"Rumble","year":"2009","journal-title":"PLoS Comput. Biol."},{"key":"2023012508015648400_B14","doi-asserted-by":"crossref","first-page":"2719","DOI":"10.1056\/NEJMoa0902542","article-title":"Mutation of FOXL2 in granulosa-cell tumors of the ovary","volume":"360","author":"Shah","year":"2009","journal-title":"New Engl J. Med."},{"key":"2023012508015648400_B15","doi-asserted-by":"crossref","first-page":"809","DOI":"10.1038\/nature08489","article-title":"Mutational evolution in a lobular breast tumor profiled at single nucleotide resolution","volume":"461","author":"Shah","year":"2009","journal-title":"Nature"},{"key":"2023012508015648400_B16","doi-asserted-by":"crossref","first-page":"1135","DOI":"10.1038\/nbt1486","article-title":"Next-generation DNA sequencing","volume":"26","author":"Shendure","year":"2008","journal-title":"Nat. Biotechnol."},{"key":"2023012508015648400_B17","doi-asserted-by":"crossref","first-page":"719","DOI":"10.1038\/nature07943","article-title":"The cancer genome","volume":"458","author":"Stratton","year":"2009","journal-title":"Nature"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/6\/730\/48855386\/bioinformatics_26_6_730.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/6\/730\/48855386\/bioinformatics_26_6_730.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:02:40Z","timestamp":1674633760000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/6\/730\/245170"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,2,3]]},"references-count":17,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2010,3,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq040","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,3,15]]},"published":{"date-parts":[[2010,2,3]]}}}