{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T01:44:50Z","timestamp":1774575890683,"version":"3.50.1"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2020,4,13]],"date-time":"2020-04-13T00:00:00Z","timestamp":1586736000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Next-generation sequencing technologies have accelerated the discovery of single nucleotide variants in the human genome, stimulating the development of predictors for classifying which of these variants are likely functional in disease, and which neutral. Recently, we proposed CScape, a method for discriminating between cancer driver mutations and presumed benign variants. For the neutral class, this method relied on benign germline variants found in the 1000 Genomes Project database. Discrimination could, therefore, be influenced by the distinction of germline versus somatic, rather than neutral versus disease driver. This motivates this article in which we consider predictive discrimination between recurrent and rare somatic single point mutations based solely on using cancer data, and the distinction between these two somatic classes and germline single point mutations.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>For somatic point mutations in coding and non-coding regions of the genome, we propose CScape-somatic, an integrative classifier for predictively discriminating between recurrent and rare variants in the human cancer genome. In this study, we use purely cancer genome data and investigate the distinction between minimal occurrence and significantly recurrent somatic single point mutations in the human cancer genome. We show that this type of predictive distinction can give novel insight, and may deliver more meaningful prediction in both coding and non-coding regions of the cancer genome. Tested on somatic mutations, CScape-somatic outperforms alternative methods, reaching 74% balanced accuracy in coding regions and 69% in non-coding regions, whereas even higher accuracy may be achieved using thresholds to isolate high-confidence predictions.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Predictions and software are available at http:\/\/CScape-somatic.biocompute.org.uk\/.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Contact<\/jats:title>\n                  <jats:p>mark.f.rogers.phd@gmail.com or C.Campbell@bristol.ac.uk<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa242","type":"journal-article","created":{"date-parts":[[2020,4,7]],"date-time":"2020-04-07T22:01:57Z","timestamp":1586296917000},"page":"3637-3644","source":"Crossref","is-referenced-by-count":33,"title":["<i>CScape-somatic<\/i>: distinguishing driver and passenger point mutations in the cancer genome"],"prefix":"10.1093","volume":"36","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7886-0674","authenticated-orcid":false,"given":"Mark F","family":"Rogers","sequence":"first","affiliation":[{"name":"Intelligent Systems Laboratory , University of Bristol, Bristol BS8 1UB, UK"}]},{"given":"Tom R","family":"Gaunt","sequence":"additional","affiliation":[{"name":"MRC Integrative Epidemiology Unit (IEU) , University of Bristol, Bristol BS8 2BN, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6485-6014","authenticated-orcid":false,"given":"Colin","family":"Campbell","sequence":"additional","affiliation":[{"name":"Intelligent Systems Laboratory , University of Bristol, Bristol BS8 1UB, UK"}]}],"member":"286","published-online":{"date-parts":[[2020,4,13]]},"reference":[{"key":"2023063010284100800_btaa242-B1","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1038\/nmeth0410-248","article-title":"A method and server for predicting damaging missense mutations","volume":"7","author":"Adzhubei","year":"2010","journal-title":"Nat. Methods"},{"key":"2023063010284100800_btaa242-B2","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1038\/nature12477","article-title":"Signatures of mutational processes in human cancer","volume":"500","author":"Alexandrov","year":"2013","journal-title":"Nature"},{"key":"2023063010284100800_btaa242-B3","doi-asserted-by":"crossref","first-page":"18545","DOI":"10.1073\/pnas.1010978107","article-title":"Accumulation of driver and passenger mutations during tumor progression","volume":"107","author":"Bozic","year":"2010","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023063010284100800_btaa242-B4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41598-019-48765-2","article-title":"Estimating the frequency of single point driver mutations across common solid tumours","volume":"9","author":"Darbyshire","year":"2019","journal-title":"Sci. Rep"},{"key":"2023063010284100800_btaa242-B5","doi-asserted-by":"crossref","first-page":"D652","DOI":"10.1093\/nar\/gkp995","article-title":"COSMIC: mining complete cancer genomes in the catalogue of somatic mutations in cancer","volume":"38","author":"Forbes","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2023063010284100800_btaa242-B6","doi-asserted-by":"crossref","first-page":"480","DOI":"10.1186\/s13059-014-0480-5","article-title":"FunSeq2: a framework for prioritizing noncoding regulatory variants in cancer","volume":"15","author":"Fu","year":"2014","journal-title":"Genome Biol"},{"key":"2023063010284100800_btaa242-B7","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1038\/nrc1299","article-title":"A census of human cancer genes","volume":"4","author":"Futreal","year":"2004","journal-title":"Nat. Rev. Cancer"},{"key":"2023063010284100800_btaa242-B8","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1186\/gm390","article-title":"Improving the prediction of the functional impact of cancer mutations by baseline tolerance transformation","volume":"4","author":"Gonzalez-Perez","year":"2012","journal-title":"Genome Med"},{"key":"2023063010284100800_btaa242-B9","doi-asserted-by":"crossref","first-page":"959","DOI":"10.1126\/science.1230062","article-title":"TERT promoter mutations in familial and sporadic melanoma","volume":"339","author":"Horn","year":"2013","journal-title":"Science"},{"key":"2023063010284100800_btaa242-B10","doi-asserted-by":"crossref","first-page":"957","DOI":"10.1126\/science.1229259","article-title":"Highly recurrent TERT promoter mutations in human melanoma","volume":"339","author":"Huang","year":"2013","journal-title":"Science"},{"key":"2023063010284100800_btaa242-B11","doi-asserted-by":"crossref","first-page":"e22769","DOI":"10.1371\/journal.pone.0022769","article-title":"PIK3CA mutations frequently coexist with RAS and BRAF mutations in patients with advanced cancers","volume":"6","author":"Janku","year":"2011","journal-title":"PLoS One"},{"key":"2023063010284100800_btaa242-B12","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1038\/ng.2892","article-title":"A general framework for estimating the relative pathogenicity of human genetic variants","volume":"46","author":"Kircher","year":"2014","journal-title":"Nat. Genet"},{"key":"2023063010284100800_btaa242-B13","doi-asserted-by":"crossref","first-page":"1073","DOI":"10.1038\/nprot.2009.86","article-title":"Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm","volume":"4","author":"Kumar","year":"2009","journal-title":"Nat. Protoc"},{"key":"2023063010284100800_btaa242-B14","first-page":"564","article-title":"The spectrum kernel: a string kernel for SVM protein classification","volume":"7","author":"Leslie","year":"2002","journal-title":"Pac. Symp. Biocomput"},{"key":"2023063010284100800_btaa242-B15","doi-asserted-by":"crossref","first-page":"134","DOI":"10.1136\/jmedgenet-2016-104369","article-title":"The performance of deleteriousness prediction scores for rare non-protein-changing single nucleotide variants in human genes","volume":"54","author":"Liu","year":"2017","journal-title":"J. Med. Genet"},{"key":"2023063010284100800_btaa242-B16","doi-asserted-by":"crossref","first-page":"1483","DOI":"10.1126\/science.aab4082","article-title":"Somatic mutation in cancer and normal cells","volume":"349","author":"Martincorena","year":"2015","journal-title":"Science"},{"key":"2023063010284100800_btaa242-B17","doi-asserted-by":"crossref","first-page":"1029","DOI":"10.1016\/j.cell.2017.09.042","article-title":"Universal patterns of selection in cancer and somatic tissues","volume":"171","author":"Martincorena","year":"2017","journal-title":"Cell"},{"key":"2023063010284100800_btaa242-B18","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1038\/nmeth.2891","article-title":"Cancer genomes: discerning drivers from passengers","volume":"11","author":"Marx","year":"2014","journal-title":"Nat. Methods"},{"key":"2023063010284100800_btaa242-B19","doi-asserted-by":"crossref","first-page":"15138","DOI":"10.1073\/pnas.1404341111","article-title":"Tug-of-war between driver and passenger mutations in cancer and other adaptive processes","volume":"111","author":"McFarland","year":"2014","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023063010284100800_btaa242-B20","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1186\/s13059-016-0974-4","article-title":"The ensembl variant effect predictor","volume":"17","author":"McLaren","year":"2016","journal-title":"Genome Biol"},{"key":"2023063010284100800_btaa242-B21","doi-asserted-by":"crossref","first-page":"110","DOI":"10.1101\/gr.097857.109","article-title":"Detection of non-neutral substitution rates on mammalian phylogenies","volume":"20","author":"Pollard","year":"2010","journal-title":"Genome Res"},{"key":"2023063010284100800_btaa242-B22","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1146\/annurev-pathol-012414-040312","article-title":"Driver and passenger mutations in cancer","volume":"10","author":"Pon","year":"2015","journal-title":"Annu. Rev. Pathol. Mech. Dis"},{"key":"2023063010284100800_btaa242-B23","doi-asserted-by":"crossref","first-page":"761","DOI":"10.1093\/bioinformatics\/btu703","article-title":"DANN: a deep learning approach for annotating the pathogenicity of genetic variants","volume":"31","author":"Quang","year":"2015","journal-title":"Bioinformatics"},{"key":"2023063010284100800_btaa242-B24","doi-asserted-by":"crossref","first-page":"e118","DOI":"10.1093\/nar\/gkr407","article-title":"Predicting the functional impact of protein mutations: application to cancer genomics","volume":"39","author":"Reva","year":"2011","journal-title":"Nucleic Acids Res"},{"key":"2023063010284100800_btaa242-B25","author":"Rheinbay","year":"2017"},{"key":"2023063010284100800_btaa242-B26","doi-asserted-by":"crossref","first-page":"294","DOI":"10.1038\/nmeth.2832","article-title":"Functional annotation of noncoding sequence variants","volume":"11","author":"Ritchie","year":"2014","journal-title":"Nat. Methods"},{"key":"2023063010284100800_btaa242-B27","first-page":"639","author":"Rogers","year":"2015"},{"key":"2023063010284100800_btaa242-B28","doi-asserted-by":"crossref","first-page":"11597","DOI":"10.1038\/s41598-017-11746-4","article-title":"CScape: a tool for predicting oncogenic single-point mutations in the cancer genome","volume":"7","author":"Rogers","year":"2017","journal-title":"Sci. Rep"},{"key":"2023063010284100800_btaa242-B29","doi-asserted-by":"crossref","first-page":"511","DOI":"10.1093\/bioinformatics\/btx536","article-title":"FATHMM-XF: accurate prediction of pathogenic point mutations via extended features","volume":"34","author":"Rogers","year":"2017","journal-title":"Bioinformatics"},{"key":"2023063010284100800_btaa242-B30","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1002\/humu.22225","article-title":"Predicting the functional, molecular and phenotypic consequences of amino acid substitutions using hidden Markov models","volume":"34","author":"Shihab","year":"2013","journal-title":"Hum. Mutat"},{"key":"2023063010284100800_btaa242-B31","doi-asserted-by":"crossref","first-page":"1536","DOI":"10.1093\/bioinformatics\/btv009","article-title":"An integrative approach to predicting the functional effects of non-coding and coding sequence variation","volume":"31","author":"Shihab","year":"2015","journal-title":"Bioinformatics"},{"key":"2023063010284100800_btaa242-B32","doi-asserted-by":"crossref","first-page":"1034","DOI":"10.1101\/gr.3715005","article-title":"Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes","volume":"15","author":"Siepel","year":"2005","journal-title":"Genome Res"},{"key":"2023063010284100800_btaa242-B33","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s00439-013-1358-4","article-title":"The human gene mutation database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine","volume":"133","author":"Stenson","year":"2014","journal-title":"Hum. Genet"},{"key":"2023063010284100800_btaa242-B34","doi-asserted-by":"crossref","first-page":"719","DOI":"10.1038\/nature07943","article-title":"The cancer genome","volume":"458","author":"Stratton","year":"2009","journal-title":"Nature"},{"key":"2023063010284100800_btaa242-B35","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/nature11632","article-title":"An integrated map of genetic variation from 1,092 human genomes","volume":"491","year":"2012","journal-title":"Nature"},{"key":"2023063010284100800_btaa242-B36","doi-asserted-by":"crossref","first-page":"1160","DOI":"10.1038\/ng.3101","article-title":"Genome-wide analysis of noncoding regulatory mutations in cancer","volume":"46","author":"Weinhold","year":"2014","journal-title":"Nat. Genet"},{"key":"2023063010284100800_btaa242-B37","doi-asserted-by":"crossref","first-page":"1113","DOI":"10.1038\/ng.2764","article-title":"The cancer genome Atlas Pan-cancer analysis project","volume":"45","author":"Weinstein","year":"2013","journal-title":"Nat. Genet"},{"key":"2023063010284100800_btaa242-B38","doi-asserted-by":"crossref","first-page":"2147","DOI":"10.1093\/bioinformatics\/btr357","article-title":"CHASM and SNVBox: toolkit for detecting biologically important single nucleotide mutations in cancer","volume":"27","author":"Wong","year":"2011","journal-title":"Bioinformatics"},{"key":"2023063010284100800_btaa242-B39","doi-asserted-by":"crossref","first-page":"bar026","DOI":"10.1093\/database\/bar026","article-title":"International cancer genome consortium data portal \u2013 a one-stop shop for cancer genomics data","volume":"2011","author":"Zhang","year":"2011","journal-title":"Database"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa242\/33152989\/btaa242.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/12\/3637\/50746777\/bioinformatics_36_12_3637.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/12\/3637\/50746777\/bioinformatics_36_12_3637.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T10:29:21Z","timestamp":1688120961000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/12\/3637\/5819545"}},"subtitle":[],"editor":[{"given":"Peter","family":"Robinson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,4,13]]},"references-count":39,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2020,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa242","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,6,15]]},"published":{"date-parts":[[2020,4,13]]}}}