{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T04:32:26Z","timestamp":1773289946044,"version":"3.50.1"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"16","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,8,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Genome and transcriptome analyses can be used to explore cancers comprehensively, and it is increasingly common to have multiple omics data measured from each individual. Furthermore, there are rich functional data such as predicted impact of mutations on protein coding and gene\/protein networks. However, integration of the complex information across the different omics and functional data is still challenging. Clinical validation, particularly based on patient outcomes such as survival, is important for assessing the relevance of the integrated information and for comparing different procedures.<\/jats:p>\n               <jats:p>Results: An analysis pipeline is built for integrating genomic and transcriptomic alterations from whole-exome and RNA sequence data and functional data from protein function prediction and gene interaction networks. The method accumulates evidence for the functional implications of mutated potential driver genes found within and across patients. A driver-gene score (DGscore) is developed to capture the cumulative effect of such genes. To contribute to the score, a gene has to be frequently mutated, with high or moderate mutational impact at protein level, exhibiting an extreme expression and functionally linked to many differentially expressed neighbors in the functional gene network. The pipeline is applied to 60 matched tumor and normal samples of the same patient from The Cancer Genome Atlas breast-cancer project. In clinical validation, patients with high DGscores have worse survival than those with low scores (P\u2009=\u20090.001). Furthermore, the DGscore outperforms the established expression-based signatures MammaPrint and PAM50 in predicting patient survival. In conclusion, integration of mutation, expression and functional data allows identification of clinically relevant potential driver genes in cancer.<\/jats:p>\n               <jats:p>Availability and implementation: The documented pipeline including annotated sample scripts can be found in http:\/\/fafner.meb.ki.se\/biostatwiki\/driver-genes\/.<\/jats:p>\n               <jats:p>Contact: \u00a0yudi.pawitan@ki.se<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btv164","type":"journal-article","created":{"date-parts":[[2015,3,26]],"date-time":"2015-03-26T04:39:55Z","timestamp":1427344795000},"page":"2607-2613","source":"Crossref","is-referenced-by-count":45,"title":["Integration of somatic mutation, expression and functional data reveals potential driver genes predictive of breast cancer survival"],"prefix":"10.1093","volume":"31","author":[{"given":"Chen","family":"Suo","sequence":"first","affiliation":[{"name":"1 School of Life Sciences, Peking University, Beijing, China,"},{"name":"2 Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden,"}]},{"given":"Olga","family":"Hrydziuszko","sequence":"additional","affiliation":[{"name":"2 Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden,"}]},{"given":"Donghwan","family":"Lee","sequence":"additional","affiliation":[{"name":"3 Department of Statistics, Ewha Womans University, Seoul, South Korea,"}]},{"given":"Setia","family":"Pramana","sequence":"additional","affiliation":[{"name":"2 Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden,"},{"name":"4 Department of Computational Statistics, Institute of Statistics, Jakarta, Indonesia and"}]},{"given":"Dhany","family":"Saputra","sequence":"additional","affiliation":[{"name":"2 Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden,"}]},{"given":"Himanshu","family":"Joshi","sequence":"additional","affiliation":[{"name":"2 Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden,"}]},{"given":"Stefano","family":"Calza","sequence":"additional","affiliation":[{"name":"2 Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden,"},{"name":"5 Department of Molecular and Translational Medicine, University of Brescia, Brescia, Italy"}]},{"given":"Yudi","family":"Pawitan","sequence":"additional","affiliation":[{"name":"2 Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden,"}]}],"member":"286","published-online":{"date-parts":[[2015,3,24]]},"reference":[{"key":"2023020202210207700_btv164-B1","doi-asserted-by":"crossref","first-page":"1005","DOI":"10.1016\/j.cell.2010.11.013","article-title":"An integrated approach to uncover drivers of cancer","volume":"143","author":"Akavia","year":"2010","journal-title":"Cell"},{"key":"2023020202210207700_btv164-B2","first-page":"1109","article-title":"Global networks of functional coupling in eukaryotes from comprehensive data integration","volume":"19","author":"Alexeyenko","year":"2009","journal-title":"Genomes Res."},{"key":"2023020202210207700_btv164-B3","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1186\/1471-2105-13-226","article-title":"Network enrichment analysis: extension of gene-set enrichment analysis to gene networks","volume":"13","author":"Alexeyenko","year":"2012","journal-title":"BMC Bioinformatics"},{"key":"2023020202210207700_btv164-B5","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1038\/nature11412","article-title":"Comprehensive molecular portraits of human breast tumours","volume":"490","author":"Cancer Genome Atlas Network","year":"2012","journal-title":"Nature"},{"key":"2023020202210207700_btv164-B6","doi-asserted-by":"crossref","first-page":"3759","DOI":"10.1182\/blood-2010-08-299917","article-title":"Origin, functional role, and clinical impact of Fanconi anemia FANCA mutations","volume":"117","author":"Castella","year":"2011","journal-title":"Blood"},{"key":"2023020202210207700_btv164-B7","first-page":"80","article-title":"A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3","volume":"6","author":"Cingolani","year":"2012","journal-title":"Fly(Austin)"},{"key":"2023020202210207700_btv164-B8","doi-asserted-by":"crossref","first-page":"398","DOI":"10.1101\/gr.125567.111","article-title":"Mutual exclusivity analysis identifies oncogenic network modules","volume":"22","author":"Ciriello","year":"2012","journal-title":"Genome Res."},{"key":"2023020202210207700_btv164-B9","doi-asserted-by":"crossref","first-page":"1169","DOI":"10.1016\/S1470-2045(11)70123-1","article-title":"Evolution of platinum resistance in high-grade serous ovarian cancer","volume":"12","author":"Cooke","year":"2011","journal-title":"Lancet Oncol."},{"key":"2023020202210207700_btv164-B10","doi-asserted-by":"crossref","first-page":"3126","DOI":"10.1074\/jbc.M112.403279","article-title":"Type 10 soluble adenylyl cyclase is overexpressed in prostate carcinoma and controls proliferation of prostate cancer cells","volume":"288","author":"Flacke","year":"2013","journal-title":"J. Biol. Chem."},{"key":"2023020202210207700_btv164-B11","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1038\/nrc1299","article-title":"A cencus of human cancer genes","volume":"4","author":"Futreal","year":"2004","journal-title":"Nat. Rev. Cancer"},{"key":"2023020202210207700_btv164-B12","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1200\/JCO.2010.33.0068","article-title":"Breast cancer prognosis in BRCA1 and BRCA2 mutation carriers: an international prospective breast cancer family registry population based cohort study","volume":"30","author":"Goodwin","year":"2012","journal-title":"J. Clin. Oncol."},{"key":"2023020202210207700_btv164-B13","article-title":"genefu: Relevant Functions for Gene Expression Analysis, Especially in Breast Cancer","author":"Haibe-Kains","year":"2012"},{"key":"2023020202210207700_btv164-B14","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1186\/s13073-014-0056-8","article-title":"DawnRank: discovering personalized driver genes in caner","volume":"6","author":"Hou","year":"2014","journal-title":"Genome Med."},{"key":"2023020202210207700_btv164-B15","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1186\/1755-8794-6-53","article-title":"Integrated molecular portrait of non-small cell lung cancers","volume":"6","author":"Lazar","year":"2013","journal-title":"BMC Med. Genomics"},{"key":"2023020202210207700_btv164-B16","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1093\/carcin\/21.3.379","article-title":"Significance of multiple mutations in cancer","volume":"21","author":"Loeb","year":"2000","journal-title":"Carcinogensis"},{"key":"2023020202210207700_btv164-B17","doi-asserted-by":"crossref","first-page":"1297","DOI":"10.1101\/gr.107524.110","article-title":"The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data","volume":"20","author":"McKenna","year":"2010","journal-title":"Genome Res."},{"key":"2023020202210207700_btv164-B18","doi-asserted-by":"crossref","first-page":"e54455","DOI":"10.1371\/journal.pone.0054455","article-title":"MUC4 overexpression augments cell migration and metastasis through EGFR family proteins in triple negativebreast cancer cells","volume":"8","author":"Mukhopadhyay","year":"2013","journal-title":"PLoS One"},{"key":"2023020202210207700_btv164-B19","doi-asserted-by":"crossref","first-page":"3379","DOI":"10.1093\/hmg\/ddi369","article-title":"Detecting tissue-specific alternative splicing and disease-associated aberrant splicing of the PTCH gene with exon junction microarrays","volume":"14","author":"Nagao","year":"2005","journal-title":"Hum. Mol. Genet."},{"key":"2023020202210207700_btv164-B20","first-page":"259","article-title":"Prognostic significance of Ki-67 in chemotherapy-naive breast cancer patients with 10-year follow-up","volume":"34","author":"Nishimiya","year":"2014","journal-title":"Anticancer Res."},{"key":"2023020202210207700_btv164-B21","doi-asserted-by":"crossref","first-page":"1160","DOI":"10.1200\/JCO.2008.18.1370","article-title":"Supervised predictor of breast cancer based on intrinsic subtypes","volume":"27","author":"Parker","year":"2009","journal-title":"J. Clin. Oncol."},{"key":"2023020202210207700_btv164-B22","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1083\/jcb.201112132","article-title":"PARP1 promotes nucleotide excision repair through DDB2 stabilization and recruitment of ALC1","volume":"199","author":"Pines","year":"2012","journal-title":"J. Cell. Biol."},{"key":"2023020202210207700_btv164-B23","doi-asserted-by":"crossref","first-page":"e76496","DOI":"10.1371\/journal.pone.0076496","article-title":"Genome instability: a stronger prognostic marker than proliferation for early stage luminal breast carcinomas","volume":"8","author":"Salomon","year":"2013","journal-title":"PLoS One"},{"key":"2023020202210207700_btv164-B24","doi-asserted-by":"crossref","first-page":"1461","DOI":"10.1016\/j.cell.2014.10.048","article-title":"Integration of genomic data enables selective discovery of breast cancer drivers","volume":"159","author":"Sanchez-Garcia","year":"2014","journal-title":"Cell"},{"key":"2023020202210207700_btv164-B25","doi-asserted-by":"crossref","first-page":"506","DOI":"10.1093\/bioinformatics\/btt704","article-title":"Joint estimation of isoform expression and isoform-specific read distribution using multi-sample RNA-seq data","volume":"30","author":"Suo","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020202210207700_btv164-B26","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1016\/j.canlet.2013.12.026","article-title":"BRCA1 deficiency induces protective autophagy to mitigate stress and provides a mechanism for BRCA1 haploinsufficiency in tumorigenesis","volume":"346","author":"Tang","year":"2014","journal-title":"Cancer Lett."},{"key":"2023020202210207700_btv164-B27","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1038\/415530a","article-title":"Gene expression profiling predicts clinical outcome of breast cancer","volume":"415","author":"van \u2019t Veer","year":"2002","journal-title":"Nature"},{"key":"2023020202210207700_btv164-B28","doi-asserted-by":"crossref","first-page":"382","DOI":"10.1016\/j.cell.2014.01.066","article-title":"Mutant p53 drives pancreatic cancer metastasis through cell-autonomous PDGF receptor \u03b2 signalling","volume":"157","author":"Weissmueller","year":"2014","journal-title":"Cell"},{"key":"2023020202210207700_btv164-B29","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1093\/bioinformatics\/btq630","article-title":"Identifying cancer driver genes in tumor genome sequencing studies","volume":"27","author":"Youn","year":"2011","journal-title":"Bioinformatics"},{"key":"2023020202210207700_btv164-B30","doi-asserted-by":"crossref","first-page":"e1001114","DOI":"10.1371\/journal.pcbi.1001114","article-title":"CAERUS: Predicting CAncER oUtcomeS using relationship between protein structural information, protein networks, gene expression data, and mutation data","volume":"7","author":"Zhang","year":"2011","journal-title":"PLoS Comput. Biol."},{"key":"2023020202210207700_btv164-B31","doi-asserted-by":"crossref","DOI":"10.1093\/database\/bar026","article-title":"International Cancer Genome Consortium Data Portal - a one-stop shop for cancer genomics data","author":"Zhang","year":"2011","journal-title":"Database"},{"key":"2023020202210207700_btv164-B32","first-page":"2482","article-title":"Exome sequencing identifies compound heterozygous PKHD1 mutations as a cause of autosomal recessive polycystic kidney disease","volume":"125","author":"Zhang","year":"2012","journal-title":"Chin. Med. J."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/16\/2607\/49035050\/bioinformatics_31_16_2607.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/16\/2607\/49035050\/bioinformatics_31_16_2607.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T03:45:17Z","timestamp":1675309517000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/31\/16\/2607\/320717"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,3,24]]},"references-count":31,"journal-issue":{"issue":"16","published-print":{"date-parts":[[2015,8,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btv164","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,8,15]]},"published":{"date-parts":[[2015,3,24]]}}}