{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,4]],"date-time":"2025-10-04T18:32:07Z","timestamp":1759602727860,"version":"3.37.3"},"reference-count":21,"publisher":"Oxford University Press (OUP)","issue":"Supplement_1","license":[{"start":{"date-parts":[[2020,7,13]],"date-time":"2020-07-13T00:00:00Z","timestamp":1594598400000},"content-version":"vor","delay-in-days":12,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100003006","name":"ETH Z\u00fcrich","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100003006","id-type":"DOI","asserted-by":"publisher"}]},{"name":"PHRT","award":["#106"],"award-info":[{"award-number":["#106"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Understanding the underlying mutational processes of cancer patients has been a long-standing goal in the community and promises to provide new insights that could improve cancer diagnoses and treatments. Mutational signatures are summaries of the mutational processes, and improving the derivation of mutational signatures can yield new discoveries previously obscured by technical and biological confounders. Results from existing mutational signature extraction methods depend on the size of available patient cohort and solely focus on the analysis of mutation count data without considering the exploitation of metadata.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Here we present a supervised method that utilizes cancer type as metadata to extract more distinctive signatures. More specifically, we use a negative binomial non-negative matrix factorization and add a support vector machine loss. We show that mutational signatures extracted by our proposed method have a lower reconstruction error and are designed to be more predictive of cancer type than those generated by unsupervised methods. This design reduces the need for elaborate post-processing strategies in order to recover most of the known signatures unlike the existing unsupervised signature extraction methods. Signatures extracted by a supervised model used in conjunction with cancer-type labels are also more robust, especially when using small and potentially cancer-type limited patient cohorts. Finally, we adapted our model such that molecular features can be utilized to derive an according mutational signature. We used APOBEC expression and MUTYH mutation status to demonstrate the possibilities that arise from this ability. We conclude that our method, which exploits available metadata, improves the quality of mutational signatures as well as helps derive more interpretable representations.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>https:\/\/github.com\/ratschlab\/SNBNMF-mutsig-public.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa473","type":"journal-article","created":{"date-parts":[[2020,5,4]],"date-time":"2020-05-04T19:13:41Z","timestamp":1588619621000},"page":"i154-i160","source":"Crossref","is-referenced-by-count":6,"title":["Mutational signature learning with supervised negative binomial non-negative matrix factorization"],"prefix":"10.1093","volume":"36","author":[{"given":"Xinrui","family":"Lyu","sequence":"first","affiliation":[{"name":"Department of Computer Science"}]},{"given":"Jean","family":"Garret","sequence":"additional","affiliation":[{"name":"Department of Mathematics, ETH Z\u00fcrich , Z\u00fcrich 8092, Switzerland"}]},{"given":"Gunnar","family":"R\u00e4tsch","sequence":"additional","affiliation":[{"name":"Department of Computer Science"},{"name":"Swiss , Lausanne 1015, Switzerland"},{"name":"Institute for Bioinformatics , Lausanne 1015, Switzerland"},{"name":"University Hospital Zurich , Z\u00fcrich 8091, Switzerland"},{"name":"Department , Z\u00fcrich 8093, Switzerland"},{"name":"of Biology, ETH Z\u00fcrich , Z\u00fcrich 8093, Switzerland"},{"name":"Center Switzerland"},{"name":"for Learning Systems, ETH Z\u00fcrich Switzerland"}]},{"given":"Kjong-Van","family":"Lehmann","sequence":"additional","affiliation":[{"name":"Department of Computer Science"},{"name":"Swiss , Lausanne 1015, Switzerland"},{"name":"Institute for Bioinformatics , Lausanne 1015, Switzerland"},{"name":"University Hospital Zurich , Z\u00fcrich 8091, Switzerland"}]}],"member":"286","published-online":{"date-parts":[[2020,7,13]]},"reference":[{"key":"2024021913321570800_btaa473-B1","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1016\/j.celrep.2012.12.008","article-title":"Deciphering signatures of mutational processes operative in human cancer","volume":"3","author":"Alexandrov","year":"2013","journal-title":"Cell Rep"},{"key":"2024021913321570800_btaa473-B2","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1038\/nature12477","article-title":"Signatures of mutational processes in human cancer","volume":"500","author":"Alexandrov","year":"2013","journal-title":"Nature"},{"key":"2024021913321570800_btaa473-B3","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1038\/s41586-020-1943-3","article-title":"The repertoire of mutational signatures in human cancer","volume":"578","author":"Alexandrov","year":"2020","journal-title":"Nature"},{"key":"2024021913321570800_btaa473-B4","doi-asserted-by":"crossref","first-page":"977","DOI":"10.1038\/ng.2701","article-title":"Evidence for APOBEC3b mutagenesis in multiple human cancers","volume":"45","author":"Burns","year":"2013","journal-title":"Nat. Genet"},{"key":"2024021913321570800_btaa473-B5","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1038\/s41586-020-1970-0","article-title":"Genomic basis for RNA alterations in cancer","volume":"578","author":"Calabrese","year":"2020","journal-title":"Nature"},{"key":"2024021913321570800_btaa473-B6","doi-asserted-by":"crossref","first-page":"e67","DOI":"10.1093\/nar\/gks1443","article-title":"Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation","volume":"41","author":"Costello","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2024021913321570800_btaa473-B7","doi-asserted-by":"crossref","first-page":"2421","DOI":"10.1162\/NECO_a_00168","article-title":"Algorithms for nonnegative matrix factorization with the \u03b2-divergence","volume":"23","author":"F\u00e9votte","year":"2011","journal-title":"Neural Comput"},{"year":"2020","author":"Gouvert","key":"2024021913321570800_btaa473-B8"},{"key":"2024021913321570800_btaa473-B9","doi-asserted-by":"crossref","first-page":"585","DOI":"10.1038\/nrg3729","article-title":"Mechanisms underlying mutational signatures in human cancers","volume":"15","author":"Helleday","year":"2014","journal-title":"Nat. Rev. Genet"},{"key":"2024021913321570800_btaa473-B10","doi-asserted-by":"crossref","first-page":"D964","DOI":"10.1093\/nar\/gkx1133","article-title":"msignaturedb: a database for deciphering mutational signatures in human cancers","volume":"46","author":"Huang","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2024021913321570800_btaa473-B11","doi-asserted-by":"crossref","first-page":"D1062","DOI":"10.1093\/nar\/gkx1153","article-title":"Clinvar: improving access to variant interpretations and supporting evidence","volume":"46","author":"Landrum","year":"2018","journal-title":"Nucleic Acids Res"},{"first-page":"556","year":"2001","author":"Lee","key":"2024021913321570800_btaa473-B12"},{"key":"2024021913321570800_btaa473-B13","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with deseq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"key":"2024021913321570800_btaa473-B14","doi-asserted-by":"crossref","first-page":"3292","DOI":"10.1038\/s41467-018-05228-y","article-title":"The therapeutic significance of mutational signatures from DNA repair deficiency in cancer","volume":"9","author":"Ma","year":"2018","journal-title":"Nat. Commun"},{"key":"2024021913321570800_btaa473-B15","first-page":"1","volume-title":"Intrinsic Immunity","author":"Refsland","year":"2013"},{"key":"2024021913321570800_btaa473-B16","doi-asserted-by":"crossref","first-page":"970","DOI":"10.1038\/ng.2702","article-title":"An APOBEC cytidine deaminase mutagenesis pattern is widespread in human cancers","volume":"45","author":"Roberts","year":"2013","journal-title":"Nat. Genet."},{"key":"2024021913321570800_btaa473-B17","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","article-title":"Silhouettes: a graphical aid to the interpretation and validation of cluster analysis","volume":"20","author":"Rousseeuw","year":"1987","journal-title":"J.\u00a0Comput. Appl. Math"},{"key":"2024021913321570800_btaa473-B18","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1186\/s12918-019-0699-6","article-title":"A fast and efficient count-based matrix factorization method for detecting cell types from single-cell RNAseq data","volume":"13","author":"Sun","year":"2019","journal-title":"BMC Syst. Biol"},{"key":"2024021913321570800_btaa473-B19","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1038\/s41586-020-1969-6","article-title":"Pan-cancer analysis of whole genomes","volume":"578","author":"The","year":"2020","journal-title":"Nature"},{"key":"2024021913321570800_btaa473-B20","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1016\/j.ebiom.2017.04.022","article-title":"A specific mutational signature associated with DNA 8-oxoguanine persistence in MUTYH-defective colorectal cancer","volume":"20","author":"Viel","year":"2017","journal-title":"EBioMedicine"},{"key":"2024021913321570800_btaa473-B21","doi-asserted-by":"crossref","first-page":"3924","DOI":"10.1038\/s41388-018-0245-9","article-title":"APOBEC3b and APOBEC mutational signature as potential predictive markers for immunotherapy response in non-small cell lung cancer","volume":"37","author":"Wang","year":"2018","journal-title":"Oncogene"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/Supplement_1\/i154\/56702318\/bioinformatics_36_supplement1_i154.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/Supplement_1\/i154\/56702318\/bioinformatics_36_supplement1_i154.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,19]],"date-time":"2024-02-19T13:37:24Z","timestamp":1708349844000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/Supplement_1\/i154\/5870496"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,7,1]]},"references-count":21,"journal-issue":{"issue":"Supplement_1","published-print":{"date-parts":[[2020,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa473","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2020,7]]},"published":{"date-parts":[[2020,7,1]]}}}