{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T06:16:02Z","timestamp":1772172962032,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1009309","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,4,14]],"date-time":"2022-04-14T00:00:00Z","timestamp":1649894400000}}],"reference-count":58,"publisher":"Public Library of Science (PLoS)","issue":"4","license":[{"start":{"date-parts":[[2022,4,4]],"date-time":"2022-04-04T00:00:00Z","timestamp":1649030400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/publicdomain\/zero\/1.0\/"}],"funder":[{"DOI":"10.13039\/100000054","name":"national cancer institute","doi-asserted-by":"publisher","award":["Intramural Research Program"],"award-info":[{"award-number":["Intramural Research Program"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>\n                    For\n                    <jats:italic>de novo<\/jats:italic>\n                    mutational signature analysis, the critical first step is to decide how many signatures should be expected in a cancer genomics study. An incorrect number could mislead downstream analyses. Here we present SUITOR (Selecting the nUmber of mutatIonal signaTures thrOugh cRoss-validation), an unsupervised cross-validation method that requires little assumptions and no numerical approximations to select the optimal number of signatures without overfitting the data.\n                    <jats:italic>In vitro<\/jats:italic>\n                    studies and\n                    <jats:italic>in silico<\/jats:italic>\n                    simulations demonstrated that SUITOR can correctly identify signatures, some of which were missed by other widely used methods. Applied to 2,540 whole-genome sequenced tumors across 22 cancer types, SUITOR selected signatures with the smallest prediction errors and almost all signatures of breast cancer selected by SUITOR were validated in an independent breast cancer study. SUITOR is a powerful tool to select the optimal number of mutational signatures, facilitating downstream analyses with etiological or therapeutic importance.\n                  <\/jats:p>","DOI":"10.1371\/journal.pcbi.1009309","type":"journal-article","created":{"date-parts":[[2022,4,4]],"date-time":"2022-04-04T13:33:01Z","timestamp":1649079181000},"page":"e1009309","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":4,"title":["SUITOR: Selecting the number of mutational signatures through cross-validation"],"prefix":"10.1371","volume":"18","author":[{"given":"Donghyuk","family":"Lee","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4088-3859","authenticated-orcid":true,"given":"Difei","family":"Wang","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4451-8664","authenticated-orcid":true,"given":"Xiaohong R.","family":"Yang","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8606-4707","authenticated-orcid":true,"given":"Jianxin","family":"Shi","sequence":"additional","affiliation":[]},{"given":"Maria Teresa","family":"Landi","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0172-5516","authenticated-orcid":true,"given":"Bin","family":"Zhu","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,4,4]]},"reference":[{"issue":"7793","key":"pcbi.1009309.ref001","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1038\/s41586-020-1943-3","article-title":"The repertoire of mutational signatures in human cancer","volume":"578","author":"LB Alexandrov","year":"2020","journal-title":"Nature"},{"issue":"7605","key":"pcbi.1009309.ref002","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1038\/nature17676","article-title":"Landscape of somatic mutations in 560 breast cancer whole-genome sequences","volume":"534","author":"S Nik-Zainal","year":"2016","journal-title":"Nature"},{"issue":"7793","key":"pcbi.1009309.ref003","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1038\/s41586-019-1913-9","article-title":"Patterns of somatic structural variation in human cancer genomes","volume":"578","author":"Y Li","year":"2020","journal-title":"Nature"},{"issue":"9","key":"pcbi.1009309.ref004","doi-asserted-by":"crossref","first-page":"1262","DOI":"10.1038\/s41588-018-0179-8","article-title":"Copy number signatures and mutational processes in ovarian carcinoma","volume":"50","author":"G Macintyre","year":"2018","journal-title":"Nat Genet"},{"issue":"5","key":"pcbi.1009309.ref005","doi-asserted-by":"crossref","first-page":"e1009557","DOI":"10.1371\/journal.pgen.1009557","article-title":"Copy number signature analysis tool and its application in prostate cancer reveals distinct mutational processes and clinical outcomes","volume":"17","author":"S Wang","year":"2021","journal-title":"PLoS Genet"},{"key":"pcbi.1009309.ref006","article-title":"Signatures of copy number alterations in human cancer","author":"CD Steele","year":"2021","journal-title":"bioRxiv"},{"issue":"D1","key":"pcbi.1009309.ref007","doi-asserted-by":"crossref","first-page":"D941","DOI":"10.1093\/nar\/gky1015","article-title":"COSMIC: the Catalogue Of Somatic Mutations In Cancer","volume":"47","author":"JG Tate","year":"2019","journal-title":"Nucleic Acids Res"},{"issue":"4","key":"pcbi.1009309.ref008","doi-asserted-by":"crossref","first-page":"821","DOI":"10.1016\/j.cell.2019.03.001","article-title":"A Compendium of Mutational Signatures of Environmental Agents","volume":"177","author":"JE Kucab","year":"2019","journal-title":"Cell"},{"issue":"6312","key":"pcbi.1009309.ref009","doi-asserted-by":"crossref","first-page":"618","DOI":"10.1126\/science.aag0299","article-title":"Mutational signatures associated with tobacco smoking in human cancer","volume":"354","author":"LB Alexandrov","year":"2016","journal-title":"Science"},{"issue":"1","key":"pcbi.1009309.ref010","doi-asserted-by":"crossref","first-page":"1315","DOI":"10.1038\/s41467-017-01358-x","article-title":"Mutational signatures reveal the dynamic interplay of risk factors and cellular processes during liver tumorigenesis","volume":"8","author":"E Letouze","year":"2017","journal-title":"Nat Commun"},{"issue":"1","key":"pcbi.1009309.ref011","doi-asserted-by":"crossref","first-page":"1744","DOI":"10.1038\/s41467-018-04052-8","article-title":"Validating the concept of mutational signatures with isogenic cell models","volume":"9","author":"X Zou","year":"2018","journal-title":"Nat Commun"},{"issue":"10","key":"pcbi.1009309.ref012","doi-asserted-by":"crossref","first-page":"1476","DOI":"10.1038\/ng.3934","article-title":"A mutational signature reveals alterations underlying deficient homologous recombination repair in breast cancer","volume":"49","author":"P Polak","year":"2017","journal-title":"Nat Genet"},{"issue":"1","key":"pcbi.1009309.ref013","doi-asserted-by":"crossref","first-page":"1746","DOI":"10.1038\/s41467-018-04002-4","article-title":"Distinct mutational signatures characterize concurrent loss of polymerase proofreading and mismatch repair.","volume":"9","author":"NJ Haradhvala","year":"2018","journal-title":"Nat Commun."},{"issue":"6","key":"pcbi.1009309.ref014","doi-asserted-by":"crossref","first-page":"1282","DOI":"10.1016\/j.cell.2019.02.012","article-title":"Characterizing Mutational Signatures in Human Cancer Cell Lines Reveals Episodic APOBEC Mutagenesis","volume":"176","author":"M Petljak","year":"2019","journal-title":"Cell"},{"issue":"2","key":"pcbi.1009309.ref015","doi-asserted-by":"crossref","first-page":"256","DOI":"10.1016\/j.ccell.2018.12.011","article-title":"Mutational Signature Analysis Reveals NTHL1 Deficiency to Cause a Multi-tumor Phenotype","volume":"35","author":"JE Grolleman","year":"2019","journal-title":"Cancer Cell"},{"issue":"6360","key":"pcbi.1009309.ref016","doi-asserted-by":"crossref","first-page":"234","DOI":"10.1126\/science.aao3130","article-title":"Use of CRISPR-modified human stem cell organoids to study the origin of mutational signatures in cancer","volume":"358","author":"J Drost","year":"2017","journal-title":"Science"},{"issue":"10","key":"pcbi.1009309.ref017","doi-asserted-by":"crossref","first-page":"1131","DOI":"10.1038\/ng.3659","article-title":"Mutational signatures in esophageal adenocarcinoma define etiologically distinct subgroups with therapeutic relevance","volume":"48","author":"M Secrier","year":"2016","journal-title":"Nat Genet"},{"issue":"10","key":"pcbi.1009309.ref018","doi-asserted-by":"crossref","first-page":"1526","DOI":"10.1038\/s41591-019-0582-4","article-title":"Whole-genome sequencing of triple-negative breast cancers in a population-based clinical study","volume":"25","author":"J Staaf","year":"2019","journal-title":"Nat Med"},{"issue":"6","key":"pcbi.1009309.ref019","doi-asserted-by":"crossref","first-page":"774","DOI":"10.1001\/jamaoncol.2016.3916","article-title":"Association of Distinct Mutational Signatures With Correlates of Increased Immune Activity in Pancreatic Ductal Adenocarcinoma","volume":"3","author":"AA Connor","year":"2017","journal-title":"JAMA Oncol"},{"issue":"7","key":"pcbi.1009309.ref020","doi-asserted-by":"crossref","first-page":"1724","DOI":"10.1158\/0008-5472.CAN-15-2443","article-title":"Distinct Subtypes of Gastric Cancer Defined by Molecular Characterization Include Novel Mutational Signatures with Prognostic Capability","volume":"76","author":"X Li","year":"2016","journal-title":"Cancer Res"},{"issue":"9","key":"pcbi.1009309.ref021","doi-asserted-by":"crossref","first-page":"e0221235","DOI":"10.1371\/journal.pone.0221235","article-title":"Computational tools to detect signatures of mutational processes in DNA from tumours: A review and empirical comparison of performance","volume":"14","author":"H Omichessan","year":"2019","journal-title":"PLoS One"},{"key":"pcbi.1009309.ref022","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1016\/j.mam.2019.05.002","article-title":"Somatic mutational signatures in polyposis and colorectal cancer","volume":"69","author":"JE Grolleman","year":"2019","journal-title":"Mol Aspects Med"},{"issue":"1","key":"pcbi.1009309.ref023","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1093\/bib\/bbx082","article-title":"Computational approaches for discovery of mutational signatures in cancer","volume":"20","author":"A Baez-Ortega","year":"2019","journal-title":"Brief Bioinform"},{"issue":"22","key":"pcbi.1009309.ref024","doi-asserted-by":"crossref","first-page":"3673","DOI":"10.1093\/bioinformatics\/btv408","article-title":"SomaticSignatures: inferring mutational signatures from single-nucleotide variants","volume":"31","author":"JS Gehring","year":"2015","journal-title":"Bioinformatics"},{"issue":"1","key":"pcbi.1009309.ref025","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1016\/j.celrep.2012.12.008","article-title":"Deciphering signatures of mutational processes operative in human cancer","volume":"3","author":"LB Alexandrov","year":"2013","journal-title":"Cell Rep"},{"issue":"4","key":"pcbi.1009309.ref026","doi-asserted-by":"crossref","first-page":"R39","DOI":"10.1186\/gb-2013-14-4-r39","article-title":"EMu: probabilistic inference of mutational processes and their localization in the cancer genome","volume":"14","author":"A Fischer","year":"2013","journal-title":"Genome Biol"},{"issue":"1","key":"pcbi.1009309.ref027","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1093\/bioinformatics\/btw572","article-title":"signeR: an empirical Bayesian approach to mutational signature discovery","volume":"33","author":"RA Rosales","year":"2017","journal-title":"Bioinformatics"},{"issue":"6","key":"pcbi.1009309.ref028","doi-asserted-by":"crossref","first-page":"600","DOI":"10.1038\/ng.3557","article-title":"Somatic ERCC2 mutations are associated with a distinct genomic signature in urothelial tumors","volume":"48","author":"J Kim","year":"2016","journal-title":"Nat Genet"},{"key":"pcbi.1009309.ref029","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1186\/s13059-016-0893-4","article-title":"DeconstructSigs: delineating mutational processes in single tumors distinguishes DNA repair deficiencies and patterns of carcinoma evolution","volume":"17","author":"R Rosenthal","year":"2016","journal-title":"Genome Biol"},{"issue":"2","key":"pcbi.1009309.ref030","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1093\/bioinformatics\/btx604","article-title":"Detecting presence of mutational signatures in cancer with confidence","volume":"34","author":"X Huang","year":"2018","journal-title":"Bioinformatics"},{"issue":"1","key":"pcbi.1009309.ref031","doi-asserted-by":"crossref","first-page":"2969","DOI":"10.1038\/s41467-019-11037-8","article-title":"A practical guide for mutational signature analysis in hematological malignancies.","volume":"10","author":"F Maura","year":"2019","journal-title":"Nat Commun."},{"issue":"2","key":"pcbi.1009309.ref032","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1038\/s43018-020-0027-5","article-title":"A practical framework and online tool for mutational signature analyses show inter-tissue variation and driver dependencies","volume":"1","author":"A Degasperi","year":"2020","journal-title":"Nat Cancer"},{"issue":"2","key":"pcbi.1009309.ref033","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1214\/aos\/1176344136","article-title":"Estimating the dimension of a model","volume":"6","author":"G. Schwarz","year":"1978","journal-title":"The annals of statistics"},{"issue":"7","key":"pcbi.1009309.ref034","doi-asserted-by":"crossref","first-page":"1592","DOI":"10.1109\/TPAMI.2012.240","article-title":"Automatic relevance determination in nonnegative matrix factorization with the \u03b2-divergence","volume":"35","author":"VY Tan","year":"2013","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"issue":"1","key":"pcbi.1009309.ref035","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1016\/j.jeconom.2015.02.006","article-title":"Cross-validation for selecting a model selection procedure","volume":"187","author":"Y Zhang","year":"2015","journal-title":"Journal of Econometrics"},{"key":"pcbi.1009309.ref036","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1214\/09-SS054","article-title":"A survey of cross-validation procedures for model selection.","volume":"4","author":"S Arlot","year":"2010","journal-title":"Statistics surveys."},{"issue":"1","key":"pcbi.1009309.ref037","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1038\/ng1061","article-title":"Identifying distinct classes of bladder carcinoma using microarrays","volume":"33","author":"L Dyrskjot","year":"2003","journal-title":"Nat Genet"},{"issue":"1","key":"pcbi.1009309.ref038","doi-asserted-by":"crossref","first-page":"4556","DOI":"10.1038\/s41467-020-18418-4","article-title":"Dutch population structure across space, time and GWAS design.","volume":"11","author":"Project Min EALSGC","year":"2020","journal-title":"Nat Commun."},{"issue":"1","key":"pcbi.1009309.ref039","doi-asserted-by":"crossref","first-page":"4807","DOI":"10.1038\/s41467-020-18497-3","article-title":"Lymph node metastasis prediction of papillary thyroid carcinoma based on transfer learning radiomics","volume":"11","author":"J Yu","year":"2020","journal-title":"Nat Commun"},{"issue":"2","key":"pcbi.1009309.ref040","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1093\/biomet\/80.2.267","article-title":"Maximum Likelihood Estimation via the ECM Algorithm: A General Framework","volume":"80","author":"X-L Meng","year":"1993","journal-title":"Biometrika"},{"issue":"1","key":"pcbi.1009309.ref041","first-page":"015013","article-title":"An automated approach for determining the number of components in non-negative matrix factorization with application to mutational signature learning.","volume":"2","author":"G Gilad","year":"2020","journal-title":"Machine Learning: Science and Technology."},{"issue":"6","key":"pcbi.1009309.ref042","doi-asserted-by":"crossref","first-page":"e1009119","DOI":"10.1371\/journal.pcbi.1009119","article-title":"De novo mutational signature discovery in tumor genomes using SparseSignatures.","volume":"17","author":"A Lal","year":"2021","journal-title":"PLoS Comput Biol."},{"key":"pcbi.1009309.ref043","doi-asserted-by":"crossref","unstructured":"F\u00e9votte C, Cemgil AT, editors. Nonnegative matrix factorizations as probabilistic inference in composite models. 2009 17th European Signal Processing Conference; 2009 24\u201328 Aug. 2009.","DOI":"10.1109\/SIU.2009.5136487"},{"key":"pcbi.1009309.ref044","doi-asserted-by":"crossref","unstructured":"Gaussier E, Goutte C. Relation between PLSA and NMF and implications. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval; Salvador, Brazil: Association for Computing Machinery; 2005. p. 601\u20132.","DOI":"10.1145\/1076034.1076148"},{"key":"pcbi.1009309.ref045","doi-asserted-by":"crossref","unstructured":"Friedman J, Hastie T, Tibshirani R. The elements of statistical learning: Springer series in statistics New York; 2001.","DOI":"10.1007\/978-0-387-21606-5"},{"key":"pcbi.1009309.ref046","volume-title":"Statistical analysis with missing data","author":"RJ Little","year":"2019"},{"issue":"1","key":"pcbi.1009309.ref047","doi-asserted-by":"crossref","first-page":"2169","DOI":"10.1038\/s41467-020-15912-7","article-title":"Mutational signatures are jointly shaped by DNA damage and repair","volume":"11","author":"NV Volkova","year":"2020","journal-title":"Nat Commun"},{"key":"pcbi.1009309.ref048","unstructured":"Kohavi R. A study of cross-validation and bootstrap for accuracy estimation and model selection. Proceedings of the 14th international joint conference on Artificial intelligence\u2014Volume 2; Montreal, Quebec, Canada: Morgan Kaufmann Publishers Inc.; 1995. p. 1137\u201343."},{"issue":"3","key":"pcbi.1009309.ref049","first-page":"291","article-title":"Submodel Selection and Evaluation in Regression. The X-Random Case.","volume":"60","author":"L Breiman","year":"1992","journal-title":"International Statistical Review \/ Revue Internationale de Statistique."},{"key":"pcbi.1009309.ref050","unstructured":"Ding C, Li T, Peng W. Nonnegative matrix factorization and probabilistic latent semantic indexing: equivalence, chi-square statistic, and a hybrid method. Proceedings of the 21st national conference on Artificial intelligence\u2014Volume 1; Boston, Massachusetts: AAAI Press; 2006. p. 342\u20137."},{"issue":"6755","key":"pcbi.1009309.ref051","doi-asserted-by":"crossref","first-page":"788","DOI":"10.1038\/44565","article-title":"Learning the parts of objects by non-negative matrix factorization","volume":"401","author":"DD Lee","year":"1999","journal-title":"Nature"},{"issue":"5","key":"pcbi.1009309.ref052","doi-asserted-by":"crossref","first-page":"1241","DOI":"10.1007\/s00216-007-1790-1","article-title":"Cross-validation of component models: A critical look at current methods","volume":"390","author":"R Bro","year":"2008","journal-title":"Analytical and Bioanalytical Chemistry"},{"issue":"4","key":"pcbi.1009309.ref053","doi-asserted-by":"crossref","first-page":"397","DOI":"10.1080\/00401706.1978.10489693","article-title":"Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models.","volume":"20","author":"S. Wold","year":"1978","journal-title":"Technometrics"},{"issue":"6","key":"pcbi.1009309.ref054","doi-asserted-by":"crossref","first-page":"932","DOI":"10.1016\/j.ccell.2019.04.007","article-title":"Genomic and Transcriptomic Profiling of Combined Hepatocellular and Intrahepatic Cholangiocarcinoma Reveals Distinct Molecular Subtypes","volume":"35","author":"R Xue","year":"2019","journal-title":"Cancer Cell"},{"issue":"1","key":"pcbi.1009309.ref055","doi-asserted-by":"crossref","first-page":"394","DOI":"10.1038\/s41467-019-14261-4","article-title":"Mutational signatures in tumours induced by high and low energy radiation in Trp53 deficient mice.","volume":"11","author":"Y Rose Li","year":"2020","journal-title":"Nat Commun."},{"key":"pcbi.1009309.ref056","doi-asserted-by":"crossref","first-page":"8866","DOI":"10.1038\/ncomms9866","article-title":"Whole-genome sequencing reveals activation-induced cytidine deaminase signatures during indolent chronic lymphocytic leukaemia evolution","volume":"6","author":"S Kasar","year":"2015","journal-title":"Nat Commun"},{"issue":"Nov","key":"pcbi.1009309.ref057","first-page":"2579","article-title":"Visualizing data using t-SNE.","volume":"9","author":"Maaten Lvd","year":"2008","journal-title":"Journal of machine learning research"},{"issue":"5","key":"pcbi.1009309.ref058","doi-asserted-by":"crossref","first-page":"1042","DOI":"10.1016\/j.cell.2017.09.048","article-title":"Comprehensive Analysis of Hypermutation in Human Cancer","volume":"171","author":"BB Campbell","year":"2017","journal-title":"Cell"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1009309","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,4,14]],"date-time":"2022-04-14T00:00:00Z","timestamp":1649894400000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009309","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,19]],"date-time":"2023-11-19T11:38:45Z","timestamp":1700393925000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009309"}},"subtitle":[],"editor":[{"given":"Anna R","family":"Panchenko","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,4,4]]},"references-count":58,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2022,4,4]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1009309","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.07.28.454269","asserted-by":"object"},{"id-type":"doi","id":"10.21203\/rs.3.rs-67930\/v1","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,4,4]]}}}