{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,22]],"date-time":"2025-02-22T00:45:39Z","timestamp":1740185139352,"version":"3.37.3"},"reference-count":24,"publisher":"Oxford University Press (OUP)","issue":"Supplement_1","license":[{"start":{"date-parts":[[2020,7,13]],"date-time":"2020-07-13T00:00:00Z","timestamp":1594598400000},"content-version":"vor","delay-in-days":12,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["NRF-2017M3C9A5031597","NRF-2017R1E1A1A01077412","NRF-2019M3E5D3073568"],"award-info":[{"award-number":["NRF-2017M3C9A5031597","NRF-2017R1E1A1A01077412","NRF-2019M3E5D3073568"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001321","name":"National Research Foundation","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001321","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Ministry of Education of Korea"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Proteogenomics has proven its utility by integrating genomics and proteomics. Typical approaches use data from next-generation sequencing to infer proteins expressed. A sample-specific protein sequence database is often adopted to identify novel peptides from matched mass spectrometry-based proteomics; nevertheless, there is no software that can practically identify all possible forms of mutated peptides suggested by various genomic information sources.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We propose MutCombinator, which enables us to practically identify mutated peptides from tandem mass spectra allowing combinatorial mutations during the database search. It uses an upgraded version of a variant graph, keeping track of frame information. The variant graph is indexed by nine nucleotides for fast access. Using MutCombinator, we could identify more mutated peptides than previous methods, because combinations of point mutations are considered and also because it can be practically applied together with a large mutation database such as COSMIC. Furthermore, MutCombinator supports in-frame search for coding regions and three-frame search for non-coding regions.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>https:\/\/prix.hanyang.ac.kr\/download\/mutcombinator.jsp.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa504","type":"journal-article","created":{"date-parts":[[2020,5,9]],"date-time":"2020-05-09T11:09:46Z","timestamp":1589022586000},"page":"i203-i209","source":"Crossref","is-referenced-by-count":1,"title":["MutCombinator: identification of mutated peptides allowing combinatorial mutations using nucleotide-based graph search"],"prefix":"10.1093","volume":"36","author":[{"given":"Seunghyuk","family":"Choi","sequence":"first","affiliation":[{"name":"Department of Computer Science, Hanyang University , Seongdong-gu, Seoul 04763, Republic of Korea"}]},{"given":"Eunok","family":"Paek","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Hanyang University , Seongdong-gu, Seoul 04763, Republic of Korea"}]}],"member":"286","published-online":{"date-parts":[[2020,7,13]]},"reference":[{"key":"2024021913322536300_btaa504-B3","doi-asserted-by":"crossref","first-page":"1218","DOI":"10.1093\/bioinformatics\/btw787","article-title":"ACTG: novel peptide mapping onto gene models","volume":"33","author":"Choi","year":"2017","journal-title":"Bioinformatics"},{"key":"2024021913322536300_btaa504-B4","doi-asserted-by":"crossref","first-page":"860","DOI":"10.1038\/nature01322","article-title":"Inflammation and cancer","volume":"420","author":"Coussens","year":"2002","journal-title":"Nature"},{"key":"2024021913322536300_btaa504-B5","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1038\/nprot.2008.211","article-title":"Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources","volume":"4","author":"Huang da","year":"2009","journal-title":"Nat. Protoc"},{"key":"2024021913322536300_btaa504-B6","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1038\/nature13302","article-title":"A draft map of the human proteome","volume":"509","author":"Kim","year":"2014","journal-title":"Nature"},{"key":"2024021913322536300_btaa504-B7","doi-asserted-by":"crossref","first-page":"5277","DOI":"10.1038\/ncomms6277","article-title":"MS-GF+ makes progress towards a universal database search tool for proteomics","volume":"5","author":"Kim","year":"2014","journal-title":"Nat. Commun"},{"key":"2024021913322536300_btaa504-B8","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1038\/nmeth.4256","article-title":"MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics","volume":"14","author":"Kong","year":"2017","journal-title":"Nat. Methods"},{"key":"2024021913322536300_btaa504-B9","doi-asserted-by":"crossref","first-page":"D1062","DOI":"10.1093\/nar\/gkx1153","article-title":"ClinVar: improving access to variant interpretations and supporting evidence","volume":"46","author":"Landrum","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2024021913322536300_btaa504-B10","doi-asserted-by":"crossref","first-page":"4390","DOI":"10.1021\/ac00096a002","article-title":"Error-tolerant identification of peptides in sequence databases by peptide sequence tags","volume":"66","author":"Mann","year":"1994","journal-title":"Anal. Chem"},{"key":"2024021913322536300_btaa504-B11","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.ijms.2017.08.015","article-title":"Comprehensive and sensitive proteogenomics data analysis strategy based on complementary multi-stage database search","volume":"427","author":"Madar","year":"2018","journal-title":"Int. J. Mass Spectrom"},{"key":"2024021913322536300_btaa504-B12","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1038\/nature18003","article-title":"Proteogenomics connects somatic mutations to signalling in breast cancer","volume":"534","author":"Mertins","year":"2016","journal-title":"Nature"},{"key":"2024021913322536300_btaa504-B13","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1016\/j.ccell.2018.12.003","article-title":"Proteogenomic characterization of human early-onset gastric cancer","volume":"35","author":"Mun","year":"2019","journal-title":"Cancer Cell"},{"key":"2024021913322536300_btaa504-B14","doi-asserted-by":"crossref","DOI":"10.1074\/mcp.M111.010199","article-title":"Fast multi-blind modification search through tandem mass spectrometry","volume":"11","author":"Na","year":"2012","journal-title":"Mol. Cell Proteomics"},{"key":"2024021913322536300_btaa504-B15","doi-asserted-by":"crossref","first-page":"1114","DOI":"10.1038\/nmeth.3144","article-title":"Proteogenomics: concepts, applications and computational strategies","volume":"11","author":"Nesvizhskii","year":"2014","journal-title":"Nat. Methods"},{"key":"2024021913322536300_btaa504-B16","doi-asserted-by":"crossref","first-page":"2742","DOI":"10.1002\/pmic.201400225","article-title":"Compact variant-rich customized sequence database and a fast and sensitive database search for efficient proteogenomic analyses","volume":"14","author":"Park","year":"2014","journal-title":"Proteomics"},{"key":"2024021913322536300_btaa504-B17","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1038\/nbt.1754","article-title":"Integrative genomics viewer","volume":"29","author":"Robinson","year":"2011","journal-title":"Nat. Biotechnol"},{"key":"2024021913322536300_btaa504-B18","doi-asserted-by":"crossref","first-page":"1124","DOI":"10.1074\/mcp.M700419-MCP200","article-title":"Post-experiment monoisotopic mass filtering and refinement (PE-MMR) of tandem mass spectrometric data increases accuracy of peptide identification in LC\/MS\/MS","volume":"7","author":"Shin","year":"2008","journal-title":"Mol. Cell. Proteomics"},{"key":"2024021913322536300_btaa504-B19","doi-asserted-by":"crossref","first-page":"138","DOI":"10.3389\/fgene.2019.00138","article-title":"Aberrant expression of pseudogene-derived lncRNAs as an alternative mechanism of cancer gene regulation in lung adenocarcinoma","volume":"10","author":"Stewart","year":"2019","journal-title":"Front. Genet"},{"key":"2024021913322536300_btaa504-B20","doi-asserted-by":"crossref","first-page":"6415","DOI":"10.1021\/ac0347462","article-title":"GutenTag: high-throughput sequence tagging via an empirically derived fragmentation model","volume":"75","author":"Tabb","year":"2003","journal-title":"Anal. Chem"},{"key":"2024021913322536300_btaa504-B21","doi-asserted-by":"crossref","first-page":"D941","DOI":"10.1093\/nar\/gky1015","article-title":"COSMIC: the catalogue of somatic mutations in cancer","volume":"47","author":"Tate","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2024021913322536300_btaa504-B22","doi-asserted-by":"crossref","first-page":"3235","DOI":"10.1093\/bioinformatics\/btt543","article-title":"customProDB: an R package to generate customized protein databases from RNA-Seq data for proteomics search","volume":"29","author":"Wang","year":"2013","journal-title":"Bioinformatics"},{"key":"2024021913322536300_btaa504-B24","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1021\/pr400294c","article-title":"Proteogenomic database construction driven from large scale RNA-seq data","volume":"13","author":"Woo","year":"2014","journal-title":"J. Proteome Res"},{"key":"2024021913322536300_btaa504-B25","doi-asserted-by":"crossref","first-page":"2719","DOI":"10.1002\/pmic.201400206","article-title":"Proteogenomic strategies for identification of aberrant cancer peptides using large-scale next-generation sequencing data","volume":"14","author":"Woo","year":"2014","journal-title":"Proteomics"},{"key":"2024021913322536300_btaa504-B26","doi-asserted-by":"crossref","first-page":"382","DOI":"10.1038\/nature13438","article-title":"Proteogenomic characterization of human colon and rectal cancer","volume":"513","author":"Zhang","year":"2014","journal-title":"Nature"},{"key":"2024021913322536300_btaa504-B27","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1021\/acs.jproteome.6b00505","article-title":"CanProVar 2.0: an updated database of human cancer proteome variation","volume":"16","author":"Zhang","year":"2017","journal-title":"J. Proteome Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/Supplement_1\/i203\/56702360\/bioinformatics_36_supplement1_i203.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/Supplement_1\/i203\/56702360\/bioinformatics_36_supplement1_i203.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,19]],"date-time":"2024-02-19T13:37:50Z","timestamp":1708349870000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/Supplement_1\/i203\/5870524"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,7,1]]},"references-count":24,"journal-issue":{"issue":"Supplement_1","published-print":{"date-parts":[[2020,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa504","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2020,7]]},"published":{"date-parts":[[2020,7,1]]}}}