{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,31]],"date-time":"2025-10-31T07:12:23Z","timestamp":1761894743453},"reference-count":35,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":3015,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Tandem mass spectrometry (MS\/MS) is an indispensable technology for identification of proteins from complex mixtures. Proteins are digested to peptides that are then identified by their fragmentation patterns in the mass spectrometer. Thus, at its core, MS\/MS protein identification relies on the relative predictability of peptide fragmentation. Unfortunately, peptide fragmentation is complex and not fully understood, and what is understood is not always exploited by peptide identification algorithms.<\/jats:p><jats:p>Results: We use a hybrid dynamic Bayesian network (DBN)\/support vector machine (SVM) approach to address these two problems. We train a set of DBNs on high-confidence peptide-spectrum matches. These DBNs, known collectively as Riptide, comprise a probabilistic model of peptide fragmentation chemistry. Examination of the distributions learned by Riptide allows identification of new trends, such as prevalent a-ion fragmentation at peptide cleavage sites C-term to hydrophobic residues. In addition, Riptide can be used to produce likelihood scores that indicate whether a given peptide-spectrum match is correct. A vector of such scores is evaluated by an SVM, which produces a final score to be used in peptide identification. Using Riptide in this way yields improved discrimination when compared to other state-of-the-art MS\/MS identification algorithms, increasing the number of positive identifications by as much as 12% at a 1% false discovery rate.<\/jats:p><jats:p>Availability: Python and C source code are available upon request from the authors. The curated training sets are available at http:\/\/noble.gs.washington.edu\/proj\/intense\/. The Graphical Model Tool Kit (GMTK) is freely available at http:\/\/ssli.ee.washington.edu\/bilmes\/gmtk.<\/jats:p><jats:p>Contact: \u00a0noble@gs.washington.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn189","type":"journal-article","created":{"date-parts":[[2008,6,27]],"date-time":"2008-06-27T07:43:13Z","timestamp":1214552593000},"page":"i348-i356","source":"Crossref","is-referenced-by-count":46,"title":["Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification"],"prefix":"10.1093","volume":"24","author":[{"given":"Aaron A.","family":"Klammer","sequence":"first","affiliation":[{"name":"1 Department of Genome Sciences, 2Department of Electrical Engineering and 3Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA"}]},{"given":"Sheila M.","family":"Reynolds","sequence":"additional","affiliation":[{"name":"1 Department of Genome Sciences, 2Department of Electrical Engineering and 3Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA"}]},{"given":"Jeff A.","family":"Bilmes","sequence":"additional","affiliation":[{"name":"1 Department of Genome Sciences, 2Department of Electrical Engineering and 3Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA"},{"name":"1 Department of Genome Sciences, 2Department of Electrical Engineering and 3Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA"}]},{"given":"Michael J.","family":"MacCoss","sequence":"additional","affiliation":[{"name":"1 Department of Genome Sciences, 2Department of Electrical Engineering and 3Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA"}]},{"given":"William Stafford","family":"Noble","sequence":"additional","affiliation":[{"name":"1 Department of Genome Sciences, 2Department of Electrical Engineering and 3Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA"},{"name":"1 Department of Genome Sciences, 2Department of Electrical Engineering and 3Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA"}]}],"member":"286","published-online":{"date-parts":[[2008,7,1]]},"reference":[{"key":"2023020210392250400_B1","doi-asserted-by":"crossref","first-page":"S13","DOI":"10.1093\/bioinformatics\/17.suppl_1.S13","article-title":"SCOPE: a probabilistic model for scoring tandem mass spectra against a peptide database","volume":"17","author":"Bafna","year":"2001","journal-title":"Bioinformatics"},{"key":"2023020210392250400_B2","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1109\/MSP.2005.1511827","article-title":"Graphical model architectures for speech recognition","volume":"22","author":"Bilmes","year":"2005","journal-title":"IEEE Signal Proc. Mag"},{"key":"2023020210392250400_B3","doi-asserted-by":"crossref","first-page":"8365","DOI":"10.1021\/ja9542193","article-title":"Influence of peptide composition, gas-phase basicity, and chemical modification on fragmentation efficiency: evidence for the mobile proton model","volume":"118","author":"Dongre","year":"1996","journal-title":"J. Am. Chem. Soc"},{"key":"2023020210392250400_B4","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1089\/106652799318300","article-title":"De novopeptide sequencing via tandem mass spectrometry","volume":"6","author":"Dancik","year":"1999","journal-title":"J. Comput. Biol"},{"key":"2023020210392250400_B5","doi-asserted-by":"crossref","first-page":"976","DOI":"10.1016\/1044-0305(94)80016-2","article-title":"An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database","volume":"5","author":"Eng","year":"1994","journal-title":"J. Am. Soc. Mass Spectr"},{"key":"2023020210392250400_B6","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1038\/nbt930","article-title":"Intensity-based protein identification by machine learning from a library of tandem mass spectra","volume":"22","author":"Elias","year":"2004","journal-title":"Nature Biotechnology"},{"key":"2023020210392250400_B7","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1002\/1615-9861(200201)2:1<36::AID-PROT36>3.0.CO;2-W","article-title":"Radars, a bioinformatics solution that automates proteome mass spectral analysis, optimises protein identification, and archives data in a relational database","volume":"2","author":"Field","year":"2002","journal-title":"Proteomics"},{"key":"2023020210392250400_B8","doi-asserted-by":"crossref","first-page":"964","DOI":"10.1021\/ac048788h","article-title":"Pepnovo: de novo peptide sequencing via probabilistic network modeling","volume":"77","author":"Frank","year":"2005","journal-title":"Anal. Chem"},{"key":"2023020210392250400_B9","doi-asserted-by":"crossref","first-page":"958","DOI":"10.1021\/pr0499491","article-title":"Open mass spectrometry search algorithm","volume":"3","author":"Geer","year":"2004","journal-title":"J. Proteome Res"},{"key":"2023020210392250400_B10","article-title":"A tutorial on learning with Bayesian Networks","volume-title":"Technical report","author":"Heckerman","year":"1995"},{"key":"2023020210392250400_B11","doi-asserted-by":"crossref","first-page":"435","DOI":"10.1021\/ac0258913","article-title":"Intensity-based statistical scorer for tandem mass spectrometry","volume":"75","author":"Havilio","year":"2003","journal-title":"Anal. Chem"},{"key":"2023020210392250400_B12","doi-asserted-by":"crossref","first-page":"5620","DOI":"10.1021\/ac0700833","article-title":"High speed data reduction, feature detection, and MS\/MS spectrum quality assessment of shotgun proteomics datasets using high resolution mass spectrometry","volume":"79","author":"Hoopmann","year":"2007","journal-title":"Anal. Chem"},{"key":"2023020210392250400_B13","doi-asserted-by":"crossref","DOI":"10.1002\/0471721980","article-title":"Protein sequencing and identification using tandem mass spectrometry","author":"Kinter","year":"2000"},{"key":"2023020210392250400_B14","doi-asserted-by":"crossref","first-page":"6111","DOI":"10.1021\/ac070262k","article-title":"Improving tandem mass spectrum identification using peptide retention time prediction across diverse chromatography conditions","volume":"79","author":"Klammer","year":"2007","journal-title":"Anal. Chem"},{"key":"2023020210392250400_B15","doi-asserted-by":"crossref","first-page":"923","DOI":"10.1038\/nmeth1113","article-title":"A semi-supervised machine learning technique for peptide identification from shotgun proteomics datasets","volume":"4","author":"K\u00e4ll","year":"2007","journal-title":"Nat. Methods"},{"key":"2023020210392250400_B16","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1021\/pr700600n","article-title":"Assigning significance to peptides identified by tandem mass spectrometry using decoy databases","volume":"7","author":"K\u00e4ll","year":"2008","journal-title":"J. Proteome Res"},{"key":"2023020210392250400_B17","doi-asserted-by":"crossref","DOI":"10.1093\/oso\/9780198522195.001.0001","volume-title":"Graphical Models","author":"Lauritzen","year":"1996"},{"key":"2023020210392250400_B18","doi-asserted-by":"crossref","first-page":"437","DOI":"10.1146\/annurev.biochem.70.1.437","article-title":"Analysis of proteins and proteomes by mass spectrometry","volume":"70","author":"Mann","year":"2001","journal-title":"Ann. Rev. Biochem"},{"key":"2023020210392250400_B19","doi-asserted-by":"crossref","first-page":"1811","DOI":"10.1016\/j.bbapap.2006.10.003","article-title":"The utility of ETD mass spectrometry in proteomic analysis","volume":"1764","author":"Mikesh","year":"2007","journal-title":"Biochim. Biophys. Acta"},{"key":"2023020210392250400_B20","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1093\/bioinformatics\/19.2.295","article-title":"Matrix2png: a utility for visualizing matrix data","volume":"19","author":"Pavlidis","year":"2003","journal-title":"Bioinformatics"},{"key":"2023020210392250400_B21","doi-asserted-by":"crossref","first-page":"508","DOI":"10.1002\/mas.20024","article-title":"Fragmentation pathways of protonated peptides","volume":"24","author":"Paizs","year":"2004","journal-title":"Mass Spectro. Rev"},{"key":"2023020210392250400_B22","doi-asserted-by":"crossref","DOI":"10.1021\/pr800127y","article-title":"Rapid and accurate peptide identification from tandem mass spectra","volume-title":"J. Proteome Res","author":"Park","year":"2008"},{"key":"2023020210392250400_B23","doi-asserted-by":"crossref","first-page":"9440","DOI":"10.1073\/pnas.1530509100","article-title":"Statistical significance for genome-wide studies","volume":"100","author":"Storey","year":"2003","journal-title":"Pro. Natl. Acad. Sci.USA"},{"key":"2023020210392250400_B24","doi-asserted-by":"crossref","first-page":"1067","DOI":"10.1002\/(SICI)1097-0231(19970615)11:9<1067::AID-RCM953>3.0.CO;2-L","article-title":"Sequence database searches via de novopeptide sequencing by tandem mass spectrometry","volume":"11","author":"Taylor","year":"1997","journal-title":"Rapid commun. Mass Spectr"},{"key":"2023020210392250400_B25","doi-asserted-by":"crossref","first-page":"6415","DOI":"10.1021\/ac0347462","article-title":"Gutentag: high-throughput sequence tagging via an empirically derived fragmentation model","volume":"75","author":"Tabb","year":"2003","journal-title":"Anal. Chem"},{"key":"2023020210392250400_B26","doi-asserted-by":"crossref","first-page":"1243","DOI":"10.1021\/ac0351163","article-title":"Influence of basic residue content on fragment ion peak intensities in low-energy collision-induced dissociation spectra of peptides","volume":"76","author":"Tabb","year":"2004","journal-title":"Anal. Chem"},{"key":"2023020210392250400_B27","doi-asserted-by":"crossref","first-page":"4626","DOI":"10.1021\/ac050102d","article-title":"InsPecT: Identification of posttranslationally modified peptides from tandem mass spectra","volume":"77","author":"Tanner","year":"2005","journal-title":"Anal. Chem"},{"key":"2023020210392250400_B28","doi-asserted-by":"crossref","first-page":"1399","DOI":"10.1002\/1096-9888(200012)35:12<1399::AID-JMS86>3.0.CO;2-R","article-title":"Mobile and localized protons: a framework for understanding peptide dissociation","volume":"35","author":"Wysocki","year":"2000","journal-title":"J. Am. Soc. Mass Spectr"},{"key":"2023020210392250400_B29","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1038\/85686","article-title":"Large-scale analysis of the yeast proteome by multidimensional protein identification technology","volume":"19","author":"Washburn","year":"2001","journal-title":"Nat. Biotechnol"},{"key":"2023020210392250400_B30","doi-asserted-by":"crossref","first-page":"432","DOI":"10.1021\/ac051319a","article-title":"PepHMM: a hidden Markov model based scoring function for mass spectrometry database search","volume":"78","author":"Wan","year":"2005","journal-title":"Anal.l Chem"},{"key":"2023020210392250400_B31","doi-asserted-by":"crossref","first-page":"1426","DOI":"10.1021\/ac00104a020","article-title":"Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database","volume":"67","author":"Yates,III","year":"1995","journal-title":"Anal. Chem"},{"key":"2023020210392250400_B32","first-page":"1","article-title":"Mass spectrometry and the age of the proteome","volume":"33","author":"Yates,III","year":"1998","journal-title":"Anal. Chem"},{"key":"2023020210392250400_B33","doi-asserted-by":"crossref","first-page":"1406","DOI":"10.1002\/1615-9861(200210)2:10<1406::AID-PROT1406>3.0.CO;2-9","article-title":"ProbID: a probabilistic algorithm to identify peptides through sequence database searching using tandem mass spectral data","volume":"2","author":"Zhang","year":"2002","journal-title":"Proteomics"},{"key":"2023020210392250400_B34","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1016\/j.copbio.2003.12.002","article-title":"Electron-capture dissociation tandem mass spectrometry","volume":"15","author":"Zubarev","year":"2004","journal-title":"Curr. Opin. Biotechnol"},{"key":"2023020210392250400_B35","doi-asserted-by":"crossref","first-page":"3908","DOI":"10.1021\/ac049951b","article-title":"Prediction of low-energy collision-induced dissociation spectra of peptides","volume":"76","author":"Zhang","year":"2004","journal-title":"Anal. Chem"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i348\/49050839\/bioinformatics_24_13_i348.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i348\/49050839\/bioinformatics_24_13_i348.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,27]],"date-time":"2024-02-27T22:21:25Z","timestamp":1709072485000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/13\/i348\/236975"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,7,1]]},"references-count":35,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2008,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn189","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,7,1]]},"published":{"date-parts":[[2008,7,1]]}}}