{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T23:30:12Z","timestamp":1773271812771,"version":"3.50.1"},"reference-count":35,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":3015,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Despite a recent surge of interest in database-independent peptide identifications, accurate de novo peptide sequencing remains an elusive goal. While the recently introduced spectral network approach resulted in accurate peptide sequencing in low-complexity samples, its success depends on the chance of presence of spectra from overlapping peptides. On the other hand, while multistage mass spectrometry (collecting multiple MS 3 spectra from each MS 2 spectrum) can be applied to all spectra in a complex sample, there are currently no software tools for de novo peptide sequencing by multistage mass spectrometry. We describe a rigorous probabilistic framework for analyzing spectra of overlapping peptides and show how to apply it for multistage mass spectrometry. Our software results in both accurate de novo peptide sequencing from multistage mass spectra (despite the inferior quality of MS 3 spectra) and improved interpretation of spectral networks. We further study the problem of de novo peptide sequencing with accurate parent mass (but inaccurate fragment masses), the protocol that may soon become the dominant mode of spectral acquisition. Most existing peptide sequencing algorithms (based on the spectrum graph approach) do not track the accurate parent mass and are thus not equipped for solving this problem. We describe a de novo peptide sequencing algorithm aimed at this experimental protocol and show that it improves the sequencing accuracy on both tandem and multistage mass spectrometry.<\/jats:p>\n               <jats:p>Availability: The open-source implementation of our software is available at http:\/\/proteomics.bioprojects.org.<\/jats:p>\n               <jats:p>Contact: \u00a0bandeira@ucsd.edu<\/jats:p>\n               <jats:p>Supplementary information:: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn184","type":"journal-article","created":{"date-parts":[[2008,7,29]],"date-time":"2008-07-29T20:24:44Z","timestamp":1217363084000},"page":"i416-i423","source":"Crossref","is-referenced-by-count":24,"title":["Multi-spectra peptide sequencing and its applications to multistage mass spectrometry"],"prefix":"10.1093","volume":"24","author":[{"given":"Nuno","family":"Bandeira","sequence":"first","affiliation":[{"name":"1 Department of Computer Science and Engineering, University of California, San Diego, USA and 2Max-Planck Institute for Biochemistry, Germany"}]},{"given":"Jesper V.","family":"Olsen","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, University of California, San Diego, USA and 2Max-Planck Institute for Biochemistry, Germany"}]},{"given":"Matthias","family":"Mann","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, University of California, San Diego, USA and 2Max-Planck Institute for Biochemistry, Germany"}]},{"given":"Pavel A.","family":"Pevzner","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, University of California, San Diego, USA and 2Max-Planck Institute for Biochemistry, Germany"}]}],"member":"286","published-online":{"date-parts":[[2008,7,1]]},"reference":[{"key":"2023020210410171000_B1","doi-asserted-by":"crossref","first-page":"198","DOI":"10.1038\/nature01511","article-title":"Mass spectrometry-based proteomics","volume":"422","author":"Aebersold","year":"2003","journal-title":"Nature"},{"key":"2023020210410171000_B2","doi-asserted-by":"crossref","first-page":"3726","DOI":"10.1093\/bioinformatics\/bti620","article-title":"Robust accurate identification of peptides (raid): deciphering ms2 data using a structured library search with de novo based statistics","volume":"21","author":"Alves","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020210410171000_B3","first-page":"9","article-title":"On de-novo interpretation of tandem mass spectra for peptide identification","author":"Bafna","year":"2003"},{"key":"2023020210410171000_B4","doi-asserted-by":"crossref","first-page":"7221","DOI":"10.1021\/ac0489162","article-title":"Shotgun protein sequencing by tandem mass spectra assembly","volume":"76","author":"Bandeira","year":"2004","journal-title":"Anal. Chem"},{"key":"2023020210410171000_B5","doi-asserted-by":"crossref","first-page":"1123","DOI":"10.1074\/mcp.M700001-MCP200","article-title":"Shotgun protein sequencing: assembly of tandem mass spectra from mixtures of modified proteins","volume":"6","author":"Bandeira","year":"2007","journal-title":"Mol. Cell Proteomics"},{"key":"2023020210410171000_B6","doi-asserted-by":"crossref","first-page":"6140","DOI":"10.1073\/pnas.0701130104","article-title":"Protein identification via spectral networks analysis","volume":"104","author":"Bandeira","year":"2007","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020210410171000_B7","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1002\/bms.1200190607","article-title":"Fast algorithm for peptide sequencing by mass spectroscopy","volume":"19","author":"Bartels","year":"1990","journal-title":"Biomedical and Environmental Mass Spectrometry"},{"key":"2023020210410171000_B8","doi-asserted-by":"crossref","DOI":"10.1074\/mcp.M600419-MCP200","article-title":"The diversity of bioactive proteins in australian snake venoms","volume-title":"Mol. Cell Proteomics.","author":"Birrell","year":"2007"},{"key":"2023020210410171000_B9","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1089\/10665270152530872","article-title":"A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry","volume":"8","author":"Chen","year":"2001","journal-title":"J. Comput. Biol"},{"key":"2023020210410171000_B10","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1089\/106652799318300","article-title":"De novo peptide sequencing via tandem mass spectrometry","volume":"6","author":"Danc\u00edk","year":"1999","journal-title":"J. Comput. Biol"},{"key":"2023020210410171000_B11","volume-title":"Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids.","author":"Durbin","year":"1999"},{"key":"2023020210410171000_B12","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1038\/msb4100142","article-title":"Novel peptide identification from tandem mass spectra using ests and sequence database compression","volume":"3","author":"Edwards","year":"2007","journal-title":"Mol. Syst. Biol"},{"key":"2023020210410171000_B13","first-page":"427","article-title":"A computer program to aid the sequencing of peptides in collision-activated decomposition experiments","volume":"11","author":"Fern\u00e1ndez-de Coss\u00edo","year":"1995","journal-title":"Comput. Appl. Biosci"},{"key":"2023020210410171000_B14","doi-asserted-by":"crossref","first-page":"7265","DOI":"10.1021\/ac0508853","article-title":"Novohmm: a hidden markov model for de novo peptide sequencing","volume":"77","author":"Fischer","year":"2005","journal-title":"Anal. Chem"},{"key":"2023020210410171000_B15","doi-asserted-by":"crossref","first-page":"964","DOI":"10.1021\/ac048788h","article-title":"PepNovo: de novo peptide sequencing via probabilistic network modeling","volume":"77","author":"Frank","year":"2005","journal-title":"Anal. Chem"},{"key":"2023020210410171000_B16","doi-asserted-by":"crossref","first-page":"1362","DOI":"10.1101\/gr.6427907","article-title":"Whole proteome analysis of post-translational modifications: applications of mass-spectrometry for proteogenomic annotation","volume":"17","author":"Gupta","year":"2007","journal-title":"Genome Res"},{"key":"2023020210410171000_B17","doi-asserted-by":"crossref","first-page":"655","DOI":"10.1016\/j.drudis.2006.05.009","article-title":"Recombinant polyclonal antibodies: the next generation of antibody therapeutics?","volume":"11","author":"Haurum","year":"2006","journal-title":"Drug. Discov. Today"},{"key":"2023020210410171000_B18","doi-asserted-by":"crossref","first-page":"2795","DOI":"10.1073\/pnas.0436605100","article-title":"Detection of secreted peptides by using hypothesis-driven multistage mass spectrometry","volume":"100","author":"Kalkum","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020210410171000_B19","doi-asserted-by":"crossref","first-page":"5162","DOI":"10.1021\/ac980823v","article-title":"C-terminal peptide sequencing via multistage mass spectrometry","volume":"70","author":"Lin","year":"1998","journal-title":"Anal. Chem"},{"key":"2023020210410171000_B20","doi-asserted-by":"crossref","first-page":"2337","DOI":"10.1002\/rcm.1196","article-title":"PEAKS: powerful software for peptide de novo sequencing by tandem mass spectrometry","volume":"17","author":"Ma","year":"2003","journal-title":"Rapid Commun. Mass Spectrom"},{"key":"2023020210410171000_B21","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1146\/annurev.genet.39.073003.110544","article-title":"Immunoglobulin gene diversification","volume":"39","author":"Maizels","year":"2005","journal-title":"Annu. Rev. Genet"},{"key":"2023020210410171000_B22","doi-asserted-by":"crossref","first-page":"4870","DOI":"10.1021\/ac070039n","article-title":"Msnovo: a dynamic programming algorithm for de novo peptide sequencing via tandem mass spectrometry","volume":"79","author":"Mo","year":"2007","journal-title":"Anal. Chem"},{"key":"2023020210410171000_B23","doi-asserted-by":"crossref","first-page":"13417","DOI":"10.1073\/pnas.0405549101","article-title":"Improved peptide identification in proteomics by two consecutive stages of mass spectrometric fragmentation","volume":"101","author":"Olsen","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020210410171000_B24","doi-asserted-by":"crossref","first-page":"3551","DOI":"10.1002\/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2","article-title":"Probability-based protein identification by searching sequence databases using mass spectrometry data","volume":"20","author":"Perkins","year":"1999","journal-title":"Electrophoresis"},{"key":"2023020210410171000_B25","doi-asserted-by":"crossref","first-page":"3018","DOI":"10.1021\/pr060222h","article-title":"Performance evaluation of existing de novo sequencing algorithms","volume":"5","author":"Pevtsov","year":"2006","journal-title":"J. Proteome. Res"},{"key":"2023020210410171000_B26","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1016\/j.ab.2006.02.001","article-title":"De novo proteomic sequencing of a monoclonal antibody raised against ox40 ligand","volume":"352","author":"Pham","year":"2006","journal-title":"Anal. Biochem"},{"key":"2023020210410171000_B27","doi-asserted-by":"crossref","first-page":"670","DOI":"10.1002\/psc.701","article-title":"Small peptides, big world: biotechnological potential in neglected bioactive peptides from arthropod venoms","volume":"11","author":"Pimenta","year":"2005","journal-title":"J. Pept. Sci"},{"key":"2023020210410171000_B28","doi-asserted-by":"crossref","first-page":"1015","DOI":"10.1002\/(SICI)1097-0231(19970615)11:9<1015::AID-RCM958>3.0.CO;2-H","article-title":"Rapid \u2018de novo\u2019 peptide sequencing by a combination of nanoelectrospray, isotopic labeling and a quadrupole\/time-of-flight mass spectrometer","volume":"11","author":"Shevchenko","year":"1997","journal-title":"Rapid. Commun. Mass. Spectrom"},{"key":"2023020210410171000_B29","doi-asserted-by":"crossref","first-page":"4626","DOI":"10.1021\/ac050102d","article-title":"InsPecT: identification of posttranslationally modified peptides from tandem mass spectra","volume":"77","author":"Tanner","year":"2005","journal-title":"Anal. Chem"},{"key":"2023020210410171000_B30","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1101\/gr.5646507","article-title":"Improving gene annotation using peptide mass spectrometry","volume":"17","author":"Tanner","year":"2007","journal-title":"Genome Res"},{"key":"2023020210410171000_B31","doi-asserted-by":"crossref","first-page":"1562","DOI":"10.1038\/nbt1168","article-title":"Identification of post-translational modifications by blind search of mass spectra","volume":"23","author":"Tsur","year":"2005","journal-title":"Nat. Biotechnol"},{"key":"2023020210410171000_B32","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1074\/mcp.M700128-MCP200","article-title":"Investigating ms2-ms3 matching statistics: A model for coupling consecutive stage mass spectrometry data for increased peptide identification confidence","volume":"7","author":"Ulintz","year":"2008","journal-title":"Mol. Cell Proteomics"},{"key":"2023020210410171000_B33","doi-asserted-by":"crossref","first-page":"260","DOI":"10.1109\/TIT.1967.1054010","article-title":"Error bounds for convolutional codes and an asymptotically optimum decoding algorithm","volume":"13","author":"Viterbi","year":"1967","journal-title":"IEEE Transactions on Information Theory"},{"key":"2023020210410171000_B34","doi-asserted-by":"crossref","first-page":"3202","DOI":"10.1021\/ac00114a016","article-title":"Mining genomes: correlating tandem mass spectra of modified and unmodified peptides to sequences in nucleotide databases","volume":"67","author":"Yates","year":"1995","journal-title":"Anal. Chem"},{"key":"2023020210410171000_B35","doi-asserted-by":"crossref","first-page":"2337","DOI":"10.1021\/ac000226k","article-title":"De novo peptide sequencing by two-dimensional fragment correlation mass spectrometry","volume":"72","author":"Zhang","year":"2000","journal-title":"Anal. Chem"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i416\/49054238\/bioinformatics_24_13_i416.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i416\/49054238\/bioinformatics_24_13_i416.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T12:26:33Z","timestamp":1675340793000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/13\/i416\/236294"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,7,1]]},"references-count":35,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2008,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn184","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,7,1]]},"published":{"date-parts":[[2008,7,1]]}}}