{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,27]],"date-time":"2025-10-27T04:57:22Z","timestamp":1761541042005},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"8","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,4,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Although many methods and statistical approaches have been developed for protein identification by mass spectrometry, the problem of accurate assessment of statistical significance of protein identifications remains an open question. The main issues are as follows: (i) statistical significance of inferring peptide from experimental mass spectra must be platform independent and spectrum specific and (ii) individual spectrum matches at the peptide level must be combined into a single statistical measure at the protein level.<\/jats:p>\n               <jats:p>Results: We present a method and software to assign statistical significance to protein identifications from search engines for mass spectrometric data. The approach is based on asymptotic theory of order statistics. The parameters of the asymptotic distributions of identification scores are estimated for each spectrum individually. The method relies on new unbiased estimators for parameters of extreme value distribution. The estimated parameters are used to assign a spectrum-specific P-value to each peptide-spectrum match. The protein-level confidence measure combines P-values of peptide-to-spectrum matches.<\/jats:p>\n               <jats:p>Conclusion: We extensively tested the method using triplicate mouse and yeast high-throughput proteomic experiments. The proposed statistical approach improves the sensitivity of protein identifications without compromising specificity. While the method was primarily designed to work with Mascot, it is platform-independent and is applicable to any search engine which outputs a single score for a peptide-spectrum match. We demonstrate this by testing the method in conjunction with X!Tandem.<\/jats:p>\n               <jats:p>Availability: The software is available for download at ftp:\/\/genetics.bwh.harvard.edu\/SSPV\/.<\/jats:p>\n               <jats:p>Contact: \u00a0ssunyaev@rics.bwh.harvard.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr089","type":"journal-article","created":{"date-parts":[[2011,2,25]],"date-time":"2011-02-25T01:38:20Z","timestamp":1298597900000},"page":"1128-1134","source":"Crossref","is-referenced-by-count":33,"title":["Assigning spectrum-specific <i>P<\/i>-values to protein identifications by mass spectrometry"],"prefix":"10.1093","volume":"27","author":[{"given":"Victor","family":"Spirin","sequence":"first","affiliation":[{"name":"1 Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115, 2Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02143, 3Department of Cell Biology, Harvard Medical School, 240 Longwood Avenue, Boston, MA 02115, USA and 4MPI of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany"}]},{"given":"Alexander","family":"Shpunt","sequence":"additional","affiliation":[{"name":"1 Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115, 2Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02143, 3Department of Cell Biology, Harvard Medical School, 240 Longwood Avenue, Boston, MA 02115, USA and 4MPI of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany"},{"name":"1 Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115, 2Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02143, 3Department of Cell Biology, Harvard Medical School, 240 Longwood Avenue, Boston, MA 02115, USA and 4MPI of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany"}]},{"given":"Jan","family":"Seebacher","sequence":"additional","affiliation":[{"name":"1 Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115, 2Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02143, 3Department of Cell Biology, Harvard Medical School, 240 Longwood Avenue, Boston, MA 02115, USA and 4MPI of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany"}]},{"given":"Marc","family":"Gentzel","sequence":"additional","affiliation":[{"name":"1 Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115, 2Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02143, 3Department of Cell Biology, Harvard Medical School, 240 Longwood Avenue, Boston, MA 02115, USA and 4MPI of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany"}]},{"given":"Andrej","family":"Shevchenko","sequence":"additional","affiliation":[{"name":"1 Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115, 2Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02143, 3Department of Cell Biology, Harvard Medical School, 240 Longwood Avenue, Boston, MA 02115, USA and 4MPI of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany"}]},{"given":"Steven","family":"Gygi","sequence":"additional","affiliation":[{"name":"1 Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115, 2Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02143, 3Department of Cell Biology, Harvard Medical School, 240 Longwood Avenue, Boston, MA 02115, USA and 4MPI of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany"}]},{"given":"Shamil","family":"Sunyaev","sequence":"additional","affiliation":[{"name":"1 Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115, 2Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02143, 3Department of Cell Biology, Harvard Medical School, 240 Longwood Avenue, Boston, MA 02115, USA and 4MPI of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany"}]}],"member":"286","published-online":{"date-parts":[[2011,2,23]]},"reference":[{"key":"2023061311473711700_B1","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1021\/pr0255654","article-title":"A new algorithm for the evaluation of shotgun peptide sequencing in proteomics: support vector machine classification of peptide MS\/MS spectra and SEQUEST scores","volume":"2","author":"Anderson","year":"2003","journal-title":"J. Proteome Res."},{"key":"2023061311473711700_B2","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1186\/1745-6150-2-25","article-title":"RAId_DbS: peptide identification using database searches with realistic statistics","volume":"2","author":"Alves","year":"2007","journal-title":"Biol. Direct"},{"key":"2023061311473711700_B3","volume-title":"A First Course in Order Statistics.","author":"Arnold","year":"1992"},{"key":"2023061311473711700_B4","doi-asserted-by":"crossref","first-page":"1409","DOI":"10.1007\/s00216-007-1563-x","article-title":"The effect of mass accuracy, data acquisition speed, and search algorithm choice on peptide identification rates in phosphoproteomics","volume":"389","author":"Bakalarski","year":"2007","journal-title":"Anal. Bioanal. Chem."},{"key":"2023061311473711700_B5","doi-asserted-by":"crossref","first-page":"705","DOI":"10.1089\/cmb.2007.0119","article-title":"Improved ranking functions for protein and modification-site identification","volume":"15","author":"Bern","year":"2008","journal-title":"J. Comput. Biol."},{"key":"2023061311473711700_B6","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1074\/mcp.T400006-MCP200","article-title":"The need for guidelines in publication of peptide and protein identification data: working group on publication guidelines for peptide and protein identification data","volume":"3","author":"Carr","year":"2004","journal-title":"Mol. Cell. Proteomics"},{"key":"2023061311473711700_B7","doi-asserted-by":"crossref","first-page":"1367","DOI":"10.1038\/nbt.1511","article-title":"MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification","volume":"26","author":"Cox","year":"2008","journal-title":"Nat. Biotechonol."},{"key":"2023061311473711700_B8","doi-asserted-by":"crossref","first-page":"1466","DOI":"10.1093\/bioinformatics\/bth092","article-title":"TANDEM: matching proteins with tandem mass spectra","volume":"20","author":"Craig","year":"2004","journal-title":"Bioinformatics"},{"key":"2023061311473711700_B9","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1038\/nmeth1019","article-title":"Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry","volume":"4","author":"Elias","year":"2007","journal-title":"Nat. Methods"},{"key":"2023061311473711700_B10","doi-asserted-by":"crossref","first-page":"4598","DOI":"10.1021\/pr800420s","article-title":"A Fast SEQUEST cross correlation algorithm","volume":"7","author":"Eng","year":"2008","journal-title":"J. Proteome Res."},{"key":"2023061311473711700_B11","doi-asserted-by":"crossref","first-page":"3901","DOI":"10.1021\/ac070202e","article-title":"Probability model for assessing proteins assembled from peptide sequences inferred from tandem mass spectrometry data","volume":"79","author":"Feng","year":"2007","journal-title":"Anal. Chem."},{"key":"2023061311473711700_B12","doi-asserted-by":"crossref","first-page":"958","DOI":"10.1021\/pr0499491","article-title":"Open mass spectrometry search algorithm","volume":"3","author":"Geer","year":"2004","journal-title":"J. Proteome Res."},{"key":"2023061311473711700_B13","volume-title":"Statistics of Extremes.","author":"Gumbel","year":"2004"},{"key":"2023061311473711700_B14","doi-asserted-by":"crossref","first-page":"923","DOI":"10.1038\/nmeth1113","article-title":"Semi-supervised learning for peptide identification from shotgun proteomics datasets","volume":"4","author":"Kall","year":"2007","journal-title":"Nat. Methods"},{"key":"2023061311473711700_B15","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1021\/pr700600n","article-title":"Assigning significance to peptides identified by tandem mass spectrometry using decoy databases","volume":"7","author":"Kall","year":"2008","journal-title":"J. Proteome Res."},{"key":"2023061311473711700_B16","doi-asserted-by":"crossref","first-page":"5383","DOI":"10.1021\/ac025747h","article-title":"Empirical statistical model to estimate the accuracy of peptide identifications made by MS\/MS and database search","volume":"74","author":"Keller","year":"2002","journal-title":"Anal. Chem."},{"key":"2023061311473711700_B17","doi-asserted-by":"crossref","first-page":"3354","DOI":"10.1021\/pr8001244","article-title":"Spectral probabilities and generating functions of tandem mass spectra: a strike against decoy databases","volume":"7","author":"Kim","year":"2009","journal-title":"J. Proteome Res."},{"key":"2023061311473711700_B18","doi-asserted-by":"crossref","first-page":"2106","DOI":"10.1021\/pr8011107","article-title":"Statistical calibration of the SEQUEST Xcorr funtion","volume":"8","author":"Klammer","year":"2009","journal-title":"J. Proteome Res."},{"key":"2023061311473711700_B19","doi-asserted-by":"crossref","first-page":"378","DOI":"10.1016\/S1044-0305(02)00352-5","article-title":"Qscore: an algorithm for evaluating SEQUEST database search results","volume":"13","author":"Moore","year":"2002","journal-title":"J. Am. Soc. Mass. Spectrom."},{"key":"2023061311473711700_B20","doi-asserted-by":"crossref","first-page":"1419","DOI":"10.1074\/mcp.R500012-MCP200","article-title":"Interpretation of shotgun proteomic data. The protein inference problem","volume":"4","author":"Nesvizhskii","year":"2005","journal-title":"Mol. Cell. Proteomics"},{"key":"2023061311473711700_B21","doi-asserted-by":"crossref","first-page":"4646","DOI":"10.1021\/ac0341261","article-title":"A statistical model for identifying proteins by tandem mass spectrometry","volume":"75","author":"Nesvizhskii","year":"2003","journal-title":"Anal. Chem."},{"key":"2023061311473711700_B22","doi-asserted-by":"crossref","first-page":"3022","DOI":"10.1021\/pr800127y","article-title":"Rapid and accurate peptide identification from tandem mass spectra","volume":"7","author":"Park","year":"2008","journal-title":"J. Proteome Res."},{"key":"2023061311473711700_B23","doi-asserted-by":"crossref","first-page":"1748","DOI":"10.1074\/mcp.M800122-MCP200","article-title":"Generalized method for probability-based peptide and protein identification from tandem mass spectrometry data and sequence database searching","volume":"7","author":"Ramos-Fernandez","year":"2008","journal-title":"Mol. Cell. Proteomics"},{"key":"2023061311473711700_B24","doi-asserted-by":"crossref","first-page":"2405","DOI":"10.1074\/mcp.M900317-MCP200","article-title":"Protein identification false discovery rates for very large proteomics data sets generated by tandem mass spectrometry","volume":"8","author":"Reiter","year":"2009","journal-title":"Mol. Cell. Proteomics"},{"key":"2023061311473711700_B25","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1021\/pr070540w","article-title":"Improving sensitivity by probabilistically combining results from multiple MS\/MS search methodologies","volume":"7","author":"Searle","year":"2008","journal-title":"J. Proteome Res."},{"key":"2023061311473711700_B26","doi-asserted-by":"crossref","first-page":"762","DOI":"10.1074\/mcp.M400215-MCP200","article-title":"A euristic method for assigning a false-discovery rates for protein identifications from Mascot database search results","volume":"4","author":"Weatherly","year":"2005","journal-title":"Mol. Cell. Proteomics"},{"key":"2023061311473711700_B27","doi-asserted-by":"crossref","first-page":"1368","DOI":"10.1111\/j.1420-9101.2005.00917.x","article-title":"Combining probability from independent tests: the weighted Z-method is superior to Fisher's approach","volume":"18","author":"Whitlock","year":"2005","journal-title":"J. Evol. Biol."},{"key":"2023061311473711700_B28","doi-asserted-by":"crossref","first-page":"3549","DOI":"10.1021\/pr070230d","article-title":"Proteomic parsimony through bipartite graph analysis improves accuracy and transparency","volume":"6","author":"Zhang","year":"2007","journal-title":"J. Proteome Res."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/8\/1128\/50580150\/bioinformatics_27_8_1128.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/8\/1128\/50580150\/bioinformatics_27_8_1128.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,13]],"date-time":"2023-06-13T11:48:48Z","timestamp":1686656928000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/8\/1128\/228019"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,2,23]]},"references-count":28,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2011,4,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr089","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2011,4,15]]},"published":{"date-parts":[[2011,2,23]]}}}