{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,23]],"date-time":"2026-01-23T17:51:35Z","timestamp":1769190695751,"version":"3.49.0"},"reference-count":27,"publisher":"Springer Science and Business Media LLC","issue":"S2","license":[{"start":{"date-parts":[[2021,4,1]],"date-time":"2021-04-01T00:00:00Z","timestamp":1617235200000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2021,4,26]],"date-time":"2021-04-26T00:00:00Z","timestamp":1619395200000},"content-version":"vor","delay-in-days":25,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2021,4]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Mass spectrometry remains the privileged method to characterize proteins. Nevertheless, most of the spectra generated by an experiment remain unidentified after their analysis, mostly because of the modifications they carry. Open Modification Search (OMS) methods offer a promising answer to this problem. However, assessing the quality of OMS identifications remains a difficult task.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Methods<\/jats:title>\n                <jats:p>Aiming at better understanding the relationship between (1) similarity of pairs of spectra provided by OMS methods and (2) relevance of their corresponding peptide sequences, we used a dataset composed of theoretical spectra only, on which we applied two OMS strategies. We also introduced two appropriately defined measures for evaluating the above mentioned spectra\/sequence relevance in this context: one is a color classification representing the level of difficulty to retrieve the proper sequence of the peptide that generated the identified spectrum\u00a0; the other, called LIPR, is the proportion of common masses, in a given Peptide Spectrum Match (PSM), that represent dissimilar sequences. These two measures were also considered in conjunction with the False Discovery Rate (FDR).<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>According to our measures, the strategy that selects the best candidate by taking the mass difference between two spectra into account yields better quality results. Besides, although the FDR remains an interesting indicator in OMS methods (as shown by LIPR), it is questionable: indeed, our color classification shows that a non negligible proportion of relevant spectra\/sequence interpretations corresponds to PSMs coming from the decoy database.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>The three above mentioned measures allowed us to clearly determine which of the two studied OMS strategies outperformed the other, both in terms of number of identifications and of accuracy of these identifications. Even though quality evaluation of PSMs in OMS methods remains challenging, the study of theoretical spectra is a favorable framework for going further in this direction.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-021-03963-6","type":"journal-article","created":{"date-parts":[[2021,4,26]],"date-time":"2021-04-26T07:02:49Z","timestamp":1619420569000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Evaluation of open search methods based on theoretical mass spectra comparison"],"prefix":"10.1186","volume":"22","author":[{"given":"Albane","family":"Lysiak","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8251-2012","authenticated-orcid":false,"given":"Guillaume","family":"Fertin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"G\u00e9raldine","family":"Jean","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dominique","family":"Tessier","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2021,4,26]]},"reference":[{"issue":"6","key":"3963_CR1","doi-asserted-by":"publisher","first-page":"565","DOI":"10.1002\/wsbm.1185","volume":"4","author":"S Prabakaran","year":"2012","unstructured":"Prabakaran S, Lippens G, Steen H, Gunawardena J. Post-translational modification: nature\u2019s escape from genetic imprisonment and the basis for dynamic information encoding. Wiley Interdiscip Rev Syst Biol Med. 2012;4(6):565\u201383.","journal-title":"Wiley Interdiscip Rev Syst Biol Med"},{"issue":"3","key":"3963_CR2","doi-asserted-by":"publisher","first-page":"186","DOI":"10.1038\/nmeth.2369","volume":"10","author":"LM Smith","year":"2013","unstructured":"Smith LM, Kelleher NL, Consortium for Top Down Proteomics. Proteoform: a single term describing protein complexity. Nat Methods. 2013;10(3):186\u20137.","journal-title":"Nat Methods"},{"issue":"8","key":"3963_CR3","doi-asserted-by":"publisher","first-page":"651","DOI":"10.1038\/nmeth.3902","volume":"13","author":"J Griss","year":"2016","unstructured":"Griss J, Perez-Riverol Y, Lewis S, Tabb DL, Dianes JA, Del-Toro N, et al. Recognizing millions of consistently unidentified spectra across hundreds of shotgun proteomics datasets. Nat Methods. 2016;13(8):651\u20136.","journal-title":"Nat Methods"},{"issue":"7","key":"3963_CR4","doi-asserted-by":"publisher","first-page":"743","DOI":"10.1038\/nbt.3267","volume":"33","author":"JM Chick","year":"2015","unstructured":"Chick JM, Kolippakkam D, Nusinow DP, Zhai B, Rad R, Huttlin EL, et al. An ultra-tolerant database search reveals that a myriad of modified peptides contributes to unassigned spectra in shotgun proteomics. Nat Biotechnol. 2015;33(7):743\u20139.","journal-title":"Nat Biotechnol"},{"key":"3963_CR5","doi-asserted-by":"crossref","unstructured":"Tsur D, Tanner S, Zandi E, Bafna V, Pevzner PA. Identification of post-translational modifications via blind search of mass-spectra. In: Proceedings IEEE computational systems bioinformatics conference. 2005; p. 157\u201366.","DOI":"10.1109\/CSB.2005.34"},{"issue":"2","key":"3963_CR6","doi-asserted-by":"publisher","first-page":"546","DOI":"10.1021\/pr049781j","volume":"4","author":"BC Searle","year":"2005","unstructured":"Searle BC, Dasari S, Wilmarth PA, Turner M, Reddy AP, David LL, et al. Identification of protein modifications using MS\/MS de novo sequencing and the OpenSea alignment algorithm. J Proteome Res. 2005;4(2):546\u201354.","journal-title":"J Proteome Res"},{"issue":"3","key":"3963_CR7","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1038\/nmeth1019","volume":"4","author":"JE Elias","year":"2007","unstructured":"Elias JE, Gygi SP. Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat Methods. 2007;4(3):207\u201314.","journal-title":"Nat Methods"},{"issue":"3","key":"3963_CR8","doi-asserted-by":"publisher","first-page":"721","DOI":"10.1021\/acs.jproteome.5b00877","volume":"15","author":"O Horlacher","year":"2016","unstructured":"Horlacher O, Lisacek F, M\u00fcller M. Mining large scale tandem mass spectrometry data for protein modifications using spectral libraries. J Proteome Res. 2016;15(3):721\u201331.","journal-title":"J Proteome Res"},{"issue":"5","key":"3963_CR9","doi-asserted-by":"publisher","first-page":"1924","DOI":"10.1021\/acs.jproteome.6b00988","volume":"16","author":"MC Burke","year":"2017","unstructured":"Burke MC, Mirokhin YA, Tchekhovskoi DV, Markey SP, Heidbrink Thompson J, Larkin C, et al. The hybrid search: a mass spectral library search method for discovery of modifications in proteomics. J Proteome Res. 2017;16(5):1924\u201335.","journal-title":"J Proteome Res"},{"issue":"10","key":"3963_CR10","doi-asserted-by":"publisher","first-page":"3463","DOI":"10.1021\/acs.jproteome.8b00359","volume":"17","author":"W Bittremieux","year":"2018","unstructured":"Bittremieux W, Meysman P, Noble WS, Laukens K. Fast open modification spectral library searching through approximate nearest neighbor indexing. J Proteome Res. 2018;17(10):3463\u201374.","journal-title":"J Proteome Res"},{"issue":"10","key":"3963_CR11","doi-asserted-by":"publisher","first-page":"3792","DOI":"10.1021\/acs.jproteome.9b00291","volume":"18","author":"W Bittremieux","year":"2019","unstructured":"Bittremieux W, Laukens K, Noble WS. Extremely fast and accurate open modification spectral library searching of high-resolution mass spectra using feature hashing and graphics processing units. J Proteome Res. 2019;18(10):3792\u20139.","journal-title":"J Proteome Res"},{"issue":"5","key":"3963_CR12","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1038\/nmeth.4256","volume":"14","author":"AT Kong","year":"2017","unstructured":"Kong AT, Leprevost FV, Avtonomov DM, Mellacheruvu D, Nesvizhskii AI. MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nat Methods. 2017;14(5):513\u201320.","journal-title":"Nat Methods"},{"issue":"5","key":"3963_CR13","doi-asserted-by":"publisher","first-page":"1844","DOI":"10.1021\/acs.jproteome.7b00873","volume":"17","author":"SK Solntsev","year":"2018","unstructured":"Solntsev SK, Shortreed MR, Frey BL, Smith LM. Enhanced global post-translational modification discovery with MetaMorpheus. J Proteome Res. 2018;17(5):1844\u201351.","journal-title":"J Proteome Res"},{"issue":"8","key":"3963_CR14","doi-asserted-by":"publisher","first-page":"3030","DOI":"10.1021\/acs.jproteome.7b00308","volume":"16","author":"M David","year":"2017","unstructured":"David M, Fertin G, Rogniaux H, Tessier D. SpecOMS: a full open modification search method performing all-to-all spectra comparisons within minutes. J Proteome Res. 2017;16(8):3030\u20138. https:\/\/doi.org\/10.1021\/acs.jproteome.7b00308.","journal-title":"J Proteome Res"},{"key":"3963_CR15","doi-asserted-by":"publisher","first-page":"1059","DOI":"10.1038\/nbt.4236","volume":"36","author":"H Chi","year":"2018","unstructured":"Chi H, Liu C, Yang H, Zeng WF, Wu L, Zhou WJ, et al. Comprehensive identification of peptides in tandem mass spectra using an efficient open search engine. Nat Biotechnol. 2018;36:1059\u201361.","journal-title":"Nat Biotechnol"},{"issue":"17","key":"3963_CR16","doi-asserted-by":"publisher","first-page":"11324","DOI":"10.1021\/acs.analchem.9b02445","volume":"91","author":"S Na","year":"2019","unstructured":"Na S, Kim J, Paek E. MODplus: robust and unrestrictive identification of post-translational modifications using mass spectrometry. Anal Chem. 2019;91(17):11324\u201333.","journal-title":"Anal Chem"},{"issue":"4","key":"3963_CR17","doi-asserted-by":"publisher","first-page":"469","DOI":"10.1038\/s41587-019-0067-5","volume":"37","author":"A Devabhaktuni","year":"2019","unstructured":"Devabhaktuni A, Lin S, Zhang L, Swaminathan K, Gonzalez CG, Olsson N, et al. TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry datasets. Nat Biotechnol. 2019;37(4):469\u201379.","journal-title":"Nat Biotechnol"},{"key":"3963_CR18","doi-asserted-by":"publisher","first-page":"116266","DOI":"10.1016\/j.ijms.2019.116266","volume":"448","author":"DL Tabb","year":"2020","unstructured":"Tabb DL, Murugan BD, Okendo J, Nair O, Blackburn JM, Buthelezi SG, et al. Open search unveils modification patterns in formalin-fixed, paraffin-embedded thermo HCD and SCIEX TripleTOF shotgun proteomes. Int J Mass Spectrom. 2020;448:116266.","journal-title":"Int J Mass Spectrom"},{"issue":"7","key":"3963_CR19","doi-asserted-by":"publisher","first-page":"605","DOI":"10.1038\/nmeth.3450","volume":"12","author":"WS Noble","year":"2015","unstructured":"Noble WS. Mass spectrometrists should search only for peptides they care about. Nat Methods. 2015;12(7):605\u20138.","journal-title":"Nat Methods"},{"issue":"7","key":"3963_CR20","doi-asserted-by":"publisher","first-page":"643","DOI":"10.1038\/nmeth.4338","volume":"14","author":"A Sticker","year":"2017","unstructured":"Sticker A, Martens L, Clement L. Mass spectrometrists should search for all peptides, but assess only the ones they care about. Nat Methods. 2017;14(7):643\u20134.","journal-title":"Nat Methods"},{"key":"3963_CR21","doi-asserted-by":"crossref","unstructured":"Fertin G, David M, Rogniaux H, Tessier DT. MS\/MS spectra interpretation and the interest of SpecFit for identifying uncommon modifications. In: Proceedings 16th international conference on computational intelligence methods for bioinformatics (CIBB\u201919). LNBI. Springer; 2020.","DOI":"10.1007\/978-3-030-63061-4_8"},{"issue":"5","key":"3963_CR22","doi-asserted-by":"publisher","first-page":"700","DOI":"10.1002\/pmic.201500355","volume":"16","author":"MS Kim","year":"2016","unstructured":"Kim MS, Zhong J, Pandey A. Common errors in mass spectrometry-based analysis of post-translational modifications. Proteomics. 2016;16(5):700\u201314.","journal-title":"Proteomics"},{"issue":"10","key":"3963_CR23","doi-asserted-by":"publisher","first-page":"1419","DOI":"10.1074\/mcp.R500012-MCP200","volume":"4","author":"AI Nesvizhskii","year":"2005","unstructured":"Nesvizhskii AI, Aebersold R. Interpretation of shotgun proteomic data: the protein inference problem. Mol Cell Proteomics. 2005;4(10):1419\u201340.","journal-title":"Mol Cell Proteomics"},{"issue":"5","key":"3963_CR24","doi-asserted-by":"publisher","first-page":"586","DOI":"10.1093\/bib\/bbs004","volume":"13","author":"T Huang","year":"2012","unstructured":"Huang T, Wang J, Yu W, He Z. Protein inference: a review. Brief Bioinform. 2012;13(5):586\u2013614.","journal-title":"Brief Bioinform"},{"key":"3963_CR25","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1007\/978-3-319-43681-4_6","volume-title":"Algorithms in bioinformatics. Lecture notes in computer science","author":"M David","year":"2016","unstructured":"David M, Fertin G, Tessier D. SpecTrees: an efficient without a priori data structure for MS\/MS spectra identification. In: Frith M, Storm Pedersen CN, editors. Algorithms in bioinformatics. Lecture notes in computer science. Cham: Springer International Publishing; 2016. p. 65\u201376."},{"issue":"4","key":"3963_CR26","doi-asserted-by":"publisher","first-page":"671","DOI":"10.1002\/pmic.200900502","volume":"10","author":"E Ahrn\u00e9","year":"2010","unstructured":"Ahrn\u00e9 E, M\u00fcller M, Lisacek F. Unrestricted identification of modified proteins using MS\/MS. Proteomics. 2010;10(4):671\u201386.","journal-title":"Proteomics"},{"issue":"D1","key":"3963_CR27","first-page":"D682","volume":"48","author":"AD Yates","year":"2020","unstructured":"Yates AD, Achuthan P, Akanni W, Allen J, Allen J, Alvarez-Jarreta J, et al. Ensembl 2020. Nucleic Acids Res. 2020;48(D1):D682\u20138.","journal-title":"Nucleic Acids Res"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-021-03963-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-021-03963-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-021-03963-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,4,26]],"date-time":"2021-04-26T07:03:21Z","timestamp":1619420601000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-021-03963-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4]]},"references-count":27,"journal-issue":{"issue":"S2","published-print":{"date-parts":[[2021,4]]}},"alternative-id":["3963"],"URL":"https:\/\/doi.org\/10.1186\/s12859-021-03963-6","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,4]]},"assertion":[{"value":"16 December 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 January 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 April 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"65"}}