{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,26]],"date-time":"2025-10-26T20:36:26Z","timestamp":1761510986048},"reference-count":27,"publisher":"Oxford University Press (OUP)","issue":"17","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2014,9,1]]},"abstract":"<jats:p>Motivation: In liquid chromatography\u2013mass spectrometry\/tandem mass spectrometry (LC-MS\/MS), it is necessary to link tandem MS-identified peptide peaks so that protein expression changes between the two runs can be tracked. However, only a small number of peptides can be identified and linked by tandem MS in two runs, and it becomes necessary to link peptide peaks with tandem identification in one run to their corresponding ones in another run without identification. In the past, peptide peaks are linked based on similarities in retention time (rt), mass or peak shape after rt alignment, which corrects mean rt shifts between runs. However, the accuracy in linking is still limited especially for complex samples collected from different conditions. Consequently, large-scale proteomics studies that require comparison of protein expression profiles of hundreds of patients can not be carried out effectively.<\/jats:p>\n               <jats:p>Method: In this article, we consider the problem of linking peptides from a pair of LC-MS\/MS runs and propose a new method, PeakLink (PL), which uses information in both the time and frequency domain as inputs to a non-linear support vector machine (SVM) classifier. The PL algorithm first uses a threshold on an rt likelihood ratio score to remove candidate corresponding peaks with excessively large elution time shifts, then PL calculates the correlation between a pair of candidate peaks after reducing noise through wavelet transformation. After converting rt and peak shape correlation to statistical scores, an SVM classifier is trained and applied for differentiating corresponding and non-corresponding peptide peaks.<\/jats:p>\n               <jats:p>Results: \u00a0PL is tested in multiple challenging cases, in which LC-MS\/MS samples are collected from different disease states, different instruments and different laboratories. Testing results show significant improvement in linking accuracy compared with other algorithms.<\/jats:p>\n               <jats:p>Availability and implementation: M files for the PL alignment method are available at http:\/\/compgenomics.utsa.edu\/zgroup\/PeakLink<\/jats:p>\n               <jats:p>Contact: \u00a0Michelle.Zhang@utsa.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary Data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btu299","type":"journal-article","created":{"date-parts":[[2014,5,10]],"date-time":"2014-05-10T00:17:52Z","timestamp":1399681072000},"page":"2464-2470","source":"Crossref","is-referenced-by-count":5,"title":["PeakLink: a new peptide peak linking method in LC-MS\/MS using wavelet and SVM"],"prefix":"10.1093","volume":"30","author":[{"given":"Mehrab","family":"Ghanat Bari","sequence":"first","affiliation":[{"name":"1Department of Electrical and Computer Engineering, The University of Texas at San Antonio, San Antonio, TX 78246, USA"}]},{"given":"Xuepo","family":"Ma","sequence":"additional","affiliation":[{"name":"1Department of Electrical and Computer Engineering, The University of Texas at San Antonio, San Antonio, TX 78246, USA"}]},{"given":"Jianqiu","family":"Zhang","sequence":"additional","affiliation":[{"name":"1Department of Electrical and Computer Engineering, The University of Texas at San Antonio, San Antonio, TX 78246, USA"}]}],"member":"286","published-online":{"date-parts":[[2014,5,9]]},"reference":[{"key":"2023012711525369800_btu299-B2","doi-asserted-by":"crossref","first-page":"1902","DOI":"10.1093\/bioinformatics\/btl276","article-title":"A suite of algorithms for the comprehensive analysis of complex protein mixtures using high-resolution LC-MS","volume":"22","author":"Bellew","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012711525369800_btu299-B3","doi-asserted-by":"crossref","first-page":"1466","DOI":"10.1093\/bioinformatics\/bth092","article-title":"TANDEM: matching proteins with tandem mass spectra","volume":"20","author":"Craig","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012711525369800_btu299-B4","doi-asserted-by":"crossref","first-page":"1794","DOI":"10.1021\/pr101065j","article-title":"Andromeda: a peptide search engine integrated into the MaxQuant environment","volume":"10","author":"Cox","year":"2011","journal-title":"J. Proteome Res."},{"key":"2023012711525369800_btu299-B5","doi-asserted-by":"crossref","first-page":"1373","DOI":"10.1007\/s13361-011-0142-8","article-title":"Software lock mass by two-dimensional minimization of peptide mass errors","volume":"22","author":"Cox","year":"2011","journal-title":"J. Am. Soc. Mass Spectrom."},{"key":"2023012711525369800_btu299-B6","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1186\/1471-2105-12-439","article-title":"SCFIA: a statistical corresponding feature identification algorithm for LC\/MS","volume":"12","author":"Cui","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023012711525369800_btu299-B7","doi-asserted-by":"crossref","first-page":"976","DOI":"10.1016\/1044-0305(94)80016-2","article-title":"An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database","volume":"5","author":"Eng","year":"1994","journal-title":"Am. Soc. Mass Spectrom."},{"key":"2023012711525369800_btu299-B8","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1038\/nmeth.1446","article-title":"Super-SILAC mix for quantitative proteomics of human tumor tissue","volume":"7","author":"Geiger","year":"2010","journal-title":"Nat. Methods"},{"key":"2023012711525369800_btu299-B12","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1186\/1471-2105-14-49","article-title":"MultiAlign: a multiple LC-MS analysis tool for targeted omics analysis","volume":"14","author":"LaMarche","year":"2013","journal-title":"BMC Bioinformatics"},{"key":"2023012711525369800_btu299-B13","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1186\/1471-2105-9-375","article-title":"Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements","volume":"9","author":"Lange","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012711525369800_btu299-B14","doi-asserted-by":"crossref","first-page":"1768","DOI":"10.1093\/bioinformatics\/btt274","article-title":"A combinatorial approach to the peptide feature matching problem for label-free quantification","volume":"29","author":"Lin","year":"2013","journal-title":"Bioinformatics"},{"key":"2023012711525369800_btu299-B15","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1016\/S0925-2312(03)00431-4","article-title":"The support vector machine under test","volume":"55","author":"Meyer","year":"2003","journal-title":"Neurocomputing"},{"key":"2023012711525369800_btu299-B16","doi-asserted-by":"crossref","first-page":"393","DOI":"10.1021\/pr900721e","article-title":"MSQuant, an open source platform for mass spectrometry-based quantitative proteomics","volume":"7","author":"Mortensen","year":"2010","journal-title":"J. Proteome Res."},{"key":"2023012711525369800_btu299-B17","doi-asserted-by":"crossref","first-page":"3470","DOI":"10.1002\/pmic.200700057","article-title":"SuperHirn- a novel tool for high resolution LC-MS-based peptide\/protein profiling","volume":"7","author":"Mueller","year":"2008","journal-title":"Proteomics"},{"key":"2023012711525369800_btu299-B18","doi-asserted-by":"crossref","DOI":"10.1074\/mcp.M111.013722","article-title":"System-wide perturbation analysis with nearly complete coverage of the yeast proteome by single-shot ultra HPLC runs on a bench top orbitrap","volume":"11","author":"Nagaraj","year":"2012","journal-title":"Mol. Cell. Proteomics"},{"key":"2023012711525369800_btu299-B19","doi-asserted-by":"crossref","first-page":"418","DOI":"10.1109\/TSP.2003.821103","article-title":"ForWaRD: Fourier-wavelet regularized deconvolution for ill-conditioned systems","volume":"52","author":"Neelamani","year":"2004","journal-title":"IEEE Trans. Signal Process"},{"key":"2023012711525369800_btu299-B20","doi-asserted-by":"crossref","first-page":"535","DOI":"10.1002\/pmic.201000553","article-title":"Less label, more free: approaches in label-free quantitative mass spectrometry","volume":"11","author":"Neilson","year":"2011","journal-title":"Proteomics"},{"key":"2023012711525369800_btu299-B22","doi-asserted-by":"crossref","first-page":"621","DOI":"10.2144\/04374RV01","article-title":"Proteomic analyses using an accurate mass and time tag strategy","volume":"37","author":"Pasa-Toli","year":"2004","journal-title":"Biotechniques"},{"key":"2023012711525369800_btu299-B23","doi-asserted-by":"crossref","first-page":"3551","DOI":"10.1002\/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2","article-title":"Probability-based protein identification by searching sequence databases using mass spectrometry data","volume":"20","author":"Perkins","year":"1999","journal-title":"Electrophoresis"},{"key":"2023012711525369800_btu299-B24","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1186\/1471-2105-11-395","article-title":"MZmine 2: modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data","volume":"11","author":"Pluskal","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023012711525369800_btu299-B28","doi-asserted-by":"crossref","first-page":"589","DOI":"10.1074\/mcp.M500321-MCP200","article-title":"Simultaneous qualitative and quantitative analysis of the Escherichia coli proteome","volume":"5","author":"Silva","year":"2006","journal-title":"Mol. Cell. Proteomics"},{"key":"2023012711525369800_btu299-B29","article-title":"LC-MS alignment in theory and practice: a comprehensive algorithmic review","author":"Smith","year":"2013","journal-title":"Brief. Bioinform."},{"key":"2023012711525369800_btu299-B30","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1186\/1471-2105-9-163","article-title":"OpenMS - an open-source software framework for mass spectrometry","volume":"9","author":"Sturm","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012711525369800_btu299-B31","doi-asserted-by":"crossref","first-page":"959","DOI":"10.1038\/nmeth.1260","article-title":"Decision tree-driven tandem mass spectrometry for shotgun proteomics","volume":"5","author":"Swaney","year":"2008","journal-title":"Nat. Methods"},{"key":"2023012711525369800_btu299-B33","doi-asserted-by":"crossref","first-page":"4415","DOI":"10.1109\/TSP.2007.896255","article-title":"Generalized Daubechies wavelet families","volume":"55","author":"Vonesch","year":"2007","journal-title":"IEEE Trans., Signal Process"},{"key":"2023012711525369800_btu299-B34","doi-asserted-by":"crossref","first-page":"987","DOI":"10.1093\/bioinformatics\/btr051","article-title":"SIMA: simultaneous multiple alignment of LC\/MS peak lists","volume":"27","author":"Voss","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012711525369800_btu299-B36","doi-asserted-by":"crossref","first-page":"388","DOI":"10.2174\/138920209789177638","article-title":"Review of peak detection algorithms in liquid-chromatography-mass spectrometry","volume":"10","author":"Zhang","year":"2009","journal-title":"Curr. Genomics"},{"key":"2023012711525369800_btu299-B37","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1109\/TCBB.2008.17","article-title":"Sparse support vector machines with L-p penalty for biomarker identification","volume":"7","author":"Zhenqiu","year":"2010","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/17\/2464\/48927202\/bioinformatics_30_17_2464.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/17\/2464\/48927202\/bioinformatics_30_17_2464.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T12:16:01Z","timestamp":1674821761000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/30\/17\/2464\/2748152"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,5,9]]},"references-count":27,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2014,9,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btu299","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,5,9]]}}}