{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,16]],"date-time":"2025-10-16T20:15:14Z","timestamp":1760645714551,"version":"3.37.0"},"reference-count":19,"publisher":"Oxford University Press (OUP)","issue":"24","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,12,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The need to align spectra to correct for mass-to-charge experimental variation is a problem that arises in mass spectrometry (MS). Most of the MS-based proteomic data analysis methods involve a two-step approach, identify peaks first and then do the alignment and statistical inference on these identified peaks only. However, the peak identification step relies on prior information on the proteins of interest or a peak detection model, which are subject to error. Also numerous additional features such as peak shape and peak width are lost in simple peak detection, and these are informative for correcting mass variation in the alignment step.<\/jats:p><jats:p>Results: Here, we present a novel Bayesian approach to align the complete spectra. The approach is based on a parametric model which assumes that the spectrum and alignment function are Gaussian processes, but the alignment function is monotone. We show how to use the expectation\u2013maximization algorithm to find the posterior mode of the set of alignment functions and the mean spectrum for a patient population. After alignment, we conduct tests while controlling for error attributable to multiple comparisons on the level of the peaks identified from the absolute mean spectra difference of two patient populations.<\/jats:p><jats:p>Contact: \u00a0cavanr@biostat.umn.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btp582","type":"journal-article","created":{"date-parts":[[2009,10,10]],"date-time":"2009-10-10T01:23:47Z","timestamp":1255137827000},"page":"3213-3220","source":"Crossref","is-referenced-by-count":11,"title":["A Bayesian approach to the alignment of mass spectra"],"prefix":"10.1093","volume":"25","author":[{"given":"Xiaoxiao","family":"Kong","sequence":"first","affiliation":[{"name":"Division of Biostatistics, School of Public Health, University of Minnesota, A460 Mayo Building (MMC 303), Minneapolis, MN 55455-0378, USA"}]},{"given":"Cavan","family":"Reilly","sequence":"additional","affiliation":[{"name":"Division of Biostatistics, School of Public Health, University of Minnesota, A460 Mayo Building (MMC 303), Minneapolis, MN 55455-0378, USA"}]}],"member":"286","published-online":{"date-parts":[[2009,10,9]]},"reference":[{"key":"2023013112150906900_B1","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Stat. Soc. Ser. B (Methodol.)"},{"key":"2023013112150906900_B2","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1016\/j.neuroimage.2008.05.021","article-title":"False discovery rate revisited: FDR and topological inference using Gaussian random fields","volume":"44","author":"Chumbley","year":"2009","journal-title":"Neuroimage"},{"key":"2023013112150906900_B3","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1177\/117693510500100103","article-title":"Understanding the characteristics of mass spectrometry data through the use of simulation","volume":"1","author":"Coombes","year":"2005","journal-title":"Cancer Inform."},{"key":"2023013112150906900_B4","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1016\/j.stamet.2005.09.006","article-title":"Feature selection and machine learning with mass spectrometry data for distinguishing cancer and noncancer samples","volume":"3","author":"Datta","year":"2006","journal-title":"Stat. Methodol."},{"key":"2023013112150906900_B5","doi-asserted-by":"crossref","first-page":"2059","DOI":"10.1093\/bioinformatics\/btl355","article-title":"Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching","volume":"22","author":"Du","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013112150906900_B6","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1021\/ac034800e","article-title":"Parametric time warping","volume":"76","author":"Eilers","year":"2004","journal-title":"Anal. Chem"},{"key":"2023013112150906900_B7","doi-asserted-by":"crossref","first-page":"599","DOI":"10.1016\/j.neuroimage.2006.04.233","article-title":"Cluster based analysis of fMRI data","volume":"33","author":"Heller","year":"2006","journal-title":"Neuroimage"},{"key":"2023013112150906900_B8","article-title":"Nonparametric models for peak identification and quantification in mass spectroscopy, with application to MALDI-TOF","volume-title":"Discussion Paper 2006-24","author":"House","year":"2006"},{"key":"2023013112150906900_B9","article-title":"Multiple alignment of continuous time series","volume-title":"Advances in Neural Information Processing Systems","author":"Listgarten","year":"2005"},{"key":"2023013112150906900_B10","doi-asserted-by":"crossref","first-page":"1764","DOI":"10.1093\/bioinformatics\/bti254","article-title":"Feature extraction and quantification for mass spectrometry in biomedical applications using the mean spectrum","volume":"21","author":"Morris","year":"2005","journal-title":"Bioinformatics"},{"key":"2023013112150906900_B11","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1111\/j.1541-0420.2007.00895.x","article-title":"Bayesian analysis of mass spectrometry proteomic data using wavelet-based functional mixed models","volume":"64","author":"Morris","year":"2007","journal-title":"Biometrics"},{"key":"2023013112150906900_B12","doi-asserted-by":"crossref","first-page":"954","DOI":"10.1111\/j.0006-341X.2004.00251.x","article-title":"Using image and curve registration for measuring the goodness of fit of spatial and temporal predictions","volume":"60","author":"Reilly","year":"2004","journal-title":"Biometrics"},{"key":"2023013112150906900_B13","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1109\/TASSP.1978.1163055","article-title":"Dynamic programming algorithm optimization for spoken word recognition","volume":"26","author":"Sakoe","year":"1978","journal-title":"IEEE Trans. ASSP"},{"key":"2023013112150906900_B14","doi-asserted-by":"crossref","first-page":"3034","DOI":"10.1093\/bioinformatics\/bth357","article-title":"Sample classification from protein mass spectrometry, \u2018by peak probability contrasts\u2019","volume":"20","author":"Tibshirani","year":"2004","journal-title":"Bioinformatics"},{"key":"2023013112150906900_B15","doi-asserted-by":"crossref","first-page":"650","DOI":"10.1002\/pmic.200700791","article-title":"Alignment of LC-MS images, with applications to biomarker discovery and protein identification","volume":"8","author":"Vandenbogaert","year":"2008","journal-title":"Proteomics"},{"key":"2023013112150906900_B16","article-title":"Statistical methods in analyzing mass spectrometry dataset","author":"Wu","year":"2004","journal-title":"Dissertation"},{"key":"2023013112150906900_B17","article-title":"Profiling high-dimensional protein expression using MALDI-TOF mass spectrometry for biomarker discovery","volume-title":"Handbook of Statistics in Clinical Oncology","author":"Yasui","year":"2006","edition":"2"},{"key":"2023013112150906900_B18","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1109\/TCBB.2006.41","article-title":"Multiple peak alignment in sequential data analysis: a scale-space based approach","volume":"3","author":"Yu","year":"2006","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"2023013112150906900_B19","doi-asserted-by":"crossref","first-page":"1001","DOI":"10.1002\/pmic.200500105","article-title":"Analysis of chronic lung transplant rejection by MALDI-TOF profiles of bronchoalveolar lavage fluid","volume":"6","author":"Zhang","year":"2005","journal-title":"Proteomics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/24\/3213\/48998024\/bioinformatics_25_24_3213.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/24\/3213\/48998024\/bioinformatics_25_24_3213.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,12]],"date-time":"2025-02-12T17:34:21Z","timestamp":1739381661000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/24\/3213\/235416"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,10,9]]},"references-count":19,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2009,12,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btp582","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2009,12,15]]},"published":{"date-parts":[[2009,10,9]]}}}