{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,15]],"date-time":"2026-03-15T04:46:04Z","timestamp":1773549964643,"version":"3.50.1"},"reference-count":35,"publisher":"Oxford University Press (OUP)","issue":"20","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":1872,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.5"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,10,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: In the analysis of differential peptide peak intensities (i.e. abundance measures), LC-MS analyses with poor quality peptide abundance data can bias downstream statistical analyses and hence the biological interpretation for an otherwise high-quality dataset. Although considerable effort has been placed on assuring the quality of the peptide identification with respect to spectral processing, to date quality assessment of the subsequent peptide abundance data matrix has been limited to a subjective visual inspection of run-by-run correlation or individual peptide components. Identifying statistical outliers is a critical step in the processing of proteomics data as many of the downstream statistical analyses [e.g. analysis of variance (ANOVA)] rely upon accurate estimates of sample variance, and their results are influenced by extreme values.<\/jats:p>\n               <jats:p>Results: We describe a novel multivariate statistical strategy for the identification of LC-MS runs with extreme peptide abundance distributions. Comparison with current method (run-by-run correlation) demonstrates a significantly better rate of identification of outlier runs by the multivariate strategy. Simulation studies also suggest that this strategy significantly outperforms correlation alone in the identification of statistically extreme liquid chromatography-mass spectrometry (LC-MS) runs.<\/jats:p>\n               <jats:p>Availability: \u00a0https:\/\/www.biopilot.org\/docs\/Software\/RMD.php<\/jats:p>\n               <jats:p>Contact: \u00a0bj@pnl.gov<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary material is available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr479","type":"journal-article","created":{"date-parts":[[2011,8,19]],"date-time":"2011-08-19T00:51:34Z","timestamp":1313715094000},"page":"2866-2872","source":"Crossref","is-referenced-by-count":108,"title":["Improved quality control processing of peptide-centric LC-MS proteomics data"],"prefix":"10.1093","volume":"27","author":[{"given":"Melissa M.","family":"Matzke","sequence":"first","affiliation":[{"name":"1 Pacific Northwest National Laboratory, PO Box 999, Richland, WA 99352 and 2Department of Microbiology and Immunology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA"}]},{"given":"Katrina M.","family":"Waters","sequence":"additional","affiliation":[{"name":"1 Pacific Northwest National Laboratory, PO Box 999, Richland, WA 99352 and 2Department of Microbiology and Immunology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA"}]},{"given":"Thomas O.","family":"Metz","sequence":"additional","affiliation":[{"name":"1 Pacific Northwest National Laboratory, PO Box 999, Richland, WA 99352 and 2Department of Microbiology and Immunology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA"}]},{"given":"Jon M.","family":"Jacobs","sequence":"additional","affiliation":[{"name":"1 Pacific Northwest National Laboratory, PO Box 999, Richland, WA 99352 and 2Department of Microbiology and Immunology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA"}]},{"given":"Amy C.","family":"Sims","sequence":"additional","affiliation":[{"name":"1 Pacific Northwest National Laboratory, PO Box 999, Richland, WA 99352 and 2Department of Microbiology and Immunology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA"}]},{"given":"Ralph S.","family":"Baric","sequence":"additional","affiliation":[{"name":"1 Pacific Northwest National Laboratory, PO Box 999, Richland, WA 99352 and 2Department of Microbiology and Immunology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA"}]},{"given":"Joel G.","family":"Pounds","sequence":"additional","affiliation":[{"name":"1 Pacific Northwest National Laboratory, PO Box 999, Richland, WA 99352 and 2Department of Microbiology and Immunology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA"}]},{"given":"Bobbie-Jo M.","family":"Webb-Robertson","sequence":"additional","affiliation":[{"name":"1 Pacific Northwest National Laboratory, PO Box 999, Richland, WA 99352 and 2Department of Microbiology and Immunology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA"}]}],"member":"286","published-online":{"date-parts":[[2011,8,18]]},"reference":[{"key":"2023012512012643800_B1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1477-5956-4-1","article-title":"Estimating probabilities of peptide database identifications to LC-FTICR-MS observations","volume":"4","author":"Anderson","year":"2006","journal-title":"Proteome Sci."},{"key":"2023012512012643800_B2","volume-title":"Outliers in Statistical Data.","author":"Barnett","year":"1994"},{"key":"2023012512012643800_B3","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1142\/S0219720008003321","article-title":"Design and analysis of quantitative differential proteomics investigations using LC-MS technology","volume":"6","author":"Bukhman","year":"2008","journal-title":"J. Bioinform. Comput. Biol."},{"key":"2023012512012643800_B4","first-page":"355","article-title":"Sequential application of Wilks's multivariate outlier test","volume":"41","author":"Caroni","year":"1992","journal-title":"J. R. Stat. Soc. Ser. C (Appl Stat)"},{"key":"2023012512012643800_B5","doi-asserted-by":"crossref","first-page":"882","DOI":"10.1093\/bioinformatics\/btn012","article-title":"OutlierD: an R package for outlier detection using quantile regression on mass spectrometry data","volume":"24","author":"Cho","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012512012643800_B6","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1016\/j.jmva.2004.08.002","article-title":"High breakdown estimators for prinicpal components: the projection-pursuit approach revisited","volume":"95","author":"Croux","year":"2005","journal-title":"J. Multivariate Anal."},{"key":"2023012512012643800_B7","doi-asserted-by":"crossref","first-page":"1209","DOI":"10.1021\/pr070441i","article-title":"Mixed-effects statistical model for comparative LC-MS proteomics studies","volume":"7","author":"Daly","year":"2008","journal-title":"J. Proteome Res."},{"key":"2023012512012643800_B8","doi-asserted-by":"crossref","first-page":"488","DOI":"10.1214\/aoms\/1177729747","article-title":"Analysis of extreme values","volume":"21","author":"Dixon","year":"1950","journal-title":"Ann. Math. Stat."},{"key":"2023012512012643800_B9","doi-asserted-by":"crossref","first-page":"1694","DOI":"10.1016\/j.csda.2007.05.018","article-title":"Outlier identification in high dimensions","volume":"52","author":"Filzmoser","year":"2008","journal-title":"Comput. Stat. Data Anal."},{"key":"2023012512012643800_B10","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1214\/aoms\/1177729885","article-title":"Sample criteria for testing outlying observations","volume":"21","author":"Grubbs","year":"1950","journal-title":"Ann. Math. Stat."},{"key":"2023012512012643800_B11","doi-asserted-by":"crossref","DOI":"10.1007\/978-94-015-3994-4","volume-title":"Identification of Outliers.","author":"Hawkins","year":"1980"},{"key":"2023012512012643800_B12","volume-title":"Understanding Robust and Exploratory Data Analysis.","author":"Hoaglin","year":"2000"},{"key":"2023012512012643800_B13","doi-asserted-by":"crossref","first-page":"1030","DOI":"10.1016\/j.clinbiochem.2010.04.071","article-title":"A recursive version of Grubbs' test for detecting multiple outliers in environmental and chemical data","volume":"43","author":"Jain","year":"2010","journal-title":"Clin. Biochem."},{"key":"2023012512012643800_B14","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1186\/1471-2105-10-87","article-title":"Decon2LS: An open-source software package for automated processing and visualization of high resolution mass spectrometry data","volume":"10","author":"Jaitly","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023012512012643800_B15","doi-asserted-by":"crossref","first-page":"2028","DOI":"10.1093\/bioinformatics\/btp362","article-title":"A statistical framework for protein quantitation in bottom-up MS-based proteomics","volume":"25","author":"Karpievitch","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012512012643800_B16","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1093\/bioinformatics\/btn647","article-title":"arrayQualityMetrics\u2013a bioconductor package for quality assessment of microarray data","volume":"25","author":"Kauffmann","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012512012643800_B17","doi-asserted-by":"crossref","first-page":"1644","DOI":"10.1093\/bioinformatics\/bti103","article-title":"Predicting gene function through systematic analysis and quality assessment of high-throughput data","volume":"21","author":"Kemmeren","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012512012643800_B18","doi-asserted-by":"crossref","first-page":"2305","DOI":"10.1093\/bioinformatics\/btl367","article-title":"arrayQCplot: software for checking the quality of microarray data","volume":"22","author":"Lee","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012512012643800_B19","doi-asserted-by":"crossref","first-page":"759","DOI":"10.1080\/01621459.1985.10478181","article-title":"Projection-pursuit approach to robust dispersion matrices and principal components: primary theory and Monte Carlo","volume":"80","author":"Li","year":"1985","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012512012643800_B20","doi-asserted-by":"crossref","first-page":"6912","DOI":"10.1021\/ac034790h","article-title":"A correlation algorithm for the automated quantitative analysis of shotgun proteomics data","volume":"75","author":"MacCoss","year":"2003","journal-title":"Anal. Chem."},{"key":"2023012512012643800_B21","first-page":"49","article-title":"On the generalized distance in statistics","volume":"12","author":"Mahalanobis","year":"1936","journal-title":"Proc. Indian Natl Sci. Acad."},{"key":"2023012512012643800_B22","doi-asserted-by":"crossref","first-page":"698","DOI":"10.1021\/pr700606w","article-title":"Application of proteomics in the discovery of candidate protein biomarkers in a diabetes autoantibody standardization program sample subset","volume":"7","author":"Metz","year":"2008","journal-title":"J. Proteome Res."},{"key":"2023012512012643800_B23","doi-asserted-by":"crossref","first-page":"2021","DOI":"10.1093\/bioinformatics\/btm281","article-title":"VIPER: an advanced software package to support high-throughput LC-MS peptide identification","volume":"23","author":"Monroe","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012512012643800_B24","doi-asserted-by":"crossref","first-page":"2144","DOI":"10.1021\/pr8010099","article-title":"Statistical design of quantitative mass spectrometry-based proteomic experiments","volume":"8","author":"Oberg","year":"2009","journal-title":"J. Proteome Res."},{"key":"2023012512012643800_B25","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1021\/pr700734f","article-title":"Statistical analysis of relative labeled mass spectrometry data from complex samples using ANOVA","volume":"7","author":"Oberg","year":"2008","journal-title":"J. Proteome Res."},{"key":"2023012512012643800_B26","doi-asserted-by":"crossref","first-page":"1527","DOI":"10.1021\/pr050436j","article-title":"Quality control metrics for LC-MS feature detection tools demonstrated on Saccharomyces cerevisiae proteomic profiles","volume":"5","author":"Piening","year":"2006","journal-title":"J. Proteome Res."},{"key":"2023012512012643800_B27","doi-asserted-by":"crossref","first-page":"1047","DOI":"10.1080\/01621459.1996.10476975","article-title":"Identification of outliers in multivariate data","volume":"91","author":"Rocke","year":"1996","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012512012643800_B28","doi-asserted-by":"crossref","first-page":"701","DOI":"10.1093\/bioinformatics\/btp038","article-title":"Papers on normalization, variable selection, classification or clustering of microarray data","volume":"25","author":"Rocke","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012512012643800_B29","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1074\/mcp.M900223-MCP200","article-title":"Performance metrics for liquid chromatography-tandem mass spectrometry systems in proteomics analyses","volume":"9","author":"Rudnick","year":"2010","journal-title":"Mol. Cell Proteomics"},{"key":"2023012512012643800_B30","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1186\/1756-0381-2-4","article-title":"Statistical quality assessment and outlier detection for liquid chromatography-mass spectrometry experiments","volume":"2","author":"Schulz-Trieglaff","year":"2009","journal-title":"BioData Min."},{"key":"2023012512012643800_B31","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1002\/1615-9861(200205)2:5<513::AID-PROT513>3.0.CO;2-W","article-title":"An accurate mass tag strategy for quantitative and high-throughput proteome measurements","volume":"2","author":"Smith","year":"2002","journal-title":"Proteomics"},{"key":"2023012512012643800_B32","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1093\/bib\/bbn004","article-title":"Information quality in proteomics","volume":"9","author":"Stead","year":"2008","journal-title":"Brief. Bioinform."},{"key":"2023012512012643800_B33","doi-asserted-by":"crossref","first-page":"5748","DOI":"10.1021\/pr1005247","article-title":"Combined statistical analysis of peptide intensities and peptide occurrences improves identification of significant peptides from MS-based proteomics data","volume":"9","author":"Webb-Robertson","year":"2010","journal-title":"J. Proteome Res."},{"key":"2023012512012643800_B34","doi-asserted-by":"crossref","first-page":"3683","DOI":"10.1093\/bioinformatics\/bti605","article-title":"Simpleaffy: a BioConductor package for affymetrix quality control and data analysis","volume":"21","author":"Wilson","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012512012643800_B35","doi-asserted-by":"crossref","first-page":"868","DOI":"10.1074\/mcp.M500369-MCP200","article-title":"Quantitative proteomics of the archaeon Methanococcus maripaludis validated by microarray analysis and real time PCR","volume":"5","author":"Xia","year":"2006","journal-title":"Mol. Cell Proteomics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/20\/2866\/48870457\/bioinformatics_27_20_2866.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/20\/2866\/48870457\/bioinformatics_27_20_2866.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T14:10:58Z","timestamp":1674655858000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/20\/2866\/202024"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,8,18]]},"references-count":35,"journal-issue":{"issue":"20","published-print":{"date-parts":[[2011,10,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr479","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2011,10,15]]},"published":{"date-parts":[[2011,8,18]]}}}