{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T05:22:30Z","timestamp":1773724950118,"version":"3.50.1"},"reference-count":20,"publisher":"Oxford University Press (OUP)","issue":"19","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: LC-MS allows for the identification and quantification of proteins from biological samples. As with any high-throughput technology, systematic biases are often observed in LC-MS data, making normalization an important preprocessing step. Normalization models need to be flexible enough to capture biases of arbitrary complexity, while avoiding overfitting that would invalidate downstream statistical inference. Careful normalization of MS peak intensities would enable greater accuracy and precision in quantitative comparisons of protein abundance levels.<\/jats:p>\n               <jats:p>Results: We propose an algorithm, called EigenMS, that uses singular value decomposition to capture and remove biases from LC-MS peak intensity measurements. EigenMS is an adaptation of the surrogate variable analysis (SVA) algorithm of Leek and Storey, with the adaptations including (i) the handling of the widespread missing measurements that are typical in LC-MS, and (ii) a novel approach to preventing overfitting that facilitates the incorporation of EigenMS into an existing proteomics analysis pipeline. EigenMS is demonstrated using both large-scale calibration measurements and simulations to perform well relative to existing alternatives.<\/jats:p>\n               <jats:p>Availability: The software has been made available in the open source proteomics platform DAnTE (Polpitiya et al., 2008)) (http:\/\/omics.pnl.gov\/software\/), as well as in standalone software available at SourceForge (http:\/\/sourceforge.net).<\/jats:p>\n               <jats:p>Contact: \u00a0yuliya@stat.tamu.edu<\/jats:p>\n               <jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btp426","type":"journal-article","created":{"date-parts":[[2009,7,15]],"date-time":"2009-07-15T02:04:52Z","timestamp":1247623492000},"page":"2573-2580","source":"Crossref","is-referenced-by-count":105,"title":["Normalization of peak intensities in bottom-up MS-based proteomics using singular value decomposition"],"prefix":"10.1093","volume":"25","author":[{"given":"Yuliya V.","family":"Karpievitch","sequence":"first","affiliation":[{"name":"1 Department of Statistics, 3143 TAMU, College Station, TX 77843 and 2Pacific Northwest National Laboratory, P.O. Box 999, Richland, WA 99352, USA"}]},{"given":"Thomas","family":"Taverner","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, 3143 TAMU, College Station, TX 77843 and 2Pacific Northwest National Laboratory, P.O. Box 999, Richland, WA 99352, USA"}]},{"given":"Joshua N.","family":"Adkins","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, 3143 TAMU, College Station, TX 77843 and 2Pacific Northwest National Laboratory, P.O. Box 999, Richland, WA 99352, USA"}]},{"given":"Stephen J.","family":"Callister","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, 3143 TAMU, College Station, TX 77843 and 2Pacific Northwest National Laboratory, P.O. Box 999, Richland, WA 99352, USA"}]},{"given":"Gordon A.","family":"Anderson","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, 3143 TAMU, College Station, TX 77843 and 2Pacific Northwest National Laboratory, P.O. Box 999, Richland, WA 99352, USA"}]},{"given":"Richard D.","family":"Smith","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, 3143 TAMU, College Station, TX 77843 and 2Pacific Northwest National Laboratory, P.O. Box 999, Richland, WA 99352, USA"}]},{"given":"Alan R.","family":"Dabney","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, 3143 TAMU, College Station, TX 77843 and 2Pacific Northwest National Laboratory, P.O. Box 999, Richland, WA 99352, USA"}]}],"member":"286","published-online":{"date-parts":[[2009,7,14]]},"reference":[{"key":"2023013112124580300_B1","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1093\/bioinformatics\/19.2.185","article-title":"A comparison of normalization methods for high density oligonucleotide array data based on variance and bias","volume":"19","author":"Bolstad","year":"2003","journal-title":"Bioinformatics"},{"key":"2023013112124580300_B2","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1021\/pr050300l","article-title":"Normalization approaches for removing systematic biases associated with mass spectrometry and label-free proteomics","volume":"5","author":"Callister","year":"2006","journal-title":"J. Proteome Res."},{"key":"2023013112124580300_B3","doi-asserted-by":"crossref","first-page":"490","DOI":"10.1038\/ng1031","article-title":"Fundamentals of experimental design for cDNA microarrays","volume":"32","author":"Churchill","year":"2002","journal-title":"Nat. Genet."},{"key":"2023013112124580300_B4","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1186\/gb-2006-7-3-401","article-title":"A reanalysis of a published Affymetrix GeneChip control dataset","volume":"7","author":"Dabney","year":"2006","journal-title":"Genome Biol."},{"key":"2023013112124580300_B5","doi-asserted-by":"crossref","first-page":"128","DOI":"10.1093\/biostatistics\/kxj038","article-title":"A new approach to intensity-dependent normalization of two-channel microarrays","volume":"8","author":"Dabney","year":"2007","journal-title":"Biostatistics"},{"key":"2023013112124580300_B6","doi-asserted-by":"crossref","first-page":"R44","DOI":"10.1186\/gb-2007-8-3-r44","article-title":"Normalization of two-channel microarrays accounting for experimental design and intensity-dependent relationships","volume":"8","author":"Dabney","year":"2007","journal-title":"Genome Biol."},{"key":"2023013112124580300_B7","doi-asserted-by":"crossref","first-page":"961","DOI":"10.1021\/ac701649e","article-title":"Label-free comparative analysis of proteomics mixtures using chromatographic alignment of high resolution \u03bcLC-MS data","volume":"80","author":"Finney","year":"2008","journal-title":"Anal. Chem."},{"key":"2023013112124580300_B8","doi-asserted-by":"crossref","first-page":"3091","DOI":"10.1021\/pr070520u","article-title":"A statistical model for iTRAQ data analysis","volume":"7","author":"Hill","year":"2008","journal-title":"J. Proteome Res."},{"key":"2023013112124580300_B9","doi-asserted-by":"crossref","first-page":"7397","DOI":"10.1021\/ac052197p","article-title":"Robust algorithm for alignment of liquid chromatography-mass spectrometry analyses in an accurate mass and time tag data analysis pipeline","volume":"78","author":"Jaitly","year":"2006","journal-title":"Anal. Chem."},{"key":"2023013112124580300_B10","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/btp362","article-title":"A statistical framework for protein quantitation in bottom-up MS-based proteomics","author":"Karpievitch","year":"2009","journal-title":"Bioinformatics"},{"key":"2023013112124580300_B11","doi-asserted-by":"crossref","first-page":"819","DOI":"10.1089\/10665270050514954","article-title":"Analysis of variance for gene expression microarray data","volume":"7","author":"Kerr","year":"2000","journal-title":"J. Comput. Biol."},{"key":"2023013112124580300_B12","doi-asserted-by":"crossref","first-page":"e161","DOI":"10.1371\/journal.pgen.0030161","article-title":"Capturing heterogeneity in gene expression studies by surrogate variable analysis","volume":"3","author":"Leek","year":"2007","journal-title":"PLoS Genet."},{"key":"2023013112124580300_B13","doi-asserted-by":"crossref","first-page":"787","DOI":"10.1038\/nmeth1088","article-title":"Analysis and validation of proteomic data generated by tandem mass spectrometry","volume":"4","author":"Nesvizhskii","year":"2007","journal-title":"Nat. Methods"},{"key":"2023013112124580300_B14","doi-asserted-by":"crossref","first-page":"693","DOI":"10.1021\/ac701863d","article-title":"Elimination of systematic mass measurement errors in liquid chromatography-mass spectrometry based proteomics using regression models and a priori partial knowledge of the sample content","volume":"80","author":"Petyuk","year":"2008","journal-title":"Anal. Chem."},{"key":"2023013112124580300_B15","doi-asserted-by":"crossref","first-page":"1556","DOI":"10.1093\/bioinformatics\/btn217","article-title":"DAnTE: a statistical tool for quantitative analysis of -omics data","volume":"24","author":"Polpitiya","year":"2008","journal-title":"Bioinformatics"},{"key":"2023013112124580300_B16","doi-asserted-by":"crossref","first-page":"496","DOI":"10.1038\/ng1032","article-title":"Microarray normalization and transformation","volume":"32","author":"Quackenbush","year":"2002","journal-title":"Nat. Genet."},{"key":"2023013112124580300_B17","doi-asserted-by":"crossref","first-page":"9440","DOI":"10.1073\/pnas.1530509100","article-title":"Statistical significance for genomewide studies","volume":"100","author":"Storey","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013112124580300_B18","doi-asserted-by":"crossref","first-page":"2540","DOI":"10.1093\/nar\/29.12.2549","article-title":"Issues in cDNA microarray analysis: Quality filtering, channel normalization, models of variations and assessment of gene effects","volume":"29","author":"Tseng","year":"2001","journal-title":"Nucleic Acids Res."},{"key":"2023013112124580300_B19","doi-asserted-by":"crossref","first-page":"e15","DOI":"10.1093\/nar\/30.4.e15","article-title":"Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation","volume":"30","author":"Yang","year":"2002","journal-title":"Nucleic Acids Res."},{"key":"2023013112124580300_B20","doi-asserted-by":"crossref","first-page":"450","DOI":"10.1002\/mas.20071","article-title":"Advances in proteomics data analysis and display using an accurate mass and time tag approach","volume":"23","author":"Zimmer","year":"2006","journal-title":"Mass Spectrom. Rev."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/19\/2573\/48995348\/bioinformatics_25_19_2573.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/19\/2573\/48995348\/bioinformatics_25_19_2573.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T21:38:58Z","timestamp":1675201138000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/19\/2573\/179927"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,7,14]]},"references-count":20,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2009,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btp426","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2009,10,1]]},"published":{"date-parts":[[2009,7,14]]}}}