{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,3,5]],"date-time":"2023-03-05T23:32:34Z","timestamp":1678059154763},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":3015,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: In recent years stable isotopic labeling has become a standard approach for quantitative proteomic analyses. Among the many available isotopic labeling strategies, metabolic labeling is attractive for the excellent internal control it provides. However, analysis of data from metabolic labeling experiments can be complicated because the spacing between labeled and unlabeled forms of each peptide depends on its sequence, and is thus variable from analyte to analyte. As a result, one generally needs to know the sequence of a peptide to identify its matching isotopic distributions in an automated fashion. In some experimental situations it would be necessary or desirable to match pairs of labeled and unlabeled peaks from peptides of unknown sequence. This article addresses this largely overlooked problem in the analysis of quantitative mass spectrometry data by presenting an algorithm that not only identifies isotopic distributions within a mass spectrum, but also annotates matches between natural abundance light isotopic distributions and their metabolically labeled counterparts. This algorithm is designed in two stages: first we annotate the isotopic peaks using a modified version of the IDM algorithm described last year; then we use a probabilistic classifier that is supplemented by dynamic programming to find the metabolically labeled matched isotopic pairs. Such a method is needed for high-throughput quantitative proteomic metabolomic experiments measured via mass spectrometry.<\/jats:p>\n               <jats:p>Results: The primary result of this article is that the dynamic programming approach performs well given perfect isotopic distribution annotations. Our algorithm achieves a true positive rate of 99% and a false positive rate of 1% using perfect isotopic distribution annotations. When the isotopic distributions are annotated given \u2018expert\u2019 selected peaks, the same algorithm gets a true positive rate of 77% and a false positive rate of 1%. Finally, when annotating using machine selected peaks, which may contain noise, the dynamic programming algorithm gives a true positive rate of 36% and a false positive rate of 1%. It is important to mention that these rates arise from the requirement of exact annotations of both the light and heavy isotopic distributions. In our evaluations, a match is considered \u2018entirely incorrect\u2019 if it is missing even one peak or containing an extraneous peak. If we only require that the \u2018monoisotopic\u2019 peaks exist within the two matched distributions, our algorithm obtains a positive rate of 45% and a false positive rate of 1% on the \u2018machine\u2019 selected data. Changes to the algorithm's scoring function and training example generation improves our \u2018monoisotopic\u2019 peak score true positive rate to 65% while obtaining a false positive rate of 2%. All results were obtained within 10-fold cross-validation of 41 mass spectra with a mass-to-charge range of 800\u20134000m\/z. There are a total of 713 isotopic distributions and 255 matched isotopic pairs that are hand-annotated for this study.<\/jats:p>\n               <jats:p>Availability: Programs are available via http:\/\/www.cs.wisc.edu\/~mcilwain\/IDM\/<\/jats:p>\n               <jats:p>Contact: \u00a0mcilwain@cs.wisc.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn190","type":"journal-article","created":{"date-parts":[[2008,6,27]],"date-time":"2008-06-27T07:43:13Z","timestamp":1214552593000},"page":"i339-i347","source":"Crossref","is-referenced-by-count":3,"title":["Matching isotopic distributions from metabolically labeled samples"],"prefix":"10.1093","volume":"24","author":[{"given":"Sean","family":"McIlwain","sequence":"first","affiliation":[{"name":"1 Department of Computer Sciences, 2Department of Computer Sciences and Department of Biostatistics, and 3Department of Biochemistry, University of Wisconsin, Madison, WI, USA"}]},{"given":"David","family":"Page","sequence":"additional","affiliation":[{"name":"1 Department of Computer Sciences, 2Department of Computer Sciences and Department of Biostatistics, and 3Department of Biochemistry, University of Wisconsin, Madison, WI, USA"}]},{"given":"Edward L.","family":"Huttlin","sequence":"additional","affiliation":[{"name":"1 Department of Computer Sciences, 2Department of Computer Sciences and Department of Biostatistics, and 3Department of Biochemistry, University of Wisconsin, Madison, WI, USA"}]},{"given":"Michael R.","family":"Sussman","sequence":"additional","affiliation":[{"name":"1 Department of Computer Sciences, 2Department of Computer Sciences and Department of Biostatistics, and 3Department of Biochemistry, University of Wisconsin, Madison, WI, USA"}]}],"member":"286","published-online":{"date-parts":[[2008,7,1]]},"reference":[{"key":"2023020210395239300_B1","doi-asserted-by":"crossref","first-page":"584","DOI":"10.1002\/pmic.200300396","article-title":"Proteome web: a web-based interface for the display and interrogation of proteomes","volume":"3","author":"Babnigg","year":"2003","journal-title":"Proteomics"},{"key":"2023020210395239300_B2","doi-asserted-by":"crossref","first-page":"857","DOI":"10.1074\/mcp.R400010-MCP200","article-title":"Metabolic labeling of proteins for proteomics","volume":"4","author":"Beynon","year":"2005","journal-title":"Mol. Cell. Proteomics"},{"key":"2023020210395239300_B3","doi-asserted-by":"crossref","first-page":"2437","DOI":"10.1002\/elps.200410336","article-title":"A comparison of the consistency of proteome quantitation using two-dimensional electrophoresis and shotgun isobaric tagging in Escherichia Coli cells","volume":"26","author":"Choe","year":"2005","journal-title":"Electrophoresis"},{"issue":"14","key":"2023020210395239300_B4","doi-asserted-by":"crossref","first-page":"2871","DOI":"10.1021\/ac9810516","article-title":"Role of accurate mass measurement (+\/\u2212 10 ppm) in protein identification strategies employing MS or MS\/MS and database searching","volume":"71","author":"Clauser","year":"1999","journal-title":"Anal. Chem"},{"key":"2023020210395239300_B5","first-page":"116","article-title":"A probablistic learning approach to whole-genome operon prediction","volume-title":"Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology","author":"Craven","year":"2000"},{"key":"2023020210395239300_B6","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1145\/1143844.1143874","article-title":"The relationship between precision-recall and ROC curves","volume-title":"ICML'06: Proceedings of the 23rd International Conference on Machine Learning","author":"Davis","year":"2006"},{"key":"2023020210395239300_B7","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1021\/pr049821j","article-title":"Search for cancer markers from endometrial tissues using differentially labeled tags itraq and cicat with multidimensional liquid chromatography and tandem mass spectrometry","volume":"4","author":"DeSouza","year":"2005","journal-title":"J. Proteome Res"},{"key":"2023020210395239300_B8","doi-asserted-by":"crossref","first-page":"212","DOI":"10.1126\/science.1124619","article-title":"Mass spectrometry and protein analysis","volume":"312","author":"Domon","year":"2006","journal-title":"Science"},{"key":"2023020210395239300_B9","doi-asserted-by":"crossref","first-page":"976","DOI":"10.1016\/1044-0305(94)80016-2","article-title":"An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database","volume":"5","author":"Eng","year":"1994","journal-title":"J. Am. Soc. Mass Spectrom"},{"key":"2023020210395239300_B10","doi-asserted-by":"crossref","first-page":"S23","DOI":"10.1016\/S1471-1931(02)00203-3","article-title":"Advances in quantitative proteomics using stable isotope tags","volume":"20","author":"Flory","year":"2002","journal-title":"Trends Biotechnol"},{"key":"2023020210395239300_B11","doi-asserted-by":"crossref","first-page":"742","DOI":"10.1016\/S1044-0305(03)00133-8","article-title":"Proteomic analysis of pseudomonas aeruginsosa grown under magnesium limitation","volume":"14","author":"Guina","year":"2003","journal-title":"J. Am. Soc. Mass Spectrom"},{"key":"2023020210395239300_B12","doi-asserted-by":"crossref","first-page":"994","DOI":"10.1038\/13690","article-title":"Quantitative analysis of complex protein mixtures using isotope-coded affinity tags","volume":"17","author":"Gygi","year":"1999","journal-title":"Nat. Biotechnol"},{"key":"2023020210395239300_B13","doi-asserted-by":"crossref","first-page":"946","DOI":"10.1038\/nbt1001-946","article-title":"Quantitative profiling of differentiation-induced microsomal proteins using isotope-coded affinity tags and mass spectrometry","volume":"19","author":"Han","year":"2001","journal-title":"Nat. Biotechnol"},{"key":"2023020210395239300_B14","doi-asserted-by":"crossref","first-page":"4947","DOI":"10.1021\/ac050161r","article-title":"Assessing the effects of diurnal variation on the composition of human parotid saliva: quantitative analysis of native peptides using itraq reagents","volume":"77","author":"Hardt","year":"2005","journal-title":"Anal. Chem"},{"key":"2023020210395239300_B15","doi-asserted-by":"crossref","first-page":"6912","DOI":"10.1021\/ac070346t","article-title":"Stable isotope assisted assignment of elemental compositions for metabolomics","volume":"79","author":"Hegeman","year":"2007","journal-title":"Anal. Chem"},{"key":"2023020210395239300_B16","doi-asserted-by":"crossref","first-page":"320","DOI":"10.1016\/S1044-0305(99)00157-9","article-title":"Automated reduction and interpretation of high resolution electrospray mass spectra of large molecules","volume":"11","author":"Horn","year":"2000","journal-title":"J. Am. Soc. Mass Spectrom"},{"key":"2023020210395239300_B17","doi-asserted-by":"crossref","first-page":"860","DOI":"10.1074\/mcp.M600347-MCP200","article-title":"Comparison of full versus partial metabolic labeling for quantitative proteomics analysis in Arabidopsis thaliana","volume":"6","author":"Huttlin","year":"2007","journal-title":"Mol. Cell. Proteomics"},{"key":"2023020210395239300_B18","doi-asserted-by":"crossref","first-page":"927","DOI":"10.1038\/nbt848","article-title":"Metabolic labeling of C. elegans and D. melanogaster for quantitative proteomics","volume":"21","author":"Krijsveld","year":"2003","journal-title":"Nat. Biotechnol"},{"key":"2023020210395239300_B19","doi-asserted-by":"crossref","first-page":"i328","DOI":"10.1093\/bioinformatics\/btm198","article-title":"Using dynamic programming to create isotopic distribution maps from mass spectra","volume":"23","author":"McIlwain","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020210395239300_B20","doi-asserted-by":"crossref","first-page":"1279","DOI":"10.1002\/pmic.200600832","article-title":"Implications of 15N-metabolic labeling for automated peptide identification in Arabidopsis thaliana","volume":"7","author":"Nelson","year":"2007","journal-title":"Proteomics"},{"key":"2023020210395239300_B21","doi-asserted-by":"crossref","first-page":"3551","DOI":"10.1002\/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2","article-title":"Probability-based protein identification by searching sequence data bases using mass spectrometry data","volume":"20","author":"Perkins","year":"1999","journal-title":"Electrophoresis"},{"key":"2023020210395239300_B22","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1002\/1615-9861(200202)2:2<157::AID-PROT157>3.0.CO;2-M","article-title":"Stable isotope labeling in vivo as an aid to protein identification in peptide mass fingerprinting","volume":"2","author":"Pratt","year":"2002","journal-title":"Proteomics"},{"key":"2023020210395239300_B23","doi-asserted-by":"crossref","first-page":"349","DOI":"10.1038\/ng1101","article-title":"The study of macromolecular complexes by quantitative proteomics","volume":"33","author":"Ranish","year":"2003","journal-title":"Nat. Genet"},{"key":"2023020210395239300_B24","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1002\/(SICI)1097-0231(19960115)10:1<54::AID-RCM444>3.0.CO;2-Z","article-title":"Ultrahigh resolution isotope distribution calculations","volume":"10","author":"Rockwood","year":"1996","journal-title":"Rapid Commun. Mass Spectrom"},{"key":"2023020210395239300_B25","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1016\/j.jasms.2005.12.001","article-title":"Efficient calculation of accurate masses of isotopic peaks","volume":"17","author":"Rockwood","year":"2006","journal-title":"J. Am. Soc. Mass Spectrom"},{"key":"2023020210395239300_B26","doi-asserted-by":"crossref","first-page":"1154","DOI":"10.1074\/mcp.M400129-MCP200","article-title":"Multiplexed protein quantitation in saccharomyces cerevisiae using amine-reactive isobaric tagging reagents","volume":"3","author":"Ross","year":"2004","journal-title":"Mol. Cell. Proteomics"},{"key":"2023020210395239300_B27","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1038\/nmeth725","article-title":"Large-scale database searching using tandem mass spectra: looking up the answer in the back of the book","volume":"1","author":"Sadygov","year":"2004","journal-title":"Nat. Methods"},{"key":"2023020210395239300_B28","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1016\/1044-0305(95)00017-8","article-title":"Determination of monoisotopic masses and ion populations for large biomolecules from resolved isotopic distributions","volume":"6","author":"Senko","year":"1995","journal-title":"J. Am. Soc. Mass Spectrom"},{"key":"2023020210395239300_B29","doi-asserted-by":"crossref","first-page":"5088","DOI":"10.1093\/emboj\/cdf525","article-title":"Quantitative proteomic analysis of myc oncoprotein function","volume":"21","author":"Shiio","year":"2002","journal-title":"EMBO J"},{"key":"2023020210395239300_B30","doi-asserted-by":"crossref","first-page":"696","DOI":"10.1016\/S1044-0305(03)00204-6","article-title":"Quantitative proteomic analysis of chromatin-associated factors","volume":"14","author":"Shiio","year":"2003","journal-title":"J. Am. Soc. Mass Spectrom"},{"key":"2023020210395239300_B31","doi-asserted-by":"crossref","first-page":"578","DOI":"10.1021\/pr0497733","article-title":"Novel approach for peptide quantitation and sequencing based on 15N and 13C metabolic labeling","volume":"4","author":"Snijders","year":"2005","journal-title":"J. Proteome Res"},{"key":"2023020210395239300_B32","doi-asserted-by":"crossref","first-page":"426","DOI":"10.1074\/mcp.D300002-MCP200","article-title":"The application of new software tools to quantitative protein profiling via icat and tandem mass spectrometry: I. statistically annotated data sets for peptide sequences and proteins identified via the application of icat and tandem mass spectrometry to proteins co-purifying with t cell lipid rafts","volume":"2","author":"von Haller","year":"2003","journal-title":"Mol. Cell. Proteomics"},{"key":"2023020210395239300_B33","doi-asserted-by":"crossref","first-page":"428","DOI":"10.1074\/mcp.M300041-MCP200","article-title":"The application of new software tools to quantitative protein profiling via icat and tandem mass spectrometry: Ii. evaluation of tandem mass spectrometry methodologies for large-scale protein analysis and the application of statistical tools for data analysis and interpretation","volume":"2","author":"von Haller","year":"2003","journal-title":"Mol. Cell. Proteomics"},{"key":"2023020210395239300_B34","article-title":"Induction of model trees for predicting continuous classes","volume-title":"In Proceedings of the poster papers of the European Conference of Machine Learning","author":"Wang","year":"1997"},{"key":"2023020210395239300_B35","volume-title":"Data Mining: Practical Machine Learning Tools with Java Implementations","author":"Witten","year":"2000"},{"key":"2023020210395239300_B36","doi-asserted-by":"crossref","first-page":"W701","DOI":"10.1093\/nar\/gkm371","article-title":"ProSight PTM 2.0: improved protein identification and characterization for top down mass spectrometry","volume":"35","author":"Zamdborg","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023020210395239300_B37","doi-asserted-by":"crossref","first-page":"1155","DOI":"10.1021\/pr049900v","article-title":"Two-dimensional mass spectra generated from the analysis of 15N-labeled and unlabeled peptides for efficient protein identification and de novo peptide sequencing","volume":"3","author":"Zhong","year":"2004","journal-title":"J. Proteome Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i339\/49052687\/bioinformatics_24_13_i339.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i339\/49052687\/bioinformatics_24_13_i339.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T12:23:11Z","timestamp":1675340591000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/13\/i339\/237064"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,7,1]]},"references-count":37,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2008,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn190","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,7,1]]},"published":{"date-parts":[[2008,7,1]]}}}