{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,27]],"date-time":"2025-10-27T04:53:13Z","timestamp":1761540793455},"reference-count":34,"publisher":"Oxford University Press (OUP)","issue":"13","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: This article presents a method to identify the isotopic distributions within a mass spectrum using a probabilistic classifier supplemented with dynamic programming. Such a system is needed for a variety of purposes, including generating robust and meaningful features from mass spectra to be used in classification.<\/jats:p>\n               <jats:p>Results: The primary result of this article is that the dynamic programming approach significantly improves sensitivity, without harming specificity, of a probabilistic classifier for identifying the isotopic distributions. When annotating isotopic distributions where an expert has performed the initial \u2018peak-picking\u2019 (removal of noise peaks), the dynamic programming approach gives a true positive rate of 96% and a false positive rate of 0.0%, whereas the classifier alone has a true positive rate of only 47% when the false positive rate is 0.0%. When annotating isotopic distributions in machine peak-picked spectra, which may contain many noise peaks, the dynamic programming approach gives a true positive rate of only 22.0%, but it still keeps a low false positive rate of 1.0% and still outperforms the classifier alone. It is important to note that all these rates are when we require exact matches with the distributions in annotated spectra; in our evaluation a distribution is considered \u2018entirely incorrect\u2019 if it is missing even one peak or contains even one extraneous peak. We compared to the THRASH and AID-MS systems using a looser requirement: correctly identifying the distribution that contains the mono-isotopic mass. Under this measure, our dynamic programming approach achieves a true positive rate of 82% and a false positive rate of 1%, which again outperforms the classifier alone. The dynamic programming approach ends up being more conservative than THRASH and AID-MS, yielding both fewer true and false peaks, but the F-score of the dynamic programming approach is significantly better than those of THRASH and AID-MS. All results were obtained with 10-fold cross-validation of 99 sections of mass spectra with a total of 214 hand-annotated isotopic distributions.<\/jats:p>\n               <jats:p>Availability: Programs are available via http:\/\/www.cs.wisc.edu\/~mcilwain\/IDM<\/jats:p>\n               <jats:p>Contact: mcilwain@cs.wisc.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm198","type":"journal-article","created":{"date-parts":[[2007,7,23]],"date-time":"2007-07-23T16:13:46Z","timestamp":1185207226000},"page":"i328-i336","source":"Crossref","is-referenced-by-count":13,"title":["Using dynamic programming to create isotopic distribution maps from mass spectra"],"prefix":"10.1093","volume":"23","author":[{"given":"Sean","family":"McIlwain","sequence":"first","affiliation":[{"name":"1 Department of Computer Sciences, University of Wisconsin, Madison, WI, 2Department of Computer Sciences and Department of Biostatistics, University of Wisconsin, Madison, WI and 3Department of Biochemistry, University of Wisconsin, Madison, WI, USA"}]},{"given":"David","family":"Page","sequence":"additional","affiliation":[{"name":"1 Department of Computer Sciences, University of Wisconsin, Madison, WI, 2Department of Computer Sciences and Department of Biostatistics, University of Wisconsin, Madison, WI and 3Department of Biochemistry, University of Wisconsin, Madison, WI, USA"}]},{"given":"Edward L.","family":"Huttlin","sequence":"additional","affiliation":[{"name":"1 Department of Computer Sciences, University of Wisconsin, Madison, WI, 2Department of Computer Sciences and Department of Biostatistics, University of Wisconsin, Madison, WI and 3Department of Biochemistry, University of Wisconsin, Madison, WI, USA"}]},{"given":"Michael R.","family":"Sussman","sequence":"additional","affiliation":[{"name":"1 Department of Computer Sciences, University of Wisconsin, Madison, WI, 2Department of Computer Sciences and Department of Biostatistics, University of Wisconsin, Madison, WI and 3Department of Biochemistry, University of Wisconsin, Madison, WI, USA"}]}],"member":"286","published-online":{"date-parts":[[2007,7,1]]},"reference":[{"key":"2023062708515038300_B1","doi-asserted-by":"crossref","first-page":"777","DOI":"10.1093\/bioinformatics\/btg484","article-title":"Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments","volume":"20","author":"Baggerly","year":"2004","journal-title":"Bioinformatics"},{"key":"2023062708515038300_B2","doi-asserted-by":"crossref","first-page":"857","DOI":"10.1074\/mcp.R400010-MCP200","article-title":"Metabolic labeling of proteins for proteomics","volume":"4","author":"Beynon","year":"2005","journal-title":"Mol. Cell. Proteomics"},{"key":"2023062708515038300_B3","doi-asserted-by":"crossref","first-page":"5006","DOI":"10.1021\/ac060099d","article-title":"Automated intensity descent algorithm for interpretation of complex high-resolution mass spectra","volume":"78","author":"Chen","year":"2006","journal-title":"Anal. Chem"},{"key":"2023062708515038300_B4","doi-asserted-by":"crossref","first-page":"2871","DOI":"10.1021\/ac9810516","article-title":"Role of accurate mass measurement (+\/\u221210 ppm) in protein identification strategies employing MS or MS\/MS and database searching","volume":"71","author":"Clauser","year":"1999","journal-title":"Anal. Chem"},{"key":"2023062708515038300_B5","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1038\/nbt0305-291","article-title":"Serum proteomics profiling\u2014a young technology begins to mature","volume":"23","author":"Coombes","year":"2005","journal-title":"Nat. Biotechnol"},{"key":"2023062708515038300_B6","first-page":"116","article-title":"A probablistic learning approach to whole-genome operon prediction","author":"Craven","year":"2000"},{"key":"2023062708515038300_B7","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1145\/1143844.1143874","article-title":"The relationship between Precision-Recall and ROC curves","volume-title":"ICML 06: Proceedings of the 23rd International Conference on Machine Learning","author":"Davis","year":"2006"},{"key":"2023062708515038300_B8","doi-asserted-by":"crossref","first-page":"865","DOI":"10.1002\/rcm.1864","article-title":"A new method to analyze matrix-assisted laser desorption\/ionization time-of-flight peptide profiling mass spectra","volume":"19","author":"Dekker","year":"2005","journal-title":"Rapid Commun. in Mass Spectrom"},{"key":"2023062708515038300_B9","doi-asserted-by":"crossref","first-page":"R9.1","DOI":"10.1186\/gb-2004-6-1-r9","article-title":"Integration with the human genome of peptide sequences obtained by highthroughput mass spectrometry","volume":"6","author":"Desiere","year":"2004","journal-title":"Genome Biol"},{"key":"2023062708515038300_B10","doi-asserted-by":"crossref","first-page":"976","DOI":"10.1016\/1044-0305(94)80016-2","article-title":"An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database","volume":"5","author":"Eng","year":"1994","journal-title":"J. Am. Soc. Mass Spectrom"},{"key":"2023062708515038300_B11","doi-asserted-by":"crossref","first-page":"865","DOI":"10.1002\/pmic.200401071","article-title":"Automatic annotation of matrix-assisted laser desorption\/ionization n-glycan spectra","volume":"5","author":"Goldberg","year":"2005","journal-title":"Proteomics"},{"key":"2023062708515038300_B12","doi-asserted-by":"crossref","first-page":"1716","DOI":"10.1002\/pmic.200300523","article-title":"Machine learning approaches to lung cancer prediction from mass spectra","volume":"3","author":"Hilario","year":"2003","journal-title":"Proteomics"},{"key":"2023062708515038300_B13","doi-asserted-by":"crossref","first-page":"320","DOI":"10.1016\/S1044-0305(99)00157-9","article-title":"Automated reduction and interpretation of high resolution electrospray mass spectra of large molecules","volume":"11","author":"Horn","year":"2000","journal-title":"J. Am. Soc. Mass Spectrom"},{"key":"2023062708515038300_B14","doi-asserted-by":"crossref","first-page":"860","DOI":"10.1074\/mcp.M600347-MCP200","article-title":"Comparison of full versus partial metabolic labeling for quantitiative proteomic analysis in arabidopsis thaliana","volume":"6","author":"Huttlin","year":"2007","journal-title":"Mol. Cell Proteomics"},{"key":"2023062708515038300_B15","doi-asserted-by":"crossref","first-page":"5383","DOI":"10.1021\/ac025747h","article-title":"Empirical statistical model to estimate the accuracy of peptide identifications made by ms\/ms and database search","volume":"74","author":"Keller","year":"2002","journal-title":"Anal. Chem"},{"key":"2023062708515038300_B16","doi-asserted-by":"crossref","first-page":"927","DOI":"10.1038\/nbt848","article-title":"Metabolic labeling of C. elegans and D. melanogaster for quantitative proteomics","volume":"21","author":"Krijgsveld","year":"2003","journal-title":"Nat. Biotechol"},{"key":"2023062708515038300_B17","doi-asserted-by":"crossref","first-page":"1296","DOI":"10.1093\/clinchem\/48.8.1296","article-title":"Proteomics and bioinformatics approaches for identification of serum biomarkers to detect breast cancer","volume":"48","author":"Li","year":"2002","journal-title":"Clin. Chem"},{"key":"2023062708515038300_B18","doi-asserted-by":"crossref","first-page":"1279","DOI":"10.1002\/pmic.200600832","article-title":"Implications of 15N metabolic labeling for automated peptide identification using multiple search engine in arabidopsis thaliana","volume":"7","author":"Nelson","year":"2007","journal-title":"Proteomics"},{"key":"2023062708515038300_B19","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1074\/mcp.M200025-MCP200","article-title":"Stable isotope labeling by amino acids in cell culture, silac, as a simple and accurate approach to expression proteomics","volume":"1","author":"Ong","year":"2002","journal-title":"Mol. Cell Proteomics"},{"key":"2023062708515038300_B20","doi-asserted-by":"crossref","first-page":"3551","DOI":"10.1002\/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2","article-title":"Probability-based protein identification by searching sequence data bases using mass spectrometry data","volume":"20","author":"Perkins","year":"1999","journal-title":"Electrophoresis"},{"key":"2023062708515038300_B21","doi-asserted-by":"crossref","first-page":"1835","DOI":"10.1093\/clinchem\/48.10.1835","article-title":"Boosted decision tree analysis of surface-enhanced laser desorption\/ionization mass spectral serum profiles discriminates prostate cancer from noncancer patients","volume":"48","author":"Qu","year":"2002","journal-title":"Clini. Chem"},{"key":"2023062708515038300_B22","doi-asserted-by":"crossref","first-page":"1518","DOI":"10.5858\/2002-126-1518-PATTMD","article-title":"Proteomic approaches to tumor marker discovery","volume":"126","author":"Rai","year":"2002","journal-title":"Arch. Pathol. Lab. Med"},{"key":"2023062708515038300_B23","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1016\/j.jasms.2005.12.001","article-title":"Efficient calculation of accurate masses of isotopic peaks","volume":"17","author":"Rockwood","year":"2006","journal-title":"J. Am. Soc. Mass Spectrom"},{"key":"2023062708515038300_B24","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1002\/(SICI)1097-0231(19960115)10:1<54::AID-RCM444>3.0.CO;2-Z","article-title":"Ultrahigh resolution isotope distribution calculations","volume":"10","author":"Rockwood","year":"1996","journal-title":"Rapid Commun. Mass Spectrom"},{"key":"2023062708515038300_B25","doi-asserted-by":"crossref","first-page":"634","DOI":"10.1002\/hep.20577","article-title":"SELDI-TOF MS profiling of serum for detection of the progression of chronic hepatitis c to hepatocellular carcinoma","volume":"41","author":"Schwegler","year":"2005","journal-title":"Hepatology"},{"key":"2023062708515038300_B26","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1016\/1044-0305(94)00091-D","article-title":"Automated assignment of charge states from resolved isotopic peaks for multiply charged ions","volume":"6","author":"Senko","year":"1995","journal-title":"J. AM. Soc. Mass Spectrom"},{"key":"2023062708515038300_B27","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1016\/1044-0305(95)00017-8","article-title":"Determination of monoisotopic masses and ion populations for large biomolecules from resolved isotopic distributions","volume":"6","author":"Senko","year":"1995","journal-title":"J. Am. Soc. Mass Spectrom"},{"key":"2023062708515038300_B28","doi-asserted-by":"crossref","first-page":"4806","DOI":"10.1158\/1078-0432.CCR-03-0469","article-title":"The use of plasma surface-enhanced laser desorption\/ionization time-of-flight mass spectrometry proteomic patterns for detection of head and neck squamous cell cancers","volume":"10","author":"Soltys","year":"2004","journal-title":"Clini. Cancer Res"},{"key":"2023062708515038300_B29","doi-asserted-by":"crossref","first-page":"104","DOI":"10.1214\/ss\/1056397488","article-title":"Class predictaion by nearest shrunken centroids, with applications to DNA microarrays","volume":"18","author":"Tibshirani","year":"2003","journal-title":"Stat. Science"},{"key":"2023062708515038300_B34","doi-asserted-by":"crossref","first-page":"3034","DOI":"10.1093\/bioinformatics\/bth357","article-title":"Sample classification from protein mass spectrometry, by \u2018peak probability contrasts\u2019","volume":"20","author":"Tibshirani","year":"2004","journal-title":"Bioinformatics"},{"key":"2023062708515038300_B30","doi-asserted-by":"crossref","first-page":"1507","DOI":"10.1016\/j.phytochem.2004.05.018","article-title":"Subtle modification of isotope ratio proteomics; an integrated strategy for expression proteomics","volume":"65","author":"Whitelegge","year":"2004","journal-title":"Phytochemistry"},{"key":"2023062708515038300_B31","doi-asserted-by":"crossref","first-page":"1636","DOI":"10.1093\/bioinformatics\/btg210","article-title":"Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data","volume":"19","author":"Wu","year":"2003","journal-title":"Bioinformatics"},{"key":"2023062708515038300_B32","doi-asserted-by":"crossref","first-page":"2836","DOI":"10.1021\/ac001404c","article-title":"Proteolytic 18O labeling for comparative proteomics: model studies with two serotypes of adenovirus","volume":"73","author":"Yao","year":"2001","journal-title":"Anal. Chem"},{"key":"2023062708515038300_B33","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1016\/S0378-4347(00)81455-3","article-title":"Capillary column gas chromatographic profile analysis of volatile compounds in sera of normal and virus-infected patients","volume":"163","author":"Zlatkis","year":"1979","journal-title":"J. Chromatogr"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/13\/i328\/50717440\/bioinformatics_23_13_i328.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/13\/i328\/50717440\/bioinformatics_23_13_i328.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,27]],"date-time":"2023-06-27T08:58:05Z","timestamp":1687856285000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/13\/i328\/231409"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,7,1]]},"references-count":34,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2007,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm198","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,7]]},"published":{"date-parts":[[2007,7,1]]}}}