{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,19]],"date-time":"2025-10-19T15:39:31Z","timestamp":1760888371007},"reference-count":41,"publisher":"Oxford University Press (OUP)","issue":"22","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,11,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Common contemporary practice within the nuclear magnetic resonance (NMR) metabolomics community is to evaluate and validate novel algorithms on empirical data or simplified simulated data. Empirical data captures the complex characteristics of experimental data, but the optimal or most correct analysis is unknown a priori; therefore, researchers are forced to rely on indirect performance metrics, which are of limited value. In order to achieve fair and complete analysis of competing techniques more exacting metrics are required. Thus, metabolomics researchers often evaluate their algorithms on simplified simulated data with a known answer. Unfortunately, the conclusions obtained on simulated data are only of value if the data sets are complex enough for results to generalize to true experimental data. Ideally, synthetic data should be indistinguishable from empirical data, yet retain a known best analysis.<\/jats:p>\n               <jats:p>Results: We have developed a technique for creating realistic synthetic metabolomics validation sets based on NMR spectroscopic data. The validation sets are developed by characterizing the salient distributions in sets of empirical spectroscopic data. Using this technique, several validation sets are constructed with a variety of characteristics present in \u2018real\u2019 data. A case study is then presented to compare the relative accuracy of several alignment algorithms using the increased precision afforded by these synthetic data sets.<\/jats:p>\n               <jats:p>Availability: These data sets are available for download at http:\/\/birg.cs.wright.edu\/nmr_synthetic_data_sets.<\/jats:p>\n               <jats:p>Contact: \u00a0travis.doom@wright.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btp540","type":"journal-article","created":{"date-parts":[[2009,9,17]],"date-time":"2009-09-17T01:30:31Z","timestamp":1253151031000},"page":"2992-3000","source":"Crossref","is-referenced-by-count":9,"title":["Characterization of 1H NMR spectroscopic data and the generation of synthetic validation sets"],"prefix":"10.1093","volume":"25","author":[{"given":"Paul E.","family":"Anderson","sequence":"first","affiliation":[{"name":"1 Department of Computer Science and Engineering, Dayton, OH 45435, 2Department of Biochemistry and Molecular Biology, Boonshoft School of Medicine, Cox Institute, Dayton, OH 45429 and 3Air Force Research Laboratory, Biosciences and Protection Division, Wright-Patterson AFB, OH 45433, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael L.","family":"Raymer","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, Dayton, OH 45435, 2Department of Biochemistry and Molecular Biology, Boonshoft School of Medicine, Cox Institute, Dayton, OH 45429 and 3Air Force Research Laboratory, Biosciences and Protection Division, Wright-Patterson AFB, OH 45433, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Benjamin J.","family":"Kelly","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, Dayton, OH 45435, 2Department of Biochemistry and Molecular Biology, Boonshoft School of Medicine, Cox Institute, Dayton, OH 45429 and 3Air Force Research Laboratory, Biosciences and Protection Division, Wright-Patterson AFB, OH 45433, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nicholas V.","family":"Reo","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, Dayton, OH 45435, 2Department of Biochemistry and Molecular Biology, Boonshoft School of Medicine, Cox Institute, Dayton, OH 45429 and 3Air Force Research Laboratory, Biosciences and Protection Division, Wright-Patterson AFB, OH 45433, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nicholas J.","family":"DelRaso","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, Dayton, OH 45435, 2Department of Biochemistry and Molecular Biology, Boonshoft School of Medicine, Cox Institute, Dayton, OH 45429 and 3Air Force Research Laboratory, Biosciences and Protection Division, Wright-Patterson AFB, OH 45433, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"T. E.","family":"Doom","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, Dayton, OH 45435, 2Department of Biochemistry and Molecular Biology, Boonshoft School of Medicine, Cox Institute, Dayton, OH 45429 and 3Air Force Research Laboratory, Biosciences and Protection Division, Wright-Patterson AFB, OH 45433, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2009,9,16]]},"reference":[{"key":"2023013112152682100_B1","author":"Anderson","year":"2009","journal-title":"Nuclear magnetic resonance synthetic validation sets."},{"key":"2023013112152682100_B2","doi-asserted-by":"crossref","first-page":"839","DOI":"10.1080\/00498250500297940","article-title":"Chemometric analysis of biofluids following toxicant induced hepatotoxicity: a metabonomic approach to distinguish the effects of 1-naphthylisothiocyanate from its products","volume":"35","author":"Azmi","year":"2005","journal-title":"Xenobiotica: Fate Safety Eval. Foreign Comp. Biol. Syst."},{"key":"2023013112152682100_B3","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/S0003-2670(03)00060-6","article-title":"NMR-based metabonomic toxicity classification: hierarchical cluster analysis and k-nearest-neighbour approaches","volume":"490","author":"Beckonert","year":"2003","journal-title":"Anal. Chim. Acta"},{"key":"2023013112152682100_B4","doi-asserted-by":"crossref","first-page":"1136","DOI":"10.1021\/tx020020+","article-title":"NMR-based metabonomic studies on the biochemical effects of commonly used drug carrier vehicles in the rat","volume":"15","author":"Beckwith-Hall","year":"2002","journal-title":"Chem. Res. Toxicol."},{"key":"2023013112152682100_B5","doi-asserted-by":"crossref","first-page":"593","DOI":"10.1002\/nbm.1372","article-title":"Detecting colorectal cancer by 1H magnetic resonance spectroscopy of fecal extracts","volume":"22","author":"Bezabeh","year":"2009","journal-title":"NMR Biomed."},{"key":"2023013112152682100_B6","doi-asserted-by":"crossref","first-page":"567","DOI":"10.1021\/ac051495j","article-title":"Large-scale human metabolomics studies: a strategy for data (pre-) processing and validation","volume":"78","author":"Bijlsma","year":"2006","journal-title":"Anal. Chem."},{"key":"2023013112152682100_B7","doi-asserted-by":"crossref","first-page":"1439","DOI":"10.1038\/nm1202-802","article-title":"Rapid and noninvasive diagnosis of the presence and severity of coronary heart disease using 1H-NMR-based metabonomics","volume":"8","author":"Brindle","year":"2002","journal-title":"Nature Med."},{"key":"2023013112152682100_B8","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1016\/S0014-5793(02)02854-5","article-title":"Earthworm species of the genus Eisenia can be phenotypically differentiated by metabolic profiling","volume":"521","author":"Bundy","year":"2002","journal-title":"FEBS Lett."},{"key":"2023013112152682100_B9","doi-asserted-by":"crossref","first-page":"517","DOI":"10.1021\/ac048803i","article-title":"Evaluation of the orthogonal projection on latent structure model limitations caused by chemical shift variability and improved visualization of Biomarker changes in 1H NMR spectroscopic metabonomic studies","volume":"77","author":"Cloarec","year":"2005","journal-title":"Anal. Chem."},{"key":"2023013112152682100_B10","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1007\/BF01582221","article-title":"On the convergence of interior-reflective Newton methods for nonlinear minimization subject to bounds","volume":"67","author":"Coleman","year":"1994","journal-title":"Math. Prog."},{"key":"2023013112152682100_B11","doi-asserted-by":"crossref","first-page":"418","DOI":"10.1137\/0806023","article-title":"An interior trust region approach for nonlinear minimization subject to bounds","volume":"6","author":"Coleman","year":"1996","journal-title":"SIAM J. Optim."},{"key":"2023013112152682100_B12","doi-asserted-by":"crossref","first-page":"144","DOI":"10.1016\/j.chemolab.2006.08.014","article-title":"Adaptive binning: an improved binning method for metabolomics data using the undecimated wavelet transform","volume":"85","author":"Davis","year":"2007","journal-title":"Chemometr. Intell. Lab. Syst."},{"key":"2023013112152682100_B13","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1023\/A:1013713905833","article-title":"Metabolomics\u2014the link between genotypes and phenotypes","volume":"48","author":"Fiehn","year":"2002","journal-title":"Plant Mol. Biol."},{"key":"2023013112152682100_B14","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1016\/S0003-2670(03)00570-1","article-title":"Peak alignment of NMR signals by means of a genetic algorithm","volume":"487","author":"Forshed","year":"2003","journal-title":"Anal. Chim. Acta"},{"key":"2023013112152682100_B15","doi-asserted-by":"crossref","first-page":"824","DOI":"10.1016\/j.jpba.2005.01.042","article-title":"A comparison of methods for alignment of NMR peaks in the context of cluster analysis","volume":"38","author":"Forshed","year":"2005","journal-title":"J. Pharm. Biomed. Anal."},{"key":"2023013112152682100_B16","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1137\/0717021","article-title":"Monotone piecewise cubic interpolation","volume":"17","author":"Fritsch","year":"1980","journal-title":"SIAM J. Numer. Anal."},{"key":"2023013112152682100_B17","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1016\/S0014-5793(00)02147-5","article-title":"An NMR-based metabonomic approach to investigate the biochemical consequences of genetic strain differences: application to the C57BL10J and Alpk:ApfCD mouse","volume":"484","author":"Gavaghan","year":"2000","journal-title":"FEBS Lett."},{"key":"2023013112152682100_B18","doi-asserted-by":"crossref","first-page":"949","DOI":"10.1038\/nature06802","article-title":"The search for new cardiovascular biomarkers","volume":"451","author":"Gerszten","year":"2008","journal-title":"Nature"},{"key":"2023013112152682100_B19","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1016\/S1090-7807(03)00038-7","article-title":"A statistical analysis of NMR spectrometer noise","volume":"162","author":"Grage","year":"2003","journal-title":"J. Magn. Reson."},{"key":"2023013112152682100_B20","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1006\/abio.2001.5096","article-title":"Metabolic profiling of genetic disorders: a multitissue 1H nuclear magnetic resonance spectroscopic and pattern recognition study into dystrophic tissue","volume":"293","author":"Griffin","year":"2001","journal-title":"Anal. Biochem."},{"key":"2023013112152682100_B21","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1021\/tx990210t","article-title":"Chemometric models for toxicity classification based on NMR spectra of biofluids","volume":"13","author":"Holmes","year":"2000","journal-title":"Chem. Res. Toxicol."},{"key":"2023013112152682100_B22","doi-asserted-by":"crossref","DOI":"10.1201\/9781420011371","volume-title":"Handbook of Statistical Distributions with Applications.","author":"Krishnamoorthy","year":"2006"},{"key":"2023013112152682100_B23","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1186\/1471-2105-9-375","article-title":"Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements","volume":"9","author":"Lange","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023013112152682100_B24","doi-asserted-by":"crossref","first-page":"3503","DOI":"10.1172\/JCI35111","article-title":"Metabolite profiling of blood from individuals undergoing planned myocardial infarction reveals early markers of myocardial injury","volume":"118","author":"Lewis","year":"2008","journal-title":"J. Clin. Invest."},{"key":"2023013112152682100_B25","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S0079-6565(00)00036-4","article-title":"Pattern recognition methods and applications in biomedical magnetic resonance","volume":"39","author":"Lindon","year":"2001","journal-title":"Progr. Nucl. Magn. Reson. Spectr."},{"key":"2023013112152682100_B26","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1081\/DCT-120014789","article-title":"NMR-based metabolomics","volume":"25","author":"Reo","year":"2002","journal-title":"Drug Chem. Toxicol."},{"key":"2023013112152682100_B27","doi-asserted-by":"crossref","first-page":"809","DOI":"10.1093\/toxsci\/kfi102","article-title":"Metabonomics in toxicology: a review","volume":"85","author":"Robertson","year":"2005","journal-title":"Toxicol. Sci."},{"key":"2023013112152682100_B28","doi-asserted-by":"crossref","first-page":"35","DOI":"10.2174\/1568026023394498","article-title":"Metabonomic applications in toxicity screening and disease diagnosis","volume":"2","author":"Shockcor","year":"2002","journal-title":"Curr. Topics Med. Chem."},{"key":"2023013112152682100_B29","doi-asserted-by":"crossref","first-page":"72","DOI":"10.2307\/1412159","article-title":"The proof and measurement of association between two things","volume":"15","author":"Spearman","year":"1904","journal-title":"Amer. J. Psychol."},{"key":"2023013112152682100_B30","doi-asserted-by":"crossref","first-page":"730","DOI":"10.1080\/01621459.1974.10480196","article-title":"EDF statistics for goodness of fit and some comparisons","volume":"69","author":"Stevens","year":"1974","journal-title":"J. Amer. Statist. Assoc."},{"key":"2023013112152682100_B31","first-page":"357","article-title":"Asymptotic results for goodness-of-fit statistics with unknown parameters","volume":"4","author":"Stevens","year":"1976","journal-title":"Ann. Statist."},{"key":"2023013112152682100_B32","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1093\/biomet\/64.3.583","article-title":"Goodness of fit for the extreme value distribution","volume":"64","author":"Stevens","year":"1977","journal-title":"Biometrika"},{"key":"2023013112152682100_B33","doi-asserted-by":"crossref","first-page":"591","DOI":"10.1093\/biomet\/66.3.591","article-title":"Tests of fit for the logistic distribution based on the empirical distribution function","volume":"66","author":"Stevens","year":"1979","journal-title":"Biometrika"},{"key":"2023013112152682100_B34","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1006\/jmre.2001.2486","article-title":"NMR spectral quantitation by principal component analysis","volume":"154","author":"Stoyanova","year":"2002","journal-title":"J. Magn. Reson."},{"key":"2023013112152682100_B35","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1016\/S0031-9422(01)00232-1","article-title":"Identification and quantification of catecholamines in potato plants (Solanum tuberosum) by GC-MS","volume":"58","author":"Szopa","year":"2001","journal-title":"Phytochemistry"},{"key":"2023013112152682100_B36","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1002\/cem.824","article-title":"Peak alignment using reduced set mapping","volume":"17","author":"Torgrip","year":"2003","journal-title":"J. Chemometr."},{"key":"2023013112152682100_B37","doi-asserted-by":"crossref","first-page":"142","DOI":"10.1186\/1471-2164-7-142","article-title":"Centering, scaling, and transformations: improving the biological information content of metabolomics data","volume":"7","author":"van den Berg","year":"2006","journal-title":"BMC Genomics"},{"key":"2023013112152682100_B38","doi-asserted-by":"crossref","first-page":"830","DOI":"10.1016\/j.jpba.2005.05.012","article-title":"A study of spectral integration and normalization in NMR-based metabonomic analyses","volume":"39","author":"Webb-Robertson","year":"2005","journal-title":"J. Pharm. Biomed. Anal."},{"key":"2023013112152682100_B39","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1016\/j.jchromb.2004.07.045","article-title":"HPLC-MS-based methods for the study of metabonomics","volume":"817","author":"Wilson","year":"2005","journal-title":"J. Chromatogr. B"},{"key":"2023013112152682100_B40","doi-asserted-by":"crossref","first-page":"2088","DOI":"10.1093\/bioinformatics\/bti300","article-title":"SpecAlign\u2014processing and alignment of mass spectra datasets","volume":"21","author":"Wong","year":"2005","journal-title":"Bioinformatics"},{"key":"2023013112152682100_B41","doi-asserted-by":"crossref","first-page":"5655","DOI":"10.1021\/ac050619p","article-title":"Application of fast Fourier transform cross-correlation for the alignment of large chromatographic and spectral datasets","volume":"77","author":"Wong","year":"2005","journal-title":"Anal. Chem."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/22\/2992\/48997295\/bioinformatics_25_22_2992.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/22\/2992\/48997295\/bioinformatics_25_22_2992.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T21:51:58Z","timestamp":1675201918000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/22\/2992\/180632"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,9,16]]},"references-count":41,"journal-issue":{"issue":"22","published-print":{"date-parts":[[2009,11,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btp540","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2009,11,15]]},"published":{"date-parts":[[2009,9,16]]}}}