{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,26]],"date-time":"2025-10-26T14:17:12Z","timestamp":1761488232856,"version":"build-2065373602"},"reference-count":46,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2009,4,3]],"date-time":"2009-04-03T00:00:00Z","timestamp":1238716800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/3.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>A robust and complete workflow for metabolic profiling and data mining was described in detail. Three independent and complementary analytical techniques for metabolic profiling were applied: hydrophilic interaction chromatography (HILIC\u2013LC\u2013ESI\u2013MS), reversed-phase liquid chromatography (RP\u2013LC\u2013ESI\u2013MS), and gas chromatography (GC\u2013TOF\u2013MS) all coupled to mass spectrometry (MS). Unsupervised methods, such as principle component analysis (PCA) and clustering, and supervised methods, such as classification and PCA-DA (discriminatory analysis) were used for data mining. Genetic Algorithms (GA), a multivariate approach, was probed for selection of the smallest subsets of potentially discriminative predictors. From thousands of peaks found in total, small subsets selected by GA were considered as highly potential predictors allowing discrimination among groups. It was found that small groups of potential top predictors selected with PCA-DA and GA are different and unique. Annotated GC\u2013TOF\u2013MS data generated identified feature metabolites. Metabolites putatively detected with LC\u2013ESI\u2013MS profiling require further elemental composition assignment with accurate mass measurement by Fourier-transform ion cyclotron resonance mass spectrometry (FT-ICR-MS) and structure elucidation by nuclear magnetic resonance spectroscopy (NMR). GA was also used to generate correlated networks for pathway analysis. Several case studies, comprising groups of plant samples bearing different genotypes and groups of samples of human origin, namely patients and healthy volunteers\u2019 urine samples, demonstrated that such a workflow combining comprehensive metabolic profiling and advanced data mining techniques provides a powerful approach for pattern recognition and biomarker discovery<\/jats:p>","DOI":"10.3390\/a2020638","type":"journal-article","created":{"date-parts":[[2009,4,3]],"date-time":"2009-04-03T07:24:36Z","timestamp":1238743476000},"page":"638-666","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":15,"title":["Pattern Recognition and Pathway Analysis with Genetic Algorithms in Mass Spectrometry Based Metabolomics"],"prefix":"10.3390","volume":"2","author":[{"given":"Wei","family":"Zou","sequence":"first","affiliation":[{"name":"UC Davis Genome Center, 451 Health Sciences Drive, Davis, CA 95616-8816, U.S.A"}]},{"given":"Vladimir","family":"Tolstikov","sequence":"additional","affiliation":[{"name":"UC Davis Genome Center, 451 Health Sciences Drive, Davis, CA 95616-8816, U.S.A"}]}],"member":"1968","published-online":{"date-parts":[[2009,4,3]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1126\/science.274.5287.533","article-title":"Genomic sequence information should be released immediately and freely in the public domain","volume":"274","author":"Bentley","year":"1996","journal-title":"Science"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"440","DOI":"10.1038\/nature02622","article-title":"Genomes for medicine","volume":"429","author":"Bentley","year":"2004","journal-title":"Nature"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"234","DOI":"10.1038\/85776","article-title":"Variation is the spice of life","volume":"27","author":"Kruglyak","year":"2001","journal-title":"Nat. Genet."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"3573","DOI":"10.1021\/ac991142i","article-title":"Identification of uncommon plant metabolites based on calculation of elemental compositions using gas chromatography and quadrupole mass spectrometry","volume":"72","author":"Fiehn","year":"2000","journal-title":"Anal. Chem."},{"key":"ref_5","unstructured":"Tanaka, N., Tolstikov, V., Weckwerth, W., Fiehn, O., and Fukusaki, H. (2003). Frontier of metabolomic research, Springer-Verlag."},{"key":"ref_6","unstructured":"Ikegami, T., Kobayashi, H., Kimura, H., Tolstikov, V., Fiehn, O., and Tanaka, N. (2005). Metabolomics. The Frontier of Systems Biology, Springer-Verlag."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1273","DOI":"10.1021\/ac034925j","article-title":"Simple and comprehensive two-dimensional reversed-phase HPLC using monolithic silica columns","volume":"76","author":"Tanaka","year":"2004","journal-title":"Anal. Chem."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1007\/s00216-003-1889-y","article-title":"Monolithic columns for liquid chromatography","volume":"376","author":"Tanaka","year":"2003","journal-title":"Anal. Bioanal. Chem."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"420A","DOI":"10.1021\/ac012495w","article-title":"Monolithic LC columns","volume":"73","author":"Tanaka","year":"2001","journal-title":"Anal. Chem."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1007\/978-1-59745-244-1_9","article-title":"Application of liquid chromatography-mass spectrometry analysis in metabolomics: reversed-phase monolithic capillary chromatography and hydrophilic chromatography coupled to electrospray ionization-mass spectrometry","volume":"358","author":"Weckwerth","year":"2007","journal-title":"Metabolomics, Methods in Molecular Biology"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"6737","DOI":"10.1021\/ac034716z","article-title":"Monolithic silica-based capillary reversed-phase liquid chromatography\/electrospray mass spectrometry for plant metabolomics","volume":"75","author":"Tolstikov","year":"2003","journal-title":"Anal. Chem."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"844","DOI":"10.1039\/b501767j","article-title":"A rapid screening approach to metabonomics using UPLC and q-TOF mass spectrometry: application to age, gender and diurnal variation in normal\/Zucker obese rats and black, white and nude mice","volume":"130","author":"Plumb","year":"2005","journal-title":"Analyst"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1784","DOI":"10.1002\/jssc.200600199","article-title":"Hydrophilic interaction chromatography","volume":"29","author":"Hemstrom","year":"2006","journal-title":"J. Sep. Sci."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1016\/0021-9673(95)00328-2","article-title":"Three-dimensional mapping of N-linked oligosaccharides using anion-exchange, hydrophobic and hydrophilic interaction modes of high-performance liquid chromatography","volume":"720","author":"Takahashi","year":"1996","journal-title":"J. Chromatogr. A"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1006\/abio.2001.5513","article-title":"Analysis of highly polar compounds of plant origin: combination of hydrophilic interaction chromatography and electrospray ion trap mass spectrometry","volume":"301","author":"Tolstikov","year":"2002","journal-title":"Anal. Biochem."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1021\/ac070997p","article-title":"Electrostatic repulsion hydrophilic interaction chromatography for isocratic separation of charged solutes and selective isolation of phosphopeptides","volume":"80","author":"Alpert","year":"2008","journal-title":"Anal. Chem."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1016\/S0378-4347(00)00210-3","article-title":"Resolution of allelic and non-allelic variants of histone H1 by cation-exchange-hydrophilic-interaction chromatography","volume":"744","author":"Mizzen","year":"2000","journal-title":"J. Chromatogr. B Biomed. Sci. Appl."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1016\/0021-9673(94)00467-6","article-title":"Hydrophilic-interaction chromatography of complex carbohydrates","volume":"676","author":"Alpert","year":"1994","journal-title":"J. Chromatogr. A"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1016\/0378-4347(92)80546-3","article-title":"Use of hydrophilic interaction chromatography for the study of tyrosine protein kinase specificity","volume":"583","author":"Boutin","year":"1992","journal-title":"J. Chromatogr."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1016\/S0021-9673(00)96972-3","article-title":"Hydrophilic-interaction chromatography for the separation of peptides, nucleic acids and other polar compounds","volume":"499","author":"Alpert","year":"1990","journal-title":"J. Chromatogr."},{"key":"ref_21","unstructured":"Salinas, J., and Sanchez-Serrano, J. J. (2006). Arabidopsis Protocols, Humana Press."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1016\/j.ab.2007.01.028","article-title":"A comprehensive urinary metabolomic approach for identifying kidney cancer","volume":"363","author":"Kind","year":"2007","journal-title":"Anal. Biochem."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"128","DOI":"10.1093\/bib\/bbl012","article-title":"Metabolomics technology and bioinformatics","volume":"7","author":"Shulaev","year":"2006","journal-title":"Brief. Bioinform."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1109\/34.824819","article-title":"Statistical pattern recognition: a review","volume":"22","author":"Jain","year":"2000","journal-title":"Trans. Pattern An. Mach. Intell."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"2447","DOI":"10.1093\/bioinformatics\/bth270","article-title":"Metabolite fingerprinting: detecting biological features by independent component analysis","volume":"20","author":"Scholz","year":"2004","journal-title":"Bioinformatics"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"846","DOI":"10.1038\/nbt0807-846b","article-title":"The metabolomics standards initiative","volume":"25","author":"Sansone","year":"2007","journal-title":"Nat. Biotechnol."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"919","DOI":"10.1016\/S0031-9422(02)00722-7","article-title":"Metabolic fingerprinting of salt-stressed tomatoes","volume":"62","author":"Johnson","year":"2003","journal-title":"Phytochemistry"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"859","DOI":"10.1016\/S0031-9422(02)00718-5","article-title":"Chemometric discrimination of unfractionated plant extracts analyzed by electrospray mass spectrometry","volume":"62","author":"Goodacre","year":"2003","journal-title":"Phytochemistry"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"2507","DOI":"10.1093\/bioinformatics\/btm344","article-title":"A review of feature selection techniques in bioinformatics","volume":"23","author":"Saeys","year":"2007","journal-title":"Bioinformatics"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"869","DOI":"10.1016\/j.csda.2004.03.017","article-title":"An extensive comparison of recent classification tools applied to microarray data","volume":"48","author":"Lee","year":"2005","journal-title":"Comput. Stat. Data An."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Zhang, X., Lu, X., Shi, Q., Xu, X.-q., Leung, H.-c., Harris, L., Iglehart, J., Miron, A., Liu, J., and Wong, W. (2006). Recursive SVM feature selection and sample classification for mass-spectrometry and microarray data. BMC Bioinformatics, 7.","DOI":"10.1186\/1471-2105-7-197"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1093\/jxb\/eri043","article-title":"Making sense of the metabolome using evolutionary computation: seeing the wood with the trees","volume":"56","author":"Goodacre","year":"2005","journal-title":"J. Exp. Bot."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"1154","DOI":"10.1093\/bioinformatics\/btl074","article-title":"GALGO: an R package for multivariate variable selection using genetic algorithms","volume":"22","author":"Trevino","year":"2006","journal-title":"Bioinformatics"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"1312","DOI":"10.1002\/rcm.3507","article-title":"Probing genetic algorithms for feature selection in comprehensive metabolic profiling approach","volume":"22","author":"Zou","year":"2008","journal-title":"Rapid Commun. Mass Spectrom."},{"key":"ref_35","first-page":"169","article-title":"SetupX--a public study design database for metabolomic projects","volume":"12","author":"Scholz","year":"2007","journal-title":"Pac. Symp. Biocomput."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Fiehn, O., Wohlgemuth, G., and Scholz, M. (2005). Setup and Annotation of Metabolomic Experiments by Integrating Biological and Mass Spectrometric Metadata. Data Integration in the Life Sciences: Second International Workshop, 224\u2013239. DILS.","DOI":"10.1007\/11530084_18"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"887","DOI":"10.1016\/S0031-9422(02)00703-3","article-title":"Construction and application of a mass spectral and retention time index database generated from plant GC\/EI-TOF-MS metabolite profiles","volume":"62","author":"Wagner","year":"2003","journal-title":"Phytochemistry Plant Metabolomics"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"779","DOI":"10.1021\/ac051437y","article-title":"XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification","volume":"78","author":"Smith","year":"2006","journal-title":"Anal. Chem."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"4933","DOI":"10.1021\/ac800110w","article-title":"Dimensionality Reduction and Visualization in Principal Component Analysis","volume":"80","author":"Ivosev","year":"2008","journal-title":"J. Anal. Chem."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1016\/j.jchromb.2008.04.044","article-title":"Instrumental and experimental effects in LC-MS-based metabolomics","volume":"871","author":"Burton","year":"2008","journal-title":"J. Chromatogr. B"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Jeffries, N. O. (2004). Performance of a genetic algorithm for mass spectrometry proteomics. BMC Bioinformatics, 5.","DOI":"10.1186\/1471-2105-5-180"},{"key":"ref_42","unstructured":"Shulaev, V. (\u2013, January September). Metabolic Fingerprinting of Breast Cancer Development. Biomarker Discovery Summit, Philadelphia, PA."},{"key":"ref_43","unstructured":"Tolstikov, V. (\u2013, January September). Mass Spectrometry-Derived Metabolic Biomarkers and Signatures in Diagnostic Development. Biomarker Discovery Summit, Philadelphia, PA."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1017\/S0007114507685365","article-title":"Multivariate techniques and their application in nutrition: a metabolomics case study","volume":"98","author":"Kemsley","year":"2007","journal-title":"Br. J. Nutr."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"738","DOI":"10.1002\/jbm.a.10037","article-title":"A biodegradable electrical bioconductor made of polypyrrole nanoparticle\/poly(D,L-lactide) composite: A preliminary in vitro biostability study","volume":"66","author":"Wang","year":"2003","journal-title":"J. Biomed. Mater. Res. A"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"R80","DOI":"10.1186\/gb-2004-5-10-r80","article-title":"Bioconductor: open software development for computational biology and bioinformatics","volume":"5","author":"Gentleman","year":"2004","journal-title":"Genome Biol."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/2\/2\/638\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T22:10:10Z","timestamp":1760220610000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/2\/2\/638"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,4,3]]},"references-count":46,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2009,6]]}},"alternative-id":["a2020638"],"URL":"https:\/\/doi.org\/10.3390\/a2020638","relation":{},"ISSN":["1999-4893"],"issn-type":[{"type":"electronic","value":"1999-4893"}],"subject":[],"published":{"date-parts":[[2009,4,3]]}}}