{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T10:31:13Z","timestamp":1769509873746,"version":"3.49.0"},"reference-count":34,"publisher":"Oxford University Press (OUP)","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,2,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Low sample size n high-dimensional large p data with n\u226ap are commonly encountered in genomics and statistical genetics. Ill-conditioning of the variance-covariance matrix for such data renders the traditional multivariate data analytical approaches unattractive. On the other side, functional data analysis (FDA) approaches are designed for infinite-dimensional data and therefore may have potential for the analysis of large p data. We herein propose a functional embedding (FEM) technique, which exploits the interface between multivariate and functional data, aiming at borrowing strength across the sample through FDA techniques in order to resolve the difficulties caused by the high dimension p.<\/jats:p>\n               <jats:p>Results: Using pairwise dissimilarities among predictor variables, one obtains a univariate configuration of these covariates. This is interpreted as variable ordination that defines the domain of a suitable function space, thus leading to the FEM of the high-dimensional data. The embedding may then be followed by functional logistic regression for the classification of high-dimensional multivariate data as an example for downstream analysis. The resulting functional classification is evaluated on several published gene expression array datasets and a mass spectrometric data, and is shown to compare favorably with various methods that have been employed previously for the classification of these high-dimensional gene expression profiles.<\/jats:p>\n               <jats:p>Availability: The implementation of FEM and Classification via Functional Embedding (CFEM) as described in this article was done with the PACE package written in Matlab. The latest version of PACE is publicly accessible at http:\/\/anson.ucdavis.edu\/\u223cmueller\/data\/programs.html. An example MATLAB script for FEM is available at http:\/\/www.lehigh.edu\/\u223cpsw205\/psw205.html<\/jats:p>\n               <jats:p>Contact: \u00a0psw205@lehigh.edu; mueller@wald.ucdavis.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btp711","type":"journal-article","created":{"date-parts":[[2010,1,7]],"date-time":"2010-01-07T02:54:39Z","timestamp":1262832879000},"page":"509-517","source":"Crossref","is-referenced-by-count":17,"title":["Functional embedding for the classification of gene expression profiles"],"prefix":"10.1093","volume":"26","author":[{"given":"Ping-Shi","family":"Wu","sequence":"first","affiliation":[{"name":"1 Department of Mathematics, Lehigh University, Bethlehem, PA 18015 and 2 Department of Statistics, University of California, Davis, One Shield Avenue Davis, CA 95616, USA"}]},{"given":"Hans-Georg","family":"M\u00fcller","sequence":"additional","affiliation":[{"name":"1 Department of Mathematics, Lehigh University, Bethlehem, PA 18015 and 2 Department of Statistics, University of California, Davis, One Shield Avenue Davis, CA 95616, USA"}]}],"member":"286","published-online":{"date-parts":[[2010,1,6]]},"reference":[{"key":"2023012508030158700_B1","volume-title":"Topics in Stochastic Processes.","author":"Ash","year":"1975"},{"key":"2023012508030158700_B2","volume-title":"Exploration and Analysis of DNA Microarray and Protein Array Data.","author":"Amaratunga","year":"2004"},{"key":"2023012508030158700_B3","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1016\/j.ygyno.2007.11.024","article-title":"Gene expression in early stage cervical cancer","volume":"108","author":"Biewenga","year":"2008","journal-title":"Gynecol. Oncol."},{"key":"2023012508030158700_B4","volume-title":"Modern Multidimensional Scaling","author":"Borg","year":"2005","edition":"2"},{"key":"2023012508030158700_B5","doi-asserted-by":"crossref","first-page":"545","DOI":"10.1021\/ac00029a018","article-title":"Optimal minimal neural interpretation of spectra","volume":"64","author":"Borggaard","year":"1992","journal-title":"Anal. Chem."},{"key":"2023012508030158700_B6","first-page":"1001","article-title":"Sparse boosting","volume":"7","author":"B\u00fchlmann","year":"2006","journal-title":"J. Mach. Learn. Res."},{"key":"2023012508030158700_B7","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1111\/j.1467-9868.2007.00605.x","article-title":"Functional clustering and identifying substructures of longitudinal data","volume":"69","author":"Chiou","year":"2007","journal-title":"J. R. Stat. Soc. Ser. B"},{"key":"2023012508030158700_B8","doi-asserted-by":"crossref","first-page":"1684","DOI":"10.1198\/016214508000000814","article-title":"Correlation-based functional clustering via subspace projection","volume":"103","author":"Chiou","year":"2008","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012508030158700_B9","volume-title":"Multidimensional Scaling","author":"Cox","year":"2000","edition":"2"},{"key":"2023012508030158700_B10","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1198\/016214502753479248","article-title":"Comparison of discrimination methods for the classification of tumors using gene expression data","volume":"97","author":"Dudoit","year":"2002","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012508030158700_B11","doi-asserted-by":"crossref","first-page":"365","DOI":"10.1080\/10485250310001624738","article-title":"Principal component estimation of functional logistic regression: discussion of two different approaches","volume":"16","author":"Escabias","year":"2004","journal-title":"J. Nonparam. Stat."},{"key":"2023012508030158700_B12","volume-title":"Local Polynomial Modeling and its Applications.","author":"Fan","year":"1996"},{"key":"2023012508030158700_B13","doi-asserted-by":"crossref","first-page":"906","DOI":"10.1093\/bioinformatics\/16.10.906","article-title":"Support vector machine classification and validation of cancer tissue sampling using microarray expression data","volume":"16","author":"Furey","year":"2000","journal-title":"Bioinformatics"},{"key":"2023012508030158700_B14","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science"},{"key":"2023012508030158700_B15","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1093\/biomet\/53.3-4.325","article-title":"Some distance properties of latent root and vector methods in multivariate analysis","volume":"53","author":"Gower","year":"1966","journal-title":"Biometrika"},{"key":"2023012508030158700_B16","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1111\/1467-9868.00342","article-title":"Generalized linear models with functional predictors","volume":"64","author":"James","year":"2002","journal-title":"J. R. Stat. Soc. Ser. B"},{"key":"2023012508030158700_B17","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1007\/BF00290638","article-title":"A quantitative genetic model for growth, shape, reaction norms, and other infinite-dimensional characters","volume":"27","author":"Kirkpatrick","year":"1989","journal-title":"J. Math. Biol."},{"key":"2023012508030158700_B18","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1093\/bioinformatics\/bti742","article-title":"Classification using functional data analysis for temporal gene expression data","volume":"22","author":"Leng","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012508030158700_B19","doi-asserted-by":"crossref","first-page":"223","DOI":"10.1111\/j.1467-9469.2005.00429.x","article-title":"Functional modelling and classification of longitudinal data","volume":"32","author":"M\u00fcller","year":"2005","journal-title":"Scand. J. Stat."},{"key":"2023012508030158700_B20","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1186\/1471-2105-9-60","article-title":"Inferring gene expression dynamics via functional regression analysis","volume":"9","author":"M\u00fcller","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012508030158700_B21","doi-asserted-by":"crossref","first-page":"774","DOI":"10.1214\/009053604000001156","article-title":"Generalized functional linear models","volume":"33","author":"M\u00fcller","year":"2005","journal-title":"Ann. Stat."},{"key":"2023012508030158700_B22","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1093\/bioinformatics\/18.1.39","article-title":"Tumor classification by partial least squares using microarray gene expression data","volume":"18","author":"Nguyen","year":"2002","journal-title":"Bioinformatics"},{"key":"2023012508030158700_B23","doi-asserted-by":"crossref","DOI":"10.1007\/b98886","volume-title":"Applied Functional Data Analysis: Methods and Case Studies.","author":"Ramsay","year":"2002"},{"key":"2023012508030158700_B24","doi-asserted-by":"crossref","DOI":"10.1007\/b98888","volume-title":"Functional Data Analysis","author":"Ramsay","year":"2005","edition":"2"},{"key":"2023012508030158700_B25","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1111\/j.0006-341X.2001.00253.x","article-title":"Nonparametric mixed effects models for unequally sampled noisy curves","volume":"57","author":"Rice","year":"2000","journal-title":"Biometrics"},{"key":"2023012508030158700_B26","doi-asserted-by":"crossref","first-page":"151","DOI":"10.2307\/2986151","article-title":"An analysis of paediatric CD4 counts for Acquired Immune Deficiency Syndrome using flexible random curves","volume":"45","author":"Shi","year":"1996","journal-title":"Appl. Stat."},{"key":"2023012508030158700_B27","volume-title":"Density Estimation for Statistics and Data Analysis.","author":"Silverman","year":"1986"},{"key":"2023012508030158700_B28","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1007\/BF02288916","article-title":"Multidimensional scaling: 1. Theory and method","volume":"17","author":"Torgerson","year":"1952","journal-title":"Psychometrika"},{"key":"2023012508030158700_B29","volume-title":"Theory and Methods of Scaling.","author":"Torgerson","year":"1958"},{"key":"2023012508030158700_B30","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1038\/415530a","article-title":"Gene expression profiling predicts clinical outcome of breast cancer","volume":"415","author":"van't Veer","year":"2002","journal-title":"Nature"},{"key":"2023012508030158700_B31","doi-asserted-by":"crossref","first-page":"11462","DOI":"10.1073\/pnas.201162998","article-title":"Predicting the clinical status of human breast cancer by using gene expression profiles","volume":"98","author":"West","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508030158700_B32","doi-asserted-by":"crossref","first-page":"676","DOI":"10.1111\/1541-0420.00078","article-title":"Shrinkage estimation for functional principal component scores, with application to the population kinetics of plasma folate","volume":"59","author":"Yao","year":"2003","journal-title":"Biometrics"},{"key":"2023012508030158700_B33","doi-asserted-by":"crossref","first-page":"2394","DOI":"10.1093\/bioinformatics\/bti319","article-title":"Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data","volume":"21","author":"Yeung","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012508030158700_B34","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1111\/j.1467-9868.2005.00503.x","article-title":"Regularization and variable selection via the elastic net","volume":"67","author":"Zou","year":"2005","journal-title":"J. R. Stat. Soc. Ser. B"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/4\/509\/48855601\/bioinformatics_26_4_509.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/4\/509\/48855601\/bioinformatics_26_4_509.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:04:07Z","timestamp":1674633847000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/4\/509\/244694"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,1,6]]},"references-count":34,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2010,2,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btp711","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,2,15]]},"published":{"date-parts":[[2010,1,6]]}}}