{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T19:58:21Z","timestamp":1760299101509},"reference-count":40,"publisher":"Wiley","license":[{"start":{"date-parts":[[2015,1,1]],"date-time":"2015-01-01T00:00:00Z","timestamp":1420070400000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/3.0\/"}],"funder":[{"name":"Council of Higher Education of Turkey"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computational and Mathematical Methods in Medicine"],"published-print":{"date-parts":[[2015]]},"abstract":"<jats:p>Gene expression data typically are large, complex, and highly noisy. Their dimension is high with several thousand genes (i.e., features) but with only a limited number of observations (i.e., samples). Although the classical principal component analysis (PCA) method is widely used as a first standard step in dimension reduction and in supervised and unsupervised classification, it suffers from several shortcomings in the case of data sets involving undersized samples, since the sample covariance matrix degenerates and becomes singular. In this paper we address these limitations within the context of probabilistic PCA (PPCA) by introducing and developing a new and novel approach using maximum entropy covariance matrix and its hybridized smoothed covariance estimators. To reduce the dimensionality of the data and to choose the number of probabilistic PCs (PPCs) to be retained, we further introduce and develop celebrated Akaike\u2019s information criterion (AIC), consistent Akaike\u2019s information criterion (CAIC), and the information theoretic measure of complexity (ICOMP) criterion of Bozdogan. Six publicly available undersized benchmark data sets were analyzed to show the utility, flexibility, and versatility of our approach with hybridized smoothed covariance matrix estimators, which do not degenerate to perform the PPCA to reduce the dimension and to carry out supervised classification of cancer groups in high dimensions.<\/jats:p>","DOI":"10.1155\/2015\/370640","type":"journal-article","created":{"date-parts":[[2015,3,9]],"date-time":"2015-03-09T21:00:59Z","timestamp":1425934859000},"page":"1-14","source":"Crossref","is-referenced-by-count":22,"title":["A Novel Hybrid Dimension Reduction Technique for Undersized High Dimensional Gene Expression Data Sets Using Information Complexity Criterion for Cancer Classification"],"prefix":"10.1155","volume":"2015","author":[{"given":"Esra","family":"Pamuk\u00e7u","sequence":"first","affiliation":[{"name":"Department of Statistics, Faculty of Science, Firat University, 23119 Elazig, Turkey"}]},{"given":"Hamparsum","family":"Bozdogan","sequence":"additional","affiliation":[{"name":"Department of Business Analytics and Statistics, The University of Tennessee, Knoxville, TN 37996, USA"}]},{"given":"Sinan","family":"\u00c7al\u0131k","sequence":"additional","affiliation":[{"name":"Department of Statistics, Faculty of Science, Firat University, 23119 Elazig, Turkey"}]}],"member":"311","reference":[{"key":"2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/17.9.763"},{"key":"3","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btl190"},{"key":"4","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btn458"},{"issue":"5","key":"5","doi-asserted-by":"crossref","first-page":"606","DOI":"10.1093\/bioinformatics\/btp023","volume":"25","year":"2009","journal-title":"Bioinformatics"},{"key":"6","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btp085"},{"key":"7","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-11-571"},{"key":"8","first-page":"309","volume-title":"Kernel PCA for feature extraction with information complexity","year":"2004"},{"key":"9","doi-asserted-by":"publisher","DOI":"10.13176\/11.117"},{"key":"10","series-title":"Springer Series in Statistics","year":"2002"},{"key":"12","doi-asserted-by":"publisher","DOI":"10.1016\/0096-3003(84)90027-4"},{"key":"14","doi-asserted-by":"publisher","DOI":"10.1111\/1467-9868.00196"},{"key":"15","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4612-1694-0_15"},{"key":"16","doi-asserted-by":"publisher","DOI":"10.1007\/bf02294361"},{"key":"17","doi-asserted-by":"publisher","DOI":"10.1080\/03610929008830199"},{"key":"27","year":"2015"},{"key":"28","year":"1961"},{"key":"29","volume-title":"The linear combination of the simplest discriminator and Fisher's one","year":"1983"},{"issue":"2","key":"25","first-page":"211","volume":"5","year":"2012","journal-title":"European Journal of Pure and Applied Mathematics"},{"key":"32","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1176345010"},{"key":"33","first-page":"629","volume-title":"Singular moment matrices in applied econometrics","year":"1980"},{"key":"34","year":"1984"},{"key":"35","doi-asserted-by":"publisher","DOI":"10.1016\/S0927-5398(03)00007-0"},{"key":"36","doi-asserted-by":"publisher","DOI":"10.1016\/S0047-259X(03)00096-4"},{"key":"23","series-title":"Unpublished Lecture Notes","year":"2010"},{"key":"22","first-page":"15","volume-title":"Intelligent statistical data mining with information complexity and genetic algorithms","year":"2004"},{"key":"40","year":"1971"},{"key":"18","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-50974-2_5"},{"key":"19","first-page":"69","volume-title":"Mixture-model cluster analysis using a new informational complexity and model selection criteria","year":"1994"},{"key":"20","doi-asserted-by":"publisher","DOI":"10.1016\/s0167-9473(98)00025-5"},{"key":"21","doi-asserted-by":"publisher","DOI":"10.1006\/jmps.1999.1277"},{"issue":"2","key":"24","first-page":"370","volume":"39","year":"2010","journal-title":"Istanbul University Journal of the School of Business Administration"},{"key":"26"},{"issue":"1","key":"41","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","volume":"39","year":"1977","journal-title":"Journal of the Royal Statistical Society Series B: Methodological"},{"key":"48","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bth447"},{"key":"42","doi-asserted-by":"publisher","DOI":"10.1126\/science.286.5439.531"},{"key":"43","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.96.12.6745"},{"key":"44","doi-asserted-by":"publisher","DOI":"10.1016\/S1535-6108(02)00030-2"},{"key":"45","doi-asserted-by":"publisher","DOI":"10.1038\/35000501"},{"key":"46","doi-asserted-by":"publisher","DOI":"10.1038\/89044"},{"key":"47","doi-asserted-by":"publisher","DOI":"10.1038\/415436a"}],"container-title":["Computational and Mathematical Methods in Medicine"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/downloads.hindawi.com\/journals\/cmmm\/2015\/370640.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/cmmm\/2015\/370640.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/cmmm\/2015\/370640.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,7]],"date-time":"2024-06-07T18:20:40Z","timestamp":1717784440000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.hindawi.com\/journals\/cmmm\/2015\/370640\/"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015]]},"references-count":40,"alternative-id":["370640","370640"],"URL":"https:\/\/doi.org\/10.1155\/2015\/370640","relation":{},"ISSN":["1748-670X","1748-6718"],"issn-type":[{"value":"1748-670X","type":"print"},{"value":"1748-6718","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015]]}}}