{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:54:30Z","timestamp":1760144070134,"version":"build-2065373602"},"reference-count":19,"publisher":"MDPI AG","issue":"6","license":[{"start":{"date-parts":[[2024,3,19]],"date-time":"2024-03-19T00:00:00Z","timestamp":1710806400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Center for Research and Development in Mathematics and Applications (CIDMA)","award":["UIDB\/04106\/2020","UIDP\/04106\/2020","UID\/BIM\/04501\/2013","POCI\/01\/0145\/FEDER\/022184)"],"award-info":[{"award-number":["UIDB\/04106\/2020","UIDP\/04106\/2020","UID\/BIM\/04501\/2013","POCI\/01\/0145\/FEDER\/022184)"]}]},{"name":"Institute for Biomedicine (iBiMED) at the University of Aveiro","award":["UIDB\/04106\/2020","UIDP\/04106\/2020","UID\/BIM\/04501\/2013","POCI\/01\/0145\/FEDER\/022184)"],"award-info":[{"award-number":["UIDB\/04106\/2020","UIDP\/04106\/2020","UID\/BIM\/04501\/2013","POCI\/01\/0145\/FEDER\/022184)"]}]},{"name":"GenomePT","award":["UIDB\/04106\/2020","UIDP\/04106\/2020","UID\/BIM\/04501\/2013","POCI\/01\/0145\/FEDER\/022184)"],"award-info":[{"award-number":["UIDB\/04106\/2020","UIDP\/04106\/2020","UID\/BIM\/04501\/2013","POCI\/01\/0145\/FEDER\/022184)"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Applied Sciences"],"abstract":"<jats:p>In this work, we aimed to establish a stable and accurate procedure with which to perform feature selection in datasets with a much higher number of predictors than individuals, as in genome-wide association studies. Due to the instability of feature selection where many potential predictors are measured, a variable selection procedure is proposed that combines several replications of shrinkage regression models. A weighted formulation is used to define the final predictors. The procedure is applied for the investigation of single nucleotide polymorphism (SNP) predictors associated with Alzheimer\u2019s disease in the Alzheimer\u2019s Disease Neuroimaging Initiative (ADNI) dataset. Furthermore, the two following data scenarios are investigated: one that solely considers the set of SNPs, and another with the covariates of age, sex, educational level, and \u03b54 allele of the Apolipoprotein E (APOE4) genotype. The SNP rs2075650 and the APOE4 genotype are provided as risk factors for Alzheimer\u2019s disease, which is in line with the literature, and another four new SNPs are indicated, thus cultivating new hypotheses for in vivo analyses. These experiments demonstrate the potential of the new method for stable feature selection.<\/jats:p>","DOI":"10.3390\/app14062572","type":"journal-article","created":{"date-parts":[[2024,3,19]],"date-time":"2024-03-19T09:39:31Z","timestamp":1710841171000},"page":"2572","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Stable Variable Selection Method with Shrinkage Regression Applied to the Selection of Genetic Variants Associated with Alzheimer\u2019s Disease"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1051-8084","authenticated-orcid":false,"given":"Vera","family":"Afreixo","sequence":"first","affiliation":[{"name":"CIDMA\u2014Center for Research & Development in Mathematics and Applications, Department of Mathematics, University of Aveiro, 3810-193 Aveiro, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4632-3561","authenticated-orcid":false,"given":"Ana Helena","family":"Tavares","sequence":"additional","affiliation":[{"name":"CIDMA\u2014Center for Research & Development in Mathematics and Applications, Agueda School of Technology and Management, 3750-127 Agueda, Portugal"}]},{"given":"Vera","family":"Enes","sequence":"additional","affiliation":[{"name":"Genome Medicine Lab, Department of Medical Sciences, iBiMED\u2014Institute of Biomedicine, University of Aveiro, 3810-193 Aveiro, Portugal"}]},{"given":"Miguel","family":"Pinheiro","sequence":"additional","affiliation":[{"name":"Genome Medicine Lab, Department of Medical Sciences, iBiMED\u2014Institute of Biomedicine, University of Aveiro, 3810-193 Aveiro, Portugal"}]},{"given":"Leonor","family":"Rodrigues","sequence":"additional","affiliation":[{"name":"CIDMA\u2014Center for Research & Development in Mathematics and Applications, Department of Mathematics, University of Aveiro, 3810-193 Aveiro, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2243-6123","authenticated-orcid":false,"given":"Gabriela","family":"Moura","sequence":"additional","affiliation":[{"name":"Genome Medicine Lab, Department of Medical Sciences, iBiMED\u2014Institute of Biomedicine, University of Aveiro, 3810-193 Aveiro, Portugal"}]}],"member":"1968","published-online":{"date-parts":[[2024,3,19]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Ridge, P.G., Mukherjee, S., Crane, P.K., and Kauwe, J.S.K. (2013). Alzheimer\u2019s Disease Genetics Consortium Alzheimer\u2019s Disease: Analyzing the Missing Heritability. PLoS ONE, 8.","DOI":"10.1371\/journal.pone.0079771"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13024-019-0333-5","article-title":"The neuropathological diagnosis of Alzheimer\u2019s disease","volume":"14","author":"DeTure","year":"2019","journal-title":"Mol. Neurodegener."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"416","DOI":"10.1111\/j.1469-1809.2010.00597.x","article-title":"Joint Identification of Multiple Genetic Variants via Elastic-Net Variable Selection in a Genome-Wide Association Analysis","volume":"74","author":"Cho","year":"2010","journal-title":"Ann. Hum. Genet."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"837","DOI":"10.1038\/ejhg.2011.57","article-title":"Gene set analysis of SNP data: Benefits, challenges, and future directions","volume":"9","author":"Fridley","year":"2011","journal-title":"Eur. J. Hum. Genet."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"270","DOI":"10.3389\/fgene.2013.00270","article-title":"Evaluation of the lasso and the elastic net in genome-wide association studies","volume":"4","author":"Waldmann","year":"2013","journal-title":"Front. Genet."},{"key":"ref_6","first-page":"242","article-title":"An efficient gene selection method for high-dimensional microarray data based on sparse logistic regression","volume":"10","author":"Algamal","year":"2017","journal-title":"Electron. J. Appl. Stat. Anal."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Cherlin, S., Howey, R., and Cordell, H. (2018). Using penalized regression to predict phenotype from SNP data. BMC Proc., 12.","DOI":"10.1186\/s12919-018-0149-2"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the LASSO","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J. R. Stat. Soc.\u2014Ser. B (Methodol.)"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"753","DOI":"10.1007\/s11634-018-0334-1","article-title":"A two-stage sparse logistic regression for optimal gene selection in high-dimensional microarray data classification","volume":"13","author":"Algamal","year":"2019","journal-title":"Adv. Data Anal. Classif."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1080\/00401706.1970.10488634","article-title":"Ridge regression: Biased estimation for nonorthogonal problems","volume":"12","author":"Hoerl","year":"1970","journal-title":"Technometrics"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"3887","DOI":"10.1093\/bioinformatics\/btx522","article-title":"Genome-wide association studies using a penalized moving-window regression","volume":"33","author":"Bao","year":"2017","journal-title":"Bioinformatics"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1111\/j.1467-9868.2005.00503.x","article-title":"Regularization and variable selection via the elastic net","volume":"67","author":"Zou","year":"2005","journal-title":"J. R. Stat. Soc.\u2014Ser. B (Stat. Methodol.)"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Gervasi, O., Murgante, B., Rocha, A.M.A.C., Garau, C., Scorza, F., Karaca, Y., and Torre, C.M. (2023). International Conference on Computational Science and Its Applications, Springer Nature.","DOI":"10.1007\/978-3-031-37126-4"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1564","DOI":"10.1038\/nprot.2010.116","article-title":"Data quality control in genetic case-control association studies","volume":"5","author":"Anderson","year":"2010","journal-title":"Nat. Protoc."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1086\/519795","article-title":"Plink: A tool set for whole-genome association and population-based linkage analyses","volume":"81","author":"Purcell","year":"2007","journal-title":"Am. J. Hum. Genet."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Wellcome Trust Case Control Consortium (2007). Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature, 447, 661\u2013678.","DOI":"10.1038\/nature05911"},{"key":"ref_17","first-page":"1","article-title":"glmulti: An R package for easy automated model selection with (generalized) linear models","volume":"34","author":"Calcagno","year":"2019","journal-title":"J. Stat. Softw."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"142","DOI":"10.1016\/j.neulet.2016.05.050","article-title":"The tomm40 gene rs2075650 polymorphism contributes to Alzheimer\u2019s disease in caucasian, and asian populations","volume":"628","author":"Huang","year":"2016","journal-title":"Neurosci. Lett."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1038\/s41398-018-0221-8","article-title":"The genetic risk of Alzheimer\u2019s disease beyond APOE 4: Systematic review of Alzheimer\u2019s genetic risk scores","volume":"8","author":"Stocker","year":"2018","journal-title":"Transl. Psychiatry"}],"container-title":["Applied Sciences"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2076-3417\/14\/6\/2572\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T14:16:04Z","timestamp":1760105764000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2076-3417\/14\/6\/2572"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,19]]},"references-count":19,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2024,3]]}},"alternative-id":["app14062572"],"URL":"https:\/\/doi.org\/10.3390\/app14062572","relation":{},"ISSN":["2076-3417"],"issn-type":[{"type":"electronic","value":"2076-3417"}],"subject":[],"published":{"date-parts":[[2024,3,19]]}}}