{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,6,12]],"date-time":"2024-06-12T00:04:48Z","timestamp":1718150688494},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,2,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: In searching for genetic variants for complex diseases with deep sequencing data, genomic marker sets of high-dimensional genotypic data and sparse functional variants are quite common. Existing sequence association tests are incapable of identifying such marker sets or individual causal loci, although they appeared powerful to identify small marker sets with dense functional variants. In sequence association studies of admixed individuals, cryptic relatedness and population structure are known to confound the association analyses.<\/jats:p><jats:p>Method: We here propose a unified marker wise test (uFineMap) to accurately localize causal loci and a unified high-dimensional set based test (uHDSet) to identify high-dimensional sparse associations in deep sequencing genomic data of multi-ethnic individuals with random relatedness. These two novel tests are based on scaled sparse linear mixed regressions with Lp (0\u2009&amp;lt;\u2009p\u2009&amp;lt;\u20091) norm regularization. They jointly adjust for cryptic relatedness, population structure and other confounders to prevent false discoveries and improve statistical power for identifying promising individual markers and marker sets that harbor functional genetic variants of a complex trait.<\/jats:p><jats:p>Results: With large scale simulation data and real data analyses, the proposed tests appropriately controlled Type I error rates and appeared to be more powerful than several prominent methods. We illustrated their practical utilities by the applications to DNA sequence data of Framingham Heart Study for osteoporosis. The proposed tests identified 11 novel significant genes that were missed by the prominent famSKAT and GEMMA. In particular, four out of six most significant pathways identified by the uHDSet but missed by famSKAT have been reported to be related to BMD or osteoporosis in the literature.<\/jats:p><jats:p>Availability and implementation: The computational toolkit is available for academic use: https:\/\/sites.google.com\/site\/shaolongscode\/home\/uhdset<\/jats:p><jats:p>Contact: \u00a0wyp@tulane.edu<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btv586","type":"journal-article","created":{"date-parts":[[2015,10,13]],"date-time":"2015-10-13T19:11:01Z","timestamp":1444763461000},"page":"330-337","source":"Crossref","is-referenced-by-count":5,"title":["Unified tests for fine-scale mapping and identifying sparse high-dimensional sequence associations"],"prefix":"10.1093","volume":"32","author":[{"given":"Shaolong","family":"Cao","sequence":"first","affiliation":[{"name":"1 Department of Biomedical Engineering,"},{"name":"2 Center for Bioinformatics and Genomics,"}]},{"given":"Huaizhen","family":"Qin","sequence":"additional","affiliation":[{"name":"2 Center for Bioinformatics and Genomics,"},{"name":"3 Department of Biostatistics and Bioinformatics and"}]},{"given":"Alexej","family":"Gossmann","sequence":"additional","affiliation":[{"name":"2 Center for Bioinformatics and Genomics,"},{"name":"4 Department of Mathematics, Tulane University, New Orleans, LA, USA"}]},{"given":"Hong-Wen","family":"Deng","sequence":"additional","affiliation":[{"name":"2 Center for Bioinformatics and Genomics,"},{"name":"3 Department of Biostatistics and Bioinformatics and"}]},{"given":"Yu-Ping","family":"Wang","sequence":"additional","affiliation":[{"name":"1 Department of Biomedical Engineering,"},{"name":"2 Center for Bioinformatics and Genomics,"},{"name":"3 Department of Biostatistics and Bioinformatics and"}]}],"member":"286","published-online":{"date-parts":[[2015,10,12]]},"reference":[{"key":"2023020110305486600_btv586-B1","first-page":"535","article-title":"Robust variance-components approach for assessing genetic linkage in pedigrees","volume":"54","author":"Amos","year":"1994","journal-title":"Am. J. Hum. Genet."},{"key":"2023020110305486600_btv586-B2","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Stat. Soc. Ser. B Methodol."},{"key":"2023020110305486600_btv586-B3","doi-asserted-by":"crossref","first-page":"60","DOI":"10.3102\/10769986025001060","article-title":"On the adaptive control of the false discovery rate in multiple testing with independent statistics","volume":"25","author":"Benjamini","year":"2000","journal-title":"J. Educ. Behav. Stat."},{"key":"2023020110305486600_btv586-B4","doi-asserted-by":"crossref","first-page":"1212","DOI":"10.3150\/12-BEJSP11","article-title":"Statistical significance in high-dimensional linear models","volume":"19","author":"B\u00fchlmann","year":"2013","journal-title":"Bernoulli"},{"key":"2023020110305486600_btv586-B5","doi-asserted-by":"crossref","first-page":"671","DOI":"10.1002\/gepi.21849","article-title":"A unified sparse representation for sequence variant identification for complex traits","volume":"38","author":"Cao","year":"2014","journal-title":"Genet. Epidemiol."},{"key":"2023020110305486600_btv586-B6","doi-asserted-by":"crossref","first-page":"196","DOI":"10.1002\/gepi.21703","article-title":"Sequence kernel association test for quantitative traits in family samples","volume":"37","author":"Chen","year":"2013","journal-title":"Genet. Epidemiol."},{"key":"2023020110305486600_btv586-B7","doi-asserted-by":"crossref","first-page":"2832","DOI":"10.1137\/090761471","article-title":"Lower bound theory of nonzero entries in solutions of l_2-lp minimization","volume":"32","author":"Chen","year":"2010","journal-title":"SIAM J. Sci. Comput."},{"key":"2023020110305486600_btv586-B8","doi-asserted-by":"crossref","first-page":"997","DOI":"10.1111\/j.0006-341X.1999.00997.x","article-title":"Genomic control for association studies","volume":"55","author":"Devlin","year":"1999","journal-title":"Biometrics"},{"key":"2023020110305486600_btv586-B9","doi-asserted-by":"crossref","first-page":"250","DOI":"10.3835\/plantgenome2011.08.0024","article-title":"Ridge regression and other kernels for genomic selection with R package rrBLUP","volume":"4","author":"Endelman","year":"2011","journal-title":"Plant Genome"},{"key":"2023020110305486600_btv586-B10","first-page":"2869","article-title":"Confidence intervals and hypothesis testing for high-dimensional regression","volume":"15","author":"Javanmard","year":"2014","journal-title":"J. Mach. Learn. Res."},{"key":"2023020110305486600_btv586-B11","doi-asserted-by":"crossref","first-page":"15474","DOI":"10.3390\/molecules181215474","article-title":"Osteogenic activity of collagen peptide via ERK\/MAPK pathway mediated boosting of collagen synthesis and its therapeutic efficacy in osteoporotic bone by back-scattered electron imaging and microarchitecture analysis","volume":"18","author":"Kim","year":"2013","journal-title":"Molecules"},{"key":"2023020110305486600_btv586-B12","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1016\/j.molmed.2010.01.003","article-title":"Chemokines and chemokine receptors: new insights into cancer-related inflammation","volume":"16","author":"Lazennec","year":"2010","journal-title":"Trends Mol. Med."},{"key":"2023020110305486600_btv586-B13","doi-asserted-by":"crossref","first-page":"1227","DOI":"10.1359\/jbmr.080325","article-title":"Berberine promotes osteoblast differentiation by Runx2 activation with p38 MAPK","volume":"23","author":"Lee","year":"2008","journal-title":"J. Bone Miner. Res."},{"key":"2023020110305486600_btv586-B14","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1111\/j.1467-9868.2010.00740.x","article-title":"Stability selection","volume":"72","author":"Meinshausen","year":"2010","journal-title":"J. R. Stat. Soc. Ser. B Stat. Methodol."},{"key":"2023020110305486600_btv586-B15","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1038\/nrg2813","article-title":"New approaches to population stratification in genome-wide association studies","volume":"11","author":"Price","year":"2010","journal-title":"Nat. Rev. Genet."},{"key":"2023020110305486600_btv586-B16","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1093\/bioinformatics\/bts669","article-title":"A Lasso multi-marker mixed model for association mapping with population structure correction","volume":"29","author":"Rakitsch","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020110305486600_btv586-B17","doi-asserted-by":"crossref","first-page":"879","DOI":"10.1093\/biomet\/ass043","article-title":"Scaled sparse linear regression","volume":"99","author":"Sun","year":"2012","journal-title":"Biometrika"},{"key":"2023020110305486600_btv586-B18","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J. R. Stat. Soc. Ser. B Methodol."},{"key":"2023020110305486600_btv586-B19","doi-asserted-by":"crossref","first-page":"1166","DOI":"10.1214\/14-AOS1221","article-title":"On asymptotically optimal confidence regions and tests for high-dimensional models","volume":"42","author":"van de Geer","year":"2014","journal-title":"The Annals of Statistics"},{"key":"2023020110305486600_btv586-B20","doi-asserted-by":"crossref","first-page":"e32","DOI":"10.1371\/journal.pgen.0010032","article-title":"Confounding from cryptic relatedness in case-control association studies","volume":"1","author":"Voight","year":"2005","journal-title":"PLoS Genet."},{"key":"2023020110305486600_btv586-B21","doi-asserted-by":"crossref","first-page":"510","DOI":"10.1097\/01.med.0000436249.84273.7b","article-title":"Glucocorticoid-related bone changes from endogenous or exogenous glucocorticoids","volume":"20","author":"Warriner","year":"2013","journal-title":"Curr. Opin. Endocrinol. Diabetes Obes."},{"key":"2023020110305486600_btv586-B22","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1016\/j.ajhg.2011.05.029","article-title":"Rare-variant association testing for sequencing data with the sequence kernel association test","volume":"89","author":"Wu","year":"2011","journal-title":"Am. J. Hum. Genet."},{"key":"2023020110305486600_btv586-B23","first-page":"1225","article-title":"Representative of L1\/2 regularization among Lq (0\u2009&lt;\u2009q\u2009\u2264\u20091) regularizations: an experimental study based on phase diagram","volume":"38","author":"XU","year":"2012","journal-title":"Acta Automatica Sinica"},{"key":"2023020110305486600_btv586-B24","doi-asserted-by":"crossref","first-page":"1876","DOI":"10.1093\/bioinformatics\/btu143","article-title":"FISH: fast and accurate diploid genotype imputation via segmental hidden Markov model","volume":"30","author":"Zhang","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020110305486600_btv586-B25","doi-asserted-by":"crossref","first-page":"821","DOI":"10.1038\/ng.2310","article-title":"Genome-wide efficient mixed-model analysis for association studies","volume":"44","author":"Zhou","year":"2012","journal-title":"Nat. Genet."},{"key":"2023020110305486600_btv586-B26","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1111\/j.1467-9868.2005.00503.x","article-title":"Regularization and variable selection via the elastic net","volume":"67","author":"Zou","year":"2005","journal-title":"J. R. Stat. Soc. Ser. B Stat. Methodol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/3\/330\/49016838\/bioinformatics_32_3_330.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/3\/330\/49016838\/bioinformatics_32_3_330.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,11]],"date-time":"2024-06-11T19:31:27Z","timestamp":1718134287000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/3\/330\/1743896"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,10,12]]},"references-count":26,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2016,2,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btv586","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2016,2,1]]},"published":{"date-parts":[[2015,10,12]]}}}