{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,1,26]],"date-time":"2023-01-26T05:21:04Z","timestamp":1674710464289},"reference-count":27,"publisher":"Oxford University Press (OUP)","issue":"18","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":1490,"URL":"http:\/\/creativecommons.org\/licenses\/by\/3.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Association tests based on next-generation sequencing data are often under-powered due to the presence of rare variants and large amount of neutral or protective variants. A successful strategy is to aggregate genetic information within meaningful single-nucleotide polymorphism (SNP) sets, e.g. genes or pathways, and test association on SNP sets. Many existing methods for group-wise tests require specific assumptions about the direction of individual SNP effects and\/or perform poorly in the presence of interactions.<\/jats:p>\n               <jats:p>Results: We propose a joint association test strategy based on two key components: a nonlinear supervised dimension reduction approach for effective SNP information aggregation and a novel kernel specially designed for qualitative genotype data. The new test demonstrates superior performance in identifying causal genes over existing methods across a large variety of disease models simulated from sequence data of real genes. In general, the proposed method provides an association test strategy that can (i) detect both rare and common causal variants, (ii) deal with both additive and interaction effect, (iii) handle both quantitative traits and disease dichotomies and (iv) incorporate non-genetic covariates. In addition, the new kernel can potentially boost the power of the entire family of kernel-based methods for genetic data analysis.<\/jats:p>\n               <jats:p>Availability: The method is implemented in MATLAB. Source code is available upon request.<\/jats:p>\n               <jats:p>Contact: \u00a0hongjie.zhu@duke.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts406","type":"journal-article","created":{"date-parts":[[2012,9,7]],"date-time":"2012-09-07T20:35:22Z","timestamp":1347050122000},"page":"i375-i381","source":"Crossref","is-referenced-by-count":3,"title":["Nonlinear dimension reduction with Wright\u2013Fisher kernel for genotype aggregation and association mapping"],"prefix":"10.1093","volume":"28","author":[{"given":"Hongjie","family":"Zhu","sequence":"first","affiliation":[{"name":"1 Department of Psychiatry and Behavior Science, Duke University, Durham, NC 27710"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lexin","family":"Li","sequence":"additional","affiliation":[{"name":"2 Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hua","family":"Zhou","sequence":"additional","affiliation":[{"name":"2 Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2012,9,3]]},"reference":[{"key":"2023012513053819100_B1","doi-asserted-by":"crossref","first-page":"260","DOI":"10.2307\/1426293","article-title":"The latent roots of certain Markov chains arising in genetics: a new approach. I. Haploid models","volume":"6","author":"Cannings","year":"1974","journal-title":"Adv. Appl. Probab."},{"key":"2023012513053819100_B2","doi-asserted-by":"crossref","first-page":"860","DOI":"10.1016\/j.ajhg.2010.04.014","article-title":"Insights into colon cancer etiology via a regularized approach to gene set analysis of gwas data","volume":"86","author":"Chen","year":"2010","journal-title":"Am. J. Hum. Genet."},{"key":"2023012513053819100_B3","doi-asserted-by":"crossref","first-page":"869","DOI":"10.1126\/science.1099870","article-title":"Multiple rare alleles contribute to low plasma levels of HDL cholesterol","volume":"305","author":"Cohen","year":"2004","journal-title":"Science"},{"key":"2023012513053819100_B4","doi-asserted-by":"crossref","DOI":"10.1093\/hmg\/ddn287","article-title":"Genome-wide association studies in cancer","volume":"17","author":"Easton","year":"2008","journal-title":"Hum. Mol. Genet."},{"key":"2023012513053819100_B5","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-21822-9","volume-title":"Mathematical Population Genetics. I, Vol. 27 of Interdisciplinary Applied Mathematics","author":"Ewens","year":"2004","edition":"2nd edn."},{"key":"2023012513053819100_B6","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1038\/nrg2554","article-title":"Human genetic variation and its contribution to complex traits","volume":"10","author":"Frazer","year":"2009","journal-title":"Nat. Rev. Genet."},{"key":"2023012513053819100_B7","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1038\/nature09534","article-title":"A map of human genome variation from population-scale sequencing","volume":"467","author":"1000 Genome Project","year":"2010","journal-title":"Nature"},{"key":"2023012513053819100_B8","doi-asserted-by":"crossref","first-page":"592","DOI":"10.1038\/ng.118","article-title":"Rare independent mutations in renal salt handling genes contribute to blood pressure variation","volume":"40","author":"Ji","year":"2008","journal-title":"Nat. Genet."},{"key":"2023012513053819100_B9","doi-asserted-by":"crossref","first-page":"737","DOI":"10.1214\/08-AAP562","article-title":"Rates of convergence of some multivariate Markov chains with polynomial eigenfunctions","volume":"19","author":"Khare","year":"2009","journal-title":"Ann. Appl. Probab."},{"key":"2023012513053819100_B10","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1016\/j.ajhg.2007.10.010","article-title":"A powerful and flexible multilocus association test for quantitative traits","volume":"82","author":"Kwee","year":"2008","journal-title":"Am. J. Hum. Genet."},{"key":"2023012513053819100_B11","doi-asserted-by":"crossref","DOI":"10.1093\/hmg\/ddn246","article-title":"Autoimmune diseases: insights from genome-wide association studies","volume":"17","author":"Lettre","year":"2008","journal-title":"Hum. Mol. Genet."},{"key":"2023012513053819100_B12","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1016\/j.ajhg.2008.06.024","article-title":"Methods for detecting associations with rare variants for common diseases: Application to analysis of sequence data","volume":"83","author":"Li","year":"2008","journal-title":"AJHG"},{"key":"2023012513053819100_B13","doi-asserted-by":"crossref","first-page":"316","DOI":"10.1080\/01621459.1991.10475035","article-title":"Sliced inverse regression for dimension reduction","volume":"86","author":"Li","year":"1991","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012513053819100_B14","doi-asserted-by":"crossref","first-page":"e1001156","DOI":"10.1371\/journal.pgen.1001156","article-title":"A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associations with rare variants due to gene main effects and interactions","volume":"6","author":"Liu","year":"2010","journal-title":"PLoS Genet."},{"key":"2023012513053819100_B15","doi-asserted-by":"crossref","first-page":"e1000384","DOI":"10.1371\/journal.pgen.1000384","article-title":"A groupwise association test for rare mutations using a weighted sum statistic","volume":"5","author":"Madsen","year":"2009","journal-title":"PLoS Genet."},{"key":"2023012513053819100_B16","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-68276-1","volume-title":"Inequalities: Theory of Majorization and its Applications","author":"Marshall","year":"2011","edition":"2nd edn."},{"key":"2023012513053819100_B17","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1126\/science.1167728","article-title":"Rare variants of IFIH1, a gene implicated in antiviral responses, protect against type 1 diabetes","volume":"324","author":"Nejentsev","year":"2009","journal-title":"Science"},{"key":"2023012513053819100_B18","doi-asserted-by":"crossref","first-page":"832","DOI":"10.1016\/j.ajhg.2010.04.005","article-title":"Pooled association tests for rare variants in exon-resequencing studies","volume":"86","author":"Price","year":"2010","journal-title":"AJHG"},{"key":"2023012513053819100_B19","doi-asserted-by":"crossref","first-page":"904","DOI":"10.1038\/ng1847","article-title":"Principal components analysis corrects for stratification in genome-wide association studies","volume":"38","author":"Price","year":"2006","journal-title":"Nat. Genet."},{"key":"2023012513053819100_B20","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1159\/000312641","article-title":"Genomic similarity and kernel methods I: advancements by building on mathematical and statistical foundations","volume":"70","author":"Schaid","year":"2010","journal-title":"Hum. Hered."},{"key":"2023012513053819100_B21","doi-asserted-by":"crossref","first-page":"132","DOI":"10.1159\/000312643","article-title":"Genomic similarity and kernel methods II: methods for genomic information","volume":"70","author":"Schaid","year":"2010","journal-title":"Hum. Hered."},{"key":"2023012513053819100_B22","volume-title":"Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond","author":"Scholkopf","year":"2001"},{"key":"2023012513053819100_B23","doi-asserted-by":"crossref","first-page":"792","DOI":"10.1086\/508346","article-title":"Generalized genomic distance-based regression methodology for multilocus association analysis","volume":"79","author":"Wessel","year":"2006","journal-title":"Am. J. Hum. Genet."},{"key":"2023012513053819100_B24","doi-asserted-by":"crossref","first-page":"929","DOI":"10.1016\/j.ajhg.2010.05.002","article-title":"Powerful SNP-set analysis for case-control genome-wide association studies","volume":"86","author":"Wu","year":"2010","journal-title":"Am. J. Hum. Genet."},{"key":"2023012513053819100_B25","doi-asserted-by":"crossref","first-page":"270","DOI":"10.1239\/aap\/1240319585","article-title":"Composition markov chains of multinomial type","volume":"41","author":"Zhou","year":"2009","journal-title":"Adv. Appl. Probab."},{"key":"2023012513053819100_B26","doi-asserted-by":"crossref","first-page":"645","DOI":"10.1198\/jcgs.2010.09014","article-title":"Mm algorithms for some discrete multivariate distributions","volume":"19","author":"Zhou","year":"2010","journal-title":"J. Comput. Graph. Stat."},{"key":"2023012513053819100_B27","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1093\/biostatistics\/kxq081","article-title":"Biological pathway selection through nonlinear dimension reduction","volume":"12","author":"Zhu","year":"2011","journal-title":"Biostatistics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/18\/i375\/48885481\/bioinformatics_28_18_i375.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/18\/i375\/48885481\/bioinformatics_28_18_i375.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T18:56:47Z","timestamp":1674673007000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/18\/i375\/249571"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,9,3]]},"references-count":27,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2012,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts406","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,9,15]]},"published":{"date-parts":[[2012,9,3]]}}}