{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,9]],"date-time":"2026-04-09T05:43:24Z","timestamp":1775713404983,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1010172","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,6,14]],"date-time":"2022-06-14T00:00:00Z","timestamp":1655164800000}}],"reference-count":28,"publisher":"Public Library of Science (PLoS)","issue":"6","license":[{"start":{"date-parts":[[2022,6,2]],"date-time":"2022-06-02T00:00:00Z","timestamp":1654128000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"a budget project of the Institute of Cytology and Genetics","award":["0259-2021-0009 \/ AAAA-A17-117092070032-4 and FWNR-2022-0020"],"award-info":[{"award-number":["0259-2021-0009 \/ AAAA-A17-117092070032-4 and FWNR-2022-0020"]}]},{"DOI":"10.13039\/501100002261","name":"\u0420\u043e\u0441\u0441\u0438\u0439\u0441\u043a\u0438\u0439 \u0424\u043e\u043d\u0434 \u0424\u0443\u043d\u0434\u0430\u043c\u0435\u043d\u0442\u0430\u043b\u044c\u043d\u044b\u0445 \u0418\u0441\u0441\u043b\u0435\u0434\u043e\u0432\u0430\u043d\u0438\u0439","doi-asserted-by":"crossref","award":["20-04-00464"],"award-info":[{"award-number":["20-04-00464"]}],"id":[{"id":"10.13039\/501100002261","id-type":"DOI","asserted-by":"crossref"}]},{"name":"5-100 Best Universities"}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Gene-based association analysis is an effective gene-mapping tool. Many gene-based methods have been proposed recently. However, their power depends on the underlying genetic architecture, which is rarely known in complex traits, and so it is likely that a combination of such methods could serve as a universal approach. Several frameworks combining different gene-based methods have been developed. However, they all imply a fixed set of methods, weights and functional annotations. Moreover, most of them use individual phenotypes and genotypes as input data. Here, we introduce sumSTAAR, a framework for gene-based association analysis using summary statistics obtained from genome-wide association studies (GWAS). It is an extended and modified version of STAAR framework proposed by Li and colleagues in 2020. The sumSTAAR framework offers a wider range of gene-based methods to combine. It allows the user to arbitrarily define a set of these methods, weighting functions and probabilities of genetic variants being causal. The methods used in the framework were adapted to analyse genes with large number of SNPs to decrease the running time. The framework includes the polygene pruning procedure to guard against the influence of the strong GWAS signals outside the gene. We also present new improved matrices of correlations between the genotypes of variants within genes. These matrices estimated on a sample of 265,000 individuals are a state-of-the-art replacement of widely used matrices based on the 1000 Genomes Project data.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1010172","type":"journal-article","created":{"date-parts":[[2022,6,2]],"date-time":"2022-06-02T13:44:21Z","timestamp":1654177461000},"page":"e1010172","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":10,"title":["sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics"],"prefix":"10.1371","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3363-0434","authenticated-orcid":true,"given":"Nadezhda M.","family":"Belonogova","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9478-267X","authenticated-orcid":true,"given":"Gulnara R.","family":"Svishcheva","sequence":"additional","affiliation":[]},{"given":"Anatoly V.","family":"Kirichenko","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2726-0350","authenticated-orcid":true,"given":"Irina V.","family":"Zorkoltseva","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4931-6052","authenticated-orcid":true,"given":"Yakov A.","family":"Tsepilov","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4318-4458","authenticated-orcid":true,"given":"Tatiana I.","family":"Axenovich","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,6,2]]},"reference":[{"issue":"3","key":"pcbi.1010172.ref001","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1016\/j.ajhg.2008.06.024","article-title":"Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data","volume":"83","author":"B Li","year":"2008","journal-title":"Am J Hum Genet"},{"issue":"6","key":"pcbi.1010172.ref002","doi-asserted-by":"crossref","first-page":"446","DOI":"10.1038\/nrg2809","article-title":"Missing heritability and strategies for finding the underlying causes of complex disease","volume":"11","author":"EE Eichler","year":"2010","journal-title":"Nature reviews Genetics"},{"issue":"2","key":"pcbi.1010172.ref003","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1016\/j.ajhg.2012.06.007","article-title":"Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies","volume":"91","author":"S Lee","year":"2012","journal-title":"The American Journal of Human Genetics"},{"issue":"3","key":"pcbi.1010172.ref004","doi-asserted-by":"crossref","first-page":"410","DOI":"10.1016\/j.ajhg.2019.01.002","article-title":"ACAT: A Fast and Powerful p Value Combination Method for Rare-Variant Analysis in Sequencing Studies","volume":"104","author":"Y Liu","year":"2019","journal-title":"Am J Hum Genet"},{"issue":"12","key":"pcbi.1010172.ref005","doi-asserted-by":"crossref","first-page":"e1009060","DOI":"10.1371\/journal.pgen.1009060","article-title":"Integrating comprehensive functional annotations to boost power and accuracy in gene-based association analysis","volume":"16","author":"C Quick","year":"2020","journal-title":"PLoS Genet"},{"issue":"9","key":"pcbi.1010172.ref006","doi-asserted-by":"crossref","first-page":"969","DOI":"10.1038\/s41588-020-0676-4","article-title":"Dynamic incorporation of multiple in silico functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale","volume":"52","author":"X Li","year":"2020","journal-title":"Nat Genet"},{"issue":"4","key":"pcbi.1010172.ref007","doi-asserted-by":"crossref","first-page":"352","DOI":"10.1002\/gepi.22287","article-title":"Convex combination sequence kernel association test for rare-variant studies","volume":"44","author":"DC Posner","year":"2020","journal-title":"Genet Epidemiol"},{"key":"pcbi.1010172.ref008","doi-asserted-by":"crossref","first-page":"437","DOI":"10.3389\/fgene.2020.00437","article-title":"Genome-Wide Gene-Based Multi-Trait Analysis","volume":"11","author":"Y Deng","year":"2020","journal-title":"Front Genet."},{"issue":"1","key":"pcbi.1010172.ref009","doi-asserted-by":"crossref","first-page":"2850","DOI":"10.1038\/s41467-020-16591-0","article-title":"Multi-trait analysis of rare-variant association summary statistics using MTAR","volume":"11","author":"L Luo","year":"2020","journal-title":"Nat Commun."},{"issue":"1","key":"pcbi.1010172.ref010","doi-asserted-by":"crossref","first-page":"5461","DOI":"10.1038\/s41598-019-41827-5","article-title":"A generalized model for combining dependent SNP-level summary statistics and its extensions to statistics of other levels","volume":"9","author":"GR Svishcheva","year":"2019","journal-title":"Sci Rep."},{"issue":"19","key":"pcbi.1010172.ref011","doi-asserted-by":"crossref","first-page":"3701","DOI":"10.1093\/bioinformatics\/btz172","article-title":"Gene-based association tests using GWAS summary statistics","volume":"35","author":"GR Svishcheva","year":"2019","journal-title":"Bioinformatics"},{"issue":"1","key":"pcbi.1010172.ref012","doi-asserted-by":"crossref","first-page":"2484","DOI":"10.1038\/s41598-021-82123-5","article-title":"Gene-based association analysis identifies 190 genes affecting neuroticism","volume":"11","author":"NM Belonogova","year":"2021","journal-title":"Sci Rep."},{"issue":"7726","key":"pcbi.1010172.ref013","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1038\/s41586-018-0579-z","article-title":"The UK Biobank resource with deep phenotyping and genomic data","volume":"562","author":"C Bycroft","year":"2018","journal-title":"Nature"},{"issue":"3","key":"pcbi.1010172.ref014","doi-asserted-by":"crossref","first-page":"e1001779","DOI":"10.1371\/journal.pmed.1001779","article-title":"UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age","volume":"12","author":"C Sudlow","year":"2015","journal-title":"PLoS medicine."},{"issue":"9","key":"pcbi.1010172.ref015","doi-asserted-by":"crossref","first-page":"1026","DOI":"10.1093\/aje\/kwx246","article-title":"Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population","volume":"186","author":"A Fry","year":"2017","journal-title":"American journal of epidemiology"},{"issue":"1","key":"pcbi.1010172.ref016","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/0191-8869(85)90026-1","article-title":"A revised version of the psychoticism scale","volume":"6","author":"SB Eysenck","year":"1985","journal-title":"Personality and individual differences."},{"issue":"1","key":"pcbi.1010172.ref017","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-018-03242-8","article-title":"Item-level analyses reveal genetic heterogeneity in neuroticism","volume":"9","author":"M Nagel","year":"2018","journal-title":"Nature communications"},{"issue":"4","key":"pcbi.1010172.ref018","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1016\/j.ajhg.2017.08.012","article-title":"Prospects of fine-mapping trait-associated genomic regions by using summary statistics from genome-wide association studies","volume":"101","author":"C Benner","year":"2017","journal-title":"The American Journal of Human Genetics"},{"issue":"2","key":"pcbi.1010172.ref019","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1002\/gepi.20266","article-title":"A principal components regression approach to multilocus genetic association studies.","volume":"32","author":"K Wang","year":"2008","journal-title":"Genetic Epidemiology: The Official Publication of the International Genetic Epidemiology Society"},{"issue":"7","key":"pcbi.1010172.ref020","doi-asserted-by":"crossref","first-page":"726","DOI":"10.1002\/gepi.21757","article-title":"Functional linear models for association analysis of quantitative traits","volume":"37","author":"R Fan","year":"2013","journal-title":"Genetic epidemiology"},{"issue":"6","key":"pcbi.1010172.ref021","doi-asserted-by":"crossref","first-page":"e0128999","DOI":"10.1371\/journal.pone.0128999","article-title":"Region-based association test for familial data under functional linear models","volume":"10","author":"GR Svishcheva","year":"2015","journal-title":"PloS one"},{"issue":"1","key":"pcbi.1010172.ref022","doi-asserted-by":"crossref","first-page":"e0190486","DOI":"10.1371\/journal.pone.0190486","article-title":"Weighted functional linear regression models for gene-based association analysis.","volume":"13","author":"NM Belonogova","year":"2018","journal-title":"Plos one"},{"issue":"3","key":"pcbi.1010172.ref023","doi-asserted-by":"crossref","first-page":"883","DOI":"10.1534\/genetics.117.300257","article-title":"COMBAT: a combined association test for genes using summary statistics","volume":"207","author":"M Wang","year":"2017","journal-title":"Genetics"},{"issue":"6","key":"pcbi.1010172.ref024","doi-asserted-by":"crossref","first-page":"516","DOI":"10.1002\/gepi.22136","article-title":"FastSKAT: Sequence kernel association tests for very large sets of markers","volume":"42","author":"T Lumley","year":"2018","journal-title":"Genet Epidemiol"},{"issue":"4","key":"pcbi.1010172.ref025","doi-asserted-by":"crossref","first-page":"369","DOI":"10.1038\/ng.2213","article-title":"Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits","volume":"44","author":"J Yang","year":"2012","journal-title":"Nat Genet"},{"issue":"9","key":"pcbi.1010172.ref026","doi-asserted-by":"crossref","first-page":"1335","DOI":"10.1038\/s41588-018-0184-y","article-title":"Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies","volume":"50","author":"W Zhou","year":"2018","journal-title":"Nature genetics"},{"issue":"11","key":"pcbi.1010172.ref027","doi-asserted-by":"crossref","first-page":"1616","DOI":"10.1038\/s41588-021-00954-4","article-title":"A generalized linear mixed model association tool for biobank-scale data","volume":"53","author":"L Jiang","year":"2021","journal-title":"Nature genetics"},{"issue":"6","key":"pcbi.1010172.ref028","doi-asserted-by":"crossref","first-page":"e0233847","DOI":"10.1371\/journal.pone.0233847","article-title":"A rank-based normalization method with the fully adjusted full-stage procedure in genetic association studies","volume":"15","author":"L-C Chien","year":"2020","journal-title":"PloS one."}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1010172","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,6,14]],"date-time":"2022-06-14T00:00:00Z","timestamp":1655164800000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010172","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,6,14]],"date-time":"2022-06-14T14:06:17Z","timestamp":1655215577000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010172"}},"subtitle":[],"editor":[{"given":"Andrey","family":"Rzhetsky","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,6,2]]},"references-count":28,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2022,6,2]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1010172","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.10.25.465680","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,6,2]]}}}