{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T07:59:36Z","timestamp":1775289576504,"version":"3.50.1"},"reference-count":39,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2018,11,29]],"date-time":"2018-11-29T00:00:00Z","timestamp":1543449600000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"UMR 8071 - Statistique et G\u00e9nome"},{"name":"Bioptimize"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2018,12]]},"DOI":"10.1186\/s12859-018-2475-9","type":"journal-article","created":{"date-parts":[[2018,11,29]],"date-time":"2018-11-29T09:32:16Z","timestamp":1543483936000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":12,"title":["Learning the optimal scale for GWAS through hierarchical SNP aggregation"],"prefix":"10.1186","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6239-869X","authenticated-orcid":false,"given":"Florent","family":"Guinot","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marie","family":"Szafranski","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christophe","family":"Ambroise","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Franck","family":"Samson","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2018,11,29]]},"reference":[{"issue":"1","key":"2475_CR1","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","volume":"57","author":"Y Benjamini","year":"1995","unstructured":"Benjamini Y, Hochberg Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Stat Soc Ser B. 1995; 57(1):289\u2013300.","journal-title":"J R Stat Soc Ser B"},{"key":"2475_CR2","doi-asserted-by":"publisher","first-page":"660","DOI":"10.1093\/bioinformatics\/bti063","volume":"21","author":"C Dalmasso","year":"2005","unstructured":"Dalmasso C, Bro\u00ebt P, Moreau T. A simple procedure for estimating the false discovery rate. Bioinformatics. 2005; 21:660\u20138.","journal-title":"Bioinformatics"},{"issue":"7218","key":"2475_CR3","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1038\/456018a","volume":"456","author":"B Maher","year":"2008","unstructured":"Maher B. Personal genomes: The case of the missing heritability. Nat News. 2008; 456(7218):18\u201321.","journal-title":"Nat News"},{"issue":"1","key":"2475_CR4","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1016\/j.ajhg.2014.06.009","volume":"95","author":"S Lee","year":"2014","unstructured":"Lee S, Abecasis GR, Boehnke M, Lin X. Rare-variant association analysis: study designs and statistical tests. Am J Hum Genet. 2014; 95(1):5\u201323.","journal-title":"Am J Hum Genet"},{"issue":"7265","key":"2475_CR5","doi-asserted-by":"publisher","first-page":"747","DOI":"10.1038\/nature08494","volume":"461","author":"TA Manolio","year":"2009","unstructured":"Manolio TA, Visscher PM. Finding the missing heritability of complex diseases. Nature. 2009; 461(7265):747\u201353.","journal-title":"Nature"},{"issue":"2","key":"2475_CR6","doi-asserted-by":"publisher","first-page":"84","DOI":"10.1159\/000336982","volume":"73","author":"JL Asimit","year":"2012","unstructured":"Asimit JL, Day-Williams AG, Morris AP, Zeggini E. ARIEL and AMELIA: testing for an accumulation of rare variants using next-generation sequencing data. Hum Hered. 2012; 73(2):84\u201394.","journal-title":"Hum Hered"},{"issue":"1","key":"2475_CR7","doi-asserted-by":"publisher","first-page":"82","DOI":"10.1016\/j.ajhg.2011.05.029","volume":"89","author":"MC Wu","year":"2011","unstructured":"Wu MC, Lee S, Cai T, Li Y, Boehnke M, Lin X. Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet. 2011; 89(1):82\u201393.","journal-title":"Am J Hum Genet"},{"issue":"4","key":"2475_CR8","doi-asserted-by":"publisher","first-page":"762","DOI":"10.1093\/biostatistics\/kxs014","volume":"13","author":"S Lee","year":"2012","unstructured":"Lee S, Wu MC, Lin X. Optimal tests for rare variant effects in sequencing association studies. Biostatistics. 2012; 13(4):762\u201375.","journal-title":"Biostatistics"},{"issue":"2","key":"2475_CR9","doi-asserted-by":"publisher","first-page":"265","DOI":"10.1093\/biomet\/asn007","volume":"95","author":"N Meinshausen","year":"2008","unstructured":"Meinshausen N. Hierarchical testing of variable importance. Biometrika. 2008; 95(2):265\u201378.","journal-title":"Biometrika"},{"issue":"2","key":"2475_CR10","first-page":"65","volume":"6","author":"S Holm","year":"1979","unstructured":"Holm S. A simple sequentially rejective multiple test procedure. Scand J Stat. 1979; 6(2):65\u201370.","journal-title":"Scand J Stat"},{"issue":"4","key":"2475_CR11","doi-asserted-by":"publisher","first-page":"299","DOI":"10.1038\/nrg777","volume":"3","author":"KG Ardlie","year":"2002","unstructured":"Ardlie KG, Kruglyak L, Seielstad M. Patterns of linkage disequilibrium in the human genome. Nat Rev Genet. 2002; 3(4):299\u2013309.","journal-title":"Nat Rev Genet"},{"issue":"5576","key":"2475_CR12","doi-asserted-by":"publisher","first-page":"2225","DOI":"10.1126\/science.1069424","volume":"296","author":"SB Gabriel","year":"2002","unstructured":"Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, Higgins J, DeFelice M, Lochner A, Faggart M, L-C SN, Rotimi C, Adeyemo A, Cooper R, Ward R, Lander ES, Daly MJ, Altshuler D. The structure of haplotype blocks in the human genome. Science. 2002; 296(5576):2225\u20139.","journal-title":"Science"},{"issue":"8","key":"2475_CR13","doi-asserted-by":"publisher","first-page":"803","DOI":"10.1002\/gepi.20242","volume":"31","author":"BE Huang","year":"2007","unstructured":"Huang BE, Amos CI, Lin DY. Detecting haplotype effects in genomewide association studies. Genet Epidemiol. 2007; 31(8):803\u201312.","journal-title":"Genet Epidemiol"},{"issue":"6","key":"2475_CR14","doi-asserted-by":"publisher","first-page":"929","DOI":"10.1016\/j.ajhg.2010.05.002","volume":"86","author":"MC Wu","year":"2010","unstructured":"Wu MC, Kraft P, Epstein MP, Taylor DM, Chanock SJ, Hunter DJ, Lin X. Powerful SNP-Set Analysis for Case-Control Genome-wide Association Studies. Am J Hum Genet. 2010; 86(6):929\u201342.","journal-title":"Am J Hum Genet"},{"issue":"12","key":"2475_CR15","doi-asserted-by":"publisher","first-page":"1526","DOI":"10.1093\/bioinformatics\/btt177","volume":"29","author":"J Listgarten","year":"2013","unstructured":"Listgarten J, Lippert C, Kang EY, Xiang J, Kadie CM, Heckerman D. A powerful and efficient set test for genetic markers that handles confounders. Bioinformatics. 2013; 29(12):1526\u201333.","journal-title":"Bioinformatics"},{"key":"2475_CR16","doi-asserted-by":"publisher","unstructured":"Yoo YJ, Sun L, G Poirier J, D Paterson A, B Bull S. Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure: Yoo et al.Genet Epidemiol. 2016; 41. https:\/\/doi.org\/10.1002\/gepi.22024 .","DOI":"10.1002\/gepi.22024"},{"key":"2475_CR17","doi-asserted-by":"publisher","unstructured":"Par\u00e9 G, Asma S, Deng WQ. Contribution of Large Region Joint Associations to Complex Traits Genetics. PLoS Genet. 2015; 11. https:\/\/doi.org\/10.1371\/journal.pgen.1005103 .","DOI":"10.1371\/journal.pgen.1005103"},{"key":"2475_CR18","doi-asserted-by":"publisher","first-page":"1178","DOI":"10.1093\/bioinformatics\/btv719","volume":"32","author":"I-Y Kwak","year":"2016","unstructured":"Kwak I-Y, Pan W. Adaptive gene- and pathway-trait association testing with GWAS summary statistics. Bioinformatics. 2016; 32:1178\u201384. https:\/\/doi.org\/10.1093\/bioinformatics\/btv719 .","journal-title":"Bioinformatics"},{"key":"2475_CR19","doi-asserted-by":"publisher","unstructured":"Petersen A, Alvarez C, DeClaire S, Tintle NL. Assessing Methods for Assigning SNPs to Genes in Gene-Based Tests of Association Using Common Variants. PLOS ONE. 2013; 8. https:\/\/doi.org\/10.1371\/journal.pone.0062161 .","DOI":"10.1371\/journal.pone.0062161"},{"issue":"3","key":"2475_CR20","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1086\/519795","volume":"81","author":"S Purcell","year":"2007","unstructured":"Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, Maller J, Sklar P, De B PIW, Daly MJ, et al. Plink: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007; 81(3):559\u201375.","journal-title":"Am J Hum Genet"},{"issue":"7145","key":"2475_CR21","doi-asserted-by":"publisher","first-page":"661","DOI":"10.1038\/nature05911","volume":"447","author":"WTCCC","year":"2007","unstructured":"WTCCC. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007; 447(7145):661\u201378.","journal-title":"Nature"},{"issue":"7","key":"2475_CR22","doi-asserted-by":"publisher","first-page":"730","DOI":"10.1038\/ng.2667","volume":"45","author":"A Cortes","year":"2013","unstructured":"International Genetics of Ankylosing Spondylitis Consortium (IGAS), Cortes A, Hadler J, Pointon JP, Robinson PC, et al. Identification of multiple risk variants for ankylosing spondylitis through high-density genotyping of immune-related loci. Nat Genet. 2013; 45(7):730\u20138.","journal-title":"Nat Genet"},{"key":"2475_CR23","doi-asserted-by":"publisher","first-page":"148","DOI":"10.1186\/s12859-015-0556-6","volume":"16","author":"A Dehman","year":"2015","unstructured":"Dehman A, Ambroise C, Neuvial P. Performance of a blockwise approach in variable selection using linkage disequilibrium information. BMC Bioinformatics. 2015; 16:148.","journal-title":"BMC Bioinformatics"},{"issue":"301","key":"2475_CR24","doi-asserted-by":"publisher","first-page":"236","DOI":"10.1080\/01621459.1963.10500845","volume":"58","author":"JH Ward","year":"1963","unstructured":"Ward JH. Hierarchical Grouping to Optimize an Objective Function. J Am Stat Assoc. 1963; 58(301):236\u201344.","journal-title":"J Am Stat Assoc"},{"issue":"2","key":"2475_CR25","doi-asserted-by":"publisher","first-page":"411","DOI":"10.1111\/1467-9868.00293","volume":"63","author":"R Tibshirani","year":"2001","unstructured":"Tibshirani R, Walther G, Hastie T. Estimating the number of clusters in a data set via the gap statistic. J R Stat Soc Ser B. 2001; 63(2):411\u201323.","journal-title":"J R Stat Soc Ser B"},{"key":"2475_CR26","unstructured":"Hartigan JA. In: Wiley Series in Probability and Mathematical Statistics, (ed).Clustering algorithms: Wiley; 1975."},{"issue":"1","key":"2475_CR27","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/03610927408827101","volume":"3","author":"T Cali\u0144ski","year":"1974","unstructured":"Cali\u0144ski T, Harabasz J. A dendrite method for cluster analysis. Commun Stat. 1974; 3(1):1\u201327.","journal-title":"Commun Stat"},{"issue":"1","key":"2475_CR28","doi-asserted-by":"publisher","first-page":"23","DOI":"10.2307\/2531893","volume":"44","author":"WJ Krzanowski","year":"1988","unstructured":"Krzanowski WJ, Lai YT. A criterion for determining the number of groups in a data set using sum-of-squares clustering. Biometrics. 1988; 44(1):23\u201334.","journal-title":"Biometrics"},{"key":"2475_CR29","first-page":"499","volume":"2","author":"O Bousquet","year":"2002","unstructured":"Bousquet O, Elisseeff A. Stability and Generalization. J Mach Learn Res. 2002; 2:499\u2013526.","journal-title":"J Mach Learn Res"},{"key":"2475_CR30","doi-asserted-by":"crossref","unstructured":"Bonnet A, Gassiat E, L\u00e9vy-Leduc C. Heritability estimation in high dimensional linear mixed models. ArXiv e-prints. 2014. http:\/\/arxiv.org\/abs\/1404.3397.","DOI":"10.1214\/15-EJS1069"},{"issue":"3","key":"2475_CR31","first-page":"370","volume":"135","author":"JA Nelder","year":"1972","unstructured":"Nelder JA, Wedderburn RWM. Generalized Linear Models. J R Stat Soc: Ser A. 1972; 135(3):370\u201384.","journal-title":"J R Stat Soc: Ser A"},{"issue":"5","key":"2475_CR32","doi-asserted-by":"publisher","first-page":"335","DOI":"10.1038\/nrg3706","volume":"15","author":"PC Sham","year":"2014","unstructured":"Sham PC, Purcell SM. Statistical power and significance testing in large-scale genetic studies. Nat Rev Genet. 2014; 15(5):335\u201346.","journal-title":"Nat Rev Genet"},{"issue":"16","key":"2475_CR33","doi-asserted-by":"publisher","first-page":"2304","DOI":"10.1093\/bioinformatics\/btr341","volume":"27","author":"Z Su","year":"2011","unstructured":"Su Z, Marchini J, Donnelly P. Hapgen2: simulation of multiple disease snps. Bioinformatics. 2011; 27(16):2304.","journal-title":"Bioinformatics"},{"issue":"1","key":"2475_CR34","doi-asserted-by":"publisher","first-page":"14","DOI":"10.1186\/1471-2288-12-14","volume":"1","author":"FA Diaz-Quijano","year":"2012","unstructured":"Diaz-Quijano FA. A simple method for estimating relative risk using logistic regression\u201d. BMC Med Res Methodol. 2012; 1(1):14.","journal-title":"BMC Med Res Methodol"},{"key":"2475_CR35","doi-asserted-by":"publisher","unstructured":"Brzyski D, Peterson CB, Sobczyk P, Cand\u00e8s EJ, Bogdan M, Sabatti C. Controlling the Rate of GWAS False Discoveries. Genetics. 2017; 205. https:\/\/doi.org\/10.1534\/genetics.116.193987 .","DOI":"10.1534\/genetics.116.193987"},{"issue":"1","key":"2475_CR36","doi-asserted-by":"publisher","first-page":"205","DOI":"10.1534\/genetics.114.167817","volume":"199","author":"H Yi","year":"2015","unstructured":"Yi H, Breheny P, Imam N, Liu Y, Hoeschele I. Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits. Genetics. 2015; 199(1):205\u201322.","journal-title":"Genetics"},{"key":"2475_CR37","doi-asserted-by":"publisher","first-page":"904","DOI":"10.1038\/ng1847","volume":"38","author":"AL Price","year":"2006","unstructured":"Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006; 38:904\u20139.","journal-title":"Nat Genet"},{"issue":"6","key":"2475_CR38","doi-asserted-by":"publisher","first-page":"504","DOI":"10.1136\/ard.37.6.504","volume":"37","author":"JC Woodrow","year":"1978","unstructured":"Woodrow JC, Eastmond CJ. HLA B27 and the genetics of ankylosing spondylitis. Ann Rheum Dis. 1978; 37(6):504\u20139.","journal-title":"Ann Rheum Dis"},{"key":"2475_CR39","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1016\/0167-9473(93)90217-H","volume":"15","author":"L Breiman","year":"1993","unstructured":"Breiman L. Fitting additive models to regression data. diagnostics and alternative views. Comput Stat Data Anal. 1993; 15:13\u201346.","journal-title":"Comput Stat Data Anal"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-018-2475-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12859-018-2475-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-018-2475-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T07:13:27Z","timestamp":1775286807000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-018-2475-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,11,29]]},"references-count":39,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2018,12]]}},"alternative-id":["2475"],"URL":"https:\/\/doi.org\/10.1186\/s12859-018-2475-9","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,11,29]]},"assertion":[{"value":"29 September 2017","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 November 2018","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 November 2018","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The dataset regarding ankylosing spondylitis consists of the French subset of the large study of the International Genetics of Ankylosing Spondylitis (IGAS) study [\n                      \n                      ]. For this subset, unrelated cases were recruited through the Rheumatology clinic of Ambroise Par\u00e9 Hospital (Boulogne-Billancourt, France) or through the national self-help patients\u2019 association: \u201cAssociation Fran\u00e7aise des Spondylarthritiques\u201d. Population-matched unrelated controls were obtained from the \u201cCentre d\u2019Etude du Polymorphisme Humain\u201d, or were recruited as healthy spouses of cases. The protocol was reviewed and approved by the Ethics committee of the Ambroise Par\u00e9 hospital. All participants gave their informed consent to the study.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}},{"value":"Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Publisher\u2019s Note"}}],"article-number":"459"}}