{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,16]],"date-time":"2026-05-16T04:00:46Z","timestamp":1778904046867,"version":"3.51.4"},"reference-count":68,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,7,1]],"date-time":"2020-07-01T00:00:00Z","timestamp":1593561600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,7,1]],"date-time":"2020-07-01T00:00:00Z","timestamp":1593561600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Algorithms Mol Biol"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Association studies have been widely used to search for associations between common genetic variants observations and a given phenotype. However, it is now generally accepted that genes and environment must be examined jointly when estimating phenotypic variance. In this work we consider two types of biological markers: genotypic markers, which characterize an observation in terms of inherited genetic information, and metagenomic marker which are related to the environment. Both types of markers are available in their millions and can be used to characterize any observation uniquely.<\/jats:p><\/jats:sec><jats:sec><jats:title>Objective<\/jats:title><jats:p>Our focus is on detecting interactions between groups of genetic and metagenomic markers in order to gain a better understanding of the complex relationship between environment and genome in the expression of a given phenotype.<\/jats:p><\/jats:sec><jats:sec><jats:title>Contributions<\/jats:title><jats:p>We propose a novel approach for efficiently detecting interactions between complementary datasets in a high-dimensional setting with a reduced computational cost. The method, named SICOMORE, reduces the dimension of the search space by selecting a subset of supervariables in the two complementary datasets. These supervariables are given by a weighted group structure defined on sets of variables at different scales. A Lasso selection is then applied on each type of supervariable to obtain a subset of potential interactions that will be explored via linear model testing.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We compare SICOMORE with other approaches in simulations, with varying sample sizes, noise, and numbers of true interactions. SICOMORE exhibits convincing results in terms of recall, as well as competitive performances with respect to running time. The method is also used to detect interaction between genomic markers in<jats:italic>Medicago truncatula<\/jats:italic>and metagenomic markers in its rhizosphere bacterial community.<\/jats:p><\/jats:sec><jats:sec><jats:title>Software availability<\/jats:title><jats:p>An package is available\u00a0[4], along with its documentation and associated scripts, allowing the reader to reproduce the results presented in the paper.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s13015-020-00173-2","type":"journal-article","created":{"date-parts":[[2020,7,1]],"date-time":"2020-07-01T08:36:50Z","timestamp":1593592610000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Fast computation of genome-metagenome interaction effects"],"prefix":"10.1186","volume":"15","author":[{"given":"Florent","family":"Guinot","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9747-9251","authenticated-orcid":false,"given":"Marie","family":"Szafranski","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Julien","family":"Chiquet","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anouk","family":"Zancarini","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christine","family":"Le Signor","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christophe","family":"Mougel","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christophe","family":"Ambroise","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,7,1]]},"reference":[{"issue":"4","key":"173_CR1","doi-asserted-by":"crossref","first-page":"643","DOI":"10.1093\/biomet\/76.4.643","volume":"76","author":"J Aitchison","year":"1989","unstructured":"Aitchison J, Ho CH. The multivariate poisson-log normal distribution. Biometrika. 1989;76(4):643\u201353.","journal-title":"Biometrika"},{"issue":"2","key":"173_CR2","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1111\/j.2517-6161.1982.tb01195.x","volume":"44","author":"J Aitchison","year":"1982","unstructured":"Aitchison J. The statistical analysis of compositional data. J Royal Stat Soc Series B. 1982;44(2):139\u201377.","journal-title":"J Royal Stat Soc Series B"},{"key":"173_CR3","doi-asserted-by":"publisher","unstructured":"Alexa A, Rahnenfuhrer J. topGO:Enrichment Analysis for Gene Ontology. (2019). R package version 3.10. https:\/\/doi.org\/10.18129\/B9.bioc.topGO 2019.","DOI":"10.18129\/B9.bioc.topGO"},{"key":"173_CR4","unstructured":"Ambroise C, Chiquet J, Guinot F, Szafranski M. sicomore: Selection of Interaction Effects in Compressed Multiple Omics Representations. (2020). R package version 0.2.1. http:\/\/julien.cremeriefamily.info\/sicomore-pkg\/ 2020."},{"key":"173_CR5","doi-asserted-by":"crossref","unstructured":"Bach F. Bolasso: model consistent lasso estimation through the bootstrap. In: Proceedings of the 25th Annual International Conference on Machine Learning, 2008;33\u201340.","DOI":"10.1145\/1390156.1390161"},{"key":"173_CR6","unstructured":"Benjamin H, Hothorn T. stabs: Stability Selection with Error Control. (2017). R package version 0.6-3. https:\/\/cran.r-project.org\/package=stabs 2017."},{"issue":"1","key":"173_CR7","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","volume":"57","author":"Y Benjamini","year":"1995","unstructured":"Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc Series B. 1995;57(1):289\u2013300.","journal-title":"J Royal Stat Soc Series B"},{"issue":"8","key":"173_CR8","doi-asserted-by":"crossref","first-page":"478","DOI":"10.1016\/j.tplants.2012.04.001","volume":"17","author":"RL Berendsen","year":"2012","unstructured":"Berendsen RL, Pieterse CMJ, Bakker PAHM. The rhizosphere microbiome and plant health. Trends Plant Sci. 2012;17(8):478\u201386.","journal-title":"Trends Plant Sci"},{"issue":"1","key":"173_CR9","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41598-018-37208-z","volume":"9","author":"J Bergelson","year":"2019","unstructured":"Bergelson J, Mittelstrass J, Horton MW. Characterizing both bacteria and fungi improves understanding of the arabidopsis root microbiome. Sci Rep. 2019;9(1):1\u201311.","journal-title":"Sci Rep"},{"issue":"3","key":"173_CR10","doi-asserted-by":"crossref","first-page":"1111","DOI":"10.1214\/13-AOS1096","volume":"41","author":"J Bien","year":"2013","unstructured":"Bien J, Taylor J, Tibshirani R. A Lasso for hierarchical interactions. Annals of statistics. 2013;41(3):1111.","journal-title":"Annals of statistics"},{"issue":"1462","key":"173_CR11","doi-asserted-by":"crossref","first-page":"1935","DOI":"10.1098\/rstb.2005.1725","volume":"360","author":"M Blaxter","year":"2005","unstructured":"Blaxter M, Mann J, Chapman T, Thomas F, Whitton C, Floyd R, Abebe E. Defining operational taxonomic units using dna barcode data. Philos Trans Royal Soc B. 2005;360(1462):1935\u201343.","journal-title":"Philos Trans Royal Soc B"},{"issue":"4","key":"173_CR12","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1080\/07352680490480734","volume":"23","author":"NJ Brewin","year":"2004","unstructured":"Brewin NJ. Plant cell wall remodelling in the rhizobium-legume symbiosis. Crit Rev Plant Sci. 2004;23(4):293\u2013316.","journal-title":"Crit Rev Plant Sci"},{"issue":"12","key":"173_CR13","doi-asserted-by":"crossref","first-page":"2639","DOI":"10.1038\/ismej.2017.119","volume":"11","author":"BJ Callahan","year":"2017","unstructured":"Callahan BJ, McMurdie PJ, Holmes SP. Exact sequence variants should replace operational taxonomic units in marker-gene data analysis. ISME J. 2017;11(12):2639.","journal-title":"ISME J"},{"key":"173_CR14","doi-asserted-by":"crossref","unstructured":"Chevalier J-A, Salmon J, Thirion B. Statistical Inference with Ensemble of Clustered Desparsified Lasso. arXiv:1806.05829 2018.","DOI":"10.1007\/978-3-030-00928-1_72"},{"issue":"4","key":"173_CR15","doi-asserted-by":"crossref","first-page":"306","DOI":"10.1016\/j.crvi.2007.02.012","volume":"330","author":"J Clavel","year":"2007","unstructured":"Clavel J. Progress in the epidemiological understanding of gene-environment interactions in major diseases: cancer. Comptes rendus biologies. 2007;330(4):306\u201317.","journal-title":"Comptes rendus biologies"},{"key":"173_CR16","doi-asserted-by":"publisher","unstructured":"Clayton D. snpStats:SnpMatrix and XSnpMatrix Classes and Methods. (2019). R package version 3.10. https:\/\/doi.org\/10.18129\/B9.bioc.snpStats 2019.","DOI":"10.18129\/B9.bioc.snpStats"},{"issue":"1","key":"173_CR17","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1186\/s12859-015-0556-6","volume":"16","author":"A Dehman","year":"2015","unstructured":"Dehman A, Ambroise C, Neuvial P. Performance of a blockwise approach in variable selection using linkage disequilibrium information. BMC Bioinformatics. 2015;16(1):148.","journal-title":"BMC Bioinformatics"},{"issue":"11","key":"173_CR18","doi-asserted-by":"crossref","first-page":"4789","DOI":"10.1109\/TIT.2008.929958","volume":"54","author":"DL Donoho","year":"2008","unstructured":"Donoho DL, Tsaig Y. Fast solution of-norm minimization problems when the solution may be sparse. IEEE Trans Inf Theory. 2008;54(11):4789\u2013812.","journal-title":"IEEE Trans Inf Theory"},{"issue":"14","key":"173_CR19","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1093\/bioinformatics\/btx237","volume":"33","author":"M Fischer","year":"2017","unstructured":"Fischer M, Strauch B, Renard BY. Abundance estimation and differential testing on strain level in metagenomics data. Bioinformatics. 2017;33(14):124\u201332.","journal-title":"Bioinformatics"},{"key":"173_CR20","doi-asserted-by":"crossref","first-page":"73","DOI":"10.17713\/ajs.v45i4.122","volume":"45","author":"GB Gloor","year":"2016","unstructured":"Gloor GB, Macklaim JM, Vu M, Fernandes AD. Compositional uncertainty should not be ignored in high-throughput sequencing data analysis. Austrian J Stat. 2016;45:73\u201387.","journal-title":"Austrian J Stat"},{"key":"173_CR21","doi-asserted-by":"crossref","first-page":"2224","DOI":"10.3389\/fmicb.2017.02224","volume":"8","author":"GB Gloor","year":"2017","unstructured":"Gloor GB, Macklaim JM, Pawlowsky-Glahn V, Egozcue JJ. Microbiome datasets are compositional: and this is not optional. Front Microbiol. 2017;8:2224.","journal-title":"Front Microbiol"},{"issue":"4","key":"173_CR22","first-page":"584","volume":"26","author":"JJ Goeman","year":"2011","unstructured":"Goeman JJ, Solari A, et al. Multiple testing for exploratory research. Stat Sci. 2011;26(4):584\u201397.","journal-title":"Stat Sci"},{"key":"173_CR23","volume-title":"Classification. Monographs on statistics and applied probability","author":"AD Gordon","year":"1999","unstructured":"Gordon AD, et al. Classification. Monographs on statistics and applied probability. Boca Raton: CRC Press; 1999."},{"issue":"3","key":"173_CR24","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1016\/j.tplants.2014.11.008","volume":"20","author":"B Gourion","year":"2015","unstructured":"Gourion B, Berrabah F, Ratet P, Stacey G. Rhizobium-legume symbioses: the crucial role of plant immunity. Trends Plant Sci. 2015;20(3):186\u201394.","journal-title":"Trends Plant Sci"},{"key":"173_CR25","unstructured":"Grimonprez Q. S\u00e9lection de groupes de variables corr\u00e9l\u00e9es en grande dimension. PhD thesis, Universit\u00e9 de Lille 2016."},{"key":"173_CR26","unstructured":"Grimonprez Q, Blanck S, Celisse A, Marot G, Yang Y, Zou H. MLGL: Multi-Layer Group-Lasso. (2020). R package version 0.6-1. https:\/\/cran.r-project.org\/package=MLGL 2020."},{"issue":"1","key":"173_CR27","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1186\/s12859-018-2475-9","volume":"19","author":"F Guinot","year":"2018","unstructured":"Guinot F, Szafranski M, Ambroise C, Samson F. Learning the optimal scale for GWAS through hierarchical SNP aggregation. BMC Bioinform. 2018;19(1):459\u201372.","journal-title":"BMC Bioinform"},{"key":"173_CR28","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1146\/annurev-phyto-080516-035623","volume":"55","author":"S Hacquard","year":"2017","unstructured":"Hacquard S, Spaepen S, Garrido-Oter R, Schulze-Lefert P. Interplay between innate immunity and the plant microbiota. Annual Rev Phytopathol. 2017;55:565\u201389.","journal-title":"Annual Rev Phytopathol"},{"issue":"1","key":"173_CR29","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1007\/s40471-018-0135-2","volume":"5","author":"SS Han","year":"2018","unstructured":"Han SS, Chatterjee N. Review of statistical methods for gene-environment interaction analysis. Curr Epidemiol Rep. 2018;5(1):39\u201345.","journal-title":"Curr Epidemiol Rep"},{"issue":"6052","key":"173_CR30","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1126\/science.1209244","volume":"334","author":"AM Hancock","year":"2011","unstructured":"Hancock AM, Brachi B, Faure N, Horton MW, Jarymowycz LB, Sperone FG, Toomajian C, Roux F, Bergelson J. Adaptation to climate across the arabidopsis thaliana genome. Science. 2011;334(6052):83\u20136.","journal-title":"Science"},{"issue":"1","key":"173_CR31","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1186\/s40168-018-0445-0","volume":"6","author":"MA Hassani","year":"2018","unstructured":"Hassani MA, Dur\u00e1n P, Hacquard S. Microbial interactions within the plant holobiont. Microbiome. 2018;6(1):58.","journal-title":"Microbiome"},{"key":"173_CR32","doi-asserted-by":"crossref","first-page":"535","DOI":"10.3389\/fgene.2019.00535","volume":"10","author":"JS Hawe","year":"2019","unstructured":"Hawe JS, Theis FJ, Heinig M. Inferring interaction networks from multi-comics data-a review. Front Genet. 2019;10:535.","journal-title":"Front Genet"},{"key":"173_CR33","doi-asserted-by":"crossref","first-page":"144","DOI":"10.1186\/s12859-015-0575-3","volume":"16","author":"B Hofner","year":"2015","unstructured":"Hofner B, Boccuto L, G\u00f6ker M. Controlling false discoveries in high-dimensional situations: boosting with stability selection. BMC Bioinform. 2015;16:144.","journal-title":"BMC Bioinform"},{"issue":"1","key":"173_CR34","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/ncomms6320","volume":"5","author":"MW Horton","year":"2014","unstructured":"Horton MW, Bodenhausen N, Beilsmith K, Meng D, Muegge BD, Subramanian S, Vetter MM, Vilhj\u00e1lmsson BJ, Nordborg M, Gordon JI, et al. Genome-wide association study of arabidopsis thaliana leaf microbial community. Nat Commun. 2014;5(1):1\u20137.","journal-title":"Nat Commun"},{"key":"173_CR35","doi-asserted-by":"crossref","unstructured":"Huang S, Chaudhary K, Garmire LX. More is better: Recent progress in multi-omics data integration methods. Frontiers in Genetics. 2017;8:","DOI":"10.3389\/fgene.2017.00084"},{"issue":"7","key":"173_CR36","doi-asserted-by":"crossref","first-page":"643","DOI":"10.1002\/gepi.21756","volume":"37","author":"CM Hutter","year":"2013","unstructured":"Hutter CM, Mechanic LE, Chatterjee N, Kraft P, Gillanders EM, Tank NG-ET. Gene-environment interactions in cancer epidemiology: a national cancer institute think tank report. Genet Epidemiol. 2013;37(7):643\u201357.","journal-title":"Genet Epidemiol"},{"key":"173_CR37","doi-asserted-by":"crossref","unstructured":"Jacob L, Obozinski G, Vert J-P. Group Lasso with overlap and graph Lasso. In: Proceedings of the 26th Annual International Conference on Machine Learning, 2009;33\u2013440.","DOI":"10.1145\/1553374.1553431"},{"issue":"12","key":"173_CR38","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1186\/s13073-014-0107-1","volume":"6","author":"D Knights","year":"2014","unstructured":"Knights D, Silverberg MS, Weersma RK, Gevers D, Dijkstra G, Huang H, Tyler AD, Van Sommeren S, Imhann F, Stempak JM, et al. Complex host genetics influence the microbiome in inflammatory bowel disease. Genome Med. 2014;6(12):107.","journal-title":"Genome Med"},{"key":"173_CR39","unstructured":"Lee S, G\u00f6rnitz N, Xing EP, Heckerman D, Lippert C. Ensembles of Lasso screening rules. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2017;PP(99):1\u20131."},{"key":"173_CR40","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1146\/annurev-statistics-010814-020351","volume":"2","author":"H Li","year":"2015","unstructured":"Li H. Microbiome, metagenomics, and high-dimensional compositional data analysis. Ann Rev Stat Appl. 2015;2:73\u201394.","journal-title":"Ann Rev Stat Appl"},{"issue":"2","key":"173_CR41","first-page":"325","volume":"19","author":"Y Li","year":"2016","unstructured":"Li Y, Wu F-X, Ngom A. A review on machine learning principles for multi-view biological data integration. Briefings in Bioinformatics. 2016;19(2):325\u201340.","journal-title":"Briefings in Bioinformatics"},{"issue":"3","key":"173_CR42","doi-asserted-by":"crossref","first-page":"627","DOI":"10.1080\/10618600.2014.938812","volume":"24","author":"M Lim","year":"2015","unstructured":"Lim M, Hastie T. Learning interactions via hierarchical group-Lasso regularization. J Comput Graph Stat. 2015;24(3):627\u201354.","journal-title":"J Comput Graph Stat"},{"key":"173_CR43","unstructured":"Lim M, Hastie T. glinternet: Learning Interactions Via Hierarchical Group-Lasso Regularization. (2019). R package version 1.0.10. https:\/\/cran.r-project.org\/package=glinternet 2019."},{"issue":"4","key":"173_CR44","doi-asserted-by":"crossref","first-page":"667","DOI":"10.1093\/biostatistics\/kxt006","volume":"14","author":"X Lin","year":"2013","unstructured":"Lin X, Lee S, Christiani DC, Lin X. Test for interactions between a genetic marker set and environment in generalized linear models. Biostatistics. 2013;14(4):667\u201381.","journal-title":"Biostatistics"},{"key":"173_CR45","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1146\/annurev.micro.62.081307.162918","volume":"63","author":"B Lugtenberg","year":"2009","unstructured":"Lugtenberg B, Kamilova F. Plant-growth-promoting rhizobacteria. Annual review of microbiology. 2009;63:541\u201356.","journal-title":"Annual review of microbiology"},{"issue":"7265","key":"173_CR46","doi-asserted-by":"crossref","first-page":"747","DOI":"10.1038\/nature08494","volume":"461","author":"TA Manolio","year":"2009","unstructured":"Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A, Cho JH, Guttmacher AE, Kong A, Kruglyak L, Mardis E, Rotimi CN, Slatkin M, Valle D, Whittemore AS, Boehnke M, Clark AG, Eichler EE, Gibson G, Haines JL, Mackay TFC, McCarroll SA, Visscher PM. Finding the missing heritability of complex diseases. Nature. 2009;461(7265):747\u201353.","journal-title":"Nature"},{"issue":"4","key":"173_CR47","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1111\/j.1467-9868.2010.00740.x","volume":"72","author":"N Meinshausen","year":"2010","unstructured":"Meinshausen N, B\u00fchlmann P. Stability selection. J Royal Stat Soc Series B. 2010;72(4):417\u201373.","journal-title":"J Royal Stat Soc Series B"},{"issue":"2","key":"173_CR48","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1007\/BF02294245","volume":"50","author":"GW Milligan","year":"1985","unstructured":"Milligan GW, Cooper MC. An examination of procedures for determining the number of clusters in a data set. Psychometrika. 1985;50(2):159\u201379.","journal-title":"Psychometrika"},{"key":"173_CR49","doi-asserted-by":"crossref","DOI":"10.1007\/978-94-017-3209-3","volume-title":"Techniques in mycorrhizal studies","author":"KG Mukerji","year":"2002","unstructured":"Mukerji KG, Manoharachary C, Chamola BP. Techniques in mycorrhizal studies. Dordrecht: Springer; 2002."},{"issue":"2","key":"173_CR50","doi-asserted-by":"crossref","first-page":"212","DOI":"10.1093\/biostatistics\/kxl002","volume":"8","author":"MY Park","year":"2007","unstructured":"Park MY, Hastie T, Tibshirani R. Averaged gene expressions for regression. Biostatistics. 2007;8(2):212\u201327.","journal-title":"Biostatistics"},{"key":"173_CR51","first-page":"489","volume":"60","author":"K Pearson","year":"1896","unstructured":"Pearson K. Mathematical contributions to the theory of evolution on a form of spurious correlation which may arise when indices are used in the measurement of organs. Proc Royal Soci London. 1896;60:489\u201398.","journal-title":"Proc Royal Soci London"},{"key":"173_CR52","doi-asserted-by":"crossref","DOI":"10.1201\/9781420005585","volume-title":"The rhizosphere: biochemistry and organic substances at the soil-plant interface","author":"R Pinton","year":"2007","unstructured":"Pinton R, Varanini Z, Nannipieri P. The rhizosphere: biochemistry and organic substances at the soil-plant interface. Boca Raton: CRC Press; 2007."},{"issue":"7418","key":"173_CR53","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1038\/nature11450","volume":"490","author":"J Qin","year":"2012","unstructured":"Qin J, Li Y, Cai Z, Li S, Zhu J, Zhang F, Liang S, Zhang W, Guan Y, Shen D, et al. A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature. 2012;490(7418):55\u201360.","journal-title":"Nature"},{"key":"173_CR54","unstructured":"Rau A. Statistical methods and software for the analysis of transcriptomic data. Habilitation \u00e0 diriger des recherches, Universit\u00e9 d\u2019Evry Val d\u2019Essonne 2017."},{"issue":"6","key":"173_CR55","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1186\/gb-2011-12-6-r60","volume":"12","author":"N Segata","year":"2011","unstructured":"Segata N, Izard J, Waldron L, Gevers D, Miropolsky L, Garrett WS, Huttenhower C. Metagenomic biomarker discovery and explanation. Genome Biol. 2011;12(6):60.","journal-title":"Genome Biol"},{"issue":"521","key":"173_CR56","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1080\/01621459.2016.1260470","volume":"113","author":"Y She","year":"2016","unstructured":"She Y, Wang Z, Jiang H. Group regularized estimation under structural hierarchy. J Am Stat Assoc. 2016;113(521):445\u201354.","journal-title":"J Am Stat Assoc"},{"key":"173_CR57","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/ncomms3462","volume":"4","author":"G Srinivas","year":"2013","unstructured":"Srinivas G, M\u00f6ller S, K\u00fcnzel S, Zillikens D, Baines JF, Ibrahim SM. Genome-wide mapping of gene-microbiota interactions in susceptibility to autoimmune skin blistering. Nat Commun. 2013;4:1\u20137.","journal-title":"Nat Commun"},{"issue":"1","key":"173_CR58","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1186\/s12859-017-1488-0","volume":"18","author":"V Stanislas","year":"2017","unstructured":"Stanislas V, Dalmasso C, Ambroise C. Eigen-epistasis for detecting gene-gene interactions. BMC Bioinform. 2017;18(1):54\u201367.","journal-title":"BMC Bioinform"},{"key":"173_CR59","unstructured":"Su Z, Marchini J, Donnelly P. HAPGEN: Version 2. (2011a). Version v2.1.2. https:\/\/mathgen.stats.ox.ac.uk\/genetics_software\/hapgen\/hapgen2.html 2011."},{"issue":"16","key":"173_CR60","doi-asserted-by":"crossref","first-page":"2304","DOI":"10.1093\/bioinformatics\/btr341","volume":"27","author":"Z Su","year":"2011","unstructured":"Su Z, Marchini J, Donnelly P. HAPGEN2: Simulation of multiple disease SNPs. Bioinformatics. 2011b;27(16):2304.","journal-title":"Bioinformatics"},{"issue":"4","key":"173_CR61","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1038\/nrg2764","volume":"11","author":"D Thomas","year":"2010","unstructured":"Thomas D. Gene-environment-wide association studies: emerging approaches. Nat Rev Genet. 2010;11(4):259\u201372.","journal-title":"Nat Rev Genet"},{"key":"173_CR62","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1111\/1467-9868.00293","volume":"63","author":"R Tibshirani","year":"2001","unstructured":"Tibshirani R, Walther G, Hastie T. Estimating the number of clusters in a data set via the gap statistic. J Royal Stat Soc Series B. 2001;63:411\u201323.","journal-title":"J Royal Stat Soc Series B"},{"issue":"10","key":"173_CR63","doi-asserted-by":"crossref","first-page":"3357","DOI":"10.1105\/tpc.109.072827","volume":"22","author":"Y Tu","year":"2010","unstructured":"Tu Y, Rochfort S, Liu Z, Ran Y, Griffith M, Badenhorst P, Louie GV, Bowman ME, Smith KF, Noel JP, Mouradov A, Spangenbergothers G. Functional analyses of caffeic acid o-methyltransferase and cinnamoyl-coa-reductase genes from perennial ryegrass (lolium perenne). Plant Cell. 2010;22(10):3357\u201373.","journal-title":"Plant Cell"},{"key":"173_CR64","doi-asserted-by":"crossref","first-page":"85","DOI":"10.3389\/fpls.2012.00085","volume":"3","author":"W Underwood","year":"2012","unstructured":"Underwood W. The plant cell wall: a dynamic barrier against pathogen invasion. Front Plant Sci. 2012;3:85.","journal-title":"Front Plant Sci"},{"issue":"1","key":"173_CR65","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1016\/J.ENG.2017.01.008","volume":"3","author":"B Wang","year":"2017","unstructured":"Wang B, Yao M, Lv L, Ling Z, Li L. The human microbiota in health and disease. Engineering. 2017;3(1):71\u201382.","journal-title":"Engineering"},{"issue":"8","key":"173_CR66","doi-asserted-by":"crossref","first-page":"508","DOI":"10.1038\/nrmicro.2016.83","volume":"14","author":"J Wang","year":"2016","unstructured":"Wang J, Jia H. Metagenome-wide association studies: fine-mining the microbiome. Nat Rev Microbiol. 2016;14(8):508\u201322.","journal-title":"Nat Rev Microbiol"},{"issue":"11","key":"173_CR67","doi-asserted-by":"crossref","first-page":"1396","DOI":"10.1038\/ng.3695","volume":"48","author":"J Wang","year":"2016","unstructured":"Wang J, Thingholm LB, Skiecevi\u010dien\u0117 J, Rausch P, Kummen M, Kummen M, Hov JR, Degenhardt F, Heinsen FA, R\u00fchlemann MC, Szymczak S. Genome-wide association analysis identifies variation in vitamin D receptor and other host factors influencing the gut microbiota. Nat Genet. 2016;48(11):1396\u2013406.","journal-title":"Nat Genet"},{"issue":"301","key":"173_CR68","doi-asserted-by":"crossref","first-page":"236","DOI":"10.1080\/01621459.1963.10500845","volume":"58","author":"JHJ Ward","year":"1963","unstructured":"Ward JHJ. Hierarchical grouping to optimize an objective function. J Am Stat Assoc. 1963;58(301):236\u201344.","journal-title":"J Am Stat Assoc"}],"container-title":["Algorithms for Molecular Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13015-020-00173-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13015-020-00173-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13015-020-00173-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,8]],"date-time":"2024-08-08T23:55:14Z","timestamp":1723161314000},"score":1,"resource":{"primary":{"URL":"https:\/\/almob.biomedcentral.com\/articles\/10.1186\/s13015-020-00173-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,7,1]]},"references-count":68,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["173"],"URL":"https:\/\/doi.org\/10.1186\/s13015-020-00173-2","relation":{},"ISSN":["1748-7188"],"issn-type":[{"value":"1748-7188","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,7,1]]},"assertion":[{"value":"15 October 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 June 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 July 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"13"}}