{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,6]],"date-time":"2026-03-06T23:03:24Z","timestamp":1772838204672,"version":"3.50.1"},"reference-count":25,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,2,12]],"date-time":"2021-02-12T00:00:00Z","timestamp":1613088000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2021,2,12]],"date-time":"2021-02-12T00:00:00Z","timestamp":1613088000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>The search for statistically significant relationships between molecular markers and outcomes is challenging when dealing with high-dimensional, noisy and collinear multivariate omics data, such as metabolomic profiles. Permutation procedures allow for the estimation of adjusted significance levels without assuming independence among metabolomic variables. Nevertheless, the complex non-normal structure of metabolic profiles and outcomes may bias the permutation results leading to overly conservative threshold estimates i.e. lower than those from a Bonferroni or Sidak correction.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>Within a univariate permutation procedure we employ parametric simulation methods based on the multivariate (log-)Normal distribution to obtain adjusted significance levels which are consistent across different outcomes while effectively controlling the type I error rate. Next, we derive an alternative closed-form expression for the estimation of the number of non-redundant metabolic variates based on the spectral decomposition of their correlation matrix. The performance of the method is tested for different model parametrizations and across a wide range of correlation levels of the variates using synthetic and real data sets.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Both the permutation-based formulation and the more practical closed form expression are found to give an effective indication of the number of independent metabolic effects exhibited by the system, while guaranteeing that the derived adjusted threshold is stable across outcome measures with diverse properties.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s12859-021-03975-2","type":"journal-article","created":{"date-parts":[[2021,2,12]],"date-time":"2021-02-12T10:30:41Z","timestamp":1613125841000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":30,"title":["Multiple-testing correction in metabolome-wide association studies"],"prefix":"10.1186","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2895-0406","authenticated-orcid":false,"given":"Alina","family":"Peluso","sequence":"first","affiliation":[]},{"given":"Robert","family":"Glen","sequence":"additional","affiliation":[]},{"given":"Timothy M. D.","family":"Ebbels","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,2,12]]},"reference":[{"issue":"7193","key":"3975_CR1","doi-asserted-by":"publisher","first-page":"396","DOI":"10.1038\/nature06882","volume":"453","author":"E Holmes","year":"2008","unstructured":"Holmes E, Loo RL, Stamler J, Bictash M, Yap IK, Chan Q, Ebbels T, De Iorio M, Brown IJ, Veselkov KA, et al. Human metabolic phenotype diversity and its association with diet and blood pressure. Nature. 2008;453(7193):396.","journal-title":"Nature"},{"issue":"1","key":"3975_CR2","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","volume":"57","author":"Y Benjamini","year":"1995","unstructured":"Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc: Ser B (Methodol). 1995;57(1):289\u2013300.","journal-title":"J R Stat Soc: Ser B (Methodol)"},{"issue":"4","key":"3975_CR3","doi-asserted-by":"publisher","first-page":"1165","DOI":"10.1214\/aos\/1013699998","volume":"29","author":"Y Benjamini","year":"2001","unstructured":"Benjamini Y, Yekutieli D, et al. The control of the false discovery rate in multiple testing under dependency. Ann Stat. 2001;29(4):1165\u201388.","journal-title":"Ann Stat"},{"key":"3975_CR4","first-page":"3","volume":"8","author":"C Bonferroni","year":"1936","unstructured":"Bonferroni C. Teoria statistica delle classi e calcolo delle probabilit\u00e0. Pubblicazioni del R Istituto Superiore di Scienze Economiche e Commericiali di Firenze. 1936;8:3\u201362.","journal-title":"Pubblicazioni del R Istituto Superiore di Scienze Economiche e Commericiali di Firenze"},{"issue":"318","key":"3975_CR5","first-page":"626","volume":"62","author":"Z \u0160id\u00e1k","year":"1967","unstructured":"\u0160id\u00e1k Z. Rectangular confidence regions for the means of multivariate normal distributions. J Am Stat Assoc. 1967;62(318):626\u201333.","journal-title":"J Am Stat Assoc"},{"issue":"1","key":"3975_CR6","doi-asserted-by":"publisher","first-page":"52","DOI":"10.1046\/j.1365-2540.2001.00901.x","volume":"87","author":"JM Cheverud","year":"2001","unstructured":"Cheverud JM. A simple correction for multiple comparisons in interval mapping genome scans. Heredity. 2001;87(1):52.","journal-title":"Heredity"},{"issue":"4","key":"3975_CR7","doi-asserted-by":"publisher","first-page":"765","DOI":"10.1086\/383251","volume":"74","author":"DR Nyholt","year":"2004","unstructured":"Nyholt DR. A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other. Am J Human Genet. 2004;74(4):765\u20139.","journal-title":"Am J Human Genet"},{"issue":"3","key":"3975_CR8","doi-asserted-by":"publisher","first-page":"221","DOI":"10.1038\/sj.hdy.6800717","volume":"95","author":"J Li","year":"2005","unstructured":"Li J, Ji L. Adjusting multiple testing in multilocus analyses using the eigenvalues of a correlation matrix. Heredity. 2005;95(3):221.","journal-title":"Heredity"},{"issue":"4","key":"3975_CR9","doi-asserted-by":"publisher","first-page":"361","DOI":"10.1002\/gepi.20310","volume":"32","author":"X Gao","year":"2008","unstructured":"Gao X, Starmer J, Martin ER. A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms. Genet Epidemiol. 2008;32(4):361\u20139.","journal-title":"Genet Epidemiol"},{"issue":"7","key":"3975_CR10","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1002\/gepi.20408","volume":"33","author":"NW Galwey","year":"2009","unstructured":"Galwey NW. A new measure of the effective number of tests, a practical tool for comparing families of non-independent significance tests. Genetic Epidemiol. 2009;33(7):559\u201368.","journal-title":"Genetic Epidemiol"},{"issue":"2","key":"3975_CR11","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1111\/1467-9892.00133","volume":"20","author":"E Paparoditis","year":"1999","unstructured":"Paparoditis E, Politis DN. The local bootstrap for periodogram statistics. J Time Ser Anal. 1999;20(2):193\u2013222.","journal-title":"J Time Ser Anal"},{"issue":"2","key":"3975_CR12","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1002\/gepi.20292","volume":"32","author":"CJ Hoggart","year":"2008","unstructured":"Hoggart CJ, Clark TG, De Iorio M, Whittaker JC, Balding DJ. Genome-wide significance for dense snp and resequencing data. Genetic Epidemiol. 2008;32(2):179\u201385.","journal-title":"Genetic Epidemiol"},{"issue":"9","key":"3975_CR13","doi-asserted-by":"publisher","first-page":"4620","DOI":"10.1021\/pr1003449","volume":"9","author":"M Chadeau-Hyam","year":"2010","unstructured":"Chadeau-Hyam M, Ebbels TM, Brown IJ, Chan Q, Stamler J, Huang CC, Daviglus ML, Ueshima H, Zhao L, Holmes E, et al. Metabolic profiling and the metabolome-wide association study: significance level for biomarker identification. J Proteome Res. 2010;9(9):4620\u20137.","journal-title":"J Proteome Res"},{"issue":"10","key":"3975_CR14","doi-asserted-by":"publisher","first-page":"3623","DOI":"10.1021\/acs.jproteome.7b00344","volume":"16","author":"R Castagn\u00e9","year":"2017","unstructured":"Castagn\u00e9 R, Boulang\u00e9 CL, Karaman I, Campanella G, Santos Ferreira DL, Kaluarachchi MR, Lehne B, Moayyeri A, Lewis MR, Spagou K, et al. Improving visualization and interpretation of metabolome-wide association studies: An application in a population-based cohort using untargeted 1H-NMR metabolic profiling. J Proteome Res. 2017;16(10):3623\u201333.","journal-title":"J Proteome Res"},{"issue":"1\u20132","key":"3975_CR15","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1093\/biomet\/49.1-2.93","volume":"49","author":"GE Box","year":"1962","unstructured":"Box GE, Watson GS. Robustness to non-normality of regression tests. Biometrika. 1962;49(1\u20132):93\u2013106.","journal-title":"Biometrika"},{"key":"3975_CR16","doi-asserted-by":"crossref","unstructured":"Sch\u00e4fer J, Strimmer K. A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics. Statistical applications in genetics and molecular biology. 2005;4(1).","DOI":"10.2202\/1544-6115.1175"},{"issue":"5","key":"3975_CR17","doi-asserted-by":"publisher","first-page":"895","DOI":"10.1111\/j.1558-5646.1983.tb05619.x","volume":"37","author":"JM Cheverud","year":"1983","unstructured":"Cheverud JM, Rutledge J, Atchley WR. Quantitative genetics of development: genetic correlations among age-specific trait values and the evolution of ontogeny. Evolution. 1983;37(5):895\u2013905.","journal-title":"Evolution"},{"issue":"1","key":"3975_CR18","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1177\/001316448104100102","volume":"41","author":"S Friedman","year":"1981","unstructured":"Friedman S, Weisberg HF. Interpreting the first eigenvalue of a correlation matrix. Educ Psychol Measur. 1981;41(1):11\u201321.","journal-title":"Educ Psychol Measur"},{"issue":"9","key":"3975_CR19","doi-asserted-by":"publisher","first-page":"871","DOI":"10.1093\/aje\/kwf113","volume":"156","author":"DE Bild","year":"2002","unstructured":"Bild DE, Bluemke DA, Burke GL, Detrano R, Diez Roux AV, Folsom AR, Greenland P, JacobsJr DR, Kronmal R, Liu K, et al. Multi-ethnic study of atherosclerosis: objectives and design. Am J Epidemiol. 2002;156(9):871\u201381.","journal-title":"Am J Epidemiol"},{"issue":"12","key":"3975_CR20","doi-asserted-by":"publisher","first-page":"4188","DOI":"10.1021\/acs.jproteome.6b00125","volume":"15","author":"I Karaman","year":"2016","unstructured":"Karaman I, Ferreira DL, Boulang\u00e9 CL, Kaluarachchi MR, Herrington D, Dona AC, Castagn\u00e9 R, Moayyeri A, Lehne B, Loh M, et al. Workflow for integrated processing of multicohort untargeted 1h nmr metabolomics data in large-scale metabolic epidemiology. J Proteome Res. 2016;15(12):4188\u201394.","journal-title":"J Proteome Res"},{"issue":"11","key":"3975_CR21","doi-asserted-by":"publisher","first-page":"1713","DOI":"10.1002\/sim.2059","volume":"24","author":"R Bender","year":"2005","unstructured":"Bender R, Augustin T, Blettner M. Generating survival times to simulate cox proportional hazards models. Stat Med. 2005;24(11):1713\u201323.","journal-title":"Stat Med"},{"key":"3975_CR22","doi-asserted-by":"crossref","unstructured":"Hastings WK. Monte carlo sampling methods using markov chains and their applications; 1970.","DOI":"10.1093\/biomet\/57.1.97"},{"issue":"3","key":"3975_CR23","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1093\/imanum\/22.3.329","volume":"22","author":"NJ Higham","year":"2002","unstructured":"Higham NJ. Computing the nearest correlation matrix\u2013a problem from finance. IMA J Numer Anal. 2002;22(3):329\u201343.","journal-title":"IMA J Numer Anal"},{"key":"3975_CR24","doi-asserted-by":"publisher","DOI":"10.1088\/0957-0233\/12\/10\/708","volume-title":"Multivariate analysis of quality. An introduction","author":"H Martens","year":"2001","unstructured":"Martens H, Martens M. Multivariate analysis of quality. An introduction. Bristol: IOP Publishing; 2001."},{"key":"3975_CR25","unstructured":"Horizon2020 EC. PhenoMeNal (Phenome and Metabolome aNalysis): Large-scale Computing for Medical Metabolomics (2015-2018). https:\/\/phenomenal-h2020.eu\/."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-021-03975-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12859-021-03975-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-021-03975-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,23]],"date-time":"2024-08-23T23:31:14Z","timestamp":1724455874000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-021-03975-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,12]]},"references-count":25,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["3975"],"URL":"https:\/\/doi.org\/10.1186\/s12859-021-03975-2","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-92732\/v1","asserted-by":"object"},{"id-type":"doi","id":"10.1101\/478370","asserted-by":"object"}]},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,2,12]]},"assertion":[{"value":"12 October 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 January 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 February 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Timothy M D Ebbels is a member of the editorial board. The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"67"}}