{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:27:27Z","timestamp":1760146047694,"version":"build-2065373602"},"reference-count":21,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2024,9,21]],"date-time":"2024-09-21T00:00:00Z","timestamp":1726876800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>Generating vast arrays of genetic markers for evolutionary ecology studies has become routine and cost-effective. However, analyzing data from large numbers of loci associated with a small number of finite chromosomes introduces a challenge: loci on the same chromosome do not assort independently, leading to pseudoreplication. Previous studies have demonstrated that pseudoreplication can substantially reduce precision of genetic analyses (and make confidence intervals wider), such as FST and linkage disequilibrium (LD) measures between pairs of loci. In LD analyses, another type of dependency (overlapping pairs of the same loci) also creates pseudoreplication. Building on previous work, we explore the potential of entropy metrics to improve the status quo, particularly total correlation (TC), to assess pseudoreplication in LD studies. Our simulations, performed on a monoecious population with a range of effective population sizes (Ne) and numbers of loci, attempted to isolate the overlapping-pairs-of-loci effect by considering unlinked loci and using entropy to quantify inter-locus relationships. We hypothesized a positive correlation between TC and the number of loci (L), and a negative correlation between TC and Ne. Results from our statistical models predicting TC demonstrate a strong effect of the number of loci, and muted effects of Ne and other predictors, adding support to the use of entropy-based metrics as a tool for estimating the statistical information of complex genetic datasets. Our results also highlight a challenge regarding scalability; computational limitations arise as the number of loci grows, making our current approach limited to smaller datasets. Despite these challenges, this work further refines our understanding of entropy measures, and offers insights into the complex dynamics of genetic information in evolutionary ecology research.<\/jats:p>","DOI":"10.3390\/e26090805","type":"journal-article","created":{"date-parts":[[2024,9,24]],"date-time":"2024-09-24T10:41:47Z","timestamp":1727174507000},"page":"805","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Potential Benefits and Challenges of Quantifying Pseudoreplication in Genomic Data with Entropy Statistics"],"prefix":"10.3390","volume":"26","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4359-0296","authenticated-orcid":false,"given":"Eric J.","family":"Ward","sequence":"first","affiliation":[{"name":"Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, 2725 Montlake Blvd. East, Seattle, WA 98112, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3362-7590","authenticated-orcid":false,"given":"Robin S.","family":"Waples","sequence":"additional","affiliation":[{"name":"School of Aquatic and Fishery Sciences, University of Washington, Seattle, WA 98195, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,9,21]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1038\/nrg.2017.109","article-title":"Runs of homozygosity: Windows into population history and trait architecture","volume":"19","author":"Ceballos","year":"2018","journal-title":"Nat. Rev. Genet."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1016\/j.tree.2015.10.009","article-title":"Genomics in Conservation: Case Studies and Bridging the Gap between Data and Application","volume":"31","author":"Garner","year":"2016","journal-title":"Trends Ecol. Evol."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1111\/j.1749-6632.2009.04444.x","article-title":"From Conservation Genetics to Conservation Genomics","volume":"1162","author":"Primmer","year":"2009","journal-title":"Ann. N. Y. Acad. Sci."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1901","DOI":"10.1093\/molbev\/msr011","article-title":"Chromosome Size in Diploid Eukaryotic Species Centers on the Average Length with a Conserved Boundary","volume":"28","author":"Li","year":"2011","journal-title":"Mol. Biol. Evol."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"803","DOI":"10.1038\/326803a0","article-title":"Mammalian chiasma frequencies as a test of two theories of recombination","volume":"326","author":"Burt","year":"1987","journal-title":"Nature"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1659","DOI":"10.1073\/pnas.1817482116","article-title":"A rigorous measure of genome-wide genetic shuffling that takes into account crossover positions and Mendel\u2019s second law","volume":"116","author":"Veller","year":"2019","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1086\/321275","article-title":"Linkage Disequilibrium in Humans: Models and Data","volume":"69","author":"Pritchard","year":"2001","journal-title":"Am. J. Hum. Genet."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1007\/BF01245622","article-title":"Linkage disequilibrium in finite populations","volume":"38","author":"Hill","year":"1968","journal-title":"Theor. Appl. Genet."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"378","DOI":"10.32614\/RJ-2017-066","article-title":"glmmTMB Balances Speed and Flexibility Among Packages for Zero-inflated Generalized Linear Mixed Modeling","volume":"9","author":"Brooks","year":"2017","journal-title":"R J."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"503","DOI":"10.1111\/1755-0998.13482","article-title":"Pseudoreplication in genomics-scale datasets","volume":"2","author":"Waples","year":"2022","journal-title":"Mol. Ecol. Resour."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Sherwin, W.B. (2018). Entropy, or Information, Unifies Ecology and Evolution and Beyond. Entropy, 20.","DOI":"10.3390\/e20100727"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"948","DOI":"10.1016\/j.tree.2017.09.012","article-title":"Information Theory Broadens the Spectrum of Molecular Ecology and Evolution","volume":"32","author":"Sherwin","year":"2017","journal-title":"Trends Ecol. Evol."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"584","DOI":"10.1109\/TSMC.2014.2331917","article-title":"Algorithmic Specified Complexity in the Game of Life","volume":"45","author":"Ewert","year":"2015","journal-title":"IEEE Trans. Syst. Man Cybern. Syst."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"8574","DOI":"10.1073\/pnas.0701744104","article-title":"Functional information and the emergence of biocomplexity","volume":"104","author":"Hazen","year":"2007","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1147\/rd.41.0066","article-title":"Information Theoretical Analysis of Multivariate Correlation","volume":"4","author":"Watanabe","year":"1960","journal-title":"IBM J. Res. Dev."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"324","DOI":"10.1016\/j.jmva.2003.10.003","article-title":"Estimation of the entropy of a multivariate normal distribution","volume":"92","author":"Misra","year":"2005","journal-title":"J. Multivar. Anal."},{"key":"ref_17","unstructured":"R Core Team (2023). R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"3335","DOI":"10.1111\/j.1365-294X.2005.02673.x","article-title":"Genetic estimates of contemporary effective population size: To what time periods do the estimates apply?","volume":"14","author":"Waples","year":"2005","journal-title":"Mol. Ecol."},{"key":"ref_19","unstructured":"Qiu, Y., and Mei, J. (2024, August 01). _RSpectra: Solvers for Large-Scale Eigenvalue and SVD Problems_. R package version 0.16-2. Available online: https:\/\/CRAN.R-project.org\/package=RSpectra."},{"key":"ref_20","unstructured":"Akaike, H. (1971, January 2\u20138). Information theory and an extension of the maximum likelihood principle. Proceedings of the 2nd International Symposium on Information Theory, Tsahkadsor, Armenia."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1226","DOI":"10.1109\/TPAMI.2005.159","article-title":"Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy","volume":"27","author":"Peng","year":"2005","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/26\/9\/805\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T16:01:38Z","timestamp":1760112098000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/26\/9\/805"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,21]]},"references-count":21,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2024,9]]}},"alternative-id":["e26090805"],"URL":"https:\/\/doi.org\/10.3390\/e26090805","relation":{},"ISSN":["1099-4300"],"issn-type":[{"type":"electronic","value":"1099-4300"}],"subject":[],"published":{"date-parts":[[2024,9,21]]}}}