{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,31]],"date-time":"2025-12-31T11:41:55Z","timestamp":1767181315139,"version":"build-2238731810"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1010451","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,9,29]],"date-time":"2022-09-29T00:00:00Z","timestamp":1664409600000}}],"reference-count":38,"publisher":"Public Library of Science (PLoS)","issue":"9","license":[{"start":{"date-parts":[[2022,9,19]],"date-time":"2022-09-19T00:00:00Z","timestamp":1663545600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000054","name":"national cancer institute","doi-asserted-by":"publisher","award":["1U01CA261277-01"],"award-info":[{"award-number":["1U01CA261277-01"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Science Foundation","award":["SMA-2226343"],"award-info":[{"award-number":["SMA-2226343"]}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>\n                    Counting the number of species, items, or genes that are shared between two groups, sets, or communities is a simple calculation when sampling is complete. However, when only partial samples are available, quantifying the overlap between two communities becomes an estimation problem. Furthermore, to calculate normalized measures of\n                    <jats:italic>\u03b2<\/jats:italic>\n                    -diversity, such as the Jaccard and Sorenson-Dice indices, one must also estimate the total sizes of the communities being compared. Previous efforts to address these problems have assumed knowledge of total community sizes and then used Bayesian methods to produce unbiased estimates with quantified uncertainty. Here, we address communities of unknown size and show that this produces systematically better estimates\u2014both in terms of central estimates and quantification of uncertainty in those estimates. We further show how to use species, item, or gene count data to refine estimates of community size in a Bayesian joint model of community size and overlap.\n                  <\/jats:p>","DOI":"10.1371\/journal.pcbi.1010451","type":"journal-article","created":{"date-parts":[[2022,9,19]],"date-time":"2022-09-19T13:43:58Z","timestamp":1663595038000},"page":"e1010451","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":8,"title":["Bayesian estimation of community size and overlap from random subsamples"],"prefix":"10.1371","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4041-7866","authenticated-orcid":true,"given":"Erik K.","family":"Johnson","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5273-5234","authenticated-orcid":true,"given":"Daniel B.","family":"Larremore","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,9,19]]},"reference":[{"issue":"3","key":"pcbi.1010451.ref001","doi-asserted-by":"crossref","first-page":"279","DOI":"10.2307\/1943563","article-title":"Vegetation of the Siskiyou mountains, Oregon and California","volume":"30","author":"RH Whittaker","year":"1960","journal-title":"Ecological Monographs"},{"key":"pcbi.1010451.ref002","first-page":"547","article-title":"\u00c9tude comparative de la distribution florale dans une portion des Alpes et des Jura","volume":"37","author":"P Jaccard","year":"1901","journal-title":"Bull Soc Vaudoise Sci Nat"},{"issue":"3","key":"pcbi.1010451.ref003","doi-asserted-by":"crossref","first-page":"297","DOI":"10.2307\/1932409","article-title":"Measures of the amount of ecologic association between species","volume":"26","author":"LR Dice","year":"1945","journal-title":"Ecology"},{"key":"pcbi.1010451.ref004","first-page":"1","article-title":"A method of establishing groups of equal amplitude in plant sociology based on similarity of species content and its application to analyses of the vegetation on Danish commons","volume":"5","author":"TA Sorensen","year":"1948","journal-title":"Biol Skar"},{"issue":"3","key":"pcbi.1010451.ref005","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1046\/j.1365-2656.2003.00710.x","article-title":"Measuring beta diversity for presence\u2013absence data","volume":"72","author":"P Koleff","year":"2003","journal-title":"Journal of Animal Ecology"},{"issue":"2","key":"pcbi.1010451.ref006","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1111\/j.1461-0248.2004.00707.x","article-title":"A new statistical approach for assessing similarity of species composition with incidence and abundance data","volume":"8","author":"A Chao","year":"2005","journal-title":"Ecology Letters"},{"issue":"3","key":"pcbi.1010451.ref007","doi-asserted-by":"crossref","first-page":"e1006898","DOI":"10.1371\/journal.pcbi.1006898","article-title":"Bayes-optimal estimation of overlap between populations of fixed size","volume":"15","author":"DB Larremore","year":"2019","journal-title":"PLOS Computational Biology"},{"issue":"4","key":"pcbi.1010451.ref008","first-page":"326","article-title":"An ordination of the upland forest communities of southern Wisconsin","volume":"27","author":"JR Bray","year":"1957","journal-title":"Ecological monographs"},{"key":"pcbi.1010451.ref009","doi-asserted-by":"crossref","first-page":"42","DOI":"10.2307\/1411","article-title":"The relation between the number of species and the number of individuals in a random sample of an animal population","author":"RA Fisher","year":"1943","journal-title":"The Journal of Animal Ecology"},{"issue":"26","key":"pcbi.1010451.ref010","doi-asserted-by":"crossref","first-page":"E1782","DOI":"10.1073\/pnas.1120534109","article-title":"A restricted subset of var genes mediates adherence of Plasmodium falciparum-infected erythrocytes to brain endothelial cells","volume":"109","author":"M Avril","year":"2012","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"26","key":"pcbi.1010451.ref011","doi-asserted-by":"crossref","first-page":"E1772","DOI":"10.1073\/pnas.1120461109","article-title":"A subset of group A-like var genes encodes the malaria parasite ligands for binding to human brain endothelial cells","volume":"109","author":"A Claessens","year":"2012","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"3","key":"pcbi.1010451.ref012","doi-asserted-by":"crossref","first-page":"e14741","DOI":"10.1371\/journal.pone.0014741","article-title":"Specific receptor usage in Plasmodium falciparum cytoadherence is associated with disease outcome","volume":"6","author":"LB Ochola","year":"2011","journal-title":"PLOS One"},{"issue":"129","key":"pcbi.1010451.ref013","doi-asserted-by":"crossref","first-page":"129ra45","DOI":"10.1126\/scitranslmed.3003247","article-title":"Prognostic indicators of life-threatening malaria are associated with distinct parasite variant antigen profiles","volume":"4","author":"GM Warimwe","year":"2012","journal-title":"Science Translational Medicine"},{"issue":"26","key":"pcbi.1010451.ref014","doi-asserted-by":"crossref","first-page":"E1791","DOI":"10.1073\/pnas.1120455109","article-title":"Plasmodium falciparum erythrocyte membrane protein 1 domain cassettes 8 and 13 are associated with severe malaria in children","volume":"109","author":"T Lavstsen","year":"2012","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"6906","key":"pcbi.1010451.ref015","doi-asserted-by":"crossref","first-page":"498","DOI":"10.1038\/nature01097","article-title":"Genome sequence of the human malaria parasite Plasmodium falciparum","volume":"419","author":"MJ Gardner","year":"2002","journal-title":"Nature"},{"issue":"12","key":"pcbi.1010451.ref016","doi-asserted-by":"crossref","first-page":"e1004812","DOI":"10.1371\/journal.pgen.1004812","article-title":"Generation of antigenic diversity in Plasmodium falciparum by structured rearrangement of Var genes during mitosis","volume":"10","author":"A Claessens","year":"2014","journal-title":"PLoS genetics"},{"issue":"5","key":"pcbi.1010451.ref017","doi-asserted-by":"crossref","first-page":"e3000271","DOI":"10.1371\/journal.pbio.3000271","article-title":"Rapid antigen diversification through mitotic recombination in the human malaria parasite Plasmodium falciparum","volume":"17","author":"X Zhang","year":"2019","journal-title":"PLoS biology"},{"key":"pcbi.1010451.ref018","article-title":"Evolutionary analysis of the most polymorphic gene family in falciparum malaria","volume":"4","author":"TD Otto","year":"2019","journal-title":"Wellcome Open Research"},{"issue":"9","key":"pcbi.1010451.ref019","doi-asserted-by":"crossref","first-page":"e1000933","DOI":"10.1371\/journal.pcbi.1000933","article-title":"Plasmodium falciparum erythrocyte membrane protein 1 diversity in seven genomes\u2013divide and conquer","volume":"6","author":"TS Rask","year":"2010","journal-title":"PLoS computational biology"},{"issue":"3","key":"pcbi.1010451.ref020","doi-asserted-by":"crossref","first-page":"e34","DOI":"10.1371\/journal.ppat.0030034","article-title":"Population genomics of the immune evasion (var) genes of Plasmodium falciparum","volume":"3","author":"AE Barry","year":"2007","journal-title":"PLOS Pathogens"},{"issue":"1-2","key":"pcbi.1010451.ref021","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1016\/j.gene.2010.01.001","article-title":"The South American Plasmodium falciparum var gene repertoire is limited, highly shared and possibly lacks several antigenic types","volume":"453","author":"L Albrecht","year":"2010","journal-title":"Gene"},{"issue":"2","key":"pcbi.1010451.ref022","doi-asserted-by":"crossref","first-page":"e16629","DOI":"10.1371\/journal.pone.0016629","article-title":"A molecular epidemiological study of var gene diversity to characterize the reservoir of Plasmodium falciparum in humans in Africa","volume":"6","author":"DS Chen","year":"2011","journal-title":"PLOS One"},{"issue":"1","key":"pcbi.1010451.ref023","doi-asserted-by":"crossref","first-page":"276","DOI":"10.1128\/IAI.01979-14","article-title":"Immune characterization of Plasmodium falciparum parasites with a shared genetic signature in a region of decreasing transmission","volume":"83","author":"AK Bei","year":"2015","journal-title":"Infection and Immunity"},{"key":"pcbi.1010451.ref024","article-title":"Plasmodium falciparum population genetic complexity influences transcriptional profile and immune recognition of highly related genotypic clusters","author":"AK Bei","year":"2020","journal-title":"bioRxiv"},{"issue":"2","key":"pcbi.1010451.ref025","doi-asserted-by":"crossref","first-page":"484","DOI":"10.1111\/mec.13033","article-title":"Phylogeography of var gene repertoires reveals fine-scale geospatial clustering of Plasmodium falciparum populations in a highly endemic area","volume":"24","author":"SK Tessema","year":"2015","journal-title":"Molecular Ecology"},{"issue":"20","key":"pcbi.1010451.ref026","first-page":"E4103","article-title":"Evidence of strain structure in Plasmodium falciparum var gene repertoires in children from Gabon, West Africa","volume":"114","author":"KP Day","year":"2017","journal-title":"Proceedings of the National Academy of Sciences"},{"key":"pcbi.1010451.ref027","doi-asserted-by":"crossref","unstructured":"Childs L, Larremore D. In: Network Models for Malaria: Antigens, Dynamics, and Evolution Over Space and Time; 2020. p. 277\u2013294.","DOI":"10.1016\/B978-0-12-801238-3.11512-0"},{"issue":"1656","key":"pcbi.1010451.ref028","doi-asserted-by":"crossref","first-page":"477","DOI":"10.1098\/rspb.2008.1122","article-title":"Inferring malaria parasite population structure from serological networks","volume":"276","author":"CO Buckee","year":"2009","journal-title":"Proceedings of the Royal Society B: Biological Sciences"},{"issue":"1","key":"pcbi.1010451.ref029","first-page":"1","article-title":"Networks of genetic similarity reveal non-neutral processes shape strain structure in Plasmodium falciparum","volume":"9","author":"Q He","year":"2018","journal-title":"Nature Communications"},{"issue":"6","key":"pcbi.1010451.ref030","doi-asserted-by":"crossref","first-page":"e3000336","DOI":"10.1371\/journal.pbio.3000336","article-title":"Competition for hosts modulates vast antigenic diversity to generate persistent strain structure in Plasmodium falciparum","volume":"17","author":"S Pilosof","year":"2019","journal-title":"PLoS biology"},{"issue":"2","key":"pcbi.1010451.ref031","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1016\/S0166-6851(00)00286-3","article-title":"Var gene diversity in Plasmodium falciparum is generated by frequent recombination events","volume":"110","author":"HM Taylor","year":"2000","journal-title":"Molecular and Biochemical Parasitology"},{"issue":"3","key":"pcbi.1010451.ref032","doi-asserted-by":"crossref","first-page":"e26","DOI":"10.1371\/journal.ppat.0010026","article-title":"Plasmodium falciparum variant surface antigen expression patterns during malaria","volume":"1","author":"PC Bull","year":"2005","journal-title":"PLOS Pathogens"},{"issue":"1","key":"pcbi.1010451.ref033","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1016\/j.molbiopara.2007.03.011","article-title":"An approach to classifying sequence tags sampled from Plasmodium falciparum var genes","volume":"154","author":"PC Bull","year":"2007","journal-title":"Molecular and Biochemical Parasitology"},{"issue":"40","key":"pcbi.1010451.ref034","doi-asserted-by":"crossref","first-page":"15835","DOI":"10.1073\/pnas.0610485104","article-title":"PfEMP1-DBL1\u03b1 amino acid motifs in severe disease states of Plasmodium falciparum malaria","volume":"104","author":"J Normark","year":"2007","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"51","key":"pcbi.1010451.ref035","doi-asserted-by":"crossref","first-page":"21801","DOI":"10.1073\/pnas.0907590106","article-title":"Plasmodium falciparum var gene expression is modified by host immunity","volume":"106","author":"GM Warimwe","year":"2009","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"1","key":"pcbi.1010451.ref036","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/ncomms9368","article-title":"Ape parasite origins of human malaria virulence genes","volume":"6","author":"DB Larremore","year":"2015","journal-title":"Nature communications"},{"issue":"1","key":"pcbi.1010451.ref037","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2164-8-45","article-title":"Patterns of gene recombination shape var gene repertoires in Plasmodium falciparum: comparisons of geographically diverse isolates","volume":"8","author":"SM Kraemer","year":"2007","journal-title":"BMC genomics"},{"key":"pcbi.1010451.ref038","article-title":"Clinical malaria incidence following an outbreak in Ecuador was predominantly associated with Plasmodium falciparum with recombinant variant antigen gene repertoires","author":"S Ruybal-Pes\u00e1ntez","year":"2021","journal-title":"medRxiv"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1010451","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,9,29]],"date-time":"2022-09-29T00:00:00Z","timestamp":1664409600000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010451","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,9,29]],"date-time":"2022-09-29T15:37:30Z","timestamp":1664465850000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010451"}},"subtitle":[],"editor":[{"given":"Jacopo","family":"Grilli","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,9,19]]},"references-count":38,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2022,9,19]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1010451","relation":{},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,9,19]]}}}