{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,29]],"date-time":"2026-03-29T09:10:47Z","timestamp":1774775447188,"version":"3.50.1"},"reference-count":32,"publisher":"Oxford University Press (OUP)","issue":"18","license":[{"start":{"date-parts":[[2016,10,12]],"date-time":"2016-10-12T00:00:00Z","timestamp":1476230400000},"content-version":"vor","delay-in-days":125,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Motivation: A series of methods in population genetics use multilocus genotype data to assign individuals membership in latent clusters. These methods belong to a broad class of mixed-membership models, such as latent Dirichlet allocation used to analyze text corpora. Inference from mixed-membership models can produce different output matrices when repeatedly applied to the same inputs, and the number of latent clusters is a parameter that is often varied in the analysis pipeline. For these reasons, quantifying, visualizing, and annotating the output from mixed-membership models are bottlenecks for investigators across multiple disciplines from ecology to text data mining.<\/jats:p>\n                  <jats:p>Results: We introduce pong, a network-graphical approach for analyzing and visualizing membership in latent clusters with a native interactive D3.js visualization. pong leverages efficient algorithms for solving the Assignment Problem to dramatically reduce runtime while increasing accuracy compared with other methods that process output from mixed-membership models. We apply pong to 225 705 unlinked genome-wide single-nucleotide variants from 2426 unrelated individuals in the 1000 Genomes Project, and identify previously overlooked aspects of global human population structure. We show that pong outpaces current solutions by more than an order of magnitude in runtime while providing a customizable and interactive visualization of population structure that is more accurate than those produced by current tools.<\/jats:p>\n                  <jats:p>Availability and Implementation: pong is freely available and can be installed using the Python package management system pip. pong\u2019s source code is available at https:\/\/github.com\/abehr\/pong.<\/jats:p>\n                  <jats:p>Contact: \u00a0aaron_behr@alumni.brown.edu or sramachandran@brown.edu<\/jats:p>\n                  <jats:p>Supplementary Information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btw327","type":"journal-article","created":{"date-parts":[[2016,6,9]],"date-time":"2016-06-09T20:56:16Z","timestamp":1465505776000},"page":"2817-2823","source":"Crossref","is-referenced-by-count":389,"title":["pong: fast analysis and visualization of latent clusters in population genetic data"],"prefix":"10.1093","volume":"32","author":[{"given":"Aaron A.","family":"Behr","sequence":"first","affiliation":[{"name":"1 Department of Ecology and Evolutionary Biology"},{"name":"2 Department of Computer Science, Brown University, Providence, RI, USA"}]},{"given":"Katherine Z.","family":"Liu","sequence":"additional","affiliation":[{"name":"2 Department of Computer Science, Brown University, Providence, RI, USA"}]},{"given":"Gracie","family":"Liu-Fang","sequence":"additional","affiliation":[{"name":"3 Computer Science Department, Wellesley College, Wellesley, MA, USA"}]},{"given":"Priyanka","family":"Nakka","sequence":"additional","affiliation":[{"name":"1 Department of Ecology and Evolutionary Biology"},{"name":"4 Center for Computational Molecular Biology, Brown University, Providence, RI, USA"}]},{"given":"Sohini","family":"Ramachandran","sequence":"additional","affiliation":[{"name":"1 Department of Ecology and Evolutionary Biology"},{"name":"4 Center for Computational Molecular Biology, Brown University, Providence, RI, USA"}]}],"member":"286","published-online":{"date-parts":[[2016,6,9]]},"reference":[{"key":"2023020113390166600_btw327-B1","doi-asserted-by":"crossref","first-page":"1655","DOI":"10.1101\/gr.094052.109","article-title":"Fast model-based estimation of ancestry in unrelated individuals","volume":"19","author":"Alexander","year":"2009","journal-title":"Genome Res"},{"key":"2023020113390166600_btw327-B2","first-page":"993","article-title":"Latent Dirichlet Allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res"},{"key":"2023020113390166600_btw327-B3","doi-asserted-by":"crossref","first-page":"786","DOI":"10.1073\/pnas.0909559107","article-title":"Genome-wide patterns of population structure and admixture in West Africans and African Americans","volume":"107","author":"Bryc","year":"2010","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020113390166600_btw327-B4","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nature15393","article-title":"A global reference for human genetic variation","volume":"526","author":"Consortium, 1000 Genomes Project","year":"2015","journal-title":"Nature"},{"key":"2023020113390166600_btw327-B5","doi-asserted-by":"crossref","first-page":"1567","DOI":"10.1093\/genetics\/164.4.1567","article-title":"Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies","volume":"164","author":"Falush","year":"2003","journal-title":"Genetics"},{"key":"2023020113390166600_btw327-B6","doi-asserted-by":"crossref","first-page":"e1002554.","DOI":"10.1371\/journal.pgen.1002554","article-title":"Development of a panel of genome-wide ancestry informative markers to study admixture throughout the americas","volume":"8","author":"Galanter","year":"2012","journal-title":"PLoS Genet"},{"key":"2023020113390166600_btw327-B7","doi-asserted-by":"crossref","first-page":"e43129.","DOI":"10.1371\/journal.pone.0043129","article-title":"Three decades of farmed escapees in the wild: a spatio-temporal analysis of atlantic salmon population genetic structure throughout Norway","volume":"7","author":"Glover","year":"2012","journal-title":"PLoS One"},{"key":"2023020113390166600_btw327-B8","first-page":"1","article-title":"Great-ape Y-Chromosome and mitochondrial DNA phylogenies reflect sub-species structure and patterns of mating and dispersal","volume":"44","author":"Hallast","year":"2016","journal-title":"Genome Res"},{"key":"2023020113390166600_btw327-B9","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pgen.1005602","article-title":"Genomic insights into the ancestry and demographic history of South America","volume":"11","author":"Homburger","year":"2015","journal-title":"PLoS Genet"},{"key":"2023020113390166600_btw327-B10","doi-asserted-by":"crossref","first-page":"1322","DOI":"10.1111\/j.1755-0998.2009.02591.x","article-title":"Inferring weak population structure with the assistance of sample group information","volume":"9","author":"Hubisz","year":"2009","journal-title":"Mol. Ecol. Resour"},{"key":"2023020113390166600_btw327-B11","doi-asserted-by":"crossref","first-page":"55","DOI":"10.4137\/EBO.S6761","article-title":"Structurama: Bayesian inference of population structure","volume":"7","author":"Huelsenbeck","year":"2011","journal-title":"Evol. Bioinform"},{"key":"2023020113390166600_btw327-B12","doi-asserted-by":"crossref","first-page":"1801","DOI":"10.1093\/bioinformatics\/btm233","article-title":"CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure","volume":"23","author":"Jakobsson","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020113390166600_btw327-B13","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1214\/088342305000000016","article-title":"Markov Chain Monte Carlo methods and the label switching problem in Bayesian Mixture Modeling","volume":"20","author":"Jasra","year":"2005","journal-title":"Stat. Sci"},{"key":"2023020113390166600_btw327-B14","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1038\/ng.3215","article-title":"The genomic and phenotypic diversity of Schizosaccharomyces pombe","volume":"47","author":"Jeffares","year":"2015","journal-title":"Nat. Genet"},{"key":"2023020113390166600_btw327-B15","doi-asserted-by":"crossref","first-page":"1179","DOI":"10.1111\/1755-0998.12387","article-title":"C LUMPAK: a program for identifying clustering modes and packaging population structure inferences across K","volume":"15","author":"Kopelman","year":"2015","journal-title":"Mol. Ecol. Resour"},{"key":"2023020113390166600_btw327-B16","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1002\/nav.3800020109","article-title":"The Hungarian Method for the assignment problem","volume":"2","author":"Kuhn","year":"1955","journal-title":"Naval Res. Logist. Quart"},{"key":"2023020113390166600_btw327-B17","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1002\/nav.3800030404","article-title":"Variants of the Hungarian method for assignment problems","volume":"3","author":"Kuhn","year":"1956","journal-title":"Naval Res. Logist. Quart"},{"key":"2023020113390166600_btw327-B18","doi-asserted-by":"crossref","first-page":"10147.","DOI":"10.1038\/ncomms10147","article-title":"Local admixture of amplified and diversified secreted pathogenesis determinants shapes mosaic Toxoplasma gondii genomes","volume":"7","author":"Lorenzi","year":"2016","journal-title":"Nat. Commun"},{"key":"2023020113390166600_btw327-B19","volume-title":"Introduction to Algorithms: A Creative Approach","author":"Manber","year":"1989"},{"key":"2023020113390166600_btw327-B20","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1038\/nature16152","article-title":"Genome-wide patterns of selection in 230 ancient Eurasians","volume":"528","author":"Mathieson","year":"2015","journal-title":"Nature"},{"key":"2023020113390166600_btw327-B21","doi-asserted-by":"crossref","first-page":"e95656.","DOI":"10.1371\/journal.pone.0095656","article-title":"Genetic and ecotypic differentiation in a Californian plant polyploid complex (Grindelia, Asteraceae)","volume":"9","author":"Moore","year":"2014","journal-title":"PLoS One"},{"key":"2023020113390166600_btw327-B22","doi-asserted-by":"crossref","first-page":"e1003925.","DOI":"10.1371\/journal.pgen.1003925","article-title":"Reconstructing the population genetic history of the Caribbean","volume":"9","author":"Moreno-Estrada","year":"2013","journal-title":"PLoS Genet"},{"key":"2023020113390166600_btw327-B23","doi-asserted-by":"crossref","first-page":"1280","DOI":"10.1126\/science.1251688","article-title":"The genetics of Mexico recapitulates Native American substructure and affects biomedical traits","volume":"344","author":"Moreno-Estrada","year":"2014","journal-title":"Science (New York, N.Y.)"},{"key":"2023020113390166600_btw327-B24","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1137\/0105003","article-title":"Algorithms for the assignment and transportation problems","volume":"5","author":"Munkres","year":"1957","journal-title":"J. Soc. Ind. Appl. Math"},{"key":"2023020113390166600_btw327-B25","doi-asserted-by":"crossref","first-page":"809","DOI":"10.1534\/genetics.114.166264","article-title":"Variations on a common STRUCTURE: new algorithms for a valuable model","volume":"197","author":"Novembre","year":"2014","journal-title":"Genetics"},{"key":"2023020113390166600_btw327-B26","doi-asserted-by":"crossref","first-page":"e190.","DOI":"10.1371\/journal.pgen.0020190","article-title":"Population structure and eigenanalysis","volume":"2","author":"Patterson","year":"2006","journal-title":"PLoS Genet"},{"key":"2023020113390166600_btw327-B27","doi-asserted-by":"crossref","first-page":"904","DOI":"10.1038\/ng1847","article-title":"Principal components analysis corrects for stratification in genome-wide association studies","volume":"38","author":"Price","year":"2006","journal-title":"Nat Genet"},{"key":"2023020113390166600_btw327-B28","doi-asserted-by":"crossref","first-page":"945","DOI":"10.1093\/genetics\/155.2.945","article-title":"Inference of population structure using multilocus genotype data","volume":"155","author":"Pritchard","year":"2000","journal-title":"Genetics"},{"key":"2023020113390166600_btw327-B29","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1534\/genetics.114.164350","article-title":"fastSTRUCTURE: variational inference of population structure in large SNP data sets","volume":"197","author":"Raj","year":"2014","journal-title":"Genetics"},{"key":"2023020113390166600_btw327-B30","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1046\/j.1471-8286.2003.00566.x","article-title":"DISTRUCT: A program for the graphical display of population structure","volume":"4","author":"Rosenberg","year":"2004","journal-title":"Mol. Ecol. Notes"},{"key":"2023020113390166600_btw327-B31","doi-asserted-by":"crossref","first-page":"795","DOI":"10.1111\/1467-9868.00265","article-title":"Dealing with label switching in mixture models","volume":"62","author":"Stephens","year":"2000","journal-title":"J. R. Statist. Soc. Series B"},{"key":"2023020113390166600_btw327-B32","doi-asserted-by":"crossref","first-page":"e1004530.","DOI":"10.1371\/journal.pgen.1004530","article-title":"Patterns of admixture and population structure in Native populations of Northwest North America","volume":"10","author":"Verdu","year":"2014","journal-title":"PLoS Genet"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/18\/2817\/49020633\/bioinformatics_32_18_2817.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/18\/2817\/49020633\/bioinformatics_32_18_2817.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,1]],"date-time":"2023-02-01T18:46:08Z","timestamp":1675277168000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/18\/2817\/1744074"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,6,9]]},"references-count":32,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2016,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw327","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/031815","asserted-by":"object"}]},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2016,9,15]]},"published":{"date-parts":[[2016,6,9]]}}}