{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:41Z","timestamp":1772138081482,"version":"3.50.1"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"14","license":[{"start":{"date-parts":[[2017,1,17]],"date-time":"2017-01-17T00:00:00Z","timestamp":1484611200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"funder":[{"name":"Danish Council of Independent Research Sapere Aude","award":["12-125062"],"award-info":[{"award-number":["12-125062"]}]},{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,7,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Structure methods are highly used population genetic methods for classifying individuals in a sample fractionally into discrete ancestry components.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Contribution<\/jats:title>\n                    <jats:p>We introduce a new optimization algorithm for the classical STRUCTURE model in a maximum likelihood framework. Using analyses of real data we show that the new method finds solutions with higher likelihoods than the state-of-the-art method in the same computational time. The optimization algorithm is also applicable to models based on genotype likelihoods, that can account for the uncertainty in genotype-calling associated with Next Generation Sequencing (NGS) data. We also present a new method for estimating population trees from ancestry components using a Gaussian approximation. Using coalescence simulations of diverging populations, we explore the adequacy of the STRUCTURE-style models and the Gaussian assumption for identifying ancestry components correctly and for inferring the correct tree. In most cases, ancestry components are inferred correctly, although sample sizes and times since admixture can influence the results. We show that the popular Gaussian approximation tends to perform poorly under extreme divergence scenarios e.g. with very long branch lengths, but the topologies of the population trees are accurately inferred in all scenarios explored. The new methods are implemented together with appropriate visualization tools in the software package Ohana.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and Implementation<\/jats:title>\n                    <jats:p>Ohana is publicly available at https:\/\/github.com\/jade-cheng\/ohana. In addition to source code and installation instructions, we also provide example work-flows in the project wiki site.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btx098","type":"journal-article","created":{"date-parts":[[2017,2,16]],"date-time":"2017-02-16T16:28:08Z","timestamp":1487262488000},"page":"2148-2155","source":"Crossref","is-referenced-by-count":59,"title":["Fast admixture analysis and population tree estimation for SNP and NGS\n          data"],"prefix":"10.1093","volume":"33","author":[{"given":"Jade Yu","family":"Cheng","sequence":"first","affiliation":[{"name":"Bioinformatics Research Centre, Aarhus University, Aarhus, Denmark"},{"name":"Departments of Integrative Biology and Statistics, University of California, Berkeley, Berkeley, CA, USA"},{"name":"Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark"}]},{"given":"Thomas","family":"Mailund","sequence":"additional","affiliation":[{"name":"Bioinformatics Research Centre, Aarhus University, Aarhus, Denmark"}]},{"given":"Rasmus","family":"Nielsen","sequence":"additional","affiliation":[{"name":"Departments of Integrative Biology and Statistics, University of California, Berkeley, Berkeley, CA, USA"},{"name":"Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark"}]}],"member":"286","published-online":{"date-parts":[[2017,1,17]]},"reference":[{"key":"2023051506360360900_btx098-B1","doi-asserted-by":"crossref","first-page":"1655","DOI":"10.1101\/gr.094052.109","article-title":"Fast model-based\n            estimation of ancestry in unrelated individuals","volume":"19","author":"Alexander","year":"2009","journal-title":"Genome\n            Res"},{"key":"2023051506360360900_btx098-B2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-12-246","article-title":"Enhancements to the ADMIXTURE algorithm for\n            individual ancestry estimation","volume":"12","author":"Alexander","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023051506360360900_btx098-B3","doi-asserted-by":"crossref","DOI":"10.1534\/genetics.116.189241","article-title":"Nationwide genomic study in Denmark\n            reveals remarkable population homogeneity","author":"Athanasiadis","year":"2016","journal-title":"Genetics"},{"key":"2023051506360360900_btx098-B4","doi-asserted-by":"crossref","first-page":"1084","DOI":"10.1086\/521987","article-title":"Rapid and accurate haplotype phasing and missing-data\n            inference for whole-genome association studies by use of localized haplotype\n            clustering","volume":"81","author":"Browning","year":"2007","journal-title":"Am. J. Hum. Genet"},{"key":"2023051506360360900_btx098-B5","first-page":"923","volume-title":"Cold Spring Harbor Symposia on Quantitative Biology","author":"Cavalli-Sforza","year":"1964"},{"key":"2023051506360360900_btx098-B6","first-page":"233","article-title":"Phylonative American population\n            historygenetic analysis. Models and estimation procedures","volume":"1","author":"Cavalli-Sforza","year":"1967","journal-title":"Am.\n            J. Hum. Genet"},{"key":"2023051506360360900_btx098-B7","first-page":"81","article-title":"Sur la r\u00e9solution\n            num\u00e9rique des syst\u00e8mes d\u2019\u00e9quations\n            lin\u00e9aires","volume":"39","author":"Cholesky","year":"2005","journal-title":"Bulletin De La Sabix. Soci\u00e9t\u00e9\n            Des Amis De La Biblioth\u00e8que Et De L\u2019Histoire De L\u2019\u00c9cole\n            Polytechnique"},{"key":"2023051506360360900_btx098-B8","doi-asserted-by":"crossref","first-page":"1411","DOI":"10.1534\/genetics.110.114819","article-title":"Using environmental correlations to\n            identify loci underlying local adaptation","volume":"185","author":"Coop","year":"2010","journal-title":"Genetics"},{"key":"2023051506360360900_btx098-B9","doi-asserted-by":"crossref","first-page":"442","DOI":"10.1017\/thg.2012.8","article-title":"Imaging and cognitive genetics: the\n            Norwegian Cognitive NeuroGenetics sample","volume":"15","author":"Espeseth","year":"2012","journal-title":"Twin Res. Hum.\n            Genet"},{"key":"2023051506360360900_btx098-B10","doi-asserted-by":"crossref","first-page":"e1003905","DOI":"10.1371\/journal.pgen.1003905","article-title":"Robust demographic inference from\n            genomic and SNP data","volume":"9","author":"Excoffier","year":"2013","journal-title":"PLoS Genet"},{"key":"2023051506360360900_btx098-B11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1086\/284325","article-title":"Phylogenies and the comparative\n            method","author":"Felsenstein","year":"1985","journal-title":"Am. Nat"},{"key":"2023051506360360900_btx098-B12","doi-asserted-by":"crossref","first-page":"1635","DOI":"10.1534\/genetics.107.072371","article-title":"A Markov chain Monte Carlo approach\n            for joint inference of population structure and inbreeding rates from multilocus\n            genotype data","volume":"176","author":"Gao","year":"2007","journal-title":"Genetics"},{"key":"2023051506360360900_btx098-B13","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1534\/genetics.113.152462","article-title":"Robust identification of local adaptation from allele\n            frequencies","volume":"195","author":"Gunther","year":"2013","journal-title":"Genetics"},{"key":"2023051506360360900_btx098-B14","doi-asserted-by":"crossref","first-page":"1299","DOI":"10.1038\/nature04226","article-title":"A\n            haplotype map of the human genome","volume":"437","author":"International HapMap Consortium","year":"2005","journal-title":"Nature"},{"key":"2023051506360360900_btx098-B15","author":"Karush","year":"1939"},{"key":"2023051506360360900_btx098-B17","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-014-0356-4","article-title":"ANGSD: analysis of next generation\n            sequencing data","volume":"15","author":"Korneliussen","year":"2014","journal-title":"BMC Bioinformatics"},{"key":"2023051506360360900_btx098-B18","volume-title":"2nd Berkeley Symposium","author":"Kuhn","year":"1951"},{"key":"2023051506360360900_btx098-B19","doi-asserted-by":"crossref","first-page":"978","DOI":"10.1016\/S1474-4422(10)70184-8","article-title":"Chromosome 9p21 in amyotrophic\n            lateral sclerosis in Finland: a genome-wide association study","volume":"9","author":"Laaksovirta","year":"2010","journal-title":"Lancet Neurol"},{"key":"2023051506360360900_btx098-B20","doi-asserted-by":"crossref","first-page":"759","DOI":"10.1038\/nrg1961","article-title":"Modern computational approaches for analysing\n            molecular genetic variation data","volume":"7","author":"Marjoram","year":"2006","journal-title":"Nat. Rev. Genet"},{"key":"2023051506360360900_btx098-B21","doi-asserted-by":"crossref","first-page":"1387","DOI":"10.1098\/rstb.2005.1673","article-title":"Approximating the coalescent with\n            recombination","volume":"360","author":"McVean","year":"2005","journal-title":"Philos. Trans. R. Soc. Lond. B Biol.\n          Sci"},{"key":"2023051506360360900_btx098-B22","volume-title":"Linear Complementarity, Linear and Nonlinear\n            Programming","author":"Murty","year":"1988"},{"key":"2023051506360360900_btx098-B23","doi-asserted-by":"crossref","first-page":"308","DOI":"10.1093\/comjnl\/7.4.308","article-title":"A simplex method for function\n            minimization","volume":"7","author":"Nelder","year":"1965","journal-title":"Comput. J"},{"key":"2023051506360360900_btx098-B24","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1016\/j.ajhg.2008.08.005","article-title":"The Population Reference Sample,\n            POPRES: a resource for population, disease, and pharmacological genetics\n            research","volume":"83","author":"Nelson","year":"2008","journal-title":"Am. J. Hum. Genet"},{"key":"2023051506360360900_btx098-B28","doi-asserted-by":"crossref","first-page":"e1002967","DOI":"10.1371\/journal.pgen.1002967","article-title":"Inference of population splits and mixtures from\n            genome-wide allele frequency data","volume":"8","author":"Pickrell","year":"2012","journal-title":"PLoS Genet"},{"key":"2023051506360360900_btx098-B29","doi-asserted-by":"crossref","first-page":"945","DOI":"10.1093\/genetics\/155.2.945","article-title":"Inference of population structure\n            using multilocus genotype data","volume":"155","author":"Pritchard","year":"2000","journal-title":"Genetics"},{"key":"2023051506360360900_btx098-B30","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1086\/519795","article-title":"PLINK: a tool set for whole-genome\n            association and population-based linkage analyses","volume":"81","author":"Purcell","year":"2007","journal-title":"Am. J. Hum.\n            Genet"},{"key":"2023051506360360900_btx098-B31","doi-asserted-by":"crossref","first-page":"370","DOI":"10.1038\/nature11258","article-title":"Reconstructing native American\n            population history","volume":"488","author":"Reich","year":"2012","journal-title":"Nature"},{"key":"2023051506360360900_btx098-B32","doi-asserted-by":"crossref","first-page":"1150","DOI":"10.1038\/ng.2742","article-title":"Genome-wide association analysis\n            identifies 13 new risk loci for schizophrenia","volume":"45","author":"Ripke","year":"2013","journal-title":"Nat.\n            Genet"},{"key":"2023051506360360900_btx098-B33","doi-asserted-by":"crossref","first-page":"661","DOI":"10.1016\/j.ajhg.2010.03.011","article-title":"Inferring genetic ancestry:\n            opportunities, challenges, and implications","volume":"86","author":"Royal","year":"2010","journal-title":"Am. J. Hum.\n            Genet"},{"key":"2023051506360360900_btx098-B34","first-page":"406","article-title":"The neighbor-joining method: a new method for\n            reconstructing phylogenetic trees","volume":"4","author":"Saitou","year":"1987","journal-title":"Mol. Biol. Evolut"},{"key":"2023051506360360900_btx098-B35","doi-asserted-by":"crossref","first-page":"629","DOI":"10.1086\/502802","article-title":"A fast and flexible statistical model for large-scale\n            population genotype data: applications to inferring missing genotypes and haplotypic\n            phase","volume":"78","author":"Scheet","year":"2006","journal-title":"Am. J. Hum. Genet"},{"key":"2023051506360360900_btx098-B37","doi-asserted-by":"crossref","first-page":"693","DOI":"10.1534\/genetics.113.154138","article-title":"Estimating individual admixture\n            proportions from next generation sequencing data","volume":"195","author":"Skotte","year":"2013","journal-title":"Genetics"},{"key":"2023051506360360900_btx098-B38","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1002\/gepi.20064","article-title":"Estimation of individual admixture:\n            analytical and study design considerations","volume":"28","author":"Tang","year":"2005","journal-title":"Genet.\n            Epidemiol"},{"key":"2023051506360360900_btx098-B39","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1111\/j.1467-9469.2007.00585.x","article-title":"Simple and globally convergent methods for\n            accelerating the convergence of any EM algorithm","volume":"35","author":"Varadhan","year":"2008","journal-title":"Scand. J.\n            Stat"},{"key":"2023051506360360900_btx098-B40","doi-asserted-by":"crossref","first-page":"703","DOI":"10.1101\/gr.076539.108","article-title":"Non-Darwinian estimation: My ancestors, my\n            genes\u2019 ancestors","volume":"19","author":"Weiss","year":"2009","journal-title":"Genome Res"},{"key":"2023051506360360900_btx098-B41","doi-asserted-by":"crossref","first-page":"397","DOI":"10.1080\/00401706.1978.10489693","article-title":"Cross-validatory estimation of the\n            number of components in factor and principal components models","volume":"20","author":"Wold","year":"1978","journal-title":"Technometrics"},{"key":"2023051506360360900_btx098-B42","doi-asserted-by":"crossref","first-page":"725","DOI":"10.1038\/ng.2285","article-title":"A model-based approach for analysis\n            of spatial structure in genetic data","volume":"44","author":"Yang","year":"2012","journal-title":"Nat. Genet"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/14\/2148\/50314586\/bioinformatics_33_14_2148.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/14\/2148\/50314586\/bioinformatics_33_14_2148.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,15]],"date-time":"2023-05-15T02:36:51Z","timestamp":1684118211000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/33\/14\/2148\/3002763"}},"subtitle":[],"editor":[{"given":"Oliver","family":"Stegle","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2017,1,17]]},"references-count":37,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2017,7,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx098","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/071233","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2017,7,15]]},"published":{"date-parts":[[2017,1,17]]}}}