{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:34Z","timestamp":1772138074092,"version":"3.50.1"},"reference-count":57,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2025,4,13]],"date-time":"2025-04-13T00:00:00Z","timestamp":1744502400000},"content-version":"vor","delay-in-days":1,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["SBE-BCS 2221920"],"award-info":[{"award-number":["SBE-BCS 2221920"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,6,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Summary<\/jats:title>\n                    <jats:p>Statistical methods that measure the extent of haplotype homozygosity on chromosomes have been highly informative for identifying episodes of recent selection. For example, the integrated haplotype score (iHS) and the extended haplotype homozygosity (EHH) statistics detect long-range haplotype structure around derived and ancestral alleles indicative of classic and soft selective sweeps, respectively. However, to our knowledge, there are currently no publicly available methods that classify ancestral and derived alleles in genomic datasets for the purpose of quantifying the extent of haplotype homozygosity. Here, we introduce the Polaris package, which polarizes chromosomal variants into ancestral and derived alleles and creates corresponding genetic maps for analysis by selscan and HaploSweep, two versatile haplotype-based programs that perform scans for selection. With the input files generated by Polaris, selscan and\/or HaploSweep can produce the appropriate sign (either positive or negative) for outlier iHS statistics, enabling users to distinguish between selection on derived or ancestral alleles. In addition, Polaris can convert the numerical output of these analyses into graphical representations of selective sweeps, increasing the functionality of our software.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>To demonstrate the utility of our approach, we applied the Polaris package to Chromosome 2 in the European Finnish, Middle Eastern Bedouin, and East African Maasai populations. More specifically, we examined the regulatory sequence in intron 13 of the MCM6 gene associated with lactase persistence (i.e. the ability to digest the lactose sugar present in fresh milk), a region of intense interest to human evolutionary geneticists. Our analyses showed that derived alleles (at known enhancers for lactase expression) sit on an extended haplotype background in the Finnish, Bedouin, and Maasai consistent with a classic selective sweep model as determined by iHS and EHH statistics. Importantly, we were able to immediately identify this target allele under selection based on the information generated by our software. We also explored outlier statistics across Chromosome 2 in two distinct datasets from these populations: (i) one containing polarized alleles generated with Polaris and (ii) the other containing unpolarized alleles in the original phased vcf file. Here, we found an excess of outlier statistics on Chromosome 2 in the unpolarized datasets, raising the possibility that a subset of these \u201chits\u201d of selection may be unreliable. Overall, Polaris is a versatile package that enables users to efficiently explore, interpret, and report signals of recent selection in genomic datasets.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The Polaris package is free and open source on GitHub (https:\/\/github.com\/alisi1989\/Polaris) and DropBox (https:\/\/www.dropbox.com\/scl\/fo\/mlxizft5267vem9u62qkn\/AAnM0qX923zPzQBlPX8iteM?rlkey=uezrp4t2waffpj0nmo1evr320&amp;e=1&amp;st=jaodccws&amp;dl=0).<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf171","type":"journal-article","created":{"date-parts":[[2025,4,11]],"date-time":"2025-04-11T07:42:02Z","timestamp":1744357322000},"source":"Crossref","is-referenced-by-count":1,"title":["Polaris: Polarization of ancestral and derived polymorphic alleles for inferences of extended haplotype homozygosity in human populations"],"prefix":"10.1093","volume":"41","author":[{"given":"Alessandro","family":"Lisi","sequence":"first","affiliation":[{"name":"Department of Biological Sciences (Human and Evolutionary Biology Section), University of Southern California , Los Angeles, CA 90089,","place":["United States"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0749-3445","authenticated-orcid":false,"given":"Michael C","family":"Campbell","sequence":"additional","affiliation":[{"name":"Department of Biological Sciences (Human and Evolutionary Biology Section), University of Southern California , Los Angeles, CA 90089,","place":["United States"]}]}],"member":"286","published-online":{"date-parts":[[2025,4,12]]},"reference":[{"key":"2025070408273368000_btaf171-B1","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/nature11632","article-title":"An integrated map of genetic variation from 1,092 human genomes","volume":"491","author":"Abecasis","year":"2012","journal-title":"Nature"},{"key":"2025070408273368000_btaf171-B2","doi-asserted-by":"crossref","first-page":"2689","DOI":"10.3390\/nu12092689","article-title":"Genetics of lactose intolerance: an updated review and online interactive world maps of phenotype and genotype frequencies","volume":"12","author":"Anguita-Ruiz","year":"2020","journal-title":"Nutrients"},{"key":"2025070408273368000_btaf171-B3","doi-asserted-by":"crossref","first-page":"e1009562","DOI":"10.1371\/journal.pgen.1009562","article-title":"Patterns of recent natural selection on genetic loci associated with sexually differentiated human body size and shape phenotypes","volume":"17","author":"Arner","year":"2021","journal-title":"PLoS Genet"},{"key":"2025070408273368000_btaf171-B4","doi-asserted-by":"crossref","first-page":"eaay5012","DOI":"10.1126\/science.aay5012","article-title":"Insights into human genetic variation and population history from 929 diverse genomes","volume":"367","author":"Bergstr\u00f6m","year":"2020","journal-title":"Science"},{"key":"2025070408273368000_btaf171-B5","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1186\/s12915-017-0434-y","article-title":"Detecting positive selection in the genome","volume":"15","author":"Booker","year":"2017","journal-title":"BMC Biol"},{"key":"2025070408273368000_btaf171-B6","doi-asserted-by":"crossref","first-page":"1880","DOI":"10.1016\/j.ajhg.2021.08.005","article-title":"Fast two-stage phasing of large-scale sequence data","volume":"108","author":"Browning","year":"2021","journal-title":"Am J Hum Genet"},{"key":"2025070408273368000_btaf171-B7","doi-asserted-by":"crossref","first-page":"667","DOI":"10.1038\/ejhg.2010.257","article-title":"Identification of regions of positive selection using shared genomic segment analysis","volume":"19","author":"Cai","year":"2011","journal-title":"Eur J Hum Genet"},{"key":"2025070408273368000_btaf171-B8","doi-asserted-by":"crossref","first-page":"R98","DOI":"10.1093\/hmg\/ddab027","article-title":"Human adaptation, demography and cattle domestication: an overview of the complexity of lactase persistence in Africa","volume":"30","author":"Campbell","year":"2021","journal-title":"Hum Mol Genet"},{"key":"2025070408273368000_btaf171-B9","first-page":"20140819","article-title":"Neolithic dairy farming at the extreme of agriculture in Northern Europe","volume":"281","author":"Cramp","year":"2014","journal-title":"Proc Biol Sci"},{"key":"2025070408273368000_btaf171-B10","doi-asserted-by":"crossref","first-page":"D988","DOI":"10.1093\/nar\/gkab1049","article-title":"Ensembl 2022","volume":"50","author":"Cunningham","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2025070408273368000_btaf171-B11","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1038\/nmeth.1785","article-title":"A linear complexity phasing method for thousands of genomes","volume":"9","author":"Delaneau","year":"2011","journal-title":"Nat Methods"},{"key":"2025070408273368000_btaf171-B12","doi-asserted-by":"crossref","first-page":"2582","DOI":"10.1093\/molbev\/msab053","article-title":"Adaptive admixture of HLA class I allotypes enhanced genetically determined strength of natural killer cells in East Asians","volume":"38","author":"Deng","year":"2021","journal-title":"Mol Biol Evol"},{"key":"2025070408273368000_btaf171-B13","doi-asserted-by":"crossref","DOI":"10.7554\/eLife.69026","article-title":"Decreased recent adaptation at human mendelian disease genes as a possible consequence of interference between advantageous and deleterious variants","volume":"10","author":"Di","year":"2021","journal-title":"Elife"},{"key":"2025070408273368000_btaf171-B14","doi-asserted-by":"crossref","first-page":"885","DOI":"10.1101\/gr.164822.113","article-title":"Genome-wide signals of positive selection in human evolution","volume":"24","author":"Enard","year":"2014","journal-title":"Genome Res"},{"key":"2025070408273368000_btaf171-B15","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1038\/ng826","article-title":"Identification of a variant associated with adult-type hypolactasia","volume":"30","author":"Enattah","year":"2002","journal-title":"Nat Genet"},{"key":"2025070408273368000_btaf171-B16","doi-asserted-by":"crossref","first-page":"1275","DOI":"10.1093\/molbev\/msu077","article-title":"On detecting incomplete soft or hard selective sweeps using haplotype structure","volume":"31","author":"Ferrer-Admetlla","year":"2014","journal-title":"Mol Biol Evol"},{"key":"2025070408273368000_btaf171-B17","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1038\/ejhg.2012.148","article-title":"Using identity by descent estimation with dense genotype data to detect positive selection","volume":"21","author":"Han","year":"2013","journal-title":"Eur J Hum Genet"},{"key":"2025070408273368000_btaf171-B18","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1086\/499252","article-title":"Screening for recently selected alleles by analysis of human haplotype similarity","volume":"78","author":"Hanchard","year":"2006","journal-title":"Am J Hum Genet"},{"key":"2025070408273368000_btaf171-B19","doi-asserted-by":"crossref","first-page":"1429","DOI":"10.1534\/genetics.118.301502","article-title":"Detection and classification of hard and soft sweeps from unphased genotypes by multilocus genotype identity","volume":"210","author":"Harris","year":"2018","journal-title":"Genetics"},{"key":"2025070408273368000_btaf171-B20","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1186\/s13104-015-1833-1","article-title":"Genetic diversity of lactase persistence in East African populations","volume":"9","author":"Hassan","year":"2016","journal-title":"BMC Res Notes"},{"key":"2025070408273368000_btaf171-B21","doi-asserted-by":"crossref","first-page":"920","DOI":"10.1126\/science.1198878","article-title":"Classic selective sweeps were rare in recent human evolution","volume":"331","author":"Hernandez","year":"2011","journal-title":"Science"},{"key":"2025070408273368000_btaf171-B22","doi-asserted-by":"crossref","first-page":"4809","DOI":"10.1038\/s41598-024-83657-0","article-title":"Natural selection and adaptive traits in the Maniq, a nomadic hunter-gatherer society from Mainland Southeast Asia","volume":"15","author":"Herzog","year":"2025","journal-title":"Sci Rep"},{"key":"2025070408273368000_btaf171-B23","doi-asserted-by":"crossref","first-page":"779","DOI":"10.1007\/s00439-006-0291-1","article-title":"A novel polymorphism associated with lactose tolerance in Africa: multiple causes for lactase persistence?","volume":"120","author":"Ingram","year":"2007","journal-title":"Hum Genet"},{"key":"2025070408273368000_btaf171-B24","doi-asserted-by":"crossref","first-page":"579","DOI":"10.1007\/s00439-008-0593-6","article-title":"Lactose digestion and the evolutionary genetics of lactase persistence","volume":"124","author":"Ingram","year":"2009","journal-title":"Hum Genet"},{"key":"2025070408273368000_btaf171-B25","doi-asserted-by":"crossref","first-page":"789","DOI":"10.1038\/nature02168","article-title":"The international HapMap project","volume":"426","author":"International HapMap Consortium","year":"2003","journal-title":"Nature"},{"key":"2025070408273368000_btaf171-B26","doi-asserted-by":"crossref","first-page":"eaav2461","DOI":"10.1126\/sciadv.aav2461","article-title":"A migration-associated supergene reveals loss of biocomplexity in Atlantic cod","volume":"5","author":"Kess","year":"2019","journal-title":"Sci Adv"},{"key":"2025070408273368000_btaf171-B27","doi-asserted-by":"crossref","first-page":"e0262024","DOI":"10.1371\/journal.pone.0262024","article-title":"Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data","volume":"17","author":"Klassmann","year":"2022","journal-title":"PLoS One"},{"key":"2025070408273368000_btaf171-B28","doi-asserted-by":"crossref","first-page":"3081","DOI":"10.1111\/mec.13671","article-title":"A haplotype method detects diverse scenarios of local adaptation from genomic sequence variation","volume":"25","author":"Lange","year":"2016","journal-title":"Mol Ecol"},{"key":"2025070408273368000_btaf171-B29","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/btae616","article-title":"AncestryGrapher toolkit: Python command-line pipelines to visualize global- and local-ancestry inferences from the RFMIX version 2 software","volume":"40","author":"Lisi","year":"2024","journal-title":"Bioinformatics"},{"key":"2025070408273368000_btaf171-B30","doi-asserted-by":"crossref","first-page":"866","DOI":"10.1016\/j.ajhg.2013.04.021","article-title":"Detecting and characterizing genomic signatures of positive selection in global populations","volume":"92","author":"Liu","year":"2013","journal-title":"Am J Hum Genet"},{"key":"2025070408273368000_btaf171-B31","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1016\/j.tree.2013.08.003","article-title":"Population genomics of rapid adaptation by soft selective sweeps","volume":"28","author":"Messer","year":"2013","journal-title":"Trends Ecol Evol"},{"key":"2025070408273368000_btaf171-B32","doi-asserted-by":"crossref","first-page":"1102","DOI":"10.1086\/421050","article-title":"The T allele of a single-nucleotide polymorphism 13.9 kb upstream of the lactase gene (LCT) (C-13.9kbT) does not predict or cause the lactase-persistence phenotype in Africans","volume":"74","author":"Mulcare","year":"2004","journal-title":"Am J Hum Genet"},{"key":"2025070408273368000_btaf171-B33","doi-asserted-by":"crossref","first-page":"1829","DOI":"10.1101\/gr.076521.108","article-title":"Genome-wide nucleotide-level mammalian ancestor reconstruction","volume":"18","author":"Paten","year":"2008","journal-title":"Genome Res"},{"key":"2025070408273368000_btaf171-B34","doi-asserted-by":"crossref","first-page":"826","DOI":"10.1101\/gr.087577.108","article-title":"Signals of recent positive selection in a worldwide sample of human populations","volume":"19","author":"Pickrell","year":"2009","journal-title":"Genome Res"},{"key":"2025070408273368000_btaf171-B35","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1046\/j.1469-1809.2003.00048.x","article-title":"The causal element for the lactase persistence\/non-persistence polymorphism is located in a 1 Mb region of linkage disequilibrium in Europeans","volume":"67","author":"Poulter","year":"2003","journal-title":"Ann Hum Genet"},{"key":"2025070408273368000_btaf171-B36","doi-asserted-by":"crossref","first-page":"R208","DOI":"10.1016\/j.cub.2009.11.055","article-title":"The genetics of human adaptation: hard sweeps, soft sweeps, and polygenic adaptation","volume":"20","author":"Pritchard","year":"2010","journal-title":"Curr Biol"},{"key":"2025070408273368000_btaf171-B37","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1086\/519795","article-title":"PLINK: a tool set for whole-genome association and population-based linkage analyses","volume":"81","author":"Purcell","year":"2007","journal-title":"Am J Hum Genet"},{"key":"2025070408273368000_btaf171-B38","doi-asserted-by":"crossref","first-page":"496","DOI":"10.1016\/j.ajhg.2014.02.009","article-title":"Genetic origins of lactase persistence and the spread of pastoralism in Africa","volume":"94","author":"Ranciaro","year":"2014","journal-title":"Am J Hum Genet"},{"key":"2025070408273368000_btaf171-B39","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1111\/imr.13329","article-title":"Shaping immunity: the influence of natural selection on population immune diversity","volume":"323","author":"Randolph","year":"2024","journal-title":"Immunol Rev"},{"key":"2025070408273368000_btaf171-B40","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1016\/j.tig.2020.03.006","article-title":"The genomics of human local adaptation","volume":"36","author":"Rees","year":"2020","journal-title":"Trends Genet"},{"key":"2025070408273368000_btaf171-B41","doi-asserted-by":"crossref","first-page":"4320","DOI":"10.1038\/s41598-022-08439-y","article-title":"Understanding signatures of positive natural selection in human zinc transporter genes","volume":"12","author":"Roca-Umbert","year":"2022","journal-title":"Sci Rep"},{"key":"2025070408273368000_btaf171-B42","doi-asserted-by":"crossref","first-page":"5378","DOI":"10.1038\/s41598-022-09566-2","article-title":"Author correction: understanding signatures of positive natural selection in human zinc transporter genes","volume":"12","author":"Roca-Umbert","year":"2022","journal-title":"Sci Rep"},{"key":"2025070408273368000_btaf171-B43","doi-asserted-by":"crossref","first-page":"832","DOI":"10.1038\/nature01140","article-title":"Detecting recent positive selection in the human genome from haplotype structure","volume":"419","author":"Sabeti","year":"2002","journal-title":"Nature"},{"key":"2025070408273368000_btaf171-B44","doi-asserted-by":"crossref","first-page":"1614","DOI":"10.1126\/science.1124309","article-title":"Positive natural selection in the human lineage","volume":"312","author":"Sabeti","year":"2006","journal-title":"Science"},{"key":"2025070408273368000_btaf171-B45","doi-asserted-by":"crossref","first-page":"913","DOI":"10.1038\/nature06250","article-title":"Genome-wide detection and characterization of positive selection in human populations","volume":"449","author":"Sabeti","year":"2007","journal-title":"Nature"},{"key":"2025070408273368000_btaf171-B46","doi-asserted-by":"crossref","first-page":"1863","DOI":"10.1093\/molbev\/msx154","article-title":"Soft sweeps are the dominant mode of adaptation in the human genome","volume":"34","author":"Schrider","year":"2017","journal-title":"Mol Biol Evol"},{"key":"2025070408273368000_btaf171-B47","doi-asserted-by":"crossref","first-page":"849","DOI":"10.1101\/gr.213611.116","article-title":"Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly","volume":"27","author":"Schneider","year":"2017","journal-title":"Genome Res"},{"key":"2025070408273368000_btaf171-B48","doi-asserted-by":"crossref","first-page":"887582","DOI":"10.3389\/fgene.2022.887582","article-title":"Identification of selection signals on the X-chromosome in East Adriatic sheep: a new complementary approach","volume":"13","author":"Shihabi","year":"2022","journal-title":"Front Genet"},{"key":"2025070408273368000_btaf171-B49","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/btae006","article-title":"Selscan 2.0: scanning for sweeps in unphased data","volume":"40","author":"Szpiech","year":"2024","journal-title":"Bioinformatics"},{"key":"2025070408273368000_btaf171-B50","first-page":"2824","author":"Szpiech"},{"key":"2025070408273368000_btaf171-B51","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1038\/ng1946","article-title":"Convergent adaptation of human lactase persistence in Africa and Europe","volume":"39","author":"Tishkoff","year":"2007","journal-title":"Nat Genet"},{"key":"2025070408273368000_btaf171-B52","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1002\/evl3.97","article-title":"An evolutionary compass for detecting signals of polygenic selection and mutational bias","volume":"3","author":"Uricchio","year":"2019","journal-title":"Evol Lett"},{"key":"2025070408273368000_btaf171-B53","doi-asserted-by":"crossref","first-page":"e72","DOI":"10.1371\/journal.pbio.0040072","article-title":"A map of recent positive selection in the human genome","volume":"4","author":"Voight","year":"2006","journal-title":"PLoS Biol"},{"key":"2025070408273368000_btaf171-B54","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1073\/pnas.0509691102","article-title":"Global landscape of recent inferred Darwinian selection for Homo sapiens","volume":"103","author":"Wang","year":"2006","journal-title":"Proc Natl Acad Sci USA"},{"key":"2025070408273368000_btaf171-B55","doi-asserted-by":"crossref","first-page":"2122","DOI":"10.1093\/bioinformatics\/btl365","article-title":"A whole genome long-range haplotype (WGLRH) test for detecting imprints of positive selection in human populations","volume":"22","author":"Zhang","year":"2006","journal-title":"Bioinformatics"},{"key":"2025070408273368000_btaf171-B56","doi-asserted-by":"crossref","DOI":"10.1093\/molbev\/msae192","article-title":"HaploSweep: detecting and distinguishing recent soft and hard selective sweeps through haplotype structure","volume":"41","author":"Zhao","year":"2024","journal-title":"Mol Biol Evol"},{"key":"2025070408273368000_btaf171-B57","doi-asserted-by":"crossref","first-page":"3006","DOI":"10.1093\/molbev\/msx213","article-title":"Detection of regional variation in selection intensity within protein-coding genes using DNA sequence polymorphism and divergence","volume":"34","author":"Zhao","year":"2017","journal-title":"Mol Biol Evol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf171\/62923065\/btaf171.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/6\/btaf171\/62923065\/btaf171.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/6\/btaf171\/62923065\/btaf171.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,4]],"date-time":"2025-07-04T08:27:48Z","timestamp":1751617668000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf171\/8112925"}},"subtitle":[],"editor":[{"given":"Russell","family":"Schwartz","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,4,12]]},"references-count":57,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2025,6,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf171","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2024.12.06.627098","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,6]]},"published":{"date-parts":[[2025,4,12]]},"article-number":"btaf171"}}