{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,28]],"date-time":"2025-10-28T14:35:40Z","timestamp":1761662140361,"version":"3.30.2"},"reference-count":49,"publisher":"Oxford University Press (OUP)","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2005,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Haplotype reconstruction is an essential step in genetic linkage and association studies. Although many methods have been developed to estimate haplotype frequencies and reconstruct haplotypes for a sample of unrelated individuals, haplotype reconstruction in large pedigrees with a large number of genetic markers remains a challenging problem.<\/jats:p><jats:p>Methods: We have developed an efficient computer program, HAPLORE (HAPLOtype REconstruction), to identify all haplotype sets that are compatible with the observed genotypes in a pedigree for tightly linked genetic markers. HAPLORE consists of three steps that can serve different needs in applications. In the first step, a set of logic rules is used to reduce the number of compatible haplotypes of each individual in the pedigree as much as possible. After this step, the haplotypes of all individuals in the pedigree can be completely or partially determined. These logic rules are applicable to completely linked markers and they can be used to impute missing data and check genotyping errors. In the second step, a haplotype-elimination algorithm similar to the genotype-elimination algorithms used in linkage analysis is applied to delete incompatible haplotypes derived from the first step. All superfluous haplotypes of the pedigree members will be excluded after this step. In the third step, the expectation-maximization (EM) algorithm combined with the partition and ligation technique is used to estimate haplotype frequencies based on the inferred haplotype configurations through the first two steps. Only compatible haplotype configurations with haplotypes having frequencies greater than a threshold are retained.<\/jats:p><jats:p>Results: We test the effectiveness and the efficiency of HAPLORE using both simulated and real datasets. Our results show that, the rule-based algorithm is very efficient for completely genotyped pedigree. In this case, almost all of the families have one unique haplotype configuration. In the presence of missing data, the number of compatible haplotypes can be substantially reduced by HAPLORE, and the program will provide all possible haplotype configurations of a pedigree under different circumstances, if such multiple configurations exist. These inferred haplotype configurations, as well as the haplotype frequencies estimated by the EM algorithm, can be used in genetic linkage and association studies.<\/jats:p><jats:p>Availability: The program can be downloaded from http:\/\/bioinformatics.med.yale.edu<\/jats:p><jats:p>Contact: \u00a0hongyu.zhao@yale.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/bth388","type":"journal-article","created":{"date-parts":[[2004,7,2]],"date-time":"2004-07-02T00:24:20Z","timestamp":1088727860000},"page":"90-103","source":"Crossref","is-referenced-by-count":80,"title":["HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination"],"prefix":"10.1093","volume":"21","author":[{"given":"Kui","family":"Zhang","sequence":"first","affiliation":[]},{"given":"Fengzhu","family":"Sun","sequence":"additional","affiliation":[]},{"given":"Hongyu","family":"Zhao","sequence":"additional","affiliation":[]}],"member":"286","published-online":{"date-parts":[[2004,7,1]]},"reference":[{"key":"2023013107193539800_B1","doi-asserted-by":"crossref","unstructured":"Akey, J., Jin, L., Xiong, M. 2001Haplotypes vs single marker linkage disequilibrium tests: what do we gain?. Eur. J. Hum. Genet.9291\u2013300","DOI":"10.1038\/sj.ejhg.5200619"},{"key":"2023013107193539800_B2","unstructured":"Becker, T. and Knapp, M. 2003Efficiency of haplotype frequency estimation when nuclear family information is included. Hum. Hered.5445\u201353"},{"key":"2023013107193539800_B3","unstructured":"Clark, A.G. 1990Inference of haplotypes from PCR-amplifed samples of diploid populations. Mol. Biol. Evol.7111\u2013112"},{"key":"2023013107193539800_B4","doi-asserted-by":"crossref","unstructured":"Cox, R., Bouzekri, N., Martin, S., Southam, L., Hugill, A., Golamaully, M., Cooper, R., Adeyemo, A., Soubrier, F., Ward, R., et al. 2002Angiotensin-1-converting enzyme (ACE) plasma concentration is influenced by multiple ACE-linked quantitative trait nucleotides. Hum. Mol. Genet.112969\u20132977","DOI":"10.1093\/hmg\/11.23.2969"},{"key":"2023013107193539800_B5","doi-asserted-by":"crossref","unstructured":"Daly, M.J., Rioux, J.D., Schaffner, S.F., Hudson, T.J., Lander, E.S. 2001High-resolution haplotype structure in the human genome. Nat. Genet.29229\u2013232","DOI":"10.1038\/ng1001-229"},{"key":"2023013107193539800_B6","doi-asserted-by":"crossref","unstructured":"Douglas, J.A., Boehnke, M., Gillanders, E., Trent, J.M., Gruber, S.B. 2001Experimentally derived haplotypes substantially increase the efficiency of linkage disequilibrium studies. Nat. Genet.28361\u2013364","DOI":"10.1038\/ng582"},{"key":"2023013107193539800_B7","doi-asserted-by":"crossref","unstructured":"Du, F.X., Woodward, B.W., Denise, S.K. 1998Haplotype construction of sires with progeny genotypes based on an exact likelihood. J. Dairy Sci.811462\u20131468","DOI":"10.3168\/jds.S0022-0302(98)75710-8"},{"key":"2023013107193539800_B8","unstructured":"Dudbridge, F., Koeleman, B.P.C., Todd, J.A., Clayton, D.G. 2000Unbiased application of the transmission\/disequilibrium test to multilocus haplotypes. Am. J. Hum. Genet.662009\u20132012"},{"key":"2023013107193539800_B9","doi-asserted-by":"crossref","unstructured":"Elston, R.C. and Stewart, J. 1971General model for genetic analysis of pedigree data. Hum. Hered.21523\u2013542","DOI":"10.1159\/000152448"},{"key":"2023013107193539800_B10","unstructured":"Excoffier, L. and Slatkin, M. 1995Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. Mol. Biol. Evol.12921\u2013927"},{"key":"2023013107193539800_B11","doi-asserted-by":"crossref","unstructured":"Fallin, D. and Schork, N. 2000Accuracy of haplotyzpe frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data. Am. J. Hum. Genet.67947\u2013959","DOI":"10.1086\/303069"},{"key":"2023013107193539800_B12","unstructured":"Goldstein, D.B. 2001Islands of linkage disequilibrium. Nat. Genet.29109\u2013211"},{"key":"2023013107193539800_B13","unstructured":"Gusfield, D. 2001Inference of haplotypes from samples of diploid populations: complexity and algorithms. J. Comput. Biol.8305\u2013323"},{"key":"2023013107193539800_B14","unstructured":"Haines, J.L. 1992Chromlook: an interactive program for error detection and mapping in reference linkage data. Genomics14517\u2013519"},{"key":"2023013107193539800_B15","unstructured":"Hawley, M.E. and Kidd, K.K. 1995HAPLO: a program using the EM algorithm to estimate the frequencies of multi-site haplotypes. J. Hered.86409\u2013411"},{"key":"2023013107193539800_B16","doi-asserted-by":"crossref","unstructured":"Hodge, S.E., Boehnke, M., Spence, M.A. 1999Loss of information due to ambiguous haplotyping of SNPs. Nat. Genet.21360\u2013361","DOI":"10.1038\/7687"},{"key":"2023013107193539800_B17","doi-asserted-by":"crossref","unstructured":"Keavney, B., McKenzie, C.A., Connell, J.M.C., Julier, C., Ratcliffe, P.J., Sobel, E., Lathrop, M., Farrall, M. 1998Measured haplotype analysis of the angiotensin-I converting enzyme gene. Hum. Mol. Genet.71745\u20131751","DOI":"10.1093\/hmg\/7.11.1745"},{"key":"2023013107193539800_B18","doi-asserted-by":"crossref","unstructured":"Kruglyak, L. 1999Prospects for whole-genome linkage disequilibrium mapping of common disease genes. Nat. Genet.22139\u2013144","DOI":"10.1038\/9642"},{"key":"2023013107193539800_B19","unstructured":"Kruglyak, L., Daly, M.J., Reeve-Daly, M.P., Lander, E.S. 1996Parametric and nonparametric linkage analysis: a unified multipoint approach. Am. J. Hum. Genet.581347\u20131363"},{"key":"2023013107193539800_B20","doi-asserted-by":"crossref","unstructured":"Lander, E.S. and Green, P. 1987Construction of multilocus genetic-linkage maps in humans. Proc. Natl Acad. Sci. USA842363\u20132367","DOI":"10.1073\/pnas.84.8.2363"},{"key":"2023013107193539800_B21","unstructured":"Lange, K. and Boehnke, M. 1983Extensions to pedigree analysis. V. Optimal calculation of Mendelian likelihood. Hum. Hered.33291\u2013301"},{"key":"2023013107193539800_B22","unstructured":"Lange, K. and Goradia, T.M. 1987An algorithm for automatic genotype elimination. Am. J. Hum. Genet.40250\u2013256"},{"key":"2023013107193539800_B23","doi-asserted-by":"crossref","unstructured":"Lange, K. and Weeks, D.E. 1989Efficient computation of LOD scores: genotype elimination, genotype redefinition, and hybrid maximum likelihood algorithms. Ann. Hum. Genet.5367\u201383","DOI":"10.1111\/j.1469-1809.1989.tb01122.x"},{"key":"2023013107193539800_B24","doi-asserted-by":"crossref","unstructured":"Li, J. and Jiang, T. 2003Efficient rule-based haplotyping algorithm for pedigree data. In Miller, W., Vingron, M., Istrail, S., Pevzner, P., Waterman, M. (Eds.). Proceedings of the Seventh Annual International Conference on Research in Computational Molecular Biology (RECOMB03) , New York ACM, pp. 197\u2013206","DOI":"10.1145\/640075.640101"},{"key":"2023013107193539800_B25","unstructured":"Lin, S., Cutler, D.J., Zwick, M.E., Chakravarti, A. 2002Haplotype inference in random population samples. Am. J. Hum. Genet.711129\u20131137"},{"key":"2023013107193539800_B26","unstructured":"Lin, S.L. and Speed, T.P. 1997An algorithm for haplotype analysis. J. Comput. Biol.4535\u2013546"},{"key":"2023013107193539800_B27","unstructured":"Long, J.C., Williams, R.C., Urbanek, M. 1995An E-M algorithm and testing strategy for mutiple-locus haplotypes. Am. J. Hum. Genet.56799\u2013810"},{"key":"2023013107193539800_B28","doi-asserted-by":"crossref","unstructured":"Michlataos-Beloin, S., Tishkoff, S.A., Bentley, K.L., Kidd, K.K., Ruano, G. 1996Molecular haplotyping of genetic markers 10 kb apart by allelic-specific long-range PCR. Nucleic Acids Res.244841\u20134843","DOI":"10.1093\/nar\/24.23.4841"},{"key":"2023013107193539800_B29","doi-asserted-by":"crossref","unstructured":"Nejati-Javaremi, A. and Smith, C. 1996Assigning linkage haplotypes from parent and progeny genotypes. Genetics1421363\u20131367","DOI":"10.1093\/genetics\/142.4.1363"},{"key":"2023013107193539800_B30","unstructured":"Niu, T., Qin, Z., Xu, X., Liu, J.S. 2002Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms. Am. J. Hum. Genet.70157\u2013159"},{"key":"2023013107193539800_B31","doi-asserted-by":"crossref","unstructured":"O'Connell, J.R. 2000Zero-recombinant haplotyping: applications to fine mapping using SNPs. Genet. Epidemiol.19(Suppl. 1),S64\u2013S70","DOI":"10.1002\/1098-2272(2000)19:1+<::AID-GEPI10>3.0.CO;2-G"},{"key":"2023013107193539800_B32","unstructured":"O'Connell, J.R. and Weeks, D.E. 1999An optimal algorithm for automatic genotype elimination. Am. J. Hum. Genet.651733\u20131740"},{"key":"2023013107193539800_B33","doi-asserted-by":"crossref","unstructured":"Patil, N., Berno, A.J., Hinds, D.A., Barrett, W.A., Doshi, J.M., Hacker, C.R., Kautzer, C.R., Lee, D.H., Marjoribanks, C., McDonough, D.P., et al. 2001Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21. Science2941719\u20131723","DOI":"10.1126\/science.1065573"},{"key":"2023013107193539800_B34","unstructured":"Qian, D. and Beckman, L. 2002Minimum-recombinant haplotyping in pedigrees. Am. J. Hum. Genet.701434\u20131445"},{"key":"2023013107193539800_B35","unstructured":"Qin, Z., Niu, T., Liu, J. 2002Partitioning-ligation-expectation-maximization algorithm for haplotype inference with single-nucleotide polymorphisms. Am. J. Hum. Genet.711242\u20131247"},{"key":"2023013107193539800_B36","doi-asserted-by":"crossref","unstructured":"Rohde, K. and Fuerst, R. 2001Haplotyping and estimation of haplotype frequencies for closely linked biallelic multilocus genetic phenotypes including nuclear family information. Hum. Mutat.17289\u2013295","DOI":"10.1002\/humu.26"},{"key":"2023013107193539800_B37","doi-asserted-by":"crossref","unstructured":"Schaid, D.J. 2002Relative efficiency of ambiguous vs. directly measured haplotype frequencies. Genet. Epidemiol.23426\u2013443","DOI":"10.1002\/gepi.10184"},{"key":"2023013107193539800_B38","doi-asserted-by":"crossref","unstructured":"Sobel, E., Lange, K., O'Connell, J.R., Weeks, D.E. 1995Haplotype algorithms. In Speed, T.P. and Waterman, M.S. (Eds.). Genetic Mapping and DNA Sequencing , New York IMA Volumes in Mathematics and Its Applications Springer, pp. 89\u2013110","DOI":"10.1007\/978-1-4612-0751-1_6"},{"key":"2023013107193539800_B39","unstructured":"Stephens, M., Smith, N.J., Donnelly, P. 2001A new statistical method for haplotype reconstruction from population data. Am. J. Hum. Genet.68978\u2013989"},{"key":"2023013107193539800_B40","unstructured":"Tapadar, P., Ghosh, S., Majumder, P.P. 2000Haplotyping in pedigrees via a genetic algorithm. Hum. Hered.5043\u201356"},{"key":"2023013107193539800_B41","doi-asserted-by":"crossref","unstructured":"Tishkoff, S.A., Pakstis, A.J., Ruano, G., Kidd, K.K. 2000The accuracy of statistical methods for estimation of haplotype frequencies: an example from the CD4 locus. Am. J. Hum. Genet.67518\u201322","DOI":"10.1086\/303000"},{"key":"2023013107193539800_B42","doi-asserted-by":"crossref","unstructured":"Toivonen, H.T.T., Onkamo, P., Vasko, K., Ollikainen, V., Sevon, P., Mannila, H., Herr, M., Kere, J. 2000Data mining applied to linkage disequilibrium mapping. Am. J. Hum. Genet.67133\u2013145","DOI":"10.1086\/302954"},{"key":"2023013107193539800_B43","doi-asserted-by":"crossref","unstructured":"Wang, N., Akey, J.M., Zhang, K., Chakraborty, K., Jin, L. 2002Distribution of recombination crossovers and the origin of haplotype blocks: the interplay of population history, recombination, and mutation. Am. J. Hum. Genet.711227\u20131234","DOI":"10.1086\/344398"},{"key":"2023013107193539800_B44","unstructured":"Weeks, D.E., Sobel, E., O'Connell, J.R., Lange, K. 1995Computer programs for multilocus haplotyping of general pedigrees. Am. J. Hum. Genet.561506\u20131507"},{"key":"2023013107193539800_B45","unstructured":"Wijsman, E.M. 1987A deductive method of haplotype analysis in pedigrees. Am. J. Hum. Genet.41356\u2013373"},{"key":"2023013107193539800_B46","unstructured":"Wijsman, E.M., Almasy, L., Amos, C.I., Borecki, I., Falk, C.T., King, T.M., Martinez, M.M., Meyers, D., Neuman, R., Olson, J.M., et al. 2001Genetic analysis workshop 12: analysis of complex genetic traits: applications to asthma and simulated data. Genet. Epidemiol.21(Suppl. 1),S1\u2013S853"},{"key":"2023013107193539800_B47","unstructured":"Zhang, S., Pakstis, A.J., Kidd, K.K., Zhao, H. 2001Comparisons of two methods for haplotype reconstruction and haplotype frequency estimates from population data. Am. J. Hum. Genet.69906\u2013912"},{"key":"2023013107193539800_B48","doi-asserted-by":"crossref","unstructured":"Zhang, S., Zhang, K., Li, J., Zhao, H. 2002On a family-based haplotype pattern mining method for linkage disequilibrium mapping. Pac. Symp. Biocomput.100\u2013111","DOI":"10.1142\/9789812799623_0010"},{"key":"2023013107193539800_B49","doi-asserted-by":"crossref","unstructured":"Zhao, H., Zhang, S., Merikangas, K.R., Trixler, M., Wildenauer, D.B., Sun, F.Z., Kidd, K.K. 2000Transmission\/disequilibrium tests using multiple tightly linked markers. Am. J. Hum. Genet.67936\u2013946","DOI":"10.1086\/303073"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/21\/1\/90\/48961933\/bioinformatics_21_1_90.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/21\/1\/90\/48961933\/bioinformatics_21_1_90.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,17]],"date-time":"2024-12-17T20:16:36Z","timestamp":1734466596000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/21\/1\/90\/212319"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2004,7,1]]},"references-count":49,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2005,1,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bth388","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2005,1,1]]},"published":{"date-parts":[[2004,7,1]]}}}