{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,13]],"date-time":"2025-09-13T15:51:41Z","timestamp":1757778701808},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"7","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2005,4,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: GenBank data are at present lacking alpha satellite higher-order repeat (HOR) annotation. Furthermore, exact HOR consensus lengths have not been reported so far. Given the fast growth of sequence databases in the centromeric region, it is of increasing interest to have efficient tools for computational identification and analysis of HORs from known sequences.<\/jats:p>\n               <jats:p>Results: We develop a graphical user interface method, ColorHOR, for fast computational identification of HORs in a given genomic sequence, without requiring a priori information on the composition of the genomic sequence. ColorHOR is based on an extension of the key-string algorithm and provides a color representation of the order and orientation of HORs. For the key string, we use a robust 6 bp string from a consensus alpha satellite and its representative nature is tested. ColorHOR algorithm provides a direct visual identification of HORs (direct and\/or reverse complement). In more detail, we first illustrate the ColorHOR results for human chromosome 1. Using ColorHOR we determine for the first time the HOR annotation of the GenBank sequence of the whole human genome. In addition to some HORs, corresponding to those determined previously biochemically, we find new HORs in chromosomes 4, 8, 9, 10, 11 and 19. For the first time, we determine exact consensus lengths of HORs in 10 chromosomes. We propose that the HOR assignment obtained by using ColorHOR be included into the GenBank database.<\/jats:p>\n               <jats:p>Availability: The program with graphical user interface application for ColorHOR is freely available at http:\/\/www.hazu.hr\/KSA\/colorHOR.html. It can be run on any platform on which wxPython is supported.<\/jats:p>\n               <jats:p>Contact: \u00a0paar@hazu.hr<\/jats:p>\n               <jats:p>Supplementary information: \u00a0http:\/\/www.hazu.hr\/KSA\/colorHOR.html.<\/jats:p>","DOI":"10.1093\/bioinformatics\/bti072","type":"journal-article","created":{"date-parts":[[2004,10,28]],"date-time":"2004-10-28T00:23:01Z","timestamp":1098922981000},"page":"846-852","source":"Crossref","is-referenced-by-count":22,"title":["ColorHOR\u2014novel graphical algorithm for fast scan of alpha satellite higher-order repeats and HOR annotation for GenBank sequence of human genome"],"prefix":"10.1093","volume":"21","author":[{"given":"Vladimir","family":"Paar","sequence":"first","affiliation":[]},{"given":"Nenad","family":"Pavin","sequence":"additional","affiliation":[]},{"given":"Marija","family":"Rosandi\u0107","sequence":"additional","affiliation":[]},{"given":"Matko","family":"Glun\u010di\u0107","sequence":"additional","affiliation":[]},{"given":"Ivan","family":"Basar","sequence":"additional","affiliation":[]},{"given":"Robert","family":"Pezer","sequence":"additional","affiliation":[]},{"given":"Sonja Durajlija","family":"\u017dini\u0107","sequence":"additional","affiliation":[]}],"member":"286","published-online":{"date-parts":[[2004,10,27]]},"reference":[{"key":"2023013107420067000_B1","doi-asserted-by":"crossref","unstructured":"Alexandrov, I.A., Medvedev, L.I., Mashkova, T.D., Kisselev, L.L., Romanova, L.Y., Yurov, Y.B. 1993Definition of a new alpha satellite suprachromosomal family characterized by monomeric organization. Nucleic Acids Res.212209\u20132215","DOI":"10.1093\/nar\/21.9.2209"},{"key":"2023013107420067000_B2","unstructured":"Baldi, P. and Baisnee, P.F. 2000Sequence analysis by additive scales: DNA structure for sequences and repeats of all lengths. Bioinformatics16865\u2013889"},{"key":"2023013107420067000_B3","doi-asserted-by":"crossref","unstructured":"Benson, G. 1999Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res.27573\u2013580","DOI":"10.1093\/nar\/27.2.573"},{"key":"2023013107420067000_B4","doi-asserted-by":"crossref","unstructured":"Benson, G. and Waterman, M.S. 1994A method for fast database search for all k-nucleotide repeats. Nucleic Acids Res.224828\u20134836","DOI":"10.1093\/nar\/22.22.4828"},{"key":"2023013107420067000_B5","unstructured":"Blanchard, M.K., Chiapello, H., Coward, E. 2000Detecting localized repeats in genomic sequences: a new strategy and its applications to Bacillus subtilis and Arabidopsis thaliana sequences. Comput. Chem.2457\u201370"},{"key":"2023013107420067000_B6","doi-asserted-by":"crossref","unstructured":"Bor\u0161tnik, B., Pumpernik, D., Lukman, D., Ugarkovi\u0107, D., Plohl, M. 1994Tandemly repeated pentanucleotides in DNA sequences of eukaryotes. Nucleic Acids Res.223412\u20133417","DOI":"10.1093\/nar\/22.16.3412"},{"key":"2023013107420067000_B7","unstructured":"Castello, A.T., Martins, W., Gao, G.R. 2002TROLL\u2013Tandem Repeat Occurrence locator. Bioinformatics18634\u2013636"},{"key":"2023013107420067000_B8","doi-asserted-by":"crossref","unstructured":"Choo, K.H., Vissel, B., Nagy, A., Earle, E., Kalitsis, P. 1991A survey of the genomic distribution of alpha satellite on all the human chromosomes, and derivation of a new consensus sequence. Nucleic Acids Res.191179\u20131182","DOI":"10.1093\/nar\/19.6.1179"},{"key":"2023013107420067000_B9","unstructured":"Goodstadt, L. and Ponting, C.P. 2001CHROMA: consensus-based coloring of multiple alignments for publication. Bioinformatics17845\u2013846"},{"key":"2023013107420067000_B10","unstructured":"Hauth, A.M. 2002Identification of tandem repeats: simple and complex pattern structures in DNA sequences.   Dissertation University of Wisconsin-Madison"},{"key":"2023013107420067000_B11","unstructured":"Hauth, A.M. and Joseph, D.A. 2002Beyond tandem repeats: complex pattern structures and distant regions of similarity. BioinformaticsS1831\u201337"},{"key":"2023013107420067000_B12","unstructured":"Lee, C., Wevrick, R., Fisher, R.B., Ferguson-Smith, M.A., Lin, C.C. 1997Human centromeric DNAs. Hum. Genet.100291\u2013304"},{"key":"2023013107420067000_B13","doi-asserted-by":"crossref","unstructured":"Maio, J.J. 1971DNA strand reassociation and polyribonucleotide binding in the African green monkey, Cercopithecus aethiops. J. Mol. Biol.56579\u2013595","DOI":"10.1016\/0022-2836(71)90403-7"},{"key":"2023013107420067000_B14","unstructured":"Manuelidis, L. and Wu, J.C. 1978Homology between human and simian repeated DNA. Nature27692\u201394"},{"key":"2023013107420067000_B15","unstructured":"Milosavljevic, A. and Jurka, J. 1993Discovering simple DNA sequences by the algorithmic significance method. Comput. Appl. Biosci.9407\u2013411"},{"key":"2023013107420067000_B16","unstructured":"Puente, A., de la Velasco, E., Perez Jurado, L.A., Hernandez Chico, C., Rijke, F.M., van de Scherer, S.W., Raap, A.K., Cruces, J. 1998Analysis of the monomeric alphoid sequences in the pericentromeric region of human chromosome 7. Cytogenet. Cell Genet.83176\u2013181"},{"key":"2023013107420067000_B17","doi-asserted-by":"crossref","unstructured":"Romanova, L.Y., Deriagin, G.V., Mashkova, T.D., Tumeneva, I.G., Mushegian, A.R., Kisselev, L.L., Alexandrov, I.A. 1996Evidence for selection in evolution of alpha satellite DNA: the central role of CENP-B\/pJ\u03b1 binding region. J. Mol. Biol.261334\u2013340","DOI":"10.1006\/jmbi.1996.0466"},{"key":"2023013107420067000_B18","doi-asserted-by":"crossref","unstructured":"Rosandi\u0107, M., Paar, V., Basar, I. 2003Key-string segmentation algorithm and higher-order repeat 16mer (54 copies) in human alpha satellite DNA in chromosome 7. J. Theor. Biol.22129\u201337","DOI":"10.1006\/jtbi.2003.3165"},{"key":"2023013107420067000_B19","doi-asserted-by":"crossref","unstructured":"Schueler, M.G., Higgins, A.W., Rudd, M.K., Gustashaw, K., Willard, H.F. 2001Genomic and genetic definition of a functional human centromere. Science294109\u2013115","DOI":"10.1126\/science.1065042"},{"key":"2023013107420067000_B20","unstructured":"Taylor, W.R. 1986The classification of amino acid conservation. J. Theor. Biol.119205\u2013218"},{"key":"2023013107420067000_B21","unstructured":"Warburton, P.E. and Willard, H.F. 1996Evolution of centromeric alpha satellite DNA: molecular organization within and between human and primate chromosomes. In Jackson, M., Strachan, T., Dover, G. (Eds.). Human Genome Evolution , Oxford  BIOS,  pp. 121\u2013145"},{"key":"2023013107420067000_B22","doi-asserted-by":"crossref","unstructured":"Warburton, P.E., Waye, J.S., Willard, H.F. 1993Nonrandom localization of recombination events in human alpha satellite repeat unit variants: implications for higher-order structural characteristics within centromeric heterochromatin. Mol. Cell. Biol.136520\u20136529","DOI":"10.1128\/mcb.13.10.6520-6529.1993"},{"key":"2023013107420067000_B23","doi-asserted-by":"crossref","unstructured":"Waye, J.S. and Willard, H.F. 1985Chromosome-specific alpha satellite DNA: nucleotide sequence analysis of the 2.0 kilobasepair repeat from the human X chromosome. Nucleic Acids Res.132731\u20132743","DOI":"10.1093\/nar\/13.8.2731"},{"key":"2023013107420067000_B24","doi-asserted-by":"crossref","unstructured":"Waye, J.S. and Willard, H.F. 1987Nucleotide sequence heterogeneity of alpha satellite repetitive DNA: a survey of alphoid sequences from different human chromosomes. Nucleic Acids Res.157549\u20137569","DOI":"10.1093\/nar\/15.18.7549"},{"key":"2023013107420067000_B25","doi-asserted-by":"crossref","unstructured":"Waye, J.S., England, S.B., Willard, H.F. 1987Genomic organization of alpha satellite DNA on human chromosome 7: evidence for two distinct alphoid domains on a single chromosome. Mol. Cell Biol.7349\u2013356","DOI":"10.1128\/MCB.7.1.349"},{"key":"2023013107420067000_B26","doi-asserted-by":"crossref","unstructured":"Waye, J.S., Durfy, S.J., Pinkel, D., Kenwrick, S., Patterson, M., Davies, K.E., Willard, H.F. 1987Chromosome-specific alpha satellite DNA from human chromosome 1: hierarchical structure and genomic organization of a polymorphic domain spanning several hundred kilobase pairs of centromeric DNA. Genomics143\u201351","DOI":"10.1016\/0888-7543(87)90103-0"},{"key":"2023013107420067000_B27","doi-asserted-by":"crossref","unstructured":"Wevrick, R. and Willard, H.F. 1991Physical map of the centromeric region of human chromosome 7: relationship between two distinct alpha satellite arrays. Nucleic Acids Res.192295\u20132301","DOI":"10.1093\/nar\/19.9.2295"},{"key":"2023013107420067000_B28","doi-asserted-by":"crossref","unstructured":"Wevrick, R., Willard, V.P., Willard, H.F. 1992Structure of DNA near long tandem arrays of alpha satellite DNA at the centromere of human chromosome 7. Genomics14912\u2013923","DOI":"10.1016\/S0888-7543(05)80112-0"},{"key":"2023013107420067000_B29","unstructured":"Willard, H.F. 1985Chromosome-specific organization of human alpha satellite DNA. Am. J. Hum. Genet.37524\u2013532"},{"key":"2023013107420067000_B30","doi-asserted-by":"crossref","unstructured":"Willard, H.F. and Waye, J.S. 1987Hierarchical order in chromosome-specific human alpha satellite DNA. Trends Genet.3192\u2013198","DOI":"10.1016\/0168-9525(87)90232-0"},{"key":"2023013107420067000_B31","doi-asserted-by":"crossref","unstructured":"Willard, H.F. and Waye, J.S. 1987Chromosome-specific subsets of human alpha satellite DNA: analysis of sequence divergence within and between chromosomal subsets and evidence for an ancestral pentameric repeat. J. Mol. Evol.25207\u2013214","DOI":"10.1007\/BF02100014"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/21\/7\/846\/48972342\/bioinformatics_21_7_846.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/21\/7\/846\/48972342\/bioinformatics_21_7_846.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T10:43:58Z","timestamp":1675161838000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/21\/7\/846\/268781"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2004,10,27]]},"references-count":31,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2005,4,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bti072","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2005,4,1]]},"published":{"date-parts":[[2004,10,27]]}}}