{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T22:05:53Z","timestamp":1775945153305,"version":"3.50.1"},"reference-count":27,"publisher":"Oxford University Press (OUP)","issue":"6","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,3,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: One of the main tasks of DNA sequence analysis is identification of repetitive patterns. DNA symbol repetitions play a key role in a number of applications, including prediction of gene and exon locations, identification of diseases, reconstruction of human evolutionary history and DNA forensics.<\/jats:p><jats:p>Results: A new approach towards identification of tandem repeats in DNA sequences is proposed. The approach is a refinement of previously considered method, based on the complex periodicity transform. The refinement is obtained, among others, by mapping of DNA symbols to pure quaternions. This mapping results in an enhanced, symbol-balanced sensitivity of the transform to DNA patterns, and an unambiguous threshold selection criterion. Computational efficiency of the transform is further improved, and coupling of the computation with the period value is removed, thereby facilitating parallel implementation of the algorithm. Additionally, a post-processing stage is inserted into the algorithm, enabling unambiguous display of results in a convenient graphical format. Comparison of the quaternionic periodicity transform with two well-known pattern detection techniques shows that the new approach is competitive with these two techniques in detection of exact and approximate repeats.<\/jats:p><jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btl674","type":"journal-article","created":{"date-parts":[[2007,1,20]],"date-time":"2007-01-20T01:12:50Z","timestamp":1169255570000},"page":"694-700","source":"Crossref","is-referenced-by-count":30,"title":["Quaternionic periodicity transform: an algebraic solution to the tandem repeat detection problem"],"prefix":"10.1093","volume":"23","author":[{"given":"Andrzej K.","family":"Brodzik","sequence":"first","affiliation":[{"name":"The MITRE Corporation, Bedford MA 01730"}]}],"member":"286","published-online":{"date-parts":[[2007,1,19]]},"reference":[{"key":"2023041107504796400_","first-page":"8","article-title":"Genomic signal processing,","volume-title":"IEEE Trans. SP","author":"Anastassiou","year":"2001"},{"key":"2023041107504796400_","first-page":"439","article-title":"What can we learn with wavelets about DNA sequences?","volume-title":"Physica A","author":"Arneodo","year":"1998"},{"key":"2023041107504796400_","first-page":"573","article-title":"Tandem repeat finder: a program to analyze DNA sequences.","volume-title":"Nucleic Acid Res.","author":"Benson","year":"1999"},{"key":"2023041107504796400_","doi-asserted-by":"crossref","DOI":"10.1109\/ICASSP.2005.1416318","article-title":"Symbol-Balanced Quaternionic Periodicity Transform for Latent Pattern Detection in DNA Sequences","author":"Brodzik","year":"2005"},{"key":"2023041107504796400_","first-page":"413","article-title":"Extrapolation of band-limited signals and the finite Zak transform.","volume-title":"Sgnal Processing","author":"Brodzik","year":"2000"},{"key":"2023041107504796400_","first-page":"678","article-title":"Location of a major susceptibility locus for familial schizophrenia on chromosome 1q21\u2013q22,","volume-title":"Science","author":"Brzustowicz","year":"2000"},{"key":"2023041107504796400_","first-page":"2280","article-title":"Detection and visualization of tandem repeats in DNA sequences.","volume-title":"IEEE Trans. SP","author":"Buchner","year":"2003"},{"key":"2023041107504796400_","volume-title":"Forensic DNA Typing: Biology and Technology Behind STR Markers","author":"Butler","year":"2003"},{"key":"2023041107504796400_","first-page":"1423","article-title":"Friedreich s Ataxia: autosomal recessive disease caused by an intronic GAA triplet repeat expansion.","volume-title":"Science","author":"Campuzano","year":"1996"},{"key":"2023041107504796400_","first-page":"1976","article-title":"Microbial Forensics\u2014cross-examining pathogens,","volume-title":"Science","author":"Cummings","year":"2002"},{"key":"2023041107504796400_","first-page":"1256","article-title":"An unstable triplet repeat in a gene related to myotonic muscular dystrophy,","volume-title":"Science","author":"Fu","year":"1992"},{"key":"2023041107504796400_","first-page":"520","article-title":"Myelin basis protein gene is associated with ms in DR4- and DR5-positive Italians and Russians.","volume-title":"Neurology","author":"Guerini","year":"2003"},{"key":"2023041107504796400_","volume-title":"Elements of Quaternions","author":"Hamilton","year":"1866"},{"key":"2023041107504796400_","first-page":"971","article-title":"A novel gene containing a trinucleotide repeat that is expanded and unstable on Huntington s disease chromosomes.","volume-title":"Cell","author":"Huntington s Disease Collaborative Research Group","year":"1993"},{"key":"2023041107504796400_","doi-asserted-by":"crossref","first-page":"S31","DOI":"10.1093\/bioinformatics\/18.suppl_1.S31","article-title":"Beyond tandem repeats: complex structures and distant regions of similarity,","volume":"18","author":"Hauth","year":"2002","journal-title":"Bioinformatics"},{"key":"2023041107504796400_","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4612-3650-4","volume-title":"Hypercomplex Numbers: An Elementary Introduction to Algebras","author":"Kantor","year":"1989"},{"key":"2023041107504796400_","doi-asserted-by":"crossref","first-page":"2702","DOI":"10.1093\/bioinformatics\/bth311","article-title":"Exhaustive whole-genome tandem repeats search,","volume":"20","author":"Krishnan","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041107504796400_","first-page":"860","article-title":"Initial sequencing and analysis of the human genome.","volume-title":"Nature","author":"Lander","year":"2001"},{"key":"2023041107504796400_","first-page":"921","article-title":"Interleukin-6 gene alleles affect the risk of Alzheimer's disease and levels of the cytokine in blood and brain.","volume-title":"Neurobiol. Aging","author":"Licastro","year":"2003"},{"key":"2023041107504796400_","first-page":"1979","article-title":"Fourier transforms of color images using quaternion or hypercomplex numbers.","volume-title":"Electron. Lett.","author":"Sangwine","year":"1996"},{"key":"2023041107504796400_","first-page":"2953","article-title":"Periodicity transforms.","volume-title":"IEEE Trans. SP","author":"Sethares","year":"1999"},{"key":"2023041107504796400_","doi-asserted-by":"crossref","first-page":"1405","DOI":"10.1093\/bioinformatics\/bth103","article-title":"Spectral Repeat Finder (SRF): identification of repetitive sequences using Fourier transformation.","volume":"20","author":"Sharma","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041107504796400_","first-page":"1054","article-title":"Nucleic acid-based methods for the detection of cancer,","volume-title":"Science","author":"Sidransky","year":"1997"},{"key":"2023041107504796400_","article-title":"Statistical significance of patterns in biosequences","author":"Stolovitzky"},{"key":"2023041107504796400_","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-662-04621-0","volume-title":"Geometric Computing with Clifford Algebras","author":"Sommer","year":"2001"},{"key":"2023041107504796400_","first-page":"117","article-title":"Some statistical aspects of the primary structure of nucleotide sequences.","volume-title":"Mathematical Methods for DNA Sequences","author":"Tavare","year":"1989"},{"key":"2023041107504796400_","doi-asserted-by":"crossref","first-page":"901","DOI":"10.1086\/303068","article-title":"Short tandem-repeat polymorphism\/alu haplotype variation at the PLAT locus: implications for modern human origins.","volume":"67","author":"Tishkoff","year":"2000","journal-title":"Am. J. Hum. Genet."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/6\/694\/49822508\/bioinformatics_23_6_694.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/6\/694\/49822508\/bioinformatics_23_6_694.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,10]],"date-time":"2024-02-10T09:42:49Z","timestamp":1707558169000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/6\/694\/416696"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,1,19]]},"references-count":27,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2007,3,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btl674","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,3,15]]},"published":{"date-parts":[[2007,1,19]]}}}