{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,27]],"date-time":"2025-10-27T10:12:30Z","timestamp":1761559950722},"reference-count":26,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2005,1,17]],"date-time":"2005-01-17T00:00:00Z","timestamp":1105920000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0\/"},{"start":{"date-parts":[[2005,1,17]],"date-time":"2005-01-17T00:00:00Z","timestamp":1105920000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0\/"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                        <jats:title>Background<\/jats:title>\n                        <jats:p>Several problems exist with current methods used to align DNA sequences for comparative sequence analysis. Most dynamic programming algorithms assume that conserved sequence elements are collinear. This assumption appears valid when comparing orthologous protein coding sequences. Functional constraints on proteins provide strong selective pressure against sequence inversions, and minimize sequence duplications and feature shuffling. For non-coding sequences this collinearity assumption is often invalid. For example, enhancers contain clusters of transcription factor binding sites that change in number, orientation, and spacing during evolution yet the enhancer retains its activity. Dot plot analysis is often used to estimate non-coding sequence relatedness. Yet dot plots do not actually align sequences and thus cannot account well for base insertions or deletions. Moreover, they lack an adequate statistical framework for comparing sequence relatedness and are limited to pairwise comparisons. Lastly, dot plots and dynamic programming text outputs fail to provide an intuitive means for visualizing DNA alignments.<\/jats:p>\n                     <\/jats:sec><jats:sec>\n                        <jats:title>Results<\/jats:title>\n                        <jats:p>To address some of these issues, we created a stand alone, platform independent, graphic alignment tool for comparative sequence analysis (GATA <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"http:\/\/gata.sourceforge.net\/\">http:\/\/gata.sourceforge.net\/<\/jats:ext-link>). GATA uses the NCBI-BLASTN program and extensive post-processing to identify all small sub-alignments above a low cut-off score. These are graphed as two shaded boxes, one for each sequence, connected by a line using the coordinate system of their parent sequence. Shading and colour are used to indicate score and orientation. A variety of options exist for querying, modifying and retrieving conserved sequence elements. Extensive gene annotation can be added to both sequences using a standardized General Feature Format (GFF) file.<\/jats:p>\n                     <\/jats:sec><jats:sec>\n                        <jats:title>Conclusions<\/jats:title>\n                        <jats:p>GATA uses the NCBI-BLASTN program in conjunction with post-processing to exhaustively align two DNA sequences. It provides researchers with a fine-grained alignment and visualization tool aptly suited for non-coding, 0\u2013200 kb, pairwise, sequence analysis. It functions independent of sequence feature ordering or orientation, and readily visualizes both large and small sequence inversions, duplications, and segment shuffling. Since the alignment is visual and does not contain gaps, gene annotation can be added to both sequences to create a thoroughly descriptive picture of DNA conservation that is well suited for comparative sequence analysis.<\/jats:p>\n                     <\/jats:sec>","DOI":"10.1186\/1471-2105-6-9","type":"journal-article","created":{"date-parts":[[2005,1,18]],"date-time":"2005-01-18T08:25:11Z","timestamp":1106036711000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":57,"title":["GATA: a graphic alignment tool for comparative sequence analysis"],"prefix":"10.1186","volume":"6","author":[{"given":"David A","family":"Nix","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael B","family":"Eisen","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2005,1,17]]},"reference":[{"issue":"1","key":"334_CR1","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1016\/0022-2836(81)90087-5","volume":"147","author":"TF Smith","year":"1981","unstructured":"Smith TF, Waterman MS: Identification of common molecular subsequences. J Mol Biol 1981, 147(1):195\u2013197. 10.1016\/0022-2836(81)90087-5","journal-title":"J Mol Biol"},{"key":"334_CR2","doi-asserted-by":"publisher","first-page":"443","DOI":"10.1016\/0022-2836(70)90057-4","volume":"48","author":"SB Needleman","year":"1970","unstructured":"Needleman SB, Wunsch CD: A general method applicable to the search for similarities in the amino acid sequences of two proteins. J Mol Biol 1970, 48: 443\u2013453.","journal-title":"J Mol Biol"},{"key":"334_CR3","doi-asserted-by":"publisher","first-page":"634","DOI":"10.1016\/S0959-437X(02)00355-6","volume":"12","author":"MZ LudWig","year":"2002","unstructured":"LudWig MZ: Functional evolution of noncoding DNA. Curr Opin Genet Dev 2002, 12: 634\u2013639. 10.1016\/S0959-437X(02)00355-6","journal-title":"Curr Opin Genet Dev"},{"issue":"5","key":"334_CR4","doi-asserted-by":"publisher","first-page":"601","DOI":"10.1016\/S0959-437X(02)00345-3","volume":"12","author":"M Markstein","year":"2002","unstructured":"Markstein M, Levine M: Decoding cis-regulatory DNAs in the Drosophila genome. Curr Opin Genet Dev 2002, 12(5):601\u2013606. 10.1016\/S0959-437X(02)00345-3","journal-title":"Curr Opin Genet Dev"},{"issue":"9","key":"334_CR5","doi-asserted-by":"publisher","first-page":"1377","DOI":"10.1093\/molbev\/msg140","volume":"20","author":"GA Wray","year":"2003","unstructured":"Wray GA, Hahn MW, Abouheif E, Balhoff JP, Pizer M, Rockman MV, Romano LA: The evolution of transcriptional regulation in eukaryotes. Mol Biol Evol 2003, 20(9):1377\u20131419. 10.1093\/molbev\/msg140","journal-title":"Mol Biol Evol"},{"issue":"6","key":"334_CR6","doi-asserted-by":"publisher","first-page":"397","DOI":"10.1046\/j.1525-142X.2001.01043.x","volume":"3","author":"AP McGregor","year":"2001","unstructured":"McGregor AP, Shaw PJ, Hancock JM, Bopp D, Hediger M, Wratten NS, Dover GA: Rapid restructuring of bicoid-dependent hunchback promoters within and between Dipteran species: implications for molecular coevolution. Evol Dev 2001, 3(6):397\u2013407. 10.1046\/j.1525-142X.2001.01043.x","journal-title":"Evol Dev"},{"issue":"5","key":"334_CR7","doi-asserted-by":"crossref","first-page":"949","DOI":"10.1242\/dev.125.5.949","volume":"125","author":"MZ Ludwig","year":"1998","unstructured":"Ludwig MZ, Patel NH, Kreitman M: Functional analysis of eve stripe 2 enhancer evolution in Drosophila: rules governing conservation and change. Development 1998, 125(5):949\u2013958.","journal-title":"Development"},{"issue":"6769","key":"334_CR8","doi-asserted-by":"publisher","first-page":"564","DOI":"10.1038\/35000615","volume":"403","author":"MZ Ludwig","year":"2000","unstructured":"Ludwig MZ, Bergman C, Patel NH, Kreitman M: Evidence for stabilizing selection in a eukaryotic enhancer element. Nature 2000, 403(6769):564\u2013567. 10.1038\/35000615","journal-title":"Nature"},{"issue":"2","key":"334_CR9","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1111\/j.1574-6968.1999.tb13575.x","volume":"174","author":"TA Tatusova","year":"1999","unstructured":"Tatusova TA, Madden TL: BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. FEMS Microbiol Lett 1999, 174(2):247\u2013250. 10.1016\/S0378-1097(99)00149-4","journal-title":"FEMS Microbiol Lett"},{"issue":"3","key":"334_CR10","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","volume":"215","author":"SF Altschul","year":"1990","unstructured":"Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215(3):403\u2013410. 10.1006\/jmbi.1990.9999","journal-title":"J Mol Biol"},{"issue":"12","key":"334_CR11","doi-asserted-by":"publisher","first-page":"research0083.1","DOI":"10.1186\/gb-2002-3-12-research0083","volume":"3","author":"S Misra","year":"2002","unstructured":"Misra S, Crosby MA, Mungall CJ, Matthews BB, Campbell KS, Hradecky P, Huang Y, Kaminker JS, Millburn GH, Prochnik SE, Smith CD, Tupy JL, Whitfied EJ, Bayraktaroglu L, Berman BP, Bettencourt BR, Celniker SE, de Grey AD, Drysdale RA, Harris NL, Richter J, Russo S, Schroeder AJ, Shu SQ, Stapleton M, Yamada C, Ashburner M, Gelbart WM, Rubin GM, Lewis SE: Annotation of the Drosophila melanogaster euchromatic genome: a systematic review. Genome Biol 2002, 3(12):research0083.1\u20130083.22. 10.1186\/gb-2002-3-12-research0083","journal-title":"Genome Biol"},{"key":"334_CR12","unstructured":"Nix DA: FlyCatcher.[http:\/\/rana.lbl.gov\/FlyCatcher\/]"},{"issue":"6","key":"334_CR13","first-page":"507","volume":"12","author":"EL Sonnhammer","year":"1996","unstructured":"Sonnhammer EL, Durbin R: A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. Comput Appl Biosci 1996, 12(6):507\u2013510.","journal-title":"Comput Appl Biosci"},{"issue":"2","key":"334_CR14","doi-asserted-by":"publisher","first-page":"279","DOI":"10.1093\/bioinformatics\/btg406","volume":"20","author":"R Brodie","year":"2004","unstructured":"Brodie R, Roper RL, Upton C: JDotter: a Java interface to multiple dot plots generated by dotter. Bioinformatics 2004, 20(2):279\u2013281. 10.1093\/bioinformatics\/btg406","journal-title":"Bioinformatics"},{"key":"334_CR15","unstructured":"Pagni M, Junier T: Dotlet.[http:\/\/www.isrec.isb-sib.ch\/java\/dotlet\/Dotlet.html]"},{"issue":"1","key":"334_CR16","doi-asserted-by":"publisher","first-page":"86","DOI":"10.1006\/dbio.2002.0619","volume":"246","author":"CT Brown","year":"2002","unstructured":"Brown CT, Rust AG, Clarke PJ, Pan Z, Schilstra MJ, De Buysscher T, Griffin G, Wold BJ, Cameron RA, Davidson EH, Bolouri H: New computational approaches for analysis of cis-regulatory networks. Dev Biol 2002, 246(1):86\u2013102. 10.1006\/dbio.2002.0619","journal-title":"Dev Biol"},{"issue":"10","key":"334_CR17","doi-asserted-by":"publisher","first-page":"944","DOI":"10.1093\/bioinformatics\/16.10.944","volume":"16","author":"K Rutherford","year":"2000","unstructured":"Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream M-A, Barrell B: Artemis: sequence visualisation and annotation. Bioinformatics 2000, 16(10):944\u2013945. 10.1093\/bioinformatics\/16.10.944","journal-title":"Bioinformatics"},{"key":"334_CR18","unstructured":"ACT: Artemis Comparison Tool[http:\/\/www.sanger.ac.uk\/Software\/ACT\/]"},{"issue":"6","key":"334_CR19","first-page":"507","volume":"12","author":"L Duret","year":"1996","unstructured":"Duret L, Gasteiger E, Perriere G: LALNVIEW: a graphical viewer for pairwise sequence alignments. Comput Appl Biosci 1996, 12(6):507\u2013510.","journal-title":"Comput Appl Biosci"},{"key":"334_CR20","unstructured":"Pearson WR: PLALIGN.[http:\/\/fasta.bioch.virginia.edu\/fasta_www\/plalign.htm]"},{"issue":"Suppl 1","key":"334_CR21","doi-asserted-by":"publisher","first-page":"i54","DOI":"10.1093\/bioinformatics\/btg1005","volume":"19","author":"M Brudno","year":"2003","unstructured":"Brudno M, Malde S, Poliakov A, Do CB, Couronne O, Dubchak I, Batzoglou S: Glocal alignment: finding rearrangements during alignment. Bioinformatics 2003, 19(Suppl 1):i54\u201362. 10.1093\/bioinformatics\/btg1005","journal-title":"Bioinformatics"},{"issue":"4","key":"334_CR22","doi-asserted-by":"publisher","first-page":"685","DOI":"10.1101\/gr.2067704","volume":"14","author":"M Brudno","year":"2004","unstructured":"Brudno M, Poliakov A, Salamov A, Cooper GM, Sidow A, Rubin EM, Solovyev V, Batzoglou S, Dubchak I: Automated whole-genome multiple alignment of rat, mouse, and human. Genome Res 2004, 14(4):685\u201392. 10.1101\/gr.2067704","journal-title":"Genome Res"},{"issue":"5","key":"334_CR23","doi-asserted-by":"publisher","first-page":"636","DOI":"10.1093\/bioinformatics\/btg459","volume":"20","author":"N Shah","year":"2004","unstructured":"Shah N, Couronne O, Pennacchio LA, Brudno M, Batzoglou S, Bethel EW, Rubin EM, Hamann B, Dubchak I: Phylo-VISTA: interactive visualization of multiple DNA sequence alignments. Bioinformatics 2004, 20(5):636\u2013643. 10.1093\/bioinformatics\/btg459","journal-title":"Bioinformatics"},{"issue":"4","key":"334_CR24","doi-asserted-by":"publisher","first-page":"716","DOI":"10.1101\/gr.1957004","volume":"14","author":"K Chakrabarti","year":"2004","unstructured":"Chakrabarti K, Pachter L: Visualization of multiple genome annotations and alignments with the K-BROWSER. Genome Res 2004, 14(4):716\u2013720. 10.1101\/gr.1957004","journal-title":"Genome Res"},{"issue":"4","key":"334_CR25","doi-asserted-by":"publisher","first-page":"693","DOI":"10.1101\/gr.1960404","volume":"14","author":"N Bray","year":"2004","unstructured":"Bray N, Pachter L: MAVID: Constrained Ancestral Alignment of Multiple Sequences. Genome Res 2004, 14(4):693\u2013699. 10.1101\/gr.1960404","journal-title":"Genome Res"},{"issue":"7","key":"334_CR26","doi-asserted-by":"publisher","first-page":"1394","DOI":"10.1101\/gr.2289704","volume":"14","author":"AC Darling","year":"2004","unstructured":"Darling AC, Mau B, Blattner FR, Perna NT: Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res 2004, 14(7):1394\u20131403. 10.1101\/gr.2289704","journal-title":"Genome Res"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-6-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/1471-2105-6-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-6-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,7]],"date-time":"2024-10-07T12:04:21Z","timestamp":1728302661000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-6-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,1,17]]},"references-count":26,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2005,12]]}},"alternative-id":["334"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-6-9","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2005,1,17]]},"assertion":[{"value":"30 July 2004","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 January 2005","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 January 2005","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"9"}}