{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,20]],"date-time":"2025-10-20T10:14:02Z","timestamp":1760955242169},"reference-count":43,"publisher":"Oxford University Press (OUP)","issue":"24","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,12,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: The number of bacterial genomes being sequenced is increasing very rapidly and hence, it is crucial to have procedures for rapid and reliable annotation of their functional elements such as promoter regions, which control the expression of each gene or each transcription unit of the genome. The present work addresses this requirement and presents a generic method applicable across organisms.<\/jats:p>\n               <jats:p>Results: Relative stability of the DNA double helical sequences has been used to discriminate promoter regions from non-promoter regions. Based on the difference in stability between neighboring regions, an algorithm has been implemented to predict promoter regions on a large scale over 913 microbial genome sequences. The average free energy values for the promoter regions as well as their downstream regions are found to differ, depending on their GC content. Threshold values to identify promoter regions have been derived using sequences flanking a subset of translation start sites from all microbial genomes and then used to predict promoters over the complete genome sequences. An average recall value of 72% (which indicates the percentage of protein and RNA coding genes with predicted promoter regions assigned to them) and precision of 56% is achieved over the 913 microbial genome dataset.<\/jats:p>\n               <jats:p>Availability: The binary executable for \u2018PromPredict\u2019 algorithm (implemented in PERL and supported on Linux and MS Windows) and the predicted promoter data for all 913 microbial genomes are available at http:\/\/nucleix.mbu.iisc.ernet.in\/prombase\/.<\/jats:p>\n               <jats:p>Contact: \u00a0mb@mbu.iisc.ernet.in<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq577","type":"journal-article","created":{"date-parts":[[2010,12,1]],"date-time":"2010-12-01T08:08:45Z","timestamp":1291190925000},"page":"3043-3050","source":"Crossref","is-referenced-by-count":44,"title":["High-quality annotation of promoter regions for 913 bacterial genomes"],"prefix":"10.1093","volume":"26","author":[{"given":"Vetriselvi","family":"Rangannan","sequence":"first","affiliation":[{"name":"Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India"}]},{"given":"Manju","family":"Bansal","sequence":"additional","affiliation":[{"name":"Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India"}]}],"member":"286","published-online":{"date-parts":[[2010,10,17]]},"reference":[{"key":"2023012508030829100_B1","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1101\/gr.6991408","article-title":"Generic eukaryotic core promoter prediction using structural features of DNA","volume":"18","author":"Abeel","year":"2008","journal-title":"Genome Res."},{"key":"2023012508030829100_B2","doi-asserted-by":"crossref","first-page":"i24","DOI":"10.1093\/bioinformatics\/btn172","article-title":"ProSOM: core promoter prediction based on unsupervised clustering of DNA physical profiles","volume":"24","author":"Abeel","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012508030829100_B3","doi-asserted-by":"crossref","first-page":"10581","DOI":"10.1021\/bi962590c","article-title":"Thermodynamics and NMR of internal G.T mismatches in DNA","volume":"36","author":"Allawi","year":"1997","journal-title":"Biochemistry"},{"key":"2023012508030829100_B4","doi-asserted-by":"crossref","first-page":"e1000057","DOI":"10.1371\/journal.pcbi.1000057","article-title":"Investigations of oligonucleotide usage variance within and between prokaryotes","volume":"4","author":"Bohlin","year":"2008","journal-title":"PLoS Comput. Biol."},{"key":"2023012508030829100_B5","doi-asserted-by":"crossref","first-page":"W259","DOI":"10.1093\/nar\/gkm310","article-title":"SCOPE: a web server for practical de novo motif discovery","volume":"35","author":"Carlson","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012508030829100_B6","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1186\/1471-2105-8-249","article-title":"A novel ensemble learning method for de novo computational identification of DNA binding sites","volume":"8","author":"Chakravarty","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023012508030829100_B7","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1016\/S0006-291X(03)00973-2","article-title":"Seven GC-rich microbial genomes adopt similar codon usage patterns regardless of their phylogenetic lineages","volume":"306","author":"Chen","year":"2003","journal-title":"Biochem. Biophys. Res. Commun."},{"key":"2023012508030829100_B8","doi-asserted-by":"crossref","first-page":"1895","DOI":"10.1073\/pnas.58.5.1895","article-title":"Altered base ratios in the DNA of an Escherichia coli mutator strain","volume":"58","author":"Cox","year":"1967","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508030829100_B9","doi-asserted-by":"crossref","first-page":"835","DOI":"10.1016\/S0022-2836(99)80005-9","article-title":"Prediction of rho-independent Escherichia coli transcription terminators. A statistical analysis of their RNA stem-loop structures","volume":"216","author":"d'Aubenton Carafa","year":"1990","journal-title":"J. Mol. Biol."},{"issue":"Suppl. 7","key":"2023012508030829100_B10","doi-asserted-by":"crossref","first-page":"S21","DOI":"10.1186\/1471-2105-8-S7-S21","article-title":"A survey of DNA motif finding algorithms","volume":"8","author":"Das","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023012508030829100_B11","doi-asserted-by":"crossref","first-page":"e9841","DOI":"10.1371\/journal.pone.0009841","article-title":"Abundant oligonucleotides common to most bacteria","volume":"5","author":"Davenport","year":"2010","journal-title":"PLoS One"},{"key":"2023012508030829100_B12","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1186\/1471-2105-9-233","article-title":"Triad pattern algorithm for predicting strong promoter candidates in bacterial genomes","volume":"9","author":"Dekhtyar","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012508030829100_B13","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1101\/gr.6905408","article-title":"Genome-wide analysis reveals regulatory role of G4 DNA in gene transcription","volume":"18","author":"Du","year":"2008","journal-title":"Genome Res."},{"key":"2023012508030829100_B14","doi-asserted-by":"crossref","first-page":"1208","DOI":"10.1038\/sj.embor.7400538","article-title":"Environments shape the nucleotide composition of genomes","volume":"6","author":"Foerstner","year":"2005","journal-title":"EMBO Rep."},{"key":"2023012508030829100_B15","doi-asserted-by":"crossref","first-page":"D120","DOI":"10.1093\/nar\/gkm994","article-title":"RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation","volume":"36","author":"Gama-Castro","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012508030829100_B16","doi-asserted-by":"crossref","first-page":"2006","DOI":"10.1093\/bioinformatics\/btp359","article-title":"A pattern-based nearest neighbor search approach for promoter prediction using DNA structural profiles","volume":"5","author":"Gan","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012508030829100_B17","doi-asserted-by":"crossref","first-page":"1964","DOI":"10.1093\/bioinformatics\/btg265","article-title":"Sequence alignment kernel for recognition of promoter regions","volume":"19","author":"Gordon","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012508030829100_B18","doi-asserted-by":"crossref","first-page":"142","DOI":"10.1093\/bioinformatics\/bti771","article-title":"Improved prediction of bacterial transcription start sites","volume":"22","author":"Gordon","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012508030829100_B19","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1016\/S1097-2765(00)80477-3","article-title":"The mechanism of intrinsic transcription termination","volume":"3","author":"Gusarov","year":"1999","journal-title":"Mol. Cell"},{"key":"2023012508030829100_B20","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1007\/s11693-006-9003-3","article-title":"Machine learning for regulatory analysis and transcription factor target prediction in yeast","volume":"1","author":"Holloway","year":"2007","journal-title":"Syst. Synth. Biol."},{"key":"2023012508030829100_B21","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1186\/1471-2105-7-423","article-title":"Detection of prokaryotic promoters from the genomic distribution of hexanucleotide pairs","volume":"7","author":"Jacques","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012508030829100_B22","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-6-1","article-title":"A novel method for prokaryotic promoter prediction based on DNA stability","volume":"6","author":"Kanhere","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023012508030829100_B23","doi-asserted-by":"crossref","first-page":"3165","DOI":"10.1093\/nar\/gki627","article-title":"Structural properties of promoters: similarities and differences between prokaryotes and eukaryotes","volume":"33","author":"Kanhere","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012508030829100_B24","doi-asserted-by":"crossref","first-page":"e12","DOI":"10.1093\/nar\/gkl1024","article-title":"A pHMM-ANN based discriminative approach to promoter identification in prokaryote genomic contexts","volume":"35","author":"Mann","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012508030829100_B25","doi-asserted-by":"crossref","first-page":"e7526","DOI":"10.1371\/journal.pone.0007526","article-title":"Genome-wide identification of transcription start sites, promoters and transcription factor binding sites in E. coli","volume":"4","author":"Mendoza-Vargas","year":"2009","journal-title":"PLoS One"},{"key":"2023012508030829100_B26","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1101\/gr.6759507","article-title":"Universal patterns of purifying selection at noncoding positions in bacteria","volume":"18","author":"Molina","year":"2008","journal-title":"Genome Res."},{"key":"2023012508030829100_B27","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1186\/1471-2164-9-335","article-title":"Large gene overlaps in prokaryotic genomes: result of functional constraints or mispredictions?","volume":"9","author":"Palleja","year":"2008","journal-title":"BMC Genomics"},{"key":"2023012508030829100_B28","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1186\/1471-2164-10-281","article-title":"PairWise Neighbours database: overlaps and spacers among prokaryote genomes","volume":"10","author":"Palleja","year":"2009","journal-title":"BMC Genomics"},{"key":"2023012508030829100_B29","doi-asserted-by":"crossref","first-page":"3203","DOI":"10.1128\/JB.00122-09","article-title":"Structure and complexity of a bacterial transcriptome","volume":"191","author":"Passalacqua","year":"2009","journal-title":"J. Bacteriol."},{"key":"2023012508030829100_B30","doi-asserted-by":"crossref","first-page":"851","DOI":"10.1007\/s12038-007-0085-1","article-title":"Identification and annotation of promoter regions in microbial genome sequences on the basis of DNA stability","volume":"32","author":"Rangannan","year":"2007","journal-title":"J. Biosci."},{"key":"2023012508030829100_B31","doi-asserted-by":"crossref","first-page":"1758","DOI":"10.1039\/b906535k","article-title":"Relative stability of DNA as a generic criterion for promoter prediction: whole genome annotation of microbial genomes with varying nucleotide base composition","volume":"5","author":"Rangannan","year":"2009","journal-title":"Mol. Biosyst."},{"key":"2023012508030829100_B32","doi-asserted-by":"crossref","first-page":"644","DOI":"10.1101\/gr.4508806","article-title":"Genome-wide prediction of G4 DNA as regulatory motifs: role in Escherichia coli global regulation","volume":"16","author":"Rawal","year":"2006","journal-title":"Genome Res."},{"key":"2023012508030829100_B33","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1016\/S0097-8485(01)00099-7","article-title":"Application of a time-delay neural network to promoter annotation in the Drosophila melanogaster genome","volume":"26","author":"Reese","year":"2001","journal-title":"Comput. Chem."},{"key":"2023012508030829100_B34","doi-asserted-by":"crossref","first-page":"4264","DOI":"10.1093\/nar\/gkf549","article-title":"Congruent evolution of different classes of non-coding DNA in prokaryotic genomes","volume":"30","author":"Rogozin","year":"2002","journal-title":"Nucleic Acids Res."},{"key":"2023012508030829100_B35","doi-asserted-by":"crossref","first-page":"1460","DOI":"10.1073\/pnas.95.4.1460","article-title":"A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics","volume":"95","author":"SantaLucia","year":"1998","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508030829100_B36","doi-asserted-by":"crossref","first-page":"3332","DOI":"10.1093\/nar\/gkn135","article-title":"Large-scale computational and statistical analyses of high transcription potentialities in 32 prokaryotic genomes","volume":"36","author":"Sinoquet","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012508030829100_B37","doi-asserted-by":"crossref","first-page":"3540","DOI":"10.1093\/nar\/gkg525","article-title":"PromH: Promoters identification using orthologous genomic sequences","volume":"31","author":"Solovyev","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012508030829100_B38","doi-asserted-by":"crossref","first-page":"1757","DOI":"10.1128\/JB.185.6.1757-1767.2003","article-title":"Domain architectures of sigma54-dependent transcriptional activators","volume":"185","author":"Studholme","year":"2003","journal-title":"J. Bacteriol."},{"key":"2023012508030829100_B39","doi-asserted-by":"crossref","first-page":"582","DOI":"10.1073\/pnas.48.4.582","article-title":"On the genetic basis of variation and heterogeneity of DNA base composition","volume":"48","author":"Sueoka","year":"1962","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508030829100_B40","doi-asserted-by":"crossref","first-page":"3907","DOI":"10.1093\/nar\/gki699","article-title":"A-tract clusters may facilitate DNA packaging in bacterial nucleoid","volume":"33","author":"Tolstorukov","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012508030829100_B41","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1186\/1471-2105-7-248","article-title":"Promoter prediction and annotation of microbial genomes based on DNA sequence and structural responses to superhelical stress","volume":"7","author":"Wang","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012508030829100_B42","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1101\/gr.100396.109","article-title":"A single-base resolution map of an archaeal transcriptome","volume":"20","author":"Wurtzel","year":"2010","journal-title":"Genome Res."},{"key":"2023012508030829100_B43","doi-asserted-by":"crossref","first-page":"D381","DOI":"10.1093\/nar\/gkm781","article-title":"QuadBase: genome-wide database of G4 DNA\u2013occurrence and conservation in human, chimpanzee, mouse and rat promoters and 146 microbes","volume":"36","author":"Yadav","year":"2008","journal-title":"Nucleic Acids Res."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/24\/3043\/48854263\/bioinformatics_26_24_3043.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/24\/3043\/48854263\/bioinformatics_26_24_3043.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:05:26Z","timestamp":1674633926000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/24\/3043\/287972"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,10,17]]},"references-count":43,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2010,12,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq577","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,12,15]]},"published":{"date-parts":[[2010,10,17]]}}}