{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,5,10]],"date-time":"2024-05-10T09:10:01Z","timestamp":1715332201850},"reference-count":25,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2024,5,10]],"date-time":"2024-05-10T00:00:00Z","timestamp":1715299200000},"content-version":"vor","delay-in-days":6370,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/3.0\/"}],"funder":[{"name":"Council of Scientific and Industrial Research, Government of India"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2006,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>We have applied concepts from information theory for a comparative analysis of donor (gt) and acceptor (ag) splice site regions in the genes of five different organisms by calculating their mutual information content (relative entropy) over a selected block of nucleotides. A similar pattern that the information content decreases as the block size increases was observed for both regions in all the organisms studied. This result suggests that the information required for splicing might be contained in the consensus of ~6\u20138 nt at both regions. We assume from our study that even though the nucleotides are showing some degrees of conservation in the flanking regions of the splice sites, certain level of variability is still tolerated, which leads the splicing process to occur normally even if the extent of base pairing is not fully satisfied. We also suggest that this variability can be compensated by recognizing different splice sites with different spliceosomal factors.<\/jats:p>","DOI":"10.1016\/s1672-0229(07)60003-5","type":"journal-article","created":{"date-parts":[[2007,5,25]],"date-time":"2007-05-25T05:36:27Z","timestamp":1180071387000},"page":"230-237","source":"Crossref","is-referenced-by-count":1,"title":["Comparative Analysis of Splice Site Regions by Information Content"],"prefix":"10.1093","volume":"4","author":[{"given":"T. Shashi","family":"Rekha","sequence":"first","affiliation":[{"name":"Department of Biochemistry, University of Hyderabad , Hyderabad, 500046 , India"}]},{"given":"Chanchal K.","family":"Mitra","sequence":"additional","affiliation":[{"name":"Department of Biochemistry, University of Hyderabad , Hyderabad, 500046 , India"}]}],"member":"286","published-online":{"date-parts":[[2007,5,23]]},"reference":[{"key":"2024051008215611300_bib1","article-title":"Nuclear splicing","author":"Lewin","year":"2000"},{"key":"2024051008215611300_bib2","doi-asserted-by":"crossref","first-page":"505","DOI":"10.1093\/nar\/12.1Part2.505","article-title":"Computational methods to locate signals in nucleic acid sequences","volume":"12","author":"Staden","year":"1984","journal-title":"Nucleic Acids Res."},{"key":"2024051008215611300_bib3","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1016\/0022-2836(91)90380-O","article-title":"Prediction of human mRNA donor and acceptor sites from the DNA sequence","volume":"220","author":"Brunak","year":"1991","journal-title":"J. Mol. Biol."},{"key":"2024051008215611300_bib4","doi-asserted-by":"crossref","first-page":"1185","DOI":"10.1093\/nar\/29.5.1185","article-title":"GeneSplicer: a new computational method for splice site prediction","volume":"29","author":"Pertea","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2024051008215611300_bib5","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1093\/bioinformatics\/bti025","article-title":"Prediction of splice sites with dependency graphs and their expanded Bayesian networks","volume":"21","author":"Chen","year":"2005","journal-title":"Bioinformatics"},{"key":"2024051008215611300_bib6","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1002\/j.1538-7305.1948.tb01338.x","article-title":"A mathematical theory of communication","volume":"27","author":"Shannon","year":"1948","journal-title":"Bell Syst. Tech. J."},{"key":"2024051008215611300_bib7","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/j.plrev.2004.01.002","article-title":"Information theory in molecular biology","volume":"1","author":"Adami","year":"2004","journal-title":"Phys. Life Rev."},{"key":"2024051008215611300_bib8","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511790492","article-title":"Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids","author":"Durbin","year":"1998"},{"key":"2024051008215611300_bib9","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1016\/0022-2836(86)90165-8","article-title":"Information content of binding sites on nucleotide sequences","volume":"188","author":"Schneider","year":"1986","journal-title":"J. Mol. Biol."},{"key":"2024051008215611300_bib10","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1002\/humu.1380060114","article-title":"Using information content and base frequencies to distinguish mutations from genetic polymorphisms in splice junction recognition sites","volume":"6","author":"Rogan","year":"1995","journal-title":"Hum. Mutat."},{"key":"2024051008215611300_bib11","doi-asserted-by":"crossref","first-page":"6312","DOI":"10.1103\/PhysRevE.58.6312","article-title":"Analysis of correlations between sites in models of protein sequences","volume":"58","author":"Giraud","year":"1998","journal-title":"Phys. Rev. E"},{"key":"2024051008215611300_bib12","article-title":"Predicting protein-protein interactions from sequence data","volume-title":"The Chemical Theatre of Biological Systems. Proceedings of the International Beilstein Workshop","author":"Adami","year":"2005"},{"key":"2024051008215611300_bib13","first-page":"52","article-title":"Combinatorial drug design augmented by information theory","volume":"26","author":"Adami","year":"2002","journal-title":"NASA Tech Briefs"},{"key":"2024051008215611300_bib14","first-page":"137","article-title":"1\/f correlations in viral genomes\u2014a Fast-Fourier Transformation (FFT) study","volume":"43","author":"Rekha","year":"2006","journal-title":"Indian J. Biochem. Biophys."},{"key":"2024051008215611300_bib15","doi-asserted-by":"crossref","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","article-title":"Amino acid substitution matrices from protein blocks","volume":"89","author":"Henikoff","year":"1992","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2024051008215611300_bib16","first-page":"345","article-title":"A model of evolutionary change in proteins","volume-title":"Atlas of Protein Sequence and Structure","author":"Dayhoff","year":"1978"},{"key":"2024051008215611300_bib17","doi-asserted-by":"crossref","first-page":"555","DOI":"10.1016\/0022-2836(91)90193-A","article-title":"Amino acid substitution matrices from an information theoretic perspective","volume":"219","author":"Altschul","year":"1991","journal-title":"J. Mol. Biol."},{"key":"2024051008215611300_bib18","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1016\/j.compbiolchem.2005.10.004","article-title":"Comparative analysis of core promoter region: information content from mono and dinucleotide substitution matrices","volume":"30","author":"Reddy","year":"2006","journal-title":"Comput. Biol. Chem."},{"key":"2024051008215611300_bib19","doi-asserted-by":"crossref","first-page":"1124","DOI":"10.1016\/0022-2836(92)90320-J","article-title":"Features of spliceosome evolution and function inferred from an analysis of the information at human splice sites","volume":"228","author":"Stephens","year":"1992","journal-title":"J. Mol. Biol."},{"key":"2024051008215611300_bib20","doi-asserted-by":"crossref","first-page":"1509","DOI":"10.1093\/nar\/18.6.1509","article-title":"Information content of Caenorhabditis elegans splice site sequences varies with intron length","volume":"18","author":"Fields","year":"1990","journal-title":"Nucleic Acids Res."},{"key":"2024051008215611300_bib21","doi-asserted-by":"crossref","first-page":"4255","DOI":"10.1093\/nar\/20.16.4255","article-title":"Splicing signals in Drosophila: intron size, information content, and consensus sequences","volume":"20","author":"Mount","year":"1992","journal-title":"Nucleic Acids Res."},{"key":"2024051008215611300_bib22","doi-asserted-by":"crossref","first-page":"3955","DOI":"10.1093\/nar\/gkl556","article-title":"Comprehensive splice-site analysis using comparative genomics","volume":"34","author":"Sheth","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2024051008215611300_bib23","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1261\/rna.7221605","article-title":"U2AF binding selects for the high conservation of the C. elegans 3\u2032 splice site","volume":"11","author":"Hollins","year":"2005","journal-title":"RNA"},{"key":"2024051008215611300_bib24","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1093\/nar\/28.1.185","article-title":"EID: the Exon-Intron Database\u2014an exhaustive database of protein-coding intron-containing genes","volume":"28","author":"Saxonov","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2024051008215611300_bib25","article-title":"Elements of Information Theory","author":"Cover","year":"1991"}],"container-title":["Genomics, Proteomics &amp; Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.elsevier.com\/content\/article\/PII:S1672022907600035?httpAccept=text\/xml","content-type":"text\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/api.elsevier.com\/content\/article\/PII:S1672022907600035?httpAccept=text\/plain","content-type":"text\/plain","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/academic.oup.com\/gpb\/article-pdf\/4\/4\/230\/57482746\/gpb_4_4_230.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/gpb\/article-pdf\/4\/4\/230\/57482746\/gpb_4_4_230.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,10]],"date-time":"2024-05-10T08:22:27Z","timestamp":1715329347000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/gpb\/article\/4\/4\/230\/7210663"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,12,1]]},"references-count":25,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2006,12,1]]}},"URL":"https:\/\/doi.org\/10.1016\/s1672-0229(07)60003-5","relation":{},"ISSN":["1672-0229","2210-3244"],"issn-type":[{"value":"1672-0229","type":"print"},{"value":"2210-3244","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2006,12]]},"published":{"date-parts":[[2006,12,1]]}}}