{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,15]],"date-time":"2026-01-15T13:51:16Z","timestamp":1768485076826,"version":"3.49.0"},"reference-count":34,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":2685,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,6,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: G-quadruplexes are stable four-stranded guanine-rich structures that can form in DNA and RNA. They are an important component of human telomeres and play a role in the regulation of transcription and translation. The biological significance of a G-quadruplex is crucially linked with its thermodynamic stability. Hence the prediction of G-quadruplex stability is of vital interest.<\/jats:p>\n               <jats:p>Results: In this article, we present a novel Bayesian prediction framework based on Gaussian process regression to determine the thermodynamic stability of previously unmeasured G-quadruplexes from the sequence information alone. We benchmark our approach on a large G-quadruplex dataset and compare our method to alternative approaches. Furthermore, we propose an active learning procedure which can be used to iteratively acquire data in an optimal fashion. Lastly, we demonstrate the usefulness of our procedure on a genome-wide study of quadruplexes in the human genome.<\/jats:p>\n               <jats:p>Availability: A data table with the training sequences is available as supplementary material. Source code is available online at http:\/\/www.inference.phy.cam.ac.uk\/os252\/projects\/quadruplexes<\/jats:p>\n               <jats:p>Contact: \u00a0os252@cam.ac.uk; jlh29@cam.ac.uk<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btp210","type":"journal-article","created":{"date-parts":[[2009,5,28]],"date-time":"2009-05-28T15:48:54Z","timestamp":1243525734000},"page":"i374-i1382","source":"Crossref","is-referenced-by-count":98,"title":["Predicting and understanding the stability of G-quadruplexes"],"prefix":"10.1093","volume":"25","author":[{"given":"Oliver","family":"Stegle","sequence":"first","affiliation":[{"name":"1 Cavendish Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge CB3 0HE, UK and 2Laboratoire de Biophysique, Museum National d'Histoire Naturelle USM503, INSERM U565, CNRS UMR 5153 43 Rue Cuvier, 75231 Paris Cedex 05, France"}]},{"given":"Linda","family":"Payet","sequence":"additional","affiliation":[{"name":"1 Cavendish Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge CB3 0HE, UK and 2Laboratoire de Biophysique, Museum National d'Histoire Naturelle USM503, INSERM U565, CNRS UMR 5153 43 Rue Cuvier, 75231 Paris Cedex 05, France"}]},{"given":"Jean-Louis","family":"Mergny","sequence":"additional","affiliation":[{"name":"1 Cavendish Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge CB3 0HE, UK and 2Laboratoire de Biophysique, Museum National d'Histoire Naturelle USM503, INSERM U565, CNRS UMR 5153 43 Rue Cuvier, 75231 Paris Cedex 05, France"}]},{"given":"David J. C.","family":"MacKay","sequence":"additional","affiliation":[{"name":"1 Cavendish Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge CB3 0HE, UK and 2Laboratoire de Biophysique, Museum National d'Histoire Naturelle USM503, INSERM U565, CNRS UMR 5153 43 Rue Cuvier, 75231 Paris Cedex 05, France"}]},{"given":"Julian Leon","family":"Huppert","sequence":"additional","affiliation":[{"name":"1 Cavendish Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge CB3 0HE, UK and 2Laboratoire de Biophysique, Museum National d'Histoire Naturelle USM503, INSERM U565, CNRS UMR 5153 43 Rue Cuvier, 75231 Paris Cedex 05, France"}]}],"member":"286","published-online":{"date-parts":[[2009,5,27]]},"reference":[{"key":"2023013112021673500_B1","volume-title":"Pattern Recognition and Machine Learning.","author":"Bishop","year":"2006"},{"key":"2023013112021673500_B2","doi-asserted-by":"crossref","first-page":"11094","DOI":"10.1021\/ja0608040","article-title":"Quadruplex-based molecular beacons as tunable DNA probes","volume":"128","author":"Bourdoncle","year":"2006","journal-title":"J. Am. Chem. Soc."},{"key":"2023013112021673500_B3","doi-asserted-by":"crossref","first-page":"689","DOI":"10.1021\/bi701873c","article-title":"A sequence-independent study of the influence of short loop lengths on the stability and topology of intramolecular DNA G-quadruplexes","volume":"47","author":"Bugaut","year":"2008","journal-title":"Biochemistry"},{"key":"2023013112021673500_B4","doi-asserted-by":"crossref","first-page":"5402","DOI":"10.1093\/nar\/gkl655","article-title":"Quadruplex DNA: sequence, topology and structure","volume":"34","author":"Burge","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023013112021673500_B5","doi-asserted-by":"crossref","first-page":"3385","DOI":"10.1093\/bioinformatics\/bti526","article-title":"Biomarker discovery in microarray gene expression data with Gaussian processes","volume":"21","author":"Chu","year":"2005","journal-title":"Bioinformatics"},{"key":"2023013112021673500_B6","first-page":"1889","article-title":"Working set selection using second order information for training support vector machines","volume":"6","author":"Fan","year":"2005","journal-title":"J. Mach. Learn. Res."},{"key":"2023013112021673500_B7","doi-asserted-by":"crossref","first-page":"16405","DOI":"10.1021\/ja045154j","article-title":"Loop-length-dependent folding of G-quadruplexes","volume":"126","author":"Hazel","year":"2004","journal-title":"J. Am. Chem. Soc."},{"key":"2023013112021673500_B8","doi-asserted-by":"crossref","first-page":"1375","DOI":"10.1039\/b702491f","article-title":"Four-stranded nucleic acids: structure, function and targeting of G-quadruplexes","volume":"37","author":"Huppert","year":"2008","journal-title":"Chem. Soc. Rev."},{"key":"2023013112021673500_B9","doi-asserted-by":"crossref","first-page":"2908","DOI":"10.1093\/nar\/gki609","article-title":"Prevalence of quadruplexes in the human genome","volume":"33","author":"Huppert","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023013112021673500_B10","doi-asserted-by":"crossref","first-page":"406","DOI":"10.1093\/nar\/gkl1057","article-title":"G-quadruplexes in promoters throughout the human genome","volume":"35","author":"Huppert","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023013112021673500_B11","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511790423","volume-title":"Probability Theory: The Logic of Science.","author":"Jaynes","year":"2003"},{"key":"2023013112021673500_B12","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1214\/aoms\/1177729694","article-title":"On information and sufficiency","volume":"22","author":"Kullback","year":"1951","journal-title":"Ann. Math. Stat."},{"key":"2023013112021673500_B13","doi-asserted-by":"crossref","first-page":"218","DOI":"10.1038\/nchembio864","article-title":"An RNA G-quadruplex in the 5\u2032 UTR of the NRAS proto-oncogene modulates translation","volume":"3","author":"Kumari","year":"2007","journal-title":"Nat. Chem. Biol."},{"key":"2023013112021673500_B14","article-title":"Approximate inference for robust gaussian process regression","volume-title":"Technical Report 136.","author":"Kuss","year":"2005"},{"key":"2023013112021673500_B15","doi-asserted-by":"crossref","first-page":"5482","DOI":"10.1093\/nar\/gkn517","article-title":"Stability and kinetics of G-quadruplex structures","volume":"36","author":"Lane","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023013112021673500_B16","first-page":"566","article-title":"The spectrum kernel: a string kernel for SVM protein classification","volume":"7","author":"Leslie","year":"2002","journal-title":"Proceedings of the Pacific Symposium on Biocomputing"},{"key":"2023013112021673500_B17","doi-asserted-by":"crossref","first-page":"590","DOI":"10.1162\/neco.1992.4.4.590","article-title":"Information-based objective functions for active data selection","volume":"4","author":"MacKay","year":"1992","journal-title":"Neural Comput."},{"key":"2023013112021673500_B18","volume-title":"Information Theory, Inference and Learning Algorithms.","author":"MacKay","year":"2003"},{"key":"2023013112021673500_B19","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1016\/S0014-5793(98)01043-6","article-title":"Following G-quartet formation by UV-spectroscopy","volume":"435","author":"Mergny","year":"1998","journal-title":"FEBS Lett."},{"key":"2023013112021673500_B20","article-title":"Divergence measures and message passing","volume-title":"Technical report.","author":"Minka","year":"2005"},{"key":"2023013112021673500_B21","volume-title":"Quadruplex Nucleic Acids.","author":"Neidle","year":"2006"},{"key":"2023013112021673500_B22","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1002\/bies.20523","article-title":"Physiological relevance of telomeric G-quadruplex formation: a potential drug target","volume":"29","author":"Oganesian","year":"2007","journal-title":"Bioessays"},{"key":"2023013112021673500_B23","doi-asserted-by":"crossref","first-page":"7429","DOI":"10.1093\/nar\/gkm711","article-title":"Human telomere, oncogenic promoter and 5\u2032-UTR G-quadruplexes: diverse higher order DNA and RNA targets for cancer therapeutics","volume":"35","author":"Patel","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023013112021673500_B24","doi-asserted-by":"crossref","first-page":"1149","DOI":"10.1016\/j.biochi.2008.02.020","article-title":"Structures, folding patterns, and functions of intramolecular DNA G-quadruplexes found in eukaryotic promoter regions","volume":"90","author":"Qin","year":"2008","journal-title":"Biochimie"},{"key":"2023013112021673500_B25","volume-title":"Gaussian Processes for Machine Learning.","author":"Rasmussen","year":"2006"},{"key":"2023013112021673500_B26","doi-asserted-by":"crossref","first-page":"1460","DOI":"10.1073\/pnas.95.4.1460","article-title":"A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics","volume":"95","author":"SantaLucia","year":"1998","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013112021673500_B27","article-title":"Expectation Propagation for exponential families","volume-title":"Technical report.","author":"Seeger","year":"2005"},{"key":"2023013112021673500_B28","article-title":"Gaussian process regression: Active data selection and test point rejection","volume":"3","author":"Seo","year":"2000","journal-title":"Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference"},{"key":"2023013112021673500_B29","doi-asserted-by":"crossref","first-page":"11593","DOI":"10.1073\/pnas.182256799","article-title":"Direct evidence for a G-quadruplex in a promoter region and its targeting with a small molecule to repress c-MYC transcription","volume":"99","author":"Siddiqui-Jain","year":"2002","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013112021673500_B30","first-page":"1257","article-title":"Sparse Gaussian processes using pseudo-inputs","volume":"18","author":"Snelson","year":"2006","journal-title":"Adv. Neural Inform. Process. Sys."},{"key":"2023013112021673500_B31","first-page":"1531","article-title":"Large scale multiple kernel learning","volume":"7","author":"Sonnenburg","year":"2006","journal-title":"J. Mach. Learn. Res."},{"key":"2023013112021673500_B32","doi-asserted-by":"crossref","first-page":"2143","DOI":"10.1109\/TBME.2008.923118","article-title":"Gaussian process robust regression for noisy heart rate data","volume":"55","author":"Stegle","year":"2008","journal-title":"IEEE Trans Biomed. Eng."},{"key":"2023013112021673500_B33","doi-asserted-by":"crossref","first-page":"2901","DOI":"10.1093\/nar\/gki553","article-title":"Highly prevalent putative quadruplex sequence motifs in human DNA","volume":"33","author":"Todd","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023013112021673500_B34","volume-title":"Quadruplex.org.","author":"Wong","year":"2008"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/12\/i374\/48994049\/bioinformatics_25_12_i374.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/12\/i374\/48994049\/bioinformatics_25_12_i374.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T21:10:21Z","timestamp":1675199421000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/12\/i374\/189760"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,5,27]]},"references-count":34,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2009,6,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btp210","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2009,6,15]]},"published":{"date-parts":[[2009,5,27]]}}}