{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,26]],"date-time":"2025-11-26T21:50:33Z","timestamp":1764193833594},"reference-count":82,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2009,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>The computational prediction of DNA methylation has become an important topic in the recent years due to its role in the epigenetic control of normal and cancer-related processes. While previous prediction approaches focused merely on differences between methylated and unmethylated DNA sequences, recent experimental results have shown the presence of much more complex patterns of methylation across tissues and time in the human genome. These patterns are only partially described by a binary model of DNA methylation. In this work we propose a novel approach, based on profile analysis of tissue-specific methylation that uncovers significant differences in the sequences of CpG islands (CGIs) that predispose them to a tissue- specific methylation pattern.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We defined CGI methylation profiles that separate not only between constitutively methylated and unmethylated CGIs, but also identify CGIs showing a differential degree of methylation across tissues and cell-types or a lack of methylation exclusively in sperm. These profiles are clearly distinguished by a number of CGI attributes including their evolutionary conservation, their significance, as well as the evolutionary evidence of prior methylation. Additionally, we assess profile functionality with respect to the different compartments of protein coding genes and their possible use in the prediction of DNA methylation.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>Our approach provides new insights into the biological features that determine if a CGI has a functional role in the epigenetic control of gene expression and the features associated with CGI methylation susceptibility. Moreover, we show that the ability to predict CGI methylation is based primarily on the quality of the biological information used and the relationships uncovered between different sources of knowledge. The strategy presented here is able to predict, besides the constitutively methylated and unmethylated classes, two more tissue specific methylation classes conserving the accuracy provided by leading binary methylation classification methods.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-10-116","type":"journal-article","created":{"date-parts":[[2009,4,21]],"date-time":"2009-04-21T18:13:58Z","timestamp":1240337638000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":29,"title":["Profile analysis and prediction of tissue-specific CpG island methylation classes"],"prefix":"10.1186","volume":"10","author":[{"given":"Christopher","family":"Previti","sequence":"first","affiliation":[]},{"given":"Oscar","family":"Harari","sequence":"additional","affiliation":[]},{"given":"Igor","family":"Zwir","sequence":"additional","affiliation":[]},{"given":"Coral","family":"del Val","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2009,4,21]]},"reference":[{"issue":"Suppl","key":"2846_CR1","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1038\/ng1089","volume":"33","author":"R Jaenisch","year":"2003","unstructured":"Jaenisch R, Bird A: Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet 2003, 33(Suppl):245\u2013254.","journal-title":"Nat Genet"},{"issue":"5532","key":"2846_CR2","doi-asserted-by":"publisher","first-page":"1068","DOI":"10.1126\/science.1063852","volume":"293","author":"PA Jones","year":"2001","unstructured":"Jones PA, Takai D: The role of DNA methylation in mammalian epigenetics. Science 2001, 293(5532):1068\u20131070.","journal-title":"Science"},{"issue":"22","key":"2846_CR3","doi-asserted-by":"publisher","first-page":"3959","DOI":"10.1242\/dev.001131","volume":"134","author":"D Zilberman","year":"2007","unstructured":"Zilberman D, Henikoff S: Genome-wide analysis of DNA methylation patterns. Development 2007, 134(22):3959\u20133965.","journal-title":"Development"},{"issue":"8","key":"2846_CR4","doi-asserted-by":"publisher","first-page":"1647","DOI":"10.1007\/s00018-003-3088-6","volume":"60","author":"F Antequera","year":"2003","unstructured":"Antequera F: Structure, function and evolution of CpG island promoters. Cell Mol Life Sci 2003, 60(8):1647\u20131658.","journal-title":"Cell Mol Life Sci"},{"issue":"24","key":"2846_CR5","doi-asserted-by":"publisher","first-page":"11995","DOI":"10.1073\/pnas.90.24.11995","volume":"90","author":"F Antequera","year":"1993","unstructured":"Antequera F, Bird A: Number of CpG islands and genes in human and mouse. Proc Natl Acad Sci USA 1993, 90(24):11995\u201311999.","journal-title":"Proc Natl Acad Sci USA"},{"issue":"6067","key":"2846_CR6","doi-asserted-by":"publisher","first-page":"209","DOI":"10.1038\/321209a0","volume":"321","author":"AP Bird","year":"1986","unstructured":"Bird AP: CpG-rich islands and the function of DNA methylation. Nature 1986, 321(6067):209\u2013213.","journal-title":"Nature"},{"issue":"12","key":"2846_CR7","doi-asserted-by":"publisher","first-page":"4692","DOI":"10.1073\/pnas.87.12.4692","volume":"87","author":"J Sved","year":"1990","unstructured":"Sved J, Bird A: The expected equilibrium of the CpG dinucleotide in vertebrate genomes under a mutation model. Proc Natl Acad Sci USA 1990, 87(12):4692\u20134696.","journal-title":"Proc Natl Acad Sci USA"},{"issue":"7","key":"2846_CR8","doi-asserted-by":"publisher","first-page":"687","DOI":"10.1093\/hmg\/10.7.687","volume":"10","author":"SB Baylin","year":"2001","unstructured":"Baylin SB, Esteller M, Rountree MR, Bachman KE, Schuebel K, Herman JG: Aberrant patterns of DNA methylation, chromatin formation and gene expression in cancer. Hum Mol Genet 2001, 10(7):687\u2013692.","journal-title":"Hum Mol Genet"},{"issue":"11","key":"2846_CR9","doi-asserted-by":"publisher","first-page":"7327","DOI":"10.1128\/MCB.19.11.7327","volume":"19","author":"C De Smet","year":"1999","unstructured":"De Smet C, Lurquin C, Lethe B, Martelange V, Boon T: DNA methylation is the primary silencing mechanism for a set of germ line- and tumor-specific genes with a CpG-rich promoter. Mol Cell Biol 1999, 19(11):7327\u20137335.","journal-title":"Mol Cell Biol"},{"issue":"5","key":"2846_CR10","doi-asserted-by":"publisher","first-page":"899","DOI":"10.1002\/jcb.10464","volume":"88","author":"M Ehrlich","year":"2003","unstructured":"Ehrlich M: Expression of various genes is controlled by DNA methylation during mammalian development. J Cell Biochem 2003, 88(5):899\u2013910.","journal-title":"J Cell Biochem"},{"issue":"8","key":"2846_CR11","first-page":"3225","volume":"61","author":"M Esteller","year":"2001","unstructured":"Esteller M, Corn PG, Baylin SB, Herman JG: A gene hypermethylation profile of human cancer. Cancer Res 2001, 61(8):3225\u20133229.","journal-title":"Cancer Res"},{"issue":"2","key":"2846_CR12","doi-asserted-by":"publisher","first-page":"175","DOI":"10.1038\/ng886","volume":"31","author":"BW Futscher","year":"2002","unstructured":"Futscher BW, Oshiro MM, Wozniak RJ, Holtan N, Hanigan CL, Duan H, Domann FE: Role for DNA methylation in the control of cell type specific maspin expression. Nat Genet 2002, 31(2):175\u2013179.","journal-title":"Nat Genet"},{"issue":"3","key":"2846_CR13","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1016\/j.ceb.2004.03.005","volume":"16","author":"E Heard","year":"2004","unstructured":"Heard E: Recent advances in X-chromosome inactivation. Curr Opin Cell Biol 2004, 16(3):247\u2013255.","journal-title":"Curr Opin Cell Biol"},{"issue":"1\u20134","key":"2846_CR14","doi-asserted-by":"publisher","first-page":"122","DOI":"10.1159\/000090823","volume":"113","author":"R Holmes","year":"2006","unstructured":"Holmes R, Soloway PD: Regulation of imprinted DNA methylation. Cytogenet Genome Res 2006, 113(1\u20134):122\u2013129.","journal-title":"Cytogenet Genome Res"},{"issue":"12","key":"2846_CR15","doi-asserted-by":"publisher","first-page":"988","DOI":"10.1038\/nrc1507","volume":"4","author":"JP Issa","year":"2004","unstructured":"Issa JP: CpG island methylator phenotype in cancer. Nat Rev Cancer 2004, 4(12):988\u2013993.","journal-title":"Nat Rev Cancer"},{"issue":"3","key":"2846_CR16","doi-asserted-by":"publisher","first-page":"326","DOI":"10.1016\/j.ygeno.2006.11.006","volume":"89","author":"E Kitamura","year":"2007","unstructured":"Kitamura E, Igarashi J, Morohashi A, Hida N, Oinuma T, Nemoto N, Song F, Ghosh S, Held WA, Yoshida-Noro C, et al.: Analysis of tissue-specific differentially methylated regions (TDMs) in humans. Genomics 2007, 89(3):326\u2013337.","journal-title":"Genomics"},{"issue":"5532","key":"2846_CR17","doi-asserted-by":"publisher","first-page":"1089","DOI":"10.1126\/science.1063443","volume":"293","author":"W Reik","year":"2001","unstructured":"Reik W, Dean W, Walter J: Epigenetic reprogramming in mammalian development. Science 2001, 293(5532):1089\u20131093.","journal-title":"Science"},{"issue":"2\u20134","key":"2846_CR18","doi-asserted-by":"publisher","first-page":"325","DOI":"10.1159\/000078205","volume":"105","author":"K Shiota","year":"2004","unstructured":"Shiota K: DNA methylation profiles of CpG islands for cellular differentiation and development in mammals. Cytogenet Genome Res 2004, 105(2\u20134):325\u2013334.","journal-title":"Cytogenet Genome Res"},{"issue":"9","key":"2846_CR19","doi-asserted-by":"publisher","first-page":"3336","DOI":"10.1073\/pnas.0408436102","volume":"102","author":"F Song","year":"2005","unstructured":"Song F, Smith JF, Kimura MT, Morrow AD, Matsuyama T, Nagase H, Held WA: Association of tissue-specific differentially methylated regions (TDMs) with differential gene expression. Proc Natl Acad Sci USA 2005, 102(9):3336\u20133341.","journal-title":"Proc Natl Acad Sci USA"},{"issue":"2","key":"2846_CR20","doi-asserted-by":"publisher","first-page":"132","DOI":"10.1038\/72785","volume":"24","author":"JF Costello","year":"2000","unstructured":"Costello JF, Fruhwald MC, Smiraglia DJ, Rush LJ, Robertson GP, Gao X, Wright FA, Feramisco JD, Peltomaki P, Lang JC, et al.: Aberrant CpG-island methylation has non-random and tumour-type-specific patterns. Nat Genet 2000, 24(2):132\u2013138.","journal-title":"Nat Genet"},{"issue":"2","key":"2846_CR21","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1101\/gr.1351604","volume":"14","author":"Y Yamada","year":"2004","unstructured":"Yamada Y, Watanabe H, Miura F, Soejima H, Uchiyama M, Iwasaka T, Mukai T, Sakaki Y, Ito T: A comprehensive analysis of allelic methylation status of CpG islands on human chromosome 21q. Genome Res 2004, 14(2):247\u2013266.","journal-title":"Genome Res"},{"issue":"3","key":"2846_CR22","doi-asserted-by":"publisher","first-page":"236","DOI":"10.1038\/ng0394-236","volume":"6","author":"SH Cross","year":"1994","unstructured":"Cross SH, Charlton JA, Nan X, Bird AP: Purification of CpG islands using a methylated DNA binding column. Nat Genet 1994, 6(3):236\u2013244.","journal-title":"Nat Genet"},{"issue":"4","key":"2846_CR23","doi-asserted-by":"publisher","first-page":"253","DOI":"10.1038\/nrc1045","volume":"3","author":"PW Laird","year":"2003","unstructured":"Laird PW: The power and the promise of DNA methylation markers. Nat Rev Cancer 2003, 3(4):253\u2013266.","journal-title":"Nat Rev Cancer"},{"key":"2846_CR24","volume-title":"Bioinformatics","author":"C Bock","year":"2007","unstructured":"Bock C, Lengauer T: Computational Epigenetics. Bioinformatics 2007."},{"issue":"3","key":"2846_CR25","doi-asserted-by":"publisher","first-page":"e26","DOI":"10.1371\/journal.pgen.0020026","volume":"2","author":"C Bock","year":"2006","unstructured":"Bock C, Paulsen M, Tierling S, Mikeska T, Lengauer T, Walter J: CpG island methylation in human lymphocytes is highly correlated with DNA sequence, repeats, and predicted DNA structure. PLoS Genet 2006, 2(3):e26.","journal-title":"PLoS Genet"},{"issue":"18","key":"2846_CR26","doi-asserted-by":"publisher","first-page":"2204","DOI":"10.1093\/bioinformatics\/btl377","volume":"22","author":"F Fang","year":"2006","unstructured":"Fang F, Fan S, Zhang X, Zhang MQ: Predicting methylation status of CpG islands in the human brain. Bioinformatics 2006, 22(18):2204\u20132209.","journal-title":"Bioinformatics"},{"issue":"21","key":"2846_CR27","doi-asserted-by":"publisher","first-page":"12253","DOI":"10.1073\/pnas.2037852100","volume":"100","author":"FA Feltus","year":"2003","unstructured":"Feltus FA, Lee EK, Costello JF, Plass C, Vertino PM: Predicting aberrant CpG island methylation. Proc Natl Acad Sci USA 2003, 100(21):12253\u201312258.","journal-title":"Proc Natl Acad Sci USA"},{"issue":"28","key":"2846_CR28","doi-asserted-by":"publisher","first-page":"10713","DOI":"10.1073\/pnas.0602949103","volume":"103","author":"R Das","year":"2006","unstructured":"Das R, Dimitrova N, Xuan Z, Rollins RA, Haghighi F, Edwards JR, Ju J, Bestor TH, Zhang MQ: Computational prediction of methylation status in human genomic sequences. Proc Natl Acad Sci USA 2006, 103(28):10713\u201310716.","journal-title":"Proc Natl Acad Sci USA"},{"issue":"10","key":"2846_CR29","doi-asserted-by":"publisher","first-page":"e55","DOI":"10.1093\/nar\/gkn122","volume":"36","author":"C Bock","year":"2008","unstructured":"Bock C, Walter J, Paulsen M, Lengauer T: Inter-individual variation of DNA methylation and its implications for large-scale epigenome mapping. Nucleic Acids Res 2008, 36(10):e55.","journal-title":"Nucleic Acids Res"},{"issue":"7205","key":"2846_CR30","doi-asserted-by":"crossref","first-page":"766","DOI":"10.1038\/nature07107","volume":"454","author":"A Meissner","year":"2008","unstructured":"Meissner A, Mikkelsen TS, Gu H, Wernig M, Hanna J, Sivachenko A, Zhang X, Bernstein BE, Nusbaum C, Jaffe DB, et al.: Genome-scale DNA methylation maps of pluripotent and differentiated cells. Nature 2008, 454(7205):766\u2013770.","journal-title":"Nature"},{"issue":"12","key":"2846_CR31","doi-asserted-by":"publisher","first-page":"1378","DOI":"10.1038\/ng1909","volume":"38","author":"F Eckhardt","year":"2006","unstructured":"Eckhardt F, Lewin J, Cortese R, Rakyan VK, Attwood J, Burger M, Burton J, Cox TV, Davies R, Down TA, et al.: DNA methylation profiling of human chromosomes 6, 20 and 22. Nat Genet 2006, 38(12):1378\u20131385.","journal-title":"Nat Genet"},{"issue":"6","key":"2846_CR32","doi-asserted-by":"publisher","first-page":"e110","DOI":"10.1371\/journal.pcbi.0030110","volume":"3","author":"C Bock","year":"2007","unstructured":"Bock C, Walter J, Paulsen M, Lengauer T: CpG island mapping by epigenome prediction. PLoS Comput Biol 2007, 3(6):e110.","journal-title":"PLoS Comput Biol"},{"key":"2846_CR33","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/FUZZY.2007.4295540","volume-title":"Fuzzy Systems Conference, 2007 FUZZ-IEEE 2007 IEEE International","author":"C Previti","year":"2007","unstructured":"Previti C, Harari O, del Val C: Mining and Predicting CpGislands. Fuzzy Systems Conference, 2007 FUZZ-IEEE 2007 IEEE International 2007, 1\u20136."},{"issue":"8","key":"2846_CR34","doi-asserted-by":"publisher","first-page":"2862","DOI":"10.1073\/pnas.0408238102","volume":"102","author":"I Zwir","year":"2005","unstructured":"Zwir I, Shin D, Kato A, Nishino K, Latifi T, Solomon F, Hare JM, Huang H, Groisman EA: Dissecting the PhoP regulatory network of Escherichia coli and Salmonella enterica. Proc Natl Acad Sci USA 2005, 102(8):2862\u20132867.","journal-title":"Proc Natl Acad Sci USA"},{"key":"2846_CR35","first-page":"833","volume-title":"SCSC: Proceedings of the 2007 summer computer simulation conference","author":"C Previti","year":"2007","unstructured":"Previti C, Harari O, Zwir I, del Val C: Novel approachesto the prediction of CpG islands and their methylation status. SCSC: Proceedings of the 2007 summer computer simulation conference 2007, 833\u2013840."},{"key":"2846_CR36","doi-asserted-by":"publisher","first-page":"361","DOI":"10.1016\/S0076-6879(06)22018-4","volume":"422","author":"I Zwir","year":"2007","unstructured":"Zwir I, Harari O, Groisman EA: Gene promoter scan methodology for identifying and classifying coregulated promoters. Meth Enzymol 2007, 422: 361\u2013385.","journal-title":"Meth Enzymol"},{"issue":"3","key":"2846_CR37","doi-asserted-by":"publisher","first-page":"281","DOI":"10.1038\/10343","volume":"22","author":"S Tavazoie","year":"1999","unstructured":"Tavazoie S, Hughes JD, Campbell MJ, Cho RJ, Church GM: Systematic determination of genetic network architecture. Nat Genet 1999, 22(3):281\u2013285.","journal-title":"Nat Genet"},{"issue":"1","key":"2846_CR38","doi-asserted-by":"publisher","first-page":"446","DOI":"10.1186\/1471-2105-7-446","volume":"7","author":"M Hackenberg","year":"2006","unstructured":"Hackenberg M, Previti C, Luque-Escamilla P, Carpena P, Martinez-Aroza J, Oliver J: CpGcluster: a distance-based algorithm for CpG-island detection. BMC Bioinformatics 2006, 7(1):446.","journal-title":"BMC Bioinformatics"},{"issue":"2","key":"2846_CR39","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1016\/0022-2836(87)90689-9","volume":"196","author":"M Gardiner-Garden","year":"1987","unstructured":"Gardiner-Garden M, Frommer M: CpG islands in vertebrate genomes. J Mol Biol 1987, 196(2):261\u2013282.","journal-title":"J Mol Biol"},{"issue":"12","key":"2846_CR40","doi-asserted-by":"publisher","first-page":"R263","DOI":"10.1186\/gb-2007-8-12-r263","volume":"8","author":"JR Goni","year":"2007","unstructured":"Goni JR, Perez A, Torrents D, Orozco M: Determining promoter location based on DNA structure first-principles calculations. Genome Biol 2007, 8(12):R263.","journal-title":"Genome Biol"},{"key":"2846_CR41","volume-title":"Fuzzy Models for Pattern Recognition: Methods That Search for Structures in Data","author":"JC Bezdek","year":"1992","unstructured":"Bezdek JC, Pal SK: Fuzzy Models for Pattern Recognition: Methods That Search for Structures in Data. New York, NY: IEEE; 1992."},{"issue":"8","key":"2846_CR42","doi-asserted-by":"publisher","first-page":"1051","DOI":"10.1101\/gr.3642605","volume":"15","author":"DC King","year":"2005","unstructured":"King DC, Taylor J, Elnitski L, Chiaromonte F, Miller W, Hardison RC: Evaluation of regulatory potential and conservation scores for detecting cis-regulatory modules in aligned mammalian genome sequences. Genome Res 2005, 15(8):1051\u20131060.","journal-title":"Genome Res"},{"key":"2846_CR43","doi-asserted-by":"publisher","first-page":"1518","DOI":"10.1101\/gr.077479.108","volume":"18","author":"V Rakyan","year":"2008","unstructured":"Rakyan V, Down T, Thorne N, Flicek P, Kulesha E, Graf S, Tomazou E, Backdahl L, Johnson N, Herberth M, et al.: An integrated resource for genome-wide identification and analysis of human tissue-specific differentially methylated regions (tDMRs). Genome Res 2008, 18: 1518\u20131529.","journal-title":"Genome Res"},{"issue":"4","key":"2846_CR44","doi-asserted-by":"publisher","first-page":"457","DOI":"10.1038\/ng1990","volume":"39","author":"M Weber","year":"2007","unstructured":"Weber M, Hellmann I, Stadler MB, Ramos L, Paabo S, Rebhan M, Schubeler D: Distribution, silencing potential and evolutionary impact of promoter DNA methylation in the human genome. Nat Genet 2007, 39(4):457\u2013466.","journal-title":"Nat Genet"},{"issue":"18","key":"2846_CR45","doi-asserted-by":"publisher","first-page":"2651","DOI":"10.1093\/hmg\/9.18.2651","volume":"9","author":"C Grunau","year":"2000","unstructured":"Grunau C, Hindermann W, Rosenthal A: Large-scale methylation analysis of human genomic DNA reveals tissue-specific differences between the methylation profiles of genes and pseudogenes. Hum Mol Genet 2000, 9(18):2651\u20132663.","journal-title":"Hum Mol Genet"},{"issue":"1","key":"2846_CR46","doi-asserted-by":"publisher","first-page":"e22","DOI":"10.1371\/journal.pbio.0060022","volume":"6","author":"R Illingworth","year":"2008","unstructured":"Illingworth R, Kerr A, Desousa D, Jorgensen H, Ellis P, Stalker J, Jackson D, Clee C, Plumb R, Rogers J, et al.: A novel CpG island set identifies tissue-specific methylation at developmental gene loci. PLoS Biol 2008, 6(1):e22.","journal-title":"PLoS Biol"},{"issue":"5675","key":"2846_CR47","doi-asserted-by":"publisher","first-page":"1321","DOI":"10.1126\/science.1098119","volume":"304","author":"G Bejerano","year":"2004","unstructured":"Bejerano G, Pheasant M, Makunin I, Stephen S, Kent WJ, Mattick JS, Haussler D: Ultraconserved elements in the human genome. Science 2004, 304(5675):1321\u20131325.","journal-title":"Science"},{"key":"2846_CR48","doi-asserted-by":"publisher","first-page":"S4","DOI":"10.1186\/gb-2007-8-s1-s4","volume":"8 Suppl 1","author":"H Kikuta","year":"2007","unstructured":"Kikuta H, Fredman D, Rinkwitz S, Lenhard B, Becker TS: Retroviral enhancer detection insertions in zebrafish combined with comparative genomics reveal genomic regulatory blocks \u2013 a fundamental feature of vertebrate genomes. Genome Biol 2007, 8 Suppl 1: S4.","journal-title":"Genome Biol"},{"issue":"2","key":"2846_CR49","doi-asserted-by":"publisher","first-page":"R34","DOI":"10.1186\/gb-2008-9-2-r34","volume":"9","author":"PG Engstrom","year":"2008","unstructured":"Engstrom PG, Fredman D, Lenhard B: Ancora: a web resource for exploring highly conserved noncoding elements and their association with developmental regulatory genes. Genome Biol 2008, 9(2):R34.","journal-title":"Genome Biol"},{"issue":"30","key":"2846_CR50","doi-asserted-by":"crossref","first-page":"20504","DOI":"10.1016\/S0021-9258(18)54953-X","volume":"266","author":"YC Choi","year":"1991","unstructured":"Choi YC, Chae CB: DNA hypomethylation and germ cell-specific expression of testis-specific H2B histone gene. J Biol Chem 1991, 266(30):20504\u201320511.","journal-title":"J Biol Chem"},{"issue":"3","key":"2846_CR51","doi-asserted-by":"publisher","first-page":"323","DOI":"10.1016\/S0888-7543(03)00129-0","volume":"82","author":"T Sasaki","year":"2003","unstructured":"Sasaki T, Shiohama A, Minoshima S, Shimizu N: Identification of eight members of the Argonaute family in the human genome small star, filled. Genomics 2003, 82(3):323\u2013330.","journal-title":"Genomics"},{"issue":"5","key":"2846_CR52","doi-asserted-by":"publisher","first-page":"631","DOI":"10.1016\/S0092-8674(03)00393-3","volume":"113","author":"K Mitsui","year":"2003","unstructured":"Mitsui K, Tokuzawa Y, Itoh H, Segawa K, Murakami M, Takahashi K, Maruyama M, Maeda M, Yamanaka S: The homeoprotein Nanog is required for maintenance of pluripotency in mouse epiblast and ES cells. Cell 2003, 113(5):631\u2013642.","journal-title":"Cell"},{"issue":"6","key":"2846_CR53","doi-asserted-by":"publisher","first-page":"1231","DOI":"10.1093\/molbev\/msn071","volume":"25","author":"K Okamura","year":"2008","unstructured":"Okamura K, Nakai K: Retrotransposition as a source of new promoters. Mol Biol Evol 2008, 25(6):1231\u20131238.","journal-title":"Mol Biol Evol"},{"issue":"2","key":"2846_CR54","doi-asserted-by":"publisher","first-page":"153","DOI":"10.3324\/haematol.10782","volume":"92","author":"J Roman-Gomez","year":"2007","unstructured":"Roman-Gomez J, Jimenez-Velasco A, Agirre X, Castillejo JA, Navarro G, San Jose-Eneriz E, Garate L, Cordeu L, Cervantes F, Prosper F, et al.: Epigenetic regulation of human cancer\/testis antigen gene, HAGE, in chronic myeloid leukemia. Haematologica 2007, 92(2):153\u2013162.","journal-title":"Haematologica"},{"issue":"10","key":"2846_CR55","doi-asserted-by":"publisher","first-page":"2023","DOI":"10.1371\/journal.pgen.0030181","volume":"3","author":"L Shen","year":"2007","unstructured":"Shen L, Kondo Y, Guo Y, Zhang J, Zhang L, Ahmed S, Shu J, Chen X, Waterland RA, Issa JP: Genome-wide profiling of DNA methylation reveals a class of normally methylated CpG island promoters. PLoS Genet 2007, 3(10):2023\u20132036.","journal-title":"PLoS Genet"},{"issue":"4","key":"2846_CR56","doi-asserted-by":"publisher","first-page":"458","DOI":"10.1634\/stemcells.2004-0245","volume":"23","author":"SK Kim","year":"2005","unstructured":"Kim SK, Suh MR, Yoon HS, Lee JB, Oh SK, Moon SY, Moon SH, Lee JY, Hwang JH, Cho WJ, et al.: Identification of developmental pluripotency associated 5 expression in human pluripotent stem cells. Stem Cells 2005, 23(4):458\u2013462.","journal-title":"Stem Cells"},{"key":"2846_CR57","doi-asserted-by":"publisher","first-page":"105","DOI":"10.1023\/A:1007515423169","volume":"36","author":"E Bauer","year":"1999","unstructured":"Bauer E, Kohavi R: An empirical comparison of voting classification algorithms: Bagging, boosting, and variants. Mach Learn 1999, 36: 105\u2013139.","journal-title":"Mach Learn"},{"issue":"5","key":"2846_CR58","doi-asserted-by":"publisher","first-page":"1827","DOI":"10.1073\/pnas.89.5.1827","volume":"89","author":"M Frommer","year":"1992","unstructured":"Frommer M, McDonald LE, Millar DS, Collis CM, Watt F, Grigg GW, Molloy PL, Paul CL: A genomic sequencing protocol that yields a positive display of 5-methylcytosine residues in individual DNA strands. Proc Natl Acad Sci USA 1992, 89(5):1827\u20131831.","journal-title":"Proc Natl Acad Sci USA"},{"key":"2846_CR59","unstructured":"Smit A, Hubley R, Green P: RepeatMasker Open-3.0. 2000\u20132004."},{"issue":"8","key":"2846_CR60","doi-asserted-by":"publisher","first-page":"1034","DOI":"10.1101\/gr.3715005","volume":"15","author":"A Siepel","year":"2005","unstructured":"Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S, et al.: Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res 2005, 15(8):1034\u20131050.","journal-title":"Genome Res"},{"key":"2846_CR61","doi-asserted-by":"crossref","unstructured":"Karolchik D, Hinrichs AS, Furey TS, Roskin KM, Sugnet CW, Haussler D, Kent WJ: The UCSC Table Browser data retrieval tool. Nucleic Acids Res 2004, (32 Database):D493\u2013496. [http:\/\/genome.ucsc.edu]","DOI":"10.1093\/nar\/gkh103"},{"issue":"6","key":"2846_CR62","doi-asserted-by":"publisher","first-page":"276","DOI":"10.1016\/S0168-9525(00)02024-2","volume":"16","author":"P Rice","year":"2000","unstructured":"Rice P, Longden I, Bleasby A: EMBOSS: The European Molecular Biology Open Software Suite. Trends Genet 2000, 16(6):276\u2013277.","journal-title":"Trends Genet"},{"issue":"6","key":"2846_CR63","doi-asserted-by":"publisher","first-page":"947","DOI":"10.1101\/gr.6073107","volume":"17","author":"JA Greenbaum","year":"2007","unstructured":"Greenbaum JA, Pang B, Tullius TD: Construction of a genome-scale structural map at single-nucleotide resolution. Genome Res 2007, 17(6):947\u2013953.","journal-title":"Genome Res"},{"issue":"17","key":"2846_CR64","doi-asserted-by":"publisher","first-page":"9738","DOI":"10.1073\/pnas.95.17.9738","volume":"95","author":"B Balasubramanian","year":"1998","unstructured":"Balasubramanian B, Pogozelski WK, Tullius TD: DNA strand breaking by the hydroxyl radical is governed by the accessible surface areas of the hydrogen atoms of the DNA backbone. Proc Natl Acad Sci USA 1998, 95(17):9738\u20139743.","journal-title":"Proc Natl Acad Sci USA"},{"issue":"1","key":"2846_CR65","doi-asserted-by":"publisher","first-page":"229","DOI":"10.1006\/jmbi.2001.4987","volume":"313","author":"WK Olson","year":"2001","unstructured":"Olson WK, Bansal M, Burley SK, Dickerson RE, Gerstein M, Harvey SC, Heinemann U, Lu XJ, Neidle S, Shakked Z, et al.: A standard reference frame for the description of nucleic acid base-pair geometry. J Mol Biol 2001, 313(1):229\u2013237.","journal-title":"J Mol Biol"},{"key":"2846_CR66","volume-title":"Principal Component Analysis","author":"IT Jolliffe","year":"2002","unstructured":"Jolliffe IT: Principal Component Analysis. New York, NY: Springer; 2002."},{"issue":"15","key":"2846_CR67","doi-asserted-by":"publisher","first-page":"550a","DOI":"10.1021\/ac0028797","volume":"72","author":"CL Wilkins","year":"2000","unstructured":"Wilkins CL: Data mining with Spotfire Pro 4.0. Analytical Chemistry 2000, 72(15):550a-550a.","journal-title":"Analytical Chemistry"},{"issue":"1","key":"2846_CR68","doi-asserted-by":"publisher","first-page":"S12","DOI":"10.1186\/gb-2006-7-s1-s12","volume":"7 Suppl 1","author":"D Thierry-Mieg","year":"2006","unstructured":"Thierry-Mieg D, Thierry-Mieg J: AceView: a comprehensive cDNA-supported gene and transcripts annotation. Genome Biol 2006, 7 Suppl 1(1):S12.","journal-title":"Genome Biol"},{"key":"2846_CR69","doi-asserted-by":"crossref","unstructured":"Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL: GenBank. Nucleic Acids Res 2008, (36 Database):D25\u201330.","DOI":"10.1093\/nar\/gkm929"},{"issue":"4","key":"2846_CR70","doi-asserted-by":"publisher","first-page":"332","DOI":"10.1038\/ng0893-332","volume":"4","author":"MS Boguski","year":"1993","unstructured":"Boguski MS, Lowe TM, Tolstoshev CM: dbEST \u2013 database for \"expressed sequence tags\". Nat Genet 1993, 4(4):332\u2013333.","journal-title":"Nat Genet"},{"issue":"1","key":"2846_CR71","doi-asserted-by":"publisher","first-page":"126","DOI":"10.1093\/nar\/28.1.126","volume":"28","author":"DR Maglott","year":"2000","unstructured":"Maglott DR, Katz KS, Sicotte H, Pruitt KD: NCBI's LocusLink and RefSeq. Nucleic Acids Res 2000, 28(1):126\u2013128. [http:\/\/www.ncbi.nih.gov\/RefSeq]","journal-title":"Nucleic Acids Res"},{"issue":"11","key":"2846_CR72","doi-asserted-by":"crossref","first-page":"1854","DOI":"10.1101\/gr.174501","volume":"11","author":"L Ponger","year":"2001","unstructured":"Ponger L, Duret L, Mouchiroud D: Determinants of CpG islands: expression in early embryo and isochore structure. Genome Res 2001, 11(11):1854\u20131860.","journal-title":"Genome Res"},{"key":"2846_CR73","volume-title":"Machine learning","author":"TM Mitchell","year":"1997","unstructured":"Mitchell TM: Machine learning. New York, NY: McGraw-Hill Higher Education; 1997."},{"issue":"3","key":"2846_CR74","doi-asserted-by":"publisher","first-page":"264","DOI":"10.1145\/331499.331504","volume":"31","author":"AK Jain","year":"1999","unstructured":"Jain AK, Murty MN, Flynn PJ: Data Clustering: A Review. ACM Comput Surv 1999, 31(3):264\u2013323.","journal-title":"ACM Comput Surv"},{"key":"2846_CR75","volume-title":"Algorithms for clustering data","author":"AK Jain","year":"1988","unstructured":"Jain AK, Dubes RC: Algorithms for clustering data. Englewood Cliffs, NJ: Prentice-Hall, Inc.; 1988."},{"key":"2846_CR76","volume-title":"Some Methods for Classification and Analysis of Multivariate Observations","author":"JB MacQueen","year":"1967","unstructured":"MacQueen JB: Some Methods for Classification and Analysis of Multivariate Observations. Volume 1. Berkeley, CA: University of California Press; 1967."},{"key":"2846_CR77","volume-title":"MATLAB statistics toolbox: computation, visualization, programming: user's guide","author":"B Jones","year":"1993","unstructured":"Jones B: MATLAB statistics toolbox: computation, visualization, programming: user's guide. Natick, MA: MathWorks; 1993."},{"issue":"3","key":"2846_CR78","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1109\/3477.678624","volume":"28","author":"JC Bezdek","year":"1998","unstructured":"Bezdek JC, Pal NR: Some new indexes of cluster validity. IEEE Trans Syst Man Cybern B Cybern 1998, 28(3):301\u2013315.","journal-title":"IEEE Trans Syst Man Cybern B Cybern"},{"issue":"1","key":"2846_CR79","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","volume":"20","author":"P Rousseeuw","year":"1987","unstructured":"Rousseeuw P: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 1987, 20(1):53\u201365.","journal-title":"J Comput Appl Math"},{"issue":"11","key":"2846_CR80","doi-asserted-by":"publisher","first-page":"1386","DOI":"10.1093\/bioinformatics\/btn178","volume":"24","author":"M Hackenberg","year":"2008","unstructured":"Hackenberg M, Matthiesen R: Annotation-Modules: a tool for finding significant combinations of multisource annotations for gene lists. Bioinformatics 2008, 24(11):1386\u20131393.","journal-title":"Bioinformatics"},{"issue":"25","key":"2846_CR81","doi-asserted-by":"publisher","first-page":"14863","DOI":"10.1073\/pnas.95.25.14863","volume":"95","author":"MB Eisen","year":"1998","unstructured":"Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA 1998, 95(25):14863\u201314868.","journal-title":"Proc Natl Acad Sci USA"},{"key":"2846_CR82","volume-title":"Classification and Regression Trees","author":"L Breiman","year":"1984","unstructured":"Breiman L, Friedman JH, Olshen RA, Stone CJ: Classification and Regression Trees. Belmont, CA: Wadsworth Publishing Company; 1984."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-10-116.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T21:39:28Z","timestamp":1630445968000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-10-116"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,4,21]]},"references-count":82,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2009,12]]}},"alternative-id":["2846"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-10-116","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2009,4,21]]},"assertion":[{"value":"24 October 2008","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 April 2009","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 April 2009","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"116"}}