{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,7]],"date-time":"2025-11-07T08:55:03Z","timestamp":1762505703978},"reference-count":56,"publisher":"Springer Science and Business Media LLC","issue":"8","license":[{"start":{"date-parts":[[2011,2,5]],"date-time":"2011-02-05T00:00:00Z","timestamp":1296864000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Soft Comput"],"published-print":{"date-parts":[[2011,8]]},"DOI":"10.1007\/s00500-011-0692-5","type":"journal-article","created":{"date-parts":[[2011,2,4]],"date-time":"2011-02-04T11:26:50Z","timestamp":1296818810000},"page":"1631-1642","source":"Crossref","is-referenced-by-count":23,"title":["Generalizing and learning protein-DNA binding sequence representations by an evolutionary algorithm"],"prefix":"10.1007","volume":"15","author":[{"given":"Ka-Chun","family":"Wong","sequence":"first","affiliation":[]},{"given":"Chengbin","family":"Peng","sequence":"additional","affiliation":[]},{"given":"Man-Hon","family":"Wong","sequence":"additional","affiliation":[]},{"given":"Kwong-Sak","family":"Leung","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,2,5]]},"reference":[{"issue":"Suppl 2","key":"692_CR1","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1093\/bioinformatics\/btg1052","volume":"19","author":"S Aerts","year":"2003","unstructured":"Aerts S, Van Loo P, Thijs G, Moreau Y, De Moor B (2003) Computational detection of cis-regulatory modules. Bioinformatics 19(Suppl 2):5\u201314","journal-title":"Bioinformatics"},{"key":"692_CR2","doi-asserted-by":"crossref","unstructured":"Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of the 1993 ACM SIGMOD international conference on management of data, pp 207\u2013216. doi: 10.1145\/170035.170072","DOI":"10.1145\/170035.170072"},{"key":"692_CR3","doi-asserted-by":"crossref","unstructured":"Ahmad S, Gromiha MM, Sarai A (2004) Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information. Bioinformatics 20(4):477\u2013486. doi: 10.1093\/bioinformatics\/btg432","DOI":"10.1093\/bioinformatics\/btg432"},{"key":"692_CR4","doi-asserted-by":"crossref","first-page":"5922","DOI":"10.1093\/nar\/gkn573","volume":"36","author":"S Ahmad","year":"2008","unstructured":"Ahmad S, Keskin O, Sarai A, Nussinov R (2008) Protein-DNA interactions: structural, thermodynamic and clustering patterns of conserved residues in DNA-binding proteins. Nucleic Acids Res 36:5922\u20135932","journal-title":"Nucleic Acids Res"},{"key":"692_CR5","unstructured":"Bailey TL, Elkan C (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In: Proceedings of the 2nd international conference on intelligent systems for molecular biology, pp 28\u201336"},{"issue":"Suppl 2","key":"692_CR6","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1093\/bioinformatics\/btg1054","volume":"19","author":"TL Bailey","year":"2003","unstructured":"Bailey TL, Noble WS (2003) Searching for statistically significant regulatory modules. Bioinformatics 19(Suppl 2):16\u201325","journal-title":"Bioinformatics"},{"key":"692_CR7","unstructured":"Banzhaf W, Nordin P, Keller RE, Francone FD (1998) Genetic Programming\u2014an introduction; on the automatic evolution of computer programs and its applications. Morgan Kaufmann, San Francisco"},{"key":"692_CR8","doi-asserted-by":"crossref","first-page":"D138","DOI":"10.1093\/nar\/gkh121","volume":"32","author":"A Bateman","year":"2004","unstructured":"Bateman A, Coin L, Durbin R, Finn RD, Hollich V, GrifRths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer ELL, Studholme DJ, Yeats C, Eddy SR (2004) The pfam protein families database. Nucleic Acids Res 32:D138\u2013D141","journal-title":"Nucleic Acids Res"},{"key":"692_CR9","doi-asserted-by":"crossref","unstructured":"Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000) The protein data bank. Nucleic Acids Res 28(1):235\u2013242. doi: 10.1093\/nar\/28.1.235","DOI":"10.1093\/nar\/28.1.235"},{"key":"692_CR10","doi-asserted-by":"crossref","first-page":"656","DOI":"10.1101\/gr.4866006","volume":"16","author":"M Blanchette","year":"2006","unstructured":"Blanchette M, Bataille AR, Chen X, Poitras C, Laganiere J, Lefebvre C, Deblois G, Giguere V, Ferretti V, Bergeron D, Coulombe B, Robert F (2006) Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression. Genome Res 16:656\u2013668","journal-title":"Genome Res"},{"key":"692_CR11","doi-asserted-by":"crossref","unstructured":"Brin S, Motwani R, Ullman JD, Tsur S (1997) Dynamic itemset counting and implication rules for market basket data. SIGMOD Rec 26(2):255\u2013264. doi: 10.1145\/253262.253325","DOI":"10.1145\/253262.253325"},{"key":"692_CR12","doi-asserted-by":"crossref","first-page":"4516","DOI":"10.1073\/pnas.0737502100","volume":"100","author":"L Coin","year":"2003","unstructured":"Coin L, Bateman A, Durbin R (2003) Enhanced protein domain discovery by using language modeling techniques from speech recognition. Proc Natl Acad Sci USA 100:4516\u20134520","journal-title":"Proc Natl Acad Sci USA"},{"issue":"9","key":"692_CR13","doi-asserted-by":"crossref","first-page":"3157","DOI":"10.1093\/nar\/5.9.3157","volume":"5","author":"DJ Galas","year":"1987","unstructured":"Galas DJ, Schmitz A (1987) DNAse footprinting: a simple method for the detection of protein-DNA binding specificity. Nucleic Acids Res 5(9):3157\u20133170","journal-title":"Nucleic Acids Res"},{"issue":"13","key":"692_CR14","doi-asserted-by":"crossref","first-page":"3047","DOI":"10.1093\/nar\/9.13.3047","volume":"9","author":"MM Garner","year":"1981","unstructured":"Garner MM, Revzin A (1981) A gel electrophoresis method for quantifying the binding of proteins to specific DNA regions: application to components of the escherichia coli lactose operon regulatory system. Nucleic Acids Res 9(13):3047\u20133060","journal-title":"Nucleic Acids Res"},{"key":"692_CR15","doi-asserted-by":"crossref","unstructured":"Givant S, Halmos P (2009) Introduction to boolean algebras. Springer, Berlin","DOI":"10.1007\/978-0-387-68436-9"},{"key":"692_CR16","unstructured":"Goldberg DE, Richardson J (1987) Genetic algorithms with sharing for multimodal function optimization. In: Proceedings of the 2nd international conference on genetic algorithms and their application. L. Erlbaum Associates Inc., Hillsdale, pp 41\u201349"},{"key":"692_CR17","first-page":"397","volume":"13","author":"WN Grundy","year":"1997","unstructured":"Grundy WN, Bailey TL, Elkan CP, Baker ME (1997)Meta-MEME: motif-based hidden Markov models of protein families. Comput Appl Biosci 13:397\u2013406","journal-title":"Comput Appl Biosci"},{"key":"692_CR18","volume-title":"Adaptation in natural and artificial systems","author":"JH Holland","year":"1975","unstructured":"Holland JH (1975) Adaptation in natural and artificial systems. University of Michigan Press, Ann Arbor"},{"key":"692_CR19","unstructured":"Hulo N, Bairoch A, Bulliard V, Cerutti L, Cuche BA, de Castro E, Lachaize C, Langendijk-Genevaux PS, Sigrist CJA (2008) The 20\u00a0years of prosite. Nucl Acids Res 36(Suppl 1):D245\u2013D249"},{"issue":"1","key":"692_CR20","doi-asserted-by":"crossref","first-page":"188","DOI":"10.1214\/088342304000000107","volume":"19","author":"ST Jensen","year":"2004","unstructured":"Jensen ST, Liu XS, Zhou Q, Liu JS (2004) Computational discovery of gene regulatory binding motifs: a bayesian perspective. Stat Sci 19(1):188\u2013204","journal-title":"Stat Sci"},{"key":"692_CR21","unstructured":"Jong KAD (1975) An analysis of the behavior of a class of genetic adaptive systems. PhD thesis, University of Michigan, Ann Arbor"},{"key":"692_CR22","volume-title":"Evolutionary Computation. A Unified Approach","author":"KAD Jong","year":"2006","unstructured":"Jong KAD (2006) Evolutionary Computation. A Unified Approach. MIT Press, Cambridge, MA"},{"issue":"(I","key":"692_CR23","first-page":"593","volume":"72","author":"M Karnaugh","year":"1953","unstructured":"Karnaugh M (1953) A map method for synthesis of combinational logic circuits. Trans AIEE Commun Electron 72 (I):593\u2013599","journal-title":"Trans AIEE Commun Electron"},{"key":"692_CR24","doi-asserted-by":"crossref","first-page":"R56","DOI":"10.1186\/gb-2004-5-8-r56","volume":"5","author":"M Kato","year":"2004","unstructured":"Kato M, Hata N, Banerjee N, Futcher B, Zhang MQ (2004) Identifying combinatorial regulation of transcription factors and binding motifs. Genome Biol 5:R56","journal-title":"Genome Biol"},{"key":"692_CR25","doi-asserted-by":"crossref","first-page":"332","DOI":"10.1093\/nar\/30.1.332","volume":"30","author":"OV Kel-Margoulis","year":"2002","unstructured":"Kel-Margoulis OV, Kel AE, Reuter I, Deineko IV, Wingender E (2002) TRANSCompel: a database on composite regulatory elements in eukaryotic genes. Nucleic Acids Res 30:332\u2013334","journal-title":"Nucleic Acids Res"},{"key":"692_CR26","doi-asserted-by":"crossref","unstructured":"Kraft D, Petry F, Buckles B, Sadasivan T (1994) The use of genetic programming to build queries for information retrieval. In: Evolutionary Computation, 1994. IEEE World Congress on Computational Intelligence. Proceedings of the 1st IEEE conference, vol 1, pp 468\u2013473. doi: 10.1109\/ICEC.1994.349905","DOI":"10.1109\/ICEC.1994.349905"},{"key":"692_CR27","doi-asserted-by":"crossref","first-page":"1559","DOI":"10.1101\/gr.180601","volume":"11","author":"W Krivan","year":"2001","unstructured":"Krivan W, Wasserman WW (2001) A predictive model for regulatory sequences directing liver-specific transcription. Genome Res 11:1559\u20131566","journal-title":"Genome Res"},{"key":"692_CR28","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/0022-2836(82)90515-0","volume":"157","author":"J Kyte","year":"1982","unstructured":"Kyte J, Doolittle RF (1982) A simple method for displaying the hydropathic character of a protein. J Mol Biol 157:105\u2013132","journal-title":"J Mol Biol"},{"key":"692_CR29","doi-asserted-by":"crossref","unstructured":"Leung KS, Wong KC, Chan TM, Wong MH, Lee KH, Lau CK, Tsui SKW (2010) Discovering protein-DNA binding sequence patterns using association rule mining. Nucleic Acids Research (accepted)","DOI":"10.1093\/nar\/gkq500"},{"key":"692_CR30","doi-asserted-by":"crossref","unstructured":"Li JP, Balazs ME, Parks GT, Clarkson PJ (2002) A species conserving genetic algorithm for multimodal function optimization. Evol Comput 10(3):207\u2013234. doi: 10.1162\/106365602760234081","DOI":"10.1162\/106365602760234081"},{"key":"692_CR31","doi-asserted-by":"crossref","first-page":"835","DOI":"10.1038\/nbt717","volume":"20","author":"XS Liu","year":"2002","unstructured":"Liu XS, Brutlag DL, Liu JS (2002) An algorithm for finding protein-DNA binding sites with applications to chromatinimmunoprecipitation microarray experiments. Nat Biotechnol 20:835\u2013839","journal-title":"Nat Biotechnol"},{"issue":"5","key":"692_CR32","doi-asserted-by":"crossref","first-page":"991","DOI":"10.1016\/S0022-2836(02)00571-5","volume":"320","author":"NM Luscombe","year":"2002","unstructured":"Luscombe NM, Thornton JM (2002) Protein-DNA interactions: amino acid conservation and the effects of mutations on binding specificity. J Mol Biol 320(5):991\u20131009","journal-title":"J Mol Biol"},{"key":"692_CR33","unstructured":"Luscombe NM, Austin SE, Berman HM, Thornton JM (2000) An overview of the structures of protein-DNA complexes. Genome Biol 1(1):1\u201337"},{"issue":"4","key":"692_CR34","doi-asserted-by":"crossref","first-page":"e36","DOI":"10.1371\/journal.pcbi.0020036","volume":"2","author":"KD MacIsaac","year":"2006","unstructured":"MacIsaac KD, Fraenkel E (2006) Practical strategies for discovering regulatory DNA sequence motifs. PLoS Comput Biol 2(4):e36","journal-title":"PLoS Comput Biol"},{"key":"692_CR35","doi-asserted-by":"crossref","first-page":"D108","DOI":"10.1093\/nar\/gkj143","volume":"34","author":"V Matys","year":"2006","unstructured":"Matys V, Kel-Margoulis O, Fricke E, Liebich I, Land S, Barre-Dirrie A, Reuter I, Chekmenev D, Krull M, Hornischer K, Voss N, Stegmaier P, Lewicki-Potapov B, Saxel H, Kel A, Wingender E (2006) TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic Acids Res 34:D108\u2013D110","journal-title":"Nucleic Acids Res"},{"key":"692_CR36","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1046\/j.1365-2958.1999.01347.x","volume":"32","author":"AM McGuire","year":"1999","unstructured":"McGuire AM, De Wulf P, Church GM, Lin EC (1999) A weight matrix for binding recognition by the redox-response regulator ArcA-P of Escherichia coli. Mol Microbiol 32:219\u2013221","journal-title":"Mol Microbiol"},{"key":"692_CR37","doi-asserted-by":"crossref","first-page":"465","DOI":"10.1007\/s12038-009-0052-0","volume":"34","author":"PM Mohan","year":"2009","unstructured":"Mohan PM, Hosur RV (2009) Structure-function-folding relationships and native energy landscape of dynein light chain protein: nuclear magnetic resonance insights. J Biosci 34:465\u2013479","journal-title":"J Biosci"},{"key":"692_CR38","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1186\/1471-2105-6-21","volume":"6","author":"JL Moreland","year":"2005","unstructured":"Moreland JL, Gramada A, Buzko OV, Zhang Q, Bourne PE (2005) The Molecular Biology Toolkit (MBT): a modular platform for developing molecular visualization applications. BMC Bioinformatics 6:21","journal-title":"BMC Bioinformatics"},{"key":"692_CR39","doi-asserted-by":"crossref","unstructured":"Nelson RJ (1953) A way to simplify truth functions. J Symb Logic 18(3):280\u2013282","DOI":"10.2307\/2267441"},{"key":"692_CR40","volume-title":"Digital logic circuit analysis and design","author":"VP Nelson","year":"1995","unstructured":"Nelson VP, Nagle HT, Carroll BD, Irwin JD (1995) Digital logic circuit analysis and design. Prentice-Hall, Inc., Upper Saddle River"},{"key":"692_CR41","doi-asserted-by":"crossref","unstructured":"Ofran Y, Mysore V, Rost B (2007) Prediction of DNA-binding residues from sequence. Bioinformatics 23(13):i347\u2013i353. doi: 10.1093\/bioinformatics\/btm174","DOI":"10.1093\/bioinformatics\/btm174"},{"key":"692_CR42","unstructured":"Pavlidis P, Furey TS, Liberto M, Haussler D, Grundy WN (2001) Promoter region-based classification of genes. In: Pacific symposium on biocomputing, pp 151\u2013163"},{"key":"692_CR43","doi-asserted-by":"crossref","first-page":"812","DOI":"10.1038\/nsmb820","volume":"11","author":"A Remenyi","year":"2004","unstructured":"Remenyi A, Scholer HR, Wilmanns M (2004) Combinatorial control of gene expression. Nat Struct Mol Biol 11:812\u2013815","journal-title":"Nat Struct Mol Biol"},{"key":"692_CR44","doi-asserted-by":"crossref","unstructured":"Rudell RL (1986) Multiple-valued logic minimization for pla synthesis. Tech. Rep. UCB\/ERL M86\/65, EECS Department, University of California, Berkeley. http:\/\/www.eecs.berkeley.edu\/Pubs\/TechRpts\/1986\/734.html","DOI":"10.21236\/ADA606736"},{"issue":"20","key":"692_CR45","doi-asserted-by":"crossref","first-page":"i403","DOI":"10.1093\/bioinformatics\/bti1043","volume":"1","author":"AD Smith","year":"2005","unstructured":"Smith AD, Sumazin P, Das D, Zhang MQ (2005) Mining ChIP-chip data for transcription factor and cofactor binding sites. Bioinformatics Suppl 1(20):i403\u2013i412","journal-title":"Bioinformatics Suppl"},{"key":"692_CR46","unstructured":"Smyth MS, Martin JH (2000) X-ray crystallography. Mol Pathol 53(1):8\u201314"},{"key":"692_CR47","first-page":"241","volume":"17","author":"GD Stormo","year":"1988","unstructured":"Stormo GD (1988) Computer methods for analyzing sequence recognition of nucleic acids. Annu Rev BioChem 17:241\u2013263","journal-title":"Annu Rev BioChem"},{"key":"692_CR48","doi-asserted-by":"crossref","first-page":"e38","DOI":"10.1371\/journal.pbio.0060038","volume":"6","author":"BB Tuch","year":"2008","unstructured":"Tuch BB, Galgoczy DJ, Hernday AD, Li H, Johnson AD (2008) The evolution of combinatorial gene regulation in fungi. PLoS Biol 6:e38","journal-title":"PLoS Biol"},{"key":"692_CR49","doi-asserted-by":"crossref","unstructured":"Veitch EW (1952) A chart method for simplifying truth functions. In: Proceedings of the 1952 ACM national meeting, Pittsburgh. ACM, New York, pp 127\u2013133. doi: 10.1145\/609784.609801","DOI":"10.1145\/609784.609801"},{"key":"692_CR50","doi-asserted-by":"crossref","first-page":"1409","DOI":"10.1093\/nar\/27.6.1409","volume":"27","author":"M Wegner","year":"1999","unstructured":"Wegner M (1999) From head to toes: the multiple facets of Sox proteins. Nucleic Acids Res 27:1409\u20131420","journal-title":"Nucleic Acids Res"},{"key":"692_CR51","unstructured":"White RJ (2001) Gene transcription: mechanisms and control. Blackwell, Oxford"},{"key":"692_CR52","doi-asserted-by":"crossref","first-page":"552","DOI":"10.1016\/S0959-437X(98)80010-5","volume":"8","author":"C Wolberger","year":"1998","unstructured":"Wolberger C (1998) Combinatorial transcription factors. Curr Opin Genet Dev 8:552\u2013559","journal-title":"Curr Opin Genet Dev"},{"key":"692_CR53","doi-asserted-by":"crossref","unstructured":"Wong KC, Leung KS, Wong MH (2009) An evolutionary algorithm with species-specific explosion for multimodal optimization. In: Proceedings of the 11th Annual conference on genetic and evolutionary computation. ACM, New York, pp 923\u2013930. doi: 10.1145\/1569901.1570027","DOI":"10.1145\/1569901.1570027"},{"key":"692_CR54","doi-asserted-by":"crossref","unstructured":"Wong KC, Leung KS, Wong MH (2010a) Effect of spatial locality on an evolutionary algorithm for multimodal optimization. In: Applications of Evolutionary Computation, EvoApplications 2010 Part I. Lecture notes in computer science, vol 6024. Springer, Berlin, pp 481\u2013490. doi: 10.1007\/978-3-642-12239-2_50","DOI":"10.1007\/978-3-642-12239-2_50"},{"key":"692_CR55","doi-asserted-by":"crossref","unstructured":"Wong KC, Leung KS, Wong MH (2010b) Protein structure prediction on a lattice model via multimodal optimization techniques. In: Proceedings of the 12th annual conference on genetic and evolutionary computation. ACM, New York, pp 155\u2013162. doi: 10.1145\/1830483.1830513","DOI":"10.1145\/1830483.1830513"},{"key":"692_CR56","doi-asserted-by":"crossref","unstructured":"Zhou Q, Liu JS (2008) Extracting sequence features to predict protein-DNA interactions: a comparative study. Nucleic Acids Res 36(12):4137\u20134148. doi: 10.1093\/nar\/gkn361","DOI":"10.1093\/nar\/gkn361"}],"container-title":["Soft Computing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s00500-011-0692-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s00500-011-0692-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s00500-011-0692-5","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,6,8]],"date-time":"2019-06-08T10:16:02Z","timestamp":1559988962000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s00500-011-0692-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,2,5]]},"references-count":56,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2011,8]]}},"alternative-id":["692"],"URL":"https:\/\/doi.org\/10.1007\/s00500-011-0692-5","relation":{},"ISSN":["1432-7643","1433-7479"],"issn-type":[{"value":"1432-7643","type":"print"},{"value":"1433-7479","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,2,5]]}}}