{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,5,21]],"date-time":"2025-05-21T06:55:13Z","timestamp":1747810513256},"reference-count":66,"publisher":"Springer Science and Business Media LLC","issue":"S5","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2007,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>Identification of protein interacting sites is an important task in computational molecular biology. As more and more protein sequences are deposited without available structural information, it is strongly desirable to predict protein binding regions by their sequences alone. This paper presents a pattern mining approach to tackle this problem. It is observed that a functional region of protein structures usually consists of several peptide segments linked with large wildcard regions. Thus, the proposed mining technology considers large irregular gaps when growing patterns, in order to find the residues that are simultaneously conserved but largely separated on the sequences. A derived pattern is called a cluster-like pattern since the discovered conserved residues are always grouped into several blocks, which each corresponds to a local conserved region on the protein sequence.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>The experiments conducted in this work demonstrate that the derived long patterns automatically discover the important residues that form one or several hot regions of protein-protein interactions. The methodology is evaluated by conducting experiments on the web server MAGIIC-PRO based on a well known benchmark containing 220 protein chains from 72 distinct complexes. Among the tested 218 proteins, there are 900 sequential blocks discovered, 4.25 blocks per protein chain on average. About 92% of the derived blocks are observed to be clustered in space with at least one of the other blocks, and about 66% of the blocks are found to be near the interface of protein-protein interactions. It is summarized that for about 83% of the tested proteins, at least two interacting blocks can be discovered by this approach.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>This work aims to demonstrate that the important residues associated with the interface of protein-protein interactions may be automatically discovered by sequential pattern mining. The detected regions possess high conservation and thus are considered as the computational hot regions. This information would be useful to characterizing protein sequences, predicting protein function, finding potential partners, and facilitating protein docking for drug discovery.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-8-s5-s8","type":"journal-article","created":{"date-parts":[[2007,5,25]],"date-time":"2007-05-25T09:06:22Z","timestamp":1180083982000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":34,"title":["Identification of hot regions in protein-protein interactions by sequential pattern mining"],"prefix":"10.1186","volume":"8","author":[{"given":"Chen-Ming","family":"Hsu","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chien-Yu","family":"Chen","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Baw-Jhiune","family":"Liu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chih-Chang","family":"Huang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Min-Hung","family":"Laio","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chien-Chieh","family":"Lin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tzung-Lin","family":"Wu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2007,5,24]]},"reference":[{"key":"1921_CR1","doi-asserted-by":"crossref","unstructured":"Hsu CM, Chen CY, Liu BJ: MAGIIC-PRO: detecting functional signatures by efficient discovery of long patterns in protein sequences. Nucleic Acids Res 2006, (34 Web Server):W356-W361. 10.1093\/nar\/gkl309","DOI":"10.1093\/nar\/gkl309"},{"key":"1921_CR2","doi-asserted-by":"publisher","first-page":"957","DOI":"10.1016\/0022-2836(87)90501-8","volume":"195","author":"MJ Zvelvbil","year":"1987","unstructured":"Zvelvbil MJ, Barton GJ, Taylor WR, Sternberg MJ: Prediction of protein secondary structure and active sites using the alignment of homologous sequences. J Mol Biol 1987, 195: 957\u2013961. 10.1016\/0022-2836(87)90501-8","journal-title":"J Mol Biol"},{"key":"1921_CR3","doi-asserted-by":"publisher","first-page":"589","DOI":"10.1093\/protein\/2.8.589","volume":"2","author":"A Godzik","year":"1989","unstructured":"Godzik A, Sander C: Conservation of residue interactions in a family of Ca-binding proteins. Protein Eng 1989, 2: 589\u2013596. 10.1093\/protein\/2.8.589","journal-title":"Protein Eng"},{"key":"1921_CR4","doi-asserted-by":"publisher","first-page":"227","DOI":"10.1002\/prot.10146","volume":"48","author":"WS Valdar","year":"2002","unstructured":"Valdar WS: Scoring residue conservation. Proteins 2002, 48: 227\u2013241. 10.1002\/prot.10146","journal-title":"Proteins"},{"key":"1921_CR5","first-page":"745","volume":"9","author":"CD Livingstone","year":"1993","unstructured":"Livingstone CD, Barton GJ: Protein sequence alignments: a strategy for the hierarchical analysis of residue conservation. Comput Appl Biosci 1993, 9: 745\u2013756.","journal-title":"Comput Appl Biosci"},{"key":"1921_CR6","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1038\/nsb0295-171","volume":"2","author":"G Casari","year":"1995","unstructured":"Casari G, Sander C, Valencia A: A method to predict functional residues in proteins. Nat Struct Biol 1995, 2: 171\u2013178. 10.1038\/nsb0295-171","journal-title":"Nat Struct Biol"},{"key":"1921_CR7","doi-asserted-by":"publisher","first-page":"447","DOI":"10.1006\/jmbi.2000.4474","volume":"307","author":"A Armon","year":"2001","unstructured":"Armon A, Graur D, Ben-Tal N: ConSurf: an algorithmic tool for the identification of functional regions in proteins by surface mapping of phylogenetic information. J Mol Biol 2001, 307: 447\u2013463. 10.1006\/jmbi.2000.4474","journal-title":"J Mol Biol"},{"key":"1921_CR8","doi-asserted-by":"publisher","first-page":"216","DOI":"10.1038\/nature01513","volume":"422","author":"A Sali","year":"2003","unstructured":"Sali A, et al.: From words to literature in structural proteomics. Nature 2003, 422: 216\u2013225. 10.1038\/nature01513","journal-title":"Nature"},{"key":"1921_CR9","doi-asserted-by":"publisher","first-page":"951","DOI":"10.1038\/nbt1103","volume":"23","author":"DR Rhodes","year":"2005","unstructured":"Rhodes DR, et al.: Probabilistic model of the human protein-protein interaction network. Nat Biotechnol 2005, 23: 951\u2013959. 10.1038\/nbt1103","journal-title":"Nat Biotechnol"},{"key":"1921_CR10","doi-asserted-by":"publisher","first-page":"30","DOI":"10.1002\/prot.340210105","volume":"21","author":"J Janin","year":"1995","unstructured":"Janin J: Elusive affinities. Proteins 1995, 21: 30\u201339. 10.1002\/prot.340210105","journal-title":"Proteins"},{"key":"1921_CR11","doi-asserted-by":"publisher","first-page":"999","DOI":"10.1093\/protein\/10.9.999","volume":"10","author":"D Xu","year":"1997","unstructured":"Xu D, et al.: Hydrogen bonds and salt bridges across protein-protein interfaces. Protein Eng 1997, 10: 999\u20131012. 10.1093\/protein\/10.9.999","journal-title":"Protein Eng"},{"key":"1921_CR12","doi-asserted-by":"publisher","first-page":"2177","DOI":"10.1006\/jmbi.1998.2439","volume":"285","author":"L Lo Conte","year":"1999","unstructured":"Lo Conte L, Chothia C, Janin J: The atomic structure of protein-protein recognition sites. J Mol Biol 1999, 285: 2177\u20132198. 10.1006\/jmbi.1998.2439","journal-title":"J Mol Biol"},{"key":"1921_CR13","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1016\/S0959-440X(02)00284-1","volume":"12","author":"O Lichtarge","year":"2002","unstructured":"Lichtarge O, Sowa ME: Evolutionary predictions of binding surfaces and interactions. Curr Opin Struct Biol 2002, 12: 21\u201327. 10.1016\/S0959-440X(02)00284-1","journal-title":"Curr Opin Struct Biol"},{"key":"1921_CR14","doi-asserted-by":"publisher","first-page":"342","DOI":"10.1006\/jmbi.1996.0167","volume":"257","author":"O Lichtarge","year":"1996","unstructured":"Lichtarge O, Bourne HR, Cohen FE: An evolutionary trace method defines binding surfaces common to protein families. J Mol Biol 1996, 257: 342\u2013358. 10.1006\/jmbi.1996.0167","journal-title":"J Mol Biol"},{"issue":"1","key":"1921_CR15","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1006\/jmbi.1998.1843","volume":"280","author":"AA Bogan","year":"1998","unstructured":"Bogan AA, Thorn KS: Anatomy of hot spots in protein interfaces. J Mol Biol 1998, 280(1):1\u20139. 10.1006\/jmbi.1998.1843","journal-title":"J Mol Biol"},{"key":"1921_CR16","doi-asserted-by":"publisher","first-page":"284","DOI":"10.1093\/bioinformatics\/17.3.284","volume":"17","author":"KS Thorn","year":"2001","unstructured":"Thorn KS, Bogan AA: ASEdb: a database of alanine mutations and their effects on the free energy of binding in protein interactions. Bioinformatics 2001, 17: 284\u2013285. 10.1093\/bioinformatics\/17.3.284","journal-title":"Bioinformatics"},{"key":"1921_CR17","doi-asserted-by":"publisher","first-page":"1281","DOI":"10.1016\/j.jmb.2004.10.077","volume":"345","author":"O Keskin","year":"2005","unstructured":"Keskin O, Ma B, Nussinov R: Hot regions in protein-protein interactions: the organization and contribution of structurally conserved hot spot residues. J Mol Biol 2005, 345: 1281\u20131294. 10.1016\/j.jmb.2004.10.077","journal-title":"J Mol Biol"},{"issue":"8","key":"1921_CR18","doi-asserted-by":"publisher","first-page":"3407","DOI":"10.1073\/pnas.88.8.3407","volume":"88","author":"BC Cunningham","year":"1991","unstructured":"Cunningham BC, Wells JA: Rational design of receptor-specific variants of human growth hormone. Proceedings of the National Academy of Sciences of the United States of America 1991, 88(8):3407\u20133411. 10.1073\/pnas.88.8.3407","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"key":"1921_CR19","doi-asserted-by":"publisher","first-page":"383","DOI":"10.1126\/science.7529940","volume":"267","author":"T Clackson","year":"1995","unstructured":"Clackson T, Wells JA: A hot spot of binding energy in a hormone-receptor interface. Science 1995, 267: 383\u2013386. 10.1126\/science.7529940","journal-title":"Science"},{"key":"1921_CR20","doi-asserted-by":"publisher","first-page":"781","DOI":"10.1016\/j.jmb.2004.09.051","volume":"344","author":"X Li","year":"2004","unstructured":"Li X, Keskin O, Ma B, Nussinov R, Liang J: Protein-protein interactions: hot spots and structurally conserved residues often locate in complemented pockets that pre-organized in the unbound states: implications for docking. J Mol Biol 2004, 344: 781\u2013795. 10.1016\/j.jmb.2004.09.051","journal-title":"J Mol Biol"},{"issue":"10","key":"1921_CR21","doi-asserted-by":"publisher","first-page":"5772","DOI":"10.1073\/pnas.1030237100","volume":"100","author":"B Ma","year":"2003","unstructured":"Ma B, Elkayam T, Wolfson H, Nussinov R: Protein-protein interactions: structurally conserved residues distinguish between binding sites and exposed protein surfaces. Proceedings of the National Academy of Sciences of the United States of America 2003, 100(10):5772\u20135777. 10.1073\/pnas.1030237100","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"key":"1921_CR22","doi-asserted-by":"publisher","first-page":"943","DOI":"10.1016\/j.jmb.2003.12.073","volume":"336","author":"RP Bahadur","year":"2004","unstructured":"Bahadur RP, et al.: A dissecting of specific and non-specific protein-protein interfaces. J Mol Biol 2004, 336: 943\u2013955. 10.1016\/j.jmb.2003.12.073","journal-title":"J Mol Biol"},{"key":"1921_CR23","doi-asserted-by":"publisher","first-page":"334","DOI":"10.1002\/prot.10085","volume":"47","author":"P Chakrabarti","year":"2002","unstructured":"Chakrabarti P, Janin J: Dissecting protein-protein recognition sites. Proteins 2002, 47: 334\u2013343. 10.1002\/prot.10085","journal-title":"Proteins"},{"key":"1921_CR24","doi-asserted-by":"publisher","first-page":"705","DOI":"10.1038\/256705a0","volume":"256","author":"C Chotia","year":"1975","unstructured":"Chotia C, Janin J: Principles of protein-protein recognition. Nature 1975, 256: 705\u2013708. 10.1038\/256705a0","journal-title":"Nature"},{"issue":"1","key":"1921_CR25","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1073\/pnas.93.1.13","volume":"93","author":"S Jones","year":"1996","unstructured":"Jones S, Thornton JM: Principles of protein-protein interactions. Proceedings of the National Academy of Sciences of the United States of America 1996, 93(1):13\u201320. 10.1073\/pnas.93.1.13","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"issue":"5","key":"1921_CR26","doi-asserted-by":"publisher","first-page":"2177","DOI":"10.1006\/jmbi.1998.2439","volume":"285","author":"L Lo Conte","year":"1999","unstructured":"Lo Conte L, et al.: The atomic structure of protein-protein recognition sites. J Mol Biol 1999, 285(5):2177\u20132198. 10.1006\/jmbi.1998.2439","journal-title":"J Mol Biol"},{"key":"1921_CR27","doi-asserted-by":"publisher","first-page":"991","DOI":"10.1016\/S0022-2836(02)01281-0","volume":"325","author":"IMA Nooren","year":"2003","unstructured":"Nooren IMA, Thornton JM: Structural characterization and functional significance of transient protein-protein interactions. J Mol Biol 2003, 325: 991\u20131018. 10.1016\/S0022-2836(02)01281-0","journal-title":"J Mol Biol"},{"key":"1921_CR28","doi-asserted-by":"publisher","first-page":"377","DOI":"10.1016\/S0022-2836(02)01223-8","volume":"325","author":"Y Ofran","year":"2003","unstructured":"Ofran Y, Rost B: Analysing six types of protein-protein interfaces. J Mol Biol 2003, 325: 377\u2013387. 10.1016\/S0022-2836(02)01223-8","journal-title":"J Mol Biol"},{"key":"1921_CR29","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1006\/jmbi.1997.1234","volume":"272","author":"S Jones","year":"1997","unstructured":"Jones S, Thornton JM: Analysis of protein-protein interaction sites using surface patches. J Mol Biol 1997, 272: 121\u2013132. 10.1006\/jmbi.1997.1234","journal-title":"J Mol Biol"},{"key":"1921_CR30","doi-asserted-by":"publisher","first-page":"133","DOI":"10.1006\/jmbi.1997.1233","volume":"272","author":"S Jones","year":"1997","unstructured":"Jones S, Thornton JM: Prediction of protein-protein interaction site using surface patches. J Mol Biol 1997, 272: 133\u2013143. 10.1006\/jmbi.1997.1233","journal-title":"J Mol Biol"},{"key":"1921_CR31","doi-asserted-by":"publisher","first-page":"181","DOI":"10.1016\/j.jmb.2004.02.040","volume":"338","author":"H Neuvirth","year":"2004","unstructured":"Neuvirth H, Raz R, Schreiber G: ProMate: a structure based prediction program to identify the location of protein-protein binding sites. J Mol Biol 2004, 338: 181\u2013199. 10.1016\/j.jmb.2004.02.040","journal-title":"J Mol Biol"},{"key":"1921_CR32","doi-asserted-by":"publisher","first-page":"1335","DOI":"10.1093\/bioinformatics\/btl079","volume":"22","author":"NJ Burgoyne","year":"2006","unstructured":"Burgoyne NJ, Jackson RM: Predicting protein interaction sites: binding hot-spots in protein-protein and protein-ligand interfaces. Bioinformatics 2006, 22: 1335\u20131342. 10.1093\/bioinformatics\/btl079","journal-title":"Bioinformatics"},{"key":"1921_CR33","doi-asserted-by":"publisher","first-page":"3698","DOI":"10.1093\/nar\/gkl454","volume":"34","author":"S Liang","year":"2006","unstructured":"Liang S, Zhang C, Song L, Zhou Y: Protein binding site prediction using an empirical scoring function. Nucleic Acids Res 2006, 34: 3698\u20133707. 10.1093\/nar\/gkl454","journal-title":"Nucleic Acids Res"},{"key":"1921_CR34","doi-asserted-by":"publisher","first-page":"1356","DOI":"10.1046\/j.1432-1033.2002.02767.x","volume":"269","author":"P Fariselli","year":"2002","unstructured":"Fariselli P, Pazos F, Valencia A, Casadio R: Prediction of protein-protein interaction sites in heterocomplexes with neural networks. Eur J Biochem 2002, 269: 1356\u20131361. 10.1046\/j.1432-1033.2002.02767.x","journal-title":"Eur J Biochem"},{"issue":"8","key":"1921_CR35","doi-asserted-by":"publisher","first-page":"1487","DOI":"10.1093\/bioinformatics\/bti242","volume":"21","author":"JR Bradford","year":"2005","unstructured":"Bradford JR, Westhead DR: Improved prediction of protein-protein binding sites using a support vector machines approach. Bioinformatics 2005, 21(8):1487\u20131494. 10.1093\/bioinformatics\/bti242","journal-title":"Bioinformatics"},{"key":"1921_CR36","doi-asserted-by":"publisher","first-page":"884","DOI":"10.1110\/ps.03465504","volume":"13","author":"AR Panchenko","year":"2004","unstructured":"Panchenko AR, Kondrashov F, Bryant S: Prediction of functional sites by analysis of sequence and structure conservation. Protein Science 2004, 13: 884\u2013892. 10.1110\/ps.03465504","journal-title":"Protein Science"},{"key":"1921_CR37","doi-asserted-by":"publisher","first-page":"190","DOI":"10.1110\/ps.03323604","volume":"13","author":"DR Caffrey","year":"2004","unstructured":"Caffrey DR, et al.: Are protein-protein interfaces more conserved in sequence than the rest of the protein surface. Protein Science 2004, 13: 190\u2013202. 10.1110\/ps.03323604","journal-title":"Protein Science"},{"key":"1921_CR38","doi-asserted-by":"publisher","first-page":"331","DOI":"10.1002\/(SICI)1097-0134(20000601)39:4<331::AID-PROT60>3.0.CO;2-A","volume":"39","author":"Z Hu","year":"2000","unstructured":"Hu Z, Ma B, Wolfson H, Nussinov R: Conservation of polar residues as hot spots at protein interfaces. Proteins 2000, 39: 331\u2013342. 10.1002\/(SICI)1097-0134(20000601)39:4<331::AID-PROT60>3.0.CO;2-A","journal-title":"Proteins"},{"key":"1921_CR39","first-page":"401","volume-title":"Pac Symp Biocomput","author":"C Ouzounis","year":"1998","unstructured":"Ouzounis C, Perez-Irratxeta C, Sander C, Valencia A: Are binding residues conserved? Pac Symp Biocomput 1998, 401\u2013412."},{"key":"1921_CR40","doi-asserted-by":"publisher","first-page":"395","DOI":"10.1006\/jmbi.2001.4870","volume":"311","author":"P Aloy","year":"2001","unstructured":"Aloy P, Querol E, Aviles FX, Sternberg MJ: Automated structure-based prediction of functional sites in proteins: applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking. J Mol Biol 2001, 311: 395\u2013408. 10.1006\/jmbi.2001.4870","journal-title":"J Mol Biol"},{"key":"1921_CR41","doi-asserted-by":"publisher","first-page":"2496","DOI":"10.1093\/bioinformatics\/bti340","volume":"21","author":"I Res","year":"2005","unstructured":"Res I, Mihalek I, Lichtarge O: An evolution based classifier for prediction of protein interfaces without using protein structures. Bioinformatics 2005, 21: 2496\u20132501. 10.1093\/bioinformatics\/bti340","journal-title":"Bioinformatics"},{"key":"1921_CR42","doi-asserted-by":"publisher","first-page":"236","DOI":"10.1016\/S0014-5793(03)00456-3","volume":"544","author":"Y Ofran","year":"2003","unstructured":"Ofran Y, Rost B: Predicted protein-protein interaction sites from local sequence information. FEBS Lett 2003, 544: 236\u2013239. 10.1016\/S0014-5793(03)00456-3","journal-title":"FEBS Lett"},{"issue":"Suppl 1","key":"1921_CR43","doi-asserted-by":"publisher","first-page":"i371","DOI":"10.1093\/bioinformatics\/bth920","volume":"20","author":"C Yan","year":"2004","unstructured":"Yan C, et al.: A two-stage classifier for identification of protein-protein interface residues. Bioinformatics 2004, 20(Suppl 1):i371-i378. 10.1093\/bioinformatics\/bth920","journal-title":"Bioinformatics"},{"issue":"1","key":"1921_CR44","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1006\/jmbi.2001.5327","volume":"316","author":"S Madabushi","year":"2002","unstructured":"Madabushi S, Yao H, Marsh M, Kristensen DM, Philippi A, Sowa ME, Lichtarge O: Structural clusters of evolutionary trace residues are statistically significant and common in proteins. J Mol Biol 2002, 316(1):139\u2013154. 10.1006\/jmbi.2001.5327","journal-title":"J Mol Biol"},{"issue":"4","key":"1921_CR45","doi-asserted-by":"publisher","first-page":"917","DOI":"10.1006\/jmbi.2000.4092","volume":"302","author":"X Gallet","year":"2000","unstructured":"Gallet X, Charloteaux B, Thomas A, Brasseur R: A fast method to predict protein interaction sites from sequences. J Mol Biol 2000, 302(4):917\u2013926. 10.1006\/jmbi.2000.4092","journal-title":"J Mol Biol"},{"key":"1921_CR46","doi-asserted-by":"publisher","first-page":"1424","DOI":"10.1109\/TKDE.2004.77","volume":"16","author":"J Pei","year":"2004","unstructured":"Pei J, Han J, Mortazavi-Asl B, Wang J, Pinto H, Chen Q, Dayal U, Hsu MC: Mining sequential patterns by pattern-growth: the PrefixSpan approach. IEEE Transactions on Knowledge and Data Engineering 2004, 16: 1424\u20131440. 10.1109\/TKDE.2004.77","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"1921_CR47","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1007\/11731139_62","volume-title":"Proceedings of the 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining: 9\u201312 April 2006; Sigapore","author":"CM Hsu","year":"2006","unstructured":"Hsu CM, Chen CY, Hsu CC, Liu BJ: Efficient discovery of structural motifs from protein sequences with combination of flexible intra- and inter-block gap constraints. In Proceedings of the 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining: 9\u201312 April 2006; Sigapore. Volume LNCS 3918. Edited by: Carbonell JG, Siekmann J. Springer Berlin\/Heidelberg; 2006:530\u2013539."},{"key":"1921_CR48","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1093\/bioinformatics\/14.1.55","volume":"14","author":"I Rigoutsos","year":"1998","unstructured":"Rigoutsos I, Floratos A: Combinatorial pattern discovery in biological sequences: the TEIRESIAS algorithm. Bioinformatics 1998, 14: 55\u201367. 10.1093\/bioinformatics\/14.1.55","journal-title":"Bioinformatics"},{"key":"1921_CR49","first-page":"509","volume":"13","author":"I Jonassen","year":"1997","unstructured":"Jonassen I: Efficient discovery of conserved patterns using a pattern graph. Comput Appl Biosci 1997, 13: 509\u2013522.","journal-title":"Comput Appl Biosci"},{"issue":"4","key":"1921_CR50","doi-asserted-by":"publisher","first-page":"341","DOI":"10.1093\/bioinformatics\/16.4.341","volume":"16","author":"A Califano","year":"2000","unstructured":"Califano A: SPLASH: structural pattern localization analysis by sequential histograms. Bioinformatics 2000, 16(4):341\u2013347. 10.1093\/bioinformatics\/16.4.341","journal-title":"Bioinformatics"},{"key":"1921_CR51","volume-title":"Protein structure and function","author":"AP Gregory","year":"2003","unstructured":"Gregory AP, Dagmar R: Protein motifs. In Protein structure and function. 4th edition. Edited by: Gregory AP, Dagmar R. Waltham, MA: New Science Press; 2003.","edition":"4"},{"key":"1921_CR52","doi-asserted-by":"publisher","first-page":"1487","DOI":"10.1006\/jmbi.2001.4540","volume":"307","author":"R Landgraf","year":"2001","unstructured":"Landgraf R, Xenarios I, Eisenberg D: Three-dimensional cluster analysis identifies interfaces and functional residue clusters in protein. J Mol Biol 2001, 307: 1487\u20131502. 10.1006\/jmbi.2001.4540","journal-title":"J Mol Biol"},{"key":"1921_CR53","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/nar\/28.1.235","volume":"28","author":"HM Berman","year":"2000","unstructured":"Berman HM, et al.: The Protein Data Bank. Nucleic Acids Res 2000, 28: 235\u2013242. 10.1093\/nar\/28.1.235","journal-title":"Nucleic Acids Res"},{"issue":"2","key":"1921_CR54","doi-asserted-by":"publisher","first-page":"214","DOI":"10.1002\/prot.20560","volume":"60","author":"J Mintseris","year":"2005","unstructured":"Mintseris J, Wiehe K, Pierce B, Anderson R, Chen R, Janin J, Weng Z: Protein-Protein Docking Benchmark 2.0: an update. Proteins 2005, 60(2):214\u2013216. 10.1002\/prot.20560","journal-title":"Proteins"},{"key":"1921_CR55","doi-asserted-by":"publisher","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","volume":"22","author":"W Li","year":"2006","unstructured":"Li W, Godzik A: CD-HIT: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 2006, 22: 1658\u20131659. 10.1093\/bioinformatics\/btl158","journal-title":"Bioinformatics"},{"key":"1921_CR56","unstructured":"Online supplement of this paper[http:\/\/biominer.bime.ntu.edu.tw\/hotregions]"},{"key":"1921_CR57","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1002\/prot.10365","volume":"52","author":"O Schueler-Furman","year":"2003","unstructured":"Schueler-Furman O, Baker D: Conserved residue clustering and protein structure prediction. Proteins 2003, 52: 225\u2013235. 10.1002\/prot.10365","journal-title":"Proteins"},{"key":"1921_CR58","doi-asserted-by":"publisher","first-page":"479","DOI":"10.1093\/protein\/5.6.479","volume":"5","author":"A Ogiwara","year":"1992","unstructured":"Ogiwara A, Uchiyama I, Yasuhiko S, Kanehisa M: Construction of dictionary of sequence motifs that characterize groups of related proteins. Protein Eng 1992, 5: 479\u2013488. 10.1093\/protein\/5.6.479","journal-title":"Protein Eng"},{"key":"1921_CR59","doi-asserted-by":"crossref","unstructured":"Chakrabarti S, Anand AP, Bhardwaj N, Pugalenthi G, Sowdhamini R: SCANMOT: searching for similar sequences using s simultaneous scan of multiple sequence motifs. Nucleic Acids Res 2005, (33 Web Server):W274-W276. 10.1093\/nar\/gki493","DOI":"10.1093\/nar\/gki493"},{"key":"1921_CR60","unstructured":"Hsu CM, Chen CY, Liu BJ: WildSpan: efficient discovery of functional motifs spanning large wildcard regions from protein sequences. Technical Report [http:\/\/biominer.bime.ntu.edu.tw\/wildspan\/]"},{"key":"1921_CR61","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","volume":"25","author":"SF Altschul","year":"1997","unstructured":"Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997, 25: 3389\u20133402. 10.1093\/nar\/25.17.3389","journal-title":"Nucleic Acids Res"},{"key":"1921_CR62","doi-asserted-by":"crossref","unstructured":"Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, Martin MJ, Natale DA, O'Donovan C, Redaschi N, Yeh LS: The universal protein resource (UniProt). Nucl Acids Res 2005, (33 Database):D154-D159.","DOI":"10.1093\/nar\/gki070"},{"key":"1921_CR63","doi-asserted-by":"crossref","unstructured":"Pei J, Han J, Wang W: Mining sequential patterns with constraints in large database. In Proceedings of the 11th ACM International Conference on Information and Knowledge Management: 4\u20139 November 2002; McLean. ACM Press; 18\u201325.","DOI":"10.1145\/584792.584799"},{"issue":"22","key":"1921_CR64","doi-asserted-by":"publisher","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","volume":"89","author":"S Henikoff","year":"1992","unstructured":"Henikoff S, Henikoff JG: Amino acid substitution matrices from protein blocks. Proceedings of the National Academy of Sciences of the United States of America 1992, 89(22):10915\u201310919. 10.1073\/pnas.89.22.10915","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"key":"1921_CR65","unstructured":"BLAST Database[ftp:\/\/ftp.ncbi.nlm.nih.gov\/blast\/db\/]"},{"key":"1921_CR66","doi-asserted-by":"crossref","unstructured":"Landau M, Mayrose I, Rosenberg Y, Glaser F, Martz E, Pupko T, Ben-Tal N: ConSurf: identification of functional regions in proteins by surface-mapping of phylogenetic information. Nucleic Acids Res 2005, (33 Web Server):W299-W302. 10.1093\/nar\/gki370","DOI":"10.1093\/nar\/gki370"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-8-S5-S8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,12]],"date-time":"2023-05-12T00:42:55Z","timestamp":1683852175000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-8-S5-S8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,5]]},"references-count":66,"journal-issue":{"issue":"S5","published-print":{"date-parts":[[2007,5]]}},"alternative-id":["1921"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-8-s5-s8","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2007,5]]},"assertion":[{"value":"24 May 2007","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"S8"}}