{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,26]],"date-time":"2026-01-26T00:13:21Z","timestamp":1769386401966,"version":"3.49.0"},"reference-count":51,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2010,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>This paper deals with the preprocessing of protein sequences for supervised classification. Motif extraction is one way to address that task. It has been largely used to encode biological sequences into feature vectors to enable using well-known machine-learning classifiers which require this format. However, designing a suitable feature space, for a set of proteins, is not a trivial task. For this purpose, we propose a novel encoding method that uses amino-acid substitution matrices to define similarity between motifs during the extraction step.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>In order to demonstrate the efficiency of such approach, we compare several encoding methods using some machine learning classifiers. The experimental results showed that our encoding method outperforms other ones in terms of classification accuracy and number of generated attributes. We also compared the classifiers in term of accuracy. Results indicated that SVM generally outperforms the other classifiers with any encoding method. We showed that SVM, coupled with our encoding method, can be an efficient protein classification system. In addition, we studied the effect of the substitution matrices variation on the quality of our method and hence on the classification quality. We noticed that our method enables good classification accuracies with all the substitution matrices and that the variances of the obtained accuracies using various substitution matrices are slight. However, the number of generated features varies from a substitution matrix to another. Furthermore, the use of already published datasets allowed us to carry out a comparison with several related works.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>The outcomes of our comparative experiments confirm the efficiency of our encoding method to represent protein sequences in classification tasks.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-11-175","type":"journal-article","created":{"date-parts":[[2010,5,11]],"date-time":"2010-05-11T18:14:37Z","timestamp":1273601677000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":50,"title":["Protein sequences classification by means of feature extraction with substitution matrices"],"prefix":"10.1186","volume":"11","author":[{"given":"Rabie","family":"Saidi","sequence":"first","affiliation":[]},{"given":"Mondher","family":"Maddouri","sequence":"additional","affiliation":[]},{"given":"Engelbert","family":"Mephu Nguifo","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2010,4,8]]},"reference":[{"key":"3632_CR1","doi-asserted-by":"publisher","first-page":"1104","DOI":"10.1016\/j.compbiomed.2005.09.002","volume":"36","author":"H Bhaskar","year":"2005","unstructured":"Bhaskar H, Hoyle DC, Singh S: Machine learning in bioinformatics: A brief survey and recommendations for practitioners. Computers in Biology and Medicine 2005, 36: 1104\u20131125. 10.1016\/j.compbiomed.2005.09.002","journal-title":"Computers in Biology and Medicine"},{"key":"3632_CR2","doi-asserted-by":"publisher","first-page":"492","DOI":"10.1186\/1471-2105-9-492","volume":"9","author":"J Bernardes","year":"2008","unstructured":"Bernardes J, Fernandez J, Vasconcelos A: Structural descriptor database: a new tool for sequence based functional site prediction. BMC Bioinformatics 2008, 9: 492. 10.1186\/1471-2105-9-492","journal-title":"BMC Bioinformatics"},{"key":"3632_CR3","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1016\/0005-2795(75)90289-5","volume":"386","author":"C Klotz","year":"1975","unstructured":"Klotz C, Aumont MC, Leger JJ, Swynghedauw B: Human cardiac myosin ATPase and light subunits: A comparative study. Biochim Biophys 1975, 386: 461\u2013469.","journal-title":"Biochim Biophys"},{"key":"3632_CR4","doi-asserted-by":"crossref","first-page":"160","DOI":"10.1093\/oso\/9780199633968.003.0007","volume-title":"Mechanisms of Protein Folding","author":"NC Price","year":"1994","unstructured":"Price NC: Assembly of multi-subunit structure. In Mechanisms of Protein Folding. Edited by: Pain RH. Oxford University Press, New York; 1994:160\u2013193."},{"key":"3632_CR5","doi-asserted-by":"publisher","first-page":"265","DOI":"10.1016\/S1359-0278(96)00040-5","volume":"1","author":"FJ Corrales","year":"1996","unstructured":"Corrales FJ, Fersht AR: Kinetic significance of GroEL14\u00b7(GroES7)2complexes in molecular chaperone activity. Folding & Design 1996, 1: 265\u2013273.","journal-title":"Folding & Design"},{"key":"3632_CR6","doi-asserted-by":"publisher","first-page":"697","DOI":"10.1002\/(SICI)1096-9888(199808)33:8<697::AID-JMS711>3.0.CO;2-H","volume":"33","author":"BF Terry","year":"1998","unstructured":"Terry BF, Richard MC: Determination of protein-protein interactions by matrix-assisted laser desorption\/ionization mass spectrometry. J Mass Spectrom 1998, 33: 697\u2013704. 10.1002\/(SICI)1096-9888(199808)33:8<697::AID-JMS711>3.0.CO;2-H","journal-title":"J Mass Spectrom"},{"key":"3632_CR7","doi-asserted-by":"publisher","first-page":"517","DOI":"10.1186\/1471-2105-9-517","volume":"9","author":"P Slama","year":"2008","unstructured":"Slama P, Filippis I, Lappe M: Detection of protein catalytic residues at high precision using local network properties. BMC Bioinformatics 2008, 9: 517. 10.1186\/1471-2105-9-517","journal-title":"BMC Bioinformatics"},{"key":"3632_CR8","first-page":"292","volume-title":"ISMB","author":"E Mephu Nguifo","year":"1993","unstructured":"Mephu Nguifo E, Sallantin J: Prediction of Primate Splice Junction Gene Sequences with a Cooperative Knowledge Acquisition System. ISMB 1993, 292\u2013300."},{"key":"3632_CR9","first-page":"290","volume-title":"Pacific Symposium on Biocomputing","author":"E Lemoine","year":"1999","unstructured":"Lemoine E, Merceron D, Sallantin J, Mephu Nguifo E: Improving the Efficiency of a User-Driven Learning System with Reconfigurable Hardware. Application to DNA Splicing. Pacific Symposium on Biocomputing 1999, 290\u2013301."},{"key":"3632_CR10","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1016\/0300-9084(96)84765-1","volume":"78","author":"L Vignal","year":"1997","unstructured":"Vignal L, D'Aubenton-Carafa Y, Lisacek F, Mephu Nguifo E, Rouze P, Quinqueton J, Thermes C: Exon prediction in eucaryotic genomes. Biochimie 1997, 78: 327\u2013334. 10.1016\/0300-9084(96)84765-1","journal-title":"Biochimie"},{"key":"3632_CR11","volume-title":"Data Mining: Concepts and Techniques","author":"J Han","year":"2006","unstructured":"Han J, Kamber M: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers; 2006."},{"key":"3632_CR12","volume-title":"Feature Extraction, Construction and Selection: A Data Mining Perspective. ISBN: 978-0-7923-8196-9","author":"L Huan","year":"1998","unstructured":"Huan L, Motoda H: Feature Extraction, Construction and Selection: A Data Mining Perspective. ISBN: 978\u20130-7923\u20138196\u20139. Kluwer Academic Publishers, Norwell, MA; 1998."},{"issue":"11","key":"3632_CR13","doi-asserted-by":"publisher","first-page":"5865","DOI":"10.1073\/pnas.95.11.5865","volume":"95","author":"CG Nevill-Manning","year":"1998","unstructured":"Nevill-Manning CG, Wu TD, Brutlag DL: Highly specific protein sequence motifs for genome analysis. Proceedings of the National Academy of Sciences of the United States of America 1998, 95(11):5865\u20135871. 10.1073\/pnas.95.11.5865","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"issue":"1","key":"3632_CR14","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1007\/BF02944786","volume":"19","author":"M Maddouri","year":"2004","unstructured":"Maddouri M, Elloumi M: Encoding of primary structures of biological macromolecules within a data mining perspective. Journal of Computer Science and Technology (JCST) 2004, 19(1):78\u201388. 10.1007\/BF02944786","journal-title":"Journal of Computer Science and Technology (JCST)"},{"key":"3632_CR15","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1186\/1748-7188-1-21","volume":"1","author":"Z Yongqiang","year":"2006","unstructured":"Yongqiang Z, Zaki MJ: EXMOTIF: efficient structured motif extraction. Journal of Algorithms for Molecular Biology 2006, 1: 21. 10.1186\/1748-7188-1-21","journal-title":"Journal of Algorithms for Molecular Biology"},{"key":"3632_CR16","doi-asserted-by":"publisher","first-page":"22","DOI":"10.1186\/1748-7188-1-22","volume":"1","author":"Z Yongqiang","year":"2006","unstructured":"Yongqiang Z, Zaki MJ: SMOTIF: efficient structured pattern and profile motif search. Journal of Algorithms for Molecular Biology 2006, 1: 22. 10.1186\/1748-7188-1-22","journal-title":"Journal of Algorithms for Molecular Biology"},{"key":"3632_CR17","doi-asserted-by":"publisher","first-page":"451","DOI":"10.1186\/1471-2105-9-451","volume":"9","author":"F Lopes","year":"2008","unstructured":"Lopes F, Martins D, Cesar R: Feature selection environment for genomic applications. BMC Bioinformatics 2008, 9: 451. 10.1186\/1471-2105-9-451","journal-title":"BMC Bioinformatics"},{"key":"3632_CR18","first-page":"224","volume-title":"Intl. conf. on BioInformatics Research and Development (BIRD), TU Berlin, March 12-14, Springer-Verlag, LNBI 4414","author":"R Saidi","year":"2007","unstructured":"Saidi R, Maddouri M, Mephu Nguifo E: Biological Sequences Encoding for Supervised Classification. Intl. conf. on BioInformatics Research and Development (BIRD), TU Berlin, March 12\u201314, Springer-Verlag, LNBI 4414 2007, 224\u2013238."},{"key":"3632_CR19","doi-asserted-by":"publisher","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","volume":"89","author":"S Henikoff","year":"1992","unstructured":"Henikoff S, Henikoff JG: Amino acid substitution matrices from protein blocks. National Academy of Sciences, USA 1992, 89: 10915\u201310919. 10.1073\/pnas.89.22.10915","journal-title":"National Academy of Sciences, USA"},{"issue":"3","key":"3632_CR20","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","volume":"215","author":"SF Altschul","year":"1990","unstructured":"Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. Journal of Molecular Biology 1990, 215(3):403\u2013413.","journal-title":"Journal of Molecular Biology"},{"key":"3632_CR21","first-page":"564","volume-title":"Pacific Symposium on Biocomputing (PSB;)","author":"C Leslie","year":"2002","unstructured":"Leslie C, Eskin E, Noble WS: The spectrum kernel: a string kernel for svm protein classification. Pacific Symposium on Biocomputing (PSB;) 2002, 564\u2013575."},{"issue":"14","key":"3632_CR22","doi-asserted-by":"publisher","first-page":"2769","DOI":"10.1093\/nar\/22.14.2769","volume":"22","author":"JTL Wang","year":"1994","unstructured":"Wang JTL, Marr TG, Shasha D, Shapiro BA, Chirn GW: Discovering active motifs in sets of related protein sequences and using them for classification. Nucleic Acids Research 1994, 22(14):2769\u20132775. 10.1093\/nar\/22.14.2769","journal-title":"Nucleic Acids Research"},{"key":"3632_CR23","doi-asserted-by":"publisher","first-page":"230","DOI":"10.1007\/3-540-56024-6_19","volume":"644","author":"LCK Hui","year":"1992","unstructured":"Hui LCK, Crochemore M, Galil Z, Manber U: Combinatorial Pattern Matching. Lecture Notes in Computer Science in Apostolico, Springer-Verlag 1992, 644: 230\u2013243.","journal-title":"Lecture Notes in Computer Science in Apostolico, Springer-Verlag"},{"key":"3632_CR24","doi-asserted-by":"publisher","first-page":"425","DOI":"10.1093\/protein\/8.5.425","volume":"8","author":"CT Zhang","year":"1995","unstructured":"Zhang CT, Chou KC, Maggiora GM: Predicting protein structural classes from amino acid composition: Application of fuzzy clustering. Protein Eng 1995, 8: 425\u2013435. 10.1093\/protein\/8.5.425","journal-title":"Protein Eng"},{"key":"3632_CR25","doi-asserted-by":"publisher","first-page":"282","DOI":"10.1002\/prot.10500","volume":"53","author":"KC Chou","year":"2003","unstructured":"Chou KC, Cai YD: Predicting protein quaternary structure by pseudo amino acid composition. Proteins 2003, 53: 282\u2013289. 10.1002\/prot.10500","journal-title":"Proteins"},{"key":"3632_CR26","doi-asserted-by":"publisher","first-page":"2390","DOI":"10.1093\/bioinformatics\/btg331","volume":"19","author":"SW Zhang","year":"2003","unstructured":"Zhang SW, Pan Q, Zhang HC, Zhang YL, Wang HY: Classification of protein quaternary structure with support vector machine. Bioinformatics 2003, 19: 2390\u20132396. 10.1093\/bioinformatics\/btg331","journal-title":"Bioinformatics"},{"key":"3632_CR27","doi-asserted-by":"publisher","first-page":"729","DOI":"10.1023\/A:1020713915365","volume":"17","author":"GP Zhou","year":"1998","unstructured":"Zhou GP: An intriguing controversy over protein structural class prediction. J Protein Chem 1998, 17: 729\u2013738. 10.1023\/A:1020713915365","journal-title":"J Protein Chem"},{"key":"3632_CR28","doi-asserted-by":"publisher","first-page":"D281","DOI":"10.1093\/nar\/gkm960","volume":"36","author":"RD Finn","year":"2008","unstructured":"Finn RD, Tate J, Mistry J, Coggill PC, Sammut JS, Hotz HR, Ceric G, Forslund K, Eddy SR, Sonnhammer EL, Bateman A: The Pfam protein families database. Nucleic Acids Research 2008, 36: D281-D288. 10.1093\/nar\/gkm960","journal-title":"Nucleic Acids Research"},{"key":"3632_CR29","first-page":"125","volume-title":"4th Symposium of Theory of Computing","author":"R Karp","year":"1972","unstructured":"Karp R, Miller RE, Rosenberg AL: Rapid Identification of Repeated Patterns in Strings, Trees and Arrays. 4th Symposium of Theory of Computing 1972, 125\u2013136."},{"issue":"7","key":"3632_CR30","doi-asserted-by":"publisher","first-page":"897","DOI":"10.1093\/bioinformatics\/btn052","volume":"24","author":"K Malde","year":"2008","unstructured":"Malde K: The effect of sequence quality on sequence alignment. Bioinformatics 2008, 24(7):897\u2013900. 10.1093\/bioinformatics\/btn052","journal-title":"Bioinformatics"},{"key":"3632_CR31","volume-title":"Data Mining: Practical machine learning tools and techniques","author":"IH Witten","year":"2005","unstructured":"Witten IH, Eibe F: Data Mining: Practical machine learning tools and techniques. 2nd edition. Morgan Kaufmann, San Francisco; 2005.","edition":"2"},{"key":"3632_CR32","doi-asserted-by":"publisher","first-page":"187","DOI":"10.1186\/1471-2105-7-187","volume":"7","author":"X Yu","year":"2006","unstructured":"Yu X, Wang C, Li Y: Classification of protein quaternary structure by functional domain composition. BMC Bioinformatics 2006, 7: 187\u2013192. 10.1186\/1471-2105-7-187","journal-title":"BMC Bioinformatics"},{"key":"3632_CR33","doi-asserted-by":"publisher","first-page":"549","DOI":"10.1007\/978-1-4613-1571-1_12","volume-title":"Prediction of Protein Structure and the Principles of Protein Conformation","author":"PY Chou","year":"1989","unstructured":"Chou PY: Prediction of protein structural classes from amino acid composition. In Prediction of Protein Structure and the Principles of Protein Conformation. Edited by: Fasman GD. Plenum Press, New York; 1989:549\u2013586."},{"key":"3632_CR34","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1093\/oxfordjournals.jbchem.a135454","volume":"99","author":"H Nakashima","year":"1986","unstructured":"Nakashima H, Nishikawa K, Ooi T: The folding type of a protein is relevant to the amino acid composition. J Biochem 1986, 99: 152\u2013162.","journal-title":"J Biochem"},{"key":"3632_CR35","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/1471-2105-2-3","volume":"2","author":"YD Cai","year":"2001","unstructured":"Cai YD, Liu XJ, Xu XB, Zhou GP: Support vector machines for predicting protein structural class. BMC Bioinform 2001, 2: 1\u20135. 10.1186\/1471-2105-2-1","journal-title":"BMC Bioinform"},{"key":"3632_CR36","doi-asserted-by":"publisher","first-page":"783","DOI":"10.1016\/S0300-9084(00)01161-5","volume":"82","author":"YD Cai","year":"2000","unstructured":"Cai YD, Zhou GP: Prediction of protein structural classes by neural network. Biochimie 2000, 82: 783\u2013785. 10.1016\/S0300-9084(00)01161-5","journal-title":"Biochimie"},{"key":"3632_CR37","doi-asserted-by":"publisher","first-page":"213","DOI":"10.1016\/j.bbrc.2005.06.075","volume":"334","author":"KY Feng","year":"2005","unstructured":"Feng KY, Cai YD, Chou KC: Boosting classifier for predicting protein domain structural class. Biochem Biophys Res Commun 2005, 334: 213\u2013217. 10.1016\/j.bbrc.2005.06.075","journal-title":"Biochem Biophys Res Commun"},{"key":"3632_CR38","volume-title":"BMC Bioinform","author":"YF Cao","year":"2006","unstructured":"Cao YF, Liu S, Zhang LD, Qin J, Wang J, Tang KX: Prediction of protein structural class with Rough Sets. BMC Bioinform 2006., 7:"},{"issue":"1","key":"3632_CR39","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1093\/nar\/28.1.45","volume":"28","author":"A Bairoch","year":"2000","unstructured":"Bairoch A, Apweiler R: The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Research 2000, 28(1):45\u201348. 10.1093\/nar\/28.1.45","journal-title":"Nucleic Acids Research"},{"key":"3632_CR40","doi-asserted-by":"publisher","first-page":"D226","DOI":"10.1093\/nar\/gkh039","volume":"32","author":"A Andreeva","year":"2004","unstructured":"Andreeva A, Howorth D, Brenner SE, Hubbard TJP, Chothia C, Murzin AG: SCOP database in 2004: refinements integrate structure and sequence family data. Nucl Acid Res 2004, 32: D226-D229. 10.1093\/nar\/gkh039","journal-title":"Nucl Acid Res"},{"key":"3632_CR41","first-page":"345","volume":"5","author":"MO Dayhoff","year":"1978","unstructured":"Dayhoff MO, Schwartz RM, Orcutt BC: A Model of Evolutionary Change in Proteins. National Biomedical Research Foundation, Washington DC 1978, 5: 345\u2013352.","journal-title":"National Biomedical Research Foundation, Washington DC"},{"key":"3632_CR42","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1007\/s00726-006-0263-8","volume":"30","author":"SW Zhang","year":"2006","unstructured":"Zhang SW, Pan Q, Zhang HC, Shao ZC, Shi JY: Prediction protein homooligomer types by pesudo amino acid composition: approached with an improved feature extraction and Naive Bayes feature fusion. Amino Acids 2006, 30: 461\u2013468. 10.1007\/s00726-006-0263-8","journal-title":"Amino Acids"},{"key":"3632_CR43","doi-asserted-by":"publisher","first-page":"1324","DOI":"10.1021\/ci034288y","volume":"44","author":"J Song","year":"2004","unstructured":"Song J, Tang H: Accurate classification of homodimeric vs other homooligomeric proteins using a new measure of information discrepancy. J Chem Inf Comput Sci 2004, 44: 1324\u20131327.","journal-title":"J Chem Inf Comput Sci"},{"key":"3632_CR44","doi-asserted-by":"publisher","first-page":"171","DOI":"10.2174\/1389203003381379","volume":"1","author":"KC Chou","year":"2000","unstructured":"Chou KC: Prediction of protein structural classes and subcellular locations. Curr Protein Peptide Sci 2000, 1: 171\u2013208. 10.2174\/1389203003381379","journal-title":"Curr Protein Peptide Sci"},{"key":"3632_CR45","doi-asserted-by":"publisher","first-page":"1007","DOI":"10.1016\/j.bbrc.2004.07.059","volume":"321","author":"KC Chou","year":"2004","unstructured":"Chou KC, Cai YD: Predicting protein structural class by functional domain composition. Biochem Biophys Res Commun 2004, 321: 1007\u20131009. 10.1016\/j.bbrc.2004.07.059","journal-title":"Biochem Biophys Res Commun"},{"key":"3632_CR46","first-page":"322","volume-title":"Multivariate analysis","author":"KV Mardia","year":"1979","unstructured":"Mardia KV, Kent JT, Bibby JM: Multivariate analysis. London: Academic Press; 1979:322\u2013381."},{"key":"3632_CR47","doi-asserted-by":"publisher","first-page":"116","DOI":"10.1016\/j.ab.2006.07.022","volume":"357","author":"C Chen","year":"2006","unstructured":"Chen C, Zhou XB, Tian YX, Zhou XY, Cai PX: Predicting protein structural class with pseudo-amino acid composition and support vector machine fusion network. Anal Biochem 2006, 357: 116\u2013121. 10.1016\/j.ab.2006.07.022","journal-title":"Anal Biochem"},{"key":"3632_CR48","doi-asserted-by":"publisher","first-page":"275","DOI":"10.3109\/10409239509083488","volume":"30","author":"KC Chou","year":"1995","unstructured":"Chou KC, Zhang CT: Review: Prediction of protein structural classes. Crit Rev Biochem Mol Biol 1995, 30: 275\u2013349. 10.3109\/10409239509083488","journal-title":"Crit Rev Biochem Mol Biol"},{"issue":"8","key":"3632_CR49","doi-asserted-by":"crossref","first-page":"561","DOI":"10.1093\/clinchem\/39.4.561","volume":"39","author":"MH Zweig","year":"1993","unstructured":"Zweig MH, Campbell G: Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clinical chemistry 1993, 39(8):561\u2013577.","journal-title":"Clinical chemistry"},{"issue":"1","key":"3632_CR50","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/nar\/28.1.235","volume":"28","author":"HM Berman","year":"2000","unstructured":"Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Res 2000, 28(1):235\u2013242. 10.1093\/nar\/28.1.235","journal-title":"Nucleic Acids Res"},{"key":"3632_CR51","doi-asserted-by":"publisher","first-page":"56","DOI":"10.1002\/prot.340090107","volume":"9","author":"C Sander","year":"1991","unstructured":"Sander C, Schneider R: Database of homology derived protein structure and the structural meaning of sequence alignment. Protein: Structure, Function and Genetic 1991, 9: 56\u201368. 10.1002\/prot.340090107","journal-title":"Protein: Structure, Function and Genetic"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-11-175.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,3,26]],"date-time":"2024-03-26T18:10:09Z","timestamp":1711476609000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-11-175"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,4,8]]},"references-count":51,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2010,12]]}},"alternative-id":["3632"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-11-175","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,4,8]]},"assertion":[{"value":"4 September 2009","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 April 2010","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 April 2010","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"175"}}