{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,6,3]],"date-time":"2024-06-03T20:36:41Z","timestamp":1717447001466},"reference-count":53,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2009,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Proteins, especially larger ones, are often composed of individual evolutionary units, domains, which have their own function and structural fold. Predicting domains is an important intermediate step in protein analyses, including the prediction of protein structures.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We describe novel systems for the prediction of protein domain boundaries powered by Recursive Neural Networks. The systems rely on a combination of primary sequence and evolutionary information, predictions of structural features such as secondary structure, solvent accessibility and residue contact maps, and structural templates, both annotated for domains (from the SCOP dataset) and unannotated (from the PDB). We gauge the contribution of contact maps, and PDB and SCOP templates independently and for different ranges of template quality. We find that accurately predicted contact maps are informative for the prediction of domain boundaries, while the same is not true for contact maps predicted ab initio. We also find that gap information from PDB templates is informative, but, not surprisingly, less than SCOP annotations. We test both systems trained on templates of all qualities, and systems trained only on templates of marginal similarity to the query (less than 25% sequence identity). While the first batch of systems produces near perfect predictions in the presence of fair to good templates, the second batch outperforms or match ab initio predictors down to essentially any level of template quality.<\/jats:p>\n            <jats:p>We test all systems in 5-fold cross-validation on a large non-redundant set of multi-domain and single domain proteins. The final predictors are state-of-the-art, with a template-less prediction boundary recall of 50.8% (precision 38.7%) within \u00b1 20 residues and a single domain recall of 80.3% (precision 78.1%). The SCOP-based predictors achieve a boundary recall of 74% (precision 77.1%) again within \u00b1 20 residues, and classify single domain proteins as such in over 85% of cases, when we allow a mix of bad and good quality templates. If we only allow marginal templates (max 25% sequence identity to the query) the scores remain high, with boundary recall and precision of 59% and 66.3%, and 80% of all single domain proteins predicted correctly.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>The systems presented here may prove useful in large-scale annotation of protein domains in proteins of unknown structure. The methods are available as public web servers at the address: <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/distill.ucd.ie\/shandy\/\" ext-link-type=\"uri\">http:\/\/distill.ucd.ie\/shandy\/<\/jats:ext-link> and we plan on running them on a multi-genomic scale and make the results public in the near future.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-10-195","type":"journal-article","created":{"date-parts":[[2009,6,26]],"date-time":"2009-06-26T18:13:50Z","timestamp":1246040030000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":12,"title":["Ab initio and homology based prediction of protein domains by recursive neural networks"],"prefix":"10.1186","volume":"10","author":[{"given":"Ian","family":"Walsh","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alberto JM","family":"Martin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Catherine","family":"Mooney","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Enrico","family":"Rubagotti","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alessandro","family":"Vullo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gianluca","family":"Pollastri","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2009,6,26]]},"reference":[{"key":"2925_CR1","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1007\/978-1-59745-515-2_1","volume":"369","author":"P Pagel","year":"2007","unstructured":"Pagel P, Strack N, Oesterheld M, Stumpflen V, Frishman D: Computational prediction of domain interactions. Methods Mol Biol 2007, 369: 3\u201315.","journal-title":"Methods Mol Biol"},{"key":"2925_CR2","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1186\/1471-2105-7-269","volume":"7","author":"H Lee","year":"2006","unstructured":"Lee H, Deng M, Sun F, Chen T: An integrated approach to the prediction of domain-domain interactions. BMC Bioinformatics 2006, 7: 269.","journal-title":"BMC Bioinformatics"},{"key":"2925_CR3","first-page":"1","volume-title":"Bioinformatics, sequence, structure and databanks","author":"D Jones","year":"2000","unstructured":"Jones D, Hadley C: Threading methods for protein structure prediction. In Bioinformatics, sequence, structure and databanks. Edited by: Higgins D, Taylor WM. Heidelberg: Springer Verlag; 2000:1\u201313."},{"issue":"26","key":"2925_CR4","doi-asserted-by":"publisher","first-page":"8304","DOI":"10.1021\/bi990202b","volume":"38","author":"J Potts","year":"1999","unstructured":"Potts J, Bright J, Bolton D, Pickford A, Campbell I: Solution Structure of the N-Terminal F1 Module Pair from Human Fibronectin. Biochemistry 1999, 38(26):8304\u20138312.","journal-title":"Biochemistry"},{"key":"2925_CR5","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1016\/S0076-6879(97)76047-6","volume":"276","author":"B Matthews","year":"1997","unstructured":"Matthews B: Recent transformations in structural biology. Methods in Enzymology 1997, 276: 3\u201310.","journal-title":"Methods in Enzymology"},{"key":"2925_CR6","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/nar\/28.1.235","volume":"28","author":"H Berman","year":"2000","unstructured":"Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H, Shindyalov I, Bourne P: The Protein Data Bank. NAR 2000, 28: 235\u2013242.","journal-title":"NAR"},{"issue":"1","key":"2925_CR7","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10618-005-0023-5","volume":"13","author":"J Cheng","year":"2006","unstructured":"Cheng J, Sweredoski M, Baldi P: DOMpro: Protein Domain Prediction Using Profiles Secondary Structure, Relative Solvent Accessibility and Recursive Neural Networks. Data Mining and Knowledge Discovery 2006, 13(1):1\u201310.","journal-title":"Data Mining and Knowledge Discovery"},{"key":"2925_CR8","doi-asserted-by":"publisher","first-page":"354","DOI":"10.1093\/nar\/gkm390","volume":"35","author":"J Cheng","year":"2007","unstructured":"Cheng J: An Accurate, Hybrid Protein Domain Prediction Server. Nucleic Acids Research 2007, 35: 354\u2013356.","journal-title":"Nucleic Acids Research"},{"key":"2925_CR9","doi-asserted-by":"publisher","first-page":"779","DOI":"10.1006\/jmbi.1993.1626","volume":"234","author":"A Sali","year":"1993","unstructured":"Sali A, Blundell T: Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol 1993, 234: 779\u2013815.","journal-title":"J Mol Biol"},{"issue":"3","key":"2925_CR10","doi-asserted-by":"publisher","first-page":"429","DOI":"10.1093\/bioinformatics\/btg006","volume":"19","author":"N Alexandrov","year":"2003","unstructured":"Alexandrov N, Shindyalov I: PDP: protein domain parser. Bioinformatics 2003, 19(3):429\u2013430.","journal-title":"Bioinformatics"},{"issue":"7","key":"2925_CR11","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1002\/prot.20737","volume":"61","author":"D Kim","year":"2005","unstructured":"Kim D, Chivian D, Malmstr\u00f6m L, Baker D: Automated prediction of domain boundaries in CASP6 targets using Ginzu and RosettaDOM. Proteins 2005, 61(7):193\u2013200.","journal-title":"Proteins"},{"issue":"1","key":"2925_CR12","doi-asserted-by":"publisher","first-page":"209","DOI":"10.1006\/jmbi.1997.0959","volume":"268","author":"K Simons","year":"1997","unstructured":"Simons K, Kooperberg C, Huang E, Baker D: Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. J Mol Biol 1997, 268(1):209\u201325.","journal-title":"J Mol Biol"},{"issue":"3","key":"2925_CR13","doi-asserted-by":"publisher","first-page":"203","DOI":"10.1093\/protein\/12.3.203","volume":"12","author":"W Taylor","year":"1999","unstructured":"Taylor W: Protein structural domain identification. Protein Engineering 1999, 12(3):203\u2013216.","journal-title":"Protein Engineering"},{"issue":"2","key":"2925_CR14","doi-asserted-by":"publisher","first-page":"839","DOI":"10.1006\/jmbi.2001.5387","volume":"316","author":"R Georgea","year":"2002","unstructured":"Georgea R, Heringa J: SnapDRAGON: a method to delineate protein structural domains from sequence data. Journal of Molecular Biology 2002, 316(2):839\u2013851.","journal-title":"Journal of Molecular Biology"},{"issue":"7","key":"2925_CR15","doi-asserted-by":"publisher","first-page":"613","DOI":"10.1093\/bioinformatics\/16.7.613","volume":"16","author":"S Wheelan","year":"2000","unstructured":"Wheelan S, Marchler-Bauer A, Bryant S: Domain size distributions can predict domain boundaries. Bioinformatics 2000, 16(7):613\u2013618.","journal-title":"Bioinformatics"},{"key":"2925_CR16","doi-asserted-by":"publisher","first-page":"2814","DOI":"10.1110\/ps.0209902","volume":"11","author":"R Marsden","year":"2002","unstructured":"Marsden R, McGuffin L, Jones D: Rapid protein domain assignment from amino acid sequence using predicted secondary structure. Protein Science 2002, 11: 2814\u20132824.","journal-title":"Protein Science"},{"issue":"8","key":"2925_CR17","doi-asserted-by":"publisher","first-page":"1093","DOI":"10.1016\/S0969-2126(97)00260-8","volume":"5","author":"C Orengo","year":"1997","unstructured":"Orengo C, Michie A, Jones M, Swindells DTand, Thornton J: CATH: A Hierarchic Classification of Protein Domain Structures. Structure 1997, 5(8):1093\u2013108.","journal-title":"Structure"},{"issue":"5","key":"2925_CR18","doi-asserted-by":"publisher","first-page":"1061","DOI":"10.1016\/j.jmb.2005.05.037","volume":"350","author":"M Dumontier","year":"2005","unstructured":"Dumontier M, Yao R, Feldman H, Hogue C: Armadillo: domain boundary prediction by amino acid composition. J Mol Biol 2005, 350(5):1061\u201373.","journal-title":"J Mol Biol"},{"issue":"11","key":"2925_CR19","doi-asserted-by":"publisher","first-page":"871","DOI":"10.1093\/protein\/15.11.871","volume":"15","author":"R George","year":"2002","unstructured":"George R, Heringa J: An analysis of protein domain linkers: their classification and role in protein folding. Protein Engineering 2002, 15(11):871\u2013879.","journal-title":"Protein Engineering"},{"issue":"1","key":"2925_CR20","doi-asserted-by":"publisher","first-page":"22","DOI":"10.1016\/S1367-5931(99)00046-0","volume":"4","author":"R Gokhale","year":"2000","unstructured":"Gokhale R, C K: Role of linkers in communication between protein modules. Current Opinion in Chemical Biology 2000, 4(1):22\u201327.","journal-title":"Current Opinion in Chemical Biology"},{"issue":"11","key":"2925_CR21","doi-asserted-by":"publisher","first-page":"5929","DOI":"10.1073\/pnas.95.11.5929","volume":"95","author":"C Robinson","year":"1998","unstructured":"Robinson C, Sauer R: Optimizing the Stability of Single-Chain Proteins by Linker Length and Composition Mutagenesis. PNAS 1998, 95(11):5929\u20135934.","journal-title":"PNAS"},{"issue":"8","key":"2925_CR22","doi-asserted-by":"publisher","first-page":"2043","DOI":"10.1093\/emboj\/16.8.2043","volume":"16","author":"H van Leeuwen","year":"1997","unstructured":"van Leeuwen H, Strating M, Rensen M, de Laat W, Vliet P: Linker length and composition influence the flexibility of Oct-1 DNA binding. EMBO J 1997, 16(8):2043\u20132053.","journal-title":"EMBO J"},{"key":"2925_CR23","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1186\/1471-2105-7-301","volume":"7","author":"S Montgomerie","year":"2006","unstructured":"Montgomerie S, Sundaraj S, Gallin W, Wishart D: Improving the Accuracy of Protein Secondary Structure Prediction Using Structural Alignment. BMC Bioinformatics 2006, 7: 301.","journal-title":"BMC Bioinformatics"},{"key":"2925_CR24","unstructured":"CASP Home page[http:\/\/predictioncenter.org\/]"},{"issue":"8","key":"2925_CR25","doi-asserted-by":"publisher","first-page":"137","DOI":"10.1002\/prot.21675","volume":"69","author":"M Tress","year":"2007","unstructured":"Tress M, Cheng J, Baldi P, Joo K, Lee J, Seo J, Lee J, Baker D, Chivian D, Kim D, Ezkurdia I: Assessment of predictions submitted for the CASP7 domain prediction category. Proteins 2007, 69(8):137\u201351.","journal-title":"Proteins"},{"key":"2925_CR26","first-page":"536","volume":"247","author":"AG Murzin","year":"1995","unstructured":"Murzin AG, Brenner SE, Hubbard T, Chothia C: SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 1995, 247: 536\u2013540.","journal-title":"J Mol Biol"},{"key":"2925_CR27","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","volume":"25","author":"S Altschul","year":"1997","unstructured":"Altschul S, Madden T, Schaffer A: Gapped Blast and psi-blast: a new generation of protein database search programs. Nucl Acids Res 1997, 25: 3389\u20133402.","journal-title":"Nucl Acids Res"},{"issue":"8","key":"2925_CR28","doi-asserted-by":"publisher","first-page":"1719","DOI":"10.1093\/bioinformatics\/bti203","volume":"21","author":"G Pollastri","year":"2005","unstructured":"Pollastri G, McLysaght A: Porter: a new, accurate server for protein secondary structure prediction. Bioinformatics 2005, 21(8):1719\u201320.","journal-title":"Bioinformatics"},{"issue":"201","key":"2925_CR29","first-page":"12","volume":"8","author":"G Pollastri","year":"2007","unstructured":"Pollastri G, Martin A, Mooney C, Vullo A: Accurate prediction of protein secondary structure and solvent accessibility by consensus combiners of sequence and structure information. BMC Bioinformatics 2007, 8(201):12.","journal-title":"BMC Bioinformatics"},{"issue":"8","key":"2925_CR30","doi-asserted-by":"publisher","first-page":"1489","DOI":"10.1089\/cmb.2006.13.1489","volume":"13","author":"C Mooney","year":"2006","unstructured":"Mooney C, Vullo A, Pollastri G: Protein Structural Motif Prediction in Multidimensional f -Space leads to improved Secondary Structure Prediction. J Comput Biol 2006, 13(8):1489\u20131502.","journal-title":"J Comput Biol"},{"key":"2925_CR31","doi-asserted-by":"publisher","first-page":"180","DOI":"10.1186\/1471-2105-7-180","volume":"7","author":"A Vullo","year":"2006","unstructured":"Vullo A, Walsh I, Pollastri G: A two-stage approach for improved prediction of residue contact maps. BMC Bioinformatics 2006, 7: 180.","journal-title":"BMC Bioinformatics"},{"key":"2925_CR32","doi-asserted-by":"publisher","first-page":"937","DOI":"10.1093\/bioinformatics\/15.11.937","volume":"15","author":"P Baldi","year":"1999","unstructured":"Baldi P, Brunak S, Frasconi P, Soda G, Pollastri G: Exploiting the past and the future in protein secondary structure prediction. Bioinformatics 1999, 15: 937\u2013946.","journal-title":"Bioinformatics"},{"key":"2925_CR33","doi-asserted-by":"publisher","first-page":"584","DOI":"10.1006\/jmbi.1993.1413","volume":"232","author":"B Rost","year":"1997","unstructured":"Rost B, Sander C: Prediction of protein secondary structure at better than 70% accuracy. J Mol Biol 1997, 232: 584\u2013599.","journal-title":"J Mol Biol"},{"issue":"3","key":"2925_CR34","doi-asserted-by":"publisher","first-page":"216","DOI":"10.1002\/prot.340200303","volume":"20","author":"B Rost","year":"1994","unstructured":"Rost B, Sander C, Schhneider R: Conservation and prediction of solvent accessibility in protein families. Proteins 1994, 20(3):216\u201326.","journal-title":"Proteins"},{"issue":"foundations","key":"2925_CR35","first-page":"318","volume":"1","author":"D Rumelhart","year":"1986","unstructured":"Rumelhart D, Hinton G, Williams R: Learning internal representations by error propagation. Parallel distributed processing: explorations in the microstructure of cognition 1986, 1(foundations):318\u201362.","journal-title":"Parallel distributed processing: explorations in the microstructure of cognition"},{"key":"2925_CR36","doi-asserted-by":"publisher","first-page":"142","DOI":"10.1002\/prot.10069","volume":"47","author":"G Pollastri","year":"2002","unstructured":"Pollastri G, Fariselli P, Casadio R, Baldi P: Prediction of Coordination Number and Relative Solvent Accessibility in Proteins. Proteins 2002, 47: 142\u2013235.","journal-title":"Proteins"},{"key":"2925_CR37","doi-asserted-by":"publisher","first-page":"228","DOI":"10.1002\/prot.10082","volume":"47","author":"G Pollastri","year":"2002","unstructured":"Pollastri G, Przybylski D, Rost B, Baldi P: Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles. Proteins 2002, 47: 228\u2013235.","journal-title":"Proteins"},{"issue":"1","key":"2925_CR38","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1016\/S1367-5931(02)00003-0","volume":"7","author":"J Liu","year":"2003","unstructured":"Liu J, Rost B: Domains, motifs and clusters in the protein universe. Curr Opin Chem Biol 2003, 7(1):5\u201311.","journal-title":"Curr Opin Chem Biol"},{"issue":"2","key":"2925_CR39","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1109\/72.279181","volume":"5","author":"Y Bengio","year":"1994","unstructured":"Bengio Y, Frasconi P, Simard P: Learning long-term dependencies with gradient descent is difficult. IEEE Trans Neural Netwprks 1994, 5(2):157\u201366.","journal-title":"IEEE Trans Neural Netwprks"},{"issue":"8","key":"2925_CR40","doi-asserted-by":"publisher","first-page":"1029","DOI":"10.1016\/j.neunet.2005.07.001","volume":"18","author":"A Ceroni","year":"2005","unstructured":"Ceroni A, Frasconi P, Pollastri G: Learning Protein Secondary Structure from Sequential and Relational Data. Neural Networks 2005, 18(8):1029\u201339.","journal-title":"Neural Networks"},{"key":"2925_CR41","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1186\/1472-6807-9-5","volume":"9","author":"I Walsh","year":"2009","unstructured":"Walsh I, Ba\u00fa D, Mooney C, Vullo A, Pollastri G: Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks. BMC Structural Biology 2009, 9: 5.","journal-title":"BMC Structural Biology"},{"issue":"13","key":"2925_CR42","doi-asserted-by":"publisher","first-page":"3789","DOI":"10.1093\/nar\/gkg620","volume":"31","author":"S Mika","year":"2003","unstructured":"Mika S, Rost B: Creating representative protein sequence sets. Nucleic Acids Research 2003, 31(13):3789\u201391.","journal-title":"Nucleic Acids Research"},{"issue":"1","key":"2925_CR43","doi-asserted-by":"publisher","first-page":"201","DOI":"10.1093\/nar\/24.1.201","volume":"24","author":"R Schneider","year":"1995","unstructured":"Schneider R, Sander C: The HSSP database of protein structure-sequence alignments. Nucleic Acids Research 1995, 24(1):201\u2013205.","journal-title":"Nucleic Acids Research"},{"issue":"2","key":"2925_CR44","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1093\/protein\/15.2.65","volume":"15","author":"D Rigden","year":"2002","unstructured":"Rigden D: Use of covariance analysis for the prediction of structural domain boundaries from multiple protein sequence alignments. Protein Engineering 2002, 15(2):65\u201377.","journal-title":"Protein Engineering"},{"key":"2925_CR45","doi-asserted-by":"publisher","first-page":"522","DOI":"10.1002\/pro.5560030317","volume":"3","author":"U Hobohm","year":"1994","unstructured":"Hobohm U, Sander C: Enlarged representative set of protein structures. Protein Sci 1994, 3: 522\u201324.","journal-title":"Protein Sci"},{"key":"2925_CR46","doi-asserted-by":"publisher","first-page":"402","DOI":"10.1186\/1471-2105-7-402","volume":"7","author":"D Ba\u00fa","year":"2006","unstructured":"Ba\u00fa D, Martin AJM, Mooney C, Vullo A, Walsh I, Pollastri G: Distill: A suite of web servers for the prediction of one-, two- and three-dimensional structural features of proteins. BMC Bioinformatics 2006, 7: 402.","journal-title":"BMC Bioinformatics"},{"issue":"12","key":"2925_CR47","doi-asserted-by":"publisher","first-page":"3522","DOI":"10.1093\/nar\/gkh684","volume":"32","author":"J Liu","year":"2004","unstructured":"Liu J, Rost B: Sequence-based prediction of protein domains. Nucleic Acids Res 2004, 32(12):3522\u20133530.","journal-title":"Nucleic Acids Res"},{"key":"2925_CR48","doi-asserted-by":"publisher","first-page":"562","DOI":"10.1016\/j.jmb.2006.05.060","volume":"361","author":"T Holland","year":"2006","unstructured":"Holland T, Veretnik S, Shindyalov I, Bourne PE: A benchmark for domain assignment from protein 3-dimensional structure and its applications. J Mol Biol 2006, 361: 562\u2013590.","journal-title":"J Mol Biol"},{"issue":"1","key":"2925_CR49","doi-asserted-by":"publisher","first-page":"383","DOI":"10.1093\/nar\/gkg087","volume":"31","author":"A Marchler-Bauer","year":"2003","unstructured":"Marchler-Bauer A, Anderson J, DeWeese-Scott C, Fedorova N, Geer L, He S, Hurwitz D, Jackson J, Jacobs A, Lanczycki C, Liebert C, Liu C, Madej T, Marchler G, Mazumder R, Nikolskaya A, Panchenko A, Rao B, Shoemaker B, Simonyan V, Song J, Thiessen P, Vasudevan S, Wang Y, Yin YamashitaJ, Bryant S: CDD: a curated Entrez database of conserved domain alignments. Nucleic Acids Research 2003, 31(1):383\u2013387.","journal-title":"Nucleic Acids Research"},{"key":"2925_CR50","doi-asserted-by":"publisher","first-page":"300","DOI":"10.1002\/prot.21745","volume":"71","author":"L Ye","year":"2008","unstructured":"Ye L, Liu T, Wu Z, Zhou R: Sequence-based protein domain boundary prediction using BP neural network with various property profiles. Proteins 2008, 71: 300\u2013307.","journal-title":"Proteins"},{"key":"2925_CR51","doi-asserted-by":"publisher","first-page":"627","DOI":"10.1002\/prot.20442","volume":"59","author":"J Sim","year":"2005","unstructured":"Sim J, Kim SY, Lee J: Pprodo: prediction of protein domain boundaries using neural networks. Proteins 2005, 59: 627\u2013632.","journal-title":"Proteins"},{"key":"2925_CR52","doi-asserted-by":"publisher","first-page":"901","DOI":"10.1101\/SQB.1987.052.01.098","volume":"52","author":"W Gilbert","year":"1987","unstructured":"Gilbert W: The exon theory of genes. Cold Spring Harbor symposia on quantitative biology 1987, 52: 901\u20135.","journal-title":"Cold Spring Harbor symposia on quantitative biology"},{"issue":"5169","key":"2925_CR53","first-page":"202","volume":"265","author":"W Gilbert","year":"1992","unstructured":"Gilbert W: Testing the exon theory of genes: the evidence from protein structure. Science 1992, 265(5169):202\u2013207.","journal-title":"Science"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-10-195.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T21:33:20Z","timestamp":1630445600000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-10-195"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,6,26]]},"references-count":53,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2009,12]]}},"alternative-id":["2925"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-10-195","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2009,6,26]]},"assertion":[{"value":"17 October 2008","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 June 2009","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 June 2009","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"195"}}