{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,2]],"date-time":"2025-12-02T15:13:08Z","timestamp":1764688388613},"reference-count":22,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2004,7,22]],"date-time":"2004-07-22T00:00:00Z","timestamp":1090454400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"},{"start":{"date-parts":[[2004,7,22]],"date-time":"2004-07-22T00:00:00Z","timestamp":1090454400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                        <jats:title>Background<\/jats:title>\n                        <jats:p>Defining blocks forming the global protein structure on the basis of local structural regularity is a very fruitful idea, extensively used in description, and prediction of structure from only sequence information. Over many years the secondary structure elements were used as available building blocks with great success. Specially prepared sets of possible structural motifs can be used to describe similarity between very distant, non-homologous proteins. The reason for utilizing the structural information in the description of proteins is straightforward. Structural comparison is able to detect approximately twice as many distant relationships as sequence comparison at the same error rate.<\/jats:p>\n                     <\/jats:sec><jats:sec>\n                        <jats:title>Results<\/jats:title>\n                        <jats:p>Here we provide a new fragment library for Local Structure Segment (LSS) prediction called FRAGlib which is integrated with a previously described segment alignment algorithm SEA. A joined FRAGlib\/SEA server provides easy access to both algorithms, allowing a one stop alignment service using a novel approach to protein sequence alignment based on a network matching approach. The FRAGlib used as secondary structure prediction achieves only 73% accuracy in Q3 measure, but when combined with the SEA alignment, it achieves a significant improvement in pairwise sequence alignment quality, as compared to previous SEA implementation and other public alignment algorithms. The FRAGlib algorithm takes ~2 min. to search over FRAGlib database for a typical query protein with 500 residues. The SEA service align two typical proteins within circa ~5 min. All supplementary materials (detailed results of all the benchmarks, the list of test proteins and the whole fragments library) are available for download on-line at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"http:\/\/ffas.ljcrf.edu\/darman\/results\/\">http:\/\/ffas.ljcrf.edu\/darman\/results\/<\/jats:ext-link>.<\/jats:p>\n                     <\/jats:sec><jats:sec>\n                        <jats:title>Conclusions<\/jats:title>\n                        <jats:p>The joined FRAGlib\/SEA server will be a valuable tool both for molecular biologists working on protein sequence analysis and for bioinformaticians developing computational methods of structure prediction and alignment of proteins.<\/jats:p>\n                     <\/jats:sec>","DOI":"10.1186\/1471-2105-5-98","type":"journal-article","created":{"date-parts":[[2004,7,24]],"date-time":"2004-07-24T06:22:36Z","timestamp":1090650156000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["Integrated web service for improving alignment quality based on segments comparison"],"prefix":"10.1186","volume":"5","author":[{"given":"Dariusz","family":"Plewczynski","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Leszek","family":"Rychlewski","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuzhen","family":"Ye","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lukasz","family":"Jaroszewski","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Adam","family":"Godzik","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2004,7,22]]},"reference":[{"key":"214_CR1","doi-asserted-by":"publisher","first-page":"306","DOI":"10.1093\/bioinformatics\/18.2.306","volume":"18","author":"M Cline","year":"2002","unstructured":"Cline M, Hughey R, Karplus K: Predicting reliable regions in protein sequence alignments.\n                           Bioinformatics 2002, 18: 306\u2013314. 10.1093\/bioinformatics\/18.2.306","journal-title":"Bioinformatics"},{"key":"214_CR2","doi-asserted-by":"publisher","first-page":"947","DOI":"10.1002\/pro.5560050516","volume":"5","author":"D Fischer","year":"1996","unstructured":"Fischer D, Eisenberg D: Protein fold recognition using sequence-derived predictions.\n                           Protein Science 1996, 5: 947\u2013955.","journal-title":"Protein Science"},{"key":"214_CR3","doi-asserted-by":"publisher","first-page":"5913","DOI":"10.1073\/pnas.95.11.5913","volume":"95","author":"M Levitt","year":"1998","unstructured":"Levitt M, Gerstein M: A unified statistical framework for sequence comparison and structure comparison.\n                           Proc Natl Acad Sci 1998, 95: 5913\u20135920. 10.1073\/pnas.95.11.5913","journal-title":"Proc Natl Acad Sci"},{"key":"214_CR4","doi-asserted-by":"publisher","first-page":"1117","DOI":"10.1006\/jmbi.1993.1464","volume":"232","author":"TM Yi","year":"1993","unstructured":"Yi TM, Lander ES: Protein secondary structure prediction using nearest-neighbor methods.\n                           J Mol Biol 1993, 232: 1117\u20131129. 10.1006\/jmbi.1993.1464","journal-title":"J Mol Biol"},{"key":"214_CR5","doi-asserted-by":"publisher","first-page":"1143","DOI":"10.1093\/protein\/10.10.1143","volume":"10","author":"L Rychlewski","year":"1997","unstructured":"Rychlewski L, Godzik A: Secondary structure prediction using segment similarity.\n                           Protein Engineering 1997, 10: 1143\u20131153. 10.1093\/protein\/10.10.1143","journal-title":"Protein Engineering"},{"key":"214_CR6","doi-asserted-by":"publisher","first-page":"750","DOI":"10.1038\/11525","volume":"6","author":"H Xu","year":"1999","unstructured":"Xu H, Aurora R, Rose GD, White RH: Identifying two ancient enzymes in archaea using predicted secondary structure alignment.\n                           Nature Structural Biology 1999, 6: 750\u2013754. 10.1038\/11525","journal-title":"Nature Structural Biology"},{"key":"214_CR7","doi-asserted-by":"publisher","first-page":"565","DOI":"10.1006\/jmbi.1998.1943","volume":"281","author":"C Bystroff","year":"1998","unstructured":"Bystroff C, Baker D: Prediction of local structure in proteins using a library of sequence-structure motifs.\n                           J Mol Biol 1998, 281: 565\u2013577. 10.1006\/jmbi.1998.1943","journal-title":"J Mol Biol"},{"key":"214_CR8","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1002\/(SICI)1097-0134(1999)37:3+<171::AID-PROT21>3.0.CO;2-Z","volume":"37","author":"KT Simons","year":"1999","unstructured":"Simons KT, Bonneau R, Ruczinski II, Baker D: Ab initio protein structure prediction of CASP III targets using ROSETTA.\n                           Proteins 1999, 37: 171\u2013176. 10.1002\/(SICI)1097-0134(1999)37:3+<171::AID-PROT21>3.3.CO;2-Q","journal-title":"Proteins"},{"key":"214_CR9","doi-asserted-by":"publisher","first-page":"742","DOI":"10.1093\/bioinformatics\/btg073","volume":"19","author":"Y Ye","year":"2003","unstructured":"Ye Y, Jaroszewski L, Li W, Godzik A: A segment alignment approach to protein comparison.\n                           Bioinformatics 2003, 19: 742\u2013749. 10.1093\/bioinformatics\/btg073","journal-title":"Bioinformatics"},{"key":"214_CR10","volume-title":"unpublished personal communication","author":"A Godzik","year":"2003","unstructured":"Godzik A: unpublished personal communication. 2003."},{"key":"214_CR11","doi-asserted-by":"publisher","first-page":"260","DOI":"10.1093\/nar\/30.1.260","volume":"30","author":"JM Chandonia","year":"2002","unstructured":"Chandonia JM, Walker NS, Lo Conte L, Koehl P, Levitt M, Brenner SE: ASTRAL compendium enhancements.\n                           Nucleic Acids Research 2002, 30: 260\u2013263. 10.1093\/nar\/30.1.260","journal-title":"Nucleic Acids Research"},{"key":"214_CR12","doi-asserted-by":"publisher","first-page":"254","DOI":"10.1093\/nar\/28.1.254","volume":"28","author":"SE Brenner","year":"2000","unstructured":"Brenner SE, Koehl P, Levitt M: The ASTRAL compendium for sequence and structure analysis.\n                           Nucleic Acids Research 2000, 28: 254\u2013256. 10.1093\/nar\/28.1.254","journal-title":"Nucleic Acids Research"},{"key":"214_CR13","unstructured":"I-sites\/HMMSTR backbone angle regions[http:\/\/www.bioinfo.rpi.edu\/~bystrc\/hmmstr\/rama.html]"},{"key":"214_CR14","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","volume":"215","author":"SF Altschul","year":"1990","unstructured":"Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool.\n                           J Mol Biol 1990, 215: 403\u2013410. 10.1006\/jmbi.1990.9999","journal-title":"J Mol Biol"},{"key":"214_CR15","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","volume":"25","author":"SF Altschul","year":"1997","unstructured":"Altschul SF, Madden TL, Sch\u00e4ffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.\n                           Nucleic Acids Research 1997, 25: 3389\u20133402. 10.1093\/nar\/25.17.3389","journal-title":"Nucleic Acids Research"},{"key":"214_CR16","doi-asserted-by":"publisher","first-page":"232","DOI":"10.1110\/ps.9.2.232","volume":"9","author":"L Rychlewski","year":"2000","unstructured":"Rychlewski L, Jaroszewski L, Li W, Godzik A: Comparison of sequence profiles. Strategies for structural predictions using sequence information.\n                           Protein Science 2000, 9: 232\u2013241.","journal-title":"Protein Science"},{"key":"214_CR17","doi-asserted-by":"publisher","first-page":"330","DOI":"10.1002\/prot.10043","volume":"46","author":"A Elofsson","year":"2002","unstructured":"Elofsson A: A study on protein sequence alignment quality.\n                           Proteins 2002, 46: 330\u2013339. 10.1002\/prot.10043","journal-title":"Proteins"},{"key":"214_CR18","unstructured":"SEgment Alignment (SEA) server (Protein pairwise alignment based on network matching algorithm)[http:\/\/ffas.ljcrf.edu\/Servers\/sea.html]"},{"key":"214_CR19","doi-asserted-by":"publisher","first-page":"1487","DOI":"10.1110\/ps.9.8.1487","volume":"9","author":"L Jaroszewski","year":"2001","unstructured":"Jaroszewski L, Li W, Godzik A: Improving the quality of twilight-zone alignments.\n                           Protein Science 2001, 9: 1487\u20131496.","journal-title":"Protein Science"},{"key":"214_CR20","doi-asserted-by":"publisher","first-page":"536","DOI":"10.1006\/jmbi.1995.0159","volume":"247","author":"AG Murzin","year":"1995","unstructured":"Murzin AG, Brenner SE, Hubbard T, Chothia C: SCOP: a structural classification of proteins database for the investigation of sequences and structures.\n                           J Mol Biol 1995, 247: 536\u2013540. 10.1006\/jmbi.1995.0159","journal-title":"J Mol Biol"},{"key":"214_CR21","doi-asserted-by":"publisher","first-page":"739","DOI":"10.1093\/protein\/11.9.739","volume":"11","author":"IN Shindyalov","year":"1998","unstructured":"Shindyalov IN, Bourne PE: Protein structure alignment by incremental combinatorial extension (CE) of the optimal path.\n                           Protein Engineering 1998, 11: 739\u2013747. 10.1093\/protein\/11.9.739","journal-title":"Protein Engineering"},{"key":"214_CR22","unstructured":"Fragments Library Tool using profile-profile alignments[http:\/\/ffas.ljcrf.edu\/Servers\/frag.html]"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-5-98.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/1471-2105-5-98\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-5-98.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,7]],"date-time":"2024-10-07T12:15:17Z","timestamp":1728303317000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-5-98"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2004,7,22]]},"references-count":22,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2004,12]]}},"alternative-id":["214"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-5-98","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2004,7,22]]},"assertion":[{"value":"30 March 2004","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 July 2004","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 July 2004","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"98"}}