{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:35Z","timestamp":1772138075766,"version":"3.50.1"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2006,2,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Motivation: In recent years, advances have been made in the ability of computational methods to discriminate between homologous and non-homologous proteins in the \u2018twilight zone\u2019 of sequence similarity, where the percent sequence identity is a poor indicator of homology. To make these predictions more valuable to the protein modeler, they must be accompanied by accurate alignments. Pairwise sequence alignments are inferences of orthologous relationships between sequence positions. Evolutionary distance is traditionally modeled using global amino acid substitution matrices. But real differences in the likelihood of substitutions may exist for different structural contexts within proteins, since structural context contributes to the selective pressure.<\/jats:p>\n                  <jats:p>Results: HMMSUM (HMMSTR-based substitution matrices) is a new model for structural context-based amino acid substitution probabilities consisting of a set of 281 matrices, each for a different sequence\u2013structure context. HMMSUM does not require the structure of the protein to be known. Instead, predictions of local structure are made using HMMSTR, a hidden Markov model for local structure. Alignments using the HMMSUM matrices compare favorably to alignments carried out using the BLOSUM matrices or structure-based substitution matrices SDM and HSDM when validated against remote homolog alignments from BAliBASE. HMMSUM has been implemented using local Dynamic Programming and with the Bayesian Adaptive alignment method.<\/jats:p>\n                  <jats:p>Availability: Matrices, source codes and programs are available at .<\/jats:p>\n                  <jats:p>Contact: \u00a0bystrc@rpi.edu, huangy2@rpi.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/bti828","type":"journal-article","created":{"date-parts":[[2005,12,13]],"date-time":"2005-12-13T21:28:47Z","timestamp":1134509327000},"page":"413-422","source":"Crossref","is-referenced-by-count":41,"title":["Improved pairwise alignments of proteins in the Twilight Zone using local structure predictions"],"prefix":"10.1093","volume":"22","author":[{"given":"Yao-ming","family":"Huang","sequence":"first","affiliation":[{"name":"Center for Bioinformatics, Department of Biology, Rensselaer Polytechnic Institute \u00a0 Troy, NY 12180, USA"}]},{"given":"Christopher","family":"Bystroff","sequence":"additional","affiliation":[{"name":"Center for Bioinformatics, Department of Biology, Rensselaer Polytechnic Institute \u00a0 Troy, NY 12180, USA"}]}],"member":"286","published-online":{"date-parts":[[2005,12,13]]},"reference":[{"key":"2023012408513405300_b1","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1016\/S0092-8240(86)90010-8","article-title":"Optimal sequence alignment using affine gap costs","volume":"48","author":"Altschul","year":"1986","journal-title":"Bull. Math. Biol."},{"key":"2023012408513405300_b2","first-page":"403","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J. Mol. Evol."},{"key":"2023012408513405300_b3","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023012408513405300_b4","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1093\/nar\/29.1.323","article-title":"BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations","volume":"29","author":"Bahr","year":"2001","journal-title":"Nucleic Acids Res."},{"key":"2023012408513405300_b5","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1006\/jmbi.1998.1943","article-title":"Prediction of local structure in proteins using a library of sequence-structure motifs","volume":"281","author":"Bystroff","year":"1998","journal-title":"J. Mol. Biol."},{"key":"2023012408513405300_b6","doi-asserted-by":"crossref","first-page":"552","DOI":"10.1002\/prot.10252","article-title":"Helix propensities of short peptides: molecular dynamics versus bioinformatics","volume":"50","author":"Bystroff","year":"2003","journal-title":"Proteins"},{"key":"2023012408513405300_b7","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1006\/jmbi.2000.3837","article-title":"HMMSTR: a hidden Markov model for local sequence-structure correlations in proteins","volume":"301","author":"Bystroff","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023012408513405300_b8","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1089\/cmb.1994.1.271","article-title":"Recent developments in linear-space alignment methods: a survey","volume":"1","author":"Chao","year":"1994","journal-title":"J. Comput. Biol."},{"key":"2023012408513405300_b9","doi-asserted-by":"crossref","first-page":"524","DOI":"10.1016\/S0076-6879(83)91049-2","article-title":"Establishing homologies in protein sequences","volume":"91","author":"Dayhoff","year":"1983","journal-title":"Methods Enzymol."},{"key":"2023012408513405300_b10","first-page":"345","article-title":"A model of evolutionary change in proteins","volume-title":"Atlas of Protein Sequence and structure","author":"Dayhoff","year":"1978"},{"key":"2023012408513405300_b11","doi-asserted-by":"crossref","first-page":"149","DOI":"10.1126\/science.7280687","article-title":"Similar amino acid sequences: chance or common ancestry?","volume":"214","author":"Doolittle","year":"1981","journal-title":"Science"},{"key":"2023012408513405300_b12","doi-asserted-by":"crossref","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","article-title":"Amino acid substitution matrices from protein blocks","volume":"89","author":"Henikoff","year":"1992","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012408513405300_b13","doi-asserted-by":"crossref","first-page":"522","DOI":"10.1002\/pro.5560030317","article-title":"Enlarged representative set of protein structures","volume":"3","author":"Hobohm","year":"1994","journal-title":"Protein Sci."},{"key":"2023012408513405300_b14","doi-asserted-by":"crossref","first-page":"518","DOI":"10.1002\/prot.20221","article-title":"Remote homolog detection using local sequence-structure correlations","volume":"57","author":"Hou","year":"2004","journal-title":"Proteins"},{"key":"2023012408513405300_b15","first-page":"373","article-title":"A space-efficient algorithm for local similarities","volume":"6","author":"Huang","year":"1990","journal-title":"Comput. Appl. Biosci."},{"key":"2023012408513405300_b16","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1016\/0196-8858(91)90017-D","article-title":"A time-efficient, linear-space local similarity algorithm","volume":"12","author":"Huang","year":"1991","journal-title":"Adv. Appl. Math."},{"key":"2023012408513405300_b17","doi-asserted-by":"crossref","first-page":"1702","DOI":"10.1110\/ps.4820102","article-title":"In search for more accurate alignments in the twilight zone","volume":"11","author":"Jaroszewski","year":"2002","journal-title":"Protein Sci."},{"key":"2023012408513405300_b18","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1093\/nar\/12.1Part1.215","article-title":"On the statistical significance of nucleic acid similarities","volume":"12","author":"Lipman","year":"1984","journal-title":"Nucleic Acids Res."},{"key":"2023012408513405300_b19","doi-asserted-by":"crossref","first-page":"1577","DOI":"10.1002\/pro.5560040816","article-title":"The distribution of alpha-helix propensity along the polypeptide chain is not conserved in proteins from the same family","volume":"4","author":"Munoz","year":"1995","journal-title":"Protein Sci."},{"key":"2023012408513405300_b20","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1016\/S0022-2836(02)00445-X","article-title":"Protein folding kinetics beyond the phi value: using multiple amino acid substitutions to investigate the structure of the SH3 domain folding transition state","volume":"320","author":"Northey","year":"2002","journal-title":"J. Mol. Biol."},{"key":"2023012408513405300_b21","doi-asserted-by":"crossref","first-page":"1145","DOI":"10.1002\/pro.5560040613","article-title":"Comparison of methods for searching protein sequence databases","volume":"4","author":"Pearson","year":"1995","journal-title":"Protein Sci."},{"key":"2023012408513405300_b22","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1016\/S0076-6879(96)66017-0","article-title":"Effective protein sequence comparison","volume":"266","author":"Pearson","year":"1996","journal-title":"Methods Enzymol."},{"key":"2023012408513405300_b23","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1006\/jmbi.1997.1525","article-title":"Empirical statistical estimates for sequence similarity searches","volume":"276","author":"Pearson","year":"1998","journal-title":"J. Mol. Biol."},{"key":"2023012408513405300_b24","first-page":"185","article-title":"Flexible sequence similarity searching with the FASTA3 program package","volume":"132","author":"Pearson","year":"2000","journal-title":"Methods Mol. Biol."},{"key":"2023012408513405300_b25","doi-asserted-by":"crossref","first-page":"545","DOI":"10.1093\/protein\/13.8.545","article-title":"Structure-derived substitution matrices for alignment of distantly related sequences","volume":"13","author":"Prli\u0107","year":"2000","journal-title":"Protein Eng."},{"key":"2023012408513405300_b26","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1109\/5.18626","article-title":"A tutorial on hidden Markov models and selected applications in speech recognition","volume":"77","author":"Rabiner","year":"1989","journal-title":"Proc. IEEE"},{"key":"2023012408513405300_b27","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1002\/prot.10544","article-title":"Modeling three-dimensional protein structures for CASP5 using the 3D-SHOTGUN meta-predictors","volume":"53","author":"Sasson","year":"2003","journal-title":"Proteins"},{"key":"2023012408513405300_b28","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1016\/0022-2836(81)90087-5","article-title":"Identification of common molecular subsequences","volume":"147","author":"Smith","year":"1981","journal-title":"J. Mol. Biol."},{"key":"2023012408513405300_b29","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1007\/BF01733210","article-title":"Comparative biosequence metrics","volume":"18","author":"Smith","year":"1981","journal-title":"J. Mol. Evol."},{"key":"2023012408513405300_b30","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1093\/bioinformatics\/15.1.87","article-title":"BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs","volume":"15","author":"Thompson","year":"1999","journal-title":"Bioinformatics"},{"key":"2023012408513405300_b31","first-page":"115","article-title":"A fast and sensitive multiple sequence alignment algorithm","volume":"5","author":"Vingron","year":"1989","journal-title":"Comput. Appl. Biosci."},{"key":"2023012408513405300_b32","doi-asserted-by":"crossref","first-page":"723","DOI":"10.1016\/0022-2836(87)90478-5","article-title":"A new algorithm for best subsequence alignments with application to tRNA-rRNA comparisons","volume":"197","author":"Waterman","year":"1987","journal-title":"J. Mol. Biol."},{"key":"2023012408513405300_b33","doi-asserted-by":"crossref","first-page":"80","DOI":"10.2307\/3001968","article-title":"\u2018Individual Comparisons by Ranking Methods\u2019","volume":"1","author":"Wilcoxon","year":"1945","journal-title":"Biometrics"},{"key":"2023012408513405300_b34","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1142\/S0219720003000186","article-title":"RAPTOR: optimal protein threading by linear programming","volume":"1","author":"Xu","year":"2003","journal-title":"J. Bioinform. Comput. Biol."},{"key":"2023012408513405300_b35","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1006\/jmbi.1998.2072","article-title":"Prediction and structural characterization of an independently folding substructure in the src SH3 domain","volume":"283","author":"Yi","year":"1998","journal-title":"J. Mol. Biol."},{"key":"2023012408513405300_b36","doi-asserted-by":"crossref","first-page":"1257","DOI":"10.1006\/jmbi.2001.5293","article-title":"Within the twilight zone: a sensitive profile-profile comparison tool based on information theory","volume":"315","author":"Yona","year":"2002","journal-title":"J. Mol. Biol."},{"key":"2023012408513405300_b37","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1093\/bioinformatics\/14.1.25","article-title":"Bayesian adaptive sequence alignment algorithms","volume":"14","author":"Zhu","year":"1998","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/4\/413\/48840274\/bioinformatics_22_4_413.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/4\/413\/48840274\/bioinformatics_22_4_413.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,24]],"date-time":"2023-01-24T04:23:49Z","timestamp":1674534229000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/22\/4\/413\/184164"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,12,13]]},"references-count":37,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2006,2,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bti828","relation":{"has-review":[{"id-type":"doi","id":"10.3410\/f.1031030.364477","asserted-by":"object"}]},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2006,2,15]]},"published":{"date-parts":[[2005,12,13]]}}}