{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,4]],"date-time":"2025-07-04T12:45:59Z","timestamp":1751633159765},"reference-count":49,"publisher":"Oxford University Press (OUP)","issue":"7","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2005,4,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Proteins of the same class often share a secondary structure packing arrangement but differ in how the secondary structure units are ordered in the sequence. We find that proteins that share a common core also share local sequence\u2013structure similarities, and these can be exploited to align structures with different topologies. In this study, segments from a library of local sequence\u2013structure alignments were assembled hierarchically, enforcing the compactness and conserved inter-residue contacts but not sequential ordering. Previous structure-based alignment methods often ignore sequence similarity, local structural equivalence and compactness.<\/jats:p><jats:p>Results: The new program, SCALI (Structural Core ALIgnment), can efficiently find conserved packing arrangements, even if they are non-sequentially ordered in space. SCALI alignments conserve remote sequence similarity and contain fewer alignment errors. Clustering of our pairwise non-sequential alignments shows that recurrent packing arrangements exist in topologically different structures. For example, the three-layer sandwich domain architecture may be divided into four structural subclasses based on internal packing arrangements. These subclasses represent an intermediate level of structure classification, more general than topology, but more specific than architecture as defined in CATH. A strategy is presented for developing a set of predictive hidden Markov models based on multiple SCALI alignments.<\/jats:p><jats:p>Availability: An online topology-independent SCALI structure comparison server is available at http:\/\/www.bioinfo.rpi.edu\/~bystrc\/scali.html<\/jats:p><jats:p>Contact: \u00a0bystrc@rpi.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/bti128","type":"journal-article","created":{"date-parts":[[2004,11,6]],"date-time":"2004-11-06T01:14:14Z","timestamp":1099703654000},"page":"1010-1019","source":"Crossref","is-referenced-by-count":39,"title":["Non-sequential structure-based alignments reveal topology-independent core packing arrangements in proteins"],"prefix":"10.1093","volume":"21","author":[{"given":"Xin","family":"Yuan","sequence":"first","affiliation":[{"name":"Department of Biology, Rensselaer Polytechnic Institute Troy, NY 12180, USA"}]},{"given":"Christopher","family":"Bystroff","sequence":"additional","affiliation":[{"name":"Department of Biology, Rensselaer Polytechnic Institute Troy, NY 12180, USA"}]}],"member":"286","published-online":{"date-parts":[[2004,11,5]]},"reference":[{"key":"2023013107271709000_B1","doi-asserted-by":"crossref","unstructured":"Abagyan, R.A. and Maiorov, V.N. 1989An automatic search for similar spatial arrangements of alpha-helices and beta-strands in globular proteins. J. Biomol. Struct. Dyn.61045\u20131060","DOI":"10.1080\/07391102.1989.10506535"},{"key":"2023013107271709000_B2","unstructured":"Alexandrov, N.N. 1996SARFing the PDB. Protein Eng.9727\u2013732"},{"key":"2023013107271709000_B3","doi-asserted-by":"crossref","unstructured":"Alexandrov, N.N. and Fischer, D. 1996Analysis of topological and nontopological structural similarities in the PDB: new examples with old structures. Proteins25354\u2013365","DOI":"10.1002\/(SICI)1097-0134(199607)25:3<354::AID-PROT7>3.0.CO;2-F"},{"key":"2023013107271709000_B4","doi-asserted-by":"crossref","unstructured":"Aloy, P., Stark, A., Hadley, C., Russell, R.B. 2003Predictions without templates: new folds, secondary structure, and contacts in CASP5. Proteins53(Suppl. 6),436\u2013456","DOI":"10.1002\/prot.10546"},{"key":"2023013107271709000_B5","doi-asserted-by":"crossref","unstructured":"Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J. 1997Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res.253389\u20133402","DOI":"10.1093\/nar\/25.17.3389"},{"key":"2023013107271709000_B6","doi-asserted-by":"crossref","unstructured":"Bennett, M.J., Choe, S., Eisenberg, D. 1994Domain swapping: entangling alliances between proteins. Proc. Natl Acad. Sci. USA913127\u20133131","DOI":"10.1073\/pnas.91.8.3127"},{"key":"2023013107271709000_B7","unstructured":"Bernstein, H.J. 2000Recent changes to RasMol, recombining the variants. Trends Biochem. Sci.25453\u2013455"},{"key":"2023013107271709000_B8","unstructured":"Bystroff, C. and Baker, D. 1998Prediction of local structure in proteins using a library of sequence\u2013structure motifs. J. Mol. Biol.281565\u2013577"},{"key":"2023013107271709000_B9","unstructured":"Bystroff, C., Thorsson, V., Baker, D. 2000HMMSTR: a hidden Markov model for local sequence\u2013structure correlations in proteins. J. Mol. Biol.301173\u2013190"},{"key":"2023013107271709000_B10","unstructured":"Eddy, S.R. 1998Profile hidden Markov models. Bioinformatics14755\u2013763"},{"key":"2023013107271709000_B11","unstructured":"Efimov, A.V. 1995Structural similarity between two-layer alpha\/beta and beta-proteins. J. Mol. Biol.245402\u2013415"},{"key":"2023013107271709000_B12","doi-asserted-by":"crossref","unstructured":"Flores, T.P., Orengo, C.A., Moss, D.S., Thornton, J.M. 1993Comparison of conformational characteristics in structurally similar protein pairs. Protein Sci.21811\u20131826","DOI":"10.1002\/pro.5560021104"},{"key":"2023013107271709000_B13","unstructured":"Gibrat, J.F., Madej, T., Bryant, S.H. 1996Surprising similarities in structure comparison. Curr. Opin. Struct. Biol.6377\u2013385"},{"key":"2023013107271709000_B14","doi-asserted-by":"crossref","unstructured":"Gong, W., O\u2019Gara, M., Blumenthal, R.M., Cheng, X. 1997Structure of pvu II DNA- (cytosine N4) methyltransferase, an example of domain permutation and protein fold assignment. Nucleic Acids Res.252702\u20132715","DOI":"10.1093\/nar\/25.14.2702"},{"key":"2023013107271709000_B15","doi-asserted-by":"crossref","unstructured":"Gough, J. and Chothia, C. 2002SUPERFAMILY: HMMs representing all proteins of known structure. SCOP sequence searches, alignments and genome assignments. Nucleic Acids Res.30268\u2013272","DOI":"10.1093\/nar\/30.1.268"},{"key":"2023013107271709000_B16","unstructured":"Holm, L. and Sander, C. 1993Protein structure comparison by alignment of distance matrices. J. Mol. Biol.233123\u2013138"},{"key":"2023013107271709000_B17","unstructured":"Holm, L. and Sander, C. 1996Mapping the protein universe. Science273595\u2013603"},{"key":"2023013107271709000_B18","unstructured":"Honig, B. 1999Protein folding: from the levinthal paradox to structure prediction. J. Mol. Biol.293283\u2013293"},{"key":"2023013107271709000_B19","unstructured":"Hou, Y., Hsu, W., Lee, M.L., Bystroff, C. 2003Efficient remote homology detection using local structure. Bioinformatics192294\u20132301"},{"key":"2023013107271709000_B20","unstructured":"Iwakura, M., Nakamura, T., Yamane, C., Maki, K. 2000Systematic circular permutation of an entire protein reveals essential folding elements. Nat. Struct. Biol.7580\u2013585"},{"key":"2023013107271709000_B21","unstructured":"Janowski, R., Kozak, M., Jankowska, E., Grzonka, Z., Grubb, A., Abrahamson, M., Jaskolski, M. 2001Human cystatin C, an amyloidogenic protein, dimerizes through three-dimensional domain swapping. Nat. Struct. Biol.8316\u2013320"},{"key":"2023013107271709000_B22","doi-asserted-by":"crossref","unstructured":"Jeltsch, A. 1999Circular permutations in the molecular evolution of DNA methyltransferases. J. Mol. Evol.49161\u2013164","DOI":"10.1007\/PL00006529"},{"key":"2023013107271709000_B23","unstructured":"Jung, J. and Lee, B. 2001Circularly permuted proteins in the protein structure database. Protein Sci.101881\u20131886"},{"key":"2023013107271709000_B24","doi-asserted-by":"crossref","unstructured":"Karplus, K., Barrett, C., Hughey, R. 1998Hidden Markov models for detecting remote protein homologies. Bioinformatics14846\u2013856","DOI":"10.1093\/bioinformatics\/14.10.846"},{"key":"2023013107271709000_B25","doi-asserted-by":"crossref","unstructured":"Khil, P.P., Obmolova, E., Teplyakov, A., Howard, A.J., Gilliland, G.L., Camerini-Otero, R.D. 2004Crystal structure of the Escherichia coli YjiA protein suggests a GTP-dependent regulatory function. Proteins54371\u2013374","DOI":"10.2210\/pdb1nij\/pdb"},{"key":"2023013107271709000_B26","unstructured":"Koehl, P. 2001Protein structure similarities. Curr. Opin. Struct. Biol.11348\u2013353"},{"key":"2023013107271709000_B27","doi-asserted-by":"crossref","unstructured":"Milik, M., Szalma, S., Olszewski, K.A. 2003Common structural cliques: a tool for protein structure and function analysis. Protein Eng.16543\u2013\u2013552","DOI":"10.1093\/protein\/gzg080"},{"key":"2023013107271709000_B28","doi-asserted-by":"crossref","unstructured":"Moult, J., Fidelis, K., Zemla, A., Hubbard, T. 2003Critical assessment of methods of protein structure prediction (CASP)-round V. Proteins53(Suppl. 6),334\u2013339","DOI":"10.1002\/prot.10556"},{"key":"2023013107271709000_B29","unstructured":"Murzin, A.G., Brenner, S.E., Hubbard, T., Chothia, C. 1995SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol.247536\u2013540"},{"key":"2023013107271709000_B30","unstructured":"Orengo, C.A. 1994Classification of protein folds. Curr. Opin. Struct. Biol.4429\u2013440"},{"key":"2023013107271709000_B31","doi-asserted-by":"crossref","unstructured":"Orengo, C.A., Michie, A.D., Jones, S., Jones, D.T., Swindells, M.B., Thornton, J.M. 1997CATH\u2014a hierarchic classification of protein domain structures. Structure51093\u20131108","DOI":"10.1016\/S0969-2126(97)00260-8"},{"key":"2023013107271709000_B32","doi-asserted-by":"crossref","unstructured":"Ortiz, A.R., Strauss, C.E., Olmea, O. 2002MAMMOTH (matching molecular models obtained from theory): an automated method for model comparison. Protein Sci.112606\u20132621","DOI":"10.1110\/ps.0215902"},{"key":"2023013107271709000_B33","unstructured":"Pearl, F.M., Lee, D., Bray, J.E., Sillitoe, I., Todd, A.E., Harrison, A.P., Thornton, J.M., Orengo, C.A. 2000Assigning genomic sequences to CATH. Nucleic Acids Res.28277\u2013282"},{"key":"2023013107271709000_B34","unstructured":"Pearl, F.M., Bennett, C.F., Bray, J.E., Harrison, A.P., Martin, N., Shepherd, A., Sillitoe, I., Thornton, J., Orengo, C.A. 2003The CATH database: an extended protein family resource for structural and functional genomics. Nucleic Acids Res.31452\u2013455"},{"key":"2023013107271709000_B35","doi-asserted-by":"crossref","unstructured":"Rabiner, L.R. 1989A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE77257\u2013286","DOI":"10.1016\/B978-0-08-051584-7.50027-9"},{"key":"2023013107271709000_B36","doi-asserted-by":"crossref","unstructured":"Rost, B. 1997Protein structures sustain evolutionary drift. Fold Des.2S19\u2013S24","DOI":"10.1016\/S1359-0278(97)00059-X"},{"key":"2023013107271709000_B37","unstructured":"Sayle, R.A. and Milner-White, E.J. 1995RASMOL: biomolecular graphics for all. Trends Biochem. Sci.20374"},{"key":"2023013107271709000_B38","doi-asserted-by":"crossref","unstructured":"Schiering, N., Casale, E., Caccia, P., Giordano, P., Battistini, C. 2000Dimer formation through domain swapping in the crystal structure of the Grb2-SH2-Ac-pYVNV complex. Biochemistry3913376\u201313382","DOI":"10.2210\/pdb1fyr\/pdb"},{"key":"2023013107271709000_B39","doi-asserted-by":"crossref","unstructured":"Shao, Y. and Bystroff, C. 2003Predicting interresidue contacts using templates and pathways. Proteins53(Suppl. 6),497\u2013502","DOI":"10.1002\/prot.10539"},{"key":"2023013107271709000_B40","doi-asserted-by":"crossref","unstructured":"Shindyalov, I.N. and Bourne, P.E. 1998Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng.11739\u2013747","DOI":"10.1093\/protein\/11.9.739"},{"key":"2023013107271709000_B41","doi-asserted-by":"crossref","unstructured":"Smith, V.F. and Matthews, C.R. 2001Testing the role of chain connectivity on the stability and structure of dihydrofolate reductase from E. coli: fragment complementation and circular permutation reveal stable, alternatively folded forms. Protein Sci.10116\u2013128","DOI":"10.1110\/ps.26601"},{"key":"2023013107271709000_B42","unstructured":"(Eds.). Introduction to Biostatistics1973, San Francisco, CA W.H. Freeman and company, pp. 220\u2013222"},{"key":"2023013107271709000_B43","unstructured":"Szustakowski, J.D. and Weng, Z. 2000Protein structure alignment using a genetic algorithm. Proteins38428\u2013440"},{"key":"2023013107271709000_B44","doi-asserted-by":"crossref","unstructured":"Szustakowski, J.D. and Weng, Z. 2002Protein structure alignment using evolutionary computing. In Fogel, G. and Corne, D. (Eds.). Evolutionary Computation in Bioinformatics Morgan Kaufman","DOI":"10.1016\/B978-155860797-2\/50006-8"},{"key":"2023013107271709000_B45","doi-asserted-by":"crossref","unstructured":"Taylor, W.R. and Orengo, C.A. 1989Protein structure alignment. J. Mol. Biol.208, pp. 1\u201322","DOI":"10.1016\/0022-2836(89)90084-3"},{"key":"2023013107271709000_B46","doi-asserted-by":"crossref","unstructured":"Viguera, A.R., Blanco, F.J., Serrano, L. 1995The order of secondary structure elements does not determine the structure of a protein but does affect its folding kinetics. J. Mol. Biol.247670\u2013681","DOI":"10.1016\/S0022-2836(05)80146-9"},{"key":"2023013107271709000_B47","doi-asserted-by":"crossref","unstructured":"Westhead, D.R., Slidel, T.W., Flores, T.P., Thornton, J.M. 1999Protein structural topology: automated analysis and diagrammatic representation. Protein Sci.8897\u2013904","DOI":"10.1110\/ps.8.4.897"},{"key":"2023013107271709000_B48","doi-asserted-by":"crossref","unstructured":"Yang, A.S. and Honig, B. 1999Sequence to structure alignment in comparative modeling using PrISM. Proteins Suppl. 3,66\u201372","DOI":"10.1002\/(SICI)1097-0134(1999)37:3+<66::AID-PROT10>3.0.CO;2-K"},{"key":"2023013107271709000_B49","unstructured":"Yang, A.S. and Honig, B. 2000An integrated approach to the analysis and modeling of protein sequences and structures. I. Protein structural alignment and a quantitative measure for protein structural distance. J. Mol. Biol.301665\u2013678"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/21\/7\/1010\/48967185\/bioinformatics_21_7_1010.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/21\/7\/1010\/48967185\/bioinformatics_21_7_1010.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,1,15]],"date-time":"2024-01-15T00:53:40Z","timestamp":1705280020000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/21\/7\/1010\/268994"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2004,11,5]]},"references-count":49,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2005,4,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bti128","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2005,4,1]]},"published":{"date-parts":[[2004,11,5]]}}}