{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,16]],"date-time":"2026-03-16T17:13:12Z","timestamp":1773681192027,"version":"3.50.1"},"reference-count":35,"publisher":"Oxford University Press (OUP)","issue":"20","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,10,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Fold recognition is an important step in protein structure and function prediction. Traditional sequence comparison methods fail to identify reliable homologies with low sequence identity, while the taxonomic methods are effective alternatives, but their prediction accuracies are around 70%, which are still relatively low for practical usage.<\/jats:p>\n               <jats:p>Results: In this study, a simple and powerful method is presented for taxonomic fold recognition, which combines support vector machine (SVM) with autocross-covariance (ACC) transformation. The evolutionary information represented in the form of position-specific score matrices is converted into a series of fixed-length vectors by ACC transformation and these vectors are then input to a SVM classifier for fold recognition. The sequence-order effect can be effectively captured by this scheme. Experiments are performed on the widely used D-B dataset and the corresponding extended dataset, respectively. The proposed method, called ACCFold, gets an overall accuracy of 70.1% on the D-B dataset, which is higher than major existing taxonomic methods by 2\u201314%. Furthermore, the method achieves an overall accuracy of 87.6% on the extended dataset, which surpasses major existing taxonomic methods by 9\u201317%. Additionally, our method obtains an overall accuracy of 80.9% for 86-folds and 77.2% for 199-folds. These results demonstrate that the ACCFold method provides the state-of-the-art performance for taxonomic fold recognition.<\/jats:p>\n               <jats:p>Availability: The source code for ACC transformation is freely available at http:\/\/www.iipl.fudan.edu.cn\/demo\/accpkg.html.<\/jats:p>\n               <jats:p>Contact: \u00a0sgzhou@fudan.edu.cn<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btp500","type":"journal-article","created":{"date-parts":[[2009,8,26]],"date-time":"2009-08-26T03:34:49Z","timestamp":1251257689000},"page":"2655-2662","source":"Crossref","is-referenced-by-count":176,"title":["A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation"],"prefix":"10.1093","volume":"25","author":[{"given":"Qiwen","family":"Dong","sequence":"first","affiliation":[{"name":"1 Shanghai Key Lab of Intelligent Information Processing, 2 School of Computer Science, Fudan University and 3 Department of Computer Science and Technology, Tongji University, Shanghai, China"},{"name":"1 Shanghai Key Lab of Intelligent Information Processing, 2 School of Computer Science, Fudan University and 3 Department of Computer Science and Technology, Tongji University, Shanghai, China"}]},{"given":"Shuigeng","family":"Zhou","sequence":"additional","affiliation":[{"name":"1 Shanghai Key Lab of Intelligent Information Processing, 2 School of Computer Science, Fudan University and 3 Department of Computer Science and Technology, Tongji University, Shanghai, China"},{"name":"1 Shanghai Key Lab of Intelligent Information Processing, 2 School of Computer Science, Fudan University and 3 Department of Computer Science and Technology, Tongji University, Shanghai, China"}]},{"given":"Jihong","family":"Guan","sequence":"additional","affiliation":[{"name":"1 Shanghai Key Lab of Intelligent Information Processing, 2 School of Computer Science, Fudan University and 3 Department of Computer Science and Technology, Tongji University, Shanghai, China"}]}],"member":"286","published-online":{"date-parts":[[2009,8,25]]},"reference":[{"key":"2023013112125993800_B1","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped blast and PSI-blast: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023013112125993800_B2","doi-asserted-by":"crossref","first-page":"D226","DOI":"10.1093\/nar\/gkh039","article-title":"SCOP database in 2004: refinements integrate structure and sequence family data","volume":"32","author":"Andreeva","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023013112125993800_B3","doi-asserted-by":"crossref","first-page":"785","DOI":"10.1093\/protein\/gzg106","article-title":"Manifold: protein fold recognition based on secondary structure, sequence similarity and enzyme classification","volume":"16","author":"Bindewald","year":"2003","journal-title":"Protein Eng."},{"key":"2023013112125993800_B4","author":"Chang","year":"2009","journal-title":"LIBSVM: a library for support vector machines."},{"key":"2023013112125993800_B5","doi-asserted-by":"crossref","first-page":"2843","DOI":"10.1093\/bioinformatics\/btm475","article-title":"PFRES: protein fold classification by using evolutionary information and predicted secondary structure","volume":"23","author":"Chen","year":"2007","journal-title":"Bioinformatics"},{"key":"2023013112125993800_B6","doi-asserted-by":"crossref","first-page":"1456","DOI":"10.1093\/bioinformatics\/btl102","article-title":"A machine learning information retrieval approach to protein fold recognition","volume":"22","author":"Cheng","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013112125993800_B7","doi-asserted-by":"crossref","first-page":"1264","DOI":"10.1093\/bioinformatics\/btn112","article-title":"Probabilistic multi-class multi-kernel learning: on protein fold recognition and remote homology detection","volume":"24","author":"Damoulas","year":"2008","journal-title":"Bioinformatics"},{"key":"2023013112125993800_B8","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1002\/prot.22324","article-title":"Enhanced protein fold recognition using a structural alphabet","volume":"76","author":"Deschavanne","year":"2009","journal-title":"Proteins Struct. Funct. Bioinform."},{"key":"2023013112125993800_B9","doi-asserted-by":"crossref","first-page":"349","DOI":"10.1093\/bioinformatics\/17.4.349","article-title":"Multi-class protein fold recognition using support vector machines and neural networks","volume":"17","author":"Ding","year":"2001","journal-title":"Bioinformatics"},{"key":"2023013112125993800_B10","doi-asserted-by":"crossref","first-page":"903","DOI":"10.1006\/jmbi.2001.5080","article-title":"Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure","volume":"313","author":"Gough","year":"2001","journal-title":"J. Mol. Biol."},{"key":"2023013112125993800_B11","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1093\/protein\/gzn045","article-title":"A novel hierarchical ensemble classifier for protein fold recognition","volume":"21","author":"Guo","year":"2008","journal-title":"Protein Eng. Des. Sel."},{"key":"2023013112125993800_B12","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1002\/prot.21097","article-title":"Predicting g-protein coupled receptors-g-protein coupling specificity based on autocross-covariance transform","volume":"65","author":"Guo","year":"2006","journal-title":"Proteins"},{"key":"2023013112125993800_B13","doi-asserted-by":"crossref","first-page":"3025","DOI":"10.1093\/nar\/gkn159","article-title":"Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences","volume":"36","author":"Guo","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023013112125993800_B14","doi-asserted-by":"crossref","first-page":"D302","DOI":"10.1093\/nar\/gkj120","article-title":"The RCSB PDB information portal for structural genomics","volume":"34","author":"Kouranov","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023013112125993800_B15","doi-asserted-by":"crossref","first-page":"795","DOI":"10.1110\/ps.2500102","article-title":"Classification of G-protein coupled receptors by alignment-independent extraction of principal chemical properties of primary amino acid sequences","volume":"11","author":"Lapinsh","year":"2002","journal-title":"Protein Sci."},{"key":"2023013112125993800_B16","first-page":"329","article-title":"Multi-class protein fold recognition using adaptive codes","volume-title":"Proceedings of the 22nd International Conference on Machine learning","author":"Le","year":"2005"},{"key":"2023013112125993800_B17","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1006\/jmbi.1999.3377","article-title":"Identification of related proteins on family, superfamily and fold level","volume":"295","author":"Lindahl","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023013112125993800_B18","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1186\/1471-2105-9-389","article-title":"Combining classifiers for improved classification of proteins from sequence or structure","volume":"9","author":"Melvin","year":"2008","journal-title":"BMC Bioinformatics"},{"issue":"Suppl. 8","key":"2023013112125993800_B19","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1002\/prot.21767","article-title":"Critical assessment of methods of protein structure prediction-round VII","volume":"69","author":"Moult","year":"2007","journal-title":"Proteins"},{"key":"2023013112125993800_B20","doi-asserted-by":"crossref","first-page":"4239","DOI":"10.1093\/bioinformatics\/bti687","article-title":"Profile-based direct kernels for remote homology detection and fold recognition","volume":"21","author":"Rangwala","year":"2005","journal-title":"Bioinformatics"},{"key":"2023013112125993800_B21","doi-asserted-by":"crossref","first-page":"455","DOI":"10.1186\/1471-2105-7-455","article-title":"Building multiclass classifiers for remote homology detection and fold recognition","volume":"7","author":"Rangwala","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023013112125993800_B22","doi-asserted-by":"crossref","first-page":"1682","DOI":"10.1093\/bioinformatics\/bth141","article-title":"Protein homology detection using string alignment kernels","volume":"20","author":"Saigo","year":"2004","journal-title":"Bioinformatics"},{"key":"2023013112125993800_B23","doi-asserted-by":"crossref","first-page":"3320","DOI":"10.1093\/bioinformatics\/btm527","article-title":"Support vector machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs","volume":"23","author":"Shamim","year":"2007","journal-title":"Bioinformatics"},{"key":"2023013112125993800_B24","doi-asserted-by":"crossref","first-page":"1717","DOI":"10.1093\/bioinformatics\/btl170","article-title":"Ensemble classifier for protein fold pattern recognition","volume":"22","author":"Shen","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013112125993800_B25","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1016\/0022-2836(81)90087-5","article-title":"Identification of common molecular subsequences","volume":"147","author":"Smith","year":"1981","journal-title":"J. Mol. Biol."},{"key":"2023013112125993800_B26","doi-asserted-by":"crossref","first-page":"W244","DOI":"10.1093\/nar\/gki408","article-title":"The HHpred interactive server for protein homology detection and structure prediction","volume":"33","author":"Soding","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023013112125993800_B27","volume-title":"Statistical Learning Theory.","author":"Vapnik","year":"1998"},{"key":"2023013112125993800_B28","doi-asserted-by":"crossref","first-page":"1612","DOI":"10.1110\/ps.03601504","article-title":"Scoring profile-to-profile sequence alignments","volume":"13","author":"Wang","year":"2004","journal-title":"Protein Sci."},{"key":"2023013112125993800_B29","doi-asserted-by":"crossref","first-page":"239","DOI":"10.1016\/0003-2670(93)80437-P","article-title":"DNA and peptide sequences and chemical processes multivariately modelled by principal component analysis and partial least-squares projections to latent structures","volume":"277","author":"Wold","year":"1993","journal-title":"Anal. Chim. Acta"},{"key":"2023013112125993800_B30","doi-asserted-by":"crossref","first-page":"897","DOI":"10.1006\/jmbi.2000.3786","article-title":"Estimating the number of protein folds and families from complete genome data","volume":"299","author":"Wolf","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023013112125993800_B31","doi-asserted-by":"crossref","first-page":"D187","DOI":"10.1093\/nar\/gkj161","article-title":"The universal protein resource (UniProt): an expanding universe of protein information","volume":"34","author":"Wu","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023013112125993800_B32","doi-asserted-by":"crossref","first-page":"3375","DOI":"10.1093\/nar\/gkm251","article-title":"LOMETS: a local meta-threading-server for protein structure prediction","volume":"35","author":"Wu","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023013112125993800_B33","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1109\/TCBB.2005.24","article-title":"Fold recognition by predicted alignment accuracy","volume":"2","author":"Xu","year":"2005","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"2023013112125993800_B34","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1142\/S0219720003000186","article-title":"RAPTOR: optimal protein threading by linear programming","volume":"1","author":"Xu","year":"2003","journal-title":"J. Bioinform. Comput. Biol."},{"key":"2023013112125993800_B35","doi-asserted-by":"crossref","first-page":"e2325","DOI":"10.1371\/journal.pone.0002325","article-title":"SP5: improving protein fold recognition by using torsion angle profiles and profile-based gap penalty model","volume":"3","author":"Zhang","year":"2008","journal-title":"PLoS ONE"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/20\/2655\/48993656\/bioinformatics_25_20_2655.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/20\/2655\/48993656\/bioinformatics_25_20_2655.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T21:40:13Z","timestamp":1675201213000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/20\/2655\/194053"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,8,25]]},"references-count":35,"journal-issue":{"issue":"20","published-print":{"date-parts":[[2009,10,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btp500","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2009,10,15]]},"published":{"date-parts":[[2009,8,25]]}}}