{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T14:24:56Z","timestamp":1772202296778,"version":"3.50.1"},"reference-count":54,"publisher":"Oxford University Press (OUP)","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: The precise prediction of protein secondary structure is of key importance for the prediction of 3D structure and biological function. Although the development of many excellent methods over the last few decades has allowed the achievement of prediction accuracies of up to 80%, progress seems to have reached a bottleneck, and further improvements in accuracy have proven difficult.<\/jats:p>\n               <jats:p>Results: We propose for the first time a structural position-specific scoring matrix (SPSSM), and establish an unprecedented database of 9 million sequences and their SPSSMs. This database, when combined with a purpose-designed BLAST tool, provides a novel prediction tool: SPSSMPred. When the SPSSMPred was validated on a large dataset (10 814 entries), the Q3 accuracy of the protein secondary structure prediction was 93.4%. Our approach was tested on the two latest EVA sets; accuracies of 82.7 and 82.0% were achieved, far higher than can be achieved using other predictors. For further evaluation, we tested our approach on newly determined sequences (141 entries), and obtained an accuracy of 89.6%. For a set of low-homology proteins (40 entries), the SPSSMPred still achieved a Q3 value of 84.6%.<\/jats:p>\n               <jats:p>Availability: The SPSSMPred server is available at http:\/\/cal.tongji.edu.cn\/SPSSMPred\/<\/jats:p>\n               <jats:p>Contact: \u00a0lith@tongji.edu.cn<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr611","type":"journal-article","created":{"date-parts":[[2011,11,8]],"date-time":"2011-11-08T07:42:28Z","timestamp":1320738148000},"page":"32-39","source":"Crossref","is-referenced-by-count":30,"title":["A novel structural position-specific scoring matrix for the prediction of protein secondary structures"],"prefix":"10.1093","volume":"28","author":[{"given":"Dapeng","family":"Li","sequence":"first","affiliation":[{"name":"Department of Chemistry, Tongji University, Shanghai 200092, China"}]},{"given":"Tonghua","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Chemistry, Tongji University, Shanghai 200092, China"}]},{"given":"Peisheng","family":"Cong","sequence":"additional","affiliation":[{"name":"Department of Chemistry, Tongji University, Shanghai 200092, China"}]},{"given":"Wenwei","family":"Xiong","sequence":"additional","affiliation":[{"name":"Department of Chemistry, Tongji University, Shanghai 200092, China"}]},{"given":"Jiangming","family":"Sun","sequence":"additional","affiliation":[{"name":"Department of Chemistry, Tongji University, Shanghai 200092, China"}]}],"member":"286","published-online":{"date-parts":[[2011,11,7]]},"reference":[{"key":"2023061011443117600_B1","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023061011443117600_B2","doi-asserted-by":"crossref","first-page":"178","DOI":"10.1186\/1471-2105-7-178","article-title":"Protein secondary structure prediction for a single-sequence using hidden semi-Markov models","volume":"7","author":"Aydin","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023061011443117600_B3","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1016\/j.cmpb.2010.04.005","article-title":"Protein secondary structure prediction using modular reciprocal bidirectional recurrent neural networks","volume":"100","author":"Babaei","year":"2010","journal-title":"Comput. Methods Programs Biomed."},{"key":"2023061011443117600_B4","doi-asserted-by":"crossref","first-page":"2069","DOI":"10.1016\/j.talanta.2006.09.015","article-title":"Prediction of protein secondary structure content using support vector machine","volume":"71","author":"Chen","year":"2007","journal-title":"Talanta"},{"key":"2023061011443117600_B5","doi-asserted-by":"crossref","first-page":"27","DOI":"10.2174\/092986609787049420","article-title":"Prediction of protein secondary structure content by using the concept of Chou's pseudo amino acid composition and support vector machine","volume":"16","author":"Chen","year":"2009","journal-title":"Protein Pept. Lett."},{"key":"2023061011443117600_B6","doi-asserted-by":"crossref","first-page":"572","DOI":"10.1109\/tcbb.2007.1055","article-title":"Cascaded bidirectional recurrent neural networks for protein secondary structure prediction","volume":"4","author":"Chen","year":"2007","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"2023061011443117600_B7","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1021\/bi00699a001","article-title":"Conformational parameters for amino acids in helical, beta-sheet, and random coil regions calculated from proteins","volume":"13","author":"Chou","year":"1974","journal-title":"Biochemistry"},{"key":"2023061011443117600_B8","doi-asserted-by":"crossref","first-page":"222","DOI":"10.1021\/bi00699a002","article-title":"Prediction of protein conformation","volume":"13","author":"Chou","year":"1974","journal-title":"Biochemistry"},{"key":"2023061011443117600_B9","doi-asserted-by":"crossref","first-page":"933","DOI":"10.1016\/S0022-2836(02)01379-7","article-title":"Interplay among tertiary contacts, secondary structure formation and side-chain packing in the protein folding mechanism: all-atom representation study of protein L","volume":"326","author":"Clementi","year":"2003","journal-title":"J. Mol. Biol."},{"key":"2023061011443117600_B10","doi-asserted-by":"crossref","first-page":"W197","DOI":"10.1093\/nar\/gkn238","article-title":"The Jpred 3 secondary structure prediction server","volume":"36","author":"Cole","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023061011443117600_B11","doi-asserted-by":"crossref","first-page":"892","DOI":"10.1093\/bioinformatics\/14.10.892","article-title":"JPred: a consensus secondary structure prediction server","volume":"14","author":"Cuff","year":"1998","journal-title":"Bioinformatics"},{"key":"2023061011443117600_B12","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1093\/protein\/1.4.289","article-title":"An algorithm for protein secondary structure prediction based on class prediction","volume":"1","author":"Deleage","year":"1987","journal-title":"Protein Eng."},{"key":"2023061011443117600_B13","first-page":"100","article-title":"Incorporating global information into secondary structure prediction with hidden Markov models of protein folds","volume":"5","author":"Di Francesco","year":"1997","journal-title":"Proc. Int. Conf. Intell. Syst. Mol. Biol."},{"key":"2023061011443117600_B14","doi-asserted-by":"crossref","first-page":"1242","DOI":"10.1093\/bioinformatics\/17.12.1242","article-title":"EVA: continuous automatic evaluation of protein structure prediction servers","volume":"17","author":"Eyrich","year":"2001","journal-title":"Bioinformatics"},{"key":"2023061011443117600_B15","doi-asserted-by":"crossref","first-page":"329","DOI":"10.1002\/(SICI)1097-0134(199703)27:3<329::AID-PROT1>3.0.CO;2-8","article-title":"Seventy-five percent accuracy in protein secondary structure prediction","volume":"27","author":"Frishman","year":"1997","journal-title":"Proteins"},{"key":"2023061011443117600_B16","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1016\/0022-2836(78)90297-8","article-title":"Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins","volume":"120","author":"Garnier","year":"1978","journal-title":"J. Mol. Biol."},{"key":"2023061011443117600_B17","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1093\/protein\/7.2.157","article-title":"SOPM: a self-optimized method for protein secondary structure prediction","volume":"7","author":"Geourjon","year":"1994","journal-title":"Protein Eng."},{"key":"2023061011443117600_B18","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1073\/pnas.86.1.152","article-title":"Protein secondary structure prediction with a neural network","volume":"86","author":"Holley","year":"1989","journal-title":"Proc. Natl. Acad. Sci. U.S.A."},{"key":"2023061011443117600_B19","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1109\/TNB.2004.837906","article-title":"Improved protein secondary structure prediction using support vector machine with a new encoding scheme and an advanced tertiary classifier","volume":"3","author":"Hu","year":"2004","journal-title":"IEEE Trans. Nanobioscience"},{"key":"2023061011443117600_B20","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1006\/jmbi.1999.3091","article-title":"Protein secondary structure prediction based on position-specific scoring matrices","volume":"292","author":"Jones","year":"1999","journal-title":"J. Mol. Biol."},{"key":"2023061011443117600_B21","doi-asserted-by":"crossref","first-page":"2577","DOI":"10.1002\/bip.360221211","article-title":"Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features","volume":"22","author":"Kabsch","year":"1983","journal-title":"Biopolymers"},{"key":"2023061011443117600_B22","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1016\/0014-5793(82)80597-8","article-title":"How good are predictions of protein secondary structure?","volume":"155","author":"Kabsch","year":"1983","journal-title":"FEBS Lett."},{"key":"2023061011443117600_B23","doi-asserted-by":"crossref","first-page":"662","DOI":"10.1038\/181662a0","article-title":"A three-dimensional model of the myoglobin molecule obtained by X-ray analysis","volume":"181","author":"Kendrew","year":"1958","journal-title":"Nature"},{"key":"2023061011443117600_B24","doi-asserted-by":"crossref","first-page":"441","DOI":"10.1016\/S0022-2836(05)80333-X","article-title":"Machine learning approach for the prediction of protein secondary structure","volume":"216","author":"King","year":"1990","journal-title":"J. Mol. Biol."},{"key":"2023061011443117600_B25","doi-asserted-by":"crossref","first-page":"2298","DOI":"10.1002\/pro.5560051116","article-title":"Identification and application of the concepts important for accurate and reliable protein secondary structure prediction","volume":"5","author":"King","year":"1996","journal-title":"Protein Sci."},{"key":"2023061011443117600_B26","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1016\/0022-2836(90)90154-E","article-title":"Improvements in protein secondary structure prediction by an enhanced neural network","volume":"214","author":"Kneller","year":"1990","journal-title":"J. Mol. Biol."},{"key":"2023061011443117600_B27","first-page":"282","article-title":"Conditional random fields: probabilistic models for segmenting and labeling sequence data","author":"Lafferty","year":"2001","journal-title":"Proceedings of the 18th International Conference on Machine Learning."},{"key":"2023061011443117600_B28","doi-asserted-by":"crossref","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","article-title":"Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences","volume":"22","author":"Li","year":"2006","journal-title":"Bioinformatics"},{"key":"2023061011443117600_B29","doi-asserted-by":"crossref","first-page":"857","DOI":"10.1016\/0022-2836(74)90404-5","article-title":"Structural principles of the globular organization of protein chains. A stereochemical theory of globular protein secondary structure","volume":"88","author":"Lim","year":"1974","journal-title":"J. Mol. Biol."},{"issue":"Suppl. 4","key":"2023061011443117600_B30","doi-asserted-by":"crossref","first-page":"S4","DOI":"10.1186\/1471-2164-11-S4-S4","article-title":"Improving protein secondary structure prediction based on short subsequences with local structure similarity","volume":"11","author":"Lin","year":"2010","journal-title":"BMC Genomics"},{"key":"2023061011443117600_B31","doi-asserted-by":"crossref","first-page":"3099","DOI":"10.1093\/bioinformatics\/bth370","article-title":"Comparison of probabilistic combination methods for protein secondary structure prediction","volume":"20","author":"Liu","year":"2004","journal-title":"Bioinformatics"},{"key":"2023061011443117600_B32","doi-asserted-by":"crossref","first-page":"2517","DOI":"10.1002\/pro.5560041208","article-title":"A simple and fast approach to prediction of protein secondary structure from multiply aligned sequences with accuracy above 70%","volume":"4","author":"Mehta","year":"1995","journal-title":"Protein Sci."},{"key":"2023061011443117600_B33","doi-asserted-by":"crossref","first-page":"5027","DOI":"10.1073\/pnas.91.11.5027","article-title":"An algorithm to generate low-resolution protein tertiary structures from knowledge of secondary structure","volume":"91","author":"Monge","year":"1994","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023061011443117600_B34","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1186\/1471-2105-7-301","article-title":"Improving the accuracy of protein secondary structure prediction using structural alignment","volume":"7","author":"Montgomerie","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023061011443117600_B35","doi-asserted-by":"crossref","first-page":"633","DOI":"10.1038\/199633a0","article-title":"Structure of haemoglobin: a three-dimensional fourier synthesis of reduced human haemoglobin at 5.5 \u00c5 resolution","volume":"199","author":"Muirhead","year":"1963","journal-title":"Nature"},{"key":"2023061011443117600_B36","doi-asserted-by":"crossref","first-page":"917","DOI":"10.1002\/prot.10507","article-title":"Better prediction of sub-cellular localization by combining evolutionary and structural information","volume":"53","author":"Nair","year":"2003","journal-title":"Proteins"},{"key":"2023061011443117600_B37","first-page":"218","article-title":"Multi-class support vector machines for protein secondary structure prediction","volume":"14","author":"Nguyen","year":"2003","journal-title":"Genome Inform."},{"key":"2023061011443117600_B38","doi-asserted-by":"crossref","first-page":"545","DOI":"10.1002\/prot.20677","article-title":"A new catalog of protein beta-sheets","volume":"61","author":"Parisien","year":"2005","journal-title":"Proteins"},{"key":"2023061011443117600_B39","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1186\/1471-2105-8-201","article-title":"Accurate prediction of protein secondary structure and solvent accessibility by consensus combiners of sequence and structure information","volume":"8","author":"Pollastri","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023061011443117600_B40","doi-asserted-by":"crossref","first-page":"983","DOI":"10.1021\/bi00119a006","article-title":"A segment-based approach to protein secondary structure prediction","volume":"31","author":"Presnell","year":"1992","journal-title":"Biochemistry"},{"key":"2023061011443117600_B41","doi-asserted-by":"crossref","first-page":"D392","DOI":"10.1093\/nar\/gkq1021","article-title":"The RCSB Protein Data Bank: redesigned web site and web services","volume":"39","author":"Rose","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023061011443117600_B42","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1201\/9780203911327.ch8","article-title":"Rising accuracy of protein secondary structure prediction","volume-title":"Protein Structure Determination, Analysis, and Applications for Drug Discovery.","author":"Rost","year":"2003"},{"key":"2023061011443117600_B43","doi-asserted-by":"crossref","first-page":"584","DOI":"10.1006\/jmbi.1993.1413","article-title":"Prediction of protein secondary structure at better than 70% accuracy","volume":"232","author":"Rost","year":"1993","journal-title":"J. Mol. Biol."},{"key":"2023061011443117600_B44","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/S0022-2836(05)80007-5","article-title":"Redefining the goals of protein secondary structure prediction","volume":"235","author":"Rost","year":"1994","journal-title":"J. Mol. Biol."},{"key":"2023061011443117600_B45","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1006\/jmbi.1997.0958","article-title":"Protein secondary structure prediction using local alignments","volume":"268","author":"Salamov","year":"1997","journal-title":"J. Mol. Biol."},{"key":"2023061011443117600_B46","author":"Schrodinger","year":"2010","journal-title":"The PyMOL Molecular Graphics System"},{"key":"2023061011443117600_B47","first-page":"661","article-title":"Predicting alpha-helix and beta-strand segments of globular proteins","volume":"10","author":"Solovyev","year":"1994","journal-title":"Comput. Appl. Biosci."},{"key":"2023061011443117600_B48","doi-asserted-by":"crossref","first-page":"1650","DOI":"10.1093\/bioinformatics\/btg223","article-title":"Secondary structure prediction with support vector machines","volume":"19","author":"Ward","year":"2003","journal-title":"Bioinformatics"},{"key":"2023061011443117600_B49","doi-asserted-by":"crossref","first-page":"D187","DOI":"10.1093\/nar\/gkj161","article-title":"The Universal Protein Resource (UniProt): an expanding universe of protein information","volume":"34","author":"Wu","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023061011443117600_B50","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1016\/j.pep.2010.09.004","article-title":"Expression, purification and functional analysis of an odorant binding protein AaegOBP22 from Aedes aegypti","volume":"75","author":"Yang","year":"2011","journal-title":"Protein Expres. Purif."},{"key":"2023061011443117600_B51","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1002\/(SICI)1097-0134(19990201)34:2<220::AID-PROT7>3.0.CO;2-K","article-title":"A modified definition of Sov, a segment-based measure for protein secondary structure prediction assessment","volume":"34","author":"Zemla","year":"1999","journal-title":"Proteins"},{"key":"2023061011443117600_B52","doi-asserted-by":"crossref","first-page":"420","DOI":"10.1504\/IJBRA.2005.008445","article-title":"Protein secondary structure prediction by combining hidden Markov models and sliding window scores","volume":"1","author":"Zheng","year":"2005","journal-title":"Int. J. Bioinform. Res. Appl."},{"key":"2023061011443117600_B53","doi-asserted-by":"crossref","first-page":"470","DOI":"10.1093\/bioinformatics\/btp679","article-title":"A novel method for accurate one-dimensional protein structure prediction based on fragment matching","volume":"26","author":"Zhou","year":"2010","journal-title":"Bioinformatics"},{"key":"2023061011443117600_B54","doi-asserted-by":"crossref","first-page":"957","DOI":"10.1016\/0022-2836(87)90501-8","article-title":"Prediction of protein secondary structure and active sites using the alignment of homologous sequences","volume":"195","author":"Zvelebil","year":"1987","journal-title":"J. Mol. Biol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/1\/32\/50568203\/bioinformatics_28_1_32.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/1\/32\/50568203\/bioinformatics_28_1_32.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,10]],"date-time":"2023-06-10T11:46:06Z","timestamp":1686397566000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/1\/32\/220472"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,11,7]]},"references-count":54,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,1,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr611","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,1,1]]},"published":{"date-parts":[[2011,11,7]]}}}