{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T14:24:23Z","timestamp":1772202263697,"version":"3.50.1"},"reference-count":28,"publisher":"World Scientific Pub Co Pte Lt","issue":"05","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Bioinform. Comput. Biol."],"published-print":{"date-parts":[[2018,10]]},"abstract":"<jats:p>Secondary structure and solvent accessibility prediction provide valuable information for estimating the three dimensional structure of a protein. As new feature extraction methods are developed the dimensionality of the input feature space increases steadily. Reducing the number of dimensions provides several advantages such as faster model training, faster prediction and noise elimination. In this work, several dimensionality reduction techniques have been employed including various feature selection methods, autoencoders and PCA for protein secondary structure and solvent accessibility prediction. The reduced feature set is used to train a support vector machine at the second stage of a hybrid classifier. Cross-validation experiments on two difficult benchmarks demonstrate that the dimension of the input space can be reduced substantially while maintaining the prediction accuracy. This will enable the incorporation of additional informative features derived for predicting the structural properties of proteins without reducing the accuracy due to overfitting.<\/jats:p>","DOI":"10.1142\/s0219720018500208","type":"journal-article","created":{"date-parts":[[2018,8,3]],"date-time":"2018-08-03T09:52:11Z","timestamp":1533289931000},"page":"1850020","source":"Crossref","is-referenced-by-count":7,"title":["Dimensionality reduction for protein secondary structure and solvent accesibility prediction"],"prefix":"10.1142","volume":"16","author":[{"given":"Zafer","family":"Aydin","sequence":"first","affiliation":[{"name":"Department of Computer Engineering, Abdullah Gul University, Kayseri 38080, Turkey"}]},{"given":"O\u011fuz","family":"Kaynar","sequence":"additional","affiliation":[{"name":"Department of Management Information Systems, Cumhuriyet University, Sivas 58000, Turkey"}]},{"given":"Yasin","family":"G\u00f6rmez","sequence":"additional","affiliation":[{"name":"Department of Management Information Systems, Cumhuriyet University, Sivas 58000, Turkey"}]}],"member":"219","published-online":{"date-parts":[[2018,11,12]]},"reference":[{"key":"S0219720018500208BIB001","doi-asserted-by":"publisher","DOI":"10.1006\/jmbi.2001.4580"},{"key":"S0219720018500208BIB002","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-12-154"},{"key":"S0219720018500208BIB005","doi-asserted-by":"publisher","DOI":"10.2174\/092986612798472875"},{"key":"S0219720018500208BIB006","doi-asserted-by":"crossref","first-page":"1791","DOI":"10.1002\/prot.24074","volume":"80","author":"Joo K","year":"2012","journal-title":"Proteins Struct Funct Bioinforma"},{"key":"S0219720018500208BIB007","doi-asserted-by":"publisher","DOI":"10.1002\/prot.20176"},{"key":"S0219720018500208BIB008","doi-asserted-by":"publisher","DOI":"10.1002\/prot.22193"},{"key":"S0219720018500208BIB009","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btt344"},{"key":"S0219720018500208BIB010","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btr611"},{"key":"S0219720018500208BIB011","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-8-201"},{"key":"S0219720018500208BIB013","volume-title":"Data Mining: Concepts and Techniques","author":"Han J","year":"2011"},{"key":"S0219720018500208BIB014","doi-asserted-by":"publisher","DOI":"10.1016\/j.gene.2017.03.011"},{"key":"S0219720018500208BIB015","first-page":"657","volume":"58","author":"Adamczak R","year":"2009","journal-title":"World Academy of Science Engineering and Technology"},{"key":"S0219720018500208BIB016","doi-asserted-by":"publisher","DOI":"10.1016\/0169-7439(87)80084-9"},{"key":"S0219720018500208BIB019","volume":"4","author":"Ozarkar P","year":"2013","journal-title":"International Journal of Computer Engineering and Technology"},{"key":"S0219720018500208BIB020","doi-asserted-by":"publisher","DOI":"10.1109\/4.293109"},{"key":"S0219720018500208BIB021","doi-asserted-by":"publisher","DOI":"10.1142\/S0219720005001004"},{"key":"S0219720018500208BIB023","doi-asserted-by":"publisher","DOI":"10.1016\/0167-8655(89)90037-8"},{"key":"S0219720018500208BIB024","doi-asserted-by":"publisher","DOI":"10.1109\/72.977291"},{"key":"S0219720018500208BIB026","doi-asserted-by":"publisher","DOI":"10.1002\/bip.360221211"},{"key":"S0219720018500208BIB027","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/25.17.3389"},{"key":"S0219720018500208BIB028","doi-asserted-by":"publisher","DOI":"10.1038\/nmeth.1818"},{"key":"S0219720018500208BIB031","volume-title":"The Nature of Statistical Learning Theory","author":"Vapnik V","year":"2013"},{"key":"S0219720018500208BIB032","doi-asserted-by":"publisher","DOI":"10.1007\/BF00994018"},{"key":"S0219720018500208BIB034","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/5236.001.0001","author":"Rumelhart DE","year":"1986","journal-title":"Parallel Distrib Process"},{"key":"S0219720018500208BIB035","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-0134(19990301)34:4<508::AID-PROT10>3.0.CO;2-4"},{"key":"S0219720018500208BIB036","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkg619"},{"key":"S0219720018500208BIB042","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-0134(19990201)34:2<220::AID-PROT7>3.0.CO;2-K"},{"key":"S0219720018500208BIB043","doi-asserted-by":"publisher","DOI":"10.1016\/0005-2795(75)90109-9"}],"container-title":["Journal of Bioinformatics and Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219720018500208","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,7]],"date-time":"2020-11-07T07:12:48Z","timestamp":1604733168000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0219720018500208"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10]]},"references-count":28,"journal-issue":{"issue":"05","published-online":{"date-parts":[[2018,11,12]]},"published-print":{"date-parts":[[2018,10]]}},"alternative-id":["10.1142\/S0219720018500208"],"URL":"https:\/\/doi.org\/10.1142\/s0219720018500208","relation":{},"ISSN":["0219-7200","1757-6334"],"issn-type":[{"value":"0219-7200","type":"print"},{"value":"1757-6334","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,10]]}}}