{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,31]],"date-time":"2025-10-31T14:20:24Z","timestamp":1761920424200},"reference-count":38,"publisher":"World Scientific Pub Co Pte Lt","issue":"02","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Bioinform. Comput. Biol."],"published-print":{"date-parts":[[2018,4]]},"abstract":"<jats:p>We discuss applicability of principal component analysis (PCA) for protein tertiary structure prediction from amino acid sequence. The algorithm presented in this paper belongs to the category of protein refinement models and involves establishing a low-dimensional space where the sampling (and optimization) is carried out via particle swarm optimizer (PSO). The reduced space is found via PCA performed for a set of low-energy protein models previously found using different optimization techniques. A high frequency term is added into this expansion by projecting the best decoy into the PCA basis set and calculating the residual model. This term is aimed at providing high frequency details in the energy optimization. The goal of this research is to analyze how the dimensionality reduction affects the prediction capability of the PSO procedure. For that purpose, different proteins from the Critical Assessment of Techniques for Protein Structure Prediction experiments were modeled. In all the cases, both the energy of the best decoy and the distance to the native structure have decreased. Our analysis also shows how the predicted backbone structure of native conformation and of alternative low energy states varies with respect to the PCA dimensionality. Generally speaking, the reconstruction can be successfully achieved with 10 principal components and the high frequency term. We also provide a computational analysis of protein energy landscape for the inverse problem of reconstructing structure from the reduced number of principal components, showing that the dimensionality reduction alleviates the ill-posed character of this high-dimensional energy optimization problem. The procedure explained in this paper is very fast and allows testing different PCA expansions. Our results show that PSO improves the energy of the best decoy used in the PCA when the adequate number of PCA terms is considered.<\/jats:p>","DOI":"10.1142\/s0219720018500051","type":"journal-article","created":{"date-parts":[[2018,2,22]],"date-time":"2018-02-22T09:25:58Z","timestamp":1519291558000},"page":"1850005","source":"Crossref","is-referenced-by-count":7,"title":["Principal component analysis in protein tertiary structure prediction"],"prefix":"10.1142","volume":"16","author":[{"given":"\u00d3scar","family":"\u00c1lvarez","sequence":"first","affiliation":[{"name":"Group of Inverse Problems, Optimization and Machine Learning, Department of Mathematics, University of Oviedo, C. Federico Garc\u00eda Lorca, 18, 33007 Oviedo, Spain"}]},{"given":"Juan Luis","family":"Fern\u00e1ndez-Mart\u00ednez","sequence":"additional","affiliation":[{"name":"Group of Inverse Problems, Optimization and Machine Learning, Department of Mathematics, University of Oviedo, C. Federico Garc\u00eda Lorca, 18, 33007 Oviedo, Spain"}]},{"given":"Celia","family":"Fern\u00e1ndez-Brillet","sequence":"additional","affiliation":[{"name":"Group of Inverse Problems, Optimization and Machine Learning, Department of Mathematics, University of Oviedo, C. Federico Garc\u00eda Lorca, 18, 33007 Oviedo, Spain"}]},{"given":"Ana","family":"Cernea","sequence":"additional","affiliation":[{"name":"Group of Inverse Problems, Optimization and Machine Learning, Department of Mathematics, University of Oviedo, C. Federico Garc\u00eda Lorca, 18, 33007 Oviedo, Spain"}]},{"given":"Zulima","family":"Fern\u00e1ndez-Mu\u00f1iz","sequence":"additional","affiliation":[{"name":"Group of Inverse Problems, Optimization and Machine Learning, Department of Mathematics, University of Oviedo, C. Federico Garc\u00eda Lorca, 18, 33007 Oviedo, Spain"}]},{"given":"Andrzej","family":"Kloczkowski","sequence":"additional","affiliation":[{"name":"Batelle Center for Mathematical Medicine, Nationwide Children\u2019s Hospital, Columbus, OH, USA"},{"name":"Department of Pediatrics, The Ohio State University, Columbus, OH, USA"}]}],"member":"219","published-online":{"date-parts":[[2018,5,8]]},"reference":[{"key":"S0219720018500051BIB001","doi-asserted-by":"publisher","DOI":"10.1016\/j.sbi.2008.02.004"},{"key":"S0219720018500051BIB002","doi-asserted-by":"publisher","DOI":"10.1006\/jmbi.1997.0959"},{"key":"S0219720018500051BIB003","doi-asserted-by":"crossref","first-page":"1715","DOI":"10.1002\/prot.24065","volume":"80","author":"Xu D","year":"2012","journal-title":"Proteins Struct Funct Bioinform"},{"key":"S0219720018500051BIB004","doi-asserted-by":"publisher","DOI":"10.1110\/ps.036442.108"},{"key":"S0219720018500051BIB005","doi-asserted-by":"publisher","DOI":"10.1126\/science.1853201"},{"key":"S0219720018500051BIB006","doi-asserted-by":"publisher","DOI":"10.1038\/358086a0"},{"key":"S0219720018500051BIB007","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-9-40"},{"key":"S0219720018500051BIB008","volume-title":"Dynamic Programming","author":"Bellman RE","year":"1957"},{"key":"S0219720018500051BIB009","doi-asserted-by":"publisher","DOI":"10.1190\/tle34091006.1"},{"key":"S0219720018500051BIB010","doi-asserted-by":"publisher","DOI":"10.1002\/prot.20853"},{"key":"S0219720018500051BIB011","doi-asserted-by":"publisher","DOI":"10.1002\/jcc.20827"},{"key":"S0219720018500051BIB012","doi-asserted-by":"publisher","DOI":"10.1021\/ct300962x"},{"key":"S0219720018500051BIB013","doi-asserted-by":"publisher","DOI":"10.1063\/1.4710986"},{"key":"S0219720018500051BIB014","doi-asserted-by":"publisher","DOI":"10.1002\/prot.10552"},{"key":"S0219720018500051BIB015","doi-asserted-by":"publisher","DOI":"10.1016\/S0022-2836(02)00698-8"},{"key":"S0219720018500051BIB016","doi-asserted-by":"publisher","DOI":"10.1002\/prot.10529"},{"key":"S0219720018500051BIB017","volume-title":"Organic and Biological Chemistry","author":"Stoker H","year":"2015"},{"key":"S0219720018500051BIB018","doi-asserted-by":"publisher","DOI":"10.1002\/prot.22229"},{"key":"S0219720018500051BIB019","doi-asserted-by":"publisher","DOI":"10.1126\/science.1065659"},{"key":"S0219720018500051BIB020","doi-asserted-by":"publisher","DOI":"10.1039\/b719351c"},{"key":"S0219720018500051BIB021","doi-asserted-by":"publisher","DOI":"10.1190\/geo2011-0341.1"},{"key":"S0219720018500051BIB022","doi-asserted-by":"publisher","DOI":"10.1007\/s11004-008-9151-y"},{"key":"S0219720018500051BIB024","doi-asserted-by":"publisher","DOI":"10.1016\/j.jappgeo.2010.02.001"},{"key":"S0219720018500051BIB025","doi-asserted-by":"publisher","DOI":"10.1007\/s00894-012-1410-7"},{"key":"S0219720018500051BIB026","doi-asserted-by":"publisher","DOI":"10.1007\/s00894-013-1911-z"},{"key":"S0219720018500051BIB028","volume-title":"Principal Component Analysis","author":"Jolliffe I","year":"2002"},{"key":"S0219720018500051BIB029","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0404703101"},{"key":"S0219720018500051BIB030","doi-asserted-by":"publisher","DOI":"10.1007\/s11721-009-0034-8"},{"key":"S0219720018500051BIB031","doi-asserted-by":"publisher","DOI":"10.1109\/TEVC.2010.2053935"},{"key":"S0219720018500051BIB032","doi-asserted-by":"publisher","DOI":"10.1142\/S0218213012400118"},{"key":"S0219720018500051BIB033","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1016\/j.amc.2014.10.066","volume":"249","author":"Garc\u00eda-Gonzalo E","year":"2014","journal-title":"Appl Math Comput"},{"key":"S0219720018500051BIB034","doi-asserted-by":"publisher","DOI":"10.1016\/j.mcm.2011.07.009"},{"key":"S0219720018500051BIB035","doi-asserted-by":"publisher","DOI":"10.1190\/geo2011-0041.1"},{"key":"S0219720018500051BIB037","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btk037"},{"key":"S0219720018500051BIB038","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btm627"},{"key":"S0219720018500051BIB039","first-page":"15","volume":"22","author":"Gniewek P","year":"2014","journal-title":"BMC Bioinformatics"},{"key":"S0219720018500051BIB041","doi-asserted-by":"publisher","DOI":"10.4018\/jaec.2010070102"},{"key":"S0219720018500051BIB046","doi-asserted-by":"publisher","DOI":"10.1038\/nphys375"}],"container-title":["Journal of Bioinformatics and Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219720018500051","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,10,28]],"date-time":"2020-10-28T12:41:54Z","timestamp":1603888914000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0219720018500051"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,4]]},"references-count":38,"journal-issue":{"issue":"02","published-online":{"date-parts":[[2018,5,8]]},"published-print":{"date-parts":[[2018,4]]}},"alternative-id":["10.1142\/S0219720018500051"],"URL":"https:\/\/doi.org\/10.1142\/s0219720018500051","relation":{},"ISSN":["0219-7200","1757-6334"],"issn-type":[{"value":"0219-7200","type":"print"},{"value":"1757-6334","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,4]]}}}