{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,5,13]],"date-time":"2023-05-13T09:29:46Z","timestamp":1683970186407},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"S1","license":[{"start":{"date-parts":[[2013,1,1]],"date-time":"2013-01-01T00:00:00Z","timestamp":1356998400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2013,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Recently, information derived by correlated mutations in proteins has regained relevance for predicting protein contacts. This is due to new forms of mutual information analysis that have been proven to be more suitable to highlight direct coupling between pairs of residues in protein structures and to the large number of protein chains that are currently available for statistical validation. It was previously discussed that disulfide bond topology in proteins is also constrained by correlated mutations.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>In this paper we exploit information derived from a corrected mutual information analysis and from the inverse of the covariance matrix to address the problem of the prediction of the topology of disulfide bonds in Eukaryotes. Recently, we have shown that Support Vector Regression (SVR) can improve the prediction for the disulfide connectivity patterns. Here we show that the inclusion of the correlated mutation information increases of 5 percentage points the SVR performance (from 54% to 59%). When this approach is used in combination with a method previously developed by us and scoring at the state of art in predicting both location and topology of disulfide bonds in Eukaryotes (DisLocate), the per-protein accuracy is 38%, 2 percentage points higher than that previously obtained.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>In this paper we show that the inclusion of information derived from correlated mutations can improve the performance of the state of the art methods for predicting disulfide connectivity patterns in Eukaryotic proteins. Our analysis also provides support to the notion that improving methods to extract evolutionary information from multiple sequence alignments greatly contributes to the scoring performance of predictors suited to detect relevant features from protein chains.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-14-s1-s10","type":"journal-article","created":{"date-parts":[[2019,12,11]],"date-time":"2019-12-11T01:59:19Z","timestamp":1576029559000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations"],"prefix":"10.1186","volume":"14","author":[{"given":"Castrense","family":"Savojardo","sequence":"first","affiliation":[]},{"given":"Piero","family":"Fariselli","sequence":"additional","affiliation":[]},{"given":"Pier Luigi","family":"Martelli","sequence":"additional","affiliation":[]},{"given":"Rita","family":"Casadio","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2013,1,14]]},"reference":[{"issue":"9","key":"5584_CR1","doi-asserted-by":"publisher","first-page":"935","DOI":"10.1111\/j.1365-2443.2010.01434.x","volume":"15","author":"K Inaba","year":"2010","unstructured":"Inaba K: Structural basis of protein disulfide bond generation in the cell. Genes Cells. 2010, 15 (9): 935-43. 10.1111\/j.1365-2443.2010.01434.x.","journal-title":"Genes Cells"},{"issue":"12","key":"5584_CR2","doi-asserted-by":"publisher","first-page":"951","DOI":"10.1093\/protein\/15.12.951","volume":"15","author":"PL Martelli","year":"2002","unstructured":"Martelli PL, Fariselli P, Malaguti L, Casadio R: Prediction of the disulfide bonding state of cysteines in proteins with hidden neural networks. Protein Eng. 2002, 15 (12): 951-953. 10.1093\/protein\/15.12.951.","journal-title":"Protein Eng"},{"issue":"3","key":"5584_CR3","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1002\/prot.10047","volume":"46","author":"MH Mucchielli-Giorgi","year":"2002","unstructured":"Mucchielli-Giorgi MH, Hazout S, Tuff\u00e9ry P: Predicting the disulfide bonding state of cysteines using protein descriptors. Proteins. 2002, 46 (3): 243-249. 10.1002\/prot.10047.","journal-title":"Proteins"},{"issue":"4","key":"5584_CR4","doi-asserted-by":"publisher","first-page":"1036","DOI":"10.1002\/prot.20079","volume":"55","author":"YC Chen","year":"2004","unstructured":"Chen YC, Lin YS, Lin CJ, Hwang JK: Prediction of the bonding states of cysteines using the support vector machines based on multiple feature vectors and cysteine state sequences. Proteins. 2004, 55 (4): 1036-1042. 10.1002\/prot.20079.","journal-title":"Proteins"},{"issue":"10","key":"5584_CR5","doi-asserted-by":"publisher","first-page":"957","DOI":"10.1093\/bioinformatics\/17.10.957","volume":"17","author":"P Fariselli","year":"2001","unstructured":"Fariselli P, Casadio R: Prediction of disulfide connectivity in proteins. Bioinformatics. 2001, 17 (10): 957-964. 10.1093\/bioinformatics\/17.10.957.","journal-title":"Bioinformatics"},{"issue":"5","key":"5584_CR6","doi-asserted-by":"publisher","first-page":"653","DOI":"10.1093\/bioinformatics\/btg463","volume":"20","author":"A Vullo","year":"2004","unstructured":"Vullo A, Frasconi P: Disulfide connectivity prediction using recursive neural networks and evolutionary information. Bioinformatics. 2004, 20 (5): 653-659. 10.1093\/bioinformatics\/btg463.","journal-title":"Bioinformatics"},{"issue":"10","key":"5584_CR7","doi-asserted-by":"publisher","first-page":"2336","DOI":"10.1093\/bioinformatics\/bti328","volume":"21","author":"F Ferr\u00e8","year":"2005","unstructured":"Ferr\u00e8 F, Clote P: Disulfide connectivity prediction using secondary structure information and diresidue frequencies. Bioinformatics. 2005, 21 (10): 2336-2346. 10.1093\/bioinformatics\/bti328.","journal-title":"Bioinformatics"},{"issue":"23","key":"5584_CR8","doi-asserted-by":"publisher","first-page":"3147","DOI":"10.1093\/bioinformatics\/btm505","volume":"23","author":"J Song","year":"2007","unstructured":"Song J, Yuan Z, Tan H, Huber T, Burrage K: Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure. Bioinformatics. 2007, 23 (23): 3147-3154. 10.1093\/bioinformatics\/btm505.","journal-title":"Bioinformatics"},{"issue":"3","key":"5584_CR9","doi-asserted-by":"publisher","first-page":"617","DOI":"10.1002\/prot.20787","volume":"62","author":"J Cheng","year":"2006","unstructured":"Cheng J, Saigo H, Baldi P: Large-scale prediction of disulphide bridges using kernel methods, two-dimensional recursive neural networks, and weighted graph matching. Proteins. 2006, 62 (3): 617-629.","journal-title":"Proteins"},{"key":"5584_CR10","doi-asserted-by":"publisher","first-page":"896","DOI":"10.1145\/1102351.1102464","volume-title":"Proceedings of the 22nd International Conference on Machine Learning (ICML '05)","author":"B Taskar","year":"2005","unstructured":"Taskar B, Chatalbashev V, Koller D, Guestrin C: Learning structured prediction models: a large margin approach. Proceedings of the 22nd International Conference on Machine Learning (ICML '05). 2005, New York: ACM, 896-903. 10.1145\/1102351.1102464."},{"key":"5584_CR11","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1186\/1471-2105-9-20","volume":"9","author":"M Vincent","year":"2008","unstructured":"Vincent M, Passerini A, Labb\u00e9 M, Frasconi P: A simplified approach to disulfide connectivity prediction from protein sequences. BMC Bioinformatics. 2008, 9: 20-10.1186\/1471-2105-9-20.","journal-title":"BMC Bioinformatics"},{"issue":"16","key":"5584_CR12","doi-asserted-by":"publisher","first-page":"2224","DOI":"10.1093\/bioinformatics\/btr387","volume":"27","author":"C Savojardo","year":"2011","unstructured":"Savojardo C, Fariselli P, Martelli PL, Pierleoni A, Casadio R: Improving the prediction of disulfide bonds in Eukaryotes with machine learning methods and protein subcellular localization. Bioinformatics. 2011, 27 (16): 2224-2230. 10.1093\/bioinformatics\/btr387.","journal-title":"Bioinformatics"},{"key":"5584_CR13","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1002\/prot.340180402","volume":"18","author":"U Gobel","year":"1994","unstructured":"Gobel U, Sander C, Schneider R, Valencia A: Correlated mutations and residue contacts in proteins. Proteins. 1994, 18: 309-317. 10.1002\/prot.340180402.","journal-title":"Proteins"},{"key":"5584_CR14","doi-asserted-by":"publisher","first-page":"S25","DOI":"10.1016\/S1359-0278(97)00060-6","volume":"2","author":"O Olmea","year":"1997","unstructured":"Olmea O, Valencia A: Improving contact predictions by the combination of correlated mutations and other sources of sequence information. Fold Des. 1997, 2: S25-S32.","journal-title":"Fold Des"},{"issue":"Suppl 5","key":"5584_CR15","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1002\/prot.1173","volume":"45","author":"P Fariselli","year":"2001","unstructured":"Fariselli P, Olmea O, Valencia A, Casadio R: Progress in predicting inter- residue contacts of proteins with neural networks and correlated mutations. Proteins. 2001, 45 (Suppl 5): 157-162.","journal-title":"Proteins"},{"key":"5584_CR16","doi-asserted-by":"publisher","first-page":"1017","DOI":"10.1109\/TCBB.2010.91","volume":"8","author":"P Di Lena","year":"2011","unstructured":"Di Lena P, Fariselli P, Margara L, Vassura M, Casadio R: Is there an optimal substitution matrix for contact prediction with correlated mutations?. IEEE\/ACM Trans Comput Biol Bioinform. 2011, 8: 1017-1028.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"issue":"2","key":"5584_CR17","doi-asserted-by":"publisher","first-page":"498","DOI":"10.1093\/bioinformatics\/btm637","volume":"24","author":"R Rubinstein","year":"2008","unstructured":"Rubinstein R, Fiser A: Predicting disulfide bond connectivity in proteins by correlated mutations analysis. Bioinformatics. 2008, 24 (2): 498-504.","journal-title":"Bioinformatics"},{"issue":"3","key":"5584_CR18","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1093\/bioinformatics\/btm604","volume":"24","author":"SD Dunn","year":"2008","unstructured":"Dunn SD, Wahl LM, Gloor GB: Mutual information without the influence of phylogeny or entropy dramatically improves residue contact prediction. Bioinformatics. 2008, 24 (3): 333-340. 10.1093\/bioinformatics\/btm604.","journal-title":"Bioinformatics"},{"issue":"12","key":"5584_CR19","doi-asserted-by":"publisher","first-page":"e28766","DOI":"10.1371\/journal.pone.0028766","volume":"6","author":"DS Marks","year":"2011","unstructured":"Marks DS, Colwell LJ, Sheridan R, Hopf TA, Pagnani A, Zecchina R, Sander C: Protein 3D structure computed from evolutionary sequence variation. PLoS ONE. 2011, 6 (12): e28766-10.1371\/journal.pone.0028766.","journal-title":"PLoS ONE"},{"issue":"2","key":"5584_CR20","doi-asserted-by":"publisher","first-page":"184","DOI":"10.1093\/bioinformatics\/btr638","volume":"28","author":"DT Jones","year":"2011","unstructured":"Jones DT, Buchan DWA, Cozzetto D, Pontil M, PSICOV: Precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments. Bioinformatics. 2011, 28 (2): 184-190.","journal-title":"Bioinformatics"},{"issue":"1","key":"5584_CR21","doi-asserted-by":"publisher","first-page":"e1000633","DOI":"10.1371\/journal.pcbi.1000633","volume":"6","author":"L Burger","year":"2010","unstructured":"Burger L, van Nimwegen E: Disentangling direct from indirect co-evolution of residues in protein alignments. PLoS Comput Biol. 2010, 6 (1): e1000633-10.1371\/journal.pcbi.1000633.","journal-title":"PLoS Comput Biol"},{"key":"5584_CR22","first-page":"485","volume":"9","author":"O Banerjee","year":"2008","unstructured":"Banerjee O, El Ghaoui L, d'Aspremont A: Model selection through sparse maximum likelihood estimation for multivariate Gaussian or binary data. Journal of Machine Learning Research. 2008, 9: 485-516. [http:\/\/jmlr.csail.mit.edu\/papers\/v9\/banerjee08a.html]","journal-title":"Journal of Machine Learning Research"},{"key":"5584_CR23","doi-asserted-by":"publisher","first-page":"432","DOI":"10.1093\/biostatistics\/kxm045","volume":"9","author":"J Friedman","year":"2008","unstructured":"Friedman J, Hastie T, Tibshirani R: Sparse inverse covariance estimation with the graphical Lasso. Biostatistics. 2008, 9: 432-441. 10.1093\/biostatistics\/kxm045.","journal-title":"Biostatistics"},{"key":"5584_CR24","doi-asserted-by":"crossref","unstructured":"Fariselli P, Savojardo C, Martelli PL, Casadio R: Grammatical-Restrained Hidden Conditional Random Fields for Bioinfomatics Applications. Algorithms for Molecular Biology. 2009, 4 (13):","DOI":"10.1186\/1748-7188-4-13"},{"key":"5584_CR25","doi-asserted-by":"crossref","unstructured":"Casbon J, Saqi M: Analysis of superfamily specific profile-profile recognition accuracy. BMC Bioinformatics. 2004, 5 (200):","DOI":"10.1186\/1471-2105-5-200"},{"issue":"15","key":"5584_CR26","doi-asserted-by":"publisher","first-page":"4207","DOI":"10.1021\/bi992922o","volume":"39","author":"WJ Wedemeyer","year":"2000","unstructured":"Wedemeyer WJ, Welker E, Narayan M, Scheraga HA: Disulfide bonds and protein folding. Biochemistry. 2000, 39 (15): 4207-4216. 10.1021\/bi992922o.","journal-title":"Biochemistry"},{"key":"5584_CR27","doi-asserted-by":"publisher","first-page":"363","DOI":"10.1146\/annurev.biochem.77.062906.171838","volume":"77","author":"R Das","year":"2008","unstructured":"Das R, Baker D: Macromolecular modeling with rosetta. Annu Rev Biochem. 2008, 77: 363-382. 10.1146\/annurev.biochem.77.062906.171838.","journal-title":"Annu Rev Biochem"},{"key":"5584_CR28","doi-asserted-by":"publisher","first-page":"779","DOI":"10.1006\/jmbi.1993.1626","volume":"234","author":"A Sali","year":"1993","unstructured":"Sali A, Blundell TL: Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993, 234: 779-815. 10.1006\/jmbi.1993.1626.","journal-title":"J Mol Biol"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-14-S1-S10.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1471-2105-14-S1-S10\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-14-S1-S10.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T21:11:54Z","timestamp":1630530714000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-14-S1-S10"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,1]]},"references-count":28,"journal-issue":{"issue":"S1","published-print":{"date-parts":[[2013,1]]}},"alternative-id":["5584"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-14-s1-s10","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,1]]},"assertion":[{"value":"14 January 2013","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"S10"}}