{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,11]],"date-time":"2025-11-11T22:14:56Z","timestamp":1762899296352,"version":"3.37.3"},"reference-count":56,"publisher":"Oxford University Press (OUP)","issue":"17","license":[{"start":{"date-parts":[[2016,11,10]],"date-time":"2016-11-10T00:00:00Z","timestamp":1478736000000},"content-version":"vor","delay-in-days":73,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["DBI-0845196"],"award-info":[{"award-number":["DBI-0845196"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,9,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Due to their high genomic variability, RNA viruses and retroviruses present a unique opportunity for detailed study of molecular evolution. Lentiviruses, with HIV being a notable example, are one of the best studied viral groups: hundreds of thousands of sequences are available together with experimentally resolved three-dimensional structures for most viral proteins. In this work, we use these data to study specific patterns of evolution of the viral proteins, and their relationship to protein interactions and immunogenicity.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We propose a method for identification of two types of surface residues clusters with abnormal conservation: extremely conserved and extremely variable clusters. We identify them on the surface of proteins from HIV and other animal immunodeficiency viruses. Both types of clusters are overrepresented on the interaction interfaces of viral proteins with other proteins, nucleic acids or low molecular-weight ligands, both in the viral particle and between the virus and its host. In the immunodeficiency viruses, the interaction interfaces are not more conserved than the corresponding proteins on an average, and we show that extremely conserved clusters coincide with protein\u2013protein interaction hotspots, predicted as the residues with the largest energetic contribution to the interaction. Extremely variable clusters have been identified here for the first time. In the HIV-1 envelope protein gp120, they overlap with known antigenic sites. These antigenic sites also contain many residues from extremely conserved clusters, hence representing a unique interacting interface enriched both in extremely conserved and in extremely variable clusters of residues. This observation may have important implication for antiretroviral vaccine development.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and Implementation<\/jats:title>\n                  <jats:p>A Python package is available at https:\/\/bioinf.mpi-inf.mpg.de\/publications\/viral-ppi-pred\/<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Contact<\/jats:title>\n                  <jats:p>voitenko@mpi-inf.mpg.de or kalinina@mpi-inf.mpg.de<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btw441","type":"journal-article","created":{"date-parts":[[2016,9,1]],"date-time":"2016-09-01T07:53:39Z","timestamp":1472716419000},"page":"i685-i692","source":"Crossref","is-referenced-by-count":7,"title":["Patterns of amino acid conservation in human and animal immunodeficiency viruses"],"prefix":"10.1093","volume":"32","author":[{"given":"Olga S","family":"Voitenko","sequence":"first","affiliation":[{"name":"Department for Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Campus E1 4, Saarbr\u00fccken 66123, Germany,"},{"name":"Graduate School for Computer Science, Saarland University, Campus E1 3, Saarbr\u00fccken 66123, Germany,"}]},{"given":"Andi","family":"Dhroso","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA 01609, USA"}]},{"given":"Anna","family":"Feldmann","sequence":"additional","affiliation":[{"name":"Department for Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Campus E1 4, Saarbr\u00fccken 66123, Germany,"},{"name":"Graduate School for Computer Science, Saarland University, Campus E1 3, Saarbr\u00fccken 66123, Germany,"}]},{"given":"Dmitry","family":"Korkin","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA 01609, USA"}]},{"given":"Olga V","family":"Kalinina","sequence":"additional","affiliation":[{"name":"Department for Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Campus E1 4, Saarbr\u00fccken 66123, Germany,"}]}],"member":"286","published-online":{"date-parts":[[2016,8,29]]},"reference":[{"key":"2023020113283038700_btw441-B1","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/S0022-2836(02)01036-7","article-title":"Analysis of catalytic residues in enzyme active sites","volume":"324","author":"Bartlett","year":"2002","journal-title":"J. Mol. Biol"},{"key":"2023020113283038700_btw441-B2","doi-asserted-by":"crossref","first-page":"232.","DOI":"10.3389\/fmicb.2014.00232","article-title":"The activity of Nef on HIV-1 infectivity","volume":"5","author":"Basmaciogullari","year":"2014","journal-title":"Front. Microbiol"},{"key":"2023020113283038700_btw441-B3","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The protein data bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2023020113283038700_btw441-B4","doi-asserted-by":"crossref","first-page":"1487","DOI":"10.1093\/bioinformatics\/bti242","article-title":"Improved prediction of protein\u2013protein binding sites using a support vector machines approach","volume":"21","author":"Bradford","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020113283038700_btw441-B5","doi-asserted-by":"crossref","first-page":"D204","DOI":"10.1093\/nar\/gku989","article-title":"UniProt: a hub for protein information","volume":"43","author":"Consortium","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023020113283038700_btw441-B6","doi-asserted-by":"crossref","first-page":"e1002251.","DOI":"10.1371\/journal.pbio.1002251","article-title":"Extremely high mutation rate of HIV-1 in vivo","volume":"13","author":"Cuevas","year":"2015","journal-title":"PLoS Biol"},{"key":"2023020113283038700_btw441-B7","doi-asserted-by":"crossref","first-page":"426.","DOI":"10.1186\/1471-2105-10-426","article-title":"Prediction of protein\u2013protein interaction sites using an ensemble method","volume":"10","author":"Deng","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023020113283038700_btw441-B8","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1016\/j.tibs.2011.01.002","article-title":"Analyzing and visualizing residue networks of protein structures","volume":"36","author":"Doncheva","year":"2011","journal-title":"Trends Biochem. Sci"},{"key":"2023020113283038700_btw441-B9","doi-asserted-by":"crossref","first-page":"1667","DOI":"10.1093\/genetics\/148.4.1667","article-title":"Rates of spontaneous mutation","volume":"148","author":"Drake","year":"1998","journal-title":"Genetics"},{"key":"2023020113283038700_btw441-B10","doi-asserted-by":"crossref","first-page":"755","DOI":"10.1093\/bioinformatics\/14.9.755","article-title":"Profile hidden Markov models","volume":"14","author":"Eddy","year":"1998","journal-title":"Bioinformatics"},{"key":"2023020113283038700_btw441-B11","first-page":"226","volume-title":"A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise","author":"Ester","year":"1996"},{"key":"2023020113283038700_btw441-B12","doi-asserted-by":"crossref","first-page":"3150","DOI":"10.1093\/bioinformatics\/bts565","article-title":"CD-HIT: accelerated for clustering the next-generation sequencing data","volume":"28","author":"Fu","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020113283038700_btw441-B13","doi-asserted-by":"crossref","first-page":"2301","DOI":"10.1110\/ps.22901","article-title":"Role of conserved residues in structure and stability: tryptophans of human serum retinol-binding protein, a model for the lipocalin superfamily","volume":"10","author":"Greene","year":"2001","journal-title":"Protein Sci"},{"key":"2023020113283038700_btw441-B14","doi-asserted-by":"crossref","first-page":"15447","DOI":"10.1073\/pnas.0505425102","article-title":"Conservation and relative importance of residues across protein-protein interfaces","volume":"102","author":"Guharoy","year":"2005","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020113283038700_btw441-B15","doi-asserted-by":"crossref","first-page":"286.","DOI":"10.1186\/1471-2105-11-286","article-title":"Conserved residue clusters at protein-protein interfaces and their use in binding site identification","volume":"11","author":"Guharoy","year":"2010","journal-title":"BMC Bioinformatics"},{"volume-title":"NACCESS. Computer Program, Department of Biochemistry and Molecular Biology","year":"1993","author":"Hubbard","key":"2023020113283038700_btw441-B16"},{"key":"2023020113283038700_btw441-B17","doi-asserted-by":"crossref","first-page":"365","DOI":"10.1038\/nature10719","article-title":"Global landscape of HIV\u2013human protein complexes","volume":"481","author":"J\u00e4ger","year":"2011","journal-title":"Nature"},{"key":"2023020113283038700_btw441-B18","first-page":"125","article-title":"Update of the drug resistance mutations in HIV-1: Fall 2006","volume":"14","author":"Johnson","year":"2006","journal-title":"Top. HIV Med"},{"key":"2023020113283038700_btw441-B19","doi-asserted-by":"crossref","first-page":"772","DOI":"10.1093\/molbev\/mst010","article-title":"MAFFT multiple sequence alignment software version 7: improvements in performance and usability","volume":"30","author":"Katoh","year":"2013","journal-title":"Mol. Biol. Evol"},{"key":"2023020113283038700_btw441-B20","doi-asserted-by":"crossref","first-page":"1281","DOI":"10.1016\/j.jmb.2004.10.077","article-title":"Hot regions in protein-protein interactions: the organization and contribution of structurally conserved hot spot residues","volume":"345","author":"Keskin","year":"2005","journal-title":"J. Mol. Biol"},{"key":"2023020113283038700_btw441-B21","doi-asserted-by":"crossref","first-page":"W526","DOI":"10.1093\/nar\/gkh468","article-title":"Protein structure prediction and analysis using the Robetta server","volume":"32","author":"Kim","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023020113283038700_btw441-B22","doi-asserted-by":"crossref","first-page":"2350","DOI":"10.1110\/ps.051571905","article-title":"Localization of protein-binding sites within families of proteins","volume":"14","author":"Korkin","year":"2005","journal-title":"Protein Sci"},{"key":"2023020113283038700_btw441-B23","doi-asserted-by":"crossref","first-page":"10864","DOI":"10.1073\/pnas.93.20.10864","article-title":"Accurate reconstruction of a known HIV-1 transmission history by phylogenetic tree analysis","volume":"93","author":"Leitner","year":"1996","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020113283038700_btw441-B24","doi-asserted-by":"crossref","first-page":"11981","DOI":"10.1128\/JVI.79.18.11981-11989.2005","article-title":"Molecular footprint of drug-selective pressure in a human immunodeficiency virus transmission chain","volume":"79","author":"Lemey","year":"2005","journal-title":"J. Virol"},{"key":"2023020113283038700_btw441-B25","doi-asserted-by":"crossref","first-page":"619","DOI":"10.1007\/s10822-014-9746-y","article-title":"A functional feature analysis on diverse protein\u2013protein interactions: application for the prediction of binding affinity","volume":"28","author":"Luo","year":"2014","journal-title":"J. Comput. Aided Mol. Des"},{"key":"2023020113283038700_btw441-B26","doi-asserted-by":"crossref","first-page":"S11.","DOI":"10.1186\/1477-5956-11-S1-S11","article-title":"The role of electrostatic energy in prediction of obligate protein\u2013protein interactions","volume":"11","author":"Maleki","year":"2013","journal-title":"Proteome Sci"},{"key":"2023020113283038700_btw441-B27","doi-asserted-by":"crossref","first-page":"336","DOI":"10.1038\/nature10696","article-title":"Structure of HIV-1 gp120 V1\/V2 domain with broadly neutralizing antibody PG9","volume":"480","author":"McLellan","year":"2011","journal-title":"Nature"},{"key":"2023020113283038700_btw441-B28","doi-asserted-by":"crossref","first-page":"1255","DOI":"10.2174\/138161212799436412","article-title":"Computational prediction of hot spot residues","volume":"18","author":"Morrow","year":"2012","journal-title":"Curr. Pharm. Des"},{"key":"2023020113283038700_btw441-B29","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1016\/j.jmb.2004.02.040","article-title":"ProMate: a structure based prediction program to identify the location of protein\u2013protein binding sites","volume":"338","author":"Neuvirth","year":"2004","journal-title":"J. Mol. Biol"},{"key":"2023020113283038700_btw441-B30","doi-asserted-by":"crossref","first-page":"486","DOI":"10.1038\/nature11289","article-title":"Viral immune modulators perturb the human molecular network by common and unique strategies","volume":"487","author":"Pichlmair","year":"2012","journal-title":"Nature"},{"key":"2023020113283038700_btw441-B31","doi-asserted-by":"crossref","first-page":"630","DOI":"10.1002\/prot.21248","article-title":"Prediction-based fingerprints of protein-protein interactions","volume":"66","author":"Porollo","year":"2007","journal-title":"Proteins: Struct. Funct. Bioinformatics"},{"key":"2023020113283038700_btw441-B32","doi-asserted-by":"crossref","first-page":"358","DOI":"10.1002\/(SICI)1097-0134(19981115)33:3<358::AID-PROT5>3.0.CO;2-0","article-title":"Sequence and structure conservation in a protein core","volume":"33","author":"Rodionov","year":"1998","journal-title":"Proteins: Struct. Funct. Bioinformatics"},{"key":"2023020113283038700_btw441-B33","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1093\/bioinformatics\/btn141","article-title":"Selecting anti-HIV therapies based on a variety of genomic and clinical factors","volume":"24","author":"Rosen-Zvi","year":"2008","journal-title":"Bioinformatics"},{"key":"2023020113283038700_btw441-B34","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1002\/prot.340200303","article-title":"Conservation and prediction of solvent accessibility in protein families","volume":"20","author":"Rost","year":"1994","journal-title":"Proteins: Struct. Funct. Bioinformatics"},{"key":"2023020113283038700_btw441-B35","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","article-title":"Silhouettes: a graphical aid to the interpretation and validation of cluster analysis","volume":"20","author":"Rousseeuw","year":"1987","journal-title":"Comput. Appl. Math"},{"key":"2023020113283038700_btw441-B36","doi-asserted-by":"crossref","first-page":"311","DOI":"10.2174\/1568011023354191","article-title":"Protein\u2013protein interactions: lessons learned","volume":"2","author":"Sharma","year":"2002","journal-title":"Curr. Med. Chem. Anticancer Agents"},{"key":"2023020113283038700_btw441-B37","doi-asserted-by":"crossref","first-page":"3240","DOI":"10.1093\/nar\/28.17.3240","article-title":"Functional significance of conserved residues in the phosphohydrolase module of Escherichia coli MutT protein","volume":"28","author":"Shimokawa","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2023020113283038700_btw441-B38","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1038\/nrg3905","article-title":"Evolutionary insights into host\u2013pathogen interactions from mammalian sequence data","volume":"16","author":"Sirone","year":"2015","journal-title":"Nat. Rev. Genet"},{"key":"2023020113283038700_btw441-B39","doi-asserted-by":"crossref","first-page":"4059","DOI":"10.1007\/s00894-013-1886-9","article-title":"PPIcons: identification of protein\u2013protein interaction sites in selected organisms","volume":"19","author":"Sriwastava","year":"2013","journal-title":"J. Mol. Model"},{"volume-title":"Introduction to Data Mining","year":"2005","author":"Tan","key":"2023020113283038700_btw441-B40"},{"key":"2023020113283038700_btw441-B41","doi-asserted-by":"crossref","first-page":"4876","DOI":"10.1093\/nar\/25.24.4876","article-title":"The CLUSTALX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools","volume":"25","author":"Thompson","year":"1997","journal-title":"Nucleic Acids Res"},{"key":"2023020113283038700_btw441-B42","doi-asserted-by":"crossref","first-page":"727","DOI":"10.1038\/ismej.2013.215","article-title":"Host immune responses accelerate pathogen evolution","volume":"8","author":"Trivedi","year":"2014","journal-title":"ISME J"},{"key":"2023020113283038700_btw441-B43","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1111\/j.0006-341X.2005.031032.x","article-title":"Tight clustering: a resampling-based approach for identifying stable and tight patterns in data","volume":"61","author":"Tseng","year":"2005","journal-title":"Biometrics"},{"key":"2023020113283038700_btw441-B44","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1002\/prot.10146","article-title":"Scoring residue conservation","volume":"48","author":"Valdar","year":"2002","journal-title":"Proteins: Struct. Funct. Bioinformatics"},{"key":"2023020113283038700_btw441-B45","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1002\/1097-0134(20010101)42:1<108::AID-PROT110>3.0.CO;2-O","article-title":"Protein\u2013protein interfaces: analysis of amino acid conservation in homodimers","volume":"42","author":"Valdar","year":"2001","journal-title":"Proteins: Struct. Funct. Bioinformatics"},{"key":"2023020113283038700_btw441-B46","doi-asserted-by":"crossref","first-page":"728","DOI":"10.1128\/CMR.00009-06","article-title":"Going wild: lessons from naturally occurring T-lymphotropic lentiviruses","volume":"19","author":"VandeWoude","year":"2006","journal-title":"Clin. Microbiol. Rev"},{"key":"2023020113283038700_btw441-B47","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1002\/prot.20842","article-title":"WHISCY: what information does surface conservation yield? Application to data-driven docking","volume":"63","author":"Vries","year":"2006","journal-title":"Proteins: Struct. Funct. Bioinformatics"},{"key":"2023020113283038700_btw441-B48","doi-asserted-by":"crossref","first-page":"466","DOI":"10.1038\/nature10373","article-title":"Broad neutralization coverage of HIV by multiple highly potent antibodies","volume":"477","author":"Walker","year":"2011","journal-title":"Nature"},{"key":"2023020113283038700_btw441-B49","doi-asserted-by":"crossref","first-page":"380","DOI":"10.1016\/j.febslet.2005.11.081","article-title":"Predicting protein interaction sites from residue spatial sequence profile and evolution rate","volume":"580","author":"Wang","year":"2006","journal-title":"Fed Eur. Biochem. Soc. Lett"},{"key":"2023020113283038700_btw441-B50","doi-asserted-by":"crossref","first-page":"e81027.","DOI":"10.1371\/journal.pone.0081027","article-title":"Extreme evolutionary conservation of functionally important regions in H1N1 Influenza proteome","volume":"8","author":"Warren","year":"2013","journal-title":"PLoS One"},{"key":"2023020113283038700_btw441-B51","doi-asserted-by":"crossref","first-page":"10598","DOI":"10.1073\/pnas.1309215110","article-title":"Computational analysis of anti-HIV-1 antibody neutralization panel data to identify potential functional epitope residues","volume":"110","author":"West","year":"2013","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020113283038700_btw441-B52","doi-asserted-by":"crossref","first-page":"569","DOI":"10.1038\/ng1202-569","article-title":"Biological and biomedical implications of the co-evolution of pathogens and their hosts","volume":"32","author":"Woolhouse","year":"2002","journal-title":"Nat. Genet"},{"key":"2023020113283038700_btw441-B53","doi-asserted-by":"crossref","first-page":"1593","DOI":"10.1126\/science.1207532","article-title":"Focused evolution of HIV-1 neutralizing antibodies revealed by structures and deep sequencing","volume":"333","author":"Wu","year":"2011","journal-title":"Science"},{"key":"2023020113283038700_btw441-B54","doi-asserted-by":"crossref","first-page":"10896","DOI":"10.1073\/pnas.1005894107","article-title":"Protein interface conservation across structure space","volume":"107","author":"Zhang","year":"2010","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020113283038700_btw441-B55","doi-asserted-by":"crossref","first-page":"811","DOI":"10.1126\/science.1192819","article-title":"Structural basis for broad and potent neutralization of HIV-1 by antibody VRC01","volume":"329","author":"Zhou","year":"2010","journal-title":"Science"},{"key":"2023020113283038700_btw441-B56","doi-asserted-by":"crossref","first-page":"1745","DOI":"10.1099\/0022-1317-77-8-1745","article-title":"Mutational analysis of the influenza virus A\/Victoria\/3\/75\u2009PA protein: studies of interaction with PB1 protein and identification of a dominant negative mutant","volume":"77","author":"Z\u00fcrcher","year":"1996","journal-title":"J. Gen. Virol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/17\/i685\/49023357\/bioinformatics_32_17_i685.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/17\/i685\/49023357\/bioinformatics_32_17_i685.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,1]],"date-time":"2023-02-01T23:31:55Z","timestamp":1675294315000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/17\/i685\/2450771"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,8,29]]},"references-count":56,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2016,9,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw441","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2016,9,1]]},"published":{"date-parts":[[2016,8,29]]}}}