{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T11:11:43Z","timestamp":1772190703108,"version":"3.50.1"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"21","funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["30960090"],"award-info":[{"award-number":["30960090"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["31260203"],"award-info":[{"award-number":["31260203"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["11501306"],"award-info":[{"award-number":["11501306"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["GM083107"],"award-info":[{"award-number":["GM083107"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["GM116960"],"award-info":[{"award-number":["GM116960"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: More than half of proteins require binding of metal and acid radical ions for their structure and function. Identification of the ion-binding locations is important for understanding the biological functions of proteins. Due to the small size and high versatility of the metal and acid radical ions, however, computational prediction of their binding sites remains difficult.<\/jats:p>\n               <jats:p>Results: We proposed a new ligand-specific approach devoted to the binding site prediction of 13 metal ions (Zn 2+ , Cu 2+ , Fe 2+ , Fe 3+ , Ca 2+ , Mg 2+ , Mn 2+ , Na + , K + ) and acid radical ion ligands (CO3 2\u2212 , NO2 \u2212 , SO4 2\u2212 , PO4 3\u2212 ) that are most frequently seen in protein databases. A sequence-based ab initio model is first trained on sequence profiles, where a modified AdaBoost algorithm is extended to balance binding and non-binding residue samples. A composite method IonCom is then developed to combine the ab initio model with multiple threading alignments for further improving the robustness of the binding site predictions. The pipeline was tested using 5-fold cross validations on a comprehensive set of 2,100 non-redundant proteins bound with 3,075 small ion ligands. Significant advantage was demonstrated compared with the state of the art ligand-binding methods including COACH and TargetS for high-accuracy ion-binding site identification. Detailed data analyses show that the major advantage of IonCom lies at the integration of complementary ab initio and template-based components. Ion-specific feature design and binding library selection also contribute to the improvement of small ion ligand binding predictions.<\/jats:p>\n               <jats:p>Availability and Implementation : http:\/\/zhanglab.ccmb.med.umich.edu\/IonCom<\/jats:p>\n               <jats:p>Contact: \u00a0hxz@imut.edu.cn or zhng@umich.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btw396","type":"journal-article","created":{"date-parts":[[2016,7,5]],"date-time":"2016-07-05T01:59:34Z","timestamp":1467683974000},"page":"3260-3269","source":"Crossref","is-referenced-by-count":128,"title":["Recognizing metal and acid radical ion-binding sites by integrating \n            <i>ab initio<\/i>\n             modeling with template-based transferals"],"prefix":"10.1093","volume":"32","author":[{"given":"Xiuzhen","family":"Hu","sequence":"first","affiliation":[{"name":"1 Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA"},{"name":"2 College of Sciences, Inner Mongolia University of Technology, Hohhot 010051, China"}]},{"given":"Qiwen","family":"Dong","sequence":"additional","affiliation":[{"name":"1 Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA"},{"name":"3 Institute for Data Science and Engineering, East China Normal University, Shanghai 200062, China"}]},{"given":"Jianyi","family":"Yang","sequence":"additional","affiliation":[{"name":"1 Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA"},{"name":"4 School of Mathematical Sciences, Nankai University, Tianjin 300071, China"}]},{"given":"Yang","family":"Zhang","sequence":"additional","affiliation":[{"name":"1 Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA"},{"name":"5 Department of Biological Chemistry, University of Michigan, Ann Arbor, Michigan 48109, USA"}]}],"member":"286","published-online":{"date-parts":[[2016,7,4]]},"reference":[{"key":"2023020201173045800_btw396-B1","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res"},{"key":"2023020201173045800_btw396-B2","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1146\/annurev.bb.19.060190.002201","article-title":"Zinc finger domains: hypotheses and current knowledge","volume":"19","author":"Berg","year":"1990","journal-title":"Annu. Rev. Biophys. Biophys. Chem"},{"key":"2023020201173045800_btw396-B3","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1073\/pnas.0707684105","article-title":"A threading-based method (FINDSITE) for ligand-binding site prediction and functional annotation","volume":"105","author":"Brylinski","year":"2008","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020201173045800_btw396-B4","doi-asserted-by":"crossref","first-page":"969","DOI":"10.1016\/S0021-9258(18)71184-8","article-title":"The enzymatic phosphorylation of proteins","volume":"211","author":"Burnett","year":"1954","journal-title":"J. Biol. Chem"},{"key":"2023020201173045800_btw396-B5","doi-asserted-by":"crossref","first-page":"e1000585","DOI":"10.1371\/journal.pcbi.1000585","article-title":"Predicting protein ligand binding sites by combining evolutionary sequence conservation and 3D structure","volume":"5","author":"Capra","year":"2009","journal-title":"PLoS Comput. Biol"},{"key":"2023020201173045800_btw396-B6","doi-asserted-by":"crossref","first-page":"1875","DOI":"10.1093\/bioinformatics\/btm270","article-title":"Predicting functionally important residues from sequence conservation","volume":"23","author":"Capra","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020201173045800_btw396-B7","doi-asserted-by":"crossref","first-page":"463","DOI":"10.1006\/jmbi.1993.1599","article-title":"Anion-binding sites in protein structures","volume":"234","author":"Chakrabarti","year":"1993","journal-title":"J. Mol. Biol"},{"key":"2023020201173045800_btw396-B8","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1145\/1961189.1961199","article-title":"LIBSVM. A library for support vector machines","volume":"2","author":"Chang","year":"2011","journal-title":". ACM Trans. Intel. Syst. Technol"},{"key":"2023020201173045800_btw396-B9","doi-asserted-by":"crossref","DOI":"10.1109\/TCBB.2015.2505286","article-title":"A sequence-based dynamic ensemble learning system for protein ligand-binding site prediction","author":"Chen","year":"2016","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2023020201173045800_btw396-B10","doi-asserted-by":"crossref","first-page":"S4.","DOI":"10.1186\/1471-2105-15-S15-S4","article-title":"LigandRFs: random forest ensemble to identify ligand-binding residues from sequence information alone","volume":"15 (Suppl 15)","author":"Chen","year":"2014","journal-title":"BMC Bioinformatics"},{"key":"2023020201173045800_btw396-B11","first-page":"1","volume-title":"Multiple Classifier Systems","author":"Dietterich","year":"2001"},{"key":"2023020201173045800_btw396-B12","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1006\/jcss.1997.1504","article-title":"A decision-theoretic generalization of on-line learning and an application to boosting","volume":"55","author":"Freund","year":"1997","journal-title":"J. Comp. Syst. Sci"},{"key":"2023020201173045800_btw396-B13","doi-asserted-by":"crossref","first-page":"bat031","DOI":"10.1093\/database\/bat031","article-title":"The Protein Model Portal\u2013a comprehensive resource for protein structure and model information","volume":"2013","author":"Haas","year":"2013","journal-title":"Database (Oxford)"},{"key":"2023020201173045800_btw396-B14","doi-asserted-by":"crossref","first-page":"359-363","DOI":"10.1016\/S1093-3263(98)00002-3","article-title":"LIGSITE: automatic and efficient detection of potential small molecule-binding sites in proteins","volume":"15","author":"Hendlich","year":"1997","journal-title":"J. Mol. Graph. Model"},{"key":"2023020201173045800_btw396-B15","doi-asserted-by":"crossref","first-page":"574","DOI":"10.1016\/0022-2836(94)90032-9","article-title":"Position-based sequence weights","volume":"243","author":"Henikoff","year":"1994","journal-title":"J. Mol. Biol"},{"key":"2023020201173045800_btw396-B16","doi-asserted-by":"crossref","first-page":"239","DOI":"10.1056\/NEJM199801223380407","article-title":"Respiratory function of hemoglobin","volume":"338","author":"Hsia","year":"1998","journal-title":"N. Engl. J. Med"},{"key":"2023020201173045800_btw396-B17","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1016\/0263-7855(95)00073-9","article-title":"SURFNET: a program for visualizing molecular surfaces, cavities, and intermolecular interactions","volume":"13","author":"Laskowski","year":"1995","journal-title":"J. Mol. Graph"},{"key":"2023020201173045800_btw396-B18","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1109\/18.61115","article-title":"Divergence measures based on the shannon entropy","volume":"37","author":"Lin","year":"1991","journal-title":"IEEE Trans. Inform. Theory"},{"key":"2023020201173045800_btw396-B19","doi-asserted-by":"crossref","first-page":"240.","DOI":"10.1186\/1471-2105-6-240","article-title":"Sequence variation in ligand binding sites in proteins","volume":"6","author":"Magliery","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023020201173045800_btw396-B20","doi-asserted-by":"crossref","first-page":"1995","DOI":"10.1073\/pnas.0908044107","article-title":"Protein interactions and ligand binding: from protein subfamilies to functional specificity","volume":"107","author":"Rausell","year":"2010","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020201173045800_btw396-B21","doi-asserted-by":"crossref","first-page":"160.","DOI":"10.1186\/1471-2105-12-160","article-title":"FunFOLD: an improved automated method for the prediction of ligand binding residues using 3D models of proteins","volume":"12","author":"Roche","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023020201173045800_btw396-B22","doi-asserted-by":"crossref","first-page":"W471","DOI":"10.1093\/nar\/gks372","article-title":"COFACTOR: an accurate comparative algorithm for structure-based protein function annotation","volume":"40","author":"Roy","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023020201173045800_btw396-B23","doi-asserted-by":"crossref","first-page":"987","DOI":"10.1016\/j.str.2012.03.009","article-title":"Recognizing protein-ligand binding sites by global structural alignment and local geometry refinement","volume":"20","author":"Roy","year":"2012","journal-title":"Structure"},{"key":"2023020201173045800_btw396-B24","doi-asserted-by":"crossref","first-page":"126","DOI":"10.1002\/prot.23174","article-title":"Assessment of ligand-binding residue predictions in CASP9","volume":"79 Suppl 10","author":"Schmidt","year":"2011","journal-title":"Proteins"},{"key":"2023020201173045800_btw396-B25","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1093\/bioinformatics\/15.4.327","article-title":"Automated analysis of interatomic contacts in proteins","volume":"15","author":"Sobolev","year":"1999","journal-title":"Bioinformatics"},{"key":"2023020201173045800_btw396-B26","doi-asserted-by":"crossref","first-page":"582","DOI":"10.1016\/0958-1669(91)90084-I","article-title":"Metal-binding sites in proteins","volume":"2","author":"Tainer","year":"1991","journal-title":"Curr. Opin. Biotechnol"},{"key":"2023020201173045800_btw396-B27","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1016\/S1367-5931(98)80056-2","article-title":"Bio-inorganic chemistry","volume":"2","author":"Thomson","year":"1998","journal-title":"Curr. Opin. Chem. Biol"},{"key":"2023020201173045800_btw396-B28","doi-asserted-by":"crossref","first-page":"W469","DOI":"10.1093\/nar\/gkq406","article-title":"3DLigandSite: predicting ligand-binding sites using similar structures","volume":"38","author":"Wass","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2023020201173045800_btw396-B29","doi-asserted-by":"crossref","first-page":"3375","DOI":"10.1093\/nar\/gkm251","article-title":"LOMETS: A local meta-threading-server for protein structure prediction","volume":"35","author":"Wu","year":"2007","journal-title":"Nucl. Acids. Res"},{"key":"2023020201173045800_btw396-B30","doi-asserted-by":"crossref","first-page":"e3400.","DOI":"10.1371\/journal.pone.0003400","article-title":"ANGLOR: a composite machine-learning algorithm for protein backbone torsion angle prediction","volume":"3","author":"Wu","year":"2008","journal-title":"PloS One"},{"key":"2023020201173045800_btw396-B31","doi-asserted-by":"crossref","first-page":"5648","DOI":"10.1073\/pnas.87.15.5648","article-title":"Where metal ions bind in proteins","volume":"87","author":"Yamashita","year":"1990","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020201173045800_btw396-B32","doi-asserted-by":"crossref","first-page":"2619","DOI":"10.1038\/srep02619","article-title":"A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction","volume":"3","author":"Yan","year":"2013","journal-title":"Sci. Rep"},{"key":"2023020201173045800_btw396-B33","doi-asserted-by":"crossref","first-page":"D1096","DOI":"10.1093\/nar\/gks966","article-title":"BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions","volume":"41","author":"Yang","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023020201173045800_btw396-B34","doi-asserted-by":"crossref","first-page":"2588","DOI":"10.1093\/bioinformatics\/btt447","article-title":"Protein-ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment","volume":"29","author":"Yang","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020201173045800_btw396-B35","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1038\/nmeth.3213","article-title":"The I-TASSER Suite: protein structure and function prediction","volume":"12","author":"Yang","year":"2015","journal-title":"Nature Methods"},{"key":"2023020201173045800_btw396-B36","doi-asserted-by":"crossref","first-page":"994","DOI":"10.1109\/TCBB.2013.104","article-title":"Designing template-free predictor for targeting protein-ligand binding sites with classifier ensemble and spatial clustering","volume":"10","author":"Yu","year":"2013","journal-title":"Comput. Biol. Bioinform. IEEE\/ACM Trans"},{"key":"2023020201173045800_btw396-B37","doi-asserted-by":"crossref","first-page":"2302","DOI":"10.1093\/nar\/gki524","article-title":"TM-align: a protein structure alignment algorithm based on the TM-score","volume":"33","author":"Zhang","year":"2005","journal-title":"Nucleic. Acids Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/21\/3260\/49022377\/bioinformatics_32_21_3260.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/21\/3260\/49022377\/bioinformatics_32_21_3260.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T03:36:21Z","timestamp":1675308981000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/21\/3260\/2415108"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,7,4]]},"references-count":37,"journal-issue":{"issue":"21","published-print":{"date-parts":[[2016,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw396","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2016,11,1]]},"published":{"date-parts":[[2016,7,4]]}}}