{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,17]],"date-time":"2025-10-17T20:04:04Z","timestamp":1760731444921,"version":"3.37.3"},"reference-count":32,"publisher":"Springer Science and Business Media LLC","issue":"S3","license":[{"start":{"date-parts":[[2021,5,1]],"date-time":"2021-05-01T00:00:00Z","timestamp":1619827200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,5,17]],"date-time":"2021-05-17T00:00:00Z","timestamp":1621209600000},"content-version":"vor","delay-in-days":16,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2021,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>DNA-binding hot spots are dominant and fundamental residues that contribute most of the binding free energy yet accounting for a small portion of protein\u2013DNA interfaces. As experimental methods for identifying hot spots are time-consuming and costly, high-efficiency computational approaches are emerging as alternative pathways to experimental methods.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Herein, we present a new computational method, termed inpPDH, for hot spot prediction. To improve the prediction performance, we extract hybrid features which incorporate traditional features and new interfacial neighbor properties. To remove redundant and irrelevant features, feature selection is employed using a two-step feature selection strategy. Finally, a subset of 7 optimal features are chosen to construct the predictor using support vector machine. The results on the benchmark dataset show that this proposed method yields significantly better prediction accuracy than those previously published methods in the literature. Moreover, a user-friendly web server for inpPDH is well established and is freely available at<jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"http:\/\/bioinfo.ahu.edu.cn\/inpPDH\">http:\/\/bioinfo.ahu.edu.cn\/inpPDH<\/jats:ext-link>.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>We have developed an accurate improved prediction model, inpPDH, for hot spot residues in protein\u2013DNA binding interfaces by given the structure of a protein\u2013DNA complex. Moreover, we identify a comprehensive and useful feature subset including the proposed interfacial neighbor features that has an important strength for identifying hot spot residues. Our results indicate that these features are more effective than the conventional features considered previously, and that the combination of interfacial neighbor features and traditional features may support the creation of a discriminative feature set for efficient prediction of hot spot residues in protein\u2013DNA complexes.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s12859-020-03871-1","type":"journal-article","created":{"date-parts":[[2021,5,17]],"date-time":"2021-05-17T13:03:36Z","timestamp":1621256616000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties"],"prefix":"10.1186","volume":"22","author":[{"given":"Sijia","family":"Zhang","sequence":"first","affiliation":[]},{"given":"Lihua","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Le","family":"Zhao","sequence":"additional","affiliation":[]},{"given":"Menglu","family":"Li","sequence":"additional","affiliation":[]},{"given":"Mengya","family":"Liu","sequence":"additional","affiliation":[]},{"given":"Ke","family":"Li","sequence":"additional","affiliation":[]},{"given":"Yannan","family":"Bin","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3024-1705","authenticated-orcid":false,"given":"Junfeng","family":"Xia","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,5,17]]},"reference":[{"issue":"1","key":"3871_CR1","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1016\/0092-8674(87)90358-8","volume":"48","author":"KA Jones","year":"1987","unstructured":"Jones KA, Kadonaga JT, Rosenfeld PJ, Kelly TJ, Tjian R. A cellular DNA-binding protein that activates eukaryotic transcription and DNA replication. Cell. 1987;48(1):79\u201389.","journal-title":"Cell"},{"issue":"5196","key":"3871_CR2","doi-asserted-by":"publisher","first-page":"383","DOI":"10.1126\/science.7529940","volume":"267","author":"T Clackson","year":"1995","unstructured":"Clackson T, Wells JA. A hot spot of binding energy in a hormone-receptor interface. Science. 1995;267(5196):383\u20136.","journal-title":"Science"},{"issue":"4","key":"3871_CR3","doi-asserted-by":"publisher","first-page":"803","DOI":"10.1002\/prot.21396","volume":"68","author":"IS Moreira","year":"2007","unstructured":"Moreira IS, Fernandes PA, Ramos MJ. Hot spots\u2014a review of the protein\u2013protein interface determinant amino-acid residues. Proteins Struct Funct Bioinform. 2007;68(4):803\u201312.","journal-title":"Proteins Struct Funct Bioinform"},{"issue":"1","key":"3871_CR4","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1006\/jmbi.1998.1843","volume":"280","author":"AA Bogan","year":"1998","unstructured":"Bogan AA, Thorn KS. Anatomy of hot spots in protein interfaces. J Mol Biol. 1998;280(1):1\u20139.","journal-title":"J Mol Biol"},{"issue":"2","key":"3871_CR5","doi-asserted-by":"publisher","first-page":"422","DOI":"10.1109\/TCBB.2018.2846599","volume":"17","author":"J Xi","year":"2020","unstructured":"Xi J, Li A, Wang M. HetRCNA: a novel method to identify recurrent copy number alternations from heterogeneous tumor samples based on matrix decomposition framework. IEEE\/ACM Trans Comput Biol Bioinf. 2020;17(2):422\u201334.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinf"},{"issue":"6","key":"3871_CR6","doi-asserted-by":"crossref","first-page":"1855","DOI":"10.1093\/bioinformatics\/btz793","volume":"36","author":"J Xi","year":"2020","unstructured":"Xi J, Yuan X, Wang M, Li A, Li X, Huang Q. Inferring subgroup-specific driver genes from heterogeneous cancer samples via subspace learning with subgroup indication. Bioinformatics. 2020;36(6):1855\u201363.","journal-title":"Bioinformatics"},{"key":"3871_CR7","doi-asserted-by":"crossref","unstructured":"Wells JA. Systematic mutational analyses of protein\u2013protein interfaces. Methods Enzymol. 1991;202:390\u2013411.","DOI":"10.1016\/0076-6879(91)02020-A"},{"issue":"5","key":"3871_CR8","doi-asserted-by":"publisher","first-page":"779","DOI":"10.1093\/bioinformatics\/btx698","volume":"34","author":"Y Peng","year":"2018","unstructured":"Peng Y, Sun L, Jia Z, Li L, Alexov E. Predicting protein\u2013DNA binding free energy change upon missense mutations using modified MM\/PBSA approach: SAMPDI webserver. Bioinformatics. 2018;34(5):779\u201386.","journal-title":"Bioinformatics"},{"issue":"12","key":"3871_CR9","doi-asserted-by":"publisher","first-page":"e1006615","DOI":"10.1371\/journal.pcbi.1006615","volume":"14","author":"N Zhang","year":"2018","unstructured":"Zhang N, Chen Y, Zhao F, Yang Q, Simonetti FL, Li M. PremPDI estimates and interprets the effects of missense mutations on protein\u2013DNA interactions. PLoS Comput Biol. 2018;14(12):e1006615.","journal-title":"PLoS Comput Biol"},{"issue":"W1","key":"3871_CR10","doi-asserted-by":"publisher","first-page":"W241","DOI":"10.1093\/nar\/gkx236","volume":"45","author":"DE Pires","year":"2017","unstructured":"Pires DE, Ascher DB. mCSM\u2013NA: predicting the effects of mutations on protein\u2013nucleic acids interactions. Nucleic Acids Res. 2017;45(W1):W241\u20136.","journal-title":"Nucleic Acids Res"},{"issue":"3","key":"3871_CR11","doi-asserted-by":"publisher","first-page":"1038","DOI":"10.1093\/bib\/bbz037","volume":"21","author":"S Zhang","year":"2020","unstructured":"Zhang S, Zhao L, Zheng C-H, Xia J. A feature-based approach to predict hot spots in protein\u2013DNA binding interfaces. Brief Bioinform. 2020;21(3):1038\u201346.","journal-title":"Brief Bioinform"},{"issue":"9","key":"3871_CR12","doi-asserted-by":"publisher","first-page":"1473","DOI":"10.1093\/bioinformatics\/btx822","volume":"34","author":"Y Pan","year":"2017","unstructured":"Pan Y, Wang Z, Zhan W, Deng L. Computational identification of binding energy hot spots in protein\u2013RNA complexes using an ensemble approach. Bioinformatics. 2017;34(9):1473\u201380.","journal-title":"Bioinformatics"},{"issue":"1","key":"3871_CR13","doi-asserted-by":"publisher","first-page":"174","DOI":"10.1186\/1471-2105-11-174","volume":"11","author":"J-F Xia","year":"2010","unstructured":"Xia J-F, Zhao X-M, Song J, Huang D-S. APIS: accurate prediction of hot spots in protein interfaces by combining protrusion index with solvent accessibility. BMC Bioinform. 2010;11(1):174.","journal-title":"BMC Bioinform"},{"issue":"9","key":"3871_CR14","doi-asserted-by":"publisher","first-page":"2671","DOI":"10.1002\/prot.23094","volume":"79","author":"X Zhu","year":"2011","unstructured":"Zhu X, Mitchell JC. KFC2: a knowledge-based hot spot prediction method based on interface solvation, atomic density, and plasticity features. Proteins Struct Funct Bioinform. 2011;79(9):2671\u201383.","journal-title":"Proteins Struct Funct Bioinform"},{"issue":"14","key":"3871_CR15","doi-asserted-by":"publisher","first-page":"18065","DOI":"10.18632\/oncotarget.7695","volume":"7","author":"J Xia","year":"2016","unstructured":"Xia J, Yue Z, Di Y, Zhu X, Zheng C-H. Predicting hot spots in protein interfaces based on protrusion index, pseudo hydrophobicity and electron-ion interaction pseudopotential features. Oncotarget. 2016;7(14):18065.","journal-title":"Oncotarget"},{"key":"3871_CR16","doi-asserted-by":"publisher","DOI":"10.1093\/database\/bay034","author":"L Liu","year":"2018","unstructured":"Liu L, Xiong Y, Gao H, Wei D-Q, Mitchell JC, Zhu X. dbAMEPNI: a database of alanine mutagenic effects for protein\u2013nucleic acid interactions. Database. 2018. https:\/\/doi.org\/10.1093\/database\/bay034.","journal-title":"Database"},{"key":"3871_CR17","doi-asserted-by":"publisher","first-page":"223","DOI":"10.1007\/978-1-4939-7717-8_13","volume":"1754","author":"Y Xiong","year":"2018","unstructured":"Xiong Y, Zhu X, Dai H, Wei DQ. Survey of computational approaches for prediction of DNA-binding residues on protein surfaces. Methods Mol Biol. 2018;1754:223\u201334.","journal-title":"Methods Mol Biol"},{"key":"3871_CR18","unstructured":"Hubbard S. NACCESS: program for calculating accessibilities. Department of Biochemistry and Molecular Biology, University College of London; 1992. http:\/\/www.bioinf.manchester.ac.uk\/naccess."},{"issue":"6","key":"3871_CR19","doi-asserted-by":"publisher","first-page":"1419","DOI":"10.1007\/s00726-014-1710-6","volume":"46","author":"W Yan","year":"2014","unstructured":"Yan W, Zhou J, Sun M, Chen J, Hu G, Shen B. The construction of an amino acid network for understanding protein structure and function. Amino Acids. 2014;46(6):1419\u201339.","journal-title":"Amino Acids"},{"issue":"W1","key":"3871_CR20","doi-asserted-by":"publisher","first-page":"W375","DOI":"10.1093\/nar\/gkw383","volume":"44","author":"B Chakrabarty","year":"2016","unstructured":"Chakrabarty B, Parekh N. NAPS: Network analysis of protein structures. Nucleic Acids Res. 2016;44(W1):W375\u201382.","journal-title":"Nucleic Acids Res"},{"issue":"12","key":"3871_CR21","doi-asserted-by":"crossref","first-page":"2577","DOI":"10.1002\/bip.360221211","volume":"22","author":"W Kabsch","year":"1983","unstructured":"Kabsch W, Sander C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolym Orig Res Biomol. 1983;22(12):2577\u2013637.","journal-title":"Biopolym Orig Res Biomol"},{"issue":"18","key":"3871_CR22","doi-asserted-by":"publisher","first-page":"2842","DOI":"10.1093\/bioinformatics\/btx218","volume":"33","author":"R Heffernan","year":"2017","unstructured":"Heffernan R, Yang Y, Paliwal K, Zhou Y. Capturing non-local interactions by long short-term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility. Bioinformatics. 2017;33(18):2842\u20139.","journal-title":"Bioinformatics"},{"issue":"5","key":"3871_CR23","doi-asserted-by":"publisher","first-page":"777","DOI":"10.1006\/jmbi.1994.1334","volume":"238","author":"IK McDonald","year":"1994","unstructured":"McDonald IK, Thornton JM. Satisfying hydrogen bonding potential in proteins. J Mol Biol. 1994;238(5):777\u201393.","journal-title":"J Mol Biol"},{"issue":"1\u20133","key":"3871_CR24","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1023\/A:1012487302797","volume":"46","author":"I Guyon","year":"2002","unstructured":"Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach Learn. 2002;46(1\u20133):389\u2013422.","journal-title":"Mach Learn"},{"issue":"3","key":"3871_CR25","doi-asserted-by":"publisher","first-page":"970","DOI":"10.1093\/bib\/bbz047","volume":"21","author":"N Cheng","year":"2020","unstructured":"Cheng N, Li M, Zhao L, Zhang B, Yang Y, Zheng C-H, Xia J. Comparison and integration of computational methods for deleterious synonymous mutation prediction. Brief Bioinform. 2020;21(3):970\u201381.","journal-title":"Brief Bioinform"},{"issue":"11","key":"3871_CR26","doi-asserted-by":"publisher","first-page":"1793","DOI":"10.1016\/j.asr.2008.02.012","volume":"41","author":"M Chi","year":"2008","unstructured":"Chi M, Feng R, Bruzzone L. Classification of hyperspectral remote-sensing data with primal SVM for small-sized training dataset problem. Adv Space Res. 2008;41(11):1793\u20139.","journal-title":"Adv Space Res"},{"issue":"3","key":"3871_CR27","first-page":"27","volume":"2","author":"C-C Chang","year":"2011","unstructured":"Chang C-C, Lin C-J. LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol (TIST). 2011;2(3):27.","journal-title":"ACM Trans Intell Syst Technol (TIST)"},{"issue":"5","key":"3871_CR28","doi-asserted-by":"publisher","first-page":"1595","DOI":"10.1007\/s00726-010-0588-1","volume":"39","author":"J-F Xia","year":"2010","unstructured":"Xia J-F, Zhao X-M, Huang D-S. Predicting protein\u2013protein interactions from protein sequences using meta predictor. Amino Acids. 2010;39(5):1595\u20139.","journal-title":"Amino Acids"},{"key":"3871_CR29","doi-asserted-by":"publisher","first-page":"2274","DOI":"10.3390\/ijms21072274","volume":"21","author":"A Deng","year":"2020","unstructured":"Deng A, Zhang H, Wang W, Zhang J, Fan D, Chen P, Wang B. Developing computational model to predict protein\u2013protein interaction sites based on the XGBoost algorithm. Int J Mol Sci. 2020;21:2274.","journal-title":"Int J Mol Sci"},{"key":"3871_CR30","doi-asserted-by":"publisher","DOI":"10.1109\/TCBB.2019.2953908","author":"B Wang","year":"2019","unstructured":"Wang B, Wang L, Zheng C, Xiong Y. Imbalance data processing strategy for protein interaction sites prediction. IEEE\/ACM Trans Comput Biol Bioinform. 2019. https:\/\/doi.org\/10.1109\/TCBB.2019.2953908.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"issue":"1","key":"3871_CR31","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1109\/TNB.2009.2035284","volume":"9","author":"PA Mundra","year":"2010","unstructured":"Mundra PA, Rajapakse JC. SVM-RFE with MRMR filter for gene selection. IEEE Trans Nanobiosci. 2010;9(1):31\u20137.","journal-title":"IEEE Trans Nanobiosci"},{"issue":"1","key":"3871_CR32","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1186\/s12920-018-0455-6","volume":"12","author":"F Shi","year":"2019","unstructured":"Shi F, Yao Y, Bin Y, Zheng C-H, Xia J. Computational identification of deleterious synonymous variants in human genomes using a feature-based approach. BMC Med Genomics. 2019;12(1):12.","journal-title":"BMC Med Genomics"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-020-03871-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-020-03871-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-020-03871-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,3]],"date-time":"2023-11-03T20:28:00Z","timestamp":1699043280000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-020-03871-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5]]},"references-count":32,"journal-issue":{"issue":"S3","published-print":{"date-parts":[[2021,5]]}},"alternative-id":["3871"],"URL":"https:\/\/doi.org\/10.1186\/s12859-020-03871-1","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2021,5]]},"assertion":[{"value":"29 October 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 November 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 May 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"253"}}