{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,26]],"date-time":"2025-10-26T15:12:32Z","timestamp":1761491552092},"reference-count":52,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,4,4]],"date-time":"2023-04-04T00:00:00Z","timestamp":1680566400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,4,4]],"date-time":"2023-04-04T00:00:00Z","timestamp":1680566400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Open Fund of Information Materials and Intelligent Sensing Laboratory of Anhui Province","award":["IMIS202009"],"award-info":[{"award-number":["IMIS202009"]}]},{"name":"Anhui Agricultural University Introduction and Stabilization of Talents Research Funding","award":["yj2020-74"],"award-info":[{"award-number":["yj2020-74"]}]},{"name":"Natural Science Research Key Project of Colleges and Universities in Anhui Province","award":["KJ2021A0182"],"award-info":[{"award-number":["KJ2021A0182"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Identification of hot spots in protein\u2013DNA binding interfaces is extremely important for understanding the underlying mechanisms of protein\u2013DNA interactions and drug design. Since experimental methods for identifying hot spots are time-consuming and expensive, and most of the existing computational methods are based on traditional protein\u2013DNA features to predict hot spots, unable to make full use of the effective information in the features.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>In this work, a method named WTL-PDH is proposed for hot spots prediction. To deal with the unbalanced dataset, we used the Synthetic Minority Over-sampling Technique to generate minority class samples to achieve the balance of dataset. First, we extracted the solvent accessible surface area features and structural features, and then processed the traditional features using discrete wavelet transform and wavelet packet transform to extract the wavelet energy information and wavelet entropy information, and obtained a total of 175 dimensional features. In order to obtain the best feature subset, we systematically evaluate these features in various feature selection strategies. Finally, light gradient boosting machine (LightGBM) was used to establish the model.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>Our method achieved good results on independent test set with AUC, MCC and F1 scores of 0.838, 0.533 and 0.750, respectively. WTL-PDH can achieve generally better performance in predicting hot spots when compared with state-of-the-art methods. The dataset and source code are available at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/chase2555\/WTL-PDH\">https:\/\/github.com\/chase2555\/WTL-PDH<\/jats:ext-link>.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-023-05263-7","type":"journal-article","created":{"date-parts":[[2023,4,4]],"date-time":"2023-04-04T13:04:09Z","timestamp":1680613449000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Prediction of hot spots in protein\u2013DNA binding interfaces based on discrete wavelet transform and wavelet packet transform"],"prefix":"10.1186","volume":"24","author":[{"given":"Yu","family":"Sun","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hongwei","family":"Wu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhengrong","family":"Xu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhenyu","family":"Yue","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ke","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2023,4,4]]},"reference":[{"issue":"1","key":"5263_CR1","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1016\/0092-8674(87)90358-8","volume":"48","author":"KA Jones","year":"1987","unstructured":"Jones KA, Kadonaga JT, Rosenfeld PJ, Kelly TJ, Tjian R. A cellular DNA-binding protein that activates eukaryotic transcription and DNA replication. Cell. 1987;48(1):79\u201389.","journal-title":"Cell"},{"issue":"6","key":"5263_CR2","doi-asserted-by":"publisher","first-page":"3018","DOI":"10.1021\/acs.jproteome.1c00074","volume":"20","author":"F Cozzolino","year":"2021","unstructured":"Cozzolino F, Iacobucci I, Monaco V, Monti M. Protein\u2013DNA\/RNA interactions: an overview of investigation methods in the -omics era. J Proteome Res. 2021;20(6):3018\u201330.","journal-title":"J Proteome Res"},{"issue":"5196","key":"5263_CR3","doi-asserted-by":"publisher","first-page":"383","DOI":"10.1126\/science.7529940","volume":"267","author":"T Clackson","year":"1995","unstructured":"Clackson T, Wells JA. A hot spot of binding energy in a hormone-receptor interface. Science (New York, NY). 1995;267(5196):383\u20136.","journal-title":"Science (New York, NY)"},{"issue":"4","key":"5263_CR4","doi-asserted-by":"publisher","first-page":"803","DOI":"10.1002\/prot.21396","volume":"68","author":"IS Moreira","year":"2007","unstructured":"Moreira IS, Fernandes PA, Ramos MJ. Hot spots\u2013a review of the protein\u2013protein interface determinant amino-acid residues. Proteins. 2007;68(4):803\u201312.","journal-title":"Proteins"},{"issue":"5","key":"5263_CR5","doi-asserted-by":"publisher","first-page":"779","DOI":"10.1093\/bioinformatics\/btx698","volume":"34","author":"Y Peng","year":"2018","unstructured":"Peng Y, Sun L, Jia Z, Li L, Alexov E. Predicting protein\u2013DNA binding free energy change upon missense mutations using modified MM\/PBSA approach: SAMPDI webserver. Bioinformatics. 2018;34(5):779\u201386.","journal-title":"Bioinformatics"},{"issue":"12","key":"5263_CR6","doi-asserted-by":"publisher","first-page":"e1006615","DOI":"10.1371\/journal.pcbi.1006615","volume":"14","author":"N Zhang","year":"2018","unstructured":"Zhang N, Chen Y, Zhao F, Yang Q, Simonetti FL, Li M. PremPDI estimates and interprets the effects of missense mutations on protein\u2013DNA interactions. PLoS Comput Biol. 2018;14(12):e1006615.","journal-title":"PLoS Comput Biol"},{"issue":"21","key":"5263_CR7","doi-asserted-by":"publisher","first-page":"3760","DOI":"10.1093\/bioinformatics\/btab567","volume":"37","author":"G Li","year":"2021","unstructured":"Li G, Panday SK, Peng Y, Alexov E. SAMPDI-3D: predicting the effects of protein and DNA mutations on protein\u2013DNA interactions. Bioinformatics. 2021;37(21):3760\u20135.","journal-title":"Bioinformatics"},{"issue":"W1","key":"5263_CR8","doi-asserted-by":"publisher","first-page":"W241","DOI":"10.1093\/nar\/gkx236","volume":"45","author":"DEV Pires","year":"2017","unstructured":"Pires DEV, Ascher DB. mCSM-NA: predicting the effects of mutations on protein-nucleic acids interactions. Nucleic Acids Res. 2017;45(W1):W241-w246.","journal-title":"Nucleic Acids Res"},{"issue":"4","key":"5263_CR9","doi-asserted-by":"publisher","first-page":"lqab109","DOI":"10.1093\/nargab\/lqab109","volume":"3","author":"TB Nguyen","year":"2021","unstructured":"Nguyen TB, Myung Y, de S\u00e1 AGC, Pires DEV, Ascher DB. mmCSM-NA: accurately predicting effects of single and multiple mutations on protein-nucleic acid binding affinity. NAR Genomics Bioinform. 2021;3(4):lqab109.","journal-title":"NAR Genomics Bioinform"},{"issue":"5","key":"5263_CR10","doi-asserted-by":"publisher","first-page":"bbaa373","DOI":"10.1093\/bib\/bbaa373","volume":"22","author":"LC Mei","year":"2021","unstructured":"Mei LC, Wang YL, Wu FX, Wang F, Hao GF, Yang GF. HISNAPI: a bioinformatic tool for dynamic hot spot analysis in nucleic acid-protein interface with a case study. Brief Bioinform. 2021;22(5):bbaa373.","journal-title":"Brief Bioinform"},{"issue":"3","key":"5263_CR11","doi-asserted-by":"publisher","first-page":"1038","DOI":"10.1093\/bib\/bbz037","volume":"21","author":"S Zhang","year":"2019","unstructured":"Zhang S, Zhao L, Zheng C-H, Xia J. A feature-based approach to predict hot spots in protein\u2013DNA binding interfaces. Brief Bioinform. 2019;21(3):1038\u201346.","journal-title":"Brief Bioinform"},{"key":"5263_CR12","doi-asserted-by":"publisher","first-page":"19","DOI":"10.32614\/RJ-2015-018","volume":"7","author":"R Genuer","year":"2015","unstructured":"Genuer R, Poggi J-M, Tuleau-Malot C. VSURF: an R package for variable selection using random forests. R J. 2015;7:19\u201333.","journal-title":"R J"},{"issue":"4","key":"5263_CR13","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1109\/5254.708428","volume":"13","author":"MA Hearst","year":"1998","unstructured":"Hearst MA, Dumais ST, Osuna E, Platt J, Scholkopf B. Support vector machines. IEEE Intell Syst Appl. 1998;13(4):18\u201328.","journal-title":"IEEE Intell Syst Appl"},{"issue":"3","key":"5263_CR14","doi-asserted-by":"publisher","first-page":"253","DOI":"10.1186\/s12859-020-03871-1","volume":"22","author":"S Zhang","year":"2021","unstructured":"Zhang S, Wang L, Zhao L, Li M, Liu M, Li K, Bin Y, Xia J. An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties. BMC Bioinform. 2021;22(3):253.","journal-title":"BMC Bioinform"},{"issue":"Suppl 13","key":"5263_CR15","doi-asserted-by":"publisher","first-page":"381","DOI":"10.1186\/s12859-020-03683-3","volume":"21","author":"K Li","year":"2020","unstructured":"Li K, Zhang S, Yan D, Bin Y, Xia J. Prediction of hot spots in protein\u2013DNA binding interfaces based on supervised isometric feature mapping and extreme gradient boosting. BMC Bioinform. 2020;21(Suppl 13):381.","journal-title":"BMC Bioinform"},{"issue":"6","key":"5263_CR16","doi-asserted-by":"publisher","first-page":"1098","DOI":"10.1109\/TSMCB.2005.850151","volume":"35","author":"X Geng","year":"2005","unstructured":"Geng X, Zhan D-C, Zhou Z-H. Supervised nonlinear dimensionality reduction for visualization and classification. IEEE Trans Syst Man Cybern Part B (Cybern). 2005;35(6):1098\u2013107.","journal-title":"IEEE Trans Syst Man Cybern Part B (Cybern)"},{"key":"5263_CR17","doi-asserted-by":"crossref","unstructured":"Chen T, Guestrin C: Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining: 2016. pp. 785\u2013794.","DOI":"10.1145\/2939672.2939785"},{"issue":"1","key":"5263_CR18","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s12539-020-00399-z","volume":"13","author":"L Yao","year":"2021","unstructured":"Yao L, Wang H, Bin Y. Predicting hot spot residues at protein\u2013DNA binding interfaces based on sequence information. Interdiscip Sci: Comput Life Sci. 2021;13(1):1\u201311.","journal-title":"Interdiscip Sci: Comput Life Sci"},{"issue":"13","key":"5263_CR19","doi-asserted-by":"publisher","first-page":"384","DOI":"10.1186\/s12859-020-03675-3","volume":"21","author":"Y Pan","year":"2020","unstructured":"Pan Y, Zhou S, Guan J. Computationally identifying hot spots in protein\u2013DNA binding interfaces using an ensemble approach. BMC Bioinform. 2020;21(13):384.","journal-title":"BMC Bioinform"},{"key":"5263_CR20","doi-asserted-by":"publisher","first-page":"e1008951","DOI":"10.1371\/journal.pcbi.1008951","volume":"17","author":"Y Jiang","year":"2021","unstructured":"Jiang Y, Liu H-F, Liu R. Systematic comparison and prediction of the effects of missense mutations on protein\u2013DNA and protein-RNA interactions. PLoS Comput Biol. 2021;17:e1008951.","journal-title":"PLoS Comput Biol"},{"key":"5263_CR21","doi-asserted-by":"publisher","first-page":"bay034","DOI":"10.1093\/database\/bay034","volume":"2018","author":"L Liu","year":"2018","unstructured":"Liu L, Xiong Y, Gao H, Wei DQ, Mitchell JC, Zhu X. dbAMEPNI: a database of alanine mutagenic effects for protein-nucleic acid interactions. Database: J Biol Databases Curation. 2018;2018:bay034.","journal-title":"Database: J Biol Databases Curation"},{"key":"5263_CR22","doi-asserted-by":"publisher","first-page":"baabo50","DOI":"10.1093\/database\/baab050","volume":"2021","author":"J Liu","year":"2021","unstructured":"Liu J, Liu S, Liu C, Zhang Y, Pan Y, Wang Z, Wang J, Wen T, Deng L. Nabe: an energetic database of amino acid mutations in protein\u2013nucleic acid binding interfaces. Database. 2021;2021:baabo50.","journal-title":"Database"},{"issue":"D1","key":"5263_CR23","doi-asserted-by":"publisher","first-page":"D1528","DOI":"10.1093\/nar\/gkab848","volume":"50","author":"K Harini","year":"2022","unstructured":"Harini K, Srivastava A, Kulandaisamy A, Gromiha MM. ProNAB: database for binding affinities of protein-nucleic acid complexes and their mutants. Nucleic Acids Res. 2022;50(D1):D1528-d1534.","journal-title":"Nucleic Acids Res"},{"key":"5263_CR24","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1613\/jair.953","volume":"16","author":"NV Chawla","year":"2002","unstructured":"Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;16:321\u201357.","journal-title":"J Artif Intell Res"},{"key":"5263_CR25","unstructured":"Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, Ye Q, Liu T-Y: LightGBM: a highly efficient gradient boosting decision tree. In: NIPS: 2017."},{"key":"5263_CR26","unstructured":"He H, Bai Y, Garcia EA, Li S: ADASYN: adaptive synthetic sampling approach for imbalanced learning. In: 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence): 2008. IEEE: pp. 1322\u20131328."},{"issue":"4","key":"5263_CR27","doi-asserted-by":"publisher","first-page":"366","DOI":"10.1038\/7603","volume":"6","author":"JM Wojciak","year":"1999","unstructured":"Wojciak JM, Connolly KM, Clubb RT. NMR structure of the Tn916 integrase\u2013DNA complex. Nat Struct Biol. 1999;6(4):366\u201373.","journal-title":"Nat Struct Biol"},{"issue":"2","key":"5263_CR28","doi-asserted-by":"publisher","first-page":"198","DOI":"10.1016\/j.cell.2011.03.004","volume":"145","author":"SE Tsutakawa","year":"2011","unstructured":"Tsutakawa SE, Classen S, Chapados BR, Arvai AS, Finger LD, Guenther G, Tomlinson CG, Thompson P, Sarker AH, Shen B. Human flap endonuclease structures, DNA double-base flipping, and a unified understanding of the FEN1 superfamily. Cell. 2011;145(2):198\u2013211.","journal-title":"Cell"},{"issue":"13","key":"5263_CR29","doi-asserted-by":"publisher","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","volume":"22","author":"W Li","year":"2006","unstructured":"Li W, Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006;22(13):1658\u20139.","journal-title":"Bioinformatics"},{"key":"5263_CR30","unstructured":"Hubbard S, Thornton J: NACCESS: program for calculating accessibilities. Department of Biochemistry and Molecular Biology, University College of London; 1992."},{"issue":"12","key":"5263_CR31","doi-asserted-by":"publisher","first-page":"1513","DOI":"10.1093\/bioinformatics\/btp240","volume":"25","author":"N Tuncbag","year":"2009","unstructured":"Tuncbag N, Gursoy A, Keskin O. Identification of computational hot spots in protein interfaces: combining solvent accessibility and inter-residue potentials improves the accuracy. Bioinformatics. 2009;25(12):1513\u201320.","journal-title":"Bioinformatics"},{"issue":"1","key":"5263_CR32","doi-asserted-by":"publisher","first-page":"174","DOI":"10.1186\/1471-2105-11-174","volume":"11","author":"J-F Xia","year":"2010","unstructured":"Xia J-F, Zhao X-M, Song J, Huang D-S. APIS: accurate prediction of hot spots in protein interfaces by combining protrusion index with solvent accessibility. BMC Bioinform. 2010;11(1):174.","journal-title":"BMC Bioinform"},{"issue":"12","key":"5263_CR33","doi-asserted-by":"publisher","first-page":"2577","DOI":"10.1002\/bip.360221211","volume":"22","author":"W Kabsch","year":"1983","unstructured":"Kabsch W, Sander C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 1983;22(12):2577\u2013637.","journal-title":"Biopolymers"},{"key":"5263_CR34","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1186\/1472-6807-8-21","volume":"8","author":"J Mihel","year":"2008","unstructured":"Mihel J, Sikic M, Tomi\u0107 S, Jeren B, Vlahovi\u010dek K. PSAIA: protein structure and interaction analyzer. BMC Struct Biol. 2008;8:21.","journal-title":"BMC Struct Biol"},{"issue":"12","key":"5263_CR35","doi-asserted-by":"publisher","first-page":"R277","DOI":"10.1016\/S0969-2126(00)88333-1","volume":"7","author":"J Janin","year":"1999","unstructured":"Janin J. Wet and dry interfaces: the role of solvent in protein\u2013protein and protein\u2013DNA recognition. Structure. 1999;7(12):R277\u20139.","journal-title":"Structure"},{"issue":"5","key":"5263_CR36","doi-asserted-by":"publisher","first-page":"777","DOI":"10.1006\/jmbi.1994.1334","volume":"238","author":"IK McDonald","year":"1994","unstructured":"McDonald IK, Thornton JM. Satisfying hydrogen bonding potential in proteins. J Mol Biol. 1994;238(5):777\u201393.","journal-title":"J Mol Biol"},{"key":"5263_CR37","unstructured":"Skodras A: Discrete wavelet transform: an introduction; 2003."},{"key":"5263_CR38","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1007\/978-1-4419-1545-0_5","volume-title":"Wavelets: theory and applications for manufacturing","author":"RX Gao","year":"2011","unstructured":"Gao RX, Yan R. Wavelet packet transform. In: Gao RX, Yan R, editors. Wavelets: theory and applications for manufacturing. Boston: Springer; 2011. p. 69\u201381."},{"key":"5263_CR39","doi-asserted-by":"crossref","unstructured":"Chakraborty S, Gupta V: DWT based cancer identification using EIIP. In: 2016 second international conference on computational intelligence and communication technology (CICT), IEEE; 2016. pp. 718\u2013723.","DOI":"10.1109\/CICT.2016.148"},{"issue":"8","key":"5263_CR40","doi-asserted-by":"publisher","first-page":"1344","DOI":"10.1002\/jcc.21115","volume":"30","author":"JD Qiu","year":"2009","unstructured":"Qiu JD, Luo SH, Huang JH, Liang RP. Using support vector machines for prediction of protein structural classes based on discrete wavelet transform. J Comput Chem. 2009;30(8):1344\u201350.","journal-title":"J Comput Chem"},{"issue":"3","key":"5263_CR41","doi-asserted-by":"publisher","first-page":"220","DOI":"10.1016\/j.compbiolchem.2005.04.007","volume":"29","author":"Z-N Wen","year":"2005","unstructured":"Wen Z-N, Wang K-L, Li M-L, Nie F-S, Yang Y. Analyzing functional similarity of protein sequences with discrete wavelet transform. Comput Biol Chem. 2005;29(3):220\u20138.","journal-title":"Comput Biol Chem"},{"issue":"18","key":"5263_CR42","doi-asserted-by":"publisher","first-page":"i467","DOI":"10.1093\/bioinformatics\/btq371","volume":"26","author":"A Vo","year":"2010","unstructured":"Vo A, Nguyen N, Huang H. Solenoid and non-solenoid protein recognition using stationary wavelet packet transform. Bioinformatics. 2010;26(18):i467\u201373.","journal-title":"Bioinformatics"},{"key":"5263_CR43","doi-asserted-by":"crossref","unstructured":"Liu G, Luan Y: Identification of protein coding regions in the eukaryotic DNA sequences based on Marple algorithm and wavelet packets transform. In: Abstract and applied analysis, Hindawi; 2014.","DOI":"10.1155\/2014\/402567"},{"key":"5263_CR44","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.sigpro.2013.04.015","volume":"96","author":"R Yan","year":"2014","unstructured":"Yan R, Gao RX, Chen X. Wavelets for fault diagnosis of rotary machines: a review with applications. Signal Process. 2014;96:1\u201315.","journal-title":"Signal Process"},{"key":"5263_CR45","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1016\/j.chemolab.2018.08.013","volume":"182","author":"F Ali","year":"2018","unstructured":"Ali F, Kabir M, Arif M, Khan Swati ZN, Khan ZU, Ullah M, Yu D-J. DBPPred-PDSD: machine learning approach for prediction of DNA-binding proteins using discrete wavelet transform and optimized integrated features space. Chemom Intell Lab Syst. 2018;182:21\u201330.","journal-title":"Chemom Intell Lab Syst"},{"issue":"10","key":"5263_CR46","doi-asserted-by":"publisher","first-page":"2464","DOI":"10.1109\/78.157290","volume":"40","author":"MJ Shensa","year":"1992","unstructured":"Shensa MJ. The discrete wavelet transform: wedding the a trous and Mallat algorithms. IEEE Trans Signal Process. 1992;40(10):2464\u201382.","journal-title":"IEEE Trans Signal Process"},{"key":"5263_CR47","unstructured":"R\u00e9nyi A: On measures of entropy and information. In: Proceedings of the fourth Berkeley symposium on mathematical statistics and probability, Berkeley; 1961."},{"issue":"1","key":"5263_CR48","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1016\/S0165-0270(00)00356-3","volume":"105","author":"OA Rosso","year":"2001","unstructured":"Rosso OA, Blanco S, Yordanova J, Kolev V, Figliola A, Sch\u00fcrmann M, Ba\u015far E. Wavelet entropy: a new tool for analysis of short duration brain electrical signals. J Neurosci Methods. 2001;105(1):65\u201375.","journal-title":"J Neurosci Methods"},{"issue":"8","key":"5263_CR49","doi-asserted-by":"publisher","first-page":"1226","DOI":"10.1109\/TPAMI.2005.159","volume":"27","author":"P Hanchuan","year":"2005","unstructured":"Hanchuan P, Fuhui L, Ding C. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Patt Anal Mach Intell. 2005;27(8):1226\u201338.","journal-title":"IEEE Trans Patt Anal Mach Intell"},{"issue":"1","key":"5263_CR50","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L. Random forests. Mach Learn. 2001;45(1):5\u201332.","journal-title":"Mach Learn"},{"issue":"1","key":"5263_CR51","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1023\/A:1012487302797","volume":"46","author":"I Guyon","year":"2002","unstructured":"Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach Learn. 2002;46(1):389\u2013422.","journal-title":"Mach Learn"},{"issue":"1","key":"5263_CR52","doi-asserted-by":"publisher","first-page":"e86703","DOI":"10.1371\/journal.pone.0086703","volume":"9","author":"W Lou","year":"2014","unstructured":"Lou W, Wang X, Chen F, Chen Y, Jiang B, Zhang H. Sequence based prediction of DNA-binding proteins based on hybrid feature selection using random forest and Gaussian naive Bayes. PLoS ONE. 2014;9(1):e86703.","journal-title":"PLoS ONE"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05263-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-023-05263-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05263-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,4,5]],"date-time":"2023-04-05T06:52:29Z","timestamp":1680677549000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-023-05263-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,4]]},"references-count":52,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["5263"],"URL":"https:\/\/doi.org\/10.1186\/s12859-023-05263-7","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,4,4]]},"assertion":[{"value":"31 January 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 March 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 April 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"129"}}