{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,26]],"date-time":"2025-10-26T21:17:50Z","timestamp":1761513470500},"reference-count":53,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2014,5,27]],"date-time":"2014-05-27T00:00:00Z","timestamp":1401148800000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"published-print":{"date-parts":[[2014,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>The prediction of sites and products of metabolism in xenobiotic compounds is key to the development of new chemical entities, where screening potential metabolites for toxicity or unwanted side-effects is of crucial importance. In this work 2D topological fingerprints are used to encode atomic sites and three probabilistic machine learning methods are applied: Parzen-Rosenblatt Window (PRW), Naive Bayesian (NB) and a novel approach called RASCAL (Random Attribute Subsampling Classification ALgorithm). These are implemented by randomly subsampling descriptor space to alleviate the problem often suffered by data mining methods of having to exactly match fingerprints, and in the case of PRW by measuring a distance between feature vectors rather than exact matching. The classifiers have been implemented in CUDA\/C++ to exploit the parallel architecture of graphical processing units (GPUs) and is freely available in a public repository.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>It is shown that for PRW a SoM (Site of Metabolism) is identified in the top two predictions for 85%, 91% and 88% of the CYP 3A4, 2D6 and 2C9 data sets respectively, with RASCAL giving similar performance of 83%, 91% and 88%, respectively. These results put PRW and RASCAL performance ahead of NB which gave a much lower classification performance of 51%, 73% and 74%, respectively.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>2D topological fingerprints calculated to a bond depth of 4-6 contain sufficient information to allow the identification of SoMs using classifiers based on relatively small data sets. Thus, the machine learning methods outlined in this paper are conceptually simpler and more efficient than other methods tested and the use of simple topological descriptors derived from 2D structure give results competitive with other approaches using more expensive quantum chemical descriptors. The descriptor space subsampling approach and ensemble methodology allow the methods to be applied to molecules more distant from the training data where data mining would be more likely to fail due to the lack of common fingerprints. The RASCAL algorithm is shown to give equivalent classification performance to PRW but at lower computational expense allowing it to be applied more efficiently in the ensemble scheme.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1758-2946-6-29","type":"journal-article","created":{"date-parts":[[2014,5,27]],"date-time":"2014-05-27T11:19:07Z","timestamp":1401189547000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":26,"title":["Cytochrome P450 site of metabolism prediction from 2D topological fingerprints using GPU accelerated probabilistic classifiers"],"prefix":"10.1186","volume":"6","author":[{"given":"Jonathan D","family":"Tyzack","sequence":"first","affiliation":[]},{"given":"Hamse Y","family":"Mussa","sequence":"additional","affiliation":[]},{"given":"Mark J","family":"Williamson","sequence":"additional","affiliation":[]},{"given":"Johannes","family":"Kirchmair","sequence":"additional","affiliation":[]},{"given":"Robert C","family":"Glen","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2014,5,27]]},"reference":[{"key":"604_CR1","doi-asserted-by":"publisher","first-page":"E101","DOI":"10.1208\/aapsj080112","volume":"8","author":"FP Guengerich","year":"2006","unstructured":"Guengerich FP: Cytochrome P450s and other enzymes in drug metabolism and toxicity. AAPS J. 2006, 8: E101-11. 10.1208\/aapsj080112. [http:\/\/link.springer.com\/article\/10.1208\/aapsj080112]","journal-title":"AAPS J"},{"issue":"3","key":"604_CR2","doi-asserted-by":"publisher","first-page":"305","DOI":"10.1517\/phgs.5.3.305.29827","volume":"5","author":"DFV Lewis","year":"2004","unstructured":"Lewis DFV: 57 varieties: the human cytochromes P450. Pharmacogenomics. 2004, 5 (3): 305-18. 10.1517\/phgs.5.3.305.29827. [http:\/\/www.futuremedicine.com\/doi\/abs\/10.1517\/phgs.5.3.305.29827]","journal-title":"Pharmacogenomics"},{"issue":"3","key":"604_CR3","doi-asserted-by":"publisher","first-page":"617","DOI":"10.1021\/ci200542m","volume":"52","author":"J Kirchmair","year":"2012","unstructured":"Kirchmair J, Williamson MJ, Tyzack JD, Tan L, Bond PJ, Bender A, Glen RC: Computational prediction of metabolism: sites, products, SAR, P450 enzyme dynamics, and mechanisms. J Chem Inf Model. 2012, 52 (3): 617-48. 10.1021\/ci200542m. [http:\/\/pubs.acs.org\/doi\/abs\/10.1021\/ci200542m]","journal-title":"J Chem Inf Model"},{"issue":"10-11","key":"604_CR4","doi-asserted-by":"publisher","first-page":"955","DOI":"10.1080\/00498250500354402","volume":"35","author":"SA Kulkarni","year":"2005","unstructured":"Kulkarni SA, Zhu J, Blechinger S: In silico techniques for the study and prediction of xenobiotic metabolism: a review. Xenobiotica; Fate Foreign Compounds Biol Syst. 2005, 35 (10-11): 955-73. 10.1080\/00498250500354402. [http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/16393855]","journal-title":"Xenobiotica; Fate Foreign Compounds Biol Syst"},{"issue":"3","key":"604_CR5","doi-asserted-by":"publisher","first-page":"299","DOI":"10.1517\/17425255.2011.553599","volume":"7","author":"A Tarcsay","year":"2011","unstructured":"Tarcsay A, Keseru GM: In silico site of metabolism prediction of cytochrome P450-mediated biotransformations. Expert Opin Drug Metab Toxicol. 2011, 7 (3): 299-312. 10.1517\/17425255.2011.553599. [http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/21291341]","journal-title":"Expert Opin Drug Metab Toxicol"},{"issue":"2","key":"604_CR6","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1517\/17425255.1.2.303","volume":"1","author":"S Ekins","year":"2005","unstructured":"Ekins S, Andreyev S, Ryabov A, Kirillov E, Bugrim A, Nikolskaya T, Rakhmatulin Ea: Computational prediction of human drug metabolism. Expert Opin Drug Metab Toxicol. 2005, 1 (2): 303-24. 10.1517\/17425255.1.2.303. [http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/16922645]","journal-title":"Expert Opin Drug Metab Toxicol"},{"issue":"7","key":"604_CR7","doi-asserted-by":"publisher","first-page":"851","DOI":"10.1517\/17425255.2010.499123","volume":"6","author":"RJ Vaz","year":"2010","unstructured":"Vaz RJ, Zamora I, Li Y, Reiling S, Shen J, Cruciani G: The challenges of in silico contributions to drug metabolism in lead optimization. Expert Opin Drug Metab Toxicol. 2010, 6 (7): 851-61. 10.1517\/17425255.2010.499123. [http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/20565339]","journal-title":"Expert Opin Drug Metab Toxicol"},{"issue":"3","key":"604_CR8","doi-asserted-by":"publisher","first-page":"96","DOI":"10.1021\/ml100016x","volume":"1","author":"P Rydberg","year":"2010","unstructured":"Rydberg P, Gloriam DE, Zaretzki J, Breneman C, Olsen L: SMARTCyp: A 2D method for prediction of cytochrome P450-Mediated drug metabolism. ACS Med Chem Lett. 2010, 1 (3): 96-100. 10.1021\/ml100016x. [http:\/\/pubs.acs.org\/doi\/abs\/10.1021\/ml100016x]","journal-title":"ACS Med Chem Lett"},{"issue":"9","key":"604_CR9","doi-asserted-by":"publisher","first-page":"2471","DOI":"10.1021\/ci3003073","volume":"52","author":"V Campagna-Slater","year":"2012","unstructured":"Campagna-Slater V, Pottel J, Therrien E, Cantin LD, Moitessier N: Development of a computational tool to rival experts in the prediction of sites of metabolism of xenobiotics by p450s. J Chem Inf Model. 2012, 52 (9): 2471-83. 10.1021\/ci3003073. [http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/22916680]","journal-title":"J Chem Inf Model"},{"issue":"6","key":"604_CR10","doi-asserted-by":"publisher","first-page":"1294","DOI":"10.1021\/ci400058s","volume":"53","author":"JD Tyzack","year":"2013","unstructured":"Tyzack JD, Williamson MJ, Torella R, Glen RC: Prediction of cytochrome P450 xenobiotic metabolism: tethered docking and reactivity derived from ligand molecular orbital analysis. J Chem Inf Model. 2013, 53 (6): 1294-305. 10.1021\/ci400058s. [http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/23701380]","journal-title":"J Chem Inf Model"},{"key":"604_CR11","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1016\/S0022-2836(95)80037-9","volume":"245","author":"G Jones","year":"1995","unstructured":"Jones G, Willett P, Glen RC: Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. J Mol Biol. 1995, 245: 43-53. 10.1016\/S0022-2836(95)80037-9. [http:\/\/www.sciencedirect.com\/science\/article\/pii\/S0022283695800379]","journal-title":"J Mol Biol"},{"key":"604_CR12","unstructured":"MetaPrint2D, (accessed 03-06-2013). [http:\/\/www-metaprint2d.ch.cam.ac.uk\/]"},{"key":"604_CR13","unstructured":"Accelrys Metabolite Database. Accelrys Inc., 10188 Telesis Court, Suite 100, San Diego, CA, 92121, USA. [http:\/\/accelrys.com\/products\/databases\/bioactivity\/metabolite.html]"},{"issue":"7","key":"604_CR14","doi-asserted-by":"publisher","first-page":"1667","DOI":"10.1021\/ci2000488","volume":"51","author":"J Zaretzki","year":"2011","unstructured":"Zaretzki J, Bergeron C, Rydberg P, Huang TW, Bennett KP, Breneman CM: RS-Predictor: a new tool for predicting sites of cytochrome P450-Mediated metabolism applied to CYP 3A4. J Chem Inf Model. 2011, 51 (7): 1667-89. 10.1021\/ci2000488. [http:\/\/pubs.acs.org\/doi\/abs\/10.1021\/ci2000488]","journal-title":"J Chem Inf Model"},{"issue":"6","key":"604_CR15","doi-asserted-by":"publisher","first-page":"1637","DOI":"10.1021\/ci300009z","volume":"52","author":"J Zaretzki","year":"2012","unstructured":"Zaretzki J, Rydberg P, Bergeron C, Bennett KP, Olsen L, Breneman CM: RS-Predictor models augmented with SMARTCyp reactivities: robust metabolic regioselectivity predictions for nine CYP isozymes. J Chem Inf Model. 2012, 52 (6): 1637-59. 10.1021\/ci300009z. [http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/22524152]","journal-title":"J Chem Inf Model"},{"issue":"12","key":"604_CR16","doi-asserted-by":"publisher","first-page":"3373","DOI":"10.1021\/ci400518g","volume":"53","author":"J Zaretzki","year":"2013","unstructured":"Zaretzki J, Matlock M, Swamidass SJ: XenoSite: accurately predicting CYP-mediated sites of metabolism with neural networks. J Chem Inf Model. 2013, 53 (12): 3373-83. 10.1021\/ci400518g. [http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/24224933]","journal-title":"J Chem Inf Model"},{"key":"604_CR17","unstructured":"Daylight Chemical Information Systems, Inc. Aliso Viejo, CA. [http:\/\/www.daylight.com\/dayhtml\/doc\/theory\/theory.finger.html]"},{"issue":"3","key":"604_CR18","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1002\/minf.200900086","volume":"29","author":"K Hasegawa","year":"2010","unstructured":"Hasegawa K, Koyama M, Funatsu K: Quantitative prediction of regioselectivity toward cytochrome P450\/3A4 using machine learning approaches. Mol Inform. 2010, 29 (3): 243-249. 10.1002\/minf.200900086. [http:\/\/doi.wiley.com\/10.1002\/minf.200900086]","journal-title":"Mol Inform"},{"issue":"22","key":"604_CR19","doi-asserted-by":"publisher","first-page":"6489","DOI":"10.1021\/jm060551l","volume":"49","author":"L Olsen","year":"2006","unstructured":"Olsen L, Rydberg P, Rod TH, Ryde U: Prediction of activation energies for hydrogen abstraction by cytochrome P450. J Med Chem. 2006, 49 (22): 6489-6499. 10.1021\/jm060551l. [http:\/\/pubs.acs.org\/doi\/abs\/10.1021\/jm060551l]","journal-title":"J Med Chem"},{"issue":"50","key":"604_CR20","doi-asserted-by":"publisher","first-page":"13058","DOI":"10.1021\/jp803854v","volume":"112","author":"P Rydberg","year":"2008","unstructured":"Rydberg P, Ryde U, Olsen L: Prediction of activation energies for aromatic oxidation by cytochrome P450. J Phys Chem A. 2008, 112 (50): 13058-65. 10.1021\/jp803854v. [http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/18986131]","journal-title":"J Phys Chem A"},{"key":"604_CR21","unstructured":"Molecular Operating Environment (MOE), 2012.10. Chemical Computing Group Inc., 1010 Sherbooke St. West, Suite 910, Montreal, QC, Canada, H3A 2R7, 2012. [https:\/\/www.chemcomp.com\/MOE-Molecular_Operating_Environment.htm]"},{"issue":"11","key":"604_CR22","doi-asserted-by":"publisher","first-page":"1537","DOI":"10.1093\/bioinformatics\/btr177","volume":"27","author":"F Mu","year":"2011","unstructured":"Mu F, Unkefer CJ, Unkefer PJ, Hlavacek WS: Prediction of metabolic reactions based on atomic and molecular properties of small-molecule compounds. Bioinformatics (Oxford, England). 2011, 27 (11): 1537-45. 10.1093\/bioinformatics\/btr177. [http:\/\/bioinformatics.oxfordjournals.org\/content\/27\/11\/1537.short]","journal-title":"Bioinformatics (Oxford, England)"},{"issue":"Database issue","key":"604_CR23","doi-asserted-by":"publisher","first-page":"D109","DOI":"10.1093\/nar\/gkr988","volume":"40","author":"M Kanehisa","year":"2012","unstructured":"Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M: KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 2012, 40 (Database issue): D109-14. [http:\/\/nar.oxfordjournals.org\/content\/40\/D1\/D109.short]","journal-title":"Nucleic Acids Res"},{"issue":"11","key":"604_CR24","doi-asserted-by":"publisher","first-page":"2896","DOI":"10.1021\/ci400503s","volume":"53","author":"J Kirchmair","year":"2013","unstructured":"Kirchmair J, Williamson MJ, Afzal AM, Tyzack JD, Choy APK, Howlett A, Rydberg P, Glen RC: FAst MEtabolizer (FAME): A rapid and accurate predictor of sites of metabolism in multiple species by endogenous enzymes. J Chem Inf Model. 2013, 53 (11): 2896-907. 10.1021\/ci400503s. [http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/24219364]","journal-title":"J Chem Inf Model"},{"issue":"17","key":"604_CR25","doi-asserted-by":"publisher","first-page":"2111","DOI":"10.2174\/138161206777585274","volume":"12","author":"C Steinbeck","year":"2006","unstructured":"Steinbeck C, Hoppe C, Kuhn S, Floris M, Guha R, Willighagen E: Recent developments of the chemistry development kit (CDK) - an open-source Java library for Chemo- and Bioinformatics. Curr Pharm Des. 2006, 12 (17): 2111-2120. 10.2174\/138161206777585274. [http:\/\/www.ingentaconnect.com\/content\/ben\/cpd\/2006\/00000012\/00000017\/art00005]","journal-title":"Curr Pharm Des"},{"key":"604_CR26","first-page":"140113114718001","volume-title":"J Chem Inf Model","author":"AV Rudik","year":"2014","unstructured":"Rudik AV, Dmitriev A, Lagunin AA, Filimonov D, Poroikov VV: Metabolism site prediction based on xenobiotic structural formulae and PASS prediction algorithm. J Chem Inf Model. 2014, 140113114718001-[http:\/\/pubs.acs.org\/doi\/abs\/10.1021\/ci400472j]"},{"issue":"4","key":"604_CR27","first-page":"796","volume":"42","author":"L Xing","year":"2002","unstructured":"Xing L, Glen R: Novel Methods for the Prediction of logP, pKa, and logD. J Chem Inf Model. 2002, 42 (4): 796-805. 10.1021\/ci010315d. [http:\/\/pubs.acs.org\/cgi-bin\/doilookup\/?10.1021\/ci010315d]","journal-title":"J Chem Inf Model"},{"issue":"3","key":"604_CR28","doi-asserted-by":"publisher","first-page":"870","DOI":"10.1021\/ci020386s","volume":"43","author":"L Xing","year":"2003","unstructured":"Xing L, Glen RC, Clark RD: Predicting pK(a) by molecular tree structured fingerprints and PLS. J Chem Inf Comput Sci. 2003, 43 (3): 870-879. 10.1021\/ci020386s. [http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/12767145]","journal-title":"J Chem Inf Comput Sci"},{"issue":"10","key":"604_CR29","doi-asserted-by":"publisher","first-page":"56","DOI":"10.1145\/1562764.1562783","volume":"52","author":"K Asanovic","year":"2009","unstructured":"Asanovic K, Wawrzynek J, Wessel D, Yelick K, Bodik R, Demmel J, Keaveny T, Keutzer K, Kubiatowicz J, Morgan N, Patterson D, Sen K: A view of the parallel computing landscape. Commun ACM. 2009, 52 (10): 56-10.1145\/1562764.1562783. [http:\/\/dl.acm.org\/citation.cfm?id=1562783]","journal-title":"Commun ACM"},{"key":"604_CR30","doi-asserted-by":"publisher","first-page":"104","DOI":"10.1145\/1390156.1390170","volume-title":"Proceedings of the 25th international conference on Machine learning - ICML \u201908","author":"B Catanzaro","year":"2008","unstructured":"Catanzaro B, Sundaram N, Keutzer K: Fast support vector machine training and classification on graphics processors. Proceedings of the 25th international conference on Machine learning - ICML \u201908. 2008, New York, USA: ACM Press, 104-111. [http:\/\/dl.acm.org\/citation.cfm?id=1390170]"},{"issue":"4","key":"604_CR31","first-page":"387","volume":"1","author":"Q Li","year":"2011","unstructured":"Li Q, Salman R, Test E, Strack R, Kecman V: GPUSVM: a comprehensive CUDA based support vector machine package. Cent Eur J Comput Sci. 2011, 1 (4): 387-405. 10.2478\/s13537-011-0028-7. [http:\/\/www.springerlink.com\/index\/10.2478\/s13537-011-0028-7]","journal-title":"Cent Eur J Comput Sci"},{"key":"604_CR32","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1145\/1735688.1735692","volume-title":"Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units - GPGPU \u201910","author":"S Herrero-Lopez","year":"2010","unstructured":"Herrero-Lopez S, Williams JR, Sanchez A: Parallel multiclass classification using SVMs on GPUs. Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units - GPGPU \u201910. 2010, New York, USA: ACM Press, 2-2."},{"issue":"6","key":"604_CR33","doi-asserted-by":"publisher","first-page":"1311","DOI":"10.1016\/j.patcog.2004.01.013","volume":"37","author":"KS Oh","year":"2004","unstructured":"Oh KS, Jung K: GPU implementation of neural networks. Pattern Recognit. 2004, 37 (6): 1311-1314. 10.1016\/j.patcog.2004.01.013. [http:\/\/www.sciencedirect.com\/science\/article\/pii\/S0031320304000524]","journal-title":"Pattern Recognit"},{"key":"604_CR34","first-page":"415","volume-title":"2009 IEEE Youth Conference on Information, Computing and Telecommunication","author":"SLCWYLL Jian","year":"2009","unstructured":"Jian SLCWYLL: CUKNN: A parallel implementation of K-nearest neighbor on CUDA-enabled GPU. 2009 IEEE Youth Conference on Information, Computing and Telecommunication. 2009, New York: IEEE, 415-418."},{"key":"604_CR35","doi-asserted-by":"publisher","first-page":"208","DOI":"10.1016\/j.cageo.2011.12.009","volume":"46","author":"S Bernab\u00e9","year":"2012","unstructured":"Bernab\u00e9 S, Plaza A, Reddy Marpu P, Atli Benediktsson J: A new parallel tool for classification of remotely sensed imagery. Comput Geosci. 2012, 46: 208-218. [http:\/\/www.sciencedirect.com\/science\/article\/pii\/S009830041100433X]","journal-title":"Comput Geosci"},{"key":"604_CR36","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1186\/1758-2946-3-33","volume":"3","author":"NM O\u2019Boyle","year":"2011","unstructured":"O\u2019Boyle NM, Banck M, James CA, Morley C, Vandermeersch T, Hutchison GR: Open Babel: an open chemical toolbox. J Cheminformatics. 2011, 3: 33-10.1186\/1758-2946-3-33. [http:\/\/www.jcheminf.com\/content\/3\/1\/33]","journal-title":"J Cheminformatics"},{"key":"604_CR37","unstructured":"SYBYL Molecular Modeling Software:. Tripos Associates Inc., St Louis, MO, USA. [http:\/\/www.certara.com]"},{"issue":"2","key":"604_CR38","doi-asserted-by":"publisher","first-page":"442","DOI":"10.1016\/0005-2795(75)90109-9","volume":"405","author":"B Matthews","year":"1975","unstructured":"Matthews B: Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochimica et Biophysica Acta (BBA) - Protein Struct. 1975, 405 (2): 442-451. 10.1016\/0005-2795(75)90109-9. [http:\/\/www.sciencedirect.com\/science\/article\/pii\/0005279575901099]","journal-title":"Biochimica et Biophysica Acta (BBA) - Protein Struct"},{"key":"604_CR39","unstructured":"NVIDIA Nsight. NVIDIA, Santa Clara, CA, USA. [https:\/\/developer.nvidia.com\/cuda-toolkit]"},{"issue":"1","key":"604_CR40","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1186\/1758-2946-5-37","volume":"5","author":"HY Mussa","year":"2013","unstructured":"Mussa HY, Mitchell JBO, Glen RC: Full \u201cLaplacianised\u201d posterior naive Bayesian algorithm. J Cheminformatics. 2013, 5 (1): 37-43. 10.1186\/1758-2946-5-37.","journal-title":"J Cheminformatics"},{"key":"604_CR41","volume-title":"Pattern Classification and Scene Analysis","author":"RO Duda","year":"1973","unstructured":"Duda RO, Hart PE: Pattern Classification and Scene Analysis. 1973, New York, NY: John Wiley and Sons Ltd"},{"key":"604_CR42","doi-asserted-by":"publisher","DOI":"10.1002\/0470854774","volume-title":"Statistical Pattern Recognition","author":"AR Webb","year":"2002","unstructured":"Webb AR: Statistical Pattern Recognition. 2002, New York: Wiley\u2013Blackwell"},{"key":"604_CR43","volume-title":"Classification, Estimation and Pattern Recognition","author":"T Young","year":"1974","unstructured":"Young T, Calvert TW: Classification, Estimation and Pattern Recognition. 1974, New York: Elsevier"},{"key":"604_CR44","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511812651","volume-title":"Pattern Recognition and Neural Networks","author":"BD Ripley","year":"1996","unstructured":"Ripley BD: Pattern Recognition and Neural Networks. 1996, Cambridge, UK: Cambridge University Press"},{"key":"604_CR45","volume-title":"Discrimination and classification","author":"DJ Hand","year":"1981","unstructured":"Hand DJ: Discrimination and classification. 1981, New York: Wiley"},{"key":"604_CR46","volume-title":"Neural Networks for Pattern Recognition","author":"CM Bishop","year":"1996","unstructured":"Bishop CM: Neural Networks for Pattern Recognition. 1996, New York: Oxford University Press"},{"issue":"5","key":"604_CR47","first-page":"832","volume":"20","author":"TK Ho","year":"1998","unstructured":"Ho TK: The random subspace method for constructing decision forests. IEEE Tran Pat Anal Mach Intel. 1998, 20 (5): 832-844.","journal-title":"IEEE Tran Pat Anal Mach Intel"},{"key":"604_CR48","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L: Random forests. Mach Learn. 2001, 45: 5-32. 10.1023\/A:1010933404324.","journal-title":"Mach Learn"},{"key":"604_CR49","first-page":"238","volume":"2001","author":"KNC Oza","year":"2096","unstructured":"Oza KNC: Tumer: Input decimation ensembles: decorrelation through dimensionality reduction. Proc Intl Workshop Multiple Classifier Syst. 2096, 2001: 238-247.","journal-title":"Proc Intl Workshop Multiple Classifier Syst"},{"issue":"2","key":"604_CR50","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1007\/s100440200011","volume":"5","author":"Skurichina, RPW M","year":"2002","unstructured":"Skurichina, RPW M: Duin: Bagging, boosting and the random subspace method for linear classifiers. Pattern Anal Appl. 2002, 5 (2): 121-135. 10.1007\/s100440200011.","journal-title":"Pattern Anal Appl"},{"issue":"3","key":"604_CR51","doi-asserted-by":"publisher","first-page":"1065","DOI":"10.1214\/aoms\/1177704472","volume":"33","author":"E Parzen","year":"1962","unstructured":"Parzen E: On estimation of a probability density function and mode. Annal Math Stat. 1962, 33 (3): 1065-1076. 10.1214\/aoms\/1177704472.","journal-title":"Annal Math Stat"},{"issue":"19","key":"604_CR52","doi-asserted-by":"publisher","first-page":"2149","DOI":"10.1093\/bioinformatics\/btn409","volume":"24","author":"L Jacob","year":"2008","unstructured":"Jacob L, Vert JP: Protein-ligand interaction prediction: an improved chemogenomics approach. Bioinformatics (Oxford, England). 2008, 24 (19): 2149-56. 10.1093\/bioinformatics\/btn409. [http:\/\/bioinformatics.oxfordjournals.org\/content\/24\/19\/2149.short]","journal-title":"Bioinformatics (Oxford, England)"},{"key":"604_CR53","first-page":"114","volume":"5","author":"HY Mussa","year":"2013","unstructured":"Mussa HY, Tyzack JD, Glen RC: Note on the Rademacher-Walsh polynomial basis functions. J Math Res. 2013, 5: 114-121. [http:\/\/www.ccsenet.org\/journal\/index.php\/jmr\/article\/view\/24995]","journal-title":"J Math Res"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1758-2946-6-29\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1758-2946-6-29.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1758-2946-6-29.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,2]],"date-time":"2021-09-02T03:58:14Z","timestamp":1630555094000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/1758-2946-6-29"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,5,27]]},"references-count":53,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2014,12]]}},"alternative-id":["604"],"URL":"https:\/\/doi.org\/10.1186\/1758-2946-6-29","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,5,27]]},"assertion":[{"value":"19 February 2014","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 May 2014","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 May 2014","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"29"}}