{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,21]],"date-time":"2025-12-21T06:23:36Z","timestamp":1766298216337,"version":"3.37.3"},"reference-count":40,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,2,6]],"date-time":"2024-02-06T00:00:00Z","timestamp":1707177600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,2,6]],"date-time":"2024-02-06T00:00:00Z","timestamp":1707177600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Neural Process Lett"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Multiple Kernel Learning (MKL) is a conventional way to learn the kernel function in kernel-based methods. MKL algorithms enhance the performance of kernel methods. However, these methods have a lower complexity compared to deep models and are inferior to them regarding recognition accuracy. Deep learning models can learn complex functions by applying nonlinear transformations to data through several layers. In this paper, we show that a typical MKL algorithm can be interpreted as a one-layer neural network with linear activation functions. By this interpretation, we propose a Neural Generalization of Multiple Kernel Learning (NGMKL), which extends the conventional MKL framework to a multi-layer neural network with nonlinear activation functions. Our experiments show that the proposed method, which has a higher complexity than traditional MKL methods, leads to higher recognition accuracy on several benchmarks.<\/jats:p>","DOI":"10.1007\/s11063-024-11516-0","type":"journal-article","created":{"date-parts":[[2024,2,6]],"date-time":"2024-02-06T21:19:38Z","timestamp":1707254378000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Neural Generalization of Multiple Kernel Learning"],"prefix":"10.1007","volume":"56","author":[{"given":"Ahmad Navid","family":"Ghanizadeh","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6043-1820","authenticated-orcid":false,"given":"Kamaledin","family":"Ghiasi-Shirazi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Reza","family":"Monsefi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mohammadreza","family":"Qaraei","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,2,6]]},"reference":[{"key":"11516_CR1","doi-asserted-by":"crossref","unstructured":"Bach FR, Lanckriet GR, Jordan MI (2004) Multiple kernel learning, conic duality, and the smo algorithm. In: Proceedings of the twenty-first international conference on Machine learning, p\u00a06","DOI":"10.1145\/1015330.1015424"},{"key":"11516_CR2","unstructured":"Belkin M, Ma S, Mandal S (2018) To understand deep learning we need to understand kernel learning. arXiv preprint arXiv:1802.01396"},{"issue":"5","key":"11516_CR3","first-page":"1","volume":"34","author":"Y Bengio","year":"2007","unstructured":"Bengio Y, LeCun Y et al (2007) Scaling learning algorithms towards AI. Large-scale kernel Mach 34(5):1\u201341","journal-title":"Large-scale kernel Mach"},{"issue":"7","key":"11516_CR4","first-page":"1354","volume":"36","author":"SS Bucak","year":"2013","unstructured":"Bucak SS, Jin R, Jain AK (2013) Multiple kernel learning for visual object recognition: a review. IEEE Trans Pattern Anal Mach Intell 36(7):1354\u20131369","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"11516_CR5","unstructured":"Chapelle O, Rakotomamonjy A (2008) Second order optimization of kernel parameters. In: Proceedings of the NIPS Workshop on Kernel Learning: Automatic Selection of Optimal Kernels, vol\u00a019, p\u00a087"},{"issue":"10","key":"11516_CR6","doi-asserted-by":"publisher","first-page":"2678","DOI":"10.1162\/NECO_a_00018","volume":"22","author":"Y Cho","year":"2010","unstructured":"Cho Y, Saul LK (2010) Large-margin classification in infinite neural networks. Neural Comput 22(10):2678\u20132697","journal-title":"Neural Comput"},{"key":"11516_CR7","unstructured":"Cortes C, Mohri M, Rostamizadeh A (2009) Learning non-linear combinations of kernels. Adv Neural Inf Process Syst 396\u2013404"},{"key":"11516_CR8","unstructured":"Daniely A, Frostig R, Singer Y (2016) Toward deeper understanding of neural networks: The power of initialization and a dual view on expressivity. Adv Neural Inf Process Syst 2253\u20132261"},{"key":"11516_CR9","first-page":"2211","volume":"12","author":"M G\u00f6nen","year":"2011","unstructured":"G\u00f6nen M, Alpayd\u0131n E (2011) Multiple kernel learning algorithms. J Mach Learn Res 12:2211\u20132268","journal-title":"J Mach Learn Res"},{"key":"11516_CR10","doi-asserted-by":"crossref","unstructured":"Grauman K, Darrell T (2005) The pyramid match kernel: Discriminative classification with sets of image features. In: Tenth IEEE International conference on computer vision (ICCV\u201905) Volume 1, IEEE, vol\u00a02, pp 1458\u20131465","DOI":"10.1109\/ICCV.2005.239"},{"key":"11516_CR11","unstructured":"Hazan T, Jaakkola T (2015) Steps toward deep kernel methods from infinite neural networks. arXiv preprint arXiv:1508.05133"},{"key":"11516_CR12","unstructured":"Jacot A, Gabriel F, Hongler C (2018) Neural tangent kernel: convergence and generalization in neural networks. Adv Neural Inf Process Syst 8571\u20138580"},{"key":"11516_CR13","doi-asserted-by":"crossref","unstructured":"Jiu M, Sahbi H (2016) Deep kernel map networks for image annotation. In: 2016 IEEE international conference on acoustics, speech and signal processing (ICASSP), IEEE, pp 1571\u20131575","DOI":"10.1109\/ICASSP.2016.7471941"},{"issue":"4","key":"11516_CR14","doi-asserted-by":"publisher","first-page":"1820","DOI":"10.1109\/TIP.2017.2666038","volume":"26","author":"M Jiu","year":"2017","unstructured":"Jiu M, Sahbi H (2017) Nonlinear deep kernel learning for image annotation. IEEE Trans Image Process 26(4):1820\u20131832","journal-title":"IEEE Trans Image Process"},{"key":"11516_CR15","doi-asserted-by":"publisher","first-page":"447","DOI":"10.1016\/j.patcog.2018.12.005","volume":"88","author":"M Jiu","year":"2019","unstructured":"Jiu M, Sahbi H (2019) Deep representation design from deep kernel networks. Pattern Recogn 88:447\u2013457","journal-title":"Pattern Recogn"},{"key":"11516_CR16","first-page":"953","volume":"12","author":"M Kloft","year":"2011","unstructured":"Kloft M, Brefeld U, Sonnenburg S, Zien A (2011) LP-norm multiple kernel learning. J Mach Learn Res 12:953\u2013997","journal-title":"J Mach Learn Res"},{"key":"11516_CR17","first-page":"27","volume":"5","author":"GR Lanckriet","year":"2004","unstructured":"Lanckriet GR, Cristianini N, Bartlett P, Ghaoui LE, Jordan MI (2004) Learning the kernel matrix with semidefinite programming. J Mach Learn Res 5:27\u201372","journal-title":"J Mach Learn Res"},{"key":"11516_CR18","unstructured":"Lee J, Bahri Y, Novak R, Schoenholz SS, Pennington J, Sohl-Dickstein J (2017) Deep neural networks as gaussian processes. arXiv preprint arXiv:1711.00165"},{"issue":"4","key":"11516_CR19","doi-asserted-by":"publisher","first-page":"748","DOI":"10.1109\/TASL.2008.2012193","volume":"17","author":"C Longworth","year":"2009","unstructured":"Longworth C, Gales MJ (2009) Combining derivative and parametric kernels for speaker verification. IEEE Trans Audio Speech Lang Process 17(4):748\u2013757","journal-title":"IEEE Trans Audio Speech Lang Process"},{"key":"11516_CR20","unstructured":"Mairal J (2016) End-to-end kernel learning with supervised convolutional kernel networks. Adv Neural Inf Process Syst 1399\u20131407"},{"key":"11516_CR21","unstructured":"Mairal J, Koniusz P, Harchaoui Z, Schmid C (2014) Convolutional kernel networks. Adv Neural Inf Process Syst 2627\u20132635"},{"issue":"12","key":"11516_CR22","doi-asserted-by":"publisher","first-page":"e39","DOI":"10.1016\/j.ijmedinf.2009.04.010","volume":"78","author":"M Miwa","year":"2009","unstructured":"Miwa M, S\u00e6tre R, Miyao Y, Tsujii J (2009) Protein-protein interaction extraction by leveraging multiple kernels and parsers. Int J Med Informat 78(12):e39\u2013e46","journal-title":"Int J Med Informat"},{"key":"11516_CR23","doi-asserted-by":"publisher","first-page":"127","DOI":"10.1016\/j.patrec.2018.09.016","volume":"116","author":"MR Mohammadnia-Qaraei","year":"2018","unstructured":"Mohammadnia-Qaraei MR, Monsefi R, Ghiasi-Shirazi K (2018) Convolutional kernel networks based on a convex combination of cosine kernels. Pattern Recogn Lett 116:127\u2013134","journal-title":"Pattern Recogn Lett"},{"key":"11516_CR24","unstructured":"Pandey G, Dukkipati A (2014a) Learning by stretching deep networks. In: International conference on machine learning 1719\u20131727"},{"key":"11516_CR25","unstructured":"Pandey G, Dukkipati A (2014b) To go deep or wide in learning? In: Artificial Intelligence and Statistics, PMLR, pp 724\u2013732"},{"key":"11516_CR26","unstructured":"Rahimi A, Recht B (2008) Random features for large-scale kernel machines. Adv Neural Inf Process Syst 1177\u20131184"},{"key":"11516_CR27","first-page":"2491","volume":"9","author":"A Rakotomamonjy","year":"2008","unstructured":"Rakotomamonjy A, Bach FR, Canu S, Grandvalet Y (2008) Simplemkl. J Mach Learn Res 9:2491\u20132521","journal-title":"J Mach Learn Res"},{"issue":"3","key":"11516_CR28","doi-asserted-by":"publisher","first-page":"287","DOI":"10.1023\/A:1007618119488","volume":"42","author":"G R\u00e4tsch","year":"2001","unstructured":"R\u00e4tsch G, Onoda T, M\u00fcller KR (2001) Soft margins for adaboost. Mach Learn 42(3):287\u2013320","journal-title":"Mach Learn"},{"issue":"8","key":"11516_CR29","doi-asserted-by":"publisher","first-page":"2305","DOI":"10.1007\/s00521-015-2066-x","volume":"27","author":"I Rebai","year":"2016","unstructured":"Rebai I, BenAyed Y, Mahdi W (2016) Deep multilayer multiple kernel learning. Neural Comput Appl 27(8):2305\u20132314","journal-title":"Neural Comput Appl"},{"issue":"5","key":"11516_CR30","doi-asserted-by":"publisher","first-page":"1063","DOI":"10.1162\/089976604773135104","volume":"16","author":"L Rosasco","year":"2004","unstructured":"Rosasco L, Vito ED, Caponnetto A, Piana M, Verri A (2004) Are loss functions all the same? Neural Comput 16(5):1063\u20131076","journal-title":"Neural Comput"},{"key":"11516_CR31","unstructured":"Sahbi H (2019) Totally deep support vector machines. arXiv preprint arXiv:1912.05864"},{"issue":"11","key":"11516_CR32","doi-asserted-by":"publisher","first-page":"5528","DOI":"10.1109\/TNNLS.2018.2804895","volume":"29","author":"H Song","year":"2018","unstructured":"Song H, Thiagarajan JJ, Sattigeri P, Spanias A (2018) Optimizing kernel machines using deep learning. IEEE Trans Neural Netw Learn Syst 29(11):5528\u20135540","journal-title":"IEEE Trans Neural Netw Learn Syst"},{"key":"11516_CR33","unstructured":"Sonnenburg S, R\u00e4tsch G, Sch\u00e4fer C (2006a) A general and efficient multiple kernel learning algorithm. Adv Neural Inf Process Syst 1273\u20131280"},{"key":"11516_CR34","first-page":"1531","volume":"7","author":"S Sonnenburg","year":"2006","unstructured":"Sonnenburg S, R\u00e4tsch G, Sch\u00e4fer C, Sch\u00f6lkopf B (2006) Large scale multiple kernel learning. J Mach Learn Res 7:1531\u20131565","journal-title":"J Mach Learn Res"},{"key":"11516_CR35","doi-asserted-by":"crossref","unstructured":"Varma M, Babu BR (2009) More generality in efficient multiple kernel learning. In: Proceedings of the 26th annual international conference on machine learning, pp 1065\u20131072","DOI":"10.1145\/1553374.1553510"},{"key":"11516_CR36","doi-asserted-by":"crossref","unstructured":"Vedaldi A, Gulshan V, Varma M, Zisserman A (2009) Multiple kernels for object detection. In: 2009 IEEE 12th international conference on computer vision, IEEE, pp 606\u2013613","DOI":"10.1109\/ICCV.2009.5459183"},{"key":"11516_CR37","unstructured":"Williams CK, Seeger M (2001) Using the nystr\u00f6m method to speed up kernel machines. Adv Neural Inf Process Syst 682\u2013688"},{"key":"11516_CR38","unstructured":"Wilson AG, Hu Z, Salakhutdinov R, Xing EP (2016) Deep kernel learning. In: Artificial intelligence and statistics, pp 370\u2013378"},{"issue":"7","key":"11516_CR39","doi-asserted-by":"publisher","first-page":"1574","DOI":"10.1109\/TKDE.2012.89","volume":"25","author":"H Xia","year":"2012","unstructured":"Xia H, Hoi SC (2012) Mkboost: a framework of multiple kernel boosting. IEEE Trans Knowl Data Eng 25(7):1574\u20131586","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"11516_CR40","unstructured":"Zhuang J, Tsang IW, Hoi SC (2011) Two-layer multiple kernel learning. In: Proceedings of the fourteenth international conference on artificial intelligence and statistics, pp 909\u2013917"}],"container-title":["Neural Processing Letters"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11063-024-11516-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11063-024-11516-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11063-024-11516-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,29]],"date-time":"2024-02-29T20:09:37Z","timestamp":1709237377000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11063-024-11516-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2,6]]},"references-count":40,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,2]]}},"alternative-id":["11516"],"URL":"https:\/\/doi.org\/10.1007\/s11063-024-11516-0","relation":{},"ISSN":["1573-773X"],"issn-type":[{"type":"electronic","value":"1573-773X"}],"subject":[],"published":{"date-parts":[[2024,2,6]]},"assertion":[{"value":"21 November 2023","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 February 2024","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"12"}}