{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,15]],"date-time":"2025-07-15T03:44:28Z","timestamp":1752551068913,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":22,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,17]],"date-time":"2022-10-17T00:00:00Z","timestamp":1665964800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Wallenberg AI, Autonomous Systems and Software Program (WASP)","award":["37200022"],"award-info":[{"award-number":["37200022"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,17]]},"DOI":"10.1145\/3511808.3557237","type":"proceedings-article","created":{"date-parts":[[2022,10,16]],"date-time":"2022-10-16T01:29:57Z","timestamp":1665883797000},"page":"1615-1624","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Analysis of Knowledge Transfer in Kernel Regime"],"prefix":"10.1145","author":[{"given":"Ashkan","family":"Panahi","sequence":"first","affiliation":[{"name":"Chalmers University of Technology, Gothenburg, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Arman","family":"Rahbar","sequence":"additional","affiliation":[{"name":"Chalmers University of Technology, Gothenburg, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chiranjib","family":"Bhattacharyya","sequence":"additional","affiliation":[{"name":"Indian Institute of Science, Bangalore, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Devdatt","family":"Dubhashi","sequence":"additional","affiliation":[{"name":"Chalmers University of Technology, Gothenburg, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Morteza","family":"Haghir Chehreghani","sequence":"additional","affiliation":[{"name":"Chalmers University of Technology, Gothenburg, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,10,17]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Proceedings of the 36th International Conference on Machine Learning, ICML 2019","author":"Arora Sanjeev","year":"2019","unstructured":"Sanjeev Arora , Simon S. Du , Wei Hu , Zhiyuan Li , and Ruosong Wang . 2019 . Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks . In Proceedings of the 36th International Conference on Machine Learning, ICML 2019 , 9-15 June 2019, Long Beach, California, USA. 322--332. Sanjeev Arora, Simon S. Du, Wei Hu, Zhiyuan Li, and Ruosong Wang. 2019. Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks. In Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. 322--332."},{"key":"e_1_3_2_2_2_1","volume-title":"Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems","author":"Cao Yuan","year":"2019","unstructured":"Yuan Cao and Quanquan Gu. 2019. Generalization Bounds of Stochastic Gradient Descent for Wide and Deep Neural Networks . In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019 , NeurIPS 2019, 8-14 December 2019, Vancouver, BC, Canada . 10835--10845. Yuan Cao and Quanquan Gu. 2019. Generalization Bounds of Stochastic Gradient Descent for Wide and Deep Neural Networks. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, 8-14 December 2019, Vancouver, BC, Canada. 10835--10845."},{"key":"e_1_3_2_2_3_1","unstructured":"Guobin Chen Wongun Choi Xiang Yu Tony Han and Manmohan Chandraker. 2017. Learning efficient object detection models with knowledge distillation. In Advances in Neural Information Processing Systems. 742--751.  Guobin Chen Wongun Choi Xiang Yu Tony Han and Manmohan Chandraker. 2017. Learning efficient object detection models with knowledge distillation. In Advances in Neural Information Processing Systems. 742--751."},{"key":"e_1_3_2_2_4_1","volume-title":"Mach. Learn. Res.","volume":"13","author":"Cortes Corinna","year":"2012","unstructured":"Corinna Cortes , Mehryar Mohri , and Afshin Rostamizadeh . 2012 . Algorithms for Learning Kernels Based on Centered Alignment. J . Mach. Learn. Res. , Vol. 13 (March 2012), 795--828. Corinna Cortes, Mehryar Mohri, and Afshin Rostamizadeh. 2012. Algorithms for Learning Kernels Based on Centered Alignment. J. Mach. Learn. Res., Vol. 13 (March 2012), 795--828."},{"key":"e_1_3_2_2_5_1","volume-title":"Proceedings of the 36th International Conference on Machine Learning, ICML 2019","author":"Simon","year":"2019","unstructured":"Simon S. Du and Wei Hu. 2019. Width Provably Matters in Optimization for Deep Linear Neural Networks . In Proceedings of the 36th International Conference on Machine Learning, ICML 2019 , 9-15 June 2019 , Long Beach, California, USA. 1655--1664. Simon S. Du and Wei Hu. 2019. Width Provably Matters in Optimization for Deep Linear Neural Networks. In Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. 1655--1664."},{"key":"e_1_3_2_2_6_1","volume-title":"Gradient descent provably optimizes over-parameterized neural networks. arXiv preprint arXiv:1810.02054","author":"Du Simon S","year":"2018","unstructured":"Simon S Du , Xiyu Zhai , Barnabas Poczos , and Aarti Singh . 2018. Gradient descent provably optimizes over-parameterized neural networks. arXiv preprint arXiv:1810.02054 ( 2018 ). Simon S Du, Xiyu Zhai, Barnabas Poczos, and Aarti Singh. 2018. Gradient descent provably optimizes over-parameterized neural networks. arXiv preprint arXiv:1810.02054 (2018)."},{"key":"e_1_3_2_2_7_1","volume-title":"Mach. Learn. Res.","volume":"12","author":"G\u00f6nen Mehmet","year":"2011","unstructured":"Mehmet G\u00f6nen and Ethem Alpayd . 2011 . Multiple Kernel Learning Algorithms. J . Mach. Learn. Res. , Vol. 12 (July 2011), 2211--2268. Mehmet G\u00f6nen and Ethem Alpayd. 2011. Multiple Kernel Learning Algorithms. J. Mach. Learn. Res., Vol. 12 (July 2011), 2211--2268."},{"key":"e_1_3_2_2_8_1","volume-title":"Distilling the Knowledge in a Neural Network. CoRR","author":"Hinton Geoffrey E.","year":"2015","unstructured":"Geoffrey E. Hinton , Oriol Vinyals , and Jeffrey Dean . 2015. Distilling the Knowledge in a Neural Network. CoRR , Vol. abs\/ 1503 .02531 ( 2015 ). Geoffrey E. Hinton, Oriol Vinyals, and Jeffrey Dean. 2015. Distilling the Knowledge in a Neural Network. CoRR, Vol. abs\/1503.02531 (2015)."},{"volume-title":"Advances in Neural Information Processing Systems 31","author":"Jacot Arthur","key":"e_1_3_2_2_9_1","unstructured":"Arthur Jacot , Franck Gabriel , and Clement Hongler . 2018. Neural Tangent Kernel: Convergence and Generalization in Neural Networks . In Advances in Neural Information Processing Systems 31 , S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.). Curran Associates, Inc. , 8571--8580. Arthur Jacot, Franck Gabriel, and Clement Hongler. 2018. Neural Tangent Kernel: Convergence and Generalization in Neural Networks. In Advances in Neural Information Processing Systems 31, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.). Curran Associates, Inc., 8571--8580."},{"key":"e_1_3_2_2_10_1","first-page":"16","volume-title":"Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics","author":"Kim Yoon","year":"1865","unstructured":"Yoon Kim and Alexander M. Rush . 2016. Sequence-Level Knowledge Distillation . In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics , Austin, Texas, 1317--1327. https:\/\/doi.org\/10. 1865 3\/v1\/D 16 - 1139 10.18653\/v1 Yoon Kim and Alexander M. Rush. 2016. Sequence-Level Knowledge Distillation. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Austin, Texas, 1317--1327. https:\/\/doi.org\/10.18653\/v1\/D16-1139"},{"key":"e_1_3_2_2_11_1","volume-title":"4th International Conference on Learning Representations, ICLR 2016, San Juan, Puerto Rico, May 2-4, 2016, Conference Track Proceedings.","author":"Lopez-Paz David","year":"2016","unstructured":"David Lopez-Paz , L\u00e9 on Bottou , Bernhard Sch\u00f6 lkopf, and Vladimir Vapnik . 2016 . Unifying distillation and privileged information . In 4th International Conference on Learning Representations, ICLR 2016, San Juan, Puerto Rico, May 2-4, 2016, Conference Track Proceedings. David Lopez-Paz, L\u00e9 on Bottou, Bernhard Sch\u00f6 lkopf, and Vladimir Vapnik. 2016. Unifying distillation and privileged information. In 4th International Conference on Learning Representations, ICLR 2016, San Juan, Puerto Rico, May 2-4, 2016, Conference Track Proceedings."},{"key":"e_1_3_2_2_12_1","volume-title":"Mean-field theory of two-layers neural networks: dimension-free bounds and kernel limit. arXiv preprint arXiv:1902.06015","author":"Mei Song","year":"2019","unstructured":"Song Mei , Theodor Misiakiewicz , and Andrea Montanari . 2019. Mean-field theory of two-layers neural networks: dimension-free bounds and kernel limit. arXiv preprint arXiv:1902.06015 ( 2019 ). Song Mei, Theodor Misiakiewicz, and Andrea Montanari. 2019. Mean-field theory of two-layers neural networks: dimension-free bounds and kernel limit. arXiv preprint arXiv:1902.06015 (2019)."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2018.12.025"},{"key":"e_1_3_2_2_14_1","volume-title":"Proceedings of the 23rd International Conference on Neural Information Processing Systems -","volume":"2","author":"Pechyony Dmitry","year":"2010","unstructured":"Dmitry Pechyony and Vladimir Vapnik . 2010 . On the Theory of Learning with Privileged Information . In Proceedings of the 23rd International Conference on Neural Information Processing Systems - Volume 2 (Vancouver, British Columbia, Canada) (NIPS'10). Curran Associates Inc., Red Hook, NY, USA , 1894--1902. Dmitry Pechyony and Vladimir Vapnik. 2010. On the Theory of Learning with Privileged Information. In Proceedings of the 23rd International Conference on Neural Information Processing Systems - Volume 2 (Vancouver, British Columbia, Canada) (NIPS'10). Curran Associates Inc., Red Hook, NY, USA, 1894--1902."},{"key":"e_1_3_2_2_15_1","volume-title":"Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research","volume":"5151","author":"Phuong Mary","year":"2019","unstructured":"Mary Phuong and Christoph Lampert . 2019 . Towards Understanding Knowledge Distillation . In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research , Vol. 97),, Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, Long Beach, California, USA, 5142-- 5151 . Mary Phuong and Christoph Lampert. 2019. Towards Understanding Knowledge Distillation. In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 97),, Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, Long Beach, California, USA, 5142--5151."},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.5555\/2789272.2886814"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10472-017-9538-x"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2009.06.042"},{"key":"e_1_3_2_2_19_1","volume-title":"Williams and Matthias Seeger","author":"Christopher K.","year":"2001","unstructured":"Christopher K. I. Williams and Matthias Seeger . 2001 . Using the Nystr\u00f6m Method to Speed Up Kernel Machines. In Advances in Neural Information Processing Systems 13, T. K. Leen, T. G. Dietterich, and V. Tresp (Eds.). MIT Press , 682--688. Christopher K. I. Williams and Matthias Seeger. 2001. Using the Nystr\u00f6m Method to Speed Up Kernel Machines. In Advances in Neural Information Processing Systems 13, T. K. Leen, T. G. Dietterich, and V. Tresp (Eds.). MIT Press, 682--688."},{"key":"e_1_3_2_2_20_1","volume-title":"Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control. In 34th Conference on Neural Information Processing Systems (NeurIPS","author":"Xu Zhiyuan","year":"2020","unstructured":"Zhiyuan Xu , Kun Wu , Zhengping Che , Jian Tang , and Jieping Ye . 2020 . Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control. In 34th Conference on Neural Information Processing Systems (NeurIPS 2020). Zhiyuan Xu, Kun Wu, Zhengping Che, Jian Tang, and Jieping Ye. 2020. Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control. In 34th Conference on Neural Information Processing Systems (NeurIPS 2020)."},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.754"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.121"}],"event":{"name":"CIKM '22: The 31st ACM International Conference on Information and Knowledge Management","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Atlanta GA USA","acronym":"CIKM '22"},"container-title":["Proceedings of the 31st ACM International Conference on Information &amp; Knowledge Management"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3511808.3557237","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3511808.3557237","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:49:07Z","timestamp":1750182547000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3511808.3557237"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,17]]},"references-count":22,"alternative-id":["10.1145\/3511808.3557237","10.1145\/3511808"],"URL":"https:\/\/doi.org\/10.1145\/3511808.3557237","relation":{},"subject":[],"published":{"date-parts":[[2022,10,17]]},"assertion":[{"value":"2022-10-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}