{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,22]],"date-time":"2026-04-22T10:35:12Z","timestamp":1776854112543,"version":"3.51.2"},"reference-count":73,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2022,9,8]],"date-time":"2022-09-08T00:00:00Z","timestamp":1662595200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"crossref","award":["2021ZD0110303"],"award-info":[{"award-number":["2021ZD0110303"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001809","name":"National Science Foundation of China","doi-asserted-by":"crossref","award":["62006244"],"award-info":[{"award-number":["62006244"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Young Elite Scientist Sponsorship Program of China Association for Science and Technology","award":["YESS20200140"],"award-info":[{"award-number":["YESS20200140"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2022,12,31]]},"abstract":"<jats:p>\n            Regularization that incorporates the linear combination of empirical loss and explicit regularization terms as the loss function has been frequently used for many machine learning tasks. The explicit regularization term is designed in different types, depending on its applications. While regularized learning often boost the performance with higher accuracy and faster convergence, the regularization would sometimes hurt the empirical loss minimization and lead to poor performance. To deal with such issues in this work, we propose a novel strategy, namely\n            <jats:italic>\n              <jats:underline>Gr<\/jats:underline>\n              adients\n              <jats:underline>O<\/jats:underline>\n              rthogonal\n              <jats:underline>D<\/jats:underline>\n              ecomposition\n            <\/jats:italic>\n            (\n            <jats:monospace\/>\n            <jats:bold>GrOD<\/jats:bold>\n            ), that improves the training procedure of regularized deep learning. Instead of linearly combining gradients of the two terms,\n            <jats:monospace\/>\n            <jats:bold>GrOD<\/jats:bold>\n            re-estimates a new direction for iteration that does not hurt the empirical loss minimization while preserving the regularization affects, through orthogonal decomposition. We have performed extensive experiments to use\n            <jats:monospace\/>\n            <jats:bold>GrOD<\/jats:bold>\n            improving the commonly used algorithms of transfer learning\u00a0[\n            <jats:xref ref-type=\"bibr\">2<\/jats:xref>\n            ], knowledge distillation\u00a0[\n            <jats:xref ref-type=\"bibr\">3<\/jats:xref>\n            ], and adversarial learning\u00a0[\n            <jats:xref ref-type=\"bibr\">4<\/jats:xref>\n            ]. The experiment results based on large datasets, including Caltech 256\u00a0[\n            <jats:xref ref-type=\"bibr\">5<\/jats:xref>\n            ], MIT indoor 67\u00a0[\n            <jats:xref ref-type=\"bibr\">6<\/jats:xref>\n            ], CIFAR-10\u00a0[\n            <jats:xref ref-type=\"bibr\">7<\/jats:xref>\n            ], and ImageNet\u00a0[\n            <jats:xref ref-type=\"bibr\">8<\/jats:xref>\n            ], show significant improvement made by\n            <jats:monospace\/>\n            <jats:bold>GrOD<\/jats:bold>\n            for all three algorithms in all cases.\n          <\/jats:p>","DOI":"10.1145\/3530836","type":"journal-article","created":{"date-parts":[[2022,4,18]],"date-time":"2022-04-18T11:38:40Z","timestamp":1650281920000},"page":"1-25","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":17,"title":["<b>GrOD<\/b>\n            : Deep Learning with Gradients Orthogonal Decomposition for Knowledge Transfer, Distillation, and Adversarial Training"],"prefix":"10.1145","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5451-3253","authenticated-orcid":false,"given":"Haoyi","family":"Xiong","sequence":"first","affiliation":[{"name":"Baidu, Inc., Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2115-6621","authenticated-orcid":false,"given":"Ruosi","family":"Wan","sequence":"additional","affiliation":[{"name":"Peking University, and Baidu, Inc., Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3508-756X","authenticated-orcid":false,"given":"Jian","family":"Zhao","sequence":"additional","affiliation":[{"name":"Institute of North Electronic Equipment, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6286-0581","authenticated-orcid":false,"given":"Zeyu","family":"Chen","sequence":"additional","affiliation":[{"name":"Baidu, Inc., Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8073-7552","authenticated-orcid":false,"given":"Xingjian","family":"Li","sequence":"additional","affiliation":[{"name":"Baidu, Inc., Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2141-6553","authenticated-orcid":false,"given":"Zhanxing","family":"Zhu","sequence":"additional","affiliation":[{"name":"Peking University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7020-1604","authenticated-orcid":false,"given":"Jun","family":"Huan","sequence":"additional","affiliation":[{"name":"Baidu, Inc., Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,9,8]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2019.00068"},{"key":"e_1_3_2_3_2","article-title":"Explicit inductive bias for transfer learning with convolutional networks","author":"Li Xuhong","year":"2018","unstructured":"Xuhong Li, Yves Grandvalet, and Franck Davoine. 2018. Explicit inductive bias for transfer learning with convolutional networks. In Proceedings of the 35th International Conference on Machine Learning.","journal-title":"Proceedings of the 35th International Conference on Machine Learning"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.754"},{"key":"e_1_3_2_5_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Madry Aleksander","year":"2018","unstructured":"Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, and Adrian Vladu. 2018. Towards deep learning models resistant to adversarial attacks. In Proceedings of the International Conference on Learning Representations. Retrieved from https:\/\/openreview.net\/forum?id=rJzIBfZAb."},{"key":"e_1_3_2_6_2","unstructured":"Gregory Griffin Alex Holub and Pietro Perona. 2007. Caltech-256 object category dataset. Retrieved on 28 July 2022 https:\/\/authors.library.caltech.edu\/7694\/1\/CNS-TR-2007-001.pdf."},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206537"},{"key":"e_1_3_2_8_2","unstructured":"Alex Krizhevsky Vinod Nair and Geoffrey Hinton. 2014. The CIFAR-10 dataset. Retrieved 28 July 2022 from http:\/\/www.cs.toronto.edu\/kriz\/cifar.html."},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_10_2","doi-asserted-by":"crossref","unstructured":"Qingzhong Wang Pengfei Zhang Haoyi Xiong and Jian Zhao. 2022. Face. evolve: A high-performance face recognition library. Neurocomputing 494 (2022) 443\u2013445.","DOI":"10.1016\/j.neucom.2022.04.118"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00235"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-019-01181-5"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/WACVW54805.2022.00026"},{"key":"e_1_3_2_14_2","article-title":"Distance-aware molecule graph attention network for drug-target binding affinity prediction","author":"Zhou Jingbo","year":"2020","unstructured":"Jingbo Zhou, Shuangli Li, Liang Huang, Haoyi Xiong, Fan Wang, Tong Xu, Hui Xiong, and Dejing Dou. 2020. Distance-aware molecule graph attention network for drug-target binding affinity prediction. arXiv:2012.09624. Retrieved from https:\/\/arxiv.org\/abs\/2012.09624.","journal-title":"arXiv:2012.09624"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1145\/3447548.3467311"},{"key":"e_1_3_2_16_2","first-page":"4892","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"35","author":"Xiao Congxi","year":"2021","unstructured":"Congxi Xiao, Jingbo Zhou, Jizhou Huang, An Zhuo, Ji Liu, Haoyi Xiong, and Dejing Dou. 2021. C-Watcher: A framework for early detection of high-risk neighborhoods ahead of COVID-19 outbreak. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 4892\u20134900."},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2022.3149815"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.5555\/3086952"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/72.788640"},{"key":"e_1_3_2_20_2","article-title":"What makes ImageNet good for transfer learning?","author":"Huh Minyoung","year":"2016","unstructured":"Minyoung Huh, Pulkit Agrawal, and Alexei A. Efros. 2016. What makes ImageNet good for transfer learning? arXiv:1608.08614. Retrieved from https:\/\/arxiv.org\/abs\/1608.08614.","journal-title":"arXiv:1608.08614"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1611835114"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2017.2773081"},{"key":"e_1_3_2_23_2","unstructured":"Geoffrey Hinton Oriol Vinyals and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 2 7 (2015)."},{"key":"e_1_3_2_24_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Song Yang","year":"2018","unstructured":"Yang Song, Taesup Kim, Sebastian Nowozin, Stefano Ermon, and Nate Kushman. 2018. PixelDefend: Leveraging generative models to understand and defend against adversarial examples. In Proceedings of the International Conference on Learning Representations. Retrieved from https:\/\/openreview.net\/forum?id=rJUYGxbCW."},{"key":"e_1_3_2_25_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Li Xingjian","year":"2019","unstructured":"Xingjian Li, Haoyi Xiong, Hanchao Wang, Yuxuan Rao, Liping Liu, and Jun Huan. 2019. DELTA: Deep learning transfer using feature map with attention for convolutional networks. In Proceedings of the International Conference on Learning Representations. Retrieved from https:\/\/openreview.net\/forum?id=rkgbwsAcYm."},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2009.191"},{"key":"e_1_3_2_27_2","first-page":"6467","volume-title":"Proceedings of the 31st International Conference on Neural Information Processing Systems","author":"Lopez-Paz David","year":"2017","unstructured":"David Lopez-Paz et\u00a0al. 2017. Gradient episodic memory for continual learning. In Proceedings of the 31st International Conference on Neural Information Processing Systems. 6467\u20136476."},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01287"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v29i1.9179"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2016.2644866"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2019.2903448"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2873305"},{"key":"e_1_3_2_33_2","first-page":"647","volume-title":"Proceedings of the 31st International Conference on International Conference on Machine Learning","author":"Donahue Jeff","year":"2014","unstructured":"Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, and Trevor Darrell. 2014. DeCAF: A deep convolutional activation feature for generic visual recognition. In Proceedings of the 31st International Conference on International Conference on Machine Learning. 647\u2013655."},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00432"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v31i1.10801"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2017.309"},{"key":"e_1_3_2_37_2","doi-asserted-by":"crossref","unstructured":"Yinghua Zhang Yu Zhang and Qiang Yang. 2018. Parameter transfer unit for deep neural networks. In Pacific-Asia Conference on Knowledge Discovery and Data Mining . Springer 82\u201395.","DOI":"10.1007\/978-3-030-16145-3_7"},{"key":"e_1_3_2_38_2","first-page":"3320","volume-title":"Proceedings of the 27th International Conference on Neural Information Processing Systems","author":"Yosinski Jason","year":"2014","unstructured":"Jason Yosinski, Jeff Clune, Yoshua Bengio, and Hod Lipson. 2014. How transferable are features in deep neural networks? In Proceedings of the 27th International Conference on Neural Information Processing Systems. 3320\u20133328."},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.9"},{"key":"e_1_3_2_40_2","article-title":"Manifold transfer learning via discriminant regression analysis","author":"Lu Yuwu","year":"2020","unstructured":"Yuwu Lu, Wenjing Wang, Chun Yuan, Xuelong Li, and Zhihui Lai. 2020. Manifold transfer learning via discriminant regression analysis. IEEE Transactions on Multimedia 23 (2020), 2056\u20132070.","journal-title":"IEEE Transactions on Multimedia"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2019.2951463"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007379606734"},{"key":"e_1_3_2_43_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Polino Antonio","year":"2018","unstructured":"Antonio Polino, Razvan Pascanu, and Dan Alistarh. 2018. Model compression via distillation and quantization. In Proceedings of the International Conference on Learning Representations. Retrieved from https:\/\/openreview.net\/forum?id=S1XolQbRW."},{"key":"e_1_3_2_44_2","unstructured":"Adriana Romero Nicolas Ballas Samira Ebrahimi Kahou Antoine Chassang Carlo Gatta and Yoshua Bengio. 2014. Fitnets: Hints for thin deep nets. arXiv preprint arXiv:1412.6550."},{"key":"e_1_3_2_45_2","unstructured":"Sergey Zagoruyko and Nikos Komodakis. 2017. Paying more attention to attention: Improving the performance of convolutional neural networks via attention transfer. In International Conference on Learning Representations . https:\/\/openreview.net\/forum?id=Sks9_ajex."},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1145\/1081870.1081950"},{"key":"e_1_3_2_47_2","unstructured":"Ian J. Goodfellow Jonathon Shlens and Christian Szegedy. 2014. Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572."},{"key":"e_1_3_2_48_2","unstructured":"Alexey Kurakin Ian Goodfellow and Samy Bengio. 2017. Adversarial machine learning at scale. In International Conference on Learning Representations . https:\/\/openreview.net\/forum?id=BJm4T4Kgx."},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/3128572.3140448"},{"key":"e_1_3_2_50_2","unstructured":"Haishan Ye Zhichao Huang Cong Fang Chris Junchi Li and Tong Zhang. 2018. Hessian-aware zeroth-order optimization for black-box adversarial attack. arXiv preprint arXiv:1812.11377."},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1109\/SP.2016.41"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/3052973.3053009"},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.1145\/3134599"},{"key":"e_1_3_2_54_2","first-page":"2204","volume-title":"Proceedings of the 27th International Conference on Neural Information Processing Systems","author":"Mnih Volodymyr","year":"2014","unstructured":"Volodymyr Mnih, Nicolas Heess, Alex Graves, and Koray Kavukcuoglu. 2014. Recurrent models of visual attention. In Proceedings of the 27th International Conference on Neural Information Processing Systems. 2204\u20132212."},{"key":"e_1_3_2_55_2","first-page":"2048","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Xu Kelvin","year":"2015","unstructured":"Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, and Yoshua Bengio. 2015. Show, attend and tell: Neural image caption generation with visual attention. In Proceedings of the International Conference on Machine Learning. 2048\u20132057."},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.10"},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_58_2","article-title":"Places: A 10 million image database for scene recognition","author":"Zhou Bolei","year":"2017","unstructured":"Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba. 2017. Places: A 10 million image database for scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 6 (2017), 1452\u20131464.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_2_59_2","first-page":"1","volume-title":"Proceedings of the CVPR Workshop on Fine-Grained Visual Categorization","volume":"2","author":"Khosla Aditya","year":"2011","unstructured":"Aditya Khosla, Nityananda Jayadevaprakash, Bangpeng Yao, and Fei-Fei Li. 2011. Novel dataset for fine-grained image categorization: Stanford dogs. In Proceedings of the CVPR Workshop on Fine-Grained Visual Categorization, Vol. 2. 1."},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICVGIP.2008.47"},{"key":"e_1_3_2_61_2","unstructured":"Han Xiao Kashif Rasul and Roland Vollgraf. 2017. Fashion-MNIST: A novel image dataset for benchmarking machine learning algorithms. arXiv preprint arXiv:1708.07747."},{"key":"e_1_3_2_62_2","first-page":"1139","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Sutskever Ilya","year":"2013","unstructured":"Ilya Sutskever, James Martens, George Dahl, and Geoffrey Hinton. 2013. On the importance of initialization and momentum in deep learning. In Proceedings of the International Conference on Machine Learning. 1139\u20131147."},{"key":"e_1_3_2_63_2","unstructured":"Benjamin Recht Rebecca Roelofs Ludwig Schmidt and Vaishaal Shankar. 2018. Do CIFAR-10 classifiers generalize to CIFAR-10? arXiv preprint arXiv:1806.00451."},{"key":"e_1_3_2_64_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Zhang Guodong","year":"2018","unstructured":"Guodong Zhang, Chaoqi Wang, Bowen Xu, and Roger B. Grosse. 2018. Three mechanisms of weight decay regularization. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_2_65_2","first-page":"10677","volume-title":"Proceedings of the 33rd International Conference on Neural Information Processing Systems","volume":"32","author":"Golatkar Aditya Sharad","year":"2019","unstructured":"Aditya Sharad Golatkar, Alessandro Achille, and Stefano Soatto. 2019. Time matters in regularizing deep networks: Weight decay and data augmentation affect early learning dynamics, matter little near convergence. In Proceedings of the 33rd International Conference on Neural Information Processing Systems, Vol. 32. 10677\u201310687."},{"key":"e_1_3_2_66_2","first-page":"10367","volume-title":"Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research)","volume":"119","author":"Wu Jingfeng","year":"2020","unstructured":"Jingfeng Wu, Wenqing Hu, Haoyi Xiong, Jun Huan, Vladimir Braverman, and Zhanxing Zhu. 2020. On the noisy gradient descent that generalizes as SGD. In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research). Hal Daum\u00e9 III and Aarti Singh (Eds.), Vol. 119, PMLR, 10367\u201310376. Retrieved from http:\/\/proceedings.mlr.press\/v119\/wu20c.html."},{"key":"e_1_3_2_67_2","doi-asserted-by":"publisher","DOI":"10.1145\/3406325.3465355"},{"key":"e_1_3_2_68_2","doi-asserted-by":"publisher","DOI":"10.1145\/3473912"},{"key":"e_1_3_2_69_2","first-page":"Vol. 32, 1908\u20131","article-title":"Catastrophic forgetting meets negative transfer: Batch spectral shrinkage for safe transfer learning","author":"Chen Xinyang","year":"2019","unstructured":"Xinyang Chen, Sinan Wang, Bo Fu, Mingsheng Long, and Jianmin Wang. 2019. Catastrophic forgetting meets negative transfer: Batch spectral shrinkage for safe transfer learning. In Proceedings of the 33rd International Conference on Neural Information Processing Systems. Vol. 32, 1908\u20131918.","journal-title":"Proceedings of the 33rd International Conference on Neural Information Processing Systems"},{"key":"e_1_3_2_70_2","article-title":"Co-tuning for transfer learning","author":"You Kaichao","year":"2020","unstructured":"Kaichao You, Zhi Kou, Mingsheng Long, and Jianmin Wang. 2020. Co-tuning for transfer learning. In Proceedings of the 34th International Conference on Neural Information Processing Systems. Vol. 33.","journal-title":"Proceedings of the 34th International Conference on Neural Information Processing Systems"},{"key":"e_1_3_2_71_2","doi-asserted-by":"publisher","DOI":"10.1145\/3472668"},{"key":"e_1_3_2_72_2","doi-asserted-by":"crossref","unstructured":"Yihang Yin Qingzhong Wang Siyu Huang Haoyi Xiong and Xiang Zhang. 2022. AutoGCL: Automated graph contrastive learning via learnable view generators. Proceedings of the AAAI Conference on Artificial Intelligence 36 8 (2022) 8892\u20138900.","DOI":"10.1609\/aaai.v36i8.20871"},{"key":"e_1_3_2_73_2","article-title":"Interpretable deep learning: Interpretation, interpretability, trustworthiness, and beyond","author":"Li Xuhong","year":"2021","unstructured":"Xuhong Li, Haoyi Xiong, Xingjian Li, Xuanyu Wu, Xiao Zhang, Ji Liu, Jiang Bian, and Dejing Dou. 2021. Interpretable deep learning: Interpretation, interpretability, trustworthiness, and beyond. arXiv:2103.10689. Retrieved from https:\/\/arxiv.org\/abs\/2103.10689.","journal-title":"arXiv:2103.10689"},{"key":"e_1_3_2_74_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2021\/467"}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3530836","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3530836","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:09:26Z","timestamp":1750183766000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3530836"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,8]]},"references-count":73,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2022,12,31]]}},"alternative-id":["10.1145\/3530836"],"URL":"https:\/\/doi.org\/10.1145\/3530836","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,9,8]]},"assertion":[{"value":"2021-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-09-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}