{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T17:58:29Z","timestamp":1775325509178,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":54,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Exploratory Research Project","award":["2022PG0AN01"],"award-info":[{"award-number":["2022PG0AN01"]}]},{"name":"Shanghai Science and Technology Program","award":["No. 21JC1400600"],"award-info":[{"award-number":["No. 21JC1400600"]}]},{"name":"Shuguang Program","award":["No. 20SG01"],"award-info":[{"award-number":["No. 20SG01"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3548313","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:43:12Z","timestamp":1665416592000},"page":"3224-3233","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Mix-DANN and Dynamic-Modal-Distillation for Video Domain Adaptation"],"prefix":"10.1145","author":[{"given":"Yuehao","family":"Yin","sequence":"first","affiliation":[{"name":"Fudan University, Shanghai, China"}]},{"given":"Bin","family":"Zhu","sequence":"additional","affiliation":[{"name":"City University of Hong Kong, Hong Kong, China"}]},{"given":"Jingjing","family":"Chen","sequence":"additional","affiliation":[{"name":"Fudan University, Shanghai, China"}]},{"given":"Lechao","family":"Cheng","sequence":"additional","affiliation":[{"name":"Zhejiang Lab, Hangzhou, China"}]},{"given":"Yu-Gang","family":"Jiang","sequence":"additional","affiliation":[{"name":"Fudan University, Shanghai, China"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00676"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7953145"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01172"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00233"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.502"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00642"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00189"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58610-2_40"},{"key":"e_1_3_2_2_9_1","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV). 720--736","author":"Damen Dima","year":"2018","unstructured":"Dima Damen , Hazel Doughty , Giovanni Maria Farinella , Sanja Fidler , Antonino Furnari , Evangelos Kazakos , Davide Moltisanti , Jonathan Munro , Toby Perrett , Will Price , 2018 . Scaling egocentric vision: The epic-kitchens dataset . In Proceedings of the European Conference on Computer Vision (ECCV). 720--736 . Dima Damen, Hazel Doughty, Giovanni Maria Farinella, Sanja Fidler, Antonino Furnari, Evangelos Kazakos, Davide Moltisanti, Jonathan Munro, Toby Perrett, Will Price, et al. 2018. Scaling egocentric vision: The epic-kitchens dataset. In Proceedings of the European Conference on Computer Vision (ECCV). 720--736."},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2020.2991965"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00630"},{"key":"e_1_3_2_2_12_1","volume-title":"Domain-adversarial training of neural networks. The journal of machine learning research 17, 1","author":"Ganin Yaroslav","year":"2016","unstructured":"Yaroslav Ganin , Evgeniya Ustinova , Hana Ajakan , Pascal Germain , Hugo Larochelle , Fran\u00e7ois Laviolette , Mario Marchand , and Victor Lempitsky . 2016. Domain-adversarial training of neural networks. The journal of machine learning research 17, 1 ( 2016 ), 2096--2030. Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, Fran\u00e7ois Laviolette, Mario Marchand, and Victor Lempitsky. 2016. Domain-adversarial training of neural networks. The journal of machine learning research 17, 1 (2016), 2096--2030."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-13560-1_76"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46493-0_36"},{"key":"e_1_3_2_2_15_1","volume-title":"2020 International Joint Conference on Neural Networks (IJCNN). IEEE, 1--8.","author":"Granger Eric","year":"2020","unstructured":"Eric Granger , Madhu Kiran , Jose Dolz , Louis-Antoine Blais-Morin , 2020 . Joint progressive knowledge distillation and unsupervised domain adaptation . In 2020 International Joint Conference on Neural Networks (IJCNN). IEEE, 1--8. Eric Granger, Madhu Kiran, Jose Dolz, Louis-Antoine Blais-Morin, et al. 2020. Joint progressive knowledge distillation and unsupervised domain adaptation. In 2020 International Joint Conference on Neural Networks (IJCNN). IEEE, 1--8."},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00100"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00677"},{"key":"e_1_3_2_2_18_1","unstructured":"Geoffrey Hinton Oriol Vinyals Jeff Dean etal 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 2 7 (2015).  Geoffrey Hinton Oriol Vinyals Jeff Dean et al. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 2 7 (2015)."},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.167"},{"key":"e_1_3_2_2_20_1","first-page":"5","article-title":"Deep Domain Adaptation in Action Space","volume":"2","author":"Jamal Arshad","year":"2018","unstructured":"Arshad Jamal , Vinay P Namboodiri , Dipti Deodhare , and KS Venkatesh . 2018 . Deep Domain Adaptation in Action Space .. In BMVC , Vol. 2. 5 . Arshad Jamal, Vinay P Namboodiri, Dipti Deodhare, and KS Venkatesh. 2018. Deep Domain Adaptation in Action Space.. In BMVC, Vol. 2. 5.","journal-title":"BMVC"},{"key":"e_1_3_2_2_21_1","volume-title":"3D convolutional neural networks for human action recognition","author":"Ji Shuiwang","year":"2012","unstructured":"Shuiwang Ji , Wei Xu , Ming Yang , and Kai Yu. 2012. 3D convolutional neural networks for human action recognition . IEEE transactions on pattern analysis and machine intelligence 35, 1 ( 2012 ), 221--231. Shuiwang Ji, Wei Xu, Ming Yang, and Kai Yu. 2012. 3D convolutional neural networks for human action recognition. IEEE transactions on pattern analysis and machine intelligence 35, 1 (2012), 221--231."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.223"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01336"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126543"},{"key":"e_1_3_2_2_25_1","volume-title":"Large-scale domain adaptation via teacher-student learning. arXiv preprint arXiv:1708.05466","author":"Li Jinyu","year":"2017","unstructured":"Jinyu Li , Michael L Seltzer , XiWang, Rui Zhao , and Yifan Gong . 2017. Large-scale domain adaptation via teacher-student learning. arXiv preprint arXiv:1708.05466 ( 2017 ). Jinyu Li, Michael L Seltzer, XiWang, Rui Zhao, and Yifan Gong. 2017. Large-scale domain adaptation via teacher-student learning. arXiv preprint arXiv:1708.05466 (2017)."},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2018.03.005"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00320"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475660"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00020"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00115"},{"key":"e_1_3_2_2_31_1","volume-title":"OR 2.0 Context-Aware Operating Theaters and Machine Learning in Clinical Neuroimaging","author":"Orbes-Arteainst Mauricio","unstructured":"Mauricio Orbes-Arteainst , Jorge Cardoso , Lauge S\u00f8rensen , Christian Igel , Sebastien Ourselin , Marc Modat , Mads Nielsen , and Akshay Pai . 2019. Knowledge distillation for semi-supervised domain adaptation . In OR 2.0 Context-Aware Operating Theaters and Machine Learning in Clinical Neuroimaging . Springer , 68--76. Mauricio Orbes-Arteainst, Jorge Cardoso, Lauge S\u00f8rensen, Christian Igel, Sebastien Ourselin, Marc Modat, Mads Nielsen, and Akshay Pai. 2019. Knowledge distillation for semi-supervised domain adaptation. In OR 2.0 Context-Aware Operating Theaters and Machine Learning in Clinical Neuroimaging. Springer, 68--76."},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6854"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240508.3240633"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.590"},{"key":"e_1_3_2_2_35_1","volume-title":"Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing. Advances in Neural Information Processing Systems 34","author":"Sahoo Aadarsh","year":"2021","unstructured":"Aadarsh Sahoo , Rutav Shah , Rameswar Panda , Kate Saenko , and Abir Das . 2021. Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing. Advances in Neural Information Processing Systems 34 ( 2021 ). Aadarsh Sahoo, Rutav Shah, Rameswar Panda, Kate Saenko, and Abir Das. 2021. Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing. Advances in Neural Information Processing Systems 34 (2021)."},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00392"},{"key":"e_1_3_2_2_37_1","volume-title":"Two-stream convolutional networks for action recognition in videos. Advances in neural information processing systems 27","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014. Two-stream convolutional networks for action recognition in videos. Advances in neural information processing systems 27 ( 2014 ). Karen Simonyan and Andrew Zisserman. 2014. Two-stream convolutional networks for action recognition in videos. Advances in neural information processing systems 27 (2014)."},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00966"},{"key":"e_1_3_2_2_39_1","volume-title":"Amir Roshan Zamir, and Mubarak Shah","author":"Soomro Khurram","year":"2012","unstructured":"Khurram Soomro , Amir Roshan Zamir, and Mubarak Shah . 2012 . UCF101: A dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402 (2012). Khurram Soomro, Amir Roshan Zamir, and Mubarak Shah. 2012. UCF101: A dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402 (2012)."},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-49409-8_35"},{"key":"e_1_3_2_2_41_1","volume-title":"Unsupervised domain adaptation through self-supervision. arXiv preprint arXiv:1909.11825","author":"Sun Yu","year":"2019","unstructured":"Yu Sun , Eric Tzeng , Trevor Darrell , and Alexei A Efros . 2019. Unsupervised domain adaptation through self-supervision. arXiv preprint arXiv:1909.11825 ( 2019 ). Yu Sun, Eric Tzeng, Trevor Darrell, and Alexei A Efros. 2019. Unsupervised domain adaptation through self-supervision. arXiv preprint arXiv:1909.11825 (2019)."},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.510"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00675"},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.316"},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2018.05.083"},{"key":"e_1_3_2_2_46_1","volume-title":"Bevt: Bert pretraining of video transformers. arXiv preprint arXiv:2112.01529","author":"Wang Rui","year":"2021","unstructured":"Rui Wang , Dongdong Chen , Zuxuan Wu , Yinpeng Chen , Xiyang Dai , Mengchen Liu , Yu-Gang Jiang , Luowei Zhou , and Lu Yuan . 2021 . Bevt: Bert pretraining of video transformers. arXiv preprint arXiv:2112.01529 (2021). Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Yu-Gang Jiang, Luowei Zhou, and Lu Yuan. 2021. Bevt: Bert pretraining of video transformers. arXiv preprint arXiv:2112.01529 (2021)."},{"key":"e_1_3_2_2_47_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition. 7794--7803","author":"Girshick Ross","year":"2018","unstructured":"XiaolongWang, Ross Girshick , Abhinav Gupta , and Kaiming He . 2018 . Non-local neural networks . In Proceedings of the IEEE conference on computer vision and pattern recognition. 7794--7803 . XiaolongWang, Ross Girshick, Abhinav Gupta, and Kaiming He. 2018. Non-local neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7794--7803."},{"key":"e_1_3_2_2_48_1","volume-title":"A dynamic frame selection framework for fast video recognition","author":"Li Hengduo","year":"2020","unstructured":"ZuxuanWu, Hengduo Li , Caiming Xiong , Yu-Gang Jiang , and Larry Steven Davis . 2020. A dynamic frame selection framework for fast video recognition . IEEE Transactions on Pattern Analysis and Machine Intelligence ( 2020 ). ZuxuanWu, Hengduo Li, Caiming Xiong, Yu-Gang Jiang, and Larry Steven Davis. 2020. A dynamic frame selection framework for fast video recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (2020)."},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-021-01508-1"},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01267-0_19"},{"key":"e_1_3_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475272"},{"key":"e_1_3_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.547"},{"key":"e_1_3_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2016.7472666"},{"key":"e_1_3_2_2_54_1","volume-title":"Twenty-Fourth International Joint Conference on Artificial Intelligence.","author":"Zhuang Fuzhen","year":"2015","unstructured":"Fuzhen Zhuang , Xiaohu Cheng , Ping Luo , Sinno Jialin Pan , and Qing He . 2015 . Supervised representation learning: Transfer learning with deep autoencoders . In Twenty-Fourth International Joint Conference on Artificial Intelligence. Fuzhen Zhuang, Xiaohu Cheng, Ping Luo, Sinno Jialin Pan, and Qing He. 2015. Supervised representation learning: Transfer learning with deep autoencoders. In Twenty-Fourth International Joint Conference on Artificial Intelligence."}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","location":"Lisboa Portugal","acronym":"MM '22","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3548313","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3548313","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:43Z","timestamp":1750186843000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3548313"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":54,"alternative-id":["10.1145\/3503161.3548313","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3548313","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}