{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,22]],"date-time":"2025-10-22T23:14:19Z","timestamp":1761174859981,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":33,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,10,29]],"date-time":"2023-10-29T00:00:00Z","timestamp":1698537600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,10,29]]},"DOI":"10.1145\/3607865.3613181","type":"proceedings-article","created":{"date-parts":[[2023,10,17]],"date-time":"2023-10-17T18:12:36Z","timestamp":1697566356000},"page":"13-20","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["First-order Multi-label Learning with Cross-modal Interactions for Multimodal Emotion Recognition"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0009-4431-2886","authenticated-orcid":false,"given":"Yunrui","family":"Cai","sequence":"first","affiliation":[{"name":"Tsinghua University, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-2050-263X","authenticated-orcid":false,"given":"Jingran","family":"Xie","sequence":"additional","affiliation":[{"name":"Tsinghua University, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-6786-4806","authenticated-orcid":false,"given":"Boshi","family":"Tang","sequence":"additional","affiliation":[{"name":"Tsinghua University, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-8766-3118","authenticated-orcid":false,"given":"Yuanyuan","family":"Wang","sequence":"additional","affiliation":[{"name":"Tsinghua University, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7201-1989","authenticated-orcid":false,"given":"Jun","family":"Chen","sequence":"additional","affiliation":[{"name":"Tsinghua University, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7318-9682","authenticated-orcid":false,"given":"Haiwei","family":"Xue","sequence":"additional","affiliation":[{"name":"Tsinghua University, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8533-0524","authenticated-orcid":false,"given":"Zhiyong","family":"Wu","sequence":"additional","affiliation":[{"name":"Tsinghua University, Shenzhen, China"}]}],"member":"320","published-online":{"date-parts":[[2023,10,29]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Mer 2023: Multi-label learning, modality robustness, and semi-supervised learning. arXiv preprint arXiv:2304.08981","author":"Lian Zheng","year":"2023","unstructured":"Zheng Lian , Haiyang Sun , Licai Sun , Jinming Zhao , Ye Liu , Bin Liu , Jiangyan Yi , Meng Wang , Erik Cambria , Guoying Zhao , Mer 2023: Multi-label learning, modality robustness, and semi-supervised learning. arXiv preprint arXiv:2304.08981 , 2023 . Zheng Lian, Haiyang Sun, Licai Sun, Jinming Zhao, Ye Liu, Bin Liu, Jiangyan Yi, Meng Wang, Erik Cambria, Guoying Zhao, et al. Mer 2023: Multi-label learning, modality robustness, and semi-supervised learning. arXiv preprint arXiv:2304.08981, 2023."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3551876.3554805"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3129340"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2017.02.003"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2018.2882362"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3475957.3484456"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3475957.3484457"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3475957.3484454"},{"key":"e_1_3_2_1_9_1","volume-title":"A review on multi-label learning algorithms","author":"Zhang Min-Ling","year":"1819","unstructured":"Min-Ling Zhang and Zhi-Hua Zhou . A review on multi-label learning algorithms . IEEE transactions on knowledge and data engineering, 26(8): 1819 --1837, 2013. Min-Ling Zhang and Zhi-Hua Zhou. A review on multi-label learning algorithms. IEEE transactions on knowledge and data engineering, 26(8):1819--1837, 2013."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2004.03.009"},{"key":"e_1_3_2_1_11_1","volume-title":"A kernel method for multi-labelled classification. Advances in neural information processing systems, 14","author":"Elisseeff Andr\u00e9","year":"2001","unstructured":"Andr\u00e9 Elisseeff and Jason Weston . A kernel method for multi-labelled classification. Advances in neural information processing systems, 14 , 2001 . Andr\u00e9 Elisseeff and Jason Weston. A kernel method for multi-labelled classification. Advances in neural information processing systems, 14, 2001."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1099554.1099591"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/3091529.3091560"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"crossref","first-page":"995","DOI":"10.1109\/ICDM.2008.74","volume-title":"Multi-label classification using ensembles of pruned sets. In 2008 eighth IEEE international conference on data mining","author":"Read Jesse","year":"2008","unstructured":"Jesse Read , Bernhard Pfahringer , and Geoff Holmes . Multi-label classification using ensembles of pruned sets. In 2008 eighth IEEE international conference on data mining , pages 995 -- 1000 . IEEE , 2008 . Jesse Read, Bernhard Pfahringer, and Geoff Holmes. Multi-label classification using ensembles of pruned sets. In 2008 eighth IEEE international conference on data mining, pages 995--1000. IEEE, 2008."},{"key":"e_1_3_2_1_15_1","first-page":"896","volume-title":"Workshop on challenges in representation learning, ICML","volume":"3","author":"Dong-Hyun","unstructured":"Dong-Hyun Lee et al. Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks . In Workshop on challenges in representation learning, ICML , volume 3 , page 896 . Atlanta, 2013. Dong-Hyun Lee et al. Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks. In Workshop on challenges in representation learning, ICML, volume 3, page 896. Atlanta, 2013."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i14.17558"},{"key":"e_1_3_2_1_17_1","volume-title":"Semi-supervised relation extraction via incremental meta self-training. arXiv preprint arXiv:2010.16410","author":"Hu Xuming","year":"2020","unstructured":"Xuming Hu , Chenwei Zhang , Fukun Ma , Chenyao Liu , Lijie Wen , and Philip S Yu . Semi-supervised relation extraction via incremental meta self-training. arXiv preprint arXiv:2010.16410 , 2020 . Xuming Hu, Chenwei Zhang, Fukun Ma, Chenyao Liu, Lijie Wen, and Philip S Yu. Semi-supervised relation extraction via incremental meta self-training. arXiv preprint arXiv:2010.16410, 2020."},{"key":"e_1_3_2_1_18_1","volume-title":"Learning with pseudoensembles. Advances in neural information processing systems, 27","author":"Bachman Philip","year":"2014","unstructured":"Philip Bachman , Ouais Alsharif , and Doina Precup . Learning with pseudoensembles. Advances in neural information processing systems, 27 , 2014 . Philip Bachman, Ouais Alsharif, and Doina Precup. Learning with pseudoensembles. Advances in neural information processing systems, 27, 2014."},{"key":"e_1_3_2_1_19_1","volume-title":"Specaugment: A simple data augmentation method for automatic speech recognition. arXiv preprint arXiv:1904.08779","author":"Park Daniel S","year":"2019","unstructured":"Daniel S Park , William Chan , Yu Zhang , Chung-Cheng Chiu , Barret Zoph , Ekin D Cubuk , and Quoc V Le . Specaugment: A simple data augmentation method for automatic speech recognition. arXiv preprint arXiv:1904.08779 , 2019 . Daniel S Park, William Chan, Yu Zhang, Chung-Cheng Chiu, Barret Zoph, Ekin D Cubuk, and Quoc V Le. Specaugment: A simple data augmentation method for automatic speech recognition. arXiv preprint arXiv:1904.08779, 2019."},{"key":"e_1_3_2_1_20_1","volume-title":"Mixtext: Linguistically-informed interpolation of hidden space for semi-supervised text classification. arXiv preprint arXiv:2004.12239","author":"Chen Jiaao","year":"2020","unstructured":"Jiaao Chen , Zichao Yang , and Diyi Yang . Mixtext: Linguistically-informed interpolation of hidden space for semi-supervised text classification. arXiv preprint arXiv:2004.12239 , 2020 . Jiaao Chen, Zichao Yang, and Diyi Yang. Mixtext: Linguistically-informed interpolation of hidden space for semi-supervised text classification. arXiv preprint arXiv:2004.12239, 2020."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01220"},{"key":"e_1_3_2_1_22_1","volume-title":"Attention is all you need. Advances in neural information processing systems, 30","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N Gomez , ?ukasz Kaiser, and Illia Polosukhin . Attention is all you need. Advances in neural information processing systems, 30 , 2017 . Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, ?ukasz Kaiser, and Illia Polosukhin. Attention is all you need. Advances in neural information processing systems, 30, 2017."},{"key":"e_1_3_2_1_23_1","volume-title":"J. Zico Kolter, Louis-Philippe Morency, and Ruslan Salakhutdinov. Multimodal transformer for unaligned multimodal language sequences. CoRR, abs\/1906.00295","author":"Hubert Tsai Yao-Hung","year":"2019","unstructured":"Yao-Hung Hubert Tsai , Shaojie Bai , Paul Pu Liang , J. Zico Kolter, Louis-Philippe Morency, and Ruslan Salakhutdinov. Multimodal transformer for unaligned multimodal language sequences. CoRR, abs\/1906.00295 , 2019 . Yao-Hung Hubert Tsai, Shaojie Bai, Paul Pu Liang, J. Zico Kolter, Louis-Philippe Morency, and Ruslan Salakhutdinov. Multimodal transformer for unaligned multimodal language sequences. CoRR, abs\/1906.00295, 2019."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1002\/widm.1249"},{"key":"e_1_3_2_1_25_1","volume-title":"USA","author":"Kumar Alok","year":"2020","unstructured":"Alok Kumar and J Mayank . Ensemble learning for ai developers. BApress: Berkeley, CA , USA , 2020 . Alok Kumar and J Mayank. Ensemble learning for ai developers. BApress: Berkeley, CA, USA, 2020."},{"key":"e_1_3_2_1_26_1","first-page":"1","volume-title":"2018 First Asian Conference on Affective Computing and Intelligent Interaction (ACII Asia)","author":"Li Ya","year":"2017","unstructured":"Ya Li , Jianhua Tao , Bj\u00f6rn Schuller , Shiguang Shan , Dongmei Jiang , and Jia Jia . Mec 2017 : Multimodal emotion recognition challenge . In 2018 First Asian Conference on Affective Computing and Intelligent Interaction (ACII Asia) , pages 1 -- 5 . IEEE, 2018. Ya Li, Jianhua Tao, Bj\u00f6rn Schuller, Shiguang Shan, Dongmei Jiang, and Jia Jia. Mec 2017: Multimodal emotion recognition challenge. In 2018 First Asian Conference on Affective Computing and Intelligent Interaction (ACII Asia), pages 1--5. IEEE, 2018."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3093397"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.277"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-42051-1_16"},{"key":"e_1_3_2_1_31_1","volume-title":"wav2vec: Unsupervised pre-training for speech recognition. arXiv preprint arXiv:1904.05862","author":"Schneider Steffen","year":"2019","unstructured":"Steffen Schneider , Alexei Baevski , Ronan Collobert , and Michael Auli . wav2vec: Unsupervised pre-training for speech recognition. arXiv preprint arXiv:1904.05862 , 2019 . Steffen Schneider, Alexei Baevski, Ronan Collobert, and Michael Auli. wav2vec: Unsupervised pre-training for speech recognition. arXiv preprint arXiv:1904.05862, 2019."},{"key":"e_1_3_2_1_32_1","volume-title":"wav2vec 2.0: A framework for self-supervised learning of speech representations. Advances in neural information processing systems, 33:12449--12460","author":"Baevski Alexei","year":"2020","unstructured":"Alexei Baevski , Yuhao Zhou , Abdelrahman Mohamed , and Michael Auli . wav2vec 2.0: A framework for self-supervised learning of speech representations. Advances in neural information processing systems, 33:12449--12460 , 2020 . Alexei Baevski, Yuhao Zhou, Abdelrahman Mohamed, and Michael Auli. wav2vec 2.0: A framework for self-supervised learning of speech representations. Advances in neural information processing systems, 33:12449--12460, 2020."},{"key":"e_1_3_2_1_33_1","volume-title":"Hubert: Self-supervised speech representation learning by masked prediction of hidden units","author":"Hsu Wei-Ning","year":"2021","unstructured":"Wei-Ning Hsu , Benjamin Bolte , Yao-Hung Hubert Tsai , Kushal Lakhotia, Ruslan Salakhutdinov, and Abdelrahman Mohamed. Hubert: Self-supervised speech representation learning by masked prediction of hidden units . IEEE\/ACM Transactions on Audio, Speech, and Language Processing , 29:3451--3460, 2021 . Wei-Ning Hsu, Benjamin Bolte, Yao-Hung Hubert Tsai, Kushal Lakhotia, Ruslan Salakhutdinov, and Abdelrahman Mohamed. Hubert: Self-supervised speech representation learning by masked prediction of hidden units. IEEE\/ACM Transactions on Audio, Speech, and Language Processing, 29:3451--3460, 2021."}],"event":{"name":"MM '23: The 31st ACM International Conference on Multimedia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Ottawa ON Canada","acronym":"MM '23"},"container-title":["Proceedings of the 1st International Workshop on Multimodal and Responsible Affective Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3607865.3613181","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3607865.3613181","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:37:06Z","timestamp":1750178226000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3607865.3613181"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,29]]},"references-count":33,"alternative-id":["10.1145\/3607865.3613181","10.1145\/3607865"],"URL":"https:\/\/doi.org\/10.1145\/3607865.3613181","relation":{},"subject":[],"published":{"date-parts":[[2023,10,29]]},"assertion":[{"value":"2023-10-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}