{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,31]],"date-time":"2025-10-31T08:05:00Z","timestamp":1761897900772},"publisher-location":"California","reference-count":0,"publisher":"International Joint Conferences on Artificial Intelligence Organization","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,8]]},"abstract":"<jats:p>Audio visual segmentation (AVS) aims to segment the sounding objects for each frame of a given video. To distinguish the sounding objects from silent ones, both audio-visual semantic correspondence and temporal interaction are required. The previous method applies multi-frame cross-modal attention to conduct pixel-level interactions between audio features and visual features of multiple frames simultaneously, which is both redundant and implicit. In this paper, we propose an Audio-Queried Transformer architecture, AQFormer, where we define a set of object queries conditioned on audio information and associate each of them to particular sounding objects. Explicit object-level semantic correspondence between audio and visual modalities is established by gathering object information from visual features with predefined audio queries. Besides, an Audio-Bridged Temporal Interaction module is proposed to exchange sounding object-relevant information among multiple frames with the bridge of audio features. Extensive experiments are conducted on two AVS benchmarks to show that our method achieves state-of-the-art performances, especially 7.1% M_J and 7.6% M_F gains on the MS3 setting.<\/jats:p>","DOI":"10.24963\/ijcai.2023\/97","type":"proceedings-article","created":{"date-parts":[[2023,8,11]],"date-time":"2023-08-11T08:31:30Z","timestamp":1691742690000},"page":"875-883","source":"Crossref","is-referenced-by-count":22,"title":["Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation"],"prefix":"10.24963","author":[{"given":"Shaofei","family":"Huang","sequence":"first","affiliation":[{"name":"Institute of Information Engineering, Chinese Academy of Sciences"},{"name":"School of Cyber Security, University of Chinese Academy of Sciences"}]},{"given":"Han","family":"Li","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Beihang University"}]},{"given":"Yuqing","family":"Wang","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Hongji","family":"Zhu","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Jiao","family":"Dai","sequence":"additional","affiliation":[{"name":"Institute of Information Engineering, Chinese Academy of Sciences"},{"name":"School of Cyber Security, University of Chinese Academy of Sciences"}]},{"given":"Jizhong","family":"Han","sequence":"additional","affiliation":[{"name":"Institute of Information Engineering, Chinese Academy of Sciences"},{"name":"School of Cyber Security, University of Chinese Academy of Sciences"}]},{"given":"Wenge","family":"Rong","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Beihang University"}]},{"given":"Si","family":"Liu","sequence":"additional","affiliation":[{"name":"Institute of Artificial Intelligence, Beihang University"},{"name":"Hangzhou Innovation Institute, Beihang University"}]}],"member":"10584","event":{"number":"32","sponsor":["International Joint Conferences on Artificial Intelligence Organization (IJCAI)"],"acronym":"IJCAI-2023","name":"Thirty-Second International Joint Conference on Artificial Intelligence {IJCAI-23}","start":{"date-parts":[[2023,8,19]]},"theme":"Artificial Intelligence","location":"Macau, SAR China","end":{"date-parts":[[2023,8,25]]}},"container-title":["Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence"],"original-title":[],"deposited":{"date-parts":[[2023,8,11]],"date-time":"2023-08-11T08:35:13Z","timestamp":1691742913000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.ijcai.org\/proceedings\/2023\/97"}},"subtitle":[],"proceedings-subject":"Artificial Intelligence Research Articles","short-title":[],"issued":{"date-parts":[[2023,8]]},"references-count":0,"URL":"https:\/\/doi.org\/10.24963\/ijcai.2023\/97","relation":{},"subject":[],"published":{"date-parts":[[2023,8]]}}}