{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T05:33:04Z","timestamp":1772861584456,"version":"3.50.1"},"reference-count":79,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2023,9,18]],"date-time":"2023-09-18T00:00:00Z","timestamp":1694995200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Shenzhen Science and Technology Program","award":["JCYJ20220818103401003, JCYJ20220530172403007"],"award-info":[{"award-number":["JCYJ20220818103401003, JCYJ20220530172403007"]}]},{"name":"General Program of Natural Science Foundation of Guangdong Province","award":["2022A1515010170"],"award-info":[{"award-number":["2022A1515010170"]}]},{"name":"Free Exploration of Basic Research Project, Local Science and Technology Development Fund Guided by the Central Government of China","award":["2021Szvup060"],"award-info":[{"award-number":["2021Szvup060"]}]},{"name":"Research Grant entitled \u201cSelf-Supervised Learning for Medical Images\u201d\u2019","award":["871228"],"award-info":[{"award-number":["871228"]}]},{"name":"Shenzhen University-Lingnan University Joint Research Programme","award":["SZU-LU006\/ 2122"],"award-info":[{"award-number":["SZU-LU006\/ 2122"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2024,1,31]]},"abstract":"<jats:p>\n            Exploiting long-range semantic contexts and geometric information is crucial to infer salient objects from RGB and depth features. However, existing methods mainly focus on excavating local features within fixed regions by continuously feeding forward networks. In this article, we introduce Dynamic Message Propagation (DMP) to dynamically learn context information within more flexible regions. We integrate DMP into a Siamese-based network to process the RGB image and depth map separately and design a multi-level feature fusion module to explore cross-level information between refined RGB and depth features. Extensive experiments show clear improvements of our method over 17 methods on six benchmark datasets for RGB-D salient object detection (SOD). Additionally, our method outperforms its competitors for the video SOD task. Code is available at\n            <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"url\" xlink:href=\"https:\/\/github.com\/chenbaian-cs\/DMPNet\">https:\/\/github.com\/chenbaian-cs\/DMPNet<\/jats:ext-link>\n            .\n          <\/jats:p>","DOI":"10.1145\/3597612","type":"journal-article","created":{"date-parts":[[2023,5,19]],"date-time":"2023-05-19T09:48:12Z","timestamp":1684489692000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["Dynamic Message Propagation Network for RGB-D and Video Salient Object Detection"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1447-8991","authenticated-orcid":false,"given":"Baian","family":"Chen","sequence":"first","affiliation":[{"name":"Shenzhen Research Institute, Nanjing University of Aeronautics and Astronautics, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3853-4046","authenticated-orcid":false,"given":"Zhilei","family":"Chen","sequence":"additional","affiliation":[{"name":"Shenzhen Research Institute, Nanjing University of Aeronautics and Astronautics, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5708-7018","authenticated-orcid":false,"given":"Xiaowei","family":"Hu","sequence":"additional","affiliation":[{"name":"Shanghai AI Laboratory, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1602-538X","authenticated-orcid":false,"given":"Jun","family":"Xu","sequence":"additional","affiliation":[{"name":"Nankai University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0965-3617","authenticated-orcid":false,"given":"Haoran","family":"Xie","sequence":"additional","affiliation":[{"name":"Lingnan University, Hong Kong SAR"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2961-0860","authenticated-orcid":false,"given":"Jing","family":"Qin","sequence":"additional","affiliation":[{"name":"The Hong Kong Polytechnic University, Hong Kong SAR"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0429-490X","authenticated-orcid":false,"given":"Mingqiang","family":"Wei","sequence":"additional","affiliation":[{"name":"Shenzhen Research Institute, Nanjing University of Aeronautics and Astronautics, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,9,18]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206596"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2487833"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3068644"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2019.2934350"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2019.2891104"},{"key":"e_1_3_1_7_2","first-page":"6821","volume-title":"Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems","author":"Chen Hao","year":"2018","unstructured":"Hao Chen, You-Fu Li, and Dan Su. 2018. Attention-aware cross-modal cross-level fusion network for rgb-d salient object detection. In Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems. 6821\u20136826."},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2018.2813165"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2020.3028289"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/2632856.2632866"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2022.3216198"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.89"},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/IGARSS39084.2020.9324530"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jvcir.2019.03.019"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.487"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.5555\/3304415.3304515"},{"issue":"8","key":"e_1_3_1_17_2","first-page":"4339","article-title":"Re-thinking co-salient object detection","volume":"44","author":"Fan Deng-Ping","year":"2021","unstructured":"Deng-Ping Fan, Tengpeng Li, Zheng Lin, Ge-Peng Ji, Dingwen Zhang, Ming-Ming Cheng, Huazhu Fu, and Jianbing Shen. 2021. Re-thinking co-salient object detection. IEEE Trans. Pattern Anal. Mach. Intell. 44, 8 (2021), 4339\u20134354.","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2020.2996406"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00875"},{"key":"e_1_3_1_20_2","first-page":"2343","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Feng David","year":"2016","unstructured":"David Feng, Nick Barnes, Shaodi You, and Chris McCarthy. 2016. Local background enclosure for RGB-d salient object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2343\u20132350."},{"key":"e_1_3_1_21_2","first-page":"3052","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Fu Keren","year":"2020","unstructured":"Keren Fu, Deng-Ping Fan, Ge-Peng Ji, and Qijun Zhao. 2020. JL-DCF: Joint learning and densely-cooperative fusion framework for RGB-d salient object detection. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 3052\u20133062."},{"key":"e_1_3_1_22_2","first-page":"1263","volume-title":"International Conference on Machine Learning","author":"Gilmer Justin","year":"2017","unstructured":"Justin Gilmer, Samuel S. Schoenholz, Patrick F. Riley, Oriol Vinyals, and George E. Dahl. 2017. Neural message passing for quantum chemistry. In International Conference on Machine Learning. PMLR, 1263\u20131272."},{"key":"e_1_3_1_23_2","first-page":"10869","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"34","author":"Gu Yuchao","year":"2020","unstructured":"Yuchao Gu, Lijuan Wang, Ziqin Wang, Yun Liu, Ming-Ming Cheng, and Shao-Ping Lu. 2020. Pyramid constrained self-attention network for fast video salient object detection. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 10869\u201310876."},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2017.2761775"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.243"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00488"},{"key":"e_1_3_1_28_2","first-page":"9471","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Ji Wei","year":"2021","unstructured":"Wei Ji, Jingjing Li, Shuang Yu, Miao Zhang, Yongri Piao, Shunyu Yao, Qi Bi, Kai Ma, Yefeng Zheng, Huchuan Lu, et\u00a0al. 2021. Calibrated RGB-D salient object detection. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 9471\u20139481."},{"key":"e_1_3_1_29_2","first-page":"52","volume-title":"European Conference on Computer Vision","author":"Ji Wei","year":"2020","unstructured":"Wei Ji, Jingjing Li, Miao Zhang, Yongri Piao, and Huchuan Lu. 2020. Accurate RGB-D salient object detection via collaborative learning. In European Conference on Computer Vision. Springer, 52\u201369."},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3060167"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2014.7025222"},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.1145\/2000486.2000488"},{"key":"e_1_3_1_33_2","first-page":"630","volume-title":"European Conference on Computer Vision","author":"Lee Minhyeok","year":"2022","unstructured":"Minhyeok Lee, Chaewon Park, Suhwan Cho, and Sangyoun Lee. 2022. SPSN: Superpixel prototype sampling network for rgb-d salient object detection. In European Conference on Computer Vision. Springer, 630\u2013647."},{"key":"e_1_3_1_34_2","first-page":"225","volume-title":"European Conference on Computer Vision","author":"Li Chongyi","year":"2020","unstructured":"Chongyi Li, Runmin Cong, Yongri Piao, Qianqian Xu, and Chen Change Loy. 2020. RGB-D salient object detection with cross-modality modulation and selection. In European Conference on Computer Vision. Springer, 225\u2013241."},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2020.2976689"},{"key":"e_1_3_1_36_2","first-page":"665","volume-title":"European Conference on Computer Vision","author":"Li Gongyang","year":"2020","unstructured":"Gongyang Li, Zhi Liu, Linwei Ye, Yang Wang, and Haibin Ling. 2020. Cross-modal weighting network for RGB-D salient object detection. In European Conference on Computer Vision. Springer, 665\u2013681."},{"key":"e_1_3_1_37_2","first-page":"3243","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Li Guanbin","year":"2018","unstructured":"Guanbin Li, Yuan Xie, Tianhao Wei, Keze Wang, and Liang Lin. 2018. Flow guided recurrent neural encoder for video salient object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3243\u20133252."},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2762594"},{"key":"e_1_3_1_39_2","first-page":"207","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Li Siyang","year":"2018","unstructured":"Siyang Li, Bryan Seybold, Alexey Vorobyov, Xuejing Lei, and C.-C. Jay Kuo. 2018. Unsupervised video object segmentation with motion-based bilateral networks. In Proceedings of the European Conference on Computer Vision. 207\u2013223."},{"issue":"7","key":"e_1_3_1_40_2","first-page":"4486","article-title":"SwinNet: Swin transformer drives edge-aware RGB-D and RGB-T salient object detection","volume":"32","author":"Liu Zhengyi","year":"2021","unstructured":"Zhengyi Liu, Yacheng Tan, Qian He, and Yun Xiao. 2021. SwinNet: Swin transformer drives edge-aware RGB-D and RGB-T salient object detection. IEEE Trans. Circ. Syst. Vid. Technol. 32, 7 (2021), 4486\u20134497.","journal-title":"IEEE Trans. Circ. Syst. Vid. Technol."},{"key":"e_1_3_1_41_2","first-page":"454","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Niu Yuzhen","year":"2012","unstructured":"Yuzhen Niu, Yujie Geng, Xueqing Li, and Feng Liu. 2012. Leveraging stereopsis for saliency analysis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 454\u2013461."},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.242"},{"key":"e_1_3_1_43_2","first-page":"235","volume-title":"European Conference on Computer Vision","author":"Pang Youwei","year":"2020","unstructured":"Youwei Pang, Lihe Zhang, Xiaoqi Zhao, and Huchuan Lu. 2020. Hierarchical dynamic filtering network for RGB-d salient object detection. In European Conference on Computer Vision. Springer, 235\u2013252."},{"key":"e_1_3_1_44_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10578-9_7"},{"key":"e_1_3_1_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.85"},{"key":"e_1_3_1_46_2","first-page":"7253","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Piao Yongri","year":"2019","unstructured":"Yongri Piao, Wei Ji, Jingjing Li, Miao Zhang, and Huchuan Lu. 2019. Depth-induced multi-scale recurrent attention network for saliency detection. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 7253\u20137262."},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","DOI":"10.1145\/3419439"},{"key":"e_1_3_1_48_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2682981"},{"key":"e_1_3_1_49_2","first-page":"25","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops","author":"Ren Jianqiang","year":"2015","unstructured":"Jianqiang Ren, Xiaojin Gong, Lu Yu, Wenhui Zhou, and Michael Ying Yang. 2015. Exploiting global priors for RGB-D saliency detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 25\u201332."},{"key":"e_1_3_1_50_2","doi-asserted-by":"publisher","DOI":"10.1145\/3079765"},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"e_1_3_1_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/TNN.2008.2005605"},{"key":"e_1_3_1_53_2","first-page":"1","volume-title":"International Conference on Learning Representations","author":"Simonyan K.","year":"2015","unstructured":"K. Simonyan and A. Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In International Conference on Learning Representations. Computational and Biological Learning Society, 1\u201314."},{"key":"e_1_3_1_54_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2711277"},{"key":"e_1_3_1_55_2","first-page":"1407","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Sun Peng","year":"2021","unstructured":"Peng Sun, Wenhu Zhang, Huanyu Wang, Songyuan Li, and Xi Li. 2021. Deep RGB-D saliency detection with depth-sensitive attention and automatic multi-modal fusion. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 1407\u20131417."},{"key":"e_1_3_1_56_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298594"},{"issue":"7","key":"e_1_3_1_57_2","doi-asserted-by":"crossref","first-page":"1973","DOI":"10.1109\/TCSVT.2018.2859773","article-title":"Weakly supervised salient object detection with spatiotemporal cascade neural networks","volume":"29","author":"Tang Yi","year":"2018","unstructured":"Yi Tang, Wenbin Zou, Zhi Jin, Yuhuan Chen, Yang Hua, and Xia Li. 2018. Weakly supervised salient object detection with spatiotemporal cascade neural networks. IEEE Trans. Circ. Syst. Vid. Technol. 29, 7 (2018), 1973\u20131984.","journal-title":"IEEE Trans. Circ. Syst. Vid. Technol."},{"key":"e_1_3_1_58_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58536-5_24"},{"key":"e_1_3_1_59_2","first-page":"454","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Wang Li","year":"2021","unstructured":"Li Wang, Liang Du, Xiaoqing Ye, Yanwei Fu, Guodong Guo, Xiangyang Xue, Jianfeng Feng, and Li Zhang. 2021. Depth-conditioned dynamic message propagation for monocular 3d object detection. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 454\u2013463."},{"key":"e_1_3_1_60_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2913107"},{"key":"e_1_3_1_61_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3139232"},{"key":"e_1_3_1_62_2","first-page":"211","volume-title":"Proceedings of the IEEE 5th International Conference on Image, Vision and Computing","author":"Xu Xu","year":"2020","unstructured":"Xu Xu, Zhou Ruan, and Lei Yang. 2020. Facial expression recognition based on graph neural network. In Proceedings of the IEEE 5th International Conference on Image, Vision and Computing. 211\u2013214."},{"key":"e_1_3_1_63_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00738"},{"key":"e_1_3_1_64_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2022.3144852"},{"key":"e_1_3_1_65_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3116793"},{"key":"e_1_3_1_66_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3116793"},{"key":"e_1_3_1_67_2","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475364"},{"key":"e_1_3_1_68_2","first-page":"8579","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Zhang Jing","year":"2020","unstructured":"Jing Zhang, Deng-Ping Fan, Yuchao Dai, Saeed Anwar, Fatemeh Sadat Saleh, Tong Zhang, and Nick Barnes. 2020. UC-Net: Uncertainty inspired RGB-d saliency detection via conditional variational autoencoders. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8579\u20138588."},{"key":"e_1_3_1_69_2","first-page":"3723","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Zhang Li","year":"2020","unstructured":"Li Zhang, Dan Xu, Anurag Arnab, and Philip H. S. Torr. 2020. Dynamic graph message passing networks. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 3723\u20133732."},{"key":"e_1_3_1_70_2","first-page":"374","volume-title":"European Conference on Computer Vision","author":"Zhang Miao","year":"2020","unstructured":"Miao Zhang, Sun X. Fei, Jie Liu, Shuang Xu, Yongri Piao, and Huchuan Lu. 2020. Asymmetric two-stream architecture for accurate rgb-d saliency detection. In European Conference on Computer Vision. Springer, 374\u2013390."},{"key":"e_1_3_1_71_2","first-page":"3469","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Zhang Miao","year":"2020","unstructured":"Miao Zhang, Weisong Ren, Yongri Piao, Zhengkun Rong, and Huchuan Lu. 2020. Select, supplement and focus for RGB-d saliency detection. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 3469\u20133478."},{"key":"e_1_3_1_72_2","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475240"},{"key":"e_1_3_1_73_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3049959"},{"key":"e_1_3_1_74_2","first-page":"3922","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Zhao Jia-Xing","year":"2019","unstructured":"Jia-Xing Zhao, Yang Cao, Deng-Ping Fan, Ming-Ming Cheng, Xuan-Yi Li, and Le Zhang. 2019. Contrast prior and fluid pyramid integration for rgbd salient object detection. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 3922\u20133931."},{"key":"e_1_3_1_75_2","first-page":"646","volume-title":"European Conference on Computer Vision","author":"Zhao Xiaoqi","year":"2020","unstructured":"Xiaoqi Zhao, Lihe Zhang, Youwei Pang, Huchuan Lu, and Lei Zhang. 2020. A single stream network for robust and real-time RGB-D salient object detection. In European Conference on Computer Vision. Springer, 646\u2013662."},{"key":"e_1_3_1_76_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3108412"},{"key":"e_1_3_1_77_2","first-page":"270","volume-title":"European Conference on Computer Vision","author":"Zhou Jiayuan","year":"2022","unstructured":"Jiayuan Zhou, Lijun Wang, Huchuan Lu, Kaining Huang, Xinchu Shi, and Bocong Liu. 2022. Mvsalnet: Multi-view augmentation for RGB-d salient object detection. In European Conference on Computer Vision. Springer, 270\u2013287."},{"issue":"4","key":"e_1_3_1_78_2","first-page":"862","article-title":"Unsupervised object class discovery via saliency-guided multiple class learning","volume":"37","author":"Zhu Jun-Yan","year":"2014","unstructured":"Jun-Yan Zhu, Jiajun Wu, Yan Xu, Eric Chang, and Zhuowen Tu. 2014. Unsupervised object class discovery via saliency-guided multiple class learning. IEEE Trans. Pattern Anal. Mach. Intell. 37, 4 (2014), 862\u2013875.","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"e_1_3_1_79_2","first-page":"9300","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Zhu Xizhou","year":"2019","unstructured":"Xizhou Zhu, Han Hu, Stephen Lin, and Jifeng Dai. 2019. Deformable convNets V2: More deformable, better results. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 9300\u20139308."},{"key":"e_1_3_1_80_2","doi-asserted-by":"publisher","DOI":"10.1145\/3491228"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3597612","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3597612","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T22:50:13Z","timestamp":1750287013000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3597612"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,18]]},"references-count":79,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,1,31]]}},"alternative-id":["10.1145\/3597612"],"URL":"https:\/\/doi.org\/10.1145\/3597612","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,18]]},"assertion":[{"value":"2022-12-08","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-05-09","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-09-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}