{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T03:09:37Z","timestamp":1773976177116,"version":"3.50.1"},"reference-count":60,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2023,2,25]],"date-time":"2023-02-25T00:00:00Z","timestamp":1677283200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61873145"],"award-info":[{"award-number":["61873145"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Natural Science Foundation of Shandong Province for Excellent Young Scholars","award":["ZR2017JL029"],"award-info":[{"award-number":["ZR2017JL029"]}]},{"name":"Science and Technology Innovation Program for Distinguished Young Scholars of Shandong Province Higher Education Institutions","award":["2019KJN045"],"award-info":[{"award-number":["2019KJN045"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2023,8,31]]},"abstract":"<jats:p>Feature refinement and feature fusion are two key steps in convolutional neural networks\u2013based salient object detection (SOD). In this article, we investigate how to utilize multiple guidance mechanisms to better refine and fuse extracted multi-level features and propose a novel multi-guidance SOD model dubbed as MGuid-Net. Since boundary information is beneficial for locating and sharpening salient objects, edge features are utilized in our network together with saliency features for SOD. Specifically, a self-guidance module is applied to multi-level saliency features and edge features, respectively, which aims to gradually guide the refinement of lower-level features by higher-level features. After that, a cross-guidance module is devised to mutually refine saliency features and edge features via the complementarity between them. Moreover, to better integrate refined multi-level features, we also present an accumulative guidance module, which exploits multiple high-level features to guide the fusion of different features in a hierarchical manner. Finally, a pixelwise contrast loss function is adopted as an implicit guidance to help our network retain more details in salient objects. Extensive experiments on five benchmark datasets demonstrate our model can identify salient regions of an image more effectively compared to most of state-of-the-art models.<\/jats:p>","DOI":"10.1145\/3570507","type":"journal-article","created":{"date-parts":[[2022,11,3]],"date-time":"2022-11-03T11:33:27Z","timestamp":1667475207000},"page":"1-19","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":29,"title":["Multi-Guidance CNNs for Salient Object Detection"],"prefix":"10.1145","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0382-3112","authenticated-orcid":false,"given":"Shuaixiong","family":"Hui","sequence":"first","affiliation":[{"name":"School of Computer Scienceand Technology, Shandong University of Finance and Economics, and Shandong Provincial Key Laboratory of Digital Media Technology, East Erhuan Road, Jinan, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4219-3528","authenticated-orcid":false,"given":"Qiang","family":"Guo","sequence":"additional","affiliation":[{"name":"School of Computer Scienceand Technology, Shandong University of Finance and Economics, and Shandong Provincial Key Laboratory of Digital Media Technology, East Erhuan Road, Jinan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2441-7754","authenticated-orcid":false,"given":"Xiaoyu","family":"Geng","sequence":"additional","affiliation":[{"name":"School of Computer Scienceand Technology, Shandong University of Finance and Economics, and Shandong Provincial Key Laboratory of Digital Media Technology, East Erhuan Road, Jinan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0217-1543","authenticated-orcid":false,"given":"Caiming","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Software, Shandong University, Shunhua Road, Jinan, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,2,25]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206596"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2020.2965989"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-021-01490-8"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2014.2345401"},{"key":"e_1_3_1_6_2","first-page":"1529","volume-title":"Proceedings of the IEEE International Conference on Computer Vision","author":"Cheng Ming-Ming","year":"2013","unstructured":"Ming-Ming Cheng, Jonathan Warrell, Wen-Yan Lin, Shuai Zheng, Vibhav Vineet, and Nigel Crook. 2013. Efficient salient region detection with soft image abstraction. In Proceedings of the IEEE International Conference on Computer Vision. 1529\u20131536."},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2021.01.034"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-009-0275-4"},{"key":"e_1_3_1_10_2","first-page":"4548","volume-title":"Proceedings of the IEEE International Conference on Computer Vision","author":"Fan Deng-Ping","year":"2017","unstructured":"Deng-Ping Fan, Ming-Ming Cheng, Yun Liu, Tao Li, and Ali Borji. 2017. Structure-measure: A new way to evaluate foreground maps. In Proceedings of the IEEE International Conference on Computer Vision. 4548\u20134557."},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.5555\/3304415.3304515"},{"key":"e_1_3_1_12_2","first-page":"1623","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Feng Mengyang","year":"2019","unstructured":"Mengyang Feng, Huchuan Lu, and Errui Ding. 2019. Attentive feedback network for boundary-aware salient object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1623\u20131632."},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2018.2881835"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2815688"},{"key":"e_1_3_1_16_2","first-page":"6943","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","author":"Hu Xiaowei","year":"2018","unstructured":"Xiaowei Hu, Lei Zhu, Jing Qin, Chi-Wing Fu, and Pheng-Ann Heng. 2018. Recurrently aggregating deep features for salient object detection. In Proceedings of the AAAI Conference on Artificial Intelligence. 6943\u20136950."},{"key":"e_1_3_1_17_2","first-page":"2214","volume-title":"Proceedings of the IEEE International Conference on Computer Vision","author":"Klein Dominik A.","year":"2011","unstructured":"Dominik A. Klein and Simone Frintrop. 2011. Center-surround divergence of feature statistics for salient object detection. In Proceedings of the IEEE International Conference on Computer Vision. 2214\u20132219."},{"key":"e_1_3_1_18_2","first-page":"3668","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Kuen Jason","year":"2016","unstructured":"Jason Kuen, Zhenhua Wang, and Gang Wang. 2016. Recurrent attentional networks for saliency detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3668\u20133677."},{"key":"e_1_3_1_19_2","first-page":"660","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Lee Gayoung","year":"2016","unstructured":"Gayoung Lee, Yu-Wing Tai, and Junmo Kim. 2016. Deep saliency with encoded low level distance map and high level features. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 660\u2013668."},{"key":"e_1_3_1_20_2","first-page":"5455","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Li Guanbin","year":"2015","unstructured":"Guanbin Li and Yizhou Yu. 2015. Visual saliency based on multiscale deep features. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5455\u20135463."},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2019.2921543"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.43"},{"key":"e_1_3_1_23_2","first-page":"24362444","volume-title":"Proceedings of the 28th ACM International Conference on Multimedia","author":"Liao Guibiao","year":"2020","unstructured":"Guibiao Liao, Wei Gao, Qiuping Jiang, Ronggang Wang, and Ge Li. 2020. MMNet: Multi-stage and multi-scale fusion network for RGB-D salient object detection. In Proceedings of the 28th ACM International Conference on Multimedia. 24362444."},{"issue":"3","key":"e_1_3_1_24_2","first-page":"81","article-title":"Residual refinement network with attribute guidance for precise saliency detection","volume":"17","author":"Lin Feng","year":"2021","unstructured":"Feng Lin, Wengang Zhou, Jiajun Deng, Bin Li, Yan Lu, and Houqiang Li. 2021. Residual refinement network with attribute guidance for precise saliency detection. ACM Trans. Multimedia Comput. Commun. Appl. 17, 3, Article 81 (2021), 19 pages.","journal-title":"ACM Trans. Multimedia Comput. Commun. Appl."},{"key":"e_1_3_1_25_2","first-page":"3912","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Liu Jiang-Jiang","year":"2019","unstructured":"Jiang-Jiang Liu, Qibin Hou, Ming-Ming Cheng, Jiashi Feng, and Jianmin Jiang. 2019. A simple pooling-based design for real-time salient object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3912\u20133921."},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3122093"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.80"},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00326"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2021.3051350"},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.3017512"},{"key":"e_1_3_1_32_2","first-page":"9410","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Pang Youwei","year":"2020","unstructured":"Youwei Pang, Xiaoqi Zhao, Lihe Zhang, and Huchuan Lu. 2020. Multi-scale interactive network for salient object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 9410\u20139419."},{"key":"e_1_3_1_33_2","first-page":"733","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Perazzi Federico","year":"2012","unstructured":"Federico Perazzi, Philipp Kr\u00e4henb\u00fchl, Yael Pritch, and Alexander Hornung. 2012. Saliency filters: Contrast based filtering for salient region detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 733\u2013740."},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00766"},{"issue":"3","key":"e_1_3_1_35_2","first-page":"73","article-title":"Objective object segmentation visual quality evaluation: Quality measure and pooling method","volume":"18","author":"Shi Ran","year":"2022","unstructured":"Ran Shi, Jing Ma, King Ngi Ngan, Jian Xiong, and Tong Qiao. 2022. Objective object segmentation visual quality evaluation: Quality measure and pooling method. ACM Trans. Multimedia Comput. Commun. Appl. 18, 3, Article 73 (2022), 19 pages.","journal-title":"ACM Trans. Multimedia Comput. Commun. Appl."},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.256"},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-016-0977-3"},{"key":"e_1_3_1_38_2","first-page":"136","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Wang Lijun","year":"2017","unstructured":"Lijun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, Dong Wang, Baocai Yin, and Xiang Ruan. 2017. Learning to detect salient objects with image-level supervision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 136\u2013145."},{"issue":"7","key":"e_1_3_1_39_2","doi-asserted-by":"crossref","first-page":"1734","DOI":"10.1109\/TPAMI.2018.2846598","article-title":"Salient object detection with recurrent fully convolutional networks","volume":"41","author":"Wang Linzhao","year":"2018","unstructured":"Linzhao Wang, Lijun Wang, Huchuan Lu, Pingping Zhang, and Xiang Ruan. 2018. Salient object detection with recurrent fully convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 41, 7 (2018), 1734\u20131746.","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00330"},{"key":"e_1_3_1_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2021.3051099"},{"key":"e_1_3_1_42_2","first-page":"5968","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Wang Wenguan","year":"2019","unstructured":"Wenguan Wang, Jianbing Shen, Ming-Ming Cheng, and Ling Shao. 2019. An iterative and cooperative top-down and bottom-up inference network for salient object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5968\u20135977."},{"issue":"8","key":"e_1_3_1_43_2","doi-asserted-by":"crossref","first-page":"1913","DOI":"10.1109\/TPAMI.2019.2905607","article-title":"Inferring salient objects from human fixations","volume":"42","author":"Wang Wenguan","year":"2019","unstructured":"Wenguan Wang, Jianbing Shen, Xingping Dong, Ali Borji, and Ruigang Yang. 2019. Inferring salient objects from human fixations. IEEE Trans. Pattern Anal. Mach. Intell. 42, 8 (2019), 1913\u20131927.","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"e_1_3_1_44_2","first-page":"1448","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Wang Wenguan","year":"2019","unstructured":"Wenguan Wang, Shuyang Zhao, Jianbing Shen, Steven C. H. Hoi, and Ali Borji. 2019. Salient object detection with pyramid attention and salient edges. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1448\u20131457."},{"key":"e_1_3_1_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2019.2891055"},{"key":"e_1_3_1_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3123548"},{"key":"e_1_3_1_47_2","first-page":"7264","volume-title":"Proceedings of the IEEE International Conference on Computer Vision","author":"Wu Zhe","year":"2019","unstructured":"Zhe Wu, Li Su, and Qingming Huang. 2019. Stacked cross refinement network for edge-aware salient object detection. In Proceedings of the IEEE International Conference on Computer Vision. 7264\u20137273."},{"key":"e_1_3_1_48_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2010.5539970"},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.153"},{"key":"e_1_3_1_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.407"},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3113794"},{"issue":"3","key":"e_1_3_1_52_2","first-page":"32","article-title":"Saliency detection on light field: A multi-cue approach","volume":"13","author":"Zhang Jun","year":"2017","unstructured":"Jun Zhang, Meng Wang, Liang Lin, Xun Yang, Jun Gao, and Yong Rui. 2017. Saliency detection on light field: A multi-cue approach. ACM Trans. Multimedia Comput. Commun. Appl. 13, 3, Article 32 (2017), 22 pages.","journal-title":"ACM Trans. Multimedia Comput. Commun. Appl."},{"key":"e_1_3_1_53_2","first-page":"1741","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Zhang Lu","year":"2018","unstructured":"Lu Zhang, Ju Dai, Huchuan Lu, You He, and Gang Wang. 2018. A bi-directional message passing model for salient object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1741\u20131750."},{"key":"e_1_3_1_54_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.31"},{"key":"e_1_3_1_55_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00081"},{"key":"e_1_3_1_56_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00405"},{"key":"e_1_3_1_57_2","first-page":"8778","volume-title":"Proceedings of the IEEE International Conference on Computer Vision","author":"Zhao Jia-Xing","year":"2019","unstructured":"Jia-Xing Zhao, Jiang-Jiang Liu, Deng-Ping Fan, Yang Cao, Jufeng Yang, and Ming-Ming Cheng. 2019. EGNet: Edge guidance network for salient object detection. In Proceedings of the IEEE International Conference on Computer Vision. 8778\u20138787."},{"key":"e_1_3_1_58_2","first-page":"35","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Zhao Xiaoqi","year":"2020","unstructured":"Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, and Lei Zhang. 2020. Suppress and balance: A simple gated network for salient object detection. In Proceedings of the European Conference on Computer Vision. 35\u201351."},{"key":"e_1_3_1_59_2","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475494"},{"key":"e_1_3_1_60_2","first-page":"199","volume-title":"Proceedings of the IEEE International Conference on Multimedia and Expo","author":"Zhu Chunbiao","year":"2019","unstructured":"Chunbiao Zhu, Xing Cai, Kan Huang, Thomas H Li, and Ge Li. 2019. PDNet: Prior-model guided depth-enhanced network for salient object detection. In Proceedings of the IEEE International Conference on Multimedia and Expo. 199\u2013204."},{"key":"e_1_3_1_61_2","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2018.2875586"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3570507","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3570507","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:46:17Z","timestamp":1750178777000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3570507"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,25]]},"references-count":60,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2023,8,31]]}},"alternative-id":["10.1145\/3570507"],"URL":"https:\/\/doi.org\/10.1145\/3570507","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,2,25]]},"assertion":[{"value":"2022-04-30","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-10-30","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-02-25","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}