{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,9]],"date-time":"2025-09-09T22:17:51Z","timestamp":1757456271203,"version":"3.41.0"},"reference-count":62,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2024,3,8]],"date-time":"2024-03-08T00:00:00Z","timestamp":1709856000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Science Foundation of China","doi-asserted-by":"crossref","award":["61702289"],"award-info":[{"award-number":["61702289"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Henan Province University Science and Technology Innovation Talent Support Program","award":["No21HASTIT032"],"award-info":[{"award-number":["No21HASTIT032"]}]},{"name":"scientific and technological project in Henan Province of China","award":["No212102310304"],"award-info":[{"award-number":["No212102310304"]}]},{"name":"Cultivating Fund Project of the National Science Foundation","award":["2023PY009"],"award-info":[{"award-number":["2023PY009"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2024,6,30]]},"abstract":"<jats:p>Unsupervised salient object detection is an important task in many real-world scenarios where pixel-wise label information is of scarce availability. Despite its significance, this problem remains rarely explored, with a few works that consider unsupervised salient object detection methods based on the fused graph from the sum fusion of multiple deep feature similarity matrices. However, these methods ignore the interrelation of the low-level feature similarity matrices and the high-level semantic similarity matrice, which degrades the quality of the fused graph. In this article, we propose a semantic-consistency-guided multi-graph fusion learning algorithm for unsupervised saliency detection, where the consistency and inconsistency between multiple low-level feature similarity matrices and the high-level semantic similarity matrice are explored to promote the robustness and quality of the fused graph. In the first stage, a semantic-consistency-guided multi-graph fusion learning method is proposed to exploit consistency and inconsistency of multiple low-level deep features and the high-level semantic feature. The semantic-consistency-guided similarity matrices are computed for preliminary saliency ranking. In the following saliency refinement stage, the semantic-enhanced similarity matrices are built by the cross diffusion to fuse the multiple low-level deep features and the high semantic deep feature. Based on the semantic-enhanced similarity matrices, the refinement saliency maps are calculated in a semantic-enhanced cellular automata manner. Furthermore, the final ensemble stage of the large margin semi-supervised classification views the preliminary ranking results and refinement results as features, adopts the large margin graphs for saliency ensemble. Extensive evaluations over four benchmark datasets show that the proposed unsupervised method performs favorably against the state-of-the-art approaches and is competitive with some supervised deep learning-based methods.<\/jats:p>","DOI":"10.1145\/3640816","type":"journal-article","created":{"date-parts":[[2024,1,22]],"date-time":"2024-01-22T12:30:49Z","timestamp":1705926649000},"page":"1-23","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Semantic-Consistency-guided Learning on Deep Features for Unsupervised Salient Object Detection"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2704-7314","authenticated-orcid":false,"given":"Ying Ying","family":"Zhang","sequence":"first","affiliation":[{"name":"School of Physics and Electronic Engineering, Nanyang Normal University, Henan Engineering Research Center for Radio Frequency Front End and Antenna of Millimeter Wave Wireless Communication System, Nan Yang, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4622-0669","authenticated-orcid":false,"given":"Shuo","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2915-8656","authenticated-orcid":false,"given":"Ming","family":"Hui","sequence":"additional","affiliation":[{"name":"School of Physics and Electronic Engineering, Nanyang Normal University, Henan Engineering Research Center for Radio Frequency Front End and Antenna of Millimeter Wave Wireless Communication System, Nan Yang, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,3,8]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2487833"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2014.2345401"},{"key":"e_1_3_1_4_2","first-page":"473","volume-title":"Computer Vision and Pattern Recognition","author":"Duan Lijuan","year":"2011","unstructured":"Lijuan Duan, Chunpeng Wu, Jun Miao, and Laiyun Qing. 2011. Visual saliency detection by spatially weighted dissimilarity. In Computer Vision and Pattern Recognition. 473\u2013480."},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.487"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11432-021-3384-y"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298868"},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2815688"},{"key":"e_1_3_1_9_2","first-page":"478","volume-title":"Computer Vision and Pattern Recognition","author":"Itti L.","year":"2012","unstructured":"L. Itti and A. Borji. 2012. Exploiting local and global patch rarities for saliency detection. In Computer Vision and Pattern Recognition. 478\u2013485."},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/34.730558"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2020.09.003"},{"key":"e_1_3_1_12_2","first-page":"5455","volume-title":"Computer Vision and Pattern Recognition","author":"Li Guanbin","year":"2015","unstructured":"Guanbin Li and Yizhou Yu. 2015. Visual saliency based on multiscale deep features. In Computer Vision and Pattern Recognition. 5455\u20135463."},{"key":"e_1_3_1_13_2","first-page":"478","volume-title":"Computer Vision and Pattern Recognition","author":"Li Guanbin","year":"2016","unstructured":"Guanbin Li and Yizhou Yu. 2016. Deep contrast learning for salient object detection. In Computer Vision and Pattern Recognition. 478\u2013487."},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2440755"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2016.2524198"},{"key":"e_1_3_1_16_2","first-page":"280","volume-title":"Computer Vision and Pattern Recognition","author":"Li Yin","year":"2014","unstructured":"Yin Li, Xiaodi Hou, Christof Koch, James M. Rehg, and Alan L. Yuille. 2014. The secrets of salient object segmentation. In Computer Vision and Pattern Recognition. 280\u2013287."},{"key":"e_1_3_1_17_2","first-page":"1725","volume-title":"International Joint Conference on Artificial Intelligence","author":"Li Yufeng","year":"2016","unstructured":"Yufeng Li, Shao Bo Wang, and Zhihua Zhou. 2016. Graph quality judgement: A large margin expedition. In International Joint Conference on Artificial Intelligence. 1725\u20131731."},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2019.00148"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2021.3140168"},{"key":"e_1_3_1_20_2","first-page":"2097","volume-title":"Computer Vision and Pattern Recognition","author":"Liu Mingyu","year":"2011","unstructured":"Mingyu Liu, O. Tuzel, S. Ramalingam, and R. Chellappa. 2011. Entropy rate superpixel segmentation. In Computer Vision and Pattern Recognition. 2097\u20132104."},{"key":"e_1_3_1_21_2","first-page":"678","volume-title":"Computer Vision and Pattern Recognition","author":"Liu Nian.","year":"2016","unstructured":"Nian. Liu and Junwei. Han. 2016. DHSNet: Deep hierarchical saliency network for salient object detection. In Computer Vision and Pattern Recognition. 678\u2013686."},{"key":"e_1_3_1_22_2","first-page":"3866","volume-title":"Computer Vision and Pattern Recognition","author":"Liu Risheng","year":"2014","unstructured":"Risheng Liu, Junjie Cao, Zhouchen Lin, and Shiguang Shan. 2014. Adaptive partial differential equation learning for visual saliency detection. In Computer Vision and Pattern Recognition. 3866\u20133873."},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3065239"},{"issue":"4","key":"e_1_3_1_24_2","first-page":"640","article-title":"Fully convolutional networks for semantic segmentation","volume":"39","author":"Long Jonathan","year":"2015","unstructured":"Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 4 (2015), 640\u2013651.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_1_25_2","first-page":"6593","volume-title":"Computer Vision and Pattern Recognition","author":"Luo Zhiming","year":"2017","unstructured":"Zhiming Luo, Akshaya Mishra, Andrew Achkar, Justin Eichel, Shaozi Li, and Pierre Marc Jodoin. 2017. Non-local deep features for salient object detection. In Computer Vision and Pattern Recognition. 6593\u20136601."},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2754939"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.178"},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-017-1062-2"},{"key":"e_1_3_1_29_2","first-page":"110","volume-title":"Computer Vision and Pattern Recognition","author":"Qin Yao","year":"2015","unstructured":"Yao Qin, Huchuan Lu, Yiqun Xu, and He Wang. 2015. Saliency detection via cellular automata. In Computer Vision and Pattern Recognition. 110\u2013119."},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2013.2280096"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2018.2863028"},{"key":"e_1_3_1_32_2","first-page":"13","volume-title":"Computer Vision and Pattern Recognition","author":"Sugano Yusuke","year":"2010","unstructured":"Yusuke Sugano, Yasuyuki Matsushita, and Yoichi Sato. 2010. Calibration-free gaze sensing using saliency maps. In Computer Vision and Pattern Recognition. 13\u201318."},{"key":"e_1_3_1_33_2","first-page":"2997","volume-title":"Computer Vision and Pattern Recognition","author":"Wang Bo","year":"2012","unstructured":"Bo Wang, Jiayan Jiang, Zhihua Wang, Weiand Zhou, and Zhuowen Tu. 2012. Unsupervised metric fusion by cross diffusion. In Computer Vision and Pattern Recognition. 2997\u20133004."},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-016-0977-3"},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.433"},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2021.3051099"},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2840724"},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2017.2662005"},{"key":"e_1_3_1_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2015.2498149"},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01304"},{"key":"e_1_3_1_41_2","first-page":"29","volume-title":"European Conference on Computer Vision","author":"Wei Yichen","year":"2017","unstructured":"Yichen Wei, Fang Wen, Wangjiang Zhu, and Jian Sun. 2017. Geodesic saliency using background priors. In European Conference on Computer Vision. 29\u201342."},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v36i3.20206"},{"key":"e_1_3_1_43_2","first-page":"1155","volume-title":"Computer Vision and Pattern Recognition","author":"Yan Qiong","year":"2013","unstructured":"Qiong Yan, Li Xu, Jianping Shi, and Jiaya Jia. 2013. Hierarchical saliency detection. In Computer Vision and Pattern Recognition. 1155\u20131162."},{"key":"e_1_3_1_44_2","first-page":"3166","volume-title":"Computer Vision and Pattern Recognition","author":"Yang Chuan","year":"2013","unstructured":"Chuan Yang, Lihe Zhang, Huchuan Lu, Ruan Xiang, and Ming Hsuan Yang. 2013. Saliency detection via graph-based manifold ranking. In Computer Vision and Pattern Recognition. 3166\u20133173."},{"key":"e_1_3_1_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2018.2838761"},{"key":"e_1_3_1_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2018.2877335"},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2017.2751646"},{"key":"e_1_3_1_48_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2019.2900649"},{"key":"e_1_3_1_49_2","volume-title":"The Conference on Neural Information Processing Systems","author":"Zhang Dingwen","year":"2021","unstructured":"Dingwen Zhang, Haibin Tian, and Jungong Han. 2021. Few-cost salient object detection with adversarial-paced learning. In The Conference on Neural Information Processing Systems."},{"key":"e_1_3_1_50_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11432-020-3181-9"},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58520-4_21"},{"key":"e_1_3_1_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2017.2766787"},{"key":"e_1_3_1_53_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.31"},{"key":"e_1_3_1_54_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.32"},{"key":"e_1_3_1_55_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00081"},{"key":"e_1_3_1_56_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2021.04.028"},{"key":"e_1_3_1_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2019.2942796"},{"key":"e_1_3_1_58_2","first-page":"1265","volume-title":"Computer Vision and Pattern Recognition","author":"Zhao Rui","year":"2015","unstructured":"Rui Zhao, Wanli Ouyang, Hongsheng Li, and Xiaogang Wang. 2015. Saliency detection by multi-context deep learning. In Computer Vision and Pattern Recognition. 1265\u20131274."},{"key":"e_1_3_1_59_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2022.3203595"},{"key":"e_1_3_1_60_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2018.2845667"},{"key":"e_1_3_1_61_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11704-016-6906-3"},{"key":"e_1_3_1_62_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.360"},{"key":"e_1_3_1_63_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2017.2725263"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3640816","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3640816","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T22:50:40Z","timestamp":1750287040000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3640816"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,8]]},"references-count":62,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2024,6,30]]}},"alternative-id":["10.1145\/3640816"],"URL":"https:\/\/doi.org\/10.1145\/3640816","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"type":"print","value":"1551-6857"},{"type":"electronic","value":"1551-6865"}],"subject":[],"published":{"date-parts":[[2024,3,8]]},"assertion":[{"value":"2023-04-14","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-01-08","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-03-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}