{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,28]],"date-time":"2026-01-28T08:03:51Z","timestamp":1769587431360,"version":"3.49.0"},"reference-count":20,"publisher":"SAGE Publications","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IFS"],"published-print":{"date-parts":[[2024,2,14]]},"abstract":"<jats:p>Most existing RGB-D salient object detection (SOD) methods extract features of both modalities in parallel or adopt depth features as supplementary information for unidirectional interaction from depth modality to RGB modality in the encoder stage. These methods ignore the influence of low-quality depth maps, and there is still room for improvement in effectively fusing RGB features and depth features. To address the above problems, this paper proposes a Feature Interaction Network (FINet), which performs bi-directional interaction through feature interaction module (FIM) in the encoder stage. The feature interaction module is divided into two parts: depth enhancement module (DEM) filters the noise in the depth features through the attention mechanism; and cross enhancement module (CEM) effectively interacts RGB features and depth features. In addition, this paper proposes a two-stage cross-modal fusion strategy: high-level fusion adopts the semantic information of high level for coarse localization of salient regions, and low-level fusion makes full use of the detailed information of low level through boundary fusion, and then we progressively refine high-level and low-level cross-modal features to obtain the final saliency prediction map. Extensive experiments show that the proposed model achieves better performance than eight state-of-the-art models on five standard datasets.<\/jats:p>","DOI":"10.3233\/jifs-233225","type":"journal-article","created":{"date-parts":[[2023,8,18]],"date-time":"2023-08-18T11:31:32Z","timestamp":1692358292000},"page":"4543-4556","source":"Crossref","is-referenced-by-count":3,"title":["Feature interaction and two-stage cross-modal fusion for RGB-D salient object detection"],"prefix":"10.1177","volume":"46","author":[{"given":"Ming","family":"Yu","sequence":"first","affiliation":[{"name":"School of Artificial Intelligence, Hebei University of Technology, Tianjin, China"}]},{"given":"Jiali","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence, Hebei University of Technology, Tianjin, China"}]},{"given":"Yi","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence, Hebei University of Technology, Tianjin, China"}]},{"given":"Gang","family":"Yan","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence, Hebei University of Technology, Tianjin, China"}]}],"member":"179","reference":[{"key":"10.3233\/JIFS-233225_ref2","doi-asserted-by":"crossref","unstructured":"Gao Yue , Wang Meng , Tao Dacheng , et al. 3-D object retrieval and recognition with hypergraph analysis. [J], IEEE transactions on image processing: a publication of the IEEE Signal Processing Society 21(9) (2012).","DOI":"10.1109\/TIP.2012.2199502"},{"issue":"2","key":"10.3233\/JIFS-233225_ref6","doi-asserted-by":"crossref","first-page":"3147","DOI":"10.3233\/JIFS-189353","article-title":"Component identification and defect detection in transmission lines based on deep learning [J]","volume":"40","author":"Zheng","year":"2021","journal-title":"Journal of Intelligent & Fuzzy Systems"},{"key":"10.3233\/JIFS-233225_ref7","doi-asserted-by":"crossref","first-page":"4873","DOI":"10.1109\/TIP.2020.2976689","article-title":"Icnet: Information conversion network for RGB-D based salient object detection","volume":"29","author":"Li","year":"2020","journal-title":"IEEE Trans. Image Process"},{"key":"10.3233\/JIFS-233225_ref8","doi-asserted-by":"publisher","DOI":"10.1109\/ICME51207.2021.9428263"},{"key":"10.3233\/JIFS-233225_ref9","doi-asserted-by":"publisher","first-page":"3376","DOI":"10.1109\/TIP.2021.3060167","article-title":"CDNet: Complementary Depth Network for RGB-D Salient Object Detection","volume":"30","author":"Jin","year":"2021","journal-title":"in IEEE Transactions on Image Processing"},{"issue":"2","key":"10.3233\/JIFS-233225_ref10","doi-asserted-by":"crossref","first-page":"2503","DOI":"10.3233\/JIFS-182769","article-title":"LHRNet: Lateral hierarchically refining network for salient object detection [J]","volume":"37","author":"Zheng","year":"2019","journal-title":"Journal of Intelligent & Fuzzy Systems"},{"key":"10.3233\/JIFS-233225_ref14","unstructured":"Qu L. , He S. , Zhang J. , et al. RGBD Salient Object Detection via Deep Fusion [J], IEEE Transactions on Image Processing (2016), PP(99)."},{"key":"10.3233\/JIFS-233225_ref23","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1007\/s41095-020-0199-z","article-title":"RGB-D salient object detection: A survey [J]","volume":"7","author":"Zhou","year":"2021","journal-title":"Computational Visual Media"},{"issue":"Oct.21","key":"10.3233\/JIFS-233225_ref24","first-page":"46","article-title":"Salient object detection for RGB-Dimage by single stream recurrent convolution neural network [J]","volume":"363","author":"Liu","year":"2019","journal-title":"Neurocomputing"},{"key":"10.3233\/JIFS-233225_ref25","doi-asserted-by":"crossref","first-page":"55277","DOI":"10.1109\/ACCESS.2019.2913107","article-title":"Adaptive Fusion for RGB-D Salient Object Detection [J]","volume":"7","author":"Wang","year":"2019","journal-title":"IEEE Access"},{"key":"10.3233\/JIFS-233225_ref26","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.jvcir.2019.03.019","article-title":"Depth-aware saliency detection using convolutional neural networks [J]","volume":"61","author":"Ding","year":"2019","journal-title":"Journal of Visual Communication and Image Representation"},{"key":"10.3233\/JIFS-233225_ref27","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1016\/j.patcog.2018.08.007","article-title":"Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection [J]","volume":"86","author":"Chen","year":"2019","journal-title":"Pattern Recognition"},{"issue":"0","key":"10.3233\/JIFS-233225_ref30","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1016\/j.neucom.2020.04.032","article-title":", CACNet: Salient object detection via context aggregation and contrast embedding [J]","volume":"403","author":"Guang Feng","year":"2020","journal-title":"Neurocomputing"},{"key":"10.3233\/JIFS-233225_ref33","doi-asserted-by":"publisher","DOI":"10.1145\/2632856.2632866"},{"key":"10.3233\/JIFS-233225_ref37","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2018\/97"},{"key":"10.3233\/JIFS-233225_ref39","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2012.6247743"},{"key":"10.3233\/JIFS-233225_ref40","doi-asserted-by":"crossref","first-page":"3528","DOI":"10.1109\/TIP.2021.3062689","article-title":"Hierarchical alternate inter-action network for RGB-D salient object detection [J]","volume":"30","author":"Li","year":"2021","journal-title":"IEEE Transactions on Image Processing"},{"key":"10.3233\/JIFS-233225_ref41","doi-asserted-by":"crossref","first-page":"132","DOI":"10.1016\/j.neucom.2022.03.029","article-title":"Multi-modal interactive attention and dual progressive decoding network for RGB-D\/T salient object detection [J]","volume":"490","author":"Liang","year":"2022","journal-title":"Neurocomputing"},{"issue":"10","key":"10.3233\/JIFS-233225_ref42","doi-asserted-by":"crossref","first-page":"7547","DOI":"10.1007\/s00521-021-06845-3","article-title":"CFIDNet: Cascaded feature interaction decoder for RGB-D salient object detection [J]","volume":"34","author":"Chen","year":"2022","journal-title":"Neural Computing and Applications"},{"key":"10.3233\/JIFS-233225_ref44","doi-asserted-by":"crossref","first-page":"109194","DOI":"10.1016\/j.patcog.2022.109194","article-title":"Cross-modal hierarchical interaction network for RGB-D salient object detection [J]","volume":"136","author":"Bi","year":"2023","journal-title":"Pattern Recognition"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/JIFS-233225","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T18:51:36Z","timestamp":1769539896000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/JIFS-233225"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2,14]]},"references-count":20,"journal-issue":{"issue":"2"},"URL":"https:\/\/doi.org\/10.3233\/jifs-233225","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,2,14]]}}}