{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T21:15:38Z","timestamp":1760217338794,"version":"build-2065373602"},"reference-count":31,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2015,8,27]],"date-time":"2015-08-27T00:00:00Z","timestamp":1440633600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004608","name":"Natural Science Foundation of Jiangsu Province","doi-asserted-by":"publisher","award":["BK20130451"],"award-info":[{"award-number":["BK20130451"]}],"id":[{"id":"10.13039\/501100004608","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100018615","name":"University Science Research Project of Jiangsu Province","doi-asserted-by":"publisher","award":["13KJB520025"],"award-info":[{"award-number":["13KJB520025"]}],"id":[{"id":"10.13039\/501100018615","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Open Fund of Jiangsu Province Key Laboratory for Remote Measuring and Control","award":["YCCK201402, YCCK201502"],"award-info":[{"award-number":["YCCK201402, YCCK201502"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>This paper studies the problem of detecting unknown objects within indoor environments in an active and natural manner. The visual saliency scheme utilizing both color and depth cues is proposed to arouse the interests of the machine system for detecting unknown objects at salient positions in a 3D scene. The 3D points at the salient positions are selected as seed points for generating object hypotheses using the 3D shape. We perform multi-class labeling on a Markov random field (MRF) over the voxels of the 3D scene, combining cues from object hypotheses and 3D shape. The results from MRF are further refined by merging the labeled objects, which are spatially connected and have high correlation between color histograms. Quantitative and qualitative evaluations on two benchmark RGB-D datasets illustrate the advantages of the proposed method. The experiments of object detection and manipulation performed on a mobile manipulator validate its effectiveness and practicability in robotic applications.<\/jats:p>","DOI":"10.3390\/s150921054","type":"journal-article","created":{"date-parts":[[2015,8,27]],"date-time":"2015-08-27T10:03:37Z","timestamp":1440669817000},"page":"21054-21074","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":19,"title":["Saliency-Guided Detection of Unknown Objects in RGB-D Indoor Scenes"],"prefix":"10.3390","volume":"15","author":[{"given":"Jiatong","family":"Bao","sequence":"first","affiliation":[{"name":"Department of Hydraulic, Energy and Power Engineering, Yangzhou University, Yangzhou 225000, China"},{"name":"Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI 48824, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yunyi","family":"Jia","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI 48824, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yu","family":"Cheng","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI 48824, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ning","family":"Xi","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI 48824, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2015,8,27]]},"reference":[{"doi-asserted-by":"crossref","unstructured":"Lai, K., Bo, L., Ren, X., and Fox, D. (2012, January 14\u201318). Detection-based object labeling in 3D scenes. Proceedings of the IEEE International Conference on Robotics and Automation, Saint Paul, MN, USA.","key":"ref_1","DOI":"10.1109\/ICRA.2012.6225316"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1284","DOI":"10.1177\/0278364911401765","article-title":"The MOPED framework: Object recognition and pose estimation for manipulation","volume":"30","author":"Collet","year":"2011","journal-title":"Int. J. Robot. Res."},{"doi-asserted-by":"crossref","unstructured":"Johnson-Roberson, M., Bohg, J., Bjorkman, M., and Kragic, D. (2010, January 18\u201322). Attention-based active 3D point cloud segmentation. Proceedings of the IEEE International Conference on Intelligent Robots and Systems, Taipei, Taiwan.","key":"ref_3","DOI":"10.1109\/IROS.2010.5649872"},{"doi-asserted-by":"crossref","unstructured":"Bjorkman, M., and Kragic, D. (2010, January 3\u20137). Active 3D scene segmentation and detection of unknown objects. Proceedings of the IEEE International Conference on Robotics and Automation, Anchorage, AK, USA.","key":"ref_4","DOI":"10.1109\/ROBOT.2010.5509973"},{"unstructured":"Mishra, A.K., Srivastav, A., and Aloimonos, Y. (2012, January 14\u201318). Segmenting \u201cSimple\u201d Objects Using RGB-D. Proceedings of the IEEE International Conference on Robotics and Automation, Saint Paul, MN, USA.","key":"ref_5"},{"unstructured":"Potapova, E., Varadarajan, K.M., Richtsfeld, A., Zillich, M., and Vincze, M. (June, January 31). Attention-driven object detection and segmentation of cluttered table scenes using 2.5D symmetry. Proceedings of the IEEE International Conference on Robotics and Automation, Hong Kong, China.","key":"ref_6"},{"doi-asserted-by":"crossref","unstructured":"Richtsfeld, A., Morwald, T., Prankl, J., Zillich, M., and Vincze, M. (2012, January 7\u201312). Segmentation of unknown objects in indoor environments. Proceedings of the IEEE International Conference on Intelligent Robots and Systems, Vilamoura, Portugal.","key":"ref_7","DOI":"10.1109\/IROS.2012.6385661"},{"unstructured":"Sanford, K. Smoothing Kinect Depth Frames in Real-Time. Available online: http:\/\/www.codeproject.com\/Articles\/317974\/KinectDepthSmoothing.","key":"ref_8"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1007\/BF00133571","article-title":"Active Vision","volume":"1","author":"Aloimonos","year":"1988","journal-title":"IJCV"},{"key":"ref_10","first-page":"219","article-title":"Shifts in selective visual attention: towards the underlying neural circuitry","volume":"4","author":"Koch","year":"1985","journal-title":"Hum. Neurbiol."},{"key":"ref_11","first-page":"10","article-title":"The iLab Neuromorphic Vision C++ Toolkit: Free tools for the next generation of vision algorithms","volume":"1","author":"Itti","year":"2004","journal-title":"Neuromorphic Eng."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1915","DOI":"10.1109\/TPAMI.2011.272","article-title":"Context-aware saliency detection","volume":"34","author":"Goferman","year":"2012","journal-title":"IEEE PAMI"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"569","DOI":"10.1109\/TPAMI.2014.2345401","article-title":"Global Contrast based Salient Region Detection","volume":"37","author":"Cheng","year":"2015","journal-title":"IEEE TPAMI"},{"doi-asserted-by":"crossref","unstructured":"Zhang, J., and Sclaroff, S. (2013, January 1\u20138). Saliency detection: A boolean map approach. Proceedings of the IEEE International Conference on Computer Vision, Sydney, Australia.","key":"ref_14","DOI":"10.1109\/ICCV.2013.26"},{"doi-asserted-by":"crossref","unstructured":"Ju, R., Ge, L., Geng, W., Ren, T., and Wu, G. (2014, January 27\u201330). Depth saliency based on anisotropic center-surround difference. Proceedings of the IEEE International Conference on Image Processing, Paris, France.","key":"ref_15","DOI":"10.1109\/ICIP.2014.7025222"},{"doi-asserted-by":"crossref","unstructured":"Desingh, K., Krishna, K.M., Rajan, D., and Jawahar, C.V. (2013, January 9\u201313). Depth really matters: Improving visual salient region detection with depth. Proceedings of the British Machine Vision Conference, Bristol, UK.","key":"ref_16","DOI":"10.5244\/C.27.98"},{"doi-asserted-by":"crossref","unstructured":"Bharath, R., Nicholas, L., and Cheng, X. (2013, January 12\u201314). Scalable scene understanding using saliency-guided object localization. Proceedings of the IEEE International Conference on Control and Automation, Hangzhou, China.","key":"ref_17","DOI":"10.1109\/ICCA.2013.6565074"},{"unstructured":"Gupta, S., Arbelaez, P., and Malik, J. Perceptual organization and recognition of indoor scenes from RGB-D images. Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, Portland, OR, USA.","key":"ref_18"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"1222","DOI":"10.1109\/34.969114","article-title":"Fast approximate energy minimization via graph cuts","volume":"23","author":"Boykov","year":"2001","journal-title":"IEEE TPAMI"},{"doi-asserted-by":"crossref","unstructured":"Lai, K., Bo, L., Ren, X., and Fox, D. (2011, January 9\u201313). A large-scale hierarchical multi-view RGB-D object dataset. Proceedings of the International Conference on Robotics and Automation, Shanghai, China.","key":"ref_20","DOI":"10.1109\/ICRA.2011.5980382"},{"unstructured":"Judd, T., Ehinger, K., Durand, F., and Torralba, A. (October, January 29). Learning to predict where humans look. Proceedings of the 12th International Conference on Computer Vision, Kyoto, Japan.","key":"ref_21"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1167\/9.3.5","article-title":"Saliency, attention, and visual search: An information theoretic approach","volume":"9","author":"Bruce","year":"2009","journal-title":"J. Vis."},{"doi-asserted-by":"crossref","unstructured":"Kootstra, G., Nederveen, A., and Boer, B. (2008, January 1\u20134). Paying attention to symmetry. Proceedings of the British Machine Vision Conference, Leeds, UK.","key":"ref_23","DOI":"10.5244\/C.22.111"},{"doi-asserted-by":"crossref","unstructured":"Borji, A., Tavakoli, H., Sihite, D., and Itti, L. (2013, January 1\u20138). Analysis of scores, datasets, and models in visual saliency prediction. Proceedings of the 12th International Conference on Computer Vision, Sydney, Australia.","key":"ref_24","DOI":"10.1109\/ICCV.2013.118"},{"unstructured":"Bylinskii, Z., Judd, T., Durand, F., Oliva, A., and Torralba, A. MIT Saliency Benchmark. Available online: http:\/\/saliency.mit.edu\/.","key":"ref_25"},{"key":"ref_26","first-page":"545","article-title":"Graph-based visual saliency","volume":"19","author":"Harel","year":"2007","journal-title":"NIPS"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"194","DOI":"10.1109\/TPAMI.2011.146","article-title":"Image signature: Highlighting sparse salient regions","volume":"34","author":"Hou","year":"2012","journal-title":"PAMI"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1167\/8.7.32","article-title":"SUN: A Bayesian framework for saliency using natural statistics","volume":"8","author":"Zhang","year":"2008","journal-title":"J. Vis."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1109\/TPAMI.2004.1273918","article-title":"Learning to Detect Natural Image Boundaries Using Local Brightness, Color, and Texture Cues","volume":"26","author":"Martin","year":"2004","journal-title":"TPAMI"},{"unstructured":"Rodriguez, S., Burrus, N., and Abderrahim, M. (2013, January 21\u201324). 3D object reconstruction with a single RGB-Depth image. Proceedings of the International Conference on Computer Vision Theory and Applications, Barcelona, Spain.","key":"ref_30"},{"doi-asserted-by":"crossref","unstructured":"She, L., Cheng, Y., Chai, J.Y., Jia, Y., and Xi, N. (2014, January 25\u201329). Teaching robots new actions through natural language instructions. Proceedings of the IEEE International Symposium on Robot and Human Interactive Communication, Edinburgh, UK.","key":"ref_31","DOI":"10.1109\/ROMAN.2014.6926362"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/15\/9\/21054\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T20:47:30Z","timestamp":1760215650000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/15\/9\/21054"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,8,27]]},"references-count":31,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2015,9]]}},"alternative-id":["s150921054"],"URL":"https:\/\/doi.org\/10.3390\/s150921054","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2015,8,27]]}}}