{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,25]],"date-time":"2026-04-25T09:20:59Z","timestamp":1777108859739,"version":"3.51.4"},"reference-count":70,"publisher":"Tsinghua University Press","issue":"2","license":[{"start":{"date-parts":[[2020,6,1]],"date-time":"2020-06-01T00:00:00Z","timestamp":1590969600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,6,10]],"date-time":"2020-06-10T00:00:00Z","timestamp":1591747200000},"content-version":"vor","delay-in-days":9,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Comp. Visual. Med."],"published-print":{"date-parts":[[2020,6]]},"DOI":"10.1007\/s41095-020-0173-9","type":"journal-article","created":{"date-parts":[[2020,6,10]],"date-time":"2020-06-10T07:02:46Z","timestamp":1591772566000},"page":"191-204","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":18,"title":["S4Net: Single stage salient-instance segmentation"],"prefix":"10.26599","volume":"6","author":[{"given":"Ruochen","family":"Fan","sequence":"first","affiliation":[{"name":"BNRist, Tsinghua University, Beijing 100086, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ming-Ming","family":"Cheng","sequence":"additional","affiliation":[{"name":"Nankai University, Tianjin 300071, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qibin","family":"Hou","sequence":"additional","affiliation":[{"name":"Nankai University, Tianjin 300071, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tai-Jiang","family":"Mu","sequence":"additional","affiliation":[{"name":"BNRist, Tsinghua University, Beijing 100086, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jingdong","family":"Wang","sequence":"additional","affiliation":[{"name":"MSRA, Beijing 100086, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shi-Min","family":"Hu","sequence":"additional","affiliation":[{"name":"BNRist, Tsinghua University, Beijing 100086, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"11138","reference":[{"issue":"14","key":"173_CR1","doi-asserted-by":"publisher","first-page":"9596","DOI":"10.1073\/pnas.092277599","volume":"99","author":"F F Li","year":"2002","unstructured":"Li, F. F.; VanRullen, R.; Koch, C.; Perona, P. Rapid natural scene categorization in the near absence of attention. Proceedings of the National Academy of Sciences of the United States of America Vol. 99, No. 14, 9596\u20139601, 2002.","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"issue":"3","key":"173_CR2","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1167\/8.3.3","volume":"8","author":"L Elazary","year":"2008","unstructured":"Elazary, L.; Itti, L. Interesting objects are visually salient. Journal of Vision Vol. 8, No. 3, 3, 2008.","journal-title":"Journal of Vision"},{"issue":"4","key":"173_CR3","volume":"29","year":"2010","unstructured":"Cheng, M.-M.; Zhang, F.-L.; Mitra, N. J.; Huang, X.; Hu, S.-M. RepFinder: Finding approximately repeated scene elements for image editing. ACM Transactions on Graphics Vol. 29, No. 4, Article No. 83, 2010.","journal-title":"ACM Transactions on Graphics"},{"issue":"6","key":"173_CR4","volume":"29","year":"2010","unstructured":"Wu, H. S.; Wang, Y. S.; Feng, K. C.; Wong, T. T.; Lee, T. Y.; Heng, P. A. Resizing by symmetrysummarization. ACM Transactions on Graphics Vol. 29, No. 6, Article No. 159, 2010.","journal-title":"ACM Transactions on Graphics"},{"issue":"5","key":"173_CR5","volume":"28","year":"2009","unstructured":"Chen, T.; Cheng, M.-M.; Tan, P.; Shamir, A.; Hu, S.-M. Sketch2photo: Internet image montage. ACM Transactions on Graphics Vol. 28, No. 5, Article No. 124, 2009.","journal-title":"ACM Transactions on Graphics"},{"key":"173_CR6","volume-title":"Proceedings of the Robotics: Science and Systems","author":"C Wu","year":"2014","unstructured":"Wu, C.; Lenz, I.; Saxena, A. Hierarchical semantic labeling for task-relevant RGB-D perception. In: Proceedings of the Robotics: Science and Systems, 2014."},{"issue":"2","key":"173_CR7","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1007\/s41095-019-0149-9","volume":"5","author":"A Borji","year":"2019","unstructured":"Borji, A.; Cheng, M.-M.; Hou, Q.; Jiang, H.; Li, J. Salient object detection: A survey. Computational Visual Media Vol. 5, No. 2, 117\u2013150, 2019.","journal-title":"Computational Visual Media"},{"issue":"3","key":"173_CR8","doi-asserted-by":"publisher","first-page":"740","DOI":"10.1109\/TPAMI.2018.2815601","volume":"41","author":"Z Bylinskii","year":"2019","unstructured":"Bylinskii, Z.; Judd, T.; Oliva, A.; Torralba, A.; Durand, F. What do different evaluation metrics tell us about saliency models? IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 41, No. 3, 740\u2013757, 2019.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"173_CR9","first-page":"2386","volume-title":"Instance-level salient object segmentation","author":"G Li","year":"2017","unstructured":"Li, G.; Xie, Y.; Lin, L.; Yu, Y. Instance-level salient object segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2386\u20132395, 2017."},{"issue":"6","key":"173_CR10","doi-asserted-by":"publisher","first-page":"495","DOI":"10.1038\/nrn1411","volume":"5","author":"J M Wolfe","year":"2004","unstructured":"Wolfe, J. M.; Horowitz, T. S. What attributes guide the deployment of visual attention and how do they do it? Nature Reviews Neuroscience Vol. 5, No. 6, 495\u2013501, 2004.","journal-title":"Nature Reviews Neuroscience"},{"issue":"1","key":"173_CR11","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1146\/annurev.ne.18.030195.001205","volume":"18","author":"R Desimone","year":"1995","unstructured":"Desimone, R.; Duncan, J. Neural mechanisms of selective visual attention. Annual Review of Neuroscience Vol. 18, No. 1, 193\u2013222, 1995.","journal-title":"Annual Review of Neuroscience"},{"issue":"6","key":"173_CR12","doi-asserted-by":"publisher","first-page":"R247","DOI":"10.1016\/j.cub.2009.02.020","volume":"19","author":"S K Mannan","year":"2009","unstructured":"Mannan, S. K.; Kennard, C.; Husain, M. The role of visual salience in directing eye movements in visual object agnosia. Current Biology Vol. 19, No. 6, R247\u2013R248, 2009.","journal-title":"Current Biology"},{"issue":"11","key":"173_CR13","doi-asserted-by":"publisher","first-page":"1254","DOI":"10.1109\/34.730558","volume":"20","author":"L Itti","year":"1998","unstructured":"Itti, L.; Koch, C.; Niebur, E. A model of saliencybased visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 20, No. 11, 1254\u20131259, 1998.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"3","key":"173_CR14","doi-asserted-by":"publisher","first-page":"194","DOI":"10.1038\/35058500","volume":"2","author":"L Itti","year":"2001","unstructured":"Itti, L.; Koc, C. Computational modeling of visual attention. Nature Reviews Neuroscience Vol. 2, No. 3, 194\u2013203, 2001.","journal-title":"Nature Reviews Neuroscience"},{"issue":"3","key":"173_CR15","doi-asserted-by":"publisher","first-page":"569","DOI":"10.1109\/TPAMI.2014.2345401","volume":"37","author":"M M Cheng","year":"2015","unstructured":"Cheng, M. M.; Mitra, N. J.; Huang, X. L.; Torr, P. H. S.; Hu, S. M. Global contrast based salient region detection. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 37, No. 3, 569\u2013582, 2015.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"173_CR16","first-page":"2083","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"H Z Jiang","year":"2013","unstructured":"Jiang, H. Z.; Wang, J. D.; Yuan, Z. J.; Wu, Y.; Zheng, N. N.; Li, S. P. Salient object detection: A discriminative regional feature integration approach. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2083\u20132090, 2013."},{"key":"173_CR17","first-page":"2814","volume-title":"Saliency optimization from robust background detection","author":"W Zhu","year":"2014","unstructured":"Zhu, W.; Liang, S.; Wei, Y.; Sun, J. Saliency optimization from robust background detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2814\u20132821, 2014."},{"issue":"3","key":"173_CR18","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1145\/1015706.1015720","volume":"23","author":"C Rother","year":"2004","unstructured":"Rother, C.; Kolmogorov, V.; Blake A. \u201cGrabCut\u201d: Interactive foreground extraction using iterated graph cuts. ACM Transactions on Graphics Vol. 23, No. 3, 309\u2013314, 2004.","journal-title":"ACM Transactions on Graphics"},{"issue":"4","key":"173_CR19","doi-asserted-by":"publisher","first-page":"815","DOI":"10.1109\/TPAMI.2018.2815688","volume":"41","author":"Q Hou","year":"2019","unstructured":"Hou, Q.; Cheng, M.-M.; Hu, X.; Borji, A.; Tu, Z.; Torr, P. H. S. Deeply supervised salient object detection with short connections. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 41, No. 4, 815\u2013828, 2019.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"173_CR20","first-page":"478","volume-title":"Deep contrast learning for salient object detection","author":"G Li","year":"2016","unstructured":"Li, G.; Yu, Y. Deep contrast learning for salient object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 478\u2013487, 2016."},{"key":"173_CR21","first-page":"3183","volume-title":"Deep networks for saliency detection via local estimation and global search","author":"L Wang","year":"2015","unstructured":"Wang, L.; Lu, H.; Ruan, X.; Yang, M.-H. Deep networks for saliency detection via local estimation and global search. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3183\u20133192, 2015."},{"key":"173_CR22","first-page":"3992","volume-title":"Convolutional feature masking for joint object and stuff segmentation","author":"J Dai","year":"2015","unstructured":"Dai, J.; He, K.; Sun, J. Convolutional feature masking for joint object and stuff segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3992\u20134000, 2015."},{"key":"173_CR23","first-page":"297","volume-title":"Computer Vision\u2013ECCV 2014. Lecture Notes in Computer Science, Vol. 8695","author":"B Hariharan","year":"2014","unstructured":"Hariharan, B.; Arbel\u00b4aez, P.; Girshick, R.; Malik, J. Simultaneous detection and segmentation. In: Computer Vision\u2013ECCV 2014. Lecture Notes in Computer Science, Vol. 8695. Fleet, D.; Pajdla, T.; Schiele, B.; Tuytelaars, T. Eds. Springer Cham, 297\u2013312, 2014."},{"key":"173_CR24","first-page":"447","volume-title":"Hypercolumns for object segmentation and fine-grained localization","author":"B Hariharan","year":"2015","unstructured":"Hariharan, B.; Arbelaez, P.; Girshick, R.; Malik, J. Hypercolumns for object segmentation and fine-grained localization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 447\u2013456, 2015."},{"key":"173_CR25","first-page":"580","volume-title":"Rich feature hierarchies for accurate object detection and semantic segmentation","author":"R Girshick","year":"2014","unstructured":"Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 580\u2013587, 2014."},{"issue":"6","key":"173_CR26","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","volume":"39","author":"S Q Ren","year":"2017","unstructured":"Ren, S. Q.; He, K. M.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 39, No. 6, 1137\u20131149, 2017.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"173_CR27","volume-title":"Object detection via region-based fully convolutional networks","author":"J Dai","year":"2016","unstructured":"Dai, J.; Li, Y.; He, K.; Sun, J. R-FCN: Object detection via region-based fully convolutional networks. In: Proceedings of the Advances in Neural Information Processing Systems 29, 2016."},{"key":"173_CR28","first-page":"1440","volume-title":"Fast R-CNN","author":"R Girshick","year":"2015","unstructured":"Girshick, R. Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, 1440\u20131448, 2015."},{"issue":"9","key":"173_CR29","doi-asserted-by":"publisher","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","volume":"37","author":"K M He","year":"2015","unstructured":"He, K. M.; Zhang, X. Y.; Ren, S. Q.; Sun, J. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 37, No. 9, 1904\u20131916, 2015.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"173_CR30","first-page":"534","volume-title":"Computer Vision\u2013ECCV 2016. Lecture Notes in Computer Science, Vol. 9910","author":"J F Dai","year":"2016","unstructured":"Dai, J. F.; He, K. M.; Li, Y.; Ren, S. Q.; Sun, J. Instance-sensitive fully convolutional networks. In: Computer Vision\u2013ECCV 2016. Lecture Notes in Computer Science, Vol. 9910. Leibe, B.; Matas, J.; Sebe, N.; Welling, M. Eds. Springer Cham, 534\u2013549, 2016."},{"key":"173_CR31","first-page":"2961","volume-title":"Mask R-CNN","author":"K He","year":"2017","unstructured":"He, K.; Gkioxari, G.; Doll\u00b4ar, P.; Girshick, R. Mask R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, 2961\u20132969, 2017."},{"key":"173_CR32","first-page":"2117","volume-title":"Feature pyramid networks for object detection","author":"T-Y Lin","year":"2017","unstructured":"Lin, T.-Y.; Doll\u00b4ar, P.; Girshick, R. B.; He, K.; Hariharan, B.; Belongie, S. J. Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2117\u20132125, 2017."},{"issue":"11","key":"173_CR33","doi-asserted-by":"publisher","first-page":"2314","DOI":"10.1109\/TPAMI.2016.2636150","volume":"39","author":"Y C Wei","year":"2017","unstructured":"Wei, Y. C.; Liang, X. D.; Chen, Y. P.; Shen, X. H.; Cheng, M. M.; Feng, J. S.; Zhao, Y.; Yan, S. STC: A simple to complex framework for weakly-supervised semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 39, No. 11, 2314\u20132320, 2017.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"173_CR34","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1007\/978-3-319-78199-0_18","volume-title":"Energy Minimization Methods in Computer Vision and Pattern Recognition. Lecture Notes in Computer Science, Vol. 10746","author":"Q B Hou","year":"2018","unstructured":"Hou, Q. B.; Massiceti, D.; Dokania, P. K.; Wei, Y. C.; Cheng, M. M.; Torr, P. H. S. Bottom-up top-down cues for weakly-supervised semantic segmentation. In: Energy Minimization Methods in Computer Vision and Pattern Recognition. Lecture Notes in Computer Science, Vol. 10746. Pelillo, M.; Hancock, E. Eds. Springer Cham, 263\u2013277, 2018."},{"key":"173_CR35","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","volume":"115","author":"O Russakovsky","year":"2015","unstructured":"Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.; Karpathy, A.; Khosla, A.; Bernstein, M. et al. ImageNet large scale visual recognition challenge International Journal of Computer Vision Vol. 115, 211\u2013252, 2015.","journal-title":"International Journal of Computer Vision"},{"issue":"1","key":"173_CR36","doi-asserted-by":"publisher","first-page":"98","DOI":"10.1007\/s11263-014-0733-5","volume":"111","author":"M Everingham","year":"2015","unstructured":"Everingham, M.; Eslami, S. M. A.; van Gool, L.; Williams, C. K. I.; Winn, J.; Zisserman, A. The pascal visual object classes challenge: A retrospective. International Journal of Computer Vision Vol. 111, No. 1, 98\u2013136, 2015.","journal-title":"International Journal of Computer Vision"},{"key":"173_CR37","first-page":"5733","volume-title":"Unconstrained salient object detection via proposal subset optimization","author":"J M Zhang","year":"2016","unstructured":"Zhang, J. M.; Sclaroff, S.; Lin, Z.; Shen, X. H.; Price, B.; Mech, R. Unconstrained salient object detection via proposal subset optimization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 5733\u20135742, 2016."},{"issue":"1","key":"173_CR38","doi-asserted-by":"publisher","first-page":"128","DOI":"10.1109\/TPAMI.2016.2537320","volume":"39","author":"J Pont-Tuset","year":"2017","unstructured":"Pont-Tuset, J.; Arbelaez, P.; Barron, J. T.; Marques, F.; Malik, J. Multiscale combinatorial grouping for image segmentation and object proposal generation. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 39, No. 1, 128\u2013140, 2017.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"4","key":"173_CR39","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1007\/s41095-015-0028-y","volume":"1","author":"W Qi","year":"2015","unstructured":"Qi, W.; Cheng, M. M.; Borji, A.; Lu, H. C.; Bai, L. F. SaliencyRank: Two-stage manifold ranking for salient object detection. Computational Visual Media Vol. 1, No. 4, 309\u2013320, 2015.","journal-title":"Computational Visual Media"},{"issue":"12","key":"173_CR40","doi-asserted-by":"publisher","first-page":"5706","DOI":"10.1109\/TIP.2015.2487833","volume":"24","author":"A Borji","year":"2015","unstructured":"Borji, A.; Cheng, M. M.; Jiang, H. Z.; Li, J. Salient object detection: A benchmark. IEEE Transactions on Image Processing Vol. 24, No. 12, 5706\u20135722, 2015.","journal-title":"IEEE Transactions on Image Processing"},{"issue":"11","key":"173_CR41","doi-asserted-by":"publisher","first-page":"2274","DOI":"10.1109\/TPAMI.2012.120","volume":"34","author":"R Achanta","year":"2012","unstructured":"Achanta, R.; Shaji, A.; Smith, K.; Lucchi, A.; Fua, P.; S\u00fcsstrunk, S. SLIC superpixels compared to stateof- the-art superpixel methods. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 34, No. 11, 2274\u20132282, 2012.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"2","key":"173_CR42","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1023\/B:VISI.0000022288.19776.77","volume":"59","author":"P F Felzenszwalb","year":"2004","unstructured":"Felzenszwalb, P. F.; Huttenlocher, D. P. Efficient graphbased image segmentation. International Journal of Computer Vision Vol. 59, No. 2, 167\u2013181, 2004.","journal-title":"International Journal of Computer Vision"},{"issue":"8","key":"173_CR43","doi-asserted-by":"publisher","first-page":"888","DOI":"10.1109\/34.868688","volume":"22","author":"J B Shi","year":"2000","unstructured":"Shi, J. B.; Malik, J. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 22, No. 8, 888\u2013905, 2000.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"2","key":"173_CR44","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1007\/s11263-016-0977-3","volume":"123","author":"J D Wang","year":"2017","unstructured":"Wang, J. D.; Jiang, H. Z.; Yuan, Z. J.; Cheng, M. M.; Hu, X. W.; Zheng, N. N. Salient object detection: A discriminative regional feature integration approach. International Journal of Computer Vision Vol. 123, No. 2, 251\u2013268, 2017.","journal-title":"International Journal of Computer Vision"},{"key":"173_CR45","first-page":"1265","volume-title":"S4Net: Single stage salient-instance segmentation 203 Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"R Zhao","year":"2015","unstructured":"Zhao, R.; Ouyang, W.; Li, H.; Wang, X. Saliency detection by multi-context deep learning. In: S4Net: Single stage salient-instance segmentation 203 Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1265\u20131274, 2015."},{"key":"173_CR46","first-page":"660","volume-title":"Deep saliency with encoded low level distance map and high level features","author":"G Lee","year":"2016","unstructured":"Lee, G.; Tai, Y.-W.; Kim, J. Deep saliency with encoded low level distance map and high level features. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 660\u2013668, 2016."},{"key":"173_CR47","first-page":"5455","volume-title":"Visual saliency based on multiscale deep features","author":"G Li","year":"2015","unstructured":"Li, G.; Yu, Y. Visual saliency based on multiscale deep features. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 5455\u20135463, 2015."},{"issue":"2","key":"173_CR48","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","volume":"60","author":"D G Lowe","year":"2004","unstructured":"Lowe, D. G. Distinctive image features from scaleinvariant keypoints. International Journal of Computer Vision Vol. 60, No. 2, 91\u2013110, 2004.","journal-title":"International Journal of Computer Vision"},{"issue":"3","key":"173_CR49","doi-asserted-by":"publisher","first-page":"346","DOI":"10.1016\/j.cviu.2007.09.014","volume":"110","author":"H Bay","year":"2008","unstructured":"Bay, H.; Ess, A.; Tuytelaars, T.; Van Gool, L. Speededup robust features (SURF). Computer Vision and Image Understanding Vol. 110, No. 3, 346\u2013359, 2008.","journal-title":"Computer Vision and Image Understanding"},{"key":"173_CR50","first-page":"886","volume":"1","author":"N Dalal","year":"2005","unstructured":"Dalal N.; Triggs, B. Histograms of oriented gradients for human detection. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 1, 886\u2013893, 2005.","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"173_CR51","volume-title":"arXiv preprint","author":"P Sermanet","year":"2013","unstructured":"Sermanet, P.; Eigen, D.; Zhang, X.; Mathieu, M.; Fergus, R.; LeCun, Y. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229, 2013."},{"issue":"2","key":"173_CR52","doi-asserted-by":"publisher","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","volume":"104","author":"J R Uijlings","year":"2013","unstructured":"Uijlings, J. R.; Van De Sande, K. E.; Gevers, T.; Smeulders, A. W. Selective search for object recognition. International Journal of Computer Vision Vol. 104, No. 2, 154\u2013171, 2013.","journal-title":"International Journal of Computer Vision"},{"key":"173_CR53","first-page":"3286","volume-title":"BING: Binarized normed gradients for objectness estimation at 300fps","author":"M-M Cheng","year":"2014","unstructured":"Cheng, M.-M.; Zhang, Z.; Lin, W.-Y.; Torr, P. BING: Binarized normed gradients for objectness estimation at 300fps. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3286\u20133293, 2014."},{"key":"173_CR54","volume-title":"Learning to segment object candidates","author":"P O Pinheiro","year":"2015","unstructured":"Pinheiro, P. O.; Collobert, R.; Doll\u00b4ar, P. Learning to segment object candidates. In: Proceedings of the Advances in Neural Information Processing Systems 28, 2015."},{"issue":"5","key":"173_CR55","doi-asserted-by":"publisher","first-page":"898","DOI":"10.1109\/TPAMI.2010.161","volume":"33","author":"P Arbel\u00b4aez","year":"2011","unstructured":"Arbel\u00b4aez, P.; Maire, M.; Fowlkes, C.; Malik, J. Contour detection and hierarchical image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 33, No. 5, 898\u2013916, 2011.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"173_CR56","first-page":"2359","volume-title":"Fully convolutional instance-aware semantic segmentation","author":"Y Li","year":"2017","unstructured":"Li, Y.; Qi, H.; Dai, J.; Ji, X.; Wei, Y. Fully convolutional instance-aware semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2359\u20132367, 2017."},{"key":"173_CR57","first-page":"770","volume-title":"Deep residual learning for image recognition","author":"K He","year":"2016","unstructured":"He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 770\u2013778, 2016."},{"key":"173_CR58","first-page":"2980","volume-title":"Focal loss for dense object detection","author":"T-Y Lin","year":"2017","unstructured":"Lin, T.-Y.; Goyal, P.; Girshick, R.; He, K.; Doll\u00b4ar, P. Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, 2980\u20132988, 2017."},{"key":"173_CR59","unstructured":"Yosinski, J.; Clune, J.; Nguyen, A.; Fuchs, T.; Lipson, H. Understanding neural networks through deep visualization. arXiv preprint arXiv:1506.06579, 2015."},{"key":"173_CR60","first-page":"2881","volume-title":"Pyramid scene parsing network","author":"H Zhao","year":"2017","unstructured":"Zhao, H.; Shi, J.; Qi, X.; Wang, X.; Jia, J. Pyramid scene parsing network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2881\u20132890, 2017."},{"key":"173_CR61","volume-title":"arXiv preprint","author":"M Abadi","year":"2016","unstructured":"Abadi, M.; Agarwal, A.; Barham, P.; Brevdo, E.; Chen, Z.; Citro, C.; Corrado, G. S.; Davis, A.; Dean, J.; Devin, M. et al. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467, 2016."},{"key":"173_CR62","first-page":"740","volume-title":"Computer Vision\u2013ECCV 2014. Lecture Notes in Computer Science, Vol. 8693","author":"T Y Lin","year":"2014","unstructured":"Lin, T. Y.; Maire, M.; Belongie, S.; Hays, J.; Perona, P.; Ramanan, D.; Doll\u00b4ar, P.; Zitnick, C. L. Microsoft COCO: Common objects in context. In: Computer Vision\u2013ECCV 2014. Lecture Notes in Computer Science, Vol. 8693. Fleet, D.; Pajdla, T.; Schiele, B.; Tuytelaars, T. Eds. Springer Cham, 740\u2013755, 2014."},{"key":"173_CR63","volume-title":"arXiv preprint","author":"A G Howard","year":"2017","unstructured":"Howard, A. G.; Zhu, M.; Chen, B.; Kalenichenko, D.; Wang, W.; Weyand, T.; Andreetto, M.; Adam, H. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861, 2017."},{"key":"173_CR64","volume-title":"arXiv preprint","author":"K Simonyan","year":"2014","unstructured":"Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014."},{"key":"173_CR65","doi-asserted-by":"publisher","first-page":"196","DOI":"10.1007\/978-3-030-01267-0_12","volume-title":"Computer Vision\u2013ECCV 2018. Lecture Notes in Computer Science, Vol. 11219","author":"D P Fan","year":"2018","unstructured":"Fan, D. P.; Cheng, M. M.; Liu, J. J.; Gao, S. H.; Hou, Q. B.; Borji, A. Salient objects in clutter: Bringing salient object detection to the foreground. In: Computer Vision\u2013ECCV 2018. Lecture Notes in Computer Science, Vol. 11219. Ferrari, V.; Hebert, M.; Sminchisescu, C.; Weiss, Y. Eds. Springer Cham, 196\u2013212, 2018."},{"key":"173_CR66","first-page":"678","volume-title":"DHSNet: Deep hierarchical saliency network for salient object detection","author":"N Liu","year":"2016","unstructured":"Liu, N.; Han, J. DHSNet: Deep hierarchical saliency network for salient object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 678\u2013686, 2016."},{"key":"173_CR67","first-page":"695","volume-title":"Computer Vision\u2013ECCV 2016. Lecture Notes in Computer Science, Vol. 9908","author":"A Kolesnikov","year":"2016","unstructured":"Kolesnikov, A.; Lampert, C. H. Seed, expand and constrain: Three principles for weakly-supervised image segmentation. In: Computer Vision\u2013ECCV 2016. Lecture Notes in Computer Science, Vol. 9908. Leibe, B.; Matas, J.; Sebe, N.; Welling, M. Eds. Springer Cham, 695\u2013711, 2016."},{"key":"173_CR68","first-page":"6488","volume-title":"Object region mining with adversarial erasing: A simple classification to semantic segmentation approach","author":"Y C Wei","year":"2017","unstructured":"Wei, Y. C.; Feng, J. S.; Liang, X. D.; Cheng, M. M.; Zhao, Y.; Yan, S. C. Object region mining with adversarial erasing: A simple classification to semantic segmentation approach. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 6488\u20136496, 2017."},{"issue":"4","key":"173_CR69","doi-asserted-by":"publisher","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","volume":"40","author":"L C Chen","year":"2018","unstructured":"Chen, L. C.; Papandreou, G.; Kokkinos, I.; Murphy, K.; Yuille, A. L. DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and 204 R. Fan, M.-M. Cheng, Q. Hou, et al. fully connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 40, No. 4, 834\u2013848, 2018.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"173_CR70","first-page":"543","volume-title":"Computer Vision\u2013ECCV 2016. Lecture Notes in Computer Science, Vol. 9908","author":"J M Zhang","year":"2016","unstructured":"Zhang, J. M.; Lin, Z.; Brandt, J.; Shen, X. H.; Sclaroff, S. Top-down neural attention by excitation backprop. In: Computer Vision\u2013ECCV 2016. Lecture Notes in Computer Science, Vol. 9908. Leibe, B.; Matas, J.; Sebe, N.; Welling, M. Eds. Springer Cham, 543\u2013559, 2016."}],"container-title":["Computational Visual Media"],"original-title":[],"link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s41095-020-0173-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s41095-020-0173-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s41095-020-0173-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"},{"URL":"http:\/\/xplorestaging.ieee.org\/ielx8\/10750449\/10897466\/10897473.pdf?arnumber=10897473","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,5]],"date-time":"2025-11-05T18:38:30Z","timestamp":1762367910000},"score":1,"resource":{"primary":{"URL":"https:\/\/ieeexplore.ieee.org\/document\/10897473\/"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6]]},"references-count":70,"journal-issue":{"issue":"2"},"URL":"https:\/\/doi.org\/10.1007\/s41095-020-0173-9","relation":{},"ISSN":["2096-0662","2096-0433"],"issn-type":[{"value":"2096-0662","type":"electronic"},{"value":"2096-0433","type":"print"}],"subject":[],"published":{"date-parts":[[2020,6]]},"assertion":[{"value":"16 September 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 April 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 June 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}