{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T00:48:49Z","timestamp":1772498929817,"version":"3.50.1"},"reference-count":49,"publisher":"Association for Computing Machinery (ACM)","issue":"1s","license":[{"start":{"date-parts":[[2021,1,31]],"date-time":"2021-01-31T00:00:00Z","timestamp":1612051200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Science Foundation of China","doi-asserted-by":"crossref","award":["61902236"],"award-info":[{"award-number":["61902236"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"CERNET Innovation Project","award":["NGII20180617"],"award-info":[{"award-number":["NGII20180617"]}]},{"DOI":"10.13039\/501100004609","name":"Foundation of Henan Educational Committee","doi-asserted-by":"crossref","award":["19A520005"],"award-info":[{"award-number":["19A520005"]}],"id":[{"id":"10.13039\/501100004609","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"crossref","award":["2020YFB1006003"],"award-info":[{"award-number":["2020YFB1006003"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2021,1,31]]},"abstract":"<jats:p>Weakly supervised semantic segmentation under image-level annotations is effectiveness for real-world applications. The small and sparse discriminative regions obtained from an image classification network that are typically used as the important initial location of semantic segmentation also form the bottleneck. Although deep convolutional neural networks (DCNNs) have exhibited promising performances for single-label image classification tasks, images of the real-world usually contain multiple categories, which is still an open problem. So, the problem of obtaining high-confidence discriminative regions from multi-label classification networks remains unsolved. To solve this problem, this article proposes an innovative three-step framework within the perspective of multi-object proposal generation. First, an image is divided into candidate boxes using the object proposal method. The candidate boxes are sent to a single-classification network to obtain the discriminative regions. Second, the discriminative regions are aggregated to obtain a high-confidence seed map. Third, the seed cues grow on the feature maps of high-level semantics produced by a backbone segmentation network. Experiments are carried out on the PASCAL VOC 2012 dataset to verify the effectiveness of our approach, which is shown to outperform other baseline image segmentation methods.<\/jats:p>","DOI":"10.1145\/3419842","type":"journal-article","created":{"date-parts":[[2021,4,1]],"date-time":"2021-04-01T01:53:55Z","timestamp":1617242035000},"page":"1-19","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":59,"title":["A Weakly Supervised Semantic Segmentation Network by Aggregating Seed Cues: The Multi-Object Proposal Generation Perspective"],"prefix":"10.1145","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7781-856X","authenticated-orcid":false,"given":"Junsheng","family":"Xiao","sequence":"first","affiliation":[{"name":"School of Computer Engineering and Science, Shanghai University, China"}]},{"given":"Huahu","family":"Xu","sequence":"additional","affiliation":[{"name":"School of Computer Engineering and Science, Shanghai University, China"}]},{"given":"Honghao","family":"Gao","sequence":"additional","affiliation":[{"name":"School of Computer Engineering and Science, Shanghai University, China"}]},{"given":"Minjie","family":"Bian","sequence":"additional","affiliation":[{"name":"School of Computer Engineering and Science, Shanghai University, China"}]},{"given":"Yang","family":"Li","sequence":"additional","affiliation":[{"name":"School of Computer Engineering and Science, Shanghai University, China"}]}],"member":"320","published-online":{"date-parts":[[2021,3,31]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2019.2947482"},{"key":"e_1_2_1_2_1","first-page":"2932058","article-title":"Hierarchical deep click feature prediction for fine-grained image recognition","volume":"2019","author":"Yu Jun","year":"2019","unstructured":"Jun Yu , Min Tan , Hongyuan Zhang , Dacheng Tao , and Yong Rui . 2019 . Hierarchical deep click feature prediction for fine-grained image recognition . IEEE Transactions on Pattern Analysis and Machine Intelligence. https:\/\/doi.org\/10.1109\/TPAMI. 2019 . 2932058 10.1109\/TPAMI.2019.2932058 Jun Yu, Min Tan, Hongyuan Zhang, Dacheng Tao, and Yong Rui. 2019. Hierarchical deep click feature prediction for fine-grained image recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence. https:\/\/doi.org\/10.1109\/TPAMI.2019.2932058","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence. https:\/\/doi.org\/10.1109\/TPAMI."},{"key":"e_1_2_1_3_1","volume-title":"Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR\u201915)","author":"Long J.","unstructured":"J. Long , E. Shelhamer , and T. Darrell . 2015. Fully convolutional networks for semantic segmentation . In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR\u201915) . 3431\u20133440. J. Long, E. Shelhamer, and T. Darrell. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR\u201915). 3431\u20133440."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.5555\/2986459.2986472"},{"key":"e_1_2_1_5_1","unstructured":"L. C. Chen G. Papandreou I. Kokkinos K. Murphy and A. L. Yuille. 2014. Semantic image segmentation with deep convolutional nets and fully connected CRFs. arXiv preprint arXiv:1412.7062.  L. C. Chen G. Papandreou I. Kokkinos K. Murphy and A. L. Yuille. 2014. Semantic image segmentation with deep convolutional nets and fully connected CRFs. arXiv preprint arXiv:1412.7062."},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR\u201916)","author":"Zhou B.","unstructured":"B. Zhou , A. Khosla , A. Lapedriza , A. Oliva , and A. Torralba . 2016. Learning deep features for discriminative localization . In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR\u201916) . 2921\u20132929. B. Zhou, A. Khosla, A. Lapedriza, A. Oliva, and A. Torralba. 2016. Learning deep features for discriminative localization. In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR\u201916). 2921\u20132929."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.295913"},{"key":"e_1_2_1_8_1","unstructured":"Yunchao Gong et al. 2013. Deep convolutional ranking for multilabel image annotation. arXiv preprint arXiv:1312.4894.  Yunchao Gong et al. 2013. Deep convolutional ranking for multilabel image annotation. arXiv preprint arXiv:1312.4894."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00170"},{"key":"e_1_2_1_10_1","volume-title":"Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR\u201917)","author":"Jin B.","unstructured":"B. Jin , M. V. O. Segovia , and S. Susstrunk . 2017. Webly supervised semantic segmentation . In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR\u201917) . 1705\u20131714. B. Jin, M. V. O. Segovia, and S. Susstrunk. 2017. Webly supervised semantic segmentation. In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR\u201917). 1705\u20131714."},{"key":"e_1_2_1_11_1","volume-title":"Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR\u201918)","author":"Shen T.","unstructured":"T. Shen , G. Lin , C. Shen , and R. Ian . 2018. Bootstrapping the performance of Webly supervised semantic segmentation . In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR\u201918) . 1363\u20131371. T. Shen, G. Lin, C. Shen, and R. Ian. 2018. Bootstrapping the performance of Webly supervised semantic segmentation. In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR\u201918). 1363\u20131371."},{"key":"e_1_2_1_12_1","first-page":"1901","article-title":"HCP: A flexible CNN framework for multi-label image classification","volume":"38","author":"Wei Y.","year":"2015","unstructured":"Y. Wei , W. Xia , M. Lin , 2015 . HCP: A flexible CNN framework for multi-label image classification . IEEE Transactions on Software Engineering 38 , 9 (2015), 1901 \u2013 1907 . Y. Wei, W. Xia, M. Lin, et al. 2015. HCP: A flexible CNN framework for multi-label image classification. IEEE Transactions on Software Engineering 38, 9 (2015), 1901\u20131907.","journal-title":"IEEE Transactions on Software Engineering"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2016.01.015"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2017.2787986"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIE.2017.2739691"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-014-0733-5"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1093\/nsr\/nwx106"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-013-0620-5"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.191"},{"key":"e_1_2_1_20_1","volume-title":"Proceedings of the International Conference on Computer Vision and Pattern Recognition. 3159\u20133167","author":"Lin D.","unstructured":"D. Lin , J. Dai , J. Jia , K. He , and J. Sun . 2016. ScribbleSup: Scribble-supervised convolutional networks for semantic segmentation . In Proceedings of the International Conference on Computer Vision and Pattern Recognition. 3159\u20133167 . D. Lin, J. Dai, J. Jia, K. He, and J. Sun. 2016. ScribbleSup: Scribble-supervised convolutional networks for semantic segmentation. In Proceedings of the International Conference on Computer Vision and Pattern Recognition. 3159\u20133167."},{"key":"e_1_2_1_21_1","volume-title":"Proceedings of the International Conference on ECCV. 549\u2013565","author":"Bearman A.","unstructured":"A. Bearman , O. Russakovsky , V. Ferrari , and L. Fei-Fei . 2016. What's the point: Semantic segmentation with point supervision . In Proceedings of the International Conference on ECCV. 549\u2013565 . A. Bearman, O. Russakovsky, V. Ferrari, and L. Fei-Fei. 2016. What's the point: Semantic segmentation with point supervision. In Proceedings of the International Conference on ECCV. 549\u2013565."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.203"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the International Conference on Computer Vision and Pattern Recognition. 3781\u20133790","author":"Xu J.","unstructured":"J. Xu , A. G. Schwing , and R. Urtasun . 2015. Learning to segment under various forms of weak supervision . In Proceedings of the International Conference on Computer Vision and Pattern Recognition. 3781\u20133790 . J. Xu, A. G. Schwing, and R. Urtasun. 2015. Learning to segment under various forms of weak supervision. In Proceedings of the International Conference on Computer Vision and Pattern Recognition. 3781\u20133790."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2018.08.007"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the International Conference on Computer Vision and Pattern Recognition. 7104\u20137023","author":"Huang Z.","unstructured":"Z. Huang , X. Wang , J. Wang , W. Liu , and J. Wang . 2018. Weakly supervised semantic segmentation network with deep seeded region growing . In Proceedings of the International Conference on Computer Vision and Pattern Recognition. 7104\u20137023 . Z. Huang, X. Wang, J. Wang, W. Liu, and J. Wang. 2018. Weakly supervised semantic segmentation network with deep seeded region growing. In Proceedings of the International Conference on Computer Vision and Pattern Recognition. 7104\u20137023."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the International Conference on ECCV. 695\u2013711","author":"Kolesnikov A.","unstructured":"A. Kolesnikov and C. H. Lampert . 2016. Seed, expand and constrain: Three principles for weakly-supervised image segmentation . In Proceedings of the International Conference on ECCV. 695\u2013711 . A. Kolesnikov and C. H. Lampert. 2016. Seed, expand and constrain: Three principles for weakly-supervised image segmentation. In Proceedings of the International Conference on ECCV. 695\u2013711."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00147"},{"key":"e_1_2_1_28_1","volume-title":"Proceedings of the International Conference on Computer Vision and Pattern Recognition. 1354\u20131362","author":"Wei Y.","unstructured":"Y. Wei , J. Feng , X. Liang , M.-M. Cheng , Y. Zhao , and S. Yan . 2017. Object region mining with adversarial erasing: A simple classification to semantic segmentation approach . In Proceedings of the International Conference on Computer Vision and Pattern Recognition. 1354\u20131362 . Y. Wei, J. Feng, X. Liang, M.-M. Cheng, Y. Zhao, and S. Yan. 2017. Object region mining with adversarial erasing: A simple classification to semantic segmentation approach. In Proceedings of the International Conference on Computer Vision and Pattern Recognition. 1354\u20131362."},{"key":"e_1_2_1_29_1","volume-title":"Proceedings of the International Conference on Computer Vision and Pattern Recognition. 549\u2013559","author":"Hou Q.","year":"2018","unstructured":"Q. Hou , P. T. Jiang , Y. Wei , 2018 . Self-erasing network for integral object attention . In Proceedings of the International Conference on Computer Vision and Pattern Recognition. 549\u2013559 . Q. Hou, P. T. Jiang, Y. Wei, et al. 2018. Self-erasing network for integral object attention. In Proceedings of the International Conference on Computer Vision and Pattern Recognition. 549\u2013559."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126456"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.49"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.414"},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the International Conference on the 13th European Conference on Computer VisIon. 391\u2013405","author":"Zitnick C.","unstructured":"C. Zitnick and P. Dollar . 2014. Edge boxes: Locating object proposals from edges . In Proceedings of the International Conference on the 13th European Conference on Computer VisIon. 391\u2013405 . C. Zitnick and P. Dollar. 2014. Edge boxes: Locating object proposals from edges. In Proceedings of the International Conference on the 13th European Conference on Computer VisIon. 391\u2013405."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1111\/coin.12202"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCBB.2016.2615606"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2020.2976573"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.868688"},{"key":"e_1_2_1_38_1","volume-title":"Proceedings of the International Conference on ECCV. 90\u2013105","author":"Qi X.","unstructured":"X. Qi , Z. Liu , J. Shi , H. Zhao , and J. Jia . 2016. Augmented feedback in semantic segmentation under image level supervision . In Proceedings of the International Conference on ECCV. 90\u2013105 . X. Qi, Z. Liu, J. Shi, H. Zhao, and J. Jia. 2016. Augmented feedback in semantic segmentation under image level supervision. In Proceedings of the International Conference on ECCV. 90\u2013105."},{"key":"e_1_2_1_39_1","unstructured":"K. Simonyan and A. Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.  K. Simonyan and A. Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2017.2699184"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.271"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.image.2019.115648"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2019.04.095"},{"key":"e_1_2_1_44_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TCBB.2020.2991173","article-title":"A transfer learning based super-resolution microscopy for biopsy slice images: The joint methods perspective","volume":"99","author":"Chen Jintai","year":"2020","unstructured":"Jintai Chen , Haochao Ying , Xuechen Liu , Jingjing Gu , Ruiwei Feng , Tingting Chen , Honghao Gao , and Jian Wu . 2020 . A transfer learning based super-resolution microscopy for biopsy slice images: The joint methods perspective . IEEE\/ACM Transactions on Computational Biology and Bioinformatics (TCBB). 99 (2020), 1 \u2013 1 . https:\/\/doi.org\/10.1109\/TCBB.2020.2991173 10.1109\/TCBB.2020.2991173 Jintai Chen, Haochao Ying, Xuechen Liu, Jingjing Gu, Ruiwei Feng, Tingting Chen, Honghao Gao, and Jian Wu. 2020. A transfer learning based super-resolution microscopy for biopsy slice images: The joint methods perspective. IEEE\/ACM Transactions on Computational Biology and Bioinformatics (TCBB). 99 (2020), 1\u20131. https:\/\/doi.org\/10.1109\/TCBB.2020.2991173","journal-title":"IEEE\/ACM Transactions on Computational Biology and Bioinformatics (TCBB)."},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.209"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.203"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2636150"},{"key":"e_1_2_1_48_1","volume-title":"Proceedings of the International Conference on ECCV. 218\u2013234","author":"Shimoda W.","unstructured":"W. Shimoda and K. Yanai . 2016. Distinct class-specific saliency maps for weakly supervised semantic segmentation . In Proceedings of the International Conference on ECCV. 218\u2013234 . W. Shimoda and K. Yanai. 2016. Distinct class-specific saliency maps for weakly supervised semantic segmentation. In Proceedings of the International Conference on ECCV. 218\u2013234."},{"key":"e_1_2_1_49_1","volume-title":"Proceedings of the International Conference on Computer Vision and Pattern Recognition. 3529\u20133538","author":"Roy A.","unstructured":"A. Roy and S. Todorovic . 2017. Combining bottom-up, top-down, and smoothness cues for weakly supervised image segmentation . In Proceedings of the International Conference on Computer Vision and Pattern Recognition. 3529\u20133538 . A. Roy and S. Todorovic. 2017. Combining bottom-up, top-down, and smoothness cues for weakly supervised image segmentation. In Proceedings of the International Conference on Computer Vision and Pattern Recognition. 3529\u20133538."}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3419842","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3419842","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:32:03Z","timestamp":1750195923000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3419842"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,31]]},"references-count":49,"journal-issue":{"issue":"1s","published-print":{"date-parts":[[2021,1,31]]}},"alternative-id":["10.1145\/3419842"],"URL":"https:\/\/doi.org\/10.1145\/3419842","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1,31]]},"assertion":[{"value":"2020-03-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-08-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-03-31","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}