{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T18:37:25Z","timestamp":1775932645182,"version":"3.50.1"},"reference-count":33,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,10,16]],"date-time":"2021-10-16T00:00:00Z","timestamp":1634342400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,10,16]],"date-time":"2021-10-16T00:00:00Z","timestamp":1634342400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["Grant No.61671412"],"award-info":[{"award-number":["Grant No.61671412"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004731","name":"Natural Science Foundation of Zhejiang Province","doi-asserted-by":"publisher","award":["Grant No.LY19F010002"],"award-info":[{"award-number":["Grant No.LY19F010002"]}],"id":[{"id":"10.13039\/501100004731","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004731","name":"Natural Science Foundation of Zhejiang Province","doi-asserted-by":"publisher","award":["Grant No.LY21F010014"],"award-info":[{"award-number":["Grant No.LY21F010014"]}],"id":[{"id":"10.13039\/501100004731","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100007834","name":"Natural Science Foundation of Ningbo","doi-asserted-by":"publisher","award":["Grant No.2018A610053"],"award-info":[{"award-number":["Grant No.2018A610053"]}],"id":[{"id":"10.13039\/100007834","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100007834","name":"Natural Science Foundation of Ningbo","doi-asserted-by":"publisher","award":["Grant No.202003N4323"],"award-info":[{"award-number":["Grant No.202003N4323"]}],"id":[{"id":"10.13039\/100007834","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Ningbo Municipal Projects for Leading and Top Talents","award":["Grant No.NBLJ201801006"],"award-info":[{"award-number":["Grant No.NBLJ201801006"]}]},{"name":"General Scientific Research Project of Zhejiang Education Department","award":["Grant No.Y201941122"],"award-info":[{"award-number":["Grant No.Y201941122"]}]},{"name":"Commonweal Projects of Zhejiang Province","award":["Grant No.LGN20F010001"],"award-info":[{"award-number":["Grant No.LGN20F010001"]}]},{"name":"General Project of Zhejiang Education Department","award":["Grant No.Y201940951"],"award-info":[{"award-number":["Grant No.Y201940951"]}]},{"name":"the School Level Scientific Research and Innovation Team Project"},{"name":"Fundamental Research Funds for Zhejiang Provincial Colleges and Universities"},{"name":"College Students Innovative Training Project"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["EURASIP J. Adv. Signal Process."],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Text detection is a key technique and plays an important role in computer vision applications, but efficient and precise text detection is still challenging. In this paper, an efficient scene text detection scheme is proposed based on the Progressive Scale Expansion Network (PSENet). A Mixed Pooling Module (MPM) is designed to effectively capture the dependence of text information at different distances, where different pooling operations are employed to better extract information of text shape. The backbone network is optimized by combining two extensions of the Residual Network (ResNet), i.e., ResNeXt and Res2Net, to enhance feature extraction effectiveness. Experimental results show that the precision of our scheme is improved more than by 5% compared with the original PSENet.<\/jats:p>","DOI":"10.1186\/s13634-021-00808-5","type":"journal-article","created":{"date-parts":[[2021,10,16]],"date-time":"2021-10-16T17:20:09Z","timestamp":1634404809000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["PSENet-based efficient scene text detection"],"prefix":"10.1186","volume":"2021","author":[{"given":"Guanglong","family":"Liao","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4095-7128","authenticated-orcid":false,"given":"Zhongjie","family":"Zhu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yongqiang","family":"Bai","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tingna","family":"Liu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhibo","family":"Xie","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2021,10,16]]},"reference":[{"issue":"2","key":"808_CR1","doi-asserted-by":"publisher","first-page":"1264","DOI":"10.1109\/TITS.2020.2967316","volume":"22","author":"W Kazmi","year":"2021","unstructured":"W. Kazmi, I. Nabney, G. Vogiatzis, P. Rose, A. Codd, An efficient industrial system for vehicle tyre (Tire) detection and text recognition using deep learning. IEEE Trans. Intell. Transp. Syst. 22(2), 1264\u20131275 (2021). https:\/\/doi.org\/10.1109\/TITS.2020.2967316","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"808_CR2","doi-asserted-by":"publisher","first-page":"2918","DOI":"10.1109\/TIP.2019.2954218","volume":"29","author":"Y Liu","year":"2020","unstructured":"Y. Liu, L. Jin, C. Fang, Arbitrarily shaped scene text detection with a mask tightness text detector. IEEE Trans. Image Process. 29, 2918\u20132930 (2020). https:\/\/doi.org\/10.1109\/TIP.2019.2954218","journal-title":"IEEE Trans. Image Process."},{"key":"808_CR3","doi-asserted-by":"publisher","unstructured":"P. Cheng, Y. Cai, W. Wang, \u201cA direct regression scene text detector with position-sensitive segmentation. IEEE Trans. Circuits Syst. Video Technol., 30(11): 4171\u20134181 (2020). https:\/\/doi.org\/10.1109\/TCSVT.2019.2947475","DOI":"10.1109\/TCSVT.2019.2947475"},{"key":"808_CR4","doi-asserted-by":"publisher","unstructured":"P. N. C. a. w. P. Shivakumara, R. Raghavendra, S. Nag, U. Pal, T. Lu, D. Lopresti, \"An episodic learning network for text detection on human bodies in sports images,\" In IEEE Transactions on Circuits and Systems for Video Technology, 1\u20131 (2021). https:\/\/doi.org\/10.1109\/TCSVT.2021.3092713","DOI":"10.1109\/TCSVT.2021.3092713"},{"issue":"6","key":"808_CR5","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/tpami.2016.2577031","volume":"39","author":"S Ren","year":"2017","unstructured":"S. Ren, K. He, R. Girshick, J. Sun, Faster RCNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell 39(6), 1137\u20131149 (2017). https:\/\/doi.org\/10.1109\/tpami.2016.2577031","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"808_CR6","doi-asserted-by":"publisher","unstructured":"W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Y. Fu, A.C. Berg, \u201cSSD: single shot multibox detector,\u201d In European Conference on Computer Vision (ECCV), 21\u201337 (2016). https:\/\/doi.org\/10.1007\/978-3-319-46448-0_2","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"808_CR7","doi-asserted-by":"publisher","unstructured":"J. Redmon, S. Divvala, R. Girshick, A. Farhadi, \u201cYou only look once: unified, real-time object detection,\u201d In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 779\u2013788 (2016). https:\/\/doi.org\/10.1109\/CVPR.2016.91","DOI":"10.1109\/CVPR.2016.91"},{"key":"808_CR8","unstructured":"L. Huang, Y. Yang, Y. Deng, Y. Yu, \u201cDenseBox: unifying landmark localization with end to end object detection,\u201d arXiv preprint arXiv:1509.04874(2015)"},{"key":"808_CR9","unstructured":"M. Liao, B. Shi, X. Bai, X. Wang, and W. Liu, \u201cTextBoxes: A Fast Text Detector with a Single Deep Neural Network,\u201d In The National Conference on Artificial Intelligence (AAAI), 4161\u20134167 (2017)."},{"key":"808_CR10","doi-asserted-by":"publisher","unstructured":"M. Liao, B. Shi, X. Bai, \u201cTextBoxes++: a single-shot oriented scene text detector,\u201d IEEE Trans. Image Process., 3676\u20133690 (2018). http:\/\/dx.doi.org\/https:\/\/doi.org\/10.1109\/TIP.2018.2825107.","DOI":"10.1109\/TIP.2018.2825107"},{"key":"808_CR11","doi-asserted-by":"publisher","unstructured":"X. Zhou, C. Yao, H. Wen, Y. Wang, S. Zhou, W. He, J. Liang, \u201cEAST: an efficient and accurate scene text detector,\u201d In the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2642\u20132651 (2017). https:\/\/doi.org\/10.1109\/CVPR.2017.283","DOI":"10.1109\/CVPR.2017.283"},{"issue":"11","key":"808_CR12","doi-asserted-by":"publisher","first-page":"3111","DOI":"10.1109\/TMM.2018.2818020","volume":"20","author":"J Ma","year":"2018","unstructured":"J. Ma, W. Shao, H. Ye, L. Wang, H. Wang, Y. Zheng, X. Xue, Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans. Multimedia 20(11), 3111\u20133122 (2018). https:\/\/doi.org\/10.1109\/TMM.2018.2818020","journal-title":"IEEE Trans. Multimedia"},{"key":"808_CR13","doi-asserted-by":"crossref","unstructured":"L. Tychsen-Smith, L. Petersson, \u201cDeNet: scalable realtime object detection with directed sparse sampling,\u201d In IEEE International Conference on Computer Vision (ICCV), 428\u2013436 (2017)","DOI":"10.1109\/ICCV.2017.54"},{"key":"808_CR14","doi-asserted-by":"publisher","unstructured":"L. Pengyuan et al., \u201cMulti-oriented scene text detection via corner localization and region segmentation,\u201d In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 7553\u20137563 (2018). https:\/\/doi.org\/10.1109\/CVPR.2018.00788","DOI":"10.1109\/CVPR.2018.00788"},{"key":"808_CR15","unstructured":"X. Wang, K. Chen, Z. Huang, C. Yao, W. Liu, \u201cPoint linking network for object detection,\u201d arXiv preprint arXiv: 1706.03646 (2017)"},{"key":"808_CR16","doi-asserted-by":"crossref","unstructured":"D. Deng, H. Liu, X. Li, D. Cai, \u201cPixelLink: detecting scene text via instance segmentation,\u201d In The National Conference on Artificial Intelligence (AAAI), 6773\u20136780 (2018)","DOI":"10.1609\/aaai.v32i1.12269"},{"key":"808_CR17","doi-asserted-by":"publisher","unstructured":"Z. Zhang, C. Zhang, W. Shen, C. Yao, W. Liu, X. Bai, \u201cMulti-oriented text detection with fully convolutional networks,\u201d In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4159\u20134167 (2016). https:\/\/doi.org\/10.1109\/CVPR.2016.451","DOI":"10.1109\/CVPR.2016.451"},{"key":"808_CR18","doi-asserted-by":"crossref","unstructured":"X. Enze et al., \u201cScene text detection with supervised pyramid context network,\u201d In The National Conference on Artificial Intelligence (AAAI), 9038\u20139045 (2019)","DOI":"10.1609\/aaai.v33i01.33019038"},{"key":"808_CR19","doi-asserted-by":"publisher","unstructured":"L. Shangbang et al., \u201cTextSnake: a flexible representation for detecting text of arbitrary shapes,\u201d In European Conference on Computer Vision (ECCV), 20\u201336 (2018). https:\/\/doi.org\/10.1007\/978-3-030-01216-8_2","DOI":"10.1007\/978-3-030-01216-8_2"},{"key":"808_CR20","doi-asserted-by":"publisher","unstructured":"W. Wenhai et al., \u201cShape robust text detection with progressive scale expansion network,\u201d In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 9336\u20139345 (2019). https:\/\/doi.org\/10.1109\/CVPR.2019.00956","DOI":"10.1109\/CVPR.2019.00956"},{"key":"808_CR21","doi-asserted-by":"publisher","unstructured":"H. Qibin et al., \u201cStrip pooling: rethinking spatial pooling for scene parsing,\u201d In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4003\u20134012 (2020). https:\/\/doi.org\/10.1109\/CVPR42600.2020.00406","DOI":"10.1109\/CVPR42600.2020.00406"},{"key":"808_CR22","doi-asserted-by":"publisher","unstructured":"S. Xie, R. Girshick, P. Doll\u00e1r, Z. Tu, K. He, \u201cAggregated residual transformations for deep neural networks,\u201d In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 5987\u20135995 (2017). https:\/\/doi.org\/10.1109\/CVPR.2017.634","DOI":"10.1109\/CVPR.2017.634"},{"issue":"2","key":"808_CR23","doi-asserted-by":"publisher","first-page":"652","DOI":"10.1109\/TPAMI.2019.2938758","volume":"43","author":"S Gao","year":"2021","unstructured":"S. Gao, M. Cheng, K. Zhao et al., Res2Net: a new multi-scale backbone architecture. IEEE Trans. Pattern Anal. Mach. Intell. 43(2), 652\u2013662 (2021). https:\/\/doi.org\/10.1109\/TPAMI.2019.2938758","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"808_CR24","doi-asserted-by":"publisher","unstructured":"K. He, X. Zhang, S. Ren, and J. Sun, \u201cDeep residual learning for image recognition,\u201d In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770\u2013778 (2016). https:\/\/doi.org\/10.1109\/CVPR.2016.90","DOI":"10.1109\/CVPR.2016.90"},{"key":"808_CR25","doi-asserted-by":"publisher","unstructured":"C. Szegedy, W. Liu, Y. Jia et al., \u201cgoing deeper with convolutions,\u201d In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1\u20139 (2015). https:\/\/doi.org\/10.1109\/CVPR.2015.7298594","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"808_CR26","doi-asserted-by":"publisher","unstructured":"T. Zhi, W. Huang, H. Tong, et al., \u201cDetecting text in natural image with connectionist text proposal network,\u201d In European Conference on Computer Vision (ECCV), 56\u201372 (2016). https:\/\/doi.org\/10.1007\/978-3-319-46484-8_4","DOI":"10.1007\/978-3-319-46484-8_4"},{"key":"808_CR27","doi-asserted-by":"publisher","unstructured":"B. Shi, X. Bai, S. Belongie, \u201cDetecting oriented text in natural images by linking segments,\u201d In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3482\u20133490 (2017). https:\/\/doi.org\/10.1109\/CVPR.2017.371","DOI":"10.1109\/CVPR.2017.371"},{"key":"808_CR28","doi-asserted-by":"publisher","unstructured":"H. Hu, C. Zhang, Y. Luo, Y. Wang, J. Han, E. Ding, \u201cWordSup: exploiting word annotations for character based text detection,\u201d In IEEE International Conference on Computer Vision (ICCV), 4950\u20134959 (2017). https:\/\/doi.org\/10.1109\/ICCV.2017.529","DOI":"10.1109\/ICCV.2017.529"},{"key":"808_CR29","doi-asserted-by":"crossref","unstructured":"H. Zhao, J. Shi, X. Qi, X. Wang, J. Jia, \u201cPyramid scene parsing network,\u201d In CVPR, 6230\u20136239 (2017)","DOI":"10.1109\/CVPR.2017.660"},{"key":"808_CR30","doi-asserted-by":"crossref","unstructured":"J. He, Z. Deng, L. Zhou, Y. Wang, Y. Qiao, \u201cAdaptive pyramid context network for semantic segmentation,\u201d In CVPR, 7519\u20137528 (2019)","DOI":"10.1109\/CVPR.2019.00770"},{"key":"808_CR31","doi-asserted-by":"publisher","unstructured":"Y. Liu, J. Yan, Y. Xiang, \u201cResearch on license plate recognition algorithm based on ABCNet,\u201d In IEEE 3rd International Conference on Information Systems and Computer Aided Education (ICISCAE), 465\u2013469 (2020). https:\/\/doi.org\/10.1109\/ICISCAE51034.2020.9236855","DOI":"10.1109\/ICISCAE51034.2020.9236855"},{"issue":"2","key":"808_CR32","doi-asserted-by":"publisher","first-page":"532","DOI":"10.1109\/TPAMI.2019.2937086","volume":"43","author":"M Liao","year":"2021","unstructured":"M. Liao, P. Lyu, M. He, C. Yao, W. Wu, X. Bai, Mask TextSpotter: an end-to-end trainable neural network for spotting text with arbitrary shapes. IEEE Trans. Pattern Anal. Mach. Intell. 43(2), 532\u2013548 (2021). https:\/\/doi.org\/10.1109\/TPAMI.2019.2937086","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"808_CR33","doi-asserted-by":"publisher","unstructured":"W. Feng, W. He, F. Yin, X. Zhang, C. Liu, \u201cTextDragon: an end-to-end framework for arbitrary shaped text spotting,\u201d In IEEE International Conference on Computer Vision (ICCV), 9075\u20139084 (2019), https:\/\/doi.org\/10.1109\/ICCV.2019.00917","DOI":"10.1109\/ICCV.2019.00917"}],"container-title":["EURASIP Journal on Advances in Signal Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13634-021-00808-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13634-021-00808-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13634-021-00808-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,12]],"date-time":"2023-01-12T13:30:37Z","timestamp":1673530237000},"score":1,"resource":{"primary":{"URL":"https:\/\/asp-eurasipjournals.springeropen.com\/articles\/10.1186\/s13634-021-00808-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,16]]},"references-count":33,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["808"],"URL":"https:\/\/doi.org\/10.1186\/s13634-021-00808-5","relation":{},"ISSN":["1687-6180"],"issn-type":[{"value":"1687-6180","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,10,16]]},"assertion":[{"value":"30 April 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 October 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 October 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"97"}}