{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:16:01Z","timestamp":1750220161859,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":57,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,12,13]],"date-time":"2022-12-13T00:00:00Z","timestamp":1670889600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["U21A20514, 61872307, 62071404"],"award-info":[{"award-number":["U21A20514, 61872307, 62071404"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003392","name":"Natural Science Foundation of Fujian Province","doi-asserted-by":"publisher","award":["2020J01001"],"award-info":[{"award-number":["2020J01001"]}],"id":[{"id":"10.13039\/501100003392","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,12,13]]},"DOI":"10.1145\/3551626.3564980","type":"proceedings-article","created":{"date-parts":[[2022,12,7]],"date-time":"2022-12-07T00:55:45Z","timestamp":1670374545000},"page":"1-7","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["An End-to-End Scene Text Detector with Dynamic Attention"],"prefix":"10.1145","author":[{"given":"Jingyu","family":"Lin","sequence":"first","affiliation":[{"name":"Xiamen University, Xiamen, China"}]},{"given":"Yan","family":"Yan","sequence":"additional","affiliation":[{"name":"Xiamen University, Xiamen, China"}]},{"given":"Hanzi","family":"Wang","sequence":"additional","affiliation":[{"name":"Xiamen University, Xiamen, China"}]}],"member":"320","published-online":{"date-parts":[[2022,12,13]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"crossref","unstructured":"Youngmin Baek Bado Lee Dongyoon Han Sangdoo Yun and Hwalsuk Lee. 2019. Character region awareness for text detection. In CVPR.  Youngmin Baek Bado Lee Dongyoon Han Sangdoo Yun and Hwalsuk Lee. 2019. Character region awareness for text detection. In CVPR.","DOI":"10.1109\/CVPR.2019.00959"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Nicolas Carion Francisco Massa Gabriel Synnaeve Nicolas Usunier Alexander Kirillov and Sergey Zagoruyko. 2020. End-to-end object detection with transformers. In ECCV.  Nicolas Carion Francisco Massa Gabriel Synnaeve Nicolas Usunier Alexander Kirillov and Sergey Zagoruyko. 2020. End-to-end object detection with transformers. In ECCV.","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"crossref","unstructured":"Yinpeng Chen Xiyang Dai Mengchen Liu Dongdong Chen Lu Yuan and Zicheng Liu. 2020. Dynamic ReLU. In ECCV.  Yinpeng Chen Xiyang Dai Mengchen Liu Dongdong Chen Lu Yuan and Zicheng Liu. 2020. Dynamic ReLU. In ECCV.","DOI":"10.1007\/978-3-030-58529-7_21"},{"key":"e_1_3_2_1_4_1","volume-title":"Total-text: A comprehensive dataset for scene text detection and recognition. In ICDAR.","author":"Ch'ng Chee Kheng","year":"2017","unstructured":"Chee Kheng Ch'ng and Chee Seng Chan . 2017 . Total-text: A comprehensive dataset for scene text detection and recognition. In ICDAR. Chee Kheng Ch'ng and Chee Seng Chan. 2017. Total-text: A comprehensive dataset for scene text detection and recognition. In ICDAR."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Jifeng Dai Haozhi Qi Yuwen Xiong Yi Li Guodong Zhang Han Hu and Yichen Wei. 2017. Deformable convolutional networks. In ICCV.  Jifeng Dai Haozhi Qi Yuwen Xiong Yi Li Guodong Zhang Han Hu and Yichen Wei. 2017. Deformable convolutional networks. In ICCV.","DOI":"10.1109\/ICCV.2017.89"},{"key":"e_1_3_2_1_6_1","unstructured":"Xiyang Dai Yinpeng Chen Bin Xiao Dongdong Chen Mengchen Liu Lu Yuan and Lei Zhang. 2021. Dynamic head: unifying object detection heads with attentions. CVPR.  Xiyang Dai Yinpeng Chen Bin Xiao Dongdong Chen Mengchen Liu Lu Yuan and Lei Zhang. 2021. Dynamic head: unifying object detection heads with attentions. CVPR."},{"key":"e_1_3_2_1_7_1","unstructured":"Xiyang Dai Yinpeng Chen Jianwei Yang Pengchuan Zhang Lu Yuan and Lei Zhang. 2021. Dynamic DETR: End-to-end object detection with dynamic attention. In ICCV.  Xiyang Dai Yinpeng Chen Jianwei Yang Pengchuan Zhang Lu Yuan and Lei Zhang. 2021. Dynamic DETR: End-to-end object detection with dynamic attention. In ICCV."},{"key":"e_1_3_2_1_8_1","unstructured":"Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly Jakob Uszkoreit and Neil Houlsby. 2021. An image is worth 16x16 words: Transformers for image recognition at scale. In ICLR.  Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly Jakob Uszkoreit and Neil Houlsby. 2021. An image is worth 16x16 words: Transformers for image recognition at scale. In ICLR."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Ankush Gupta Andrea Vedaldi and Andrew Zisserman. 2016. Synthetic data for text localisation in natural images. In CVPR.  Ankush Gupta Andrea Vedaldi and Andrew Zisserman. 2016. Synthetic data for text localisation in natural images. In CVPR.","DOI":"10.1109\/CVPR.2016.254"},{"key":"e_1_3_2_1_10_1","unstructured":"Kaiming He Georgia Gkioxari Piotr Doll\u00e1r and Ross Girshick. 2017. Mask R-cnn. In ICCV.  Kaiming He Georgia Gkioxari Piotr Doll\u00e1r and Ross Girshick. 2017. Mask R-cnn. In ICCV."},{"key":"e_1_3_2_1_11_1","unstructured":"Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR.  Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR."},{"key":"e_1_3_2_1_12_1","volume-title":"MOST: A multi-oriented scene text detector with localization refinement. In CVPR.","author":"He Minghang","year":"2021","unstructured":"Minghang He , Minghui Liao , Zhibo Yang , Humen Zhong , Jun Tang , Wenqing Cheng , Cong Yao , Yongpan Wang , and Xiang Bai . 2021 . MOST: A multi-oriented scene text detector with localization refinement. In CVPR. Minghang He, Minghui Liao, Zhibo Yang, Humen Zhong, Jun Tang, Wenqing Cheng, Cong Yao, Yongpan Wang, and Xiang Bai. 2021. MOST: A multi-oriented scene text detector with localization refinement. In CVPR."},{"key":"e_1_3_2_1_13_1","unstructured":"Wenhao He Xu-Yao Zhang Fei Yin and Cheng-Lin Liu. 2017. Deep direct regression for multi-oriented scene text detection. In ICCV.  Wenhao He Xu-Yao Zhang Fei Yin and Cheng-Lin Liu. 2017. Deep direct regression for multi-oriented scene text detection. In ICCV."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"crossref","unstructured":"Jie Hu Li Shen Samuel Albanie Gang Sun and Enhua Wu. 2018. Squeeze-and-excitation networks. In CVPR.  Jie Hu Li Shen Samuel Albanie Gang Sun and Enhua Wu. 2018. Squeeze-and-excitation networks. In CVPR.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2015.7333942"},{"key":"e_1_3_2_1_16_1","volume-title":"The Hungarian method for the assignment problem. Naval Research Logistics Quarterly","author":"Kuhn Harold W.","year":"1955","unstructured":"Harold W. Kuhn . 1955. The Hungarian method for the assignment problem. Naval Research Logistics Quarterly ( 1955 ). Harold W. Kuhn. 1955. The Hungarian method for the assignment problem. Naval Research Logistics Quarterly (1955)."},{"key":"e_1_3_2_1_17_1","first-page":"3676","article-title":"Textboxes++: A single-shot oriented scene text detector","volume":"27","author":"Liao Minghui","year":"2018","unstructured":"Minghui Liao , Baoguang Shi , and Xiang Bai . 2018 . Textboxes++: A single-shot oriented scene text detector . IEEE TIP 27 , 8 (2018), 3676 -- 3690 . Minghui Liao, Baoguang Shi, and Xiang Bai. 2018. Textboxes++: A single-shot oriented scene text detector. IEEE TIP 27, 8 (2018), 3676--3690.","journal-title":"IEEE TIP"},{"key":"e_1_3_2_1_18_1","volume-title":"Textboxes: A fast text detector with a single deep neural network. In AAAI.","author":"Liao Minghui","year":"2017","unstructured":"Minghui Liao , Baoguang Shi , Xiang Bai , Xinggang Wang , and Wenyu Liu . 2017 . Textboxes: A fast text detector with a single deep neural network. In AAAI. Minghui Liao, Baoguang Shi, Xiang Bai, Xinggang Wang, and Wenyu Liu. 2017. Textboxes: A fast text detector with a single deep neural network. In AAAI."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"crossref","unstructured":"Minghui Liao Zhaoyi Wan Cong Yao Kai Chen and Xiang Bai. 2020. Real-Time Scene Text Detection with Differentiable Binarization.. In AAAI.  Minghui Liao Zhaoyi Wan Cong Yao Kai Chen and Xiang Bai. 2020. Real-Time Scene Text Detection with Differentiable Binarization.. In AAAI.","DOI":"10.1609\/aaai.v34i07.6812"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"crossref","unstructured":"Minghui Liao Zhen Zhu Baoguang Shi Gui-song Xia and Xiang Bai. 2018. Rotation-sensitive regression for oriented scene text detection. In CVPR.  Minghui Liao Zhen Zhu Baoguang Shi Gui-song Xia and Xiang Bai. 2018. Rotation-sensitive regression for oriented scene text detection. In CVPR.","DOI":"10.1109\/CVPR.2018.00619"},{"key":"e_1_3_2_1_21_1","unstructured":"Tsung-Yi Lin Piotr Doll\u00e1r Ross Girshick Kaiming He Bharath Hariharan and Serge Belongie. 2017. Feature pyramid networks for object detection. In CVPR.  Tsung-Yi Lin Piotr Doll\u00e1r Ross Girshick Kaiming He Bharath Hariharan and Serge Belongie. 2017. Feature pyramid networks for object detection. In CVPR."},{"key":"e_1_3_2_1_22_1","volume-title":"Ssd: Single shot multibox detector. In ECCV.","author":"Liu Wei","year":"2016","unstructured":"Wei Liu , Dragomir Anguelov , Dumitru Erhan , Christian Szegedy , Scott Reed , Cheng-Yang Fu , and Alexander C Berg . 2016 . Ssd: Single shot multibox detector. In ECCV. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In ECCV."},{"key":"e_1_3_2_1_23_1","volume-title":"Abcnet: Real-time scene text spotting with adaptive bezier-curve network. In CVPR.","author":"Liu Yuliang","year":"2020","unstructured":"Yuliang Liu , Hao Chen , Chunhua Shen , Tong He , Lianwen Jin , and Liangwei Wang . 2020 . Abcnet: Real-time scene text spotting with adaptive bezier-curve network. In CVPR. Yuliang Liu, Hao Chen, Chunhua Shen, Tong He, Lianwen Jin, and Liangwei Wang. 2020. Abcnet: Real-time scene text spotting with adaptive bezier-curve network. In CVPR."},{"key":"e_1_3_2_1_24_1","unstructured":"Yuliang Liu and Lianwen Jin. 2017. Deep matching prior network: Toward tighter multi-oriented text detection. In CVPR. 1962--1969.  Yuliang Liu and Lianwen Jin. 2017. Deep matching prior network: Toward tighter multi-oriented text detection. In CVPR. 1962--1969."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"crossref","unstructured":"Ze Liu Yutong Lin Yue Cao Han Hu Yixuan Wei Zheng Zhang Stephen Lin and Baining Guo. 2021. Swin transformer: Hierarchical vision transformer using shifted windows. In ICCV.  Ze Liu Yutong Lin Yue Cao Han Hu Yixuan Wei Zheng Zhang Stephen Lin and Baining Guo. 2021. Swin transformer: Hierarchical vision transformer using shifted windows. In ICCV.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"e_1_3_2_1_26_1","volume-title":"Textsnake: A flexible representation for detecting text of arbitrary shapes. In ECCV.","author":"Long Shangbang","year":"2018","unstructured":"Shangbang Long , Jiaqiang Ruan , Wenjie Zhang , Xin He , Wenhao Wu , and Cong Yao . 2018 . Textsnake: A flexible representation for detecting text of arbitrary shapes. In ECCV. Shangbang Long, Jiaqiang Ruan, Wenjie Zhang, Xin He, Wenhao Wu, and Cong Yao. 2018. Textsnake: A flexible representation for detecting text of arbitrary shapes. In ECCV."},{"key":"e_1_3_2_1_27_1","unstructured":"Ilya Loshchilov and Frank Hutter. 2019. Decoupled weight decay regularization. In ICLR.  Ilya Loshchilov and Frank Hutter. 2019. Decoupled weight decay regularization. In ICLR."},{"key":"e_1_3_2_1_28_1","unstructured":"Pengyuan Lyu Cong Yao Wenhao Wu Shuicheng Yan and Xiang Bai. 2018. Multi-oriented scene text detection via corner localization and region segmentation. In CVPR.  Pengyuan Lyu Cong Yao Wenhao Wu Shuicheng Yan and Xiang Bai. 2018. Multi-oriented scene text detection via corner localization and region segmentation. In CVPR."},{"key":"e_1_3_2_1_29_1","volume-title":"Robust wide-baseline stereo from maximally stable extremal regions. Image and Vision Computing","author":"Matas Jiri","year":"2004","unstructured":"Jiri Matas , Ondrej Chum , Martin Urban , and Tomas Pajdla . 2004. Robust wide-baseline stereo from maximally stable extremal regions. Image and Vision Computing ( 2004 ). Jiri Matas, Ondrej Chum, Martin Urban, and Tomas Pajdla. 2004. Robust wide-baseline stereo from maximally stable extremal regions. Image and Vision Computing (2004)."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"crossref","unstructured":"Depu Meng Xiaokang Chen Zejia Fan Gang Zeng Houqiang Li Yuhui Yuan Lei Sun and Jingdong Wang. 2021. Conditional detr for fast training convergence. In ICCV.  Depu Meng Xiaokang Chen Zejia Fan Gang Zeng Houqiang Li Yuhui Yuan Lei Sun and Jingdong Wang. 2021. Conditional detr for fast training convergence. In ICCV.","DOI":"10.1109\/ICCV48922.2021.00363"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"crossref","unstructured":"Nibal Nayef Fei Yin Imen Bizid Hyunsoo Choi Yuan Feng Dimosthenis Karatzas Zhenbo Luo Umapada Pal Christophe Rigaud Joseph Chazalon etal 2017. Icdar2017 robust reading challenge on multi-lingual scene text detection and script identification-rrc-mlt. In ICDAR.  Nibal Nayef Fei Yin Imen Bizid Hyunsoo Choi Yuan Feng Dimosthenis Karatzas Zhenbo Luo Umapada Pal Christophe Rigaud Joseph Chazalon et al. 2017. Icdar2017 robust reading challenge on multi-lingual scene text detection and script identification-rrc-mlt. In ICDAR.","DOI":"10.1109\/ICDAR.2017.237"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"crossref","unstructured":"Alexander Neubeck and Luc Van Gool. 2006. Efficient non-maximum suppression. In ICPR.  Alexander Neubeck and Luc Van Gool. 2006. Efficient non-maximum suppression. In ICPR.","DOI":"10.1109\/ICPR.2006.479"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW53098.2021.00353"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2577031"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"crossref","unstructured":"Hamid Rezatofighi Nathan Tsoi JunYoung Gwak Amir Sadeghian Ian Reid and Silvio Savarese. 2019. Generalized intersection over union: A metric and a loss for bounding box regression. In CVPR.  Hamid Rezatofighi Nathan Tsoi JunYoung Gwak Amir Sadeghian Ian Reid and Silvio Savarese. 2019. Generalized intersection over union: A metric and a loss for bounding box regression. In CVPR.","DOI":"10.1109\/CVPR.2019.00075"},{"key":"e_1_3_2_1_36_1","unstructured":"Baoguang Shi Xiang Bai and Serge Belongie. 2017. Detecting oriented text in natural images by linking segments. In CVPR.  Baoguang Shi Xiang Bai and Serge Belongie. 2017. Detecting oriented text in natural images by linking segments. In CVPR."},{"key":"e_1_3_2_1_37_1","unstructured":"Zhiqing Sun Shengcao Cao Yiming Yang and Kris M Kitani. 2021. Rethinking transformer-based set prediction for object detection. In ICCV.  Zhiqing Sun Shengcao Cao Yiming Yang and Kris M Kitani. 2021. Rethinking transformer-based set prediction for object detection. In ICCV."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"crossref","unstructured":"Satoshi Suzuki et al. 1985. Topological structural analysis of digitized binary images by border following. Computer vision graphics and image processing 30 1 (1985) 32--46.  Satoshi Suzuki et al. 1985. Topological structural analysis of digitized binary images by border following. Computer vision graphics and image processing 30 1 (1985) 32--46.","DOI":"10.1016\/0734-189X(85)90016-7"},{"key":"e_1_3_2_1_39_1","first-page":"106954","article-title":"Seglink++: Detecting dense and arbitrary-shaped scene text by instance-aware component grouping","volume":"96","author":"Tang Jun","year":"2019","unstructured":"Jun Tang , Zhibo Yang , Yongpan Wang , Qi Zheng , Yongchao Xu , and Xiang Bai . 2019 . Seglink++: Detecting dense and arbitrary-shaped scene text by instance-aware component grouping . PR 96 (2019), 106954 . Jun Tang, Zhibo Yang, Yongpan Wang, Qi Zheng, Yongchao Xu, and Xiang Bai. 2019. Seglink++: Detecting dense and arbitrary-shaped scene text by instance-aware component grouping. PR 96 (2019), 106954.","journal-title":"PR"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"crossref","unstructured":"Zhuotao Tian Michelle Shu Pengyuan Lyu Ruiyu Li Chao Zhou Xiaoyong Shen and Jiaya Jia. 2019. Learning shape-aware embedding for scene text detection. In CVPR.  Zhuotao Tian Michelle Shu Pengyuan Lyu Ruiyu Li Chao Zhou Xiaoyong Shen and Jiaya Jia. 2019. Learning shape-aware embedding for scene text detection. In CVPR.","DOI":"10.1109\/CVPR.2019.00436"},{"key":"e_1_3_2_1_41_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems.  Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"crossref","unstructured":"Wenhai Wang Enze Xie Xiang Li Wenbo Hou Tong Lu Gang Yu and Shuai Shao. 2019. Shape robust text detection with progressive scale expansion network. In CVPR.  Wenhai Wang Enze Xie Xiang Li Wenbo Hou Tong Lu Gang Yu and Shuai Shao. 2019. Shape robust text detection with progressive scale expansion network. In CVPR.","DOI":"10.1109\/CVPR.2019.00956"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"crossref","unstructured":"Wenhai Wang Enze Xie Xiaoge Song Yuhang Zang Wenjia Wang Tong Lu Gang Yu and Chunhua Shen. 2019. Efficient and accurate arbitrary-shaped text detection with pixel aggregation network. In ICCV.  Wenhai Wang Enze Xie Xiaoge Song Yuhang Zang Wenjia Wang Tong Lu Gang Yu and Chunhua Shen. 2019. Efficient and accurate arbitrary-shaped text detection with pixel aggregation network. In ICCV.","DOI":"10.1109\/ICCV.2019.00853"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"crossref","unstructured":"Xinjiang Wang Shilong Zhang Zhuoran Yu Litong Feng and Wayne Zhang. 2020. Scale-equalizing pyramid convolution for object detection. In CVPR.  Xinjiang Wang Shilong Zhang Zhuoran Yu Litong Feng and Wayne Zhang. 2020. Scale-equalizing pyramid convolution for object detection. In CVPR.","DOI":"10.1109\/CVPR42600.2020.01337"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"crossref","unstructured":"Yuxin Wang Hongtao Xie Zheng-Jun Zha Mengting Xing Zilong Fu and Yongdong Zhang. 2020. ContourNet: Taking a further step toward accurate arbitrary-shaped scene text detection. In CVPR.  Yuxin Wang Hongtao Xie Zheng-Jun Zha Mengting Xing Zilong Fu and Yongdong Zhang. 2020. ContourNet: Taking a further step toward accurate arbitrary-shaped scene text detection. In CVPR.","DOI":"10.1109\/CVPR42600.2020.01177"},{"key":"e_1_3_2_1_46_1","first-page":"5566","article-title":"Textfield: Learning a deep direction field for irregular scene text detection","volume":"28","author":"Xu Yongchao","year":"2019","unstructured":"Yongchao Xu , Yukang Wang , Wei Zhou , Yongpan Wang , Zhibo Yang , and Xiang Bai . 2019 . Textfield: Learning a deep direction field for irregular scene text detection . IEEE TIP 28 , 11 (2019), 5566 -- 5579 . Yongchao Xu, Yukang Wang, Wei Zhou, Yongpan Wang, Zhibo Yang, and Xiang Bai. 2019. Textfield: Learning a deep direction field for irregular scene text detection. IEEE TIP 28, 11 (2019), 5566--5579.","journal-title":"IEEE TIP"},{"key":"e_1_3_2_1_47_1","unstructured":"Chuhui Xue Shijian Lu and Fangneng Zhan. 2018. Accurate scene text detection through border semantics awareness and bootstrapping.. In CVPR.  Chuhui Xue Shijian Lu and Fangneng Zhan. 2018. Accurate scene text detection through border semantics awareness and bootstrapping.. In CVPR."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"crossref","unstructured":"Cong Yao Xiang Bai Wenyu Liu Yi Ma and Zhuowen Tu. 2012. Detecting texts of arbitrary orientations in natural images. In CVPR.  Cong Yao Xiang Bai Wenyu Liu Yi Ma and Zhuowen Tu. 2012. Detecting texts of arbitrary orientations in natural images. In CVPR.","DOI":"10.1109\/CVPR.2012.6247787"},{"key":"e_1_3_2_1_49_1","unstructured":"Changqian Yu Jingbo Wang Chao Peng Changxin Gao Gang Yu and Nong Sang. 2018. Learning a discriminative feature network for semantic segmentation. CVPR.  Changqian Yu Jingbo Wang Chao Peng Changxin Gao Gang Yu and Nong Sang. 2018. Learning a discriminative feature network for semantic segmentation. CVPR."},{"key":"e_1_3_2_1_50_1","volume-title":"Detecting curve text in the wild: New dataset and new solution. arXiv preprint arXiv:1712.02170","author":"Yuliang Liu","year":"2017","unstructured":"Liu Yuliang , Jin Lianwen , Zhang Shuaitao , and Zhang Sheng . 2017. Detecting curve text in the wild: New dataset and new solution. arXiv preprint arXiv:1712.02170 ( 2017 ). Liu Yuliang, Jin Lianwen, Zhang Shuaitao, and Zhang Sheng. 2017. Detecting curve text in the wild: New dataset and new solution. arXiv preprint arXiv:1712.02170 (2017)."},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"crossref","unstructured":"Chengquan Zhang Borong Liang Zuming Huang Mengyi En Junyu Han Errui Ding and Xinghao Ding. 2019. Look more than once: An accurate detector for text of arbitrary shapes. In CVPR.  Chengquan Zhang Borong Liang Zuming Huang Mengyi En Junyu Han Errui Ding and Xinghao Ding. 2019. Look more than once: An accurate detector for text of arbitrary shapes. In CVPR.","DOI":"10.1109\/CVPR.2019.01080"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"crossref","unstructured":"Shi-Xue Zhang Xiaobin Zhu Jie-Bo Hou Chang Liu Chun Yang Hongfa Wang and Xu-Cheng Yin. 2020. Deep relational reasoning graph network for arbitrary shape text detection. In CVPR.  Shi-Xue Zhang Xiaobin Zhu Jie-Bo Hou Chang Liu Chun Yang Hongfa Wang and Xu-Cheng Yin. 2020. Deep relational reasoning graph network for arbitrary shape text detection. In CVPR.","DOI":"10.1109\/CVPR42600.2020.00972"},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"crossref","unstructured":"Shi-Xue Zhang Xiaobin Zhu Chun Yang Hongfa Wang and Xu-Cheng Yin. 2021. Adaptive boundary proposal network for arbitrary shape text detection. In ICCV.  Shi-Xue Zhang Xiaobin Zhu Chun Yang Hongfa Wang and Xu-Cheng Yin. 2021. Adaptive boundary proposal network for arbitrary shape text detection. In ICCV.","DOI":"10.1109\/ICCV48922.2021.00134"},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"crossref","unstructured":"Zheng Zhang Chengquan Zhang Wei Shen Cong Yao Wenyu Liu and Xiang Bai. 2016. Multi-oriented text detection with fully convolutional networks. CVPR.  Zheng Zhang Chengquan Zhang Wei Shen Cong Yao Wenyu Liu and Xiang Bai. 2016. Multi-oriented text detection with fully convolutional networks. CVPR.","DOI":"10.1109\/CVPR.2016.451"},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"crossref","unstructured":"Xinyu Zhou Cong Yao He Wen Yuzhi Wang Shuchang Zhou Weiran He and Jiajun Liang. 2017. East: an efficient and accurate scene text detector. In CVPR.  Xinyu Zhou Cong Yao He Wen Yuzhi Wang Shuchang Zhou Weiran He and Jiajun Liang. 2017. East: an efficient and accurate scene text detector. In CVPR.","DOI":"10.1109\/CVPR.2017.283"},{"key":"e_1_3_2_1_56_1","unstructured":"Xizhou Zhu Weijie Su Lewei Lu Bin Li Xiaogang Wang and Jifeng Dai. 2021. Deformable detr: Deformable transformers for end-to-end object detection. In ICLR.  Xizhou Zhu Weijie Su Lewei Lu Bin Li Xiaogang Wang and Jifeng Dai. 2021. Deformable detr: Deformable transformers for end-to-end object detection. In ICLR."},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"crossref","unstructured":"Yiqin Zhu Jianyong Chen Lingyu Liang Zhanghui Kuang Lianwen Jin and Wayne Zhang. 2021. Fourier contour embedding for arbitrary-shaped text detection. In CVPR.  Yiqin Zhu Jianyong Chen Lingyu Liang Zhanghui Kuang Lianwen Jin and Wayne Zhang. 2021. Fourier contour embedding for arbitrary-shaped text detection. In CVPR.","DOI":"10.1109\/CVPR46437.2021.00314"}],"event":{"name":"MMAsia '22: ACM Multimedia Asia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Tokyo Japan","acronym":"MMAsia '22"},"container-title":["Proceedings of the 4th ACM International Conference on Multimedia in Asia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3551626.3564980","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3551626.3564980","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:26Z","timestamp":1750186826000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3551626.3564980"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,13]]},"references-count":57,"alternative-id":["10.1145\/3551626.3564980","10.1145\/3551626"],"URL":"https:\/\/doi.org\/10.1145\/3551626.3564980","relation":{},"subject":[],"published":{"date-parts":[[2022,12,13]]},"assertion":[{"value":"2022-12-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}