{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,18]],"date-time":"2026-02-18T00:02:39Z","timestamp":1771372959413,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":41,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,8,24]],"date-time":"2021-08-24T00:00:00Z","timestamp":1629763200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"the Fundamental Research Funds for the Central Universities under Grant","award":["WK3480000011"],"award-info":[{"award-number":["WK3480000011"]}]},{"name":"the National Key Research and Development Program of China","award":["2018YFB0804203"],"award-info":[{"award-number":["2018YFB0804203"]}]},{"name":"the Youth Innovation Promotion Association Chinese Academy of Sciences","award":["2017209"],"award-info":[{"award-number":["2017209"]}]},{"name":"the National Nature Science Foundation of China","award":["62022076, U1936210, 62032006"],"award-info":[{"award-number":["62022076, U1936210, 62032006"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,8,24]]},"DOI":"10.1145\/3460426.3463674","type":"proceedings-article","created":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T22:50:28Z","timestamp":1630536628000},"page":"638-644","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["Look Back Again"],"prefix":"10.1145","author":[{"given":"Zilong","family":"Fu","sequence":"first","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}]},{"given":"Hongtao","family":"Xie","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}]},{"given":"Guoqing","family":"Jin","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Communication Content Cognition &amp; People's Daily Online, Beijing, China"}]},{"given":"Junbo","family":"Guo","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Communication Content Cognition &amp; People's Daily Online, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2021,9]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Kiss: Keeping it simple for scene text recognition. arXiv preprint arXiv:1911.08400","author":"Bartz Christian","year":"2019","unstructured":"Christian Bartz , Joseph Bethge , Haojin Yang , and Christoph Meinel . 2019 . Kiss: Keeping it simple for scene text recognition. arXiv preprint arXiv:1911.08400 (2019). Christian Bartz, Joseph Bethge, Haojin Yang, and Christoph Meinel. 2019. Kiss: Keeping it simple for scene text recognition. arXiv preprint arXiv:1911.08400 (2019)."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.543"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143891"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.254"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v30i1.10465"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6735"},{"key":"e_1_3_2_1_9_1","volume-title":"Synthetic data and artificial neural networks for natural scene text recognition. arXiv preprint arXiv:1406.2227","author":"Jaderberg Max","year":"2014","unstructured":"Max Jaderberg , Karen Simonyan , Andrea Vedaldi , and Andrew Zisserman . 2014. Synthetic data and artificial neural networks for natural scene text recognition. arXiv preprint arXiv:1406.2227 ( 2014 ). Max Jaderberg, Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. 2014. Synthetic data and artificial neural networks for natural scene text recognition. arXiv preprint arXiv:1406.2227 (2014)."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-015-0823-z"},{"key":"e_1_3_2_1_11_1","volume-title":"Spatial transformer networks. arXiv preprint arXiv:1506.02025","author":"Jaderberg Max","year":"2015","unstructured":"Max Jaderberg , Karen Simonyan , Andrew Zisserman , and Koray Kavukcuoglu . 2015. Spatial transformer networks. arXiv preprint arXiv:1506.02025 ( 2015 ). Max Jaderberg, Karen Simonyan, Andrew Zisserman, and Koray Kavukcuoglu. 2015. Spatial transformer networks. arXiv preprint arXiv:1506.02025 (2015)."},{"key":"e_1_3_2_1_12_1","volume-title":"ICDAR 2015 competition on robust reading. In 2015 13th International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1156--1160","author":"Karatzas Dimosthenis","year":"2015","unstructured":"Dimosthenis Karatzas , Lluis Gomez-Bigorda , Anguelos Nicolaou , Suman Ghosh , Andrew Bagdanov , Masakazu Iwamura , Jiri Matas , Lukas Neumann , Vijay Ramaseshan Chandrasekhar , Shijian Lu , 2015 . ICDAR 2015 competition on robust reading. In 2015 13th International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1156--1160 . Dimosthenis Karatzas, Lluis Gomez-Bigorda, Anguelos Nicolaou, Suman Ghosh, Andrew Bagdanov, Masakazu Iwamura, Jiri Matas, Lukas Neumann, Vijay Ramaseshan Chandrasekhar, Shijian Lu, et al. 2015. ICDAR 2015 competition on robust reading. In 2015 13th International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1156--1160."},{"key":"e_1_3_2_1_13_1","volume-title":"ICDAR 2013 robust reading competition. In 2013 12th International Conference on Document Analysis and Recognition. IEEE, 1484--1493","author":"Karatzas Dimosthenis","year":"2013","unstructured":"Dimosthenis Karatzas , Faisal Shafait , Seiichi Uchida , Masakazu Iwamura , Lluis Gomez i Bigorda , Sergi Robles Mestre , Joan Mas , David Fernandez Mota , Jon Almazan Almazan , and Lluis Pere De Las Heras . 2013 . ICDAR 2013 robust reading competition. In 2013 12th International Conference on Document Analysis and Recognition. IEEE, 1484--1493 . Dimosthenis Karatzas, Faisal Shafait, Seiichi Uchida, Masakazu Iwamura, Lluis Gomez i Bigorda, Sergi Robles Mestre, Joan Mas, David Fernandez Mota, Jon Almazan Almazan, and Lluis Pere De Las Heras. 2013. ICDAR 2013 robust reading competition. In 2013 12th International Conference on Document Analysis and Recognition. IEEE, 1484--1493."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33018610"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.472"},{"key":"e_1_3_2_1_16_1","volume-title":"Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes","author":"Liao Minghui","year":"2019","unstructured":"Minghui Liao , Pengyuan Lyu , Minghang He , Cong Yao , Wenhao Wu , and Xiang Bai . 2019. Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes . IEEE Transactions on Pattern Analysis and Machine Intelligence ( 2019 ). Minghui Liao, Pengyuan Lyu, Minghang He, Cong Yao, Wenhao Wu, and Xiang Bai. 2019. Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence (2019)."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33018714"},{"key":"e_1_3_2_1_18_1","first-page":"7","article-title":"STAR-Net: a spatial attention residue network for scene text recognition","volume":"2","author":"Liu Wei","year":"2016","unstructured":"Wei Liu , Chaofeng Chen , Kwan-Yee K Wong , Zhizhong Su , and Junyu Han . 2016 . STAR-Net: a spatial attention residue network for scene text recognition .. In BMVC , Vol. 2. 7 . Wei Liu, Chaofeng Chen, Kwan-Yee K Wong, Zhizhong Su, and Junyu Han. 2016. STAR-Net: a spatial attention residue network for scene text recognition.. In BMVC, Vol. 2. 7.","journal-title":"BMVC"},{"key":"e_1_3_2_1_19_1","volume-title":"2d attentional irregular scene text recognizer. arXiv preprint arXiv:1906.05708","author":"Lyu Pengyuan","year":"2019","unstructured":"Pengyuan Lyu , Zhicheng Yang , Xinhang Leng , Xiaojun Wu , Ruiyu Li , and Xiaoyong Shen . 2019. 2d attentional irregular scene text recognizer. arXiv preprint arXiv:1906.05708 ( 2019 ). Pengyuan Lyu, Zhicheng Yang, Xinhang Leng, Xiaojun Wu, Ruiyu Li, and Xiaoyong Shen. 2019. 2d attentional irregular scene text recognizer. arXiv preprint arXiv:1906.05708 (2019)."},{"key":"e_1_3_2_1_20_1","volume-title":"Pointer sentinel mixture models. arXiv preprint arXiv:1609.07843","author":"Merity Stephen","year":"2016","unstructured":"Stephen Merity , Caiming Xiong , James Bradbury , and Richard Socher . 2016. Pointer sentinel mixture models. arXiv preprint arXiv:1609.07843 ( 2016 ). Stephen Merity, Caiming Xiong, James Bradbury, and Richard Socher. 2016. Pointer sentinel mixture models. arXiv preprint arXiv:1609.07843 (2016)."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.5244\/C.26.127"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.76"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01354"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2014.07.008"},{"key":"e_1_3_2_1_25_1","volume-title":"An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition","author":"Shi Baoguang","year":"2016","unstructured":"Baoguang Shi , Xiang Bai , and Cong Yao . 2016. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition . IEEE transactions on pattern analysis and machine intelligence 39, 11 ( 2016 ), 2298--2304. Baoguang Shi, Xiang Bai, and Cong Yao. 2016. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE transactions on pattern analysis and machine intelligence 39, 11 (2016), 2298--2304."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.452"},{"key":"e_1_3_2_1_27_1","volume-title":"Attention is all you need. arXiv preprint arXiv:1706.03762","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N Gomez , Lukasz Kaiser , and Illia Polosukhin . 2017. Attention is all you need. arXiv preprint arXiv:1706.03762 ( 2017 ). Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. arXiv preprint arXiv:1706.03762 (2017)."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6891"},{"key":"e_1_3_2_1_29_1","volume-title":"2d-ctc for scene text recognition. arXiv preprint arXiv:1907.09705","author":"Wan Zhaoyi","year":"2019","unstructured":"Zhaoyi Wan , Fengming Xie , Yibo Liu , Xiang Bai , and Cong Yao . 2019. 2d-ctc for scene text recognition. arXiv preprint arXiv:1907.09705 ( 2019 ). Zhaoyi Wan, Fengming Xie, Yibo Liu, Xiang Bai, and Cong Yao. 2019. 2d-ctc for scene text recognition. arXiv preprint arXiv:1907.09705 (2019)."},{"key":"e_1_3_2_1_30_1","volume-title":"2011 International Conference on Computer Vision. IEEE, 1457-- 1464","author":"Wang Kai","year":"2011","unstructured":"Kai Wang , Boris Babenko , and Serge Belongie . 2011 . End-to-end scene text recognition . In 2011 International Conference on Computer Vision. IEEE, 1457-- 1464 . Kai Wang, Boris Babenko, and Serge Belongie. 2011. End-to-end scene text recognition. In 2011 International Conference on Computer Vision. IEEE, 1457-- 1464."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6903"},{"key":"e_1_3_2_1_32_1","volume-title":"Deep attention-based spatially recursive networks for fine-grained visual recognition","author":"Li Xue","year":"2018","unstructured":"LinWu, YangWang, Xue Li , and Junbin Gao . 2018. Deep attention-based spatially recursive networks for fine-grained visual recognition . IEEE transactions on cybernetics 49, 5 ( 2018 ), 1791--1802. LinWu, YangWang, Xue Li, and Junbin Gao. 2018. Deep attention-based spatially recursive networks for fine-grained visual recognition. IEEE transactions on cybernetics 49, 5 (2018), 1791--1802."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2019.2940684"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3343031.3350929"},{"key":"e_1_3_2_1_35_1","unstructured":"Rui Yan and Yaohong Huang. [n.d.]. PlugNet: Degradation Aware Scene Text Recognition Supervised by a Pluggable Super-Resolution Unit. ([n. d.]).  Rui Yan and Yaohong Huang. [n.d.]. PlugNet: Degradation Aware Scene Text Recognition Supervised by a Pluggable Super-Resolution Unit. ([n. d.])."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00924"},{"key":"e_1_3_2_1_37_1","first-page":"3","article-title":"Learning to Read Irregular Text with Attention Mechanisms","volume":"1","author":"Yang Xiao","year":"2017","unstructured":"Xiao Yang , Dafang He , Zihan Zhou , Daniel Kifer , and C Lee Giles . 2017 . Learning to Read Irregular Text with Attention Mechanisms .. In IJCAI , Vol. 1. 3 . Xiao Yang, Dafang He, Zihan Zhou, Daniel Kifer, and C Lee Giles. 2017. Learning to Read Irregular Text with Attention Mechanisms.. In IJCAI, Vol. 1. 3.","journal-title":"IJCAI"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01213"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2020.107791"},{"key":"e_1_3_2_1_40_1","volume-title":"RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition. In European Conference on Computer Vision. Springer, 135--151","author":"Yue Xiaoyu","year":"2020","unstructured":"Xiaoyu Yue , Zhanghui Kuang , Chenhao Lin , Hongbin Sun , and Wayne Zhang . 2020 . RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition. In European Conference on Computer Vision. Springer, 135--151 . Xiaoyu Yue, Zhanghui Kuang, Chenhao Lin, Hongbin Sun, and Wayne Zhang. 2020. RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition. In European Conference on Computer Vision. Springer, 135--151."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00216"}],"event":{"name":"ICMR '21: International Conference on Multimedia Retrieval","location":"Taipei Taiwan","acronym":"ICMR '21","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 2021 International Conference on Multimedia Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3460426.3463674","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3460426.3463674","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:17:04Z","timestamp":1750191424000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3460426.3463674"}},"subtitle":["Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition"],"short-title":[],"issued":{"date-parts":[[2021,8,24]]},"references-count":41,"alternative-id":["10.1145\/3460426.3463674","10.1145\/3460426"],"URL":"https:\/\/doi.org\/10.1145\/3460426.3463674","relation":{},"subject":[],"published":{"date-parts":[[2021,8,24]]},"assertion":[{"value":"2021-09-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}