{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T13:19:23Z","timestamp":1753881563152,"version":"3.41.2"},"reference-count":54,"publisher":"World Scientific Pub Co Pte Ltd","issue":"01","funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62371144"],"award-info":[{"award-number":["62371144"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61966003"],"award-info":[{"award-number":["61966003"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004607","name":"Natural Science Foundation of Guangxi Province","doi-asserted-by":"publisher","award":["2020GXNSFAA159171"],"award-info":[{"award-number":["2020GXNSFAA159171"]}],"id":[{"id":"10.13039\/501100004607","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2024,1]]},"abstract":"<jats:p> The task of scene text recognition involves processing information from two modalities: images and text, thereby requiring models to have the ability to extract features from images and model sequences simultaneously. Although linguistic knowledge greatly aids scene text recognition tasks, the extensive use of language models in sequence modeling and model prediction stages in recent years has made model architectures increasingly complex and inefficient. In this paper, we propose LCSTR, a pure convolutional visual model that can complete text recognition without the need for attention mechanisms or language models. This approach applies large kernels to text recognition tasks for the first time, extracting word-level text information through large text-aware blocks, capturing long-range dependencies between characters, and using small text-aware blocks to obtain local features within characters. Experiments show that this model strikes a good trade-off between accuracy and speed, achieving notable results on seven public benchmarks, validating the generalizability and effectiveness of this method. Furthermore, owing to the absence of a language module, this model demonstrates remarkable accuracy even in limited sample scenarios, and the lightweight and low computational overhead features make it suitable for engineering applications. <\/jats:p>","DOI":"10.1142\/s021800142353004x","type":"journal-article","created":{"date-parts":[[2023,11,23]],"date-time":"2023-11-23T09:09:54Z","timestamp":1700730594000},"source":"Crossref","is-referenced-by-count":1,"title":["LCSTR: Scene Text Recognition with Large Convolutional Kernels"],"prefix":"10.1142","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-3213-8167","authenticated-orcid":false,"given":"Jiale","family":"Wang","sequence":"first","affiliation":[{"name":"Guangxi Key Laboratory of Multimedia Communications and Network Technology, School of Computer, Electronics and Information, Guangxi University, Nanning 530004, P.\u00a0R.\u00a0China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3489-1887","authenticated-orcid":false,"given":"Lina","family":"Yang","sequence":"additional","affiliation":[{"name":"Guangxi Key Laboratory of Multimedia Communications and Network Technology, School of Computer, Electronics and Information, Guangxi University, Nanning 530004, P.\u00a0R.\u00a0China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-4449-377X","authenticated-orcid":false,"given":"Jing","family":"Wang","sequence":"additional","affiliation":[{"name":"Guangxi Key Laboratory of Multimedia Communications and Network Technology, School of Computer, Electronics and Information, Guangxi University, Nanning 530004, P.\u00a0R.\u00a0China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8769-9736","authenticated-orcid":false,"given":"Haoyan","family":"Yang","sequence":"additional","affiliation":[{"name":"Guangxi Key Laboratory of Multimedia Communications and Network Technology, School of Computer, Electronics and Information, Guangxi University, Nanning 530004, P.\u00a0R.\u00a0China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8311-1694","authenticated-orcid":false,"given":"Lin","family":"Bai","sequence":"additional","affiliation":[{"name":"Guangxi Key Laboratory of Multimedia Communications and Network Technology, School of Computer, Electronics and Information, Guangxi University, Nanning 530004, P.\u00a0R.\u00a0China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9336-3155","authenticated-orcid":false,"given":"Patrick Shen-Pei","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Comouter and Informatiomn Science, Northeastern University, Boston, MA 02115, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-1921-1651","authenticated-orcid":false,"given":"Xichun","family":"Li","sequence":"additional","affiliation":[{"name":"Guangxi Normal University for Nationalities, Chongzuo 532200, P.\u00a0R.\u00a0China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5908-5518","authenticated-orcid":false,"given":"Huiwu","family":"Luo","sequence":"additional","affiliation":[{"name":"Changsha Xingshen Intelligent Technology Co., Ltd, Changsha 410100, P.\u00a0R.\u00a0China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-2848-9900","authenticated-orcid":false,"given":"Huafu","family":"Xu","sequence":"additional","affiliation":[{"name":"Guangxi Key Laboratory of Digital Infrastructure, Guangxi Information Center, Nanning 530200, P.\u00a0R.\u00a0China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2024,2,9]]},"reference":[{"key":"S021800142353004XBIB001","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW54120.2021.00181"},{"key":"S021800142353004XBIB002","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-86549-8_21"},{"key":"S021800142353004XBIB003","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00481"},{"key":"S021800142353004XBIB004","doi-asserted-by":"publisher","DOI":"10.1145\/3219819.3219861"},{"key":"S021800142353004XBIB005","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2020.3045602"},{"key":"S021800142353004XBIB006","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01166"},{"key":"S021800142353004XBIB008","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00702"},{"key":"S021800142353004XBIB009","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3051485"},{"key":"S021800142353004XBIB010","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-25069-9_24"},{"key":"S021800142353004XBIB011","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143891"},{"key":"S021800142353004XBIB012","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.254"},{"key":"S021800142353004XBIB013","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2022.109512"},{"key":"S021800142353004XBIB014","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v30i1.10465"},{"key":"S021800142353004XBIB015","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"S021800142353004XBIB016","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00140"},{"key":"S021800142353004XBIB017","doi-asserted-by":"publisher","DOI":"10.1007\/s11760-022-02388-9"},{"volume-title":"Workshop on Deep Learning, NIPS","year":"2014","author":"Jaderberg M.","key":"S021800142353004XBIB018"},{"key":"S021800142353004XBIB019","first-page":"2017","volume-title":"Proceedings of the 28th International Conference on Neural Information Processing Systems, Vol. 2, NIPS\u201915","author":"Jaderberg M.","year":"2015"},{"key":"S021800142353004XBIB020","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2013.221"},{"key":"S021800142353004XBIB021","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2015.7333942"},{"key":"S021800142353004XBIB022","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3478328"},{"key":"S021800142353004XBIB023","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.245"},{"key":"S021800142353004XBIB024","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW50498.2020.00281"},{"key":"S021800142353004XBIB026","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33018610"},{"first-page":"7","volume-title":"Proc. British Machine Vision Conf.","author":"Liu W.","key":"S021800142353004XBIB027"},{"volume-title":"International Conference on Learning Representations","year":"2023","author":"Liu S.","key":"S021800142353004XBIB028"},{"first-page":"10012","volume-title":"Proc. IEEE\/CVF Int. Conf. Computer Vision","author":"Liu Z.","key":"S021800142353004XBIB029"},{"key":"S021800142353004XBIB030","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01167"},{"key":"S021800142353004XBIB031","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v36i2.20062"},{"key":"S021800142353004XBIB032","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-004-0134-3"},{"key":"S021800142353004XBIB033","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2019.01.020"},{"key":"S021800142353004XBIB034","doi-asserted-by":"publisher","DOI":"10.1109\/ICME55011.2023.00276"},{"key":"S021800142353004XBIB035","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01376"},{"key":"S021800142353004XBIB036","doi-asserted-by":"publisher","DOI":"10.5244\/C.26.127"},{"key":"S021800142353004XBIB037","first-page":"4353","volume-title":"Proc. IEEE Conf. Computer Vision and Pattern Recognition","author":"Peng C.","year":"2017"},{"key":"S021800142353004XBIB038","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.76"},{"key":"S021800142353004XBIB039","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2014.07.008"},{"key":"S021800142353004XBIB040","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2019.00130"},{"key":"S021800142353004XBIB041","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2646371"},{"key":"S021800142353004XBIB042","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.452"},{"key":"S021800142353004XBIB043","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2848939"},{"key":"S021800142353004XBIB044","first-page":"24261","volume-title":"Advances in Neural Information Processing Systems","author":"Tolstikhin I. O.","year":"2021"},{"volume-title":"31 Annual Conf. Advances in Neural Information Processing Systems","year":"2017","author":"Vaswani A.","key":"S021800142353004XBIB045"},{"key":"S021800142353004XBIB046","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126402"},{"key":"S021800142353004XBIB047","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.01385"},{"key":"S021800142353004XBIB048","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-19815-1_20"},{"volume-title":"Advances in Neural Information Processing Systems","year":"2017","author":"Wang J.","key":"S021800142353004XBIB049"},{"key":"S021800142353004XBIB050","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01393"},{"key":"S021800142353004XBIB051","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2022.3197981"},{"key":"S021800142353004XBIB052","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2022.3146779"},{"key":"S021800142353004XBIB053","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-19815-1_18"},{"key":"S021800142353004XBIB054","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2020.07.010"},{"key":"S021800142353004XBIB055","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01213"},{"key":"S021800142353004XBIB056","first-page":"1","volume-title":"IEEE Trans. Neural Netw. Learn. Syst.","author":"Zhang H.","year":"2023"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S021800142353004X","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,29]],"date-time":"2024-02-29T07:52:46Z","timestamp":1709193166000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S021800142353004X"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1]]},"references-count":54,"journal-issue":{"issue":"01","published-print":{"date-parts":[[2024,1]]}},"alternative-id":["10.1142\/S021800142353004X"],"URL":"https:\/\/doi.org\/10.1142\/s021800142353004x","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"type":"print","value":"0218-0014"},{"type":"electronic","value":"1793-6381"}],"subject":[],"published":{"date-parts":[[2024,1]]},"article-number":"2353004"}}