{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,30]],"date-time":"2026-03-30T08:47:06Z","timestamp":1774860426927,"version":"3.50.1"},"reference-count":29,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2026,3,30]],"date-time":"2026-03-30T00:00:00Z","timestamp":1774828800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"},{"start":{"date-parts":[[2026,3,30]],"date-time":"2026-03-30T00:00:00Z","timestamp":1774828800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int. J. Mach. Learn. &amp; Cyber."],"published-print":{"date-parts":[[2026,5]]},"DOI":"10.1007\/s13042-026-03043-2","type":"journal-article","created":{"date-parts":[[2026,3,30]],"date-time":"2026-03-30T07:53:08Z","timestamp":1774857188000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Rotation-invariant scene text extraction using transformer networks"],"prefix":"10.1007","volume":"17","author":[{"given":"Vanitha Sivagami","family":"Sivasankaravel","sequence":"first","affiliation":[]},{"given":"Sreenivasan","family":"S R","sequence":"additional","affiliation":[]},{"given":"Manoj","family":"R","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2026,3,30]]},"reference":[{"key":"3043_CR1","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser \u0141 (2017) Attention is all you need. In: 31st Conference on neural information processing systems (NIPS 2017), Long Beach, CA, USA"},{"key":"3043_CR2","unstructured":"Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S, Uszkoreit J, Houlsby N (2021) An image is worth 16x16 words: Transformers for image recognition at scale. In: ICLR 2021"},{"key":"3043_CR3","doi-asserted-by":"publisher","first-page":"2127","DOI":"10.1007\/s13042-022-01750-0","volume":"14","author":"X Zhang","year":"2023","unstructured":"Zhang X, Zhang Y (2023) Conv-pvt, a fusion architecture of convolution and pyramid vision transformer. Int J Mach Learn Cybern 14:2127\u20132136. https:\/\/doi.org\/10.1007\/s13042-022-01750-0","journal-title":"Int J Mach Learn Cybern"},{"issue":"3552","key":"3043_CR4","first-page":"3566","volume":"32","author":"P Shivakumara","year":"2023","unstructured":"Shivakumara P, Banerjee A, Pal U, Nandanwar L, Liu C-L (2023) A new language-independent deep CNN for scene text detection and style transfer in social media images. IEEE Trans Image Process 32 :(3552-3566)","journal-title":"IEEE Trans Image Process"},{"key":"3043_CR5","doi-asserted-by":"publisher","first-page":"1341","DOI":"10.1109\/TIP.2023.3237002","volume":"32","author":"J Ma","year":"2023","unstructured":"Ma J, Guo S, Zhang L (2023) Text prior guided scene text image. IEEE Trans Image Process 32:1341\u2013 1353","journal-title":"IEEE Trans Image Process"},{"key":"3043_CR6","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2024.123771","volume":"249","author":"D Zhong","year":"2024","unstructured":"Zhong D, Zhan H, Lyu S, Liu C, Yin B, Shivakumara P, Pal U, Lu Y (2024) Ndorder: exploring a novel decoding order for scene text recognition. Expert Syst Appl 249:123771","journal-title":"Expert Syst Appl"},{"key":"3043_CR7","first-page":"8226","volume":"133","author":"A Banerjee","year":"2024","unstructured":"Banerjee A, Palaiahnakote S, Pal U, Antonacopoulos A, Lu T, Canet JL (2024) Tts: Hilbert transform-based generative adversarial network for tattoo and scene text spotting. IEEE Trans Multimed 26:8226\u20138241","journal-title":"Eng Appl Artif Intell"},{"issue":"3","key":"3043_CR8","doi-asserted-by":"publisher","first-page":"1638","DOI":"10.1109\/TPAMI.2020.3018491","volume":"44","author":"X Rong","year":"2022","unstructured":"Rong X, Yi C, Tian Y (2022) Unambiguous text localization, retrieval, and recognition for cluttered scenes. IEEE Trans Pattern Anal Mach Intell 44(3):1638\u20131652","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"3043_CR9","doi-asserted-by":"crossref","unstructured":"Murthy JS, Shekar AC, Bhattacharya D, Namratha R, Sripriya D (2021) A novel framework for multimodal twitter sentiment analysis using feature learning. In: Singh M, Tyagi V, Gupta PK, Flusser J, \u00d6ren T, Sonawane VR (eds) Advances in computing and data sciences. Springer, Cham, pp 252\u2013261","DOI":"10.1007\/978-3-030-88244-0_24"},{"key":"3043_CR10","doi-asserted-by":"publisher","first-page":"2404","DOI":"10.1109\/TMM.2022.3146779","volume":"25","author":"L Wu","year":"2023","unstructured":"Wu L, Xu Y, Hou J, Chen CLP, Liu C-L (2023) A two-level rectification attention network for scene text recognition. IEEE Trans Multimed 25:2404\u20132414","journal-title":"IEEE Trans Multimed"},{"issue":"11","key":"3043_CR11","doi-asserted-by":"publisher","first-page":"8048","DOI":"10.1109\/TITS.2021.3075225","volume":"44","author":"Y Liu","year":"2022","unstructured":"Liu Y, Shen C, Jin L, He T, Chen P, Liu C, Chen H (2022) Abcnet v2: Adaptive Bezier-curve network for real-time end-to-end text spotting. IEEE Trans Pattern Anal Mach Intell 44(11):8048\u201364","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"issue":"2","key":"3043_CR12","doi-asserted-by":"publisher","first-page":"758","DOI":"10.1109\/TCSVT.2021.3068133","volume":"32","author":"M Cao","year":"2022","unstructured":"Cao M, Zhang C, Yang D, Zou Y (2022) All you need is a second look: towards arbitrary-shaped text detection. IEEE Trans Circuits Syst Video Technol 32(2):758\u201367","journal-title":"IEEE Trans Circuits Syst Video Technol"},{"key":"3043_CR13","doi-asserted-by":"publisher","first-page":"4142","DOI":"10.1109\/TIP.2023.3294822","volume":"32","author":"H Bi","year":"2023","unstructured":"Bi H, Xu C, Shi C, Liu G, Zhang H, Li Y, Dong J (2023) Hgr-net: hierarchical graph reasoning network for arbitrary shape scene text detection. IEEE Trans Image Process 32:4142\u20134155","journal-title":"IEEE Trans Image Process"},{"key":"3043_CR14","doi-asserted-by":"publisher","first-page":"2864","DOI":"10.1109\/TIP.2022.3141844","volume":"31","author":"C Yang","year":"2022","unstructured":"Yang C, Chen M, Xiong Z, Yuan Y, Wang Q (2022) Cm-net: concentric mask based arbitrary-shaped text detection. IEEE Trans Image Process 31:2864\u20132877","journal-title":"IEEE Trans Image Process"},{"issue":"9","key":"3043_CR15","doi-asserted-by":"crossref","first-page":"5349","DOI":"10.1109\/TPAMI.2021.3072422","volume":"44","author":"W Wang","year":"2022","unstructured":"Wang W, Xie E, Li X, Liu X, Liang D, Yang Z, Lu T, Shen C (2022) Pan++: Towards efficient and accurate end-to-end spotting of arbitrarily-shaped text. IEEE Trans Pattern Anal Mach Intell 44(9):5349\u20135367","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"3043_CR16","doi-asserted-by":"publisher","first-page":"4718","DOI":"10.1109\/TMM.2022.3181448","volume":"25","author":"Q Wang","year":"2023","unstructured":"Wang Q, Fu B, Li M, He J, Peng X, Qiao Y (2023) Region-aware arbitrary-shaped text detection with progressive fusion. IEEE Trans Multimed 25:4718\u20134729","journal-title":"IEEE Trans Multimed"},{"key":"3043_CR17","doi-asserted-by":"publisher","DOI":"10.1007\/s13042-024-02448-1","author":"M Moitra","year":"2024","unstructured":"Moitra M, Saha SK (2024) A review on handwritten text segmentation in Indian languages. Int J Mach Learn Cybern. https:\/\/doi.org\/10.1007\/s13042-024-02448-1","journal-title":"Int J Mach Learn Cybern"},{"issue":"1","key":"3043_CR18","doi-asserted-by":"publisher","first-page":"103","DOI":"10.37385\/jaets.v6i1.5958","volume":"6","author":"P Golda Jeyasheeli","year":"2024","unstructured":"Golda Jeyasheeli P, Athinarayanan B, Manish T, Mohamad Umar M (2024) Scene text detection and recognition using maximally stable extremal region. J Applied Eng Technol Sci (JAETS) 6(1):103\u2013114. https:\/\/doi.org\/10.37385\/jaets.v6i1.5958","journal-title":"J Applied Eng Technol Sci (JAETS)"},{"key":"3043_CR19","doi-asserted-by":"publisher","first-page":"1272","DOI":"10.1109\/LSP.2022.3175667","volume":"29","author":"Y Zhou","year":"2022","unstructured":"Zhou Y, Xie H, Fang S, Zhang Y (2022) Semi-supervised text detection with accurate pseudo-labels. IEEE Signal Process Lett 29:1272\u201376","journal-title":"IEEE Signal Process Lett"},{"issue":"10","key":"3043_CR20","doi-asserted-by":"publisher","first-page":"7266","DOI":"10.1109\/TPAMI.2021.3095916","volume":"44","author":"P Wang","year":"2022","unstructured":"Wang P, Li H, Shen C (2022) Towards end-to-end text spotting in natural scenes. IEEE Trans Pattern Anal Mach Intell 44(10):7266\u20137281","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"issue":"9","key":"3043_CR21","doi-asserted-by":"publisher","first-page":"6040","DOI":"10.1109\/TPAMI.2024.3379828","volume":"46","author":"W Yu","year":"2024","unstructured":"Yu W, Liu Y, Zhu X, Cao H, Sun X, Bai X (2024) Turning a clip model into a scene text spotter. IEEE Trans Pattern Anal Mach Intell 46(9):6040\u20136054","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"3043_CR22","doi-asserted-by":"publisher","first-page":"825","DOI":"10.1109\/TIP.2024.3352399","volume":"33","author":"S-X Zhang","year":"2024","unstructured":"Zhang S-X, Yang C, Zhu X, Zhou H, Wang H, Yin X-C (2024) Inverse-like antagonistic scene text spotting via reading-order estimation and dynamic sampling. IEEE Trans Image Process 33:825\u2013839","journal-title":"IEEE Trans Image Process"},{"issue":"1","key":"3043_CR23","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/1911\/1\/012019","volume":"1911","author":"TG Nisia","year":"2021","unstructured":"Nisia TG, Rajesh S (2021) Extraction of high-level and low-level feature for classification of image using Ridgelet and CNN based image classification. J Phys: Conf Ser 1911(1):012019. https:\/\/doi.org\/10.1088\/1742-6596\/1911\/1\/012019","journal-title":"J Phys: Conf Ser"},{"key":"3043_CR24","doi-asserted-by":"crossref","unstructured":"Graves A, Fern\u00e1ndez S, Gomez F, Schmidhuber J (2006) Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks. In: Proceedings of the 23rd international conference on machine learning, Pittsburgh, PA","DOI":"10.1145\/1143844.1143891"},{"key":"3043_CR25","unstructured":"Zeng Y-X, Hsieh J-W, Li X, Chang M-C (2023) MixNet: Toward accurate detection of challenging scene text in the wild. arXiv:2308.12817"},{"key":"3043_CR26","doi-asserted-by":"crossref","unstructured":"Bu Q, Park S, Khang M, Cheng Y (2023) SRFormer: text detection transformer with incorporated segmentation and regression. arXiv:2308.10531","DOI":"10.1609\/aaai.v38i2.27844"},{"key":"3043_CR27","doi-asserted-by":"crossref","unstructured":"Ye M, Zhang J, Zhao S, Liu J, Du B, Tao D (2022) DPText-DETR: towards better scene text detection with dynamic points in transformer. arXiv:2207.04491v2","DOI":"10.1609\/aaai.v37i3.25430"},{"key":"3043_CR28","doi-asserted-by":"crossref","unstructured":"Ye J, Chen Z, Liu J, Du B (2020) Textfusenet: Scene text detection with richer fused features. In: Proceedings of the twenty-ninth international joint conference on artificial intelligence (IJCAI-20)","DOI":"10.24963\/ijcai.2020\/72"},{"key":"3043_CR29","doi-asserted-by":"publisher","DOI":"10.3390\/info14070369","author":"W Yu","year":"2023","unstructured":"Yu W, Ibrayim M, Hamdulla A (2023) Scene text recognition based on improved CRNN. Information. https:\/\/doi.org\/10.3390\/info14070369","journal-title":"Information"}],"container-title":["International Journal of Machine Learning and Cybernetics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13042-026-03043-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s13042-026-03043-2","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13042-026-03043-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,30]],"date-time":"2026-03-30T07:53:11Z","timestamp":1774857191000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s13042-026-03043-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,30]]},"references-count":29,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2026,5]]}},"alternative-id":["3043"],"URL":"https:\/\/doi.org\/10.1007\/s13042-026-03043-2","relation":{},"ISSN":["1868-8071","1868-808X"],"issn-type":[{"value":"1868-8071","type":"print"},{"value":"1868-808X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,3,30]]},"assertion":[{"value":"22 April 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 February 2026","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 March 2026","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no Conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"235"}}