{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,30]],"date-time":"2025-12-30T15:35:55Z","timestamp":1767108955093},"reference-count":45,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2021,1,30]],"date-time":"2021-01-30T00:00:00Z","timestamp":1611964800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,1,30]],"date-time":"2021-01-30T00:00:00Z","timestamp":1611964800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Projekt DEAL"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Parallel Prog"],"published-print":{"date-parts":[[2021,4]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In recent years,<jats:inline-formula><jats:alternatives><jats:tex-math>$$\\hbox {optical character recognition (OCR)}$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mrow><mml:mtext>optical character recognition (OCR)<\/mml:mtext><\/mml:mrow><\/mml:math><\/jats:alternatives><\/jats:inline-formula>systems have been used to digitally preserve historical archives. To transcribe historical archives into a machine-readable form, first, the documents are scanned, then an<jats:inline-formula><jats:alternatives><jats:tex-math>$$\\hbox {OCR}$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mtext>OCR<\/mml:mtext><\/mml:math><\/jats:alternatives><\/jats:inline-formula>is applied. In order to digitize documents without the need to remove them from where they are archived, it is valuable to have a portable device that combines scanning and<jats:inline-formula><jats:alternatives><jats:tex-math>$$\\hbox {OCR}$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mtext>OCR<\/mml:mtext><\/mml:math><\/jats:alternatives><\/jats:inline-formula>capabilities. Nowadays, there exist many commercial and open-source document digitization techniques, which are optimized for contemporary documents. However, they fail to give sufficient text recognition accuracy for transcribing historical documents due to the severe quality degradation\u00a0of such documents. On the contrary, the anyOCR system, which is designed to mainly digitize historical documents, provides high accuracy. However, this comes at a cost of high computational complexity resulting in long runtime and high power consumption. To tackle these challenges, we propose a low power energy-efficient accelerator with real-time capabilities called iDocChip, which is a configurable hybrid hardware-software programmable<jats:inline-formula><jats:alternatives><jats:tex-math>$$\\hbox {System-on-Chip (SoC)}$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mrow><mml:mtext>System-on-Chip (SoC)<\/mml:mtext><\/mml:mrow><\/mml:math><\/jats:alternatives><\/jats:inline-formula>based on anyOCR\u00a0for digitizing historical documents. In this paper, we focus on one of the most crucial processing steps in the anyOCR system:<jats:italic>Text and Image Segmentation<\/jats:italic>, which makes use of a multi-resolution morphology-based algorithm. Moreover, an optimized<jats:inline-formula><jats:alternatives><jats:tex-math>$$\\hbox {FPGA}$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mtext>FPGA<\/mml:mtext><\/mml:math><\/jats:alternatives><\/jats:inline-formula>-based hybrid architecture of this anyOCR step along with its\u00a0optimized software implementations are presented. We demonstrate our results on multiple embedded and general-purpose platforms with respect to runtime and power consumption. The resulting hardware accelerator outperforms the existing anyOCR by<jats:bold><jats:italic>6.2<\/jats:italic><\/jats:bold><jats:inline-formula><jats:alternatives><jats:tex-math>$$\\times$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mo>\u00d7<\/mml:mo><\/mml:math><\/jats:alternatives><\/jats:inline-formula>, while achieving<jats:bold><jats:italic>207<\/jats:italic><\/jats:bold><jats:inline-formula><jats:alternatives><jats:tex-math>$$\\times$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mo>\u00d7<\/mml:mo><\/mml:math><\/jats:alternatives><\/jats:inline-formula>higher energy-efficiency and maintaining\u00a0its high accuracy.<\/jats:p>","DOI":"10.1007\/s10766-020-00690-y","type":"journal-article","created":{"date-parts":[[2021,1,30]],"date-time":"2021-01-30T20:02:41Z","timestamp":1612036961000},"page":"253-284","update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["iDocChip: A Configurable Hardware Architecture for Historical Document Image Processing"],"prefix":"10.1007","volume":"49","author":[{"given":"Menbere Kina","family":"Tekleyohannes","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vladimir","family":"Rybalkin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Muhammad Mohsin","family":"Ghaffar","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Javier Alejandro","family":"Varela","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Norbert","family":"Wehn","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andreas","family":"Dengel","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2021,1,30]]},"reference":[{"key":"690_CR1","unstructured":"ABBYY. https:\/\/www.abbyy.com\/en-eu\/. Accessed 24 Apr 2020"},{"key":"690_CR2","unstructured":"Omnipage. https:\/\/www.kofax.com\/Products\/omnipage?source=nuance. Accessed 24 Apr 2020"},{"key":"690_CR3","unstructured":"OCRopus. https:\/\/github.com\/tmbarchive\/ocropy. Accessed: 2020-04-24"},{"key":"690_CR4","unstructured":"Tesseract. https:\/\/github.com\/tesseract-ocr. Accessed 24 Apr 2020"},{"key":"690_CR5","unstructured":"Bukhari, S. S., Kadi, A, Jouneh, M. A., Mir, F. M., Dengel, A: anyocr: An open-source ocr system for historical archives. In: 2017 14th IAPR international conference on document analysis and recognition (ICDAR), vol. 1 , pp. 305\u2013310. IEEE, (2017)"},{"key":"690_CR6","unstructured":"Narrenschif. http:\/\/kallimachos.de\/kallimachos\/index.php\/Narragonien. Accessed 24 Apr 2020"},{"key":"690_CR7","doi-asserted-by":"crossref","unstructured":"Tekleyohannes, M. K., Rybalkin, V., Bukhari, S. S., Ghaffar, M. M., Varela, J. A., Wehn, N. D., Andreas: idocchip - a configurable hardware architecture for historical document image processing: multiresolution morphology-based text and image segmentation. In: The 6th international embedded systems symposium (IESS), (2019)","DOI":"10.1109\/ReConFig48160.2019.8994761"},{"key":"690_CR8","unstructured":"Brugger, C., Dal\u2019Aqua, L., Varela, J. A., De\u00a0Schryver, C., Sadri, M., Wehn, N., Klein, M., Siegrist, Michael: a quantitative cross-architecture study of morphological image processing on cpus, gpus, and fpgas. In: 2015 IEEE symposium on computer applications and industrial electronics (ISCAIE), pp. 201\u2013206. IEEE, (2015)"},{"key":"690_CR9","unstructured":"Qasaimeh, M., Denolf, K., Lo, J., Vissers, K., Zambreno, J., Jones, P. H.: Comparing energy efficiency of cpu, gpu and fpga implementations for vision kernels. In: 2019 IEEE international conference on embedded software and systems (ICESS), pp. 1\u20138. IEEE, (2019)"},{"key":"690_CR10","unstructured":"Page, A., Mohsenin, T.: An efficient and reconfigurable fpga and asic implementation of a spectral doppler ultrasound imaging system. In: 2013 IEEE 24th international conference on application-specific systems, architectures and processors, pp. 198\u2013202. IEEE, (2013)"},{"key":"690_CR11","doi-asserted-by":"crossref","unstructured":"Jiang, S., He, D., Yang, C., Xu, C., Luo, G., Chen, Y., Liu, Y., Jiang, J.: Accelerating mobile applications at the network edge with software-programmable fpgas. In: IEEE INFOCOM 2018-IEEE conference on computer communications, pp. 55\u201362. IEEE, (2018)","DOI":"10.1109\/INFOCOM.2018.8485850"},{"issue":"9","key":"690_CR12","doi-asserted-by":"crossref","first-page":"1648","DOI":"10.1002\/cta.2508","volume":"46","author":"R Bonamy","year":"2018","unstructured":"Bonamy, R., Bilavarn, S., Muller, F., Duhem, F., Heywood, S., Millet, P., Lemonnier, F.: Energy efficient mapping on manycore with dynamic and partial reconfiguration: Application to a smart camera. Int. J. Circuit Theory Appl 46(9), 1648\u20131662 (2018)","journal-title":"Int. J. Circuit Theory Appl"},{"key":"690_CR13","unstructured":"Xilinx, inc. zynq\u00ae-7000 All Programmable SoC. https:\/\/www.xilinx.com\/products\/silicon-devices\/soc\/zynq-7000.html. Accessed 24 Apr 2020"},{"key":"690_CR14","unstructured":"Baidu\u2019s apollo driverless platform. https:\/\/www.electronicdesign.com\/markets\/automotive\/article\/21119589\/xilinx-soc-fpga-powers-baidus-apollo-driverless-platform. Accessed 24 Apr 2020"},{"key":"690_CR15","unstructured":"Topic embedded systems. https:\/\/topic.nl\/en\/products. Accessed 24 Apr 2020"},{"key":"690_CR16","unstructured":"AXIOM beta: a professional digital cinema camera. https:\/\/apertus.org\/axiom. Accessed 24 Apr 2020"},{"key":"690_CR17","doi-asserted-by":"publisher","first-page":"191","DOI":"10.1016\/j.nima.2017.11.033","volume":"912","author":"S Ishikawa","year":"2018","unstructured":"Ishikawa, S., Takahashi, T., Watanabe, S., Narukage, N., Miyazaki, S., Orita, T., Takeda, S., Nomachi, M., Fujishiro, I., Hodoshima, F.: High-speed x-ray imaging spectroscopy system with zynq soc for solar observations. Nucl. Instrum. Methods in Phys. Res. Sect. A: Accel, Spectrometers, Detectors and Associated Equipment 912, 191\u2013194 (2018)","journal-title":"Nucl. Instrum. Methods in Phys. Res. Sect. A: Accel, Spectrometers, Detectors and Associated Equipment"},{"issue":"18","key":"690_CR18","doi-asserted-by":"publisher","first-page":"4011","DOI":"10.3390\/s19184011","volume":"19","author":"\u00d3 Mata-Carballeira","year":"2019","unstructured":"Mata-Carballeira, \u00d3., Guti\u00e9rrez-Zaballa, J., del Campo, I., Mart\u00ednez, V.: An fpga-based neuro-fuzzy sensor for personalized driving assistance. Sensors 19(18), 4011 (2019)","journal-title":"Sensors"},{"issue":"1","key":"690_CR19","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1109\/TCAD.2017.2705069","volume":"37","author":"K Guo","year":"2017","unstructured":"Guo, K., Sui, L., Qiu, J., Jincheng, Yu., Wang, J., Yao, S., Song Han, Yu., Wang, and Huazhong Yang., : Angel-eye: A complete design flow for mapping cnn onto embedded fpga. IEEE Trans. Computer-Aided Des. Integr. Circuits Syst. 37(1), 35\u201347 (2017)","journal-title":"IEEE Trans. Computer-Aided Des. Integr. Circuits Syst."},{"key":"690_CR20","unstructured":"Afroge, S., Ahmed, B., Mahmud, F.: Optical character recognition using back propagation neural network. In: 2016 2nd international conference on electrical, computer and telecommunication engineering (ICECTE), pp. 1\u20134. IEEE, (2016)"},{"key":"690_CR21","doi-asserted-by":"crossref","unstructured":"Wei, T. C., Sheikh UU, Ab\u00a0Rahman, A.A.H.: Improved optical character recognition with deep neural network. In: 2018 IEEE 14th international colloquium on signal processing and its applications (CSPA), pp. 245\u2013249. IEEE, (2018)","DOI":"10.1109\/CSPA.2018.8368720"},{"key":"690_CR22","doi-asserted-by":"crossref","unstructured":"Clausner, C., Hayes, J., Antonacopoulos, A., Pletschacher, S.: Creating a complete workflow for digitising historical census documents: considerations and evaluation. In: Proceedings of the 4th international workshop on historical document imaging and processing, pp. 83\u201388, (2017)","DOI":"10.1145\/3151509.3151525"},{"key":"690_CR23","doi-asserted-by":"crossref","unstructured":"Das T. K., Tripathy, A. K., Mishra, A. K.: Optical character recognition using artificial neural network. In: 2017 international conference on computer communication and informatics (ICCCI), pp. 1\u20134. IEEE, (2017)","DOI":"10.1109\/ICCCI.2017.8117703"},{"key":"690_CR24","unstructured":"Awel, M. A., Abidi, A. I.: Review on optical character recognition. no. June, pp. 3666\u20133669, (2019)"},{"key":"690_CR25","doi-asserted-by":"crossref","unstructured":"Moysset, B., Kermorvant, C., Wolf, C., Louradour, J\u00e9r\u00f4me: Paragraph text segmentation into lines with recurrent neural networks. In: 2015 13th international conference on document analysis and recognition (ICDAR), pp. 456\u2013460. IEEE, (2015)","DOI":"10.1109\/ICDAR.2015.7333803"},{"key":"690_CR26","doi-asserted-by":"crossref","unstructured":"Murdock, M., Reid, S., Hamilton, B., Reese, J.: Icdar 2015 competition on text line detection in historical documents. In: 2015 13th international conference on document analysis and recognition (ICDAR), pp. 1171\u20131175. IEEE, (2015)","DOI":"10.1109\/ICDAR.2015.7333945"},{"key":"690_CR27","doi-asserted-by":"publisher","first-page":"112916","DOI":"10.1016\/j.eswa.2019.112916","volume":"140","author":"S Kundu","year":"2020","unstructured":"Kundu, S., Paul, S., Bera, S. K., Abraham, A., Sarkar, R.: Text-line extraction from handwritten document images using gan. Expert Syst Appl 140, 112916 (2020)","journal-title":"Expert Syst Appl"},{"issue":"1\u20132","key":"690_CR28","first-page":"1","volume":"21","author":"S Bhowmik","year":"2018","unstructured":"Bhowmik, S., Sarkar, R., Nasipuri, M., Doermann, D.: Text and non-text separation in offline document images: a survey. Int. J. Doc. Anal. Recognit(IJDAR) 21(1\u20132), 1\u201320 (2018)","journal-title":"Int. J. Doc. Anal. Recognit(IJDAR)"},{"key":"690_CR29","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.patcog.2016.10.023","volume":"64","author":"S Eskenazi","year":"2017","unstructured":"Eskenazi, S., Gomez-Kr\u00e4mer, P., Ogier, J.-M.: A comprehensive survey of mostly textual document segmentation algorithms since 2008. Pattern Recognit. 64, 1\u201314 (2017)","journal-title":"Pattern Recognit."},{"key":"690_CR30","unstructured":"Mukarambi, G., Gaikwadl, H., Dhandra, B. V.: Segmentation and text extraction from document images: Survey. In: 2019 innovations in power and advanced computing technologies (i-PACT), vol. 1, pp. 1\u20135. IEEE, (2019)"},{"issue":"1","key":"690_CR31","doi-asserted-by":"publisher","first-page":"198","DOI":"10.1007\/s10489-015-0753-z","volume":"45","author":"OK Oyedotun","year":"2016","unstructured":"Oyedotun, O.K., Khashman, A.: Document segmentation using textural features summarization and feedforward neural network. Appl. Intel. 45(1), 198\u2013212 (2016)","journal-title":"Appl. Intel."},{"key":"690_CR32","unstructured":"Moll, M. A., Baird, H. S.: Segmentation-based retrieval of document images from diverse collections. In: Document Recognition and Retrieval XV, volume 6815, p 68150L. International Society for Optics and Photonics, (2008)"},{"key":"690_CR33","doi-asserted-by":"crossref","unstructured":"Bukhari, S. S.,Al Azawi, M. I. A., Shafait, F., Breuel, T.M.: Document image segmentation using discriminative learning over connected components. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pages 183\u2013190. ACM, (2010)","DOI":"10.1145\/1815330.1815354"},{"key":"690_CR34","unstructured":"Chowdhury, S., Mandal, S., Das, A., Chanda, B.: Segmentation of text and graphics from document images. In: Ninth international conference on document analysis and recognition (ICDAR 2007), vol. 2, pp. 619\u2013623. IEEE, (2007)"},{"key":"690_CR35","unstructured":"Bloomberg, D. S.: Multiresolution morphological approach to document image analysis. In: Proc. of the international conference on document analysis and recognition, Saint-Malo, France, (1991)"},{"key":"690_CR36","unstructured":"Bukhari, S. S., Shafait, F., Breuel, T. M.: Improved document image segmentation algorithm using multiresolution morphology. In: Document recognition and retrieval XVIII, vol. 7874, p 78740D. International society for optics and photonics, (2011)"},{"key":"690_CR37","doi-asserted-by":"publisher","first-page":"1015","DOI":"10.1016\/j.procs.2015.07.512","volume":"57","author":"A Kumar","year":"2015","unstructured":"Kumar, A., Rastogi, P., Srivastava, P.: Design and fpga implementation of dwt, image text extraction technique. Procedia Comput. Sci. 57, 1015\u20131025 (2015)","journal-title":"Procedia Comput. Sci."},{"key":"690_CR38","doi-asserted-by":"publisher","first-page":"437","DOI":"10.1016\/j.patcog.2016.12.005","volume":"66","author":"X Bai","year":"2017","unstructured":"Bai, X., Shi, B., Zhang, C., Cai, X., Qi, L.: Text\/non-text image classification in the wild with convolutional neural networks. Pattern Recognit. 66, 437\u2013446 (2017)","journal-title":"Pattern Recognit."},{"key":"690_CR39","doi-asserted-by":"crossref","unstructured":"Rybalkin, V., Bukhari, S. S., Ghaffar, M. M., Ghafoor, A., Wehn, N., Dengel, A.: idocchip: A configurable hardware architecture for historical document image processing: Percentile based binarization. In: Proceedings of the ACM symposium on document engineering 2018, p. 24. ACM, (2018)","DOI":"10.1145\/3209280.3209538"},{"key":"690_CR40","doi-asserted-by":"crossref","unstructured":"Tekleyohannes, M. K., Rybalkin, V., Ghaffar, M.M., Wehn, N., Dengel, A.: idocchip-a configurable hardware architecture for historical document image processing: Text line extraction. In: 2019 International conference on reconfigurable computing and FPGAs (ReConFig), pp. 1\u20138. IEEE, (2019)","DOI":"10.1109\/ReConFig48160.2019.8994761"},{"key":"690_CR41","unstructured":"Rybalkin, V., Wehn, N., Yousefi, M. R., Stricker, D.: Hardware architecture of bidirectional long short-term memory neural network for optical character recognition. In: Proceedings of the conference on design, automation and test in Europe, pp. 1394\u20131399. European design and automation association, (2017)"},{"key":"690_CR42","unstructured":"Chang, A. X. M., Martini, B., Culurciello, E.: Recurrent neural networks hardware implementation on fpga. nov (2015)"},{"key":"690_CR43","doi-asserted-by":"crossref","unstructured":"Tekleyohannes, M. K., Weis, C., Wehn, N., Klein, M., Siegrist, M.: A reconfigurable accelerator for morphological operations. In: 2018 IEEE international parallel and distributed processing symposium workshops (IPDPSW), pp. 186\u2013193. IEEE, (2018)","DOI":"10.1109\/IPDPSW.2018.00035"},{"key":"690_CR44","unstructured":"Tekleyohannes, M., Sadri, M., Weis, C., Wehn, N., Klein, M., Siegrist, M.: An advanced embedded architecture for connected component analysis in industrial applications. In: Design, automation and test in Europe conference and exhibition (DATE), 2017, pp. 734\u2013735. IEEE, (2017)"},{"key":"690_CR45","unstructured":"Multi-dimensional image processing (scipy.ndimage). https:\/\/docs.scipy.org\/doc\/scipy-0.14.0\/reference\/ndimage.html. Accessed 24 Apr 2020"}],"container-title":["International Journal of Parallel Programming"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10766-020-00690-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10766-020-00690-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10766-020-00690-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,19]],"date-time":"2023-10-19T20:27:20Z","timestamp":1697747240000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10766-020-00690-y"}},"subtitle":["Multiresolution Morphology-based Text and Image Segmentation"],"short-title":[],"issued":{"date-parts":[[2021,1,30]]},"references-count":45,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2021,4]]}},"alternative-id":["690"],"URL":"https:\/\/doi.org\/10.1007\/s10766-020-00690-y","relation":{},"ISSN":["0885-7458","1573-7640"],"issn-type":[{"value":"0885-7458","type":"print"},{"value":"1573-7640","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1,30]]},"assertion":[{"value":"28 October 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 December 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 January 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}