{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,5]],"date-time":"2026-02-05T12:19:14Z","timestamp":1770293954756,"version":"3.49.0"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2023,9,30]],"date-time":"2023-09-30T00:00:00Z","timestamp":1696032000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,9,30]],"date-time":"2023-09-30T00:00:00Z","timestamp":1696032000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["52061020"],"award-info":[{"award-number":["52061020"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61971208"],"award-info":[{"award-number":["61971208"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100018531","name":"Major Science and Technology Projects in Yunnan Province","doi-asserted-by":"publisher","award":["202302AG050009"],"award-info":[{"award-number":["202302AG050009"]}],"id":[{"id":"10.13039\/501100018531","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Yunnan Fundamental Research Projects","award":["202301AV070003"],"award-info":[{"award-number":["202301AV070003"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2024,4]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Locating tables in document images is the first step to extracting table information, and high location precision is required. The dominant approach of table detection is based on an object detection algorithm, and the detector defines the prediction task as a regression problem, which inevitably leads to positioning errors. To address this issue, this paper presents an approach called Border Line Correction (BLC) to refine the rough prediction results of the original detector through the table boundary lines extracted from the document image. Our approach transforms the regression task into a classification problem, thus avoiding the inherent regression error of the object detection algorithm. Traditional annotation methods are inadequate for table detection tasks as they fail to capture the completeness and purity of the detection results. Therefore, this study treats the correct position of a table as a tolerance region. Additionally, to overcome the limitations of existing datasets in the materials domain, we collected 1183 samples from scientific literature in the materials field and created the MatTab dataset, annotating the tables with tolerance regions. This paper use Cascade RCNN with Swin Transformer as baseline models, and BLC is utilized to optimize the detection results. Experimental results demonstrate significant improvements with BLC at an IOU of 0.95 on the MatTab, ICDAR2019, and ICDAR2017 datasets. In MatTab, the percentage of correctly detected complete and pure tables increased from 72.3% to 82.1%.<\/jats:p>","DOI":"10.1007\/s40747-023-01235-9","type":"journal-article","created":{"date-parts":[[2023,9,30]],"date-time":"2023-09-30T08:01:31Z","timestamp":1696060891000},"page":"1703-1714","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Improving table detection for document images using boundary"],"prefix":"10.1007","volume":"10","author":[{"given":"Yingli","family":"Liu","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jianfeng","family":"Zheng","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Guangtao","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tao","family":"Shen","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2023,9,30]]},"reference":[{"key":"1235_CR1","doi-asserted-by":"publisher","unstructured":"Itonori K (1993) Table structure recognition based on textblock arrangement and ruled line position. In: Proceedings of 2nd international conference on document analysis and recognition (ICDAR \u201993), pp. 765\u2013768. https:\/\/doi.org\/10.1109\/ICDAR.1993.395625","DOI":"10.1109\/ICDAR.1993.395625"},{"key":"1235_CR2","doi-asserted-by":"publisher","unstructured":"Kieninger T (1998) Table structure recognition based on robust block segmentation. In: DOCUMENT RECOGNITION V, vol. 3305, pp. 22\u201332. https:\/\/doi.org\/10.1117\/12.304642","DOI":"10.1117\/12.304642"},{"key":"1235_CR3","doi-asserted-by":"publisher","unstructured":"Chandran S, Kasturi R Structural recognition of tabulated data. In: Proceedings of 2nd international conference on document analysis and recognition (ICDAR \u201993), pp. 516\u2013519. https:\/\/doi.org\/10.1109\/ICDAR.1993.395683","DOI":"10.1109\/ICDAR.1993.395683"},{"key":"1235_CR4","doi-asserted-by":"publisher","unstructured":"Ottoni Andr\u00e9 LC, Nepomuceno MSdO Erivelton G (2022) Reinforcement learning for the traveling salesman problem with refueling. Complex Intell Syst 8:2001\u20132015. https:\/\/doi.org\/10.1016\/j.engappai.2020.103551","DOI":"10.1016\/j.engappai.2020.103551"},{"key":"1235_CR5","doi-asserted-by":"publisher","unstructured":"Li Y, Gao L, Tang Z, Yan Q, Huang Y (2019) A gan-based feature generator for table detection. In: 2019 International conference on document analysis and recognition (ICDAR), pp. 763\u2013768. https:\/\/doi.org\/10.1109\/ICDAR.2019.00127","DOI":"10.1109\/ICDAR.2019.00127"},{"key":"1235_CR6","doi-asserted-by":"publisher","unstructured":"Gilani A, Qasim SR, Malik I, Shafait F (2017) Table detection using deep learning. In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR), 01:771\u2013776. https:\/\/doi.org\/10.1109\/ICDAR.2017.131","DOI":"10.1109\/ICDAR.2017.131"},{"key":"1235_CR7","doi-asserted-by":"publisher","unstructured":"Casado-Garc\u00eda A, Dominguez C, Heras J, Mata E, Pascual V (2020) Chapter 15. The Benefits of close-domain fine-tuning for table detection in document images, pp. 199\u2013215. https:\/\/doi.org\/10.1007\/978-3-030-57058-3_15","DOI":"10.1007\/978-3-030-57058-3_15"},{"key":"1235_CR8","doi-asserted-by":"publisher","unstructured":"Nazir D, Hashmi KA, Pagani A, Liwicki M, Stricker D, Afzal MZ (2021) Hybridtabnet: Towards better table detection in scanned document images. Appl Sci 11(18) https:\/\/doi.org\/10.3390\/app11188396","DOI":"10.3390\/app11188396"},{"issue":"1","key":"1235_CR9","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10032-021-00390-4","volume":"25","author":"D-D Nguyen","year":"2022","unstructured":"Nguyen D-D (2022) Tablesegnet: a fully convolutional network for table detection and segmentation in document images. Int J Document Anal Recognit (IJDAR) 25(1):1\u201314. https:\/\/doi.org\/10.1007\/s10032-021-00390-4","journal-title":"Int J Document Anal Recognit (IJDAR)"},{"key":"1235_CR10","doi-asserted-by":"publisher","unstructured":"Prasad D, Gadpal A, Kapadni K, Visave M, Sultanpure K (2020) Cascadetabnet: An approach for end to end table detection and structure recognition from image-based documents. In: 2020 IEEE\/CVF conference on computer vision and pattern recognition workshops (CVPRW), pp. 2439\u20132447. https:\/\/doi.org\/10.1109\/CVPRW50498.2020.00294","DOI":"10.1109\/CVPRW50498.2020.00294"},{"key":"1235_CR11","doi-asserted-by":"publisher","unstructured":"Dang Qianlong, GM, Gao Weifeng (2022) Multiobjective multitasking optimization assisted by multidirectional prediction method. Complex & Intell Syst 8:1663\u20131679. https:\/\/doi.org\/10.1007\/s40747-021-00624-2","DOI":"10.1007\/s40747-021-00624-2"},{"issue":"4","key":"1235_CR12","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1016\/0031-3203(91)90073-E","volume":"24","author":"N Kiryati","year":"1991","unstructured":"Kiryati N, Eldar Y, Bruckstein AM (1991) A probabilistic hough transform. Pattern Recognit 24(4):303\u2013316. https:\/\/doi.org\/10.1016\/0031-3203(91)90073-E","journal-title":"Pattern Recognit"},{"key":"1235_CR13","doi-asserted-by":"publisher","unstructured":"Lu X, Yao J, Li K, Li L Cannylines: A parameter-free line segment detector. In: 2015 IEEE international conference on image processing (ICIP), pp. 507\u2013511. https:\/\/doi.org\/10.1109\/ICIP.2015.7350850","DOI":"10.1109\/ICIP.2015.7350850"},{"key":"1235_CR14","doi-asserted-by":"crossref","unstructured":"Kieninger T, Dengel A (1999) The T-Recs table recognition and analysis system, vol. 1655, pp. 255\u2013269. Springer, Berlin","DOI":"10.1007\/3-540-48172-9_21"},{"key":"1235_CR15","doi-asserted-by":"publisher","unstructured":"Kieninger T, Dengel A Applying the t-recs table recognition system to the business letter domain. In: Proceedings of Sixth International conference on document analysis and recognition, pp. 518\u2013522. https:\/\/doi.org\/10.1109\/ICDAR.2001.953843","DOI":"10.1109\/ICDAR.2001.953843"},{"key":"1235_CR16","doi-asserted-by":"publisher","unstructured":"Kasar T, Barlas P, Adam S, Chatelain C, Paquet T (2013) Learning to detect tables in scanned document images using line information. In: 2013 12th international conference on document analysis and recognition, pp. 1185\u20131189. https:\/\/doi.org\/10.1109\/ICDAR.2013.240","DOI":"10.1109\/ICDAR.2013.240"},{"key":"1235_CR17","doi-asserted-by":"publisher","unstructured":"Kieninger T, Dengel A (2005) An approach towards benchmarking of table structure recognition results. In: Eighth international conference on document analysis and recognition (ICDAR\u201905), pp. 1232\u201312362. https:\/\/doi.org\/10.1109\/ICDAR.2005.47","DOI":"10.1109\/ICDAR.2005.47"},{"issue":"6","key":"1235_CR18","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","volume":"39","author":"S Ren","year":"2017","unstructured":"Ren S, He K, Girshick R, Sun J (2017) Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137\u20131149. https:\/\/doi.org\/10.1109\/TPAMI.2016.2577031","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1235_CR19","unstructured":"Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative Adversarial Nets"},{"key":"1235_CR20","doi-asserted-by":"publisher","unstructured":"Agarwal M, Mondal A, Jawahar CV Cdec-net: Composite deformable cascade network for table detection in document images. In: 2020 25th international conference on pattern recognition (ICPR), pp. 9491\u20139498. https:\/\/doi.org\/10.1109\/ICPR48806.2021.9411922","DOI":"10.1109\/ICPR48806.2021.9411922"},{"key":"1235_CR21","doi-asserted-by":"publisher","unstructured":"Schreiber S, Agne S, Wolf I, Dengel A, Ahmed S (2017) Deepdesrt: Deep learning for detection and structure recognition of tables in document images. In: 2017 14th IAPR international conference on document analysis and recognition (ICDAR), 01:1162\u20131167. https:\/\/doi.org\/10.1109\/ICDAR.2017.192","DOI":"10.1109\/ICDAR.2017.192"},{"key":"1235_CR22","doi-asserted-by":"publisher","unstructured":"Hashmi KA, Pagani A, Liwicki M, Stricker D, Afzal MZ (2021) Castabdetectors: Cascade network for table detection in document images with recursive feature pyramid and switchable atrous convolution. J Imag 7(10) https:\/\/doi.org\/10.3390\/jimaging7100214","DOI":"10.3390\/jimaging7100214"},{"key":"1235_CR23","doi-asserted-by":"publisher","unstructured":"Sun N, Zhu Y, Hu X (2019) Faster r-cnn based table detection combining corner locating. In: 2019 international conference on document analysis and recognition (ICDAR), pp. 1314\u20131319. https:\/\/doi.org\/10.1109\/ICDAR.2019.00212","DOI":"10.1109\/ICDAR.2019.00212"},{"key":"1235_CR24","doi-asserted-by":"publisher","unstructured":"Kara E, Traquair M, Simsek M, Kantarci B, Khan S (2020) Holistic design for deep learning-based discovery of tabular structures in datasheet images. Eng Appl Artificial Intell 90 https:\/\/doi.org\/10.1016\/j.engappai.2020.103551","DOI":"10.1016\/j.engappai.2020.103551"},{"key":"1235_CR25","doi-asserted-by":"publisher","unstructured":"Gao L, Huang Y, D\u00e9jean H, Meunier J-L, Yan Q, Fang Y, Kleber F, Lang E (2019) Icdar 2019 competition on table detection and recognition (ctdar). In: 2019 International conference on document analysis and recognition (ICDAR), pp. 1510\u20131515. https:\/\/doi.org\/10.1109\/ICDAR.2019.00243","DOI":"10.1109\/ICDAR.2019.00243"},{"key":"1235_CR26","doi-asserted-by":"publisher","unstructured":"G\u00f6bel M, Hassan T, Oro E, Orsi G (2013) Icdar 2013 table competition. In: 2013 12th international conference on document analysis and recognition, pp. 1449\u20131453. https:\/\/doi.org\/10.1109\/ICDAR.2013.292","DOI":"10.1109\/ICDAR.2013.292"},{"key":"1235_CR27","doi-asserted-by":"publisher","unstructured":"Pablo MAW, Jackson Nicholas E (2019) New frontiers for the materials genome initiative. npj Comput Mater 5, 41 https:\/\/doi.org\/10.1038\/s41524-019-0173-4","DOI":"10.1038\/s41524-019-0173-4"},{"key":"1235_CR28","doi-asserted-by":"publisher","unstructured":"Gao L, Yi X, Jiang Z, Hao L, Tang Z (2017) Icdar2017 competition on page object detection. In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR), 01:1417\u20131422. https:\/\/doi.org\/10.1109\/ICDAR.2017.231","DOI":"10.1109\/ICDAR.2017.231"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-023-01235-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-023-01235-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-023-01235-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,3,30]],"date-time":"2024-03-30T15:15:28Z","timestamp":1711811728000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-023-01235-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,30]]},"references-count":28,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2024,4]]}},"alternative-id":["1235"],"URL":"https:\/\/doi.org\/10.1007\/s40747-023-01235-9","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,30]]},"assertion":[{"value":"13 April 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 September 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 September 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflicts of interest"}}]}}