{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,12]],"date-time":"2026-06-12T16:08:14Z","timestamp":1781280494896,"version":"3.54.1"},"reference-count":148,"publisher":"Association for Computing Machinery (ACM)","issue":"12","license":[{"start":{"date-parts":[[2024,10,3]],"date-time":"2024-10-03T00:00:00Z","timestamp":1727913600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2024,12,31]]},"abstract":"<jats:p>Tables\u00a0are everywhere, from scientific journals, articles, websites, and newspapers all the way to items we buy at the supermarket. Detecting them is thus of utmost importance to automatically understanding the content of a document. The performance of table detection has substantially increased thanks to the rapid development of deep learning networks. The goals of this survey are to provide a profound comprehension of the major developments in the field of Table\u00a0Detection, offer insight into the different methodologies, and provide a systematic taxonomy of the different approaches. Furthermore, we provide an analysis of both classic and new applications in the field. Lastly, the datasets and source code of the existing models are organized to provide the reader with a compass on this vast literature. Finally, we go over the architecture of utilizing various object detection and table structure recognition methods to create an effective and efficient system, as well as a set of development trends to keep up with state-of-the-art algorithms and future research. We have also set up a public GitHub repository where we will be updating the most recent publications, open data, and source code. The GitHub repository is available at https:\/\/github.com\/abdoelsayed2016\/table-detection-structure-recognition.<\/jats:p>","DOI":"10.1145\/3657281","type":"journal-article","created":{"date-parts":[[2024,4,10]],"date-time":"2024-04-10T12:25:49Z","timestamp":1712751949000},"page":"1-41","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":19,"title":["Deep Learning for Table Detection and Structure Recognition: A Survey"],"prefix":"10.1145","volume":"56","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8513-570X","authenticated-orcid":false,"given":"Mahmoud","family":"Salaheldin Kasem","sequence":"first","affiliation":[{"name":"Faculty of Computer and Information, Assuit University, Assuit, Egypt and Chungbuk National University, Cheongju, Republic of Korea"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8747-4927","authenticated-orcid":false,"given":"Abdelrahman","family":"Abdallah","sequence":"additional","affiliation":[{"name":"Faculty of Computer and Information, Assuit University, Assuit Egypt and Ca' Foscari University of Venice, Venezia, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2150-8287","authenticated-orcid":false,"given":"Alexander","family":"Berendeyev","sequence":"additional","affiliation":[{"name":"Satbayev University, Almaty, Kazakhstan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7248-8880","authenticated-orcid":false,"given":"Ebrahem","family":"Elkady","sequence":"additional","affiliation":[{"name":"Faculty of Computer and Information, Assuit University, Assuit, Egypt"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6764-8969","authenticated-orcid":false,"given":"Mohamed","family":"Mahmoud","sequence":"additional","affiliation":[{"name":"Faculty of Computer and Information, Assuit University, Assuit, Egypt and College of Electrical and Computer Engineering, Chungbuk National University, Cheongju, Korea (the Republic of)"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4270-2268","authenticated-orcid":false,"given":"Mahmoud","family":"Abdalla","sequence":"additional","affiliation":[{"name":"Information Technology Institute, Alexandria, Egypt"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0442-3663","authenticated-orcid":false,"given":"Mohamed","family":"Hamada","sequence":"additional","affiliation":[{"name":"Department of Information System, International IT University, Almaty, Kazakhstan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7855-1641","authenticated-orcid":false,"given":"Sebastiano","family":"Vascon","sequence":"additional","affiliation":[{"name":"Ca' Foscari University of Venice, Venezia, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1073-4254","authenticated-orcid":false,"given":"Daniyar","family":"Nurseitov","sequence":"additional","affiliation":[{"name":"JSC NC KazMunayGas, Astana, Kazakhstan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3028-6751","authenticated-orcid":false,"given":"Islam","family":"Taj-Eddin","sequence":"additional","affiliation":[{"name":"Faculty of Computer and Information, Assuit University, Assuit, Egypt"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2024,10,3]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","unstructured":"Abdelrahman Abdallah Alexander Berendeyev Islam Nuradin and Daniyar Nurseitov. 2022. TNCR: Table net detection and classification dataset. Neurocomputing (2022) 79\u201397. DOI:10.1016\/j.neucom.2021.11.101","DOI":"10.1016\/j.neucom.2021.11.101"},{"key":"e_1_3_1_3_2","unstructured":"Abdelrahman Abdallah Daniel Eberharter Zoe Pfister and Adam Jatowt. 2024. Transformers and language models in form understanding: A comprehensive review of scanned document analysis. arXiv:2403.04080. Retrieved from https:\/\/arxiv.org\/abs\/2403.04080"},{"key":"e_1_3_1_4_2","unstructured":"Abdelrahman Abdallah and Adam Jatowt. 2023. Generator-retriever-generator: A novel approach to open-domain question answering. arXiv:2307.11278. Retrieved from https:\/\/arxiv.org\/abs\/2307.11278"},{"key":"e_1_3_1_5_2","doi-asserted-by":"crossref","unstructured":"Abdelrahman Abdallah Mahmoud Kasem Mahmoud Abdalla Mohamed Mahmoud Mohamed Elkasaby Yasser Elbendary and Adam Jatowt. 2024. ArabicaQA: A comprehensive dataset for arabic question answering. arXiv:2403.17848. Retrieved from https:\/\/arxiv.org\/abs\/2403.17848","DOI":"10.1145\/3626772.3657889"},{"key":"e_1_3_1_6_2","doi-asserted-by":"crossref","first-page":"9491","DOI":"10.1109\/ICPR48806.2021.9411922","volume-title":"Proceedings of the 2020 25th International Conference on Pattern Recognition (ICPR)","author":"Agarwal Madhav","year":"2021","unstructured":"Madhav Agarwal, Ajoy Mondal, and CV Jawahar. 2021. Cdec-net: Composite deformable cascade network for table detection in document images. In Proceedings of the 2020 25th International Conference on Pattern Recognition (ICPR). IEEE, 9491\u20139498."},{"issue":"6","key":"e_1_3_1_7_2","doi-asserted-by":"crossref","first-page":"4701","DOI":"10.1007\/s00521-022-07992-x","article-title":"Customer decision-making analysis based on big social data using machine learning: A case study of hotels in mecca","volume":"35","author":"Alsayat Ahmed","year":"2023","unstructured":"Ahmed Alsayat. 2023. Customer decision-making analysis based on big social data using machine learning: A case study of hotels in mecca. Neural Computing and Applications 35, 6 (2023), 4701\u20134722.","journal-title":"Neural Computing and Applications"},{"key":"e_1_3_1_8_2","first-page":"1","volume-title":"Proceedings of the 2018 Digital Image Computing: Techniques and Applications (DICTA)","author":"Arif Saman","year":"2018","unstructured":"Saman Arif and Faisal Shafait. 2018. Table detection in document images using foreground and background features. In Proceedings of the 2018 Digital Image Computing: Techniques and Applications (DICTA). IEEE, 1\u20138."},{"key":"e_1_3_1_9_2","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1109\/SEAA.2018.00018","volume-title":"Proceedings of the 2018 44th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","author":"Arpteg Anders","year":"2018","unstructured":"Anders Arpteg, Bj\u00f6rn Brinne, Luka Crnkovic-Friis, and Jan Bosch. 2018. Software engineering challenges of deep learning. In Proceedings of the 2018 44th Euromicro Conference on Software Engineering and Advanced Applications (SEAA). IEEE, 50\u201359."},{"issue":"8","key":"e_1_3_1_10_2","doi-asserted-by":"crossref","first-page":"1798","DOI":"10.1109\/TPAMI.2013.50","article-title":"Representation learning: A review and new perspectives","volume":"35","author":"Bengio Yoshua","year":"2013","unstructured":"Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2013. Representation learning: A review and new perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 8 (2013), 1798\u20131828.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_1_11_2","first-page":"213","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Carion Nicolas","year":"2020","unstructured":"Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, and Sergey Zagoruyko. 2020. End-to-end object detection with transformers. In Proceedings of the European Conference on Computer Vision. Springer, 213\u2013229."},{"key":"e_1_3_1_12_2","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1007\/978-3-030-57058-3_15","volume-title":"Proceedings of the International Workshop on Document Analysis Systems","author":"Casado-Garc\u00eda \u00c1ngela","year":"2020","unstructured":"\u00c1ngela Casado-Garc\u00eda, C\u00e9sar Dom\u00ednguez, J\u00f3nathan Heras, Eloy Mata, and Vico Pascual. 2020. The benefits of close-domain fine-tuning for table detection in document images. In Proceedings of the International Workshop on Document Analysis Systems. Springer, 199\u2013215."},{"key":"e_1_3_1_13_2","doi-asserted-by":"crossref","first-page":"236","DOI":"10.1109\/ICPR.2002.1047838","volume-title":"Proceedings of the Object Recognition Supported by User Interaction for Service Robots","author":"Cesarini Francesca","year":"2002","unstructured":"Francesca Cesarini, Simone Marinai, L Sarti, and Giovanni Soda. 2002. Trainable table location in document images. In Proceedings of the Object Recognition Supported by User Interaction for Service Robots. IEEE, 236\u2013240."},{"key":"e_1_3_1_14_2","doi-asserted-by":"crossref","first-page":"516","DOI":"10.1109\/ICDAR.1993.395683","volume-title":"Proceedings of the 2nd International Conference on Document Analysis and Recognition (ICDAR\u201993)","author":"Chandran Surekha","year":"1993","unstructured":"Surekha Chandran and Rangachar Kasturi. 1993. Structural recognition of tabulated data. In Proceedings of the 2nd International Conference on Document Analysis and Recognition (ICDAR\u201993). IEEE, 516\u2013519."},{"key":"e_1_3_1_15_2","unstructured":"Zewen Chi Heyan Huang Heng-Da Xu Houjin Yu Wanxuan Yin and Xian-Ling Mao. 2019. Complicated table structure recognition. arXiv:1908.04729. Retrieved from https:\/\/arxiv.org\/abs\/1908.04729"},{"key":"e_1_3_1_16_2","doi-asserted-by":"crossref","unstructured":"Bertrand Co\u00fcasnon and Aur\u00e9lie Lemaitre. 2014. Recognition of tables and forms.","DOI":"10.1007\/978-0-85729-859-1_20"},{"key":"e_1_3_1_17_2","doi-asserted-by":"crossref","first-page":"894","DOI":"10.1109\/ICDAR.2019.00148","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Deng Yuntian","year":"2019","unstructured":"Yuntian Deng, David Rosenberg, and Gideon Mann. 2019. Challenges in end-to-end neural scientific table recognition. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 894\u2013901."},{"key":"e_1_3_1_18_2","first-page":"69","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","author":"Dong Haoyu","year":"2019","unstructured":"Haoyu Dong, Shijie Liu, Shi Han, Zhouyu Fu, and Dongmei Zhang. 2019. Tablesense: Spreadsheet table detection with convolutional neural networks. In Proceedings of the AAAI Conference on Artificial Intelligence. 69\u201376."},{"key":"e_1_3_1_19_2","doi-asserted-by":"crossref","first-page":"843","DOI":"10.1109\/ICDAR.2009.185","volume-title":"Proceedings of the 2009 10th International Conference on Document Analysis and Recognition","author":"Silva Ana Costa e","year":"2009","unstructured":"Ana Costa e Silva. 2009. Learning rich hidden Markov models in document analysis: Table location. In Proceedings of the 2009 10th International Conference on Document Analysis and Recognition. IEEE, 843\u2013847."},{"issue":"2","key":"e_1_3_1_20_2","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1007\/s10032-006-0017-x","article-title":"Table-processing paradigms: A research survey","volume":"8","author":"Embley David W.","year":"2006","unstructured":"David W. Embley, Matthew Hurst, Daniel Lopresti, and George Nagy. 2006. Table-processing paradigms: A research survey. International Journal of Document Analysis and Recognition 8, 2 (2006), 66\u201386.","journal-title":"International Journal of Document Analysis and Recognition"},{"key":"e_1_3_1_21_2","first-page":"3937","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Fakoor Rasool","year":"2013","unstructured":"Rasool Fakoor, Faisal Ladhak, Azade Nazi, and Manfred Huber. 2013. Using deep learning to enhance cancer diagnosis and classification. In Proceedings of the International Conference on Machine Learning. ACM, New York, USA, 3937\u20133949."},{"key":"e_1_3_1_22_2","unstructured":"Miao Fan and Doo Soon Kim. 2015. Table region detection on large-scale PDF files without labeled data. arXiv:1506.08891. Retrieved from https:\/\/arxiv.org\/abs\/1506.08891"},{"key":"e_1_3_1_23_2","volume-title":"Proceedings of the 26th AAAI Conference on Artificial Intelligence","author":"Fang Jing","year":"2012","unstructured":"Jing Fang, Prasenjit Mitra, Zhi Tang, and C Lee Giles. 2012. Table header detection and classification. In Proceedings of the 26th AAAI Conference on Artificial Intelligence."},{"key":"e_1_3_1_24_2","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1109\/DAS.2012.29","volume-title":"Proceedings of the 2012 10th IAPR International Workshop on Document Analysis Systems","author":"Fang Jing","year":"2012","unstructured":"Jing Fang, Xin Tao, Zhi Tang, Ruiheng Qiu, and Ying Liu. 2012. Dataset, ground-truth and performance metrics for table detection evaluation. In Proceedings of the 2012 10th IAPR International Workshop on Document Analysis Systems. IEEE, 445\u2013449."},{"key":"e_1_3_1_25_2","first-page":"95","volume-title":"Proceedings of the KI 2021: Advances in Artificial Intelligence: 44th German Conference on AI, Virtual Event, September 27\u2013October 1, 2021","author":"Fischer Pascal","year":"2021","unstructured":"Pascal Fischer, Alen Smajic, Giuseppe Abrami, and Alexander Mehler. 2021. Multi-type-td-tsr\u2013extracting tables from document images using a multi-stage pipeline for table detection and table structure recognition: From ocr to structured table representations. In Proceedings of the KI 2021: Advances in Artificial Intelligence: 44th German Conference on AI, Virtual Event, September 27\u2013October 1, 2021. Springer, 95\u2013108."},{"key":"e_1_3_1_26_2","first-page":"1510","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Gao Liangcai","year":"2019","unstructured":"Liangcai Gao, Yilun Huang, Herv\u00e9 D\u00e9jean, Jean-Luc Meunier, Qinqin Yan, Yu Fang, Florian Kleber, and Eva Lang. 2019. ICDAR 2019 competition on table detection and recognition (cTDaR). In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1510\u20131515."},{"key":"e_1_3_1_27_2","first-page":"1417","volume-title":"Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","author":"Gao Liangcai","year":"2017","unstructured":"Liangcai Gao, Xiaohan Yi, Zhuoren Jiang, Leipeng Hao, and Zhi Tang. 2017. ICDAR2017 competition on page object detection. In Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1417\u20131422."},{"key":"e_1_3_1_28_2","first-page":"1","volume-title":"Proceedings of the 2022 IEEE 27th International Conference on Emerging Technologies and Factory Automation (ETFA)","author":"Chowdhury Arnab Ghosh","year":"2022","unstructured":"Arnab Ghosh Chowdhury, Martin ben Ahmed, and Martin Atzmueller. 2022. Towards tabular data extraction from richly-structured documents using supervised and weakly-supervised learning. In Proceedings of the 2022 IEEE 27th International Conference on Emerging Technologies and Factory Automation (ETFA). IEEE, 1\u20134."},{"key":"e_1_3_1_29_2","first-page":"771","volume-title":"Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","author":"Gilani Azka","year":"2017","unstructured":"Azka Gilani, Shah Rukh Qasim, Imran Malik, and Faisal Shafait. 2017. Table detection using deep learning. In Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). IEEE, 771\u2013776."},{"key":"e_1_3_1_30_2","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1145\/2361354.2361365","volume-title":"Proceedings of the 2012 ACM Symposium on Document Engineering","author":"G\u00f6bel Max","year":"2012","unstructured":"Max G\u00f6bel, Tamir Hassan, Ermelinda Oro, and Giorgio Orsi. 2012. A methodology for evaluating algorithms for table understanding in PDF documents. In Proceedings of the 2012 ACM Symposium on Document Engineering. 45\u201348."},{"key":"e_1_3_1_31_2","doi-asserted-by":"crossref","first-page":"1449","DOI":"10.1109\/ICDAR.2013.292","volume-title":"Proceedings of the 2013 12th International Conference on Document Analysis and Recognition","author":"G\u00f6bel Max","year":"2013","unstructured":"Max G\u00f6bel, Tamir Hassan, Ermelinda Oro, and Giorgio Orsi. 2013. ICDAR 2013 table competition. In Proceedings of the 2013 12th International Conference on Document Analysis and Recognition. IEEE, 1449\u20131453."},{"key":"e_1_3_1_32_2","volume-title":"Deep Learning","author":"Goodfellow Ian","year":"2016","unstructured":"Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. MIT Press."},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","unstructured":"A. A. Gurav and Manisha J. Nene. 2020. Weakly supervised learning-based table detection. SN Comput. Sci. 1 2 (2020) 90. DOI:10.1007\/S42979-020-0113-X","DOI":"10.1007\/S42979-020-0113-X"},{"key":"e_1_3_1_34_2","unstructured":"Mrinal Haloi Shashank Shekhar Nikhil Fande Siddhant Swaroop Dash and Sanjay G. 2022. Table detection in the wild: A novel diverse table detection dataset and method. arXiv preprint arXiv:2209.09207 (2022)."},{"key":"e_1_3_1_35_2","first-page":"1","volume-title":"Proceedings of the 2021 IEEE International Conference on Smart Information Systems and Technologies (SIST)","author":"Hamada Mohamed A","year":"2021","unstructured":"Mohamed A Hamada, Abdelrahman Abdallah, Mahmoud Kasem, and Mohamed Abokhalil. 2021. Neural network estimation model to optimize timing and schedule of software projects. In Proceedings of the 2021 IEEE International Conference on Smart Information Systems and Technologies (SIST). IEEE, 1\u20137."},{"key":"e_1_3_1_36_2","first-page":"287","volume-title":"Proceedings of the 2016 12th IAPR Workshop on Document Analysis Systems (DAS)","author":"Hao Leipeng","year":"2016","unstructured":"Leipeng Hao, Liangcai Gao, Xiaohan Yi, and Zhi Tang. 2016. A table detection method for pdf documents based on convolutional neural networks. In Proceedings of the 2016 12th IAPR Workshop on Document Analysis Systems (DAS). IEEE, 287\u2013292."},{"key":"e_1_3_1_37_2","first-page":"1","volume-title":"Proceedings of the 8th Indian Conference on Computer Vision, Graphics, and Image Processing","author":"Harit Gaurav","year":"2012","unstructured":"Gaurav Harit and Anukriti Bansal. 2012. Table detection in document images using header and trailer patterns. In Proceedings of the 8th Indian Conference on Computer Vision, Graphics, and Image Processing. 1\u20138."},{"key":"e_1_3_1_38_2","first-page":"991","volume-title":"Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR)","author":"Harley Adam W","year":"2015","unstructured":"Adam W Harley, Alex Ufkes, and Konstantinos G Derpanis. 2015. Evaluation of deep convolutional nets for document image classification and retrieval. In Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR). IEEE, 991\u2013995."},{"key":"e_1_3_1_39_2","doi-asserted-by":"publisher","unstructured":"Khurram Azeem Hashmi Didier Stricker Marcus Liwicki Muhammad Noman Afzal and Muhammad Zeshan Afzal. 2021. Guided table structure recognition through anchor optimization. IEEE Access 9 (2021) 113521\u2013113534. DOI:10.1109\/ACCESS.2021.3103413","DOI":"10.1109\/ACCESS.2021.3103413"},{"key":"e_1_3_1_40_2","first-page":"1143","volume-title":"Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007)","author":"Hassan Tamir","year":"2007","unstructured":"Tamir Hassan and Robert Baumgartner. 2007. Table recognition and understanding from pdf files. In Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007). IEEE, 1143\u20131147."},{"key":"e_1_3_1_41_2","first-page":"254","volume-title":"Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","author":"He Dafang","year":"2017","unstructured":"Dafang He, Scott Cohen, Brian Price, Daniel Kifer, and C Lee Giles. 2017. Multi-scale multi-task fcn for semantic page segmentation and table detection. In Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). IEEE, 254\u2013261."},{"key":"e_1_3_1_42_2","article-title":"Mask R-CNN","author":"He Kaiming","year":"2017","unstructured":"Kaiming He, Georgia Gkioxari, Piotr Dollar, and Ross Girshick. 2017. Mask R-CNN. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV).","journal-title":"In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV)."},{"key":"e_1_3_1_43_2","doi-asserted-by":"crossref","unstructured":"Jonathan Herzig Pawe\u0142 Krzysztof Nowak Thomas M\u00fcller Francesco Piccinno and Julian Martin Eisenschlos. 2020. TaPas: Weakly supervised table parsing via pre-training. arXiv:2004.02349. Retrieved from https:\/\/arxiv.org\/abs\/2004.02349","DOI":"10.18653\/v1\/2020.acl-main.398"},{"key":"e_1_3_1_44_2","doi-asserted-by":"crossref","first-page":"158","DOI":"10.1109\/ICDARW.2019.40098","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW)","author":"Hole\u010dek Martin","year":"2019","unstructured":"Martin Hole\u010dek, Anton\u00edn Hoskovec, Petr Baudi\u0161, and Pavel Klinger. 2019. Table understanding in structured documents. In Proceedings of the 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW). IEEE, 158\u2013164."},{"issue":"3","key":"e_1_3_1_45_2","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1007\/s100320200074","article-title":"Evaluating the performance of table processing algorithms","volume":"4","author":"Hu Jianying","year":"2002","unstructured":"Jianying Hu, Ramanujan S Kashi, Daniel Lopresti, and Gordon T Wilfong. 2002. Evaluating the performance of table processing algorithms. International Journal on Document Analysis and Recognition 4, 3 (2002), 140\u2013153.","journal-title":"International Journal on Document Analysis and Recognition"},{"key":"e_1_3_1_46_2","unstructured":"Yuan-Ting Hu Jia-Bin Huang and Alexander G. Schwing. 2017. MaskRNN: Instance level video object segmentation. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017 December 4-9 2017 Long Beach CA USA 325\u2013334. Retrieved from https:\/\/proceedings.neurips.cc\/paper\/2017\/hash\/6c9882bbac1c7093bd25041881277658-Abstract.html"},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","unstructured":"Zilong Hu Jinshan Tang Ziming Wang Kai Zhang Ling Zhang and Qingling Sun. 2018. Deep learning for image-based cancer detection and diagnosis- A survey. Pattern Recognit. 83 (2018) 134\u2013149. DOI:10.1016\/J.PATCOG.2018.05.014","DOI":"10.1016\/J.PATCOG.2018.05.014"},{"key":"e_1_3_1_48_2","doi-asserted-by":"crossref","first-page":"813","DOI":"10.1109\/ICDAR.2019.00135","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Huang Yilun","year":"2019","unstructured":"Yilun Huang, Qinqin Yan, Yibo Li, Yifan Chen, Xiong Wang, Liangcai Gao, and Zhi Tang. 2019. A YOLO-based table detection method. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 813\u2013818."},{"key":"e_1_3_1_49_2","doi-asserted-by":"crossref","first-page":"765","DOI":"10.1109\/ICDAR.1993.395625","volume-title":"Proceedings of the 2nd International Conference on Document Analysis and Recognition (ICDAR\u201993)","author":"Itonori Katsuhiko","year":"1993","unstructured":"Katsuhiko Itonori. 1993. Table structure recognition based on textblock arrangement and ruled line position. In Proceedings of the 2nd International Conference on Document Analysis and Recognition (ICDAR\u201993). IEEE, 765\u2013768."},{"key":"e_1_3_1_50_2","first-page":"1","volume-title":"Proceedings of the 7th International Conference on Information and Automation for Sustainability","author":"Jahan MAC Akmal","year":"2014","unstructured":"MAC Akmal Jahan and Roshan G Ragel. 2014. Locating tables in scanned documents for reconstructing and republishing. In Proceedings of the 7th International Conference on Information and Automation for Sustainability. IEEE, 1\u20136."},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","unstructured":"Arushi Jain Shubham Paliwal Monika Sharma and Lovekesh Vig. 2021. TSR-DSAW: Table structure recognition via deep spatial association of words. In 29th European Symposium on Artificial Neural Networks Computational Intelligence and Machine Learning ESANN 2021 Online event (Bruges Belgium) October 6-8 2021. DOI:10.14428\/ESANN\/2021.ES2021-109","DOI":"10.14428\/ESANN\/2021.ES2021-109"},{"key":"e_1_3_1_52_2","first-page":"844","volume-title":"Proceedings of the 6th International Conference on Document Analysis and Recognition","author":"Jain K","year":"2001","unstructured":"K Jain, Anoop M Namboodiri, and Jayashree Subrahmonia. 2001. Structure in on-line documents. In Proceedings of the 6th International Conference on Document Analysis and Recognition. IEEE, 844\u2013848."},{"key":"e_1_3_1_53_2","doi-asserted-by":"publisher","unstructured":"Ertugrul Kara Mark Traquair Murat Simsek Burak Kantarci and Shahzad Khan. 2020. Holistic design for deep learning-based discovery of tabular structures in datasheet images. Eng. Appl. Artif. Intell. 90 (2020) 103551. DOI:10.1016\/J.ENGAPPAI.2020.103551","DOI":"10.1016\/J.ENGAPPAI.2020.103551"},{"key":"e_1_3_1_54_2","doi-asserted-by":"crossref","first-page":"1185","DOI":"10.1109\/ICDAR.2013.240","volume-title":"Proceedings of the 2013 12th International Conference on Document Analysis and Recognition","author":"Kasar Thotreingam","year":"2013","unstructured":"Thotreingam Kasar, Philippine Barlas, Sebastien Adam, Cl\u00e9ment Chatelain, and Thierry Paquet. 2013. Learning to detect tables in scanned document images using line information. In Proceedings of the 2013 12th International Conference on Document Analysis and Recognition. IEEE, 1185\u20131189."},{"key":"e_1_3_1_55_2","doi-asserted-by":"crossref","unstructured":"Mahmoud SalahEldin Kasem Mohamed Hamada and Islam Taj-Eddin. 2024. Customer profiling segmentation and sales prediction using AI in direct marketing. Neural Computing and Applications 36 9 (2024) 4995\u20135005.","DOI":"10.1007\/s00521-023-09339-6"},{"key":"e_1_3_1_56_2","unstructured":"Mahmoud SalahEldin Kasem Mohamed Mahmoud and Hyun-Soo Kang. 2023. Advancements and challenges in Arabic optical character recognition: A comprehensive survey. arXiv:2312.11812. Retrieved from https:\/\/arxiv.org\/abs\/2312.11812"},{"key":"e_1_3_1_57_2","first-page":"292","volume-title":"Proceedings of the International Conference on Image Analysis and Processing","author":"Kavasidis Isaak","year":"2019","unstructured":"Isaak Kavasidis, Carmelo Pino, Simone Palazzo, Francesco Rundo, Daniela Giordano, P. Messina, and Concetto Spampinato. 2019. A saliency-based convolutional neural network for table and chart detection in digitized documents. In Proceedings of the International Conference on Image Analysis and Processing. Springer, 292\u2013302."},{"key":"e_1_3_1_58_2","doi-asserted-by":"crossref","first-page":"1366","DOI":"10.1109\/ICDAR.2019.00220","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Khan Saqib Ali","year":"2019","unstructured":"Saqib Ali Khan, Syed Muhammad Daniyal Khalid, Muhammad Ali Shahzad, and Faisal Shafait. 2019. Table structure extraction with bi-directional gated recurrent unit networks. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1366\u20131371."},{"issue":"1","key":"e_1_3_1_59_2","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1177\/0165551514551903","article-title":"On methods and tools of table detection, extraction and annotation in PDF documents","volume":"41","author":"Khusro Shah","year":"2015","unstructured":"Shah Khusro, Asima Latif, and Irfan Ullah. 2015. On methods and tools of table detection, extraction and annotation in PDF documents. Journal of Information Science 41, 1 (2015), 41\u201357.","journal-title":"Journal of Information Science"},{"key":"e_1_3_1_60_2","first-page":"255","volume-title":"Proceedings of the International Workshop on Document Analysis Systems","author":"Kieninger Thomas","year":"1998","unstructured":"Thomas Kieninger and Andreas Dengel. 1998. The t-recs table recognition and analysis system. In Proceedings of the International Workshop on Document Analysis Systems. Springer, 255\u2013270."},{"issue":"5","key":"e_1_3_1_61_2","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1016\/j.csi.2007.08.006","article-title":"Extracting logical structures from HTML tables","volume":"30","author":"Kim Yeon-Seok","year":"2008","unstructured":"Yeon-Seok Kim and Kyong-Ho Lee. 2008. Extracting logical structures from HTML tables. Computer Standards and Interfaces 30, 5 (2008), 296\u2013308.","journal-title":"Computer Standards and Interfaces"},{"issue":"11","key":"e_1_3_1_62_2","first-page":"7","article-title":"A comparison of two unsupervised table recognition methods from digital scientific articles","volume":"20","author":"Klampfl Stefan","year":"2014","unstructured":"Stefan Klampfl, Kris Jack, and Roman Kern. 2014. A comparison of two unsupervised table recognition methods from digital scientific articles. D-Lib Magazine 20, 11 (2014), 7.","journal-title":"D-Lib Magazine"},{"key":"e_1_3_1_63_2","first-page":"139","volume-title":"Proceedings of the 2018 13th IAPR International Workshop on Document Analysis Systems (DAS)","author":"Koci Elvis","year":"2018","unstructured":"Elvis Koci, Maik Thiele, Wolfgang Lehner, and Oscar Romero. 2018. Table recognition in spreadsheets via a graph representation. In Proceedings of the 2018 13th IAPR International Workshop on Document Analysis Systems (DAS). IEEE, 139\u2013144."},{"key":"e_1_3_1_64_2","doi-asserted-by":"crossref","first-page":"1280","DOI":"10.1109\/ICDAR.2019.00207","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Koci Elvis","year":"2019","unstructured":"Elvis Koci, Maik Thiele, Josephine Rehak, Oscar Romero, and Wolfgang Lehner. 2019. DECO: A dataset of annotated spreadsheets for layout and table recognition. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1280\u20131285."},{"key":"e_1_3_1_65_2","doi-asserted-by":"crossref","first-page":"1274","DOI":"10.1109\/ICDAR.2019.00206","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Koci Elvis","year":"2019","unstructured":"Elvis Koci, Maik Thiele, Oscar Romero, and Wolfgang Lehner. 2019. A genetic-based search for adaptive table recognition in spreadsheets. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1274\u20131279."},{"key":"e_1_3_1_66_2","unstructured":"Tarun Kumar and Himanshu Sharad Bhatt. 2022. Evaluating table structure recognition: A new perspective. arXiv:2208.00385. Retrieved from https:\/\/arxiv.org\/abs\/2208.00385"},{"key":"e_1_3_1_67_2","doi-asserted-by":"crossref","unstructured":"Yann LeCun Yoshua Bengio and Geoffrey Hinton. 2015. Deep learning. Nature 521 7553 (2015) 436\u2013444.","DOI":"10.1038\/nature14539"},{"key":"e_1_3_1_68_2","first-page":"2256","volume-title":"Proceedings of the 2017 IEEE International Conference on Big Data (Big Data)","author":"Lee Benjamin Charles Germain","year":"2017","unstructured":"Benjamin Charles Germain Lee. 2017. Line detection in binary document scans: A case study with the international tracing service archives. In Proceedings of the 2017 IEEE International Conference on Big Data (Big Data). IEEE, 2256\u20132261."},{"key":"e_1_3_1_69_2","first-page":"587","volume-title":"Proceedings of the International Conference on Database Systems for Advanced Applications","author":"Li Huichao","year":"2022","unstructured":"Huichao Li, Lingze Zeng, Weiyu Zhang, Jianing Zhang, Ju Fan, and Meihui Zhang. 2022. A two-phase approach for recognizing tables with complex structures. In Proceedings of the International Conference on Database Systems for Advanced Applications. Springer, 587\u2013595."},{"key":"e_1_3_1_70_2","doi-asserted-by":"publisher","unstructured":"Junlong Li Yiheng Xu Tengchao Lv Lei Cui Cha Zhang and Furu Wei. 2022. DiT: Self-supervised pre-training for document image transformer. In MM\u201922: The 30th ACM International Conference on Multimedia Lisboa Portugal October 10-14 2022 ACM 3530\u20133539. DOI:10.1145\/3503161.3547911","DOI":"10.1145\/3503161.3547911"},{"key":"e_1_3_1_71_2","first-page":"1918","volume-title":"Proceedings of the 12th Language Resources and Evaluation Conference","author":"Li Minghao","year":"2020","unstructured":"Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou, and Zhoujun Li. 2020. Tablebank: Table benchmark for image-based table detection and recognition. In Proceedings of the 12th Language Resources and Evaluation Conference. 1918\u20131925."},{"key":"e_1_3_1_72_2","first-page":"951","volume-title":"Proceedings of the 2019 Chinese Automation Congress (CAC)","author":"Li Shun","year":"2019","unstructured":"Shun Li, WeiDong Liu, and GongBing Xiao. 2019. Detection of srew nut images based on deep transfer learning network. In Proceedings of the 2019 Chinese Automation Congress (CAC). IEEE, 951\u2013955."},{"key":"e_1_3_1_73_2","first-page":"763","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Li Yibo","year":"2019","unstructured":"Yibo Li, Liangcai Gao, Zhi Tang, Qinqin Yan, and Yilun Huang. 2019. A GAN-based feature generator for table detection. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 763\u2013768."},{"key":"e_1_3_1_74_2","doi-asserted-by":"publisher","unstructured":"Geert Litjens Thijs Kooi Babak Ehteshami Bejnordi Arnaud Arindra Adiyoso Setio Francesco Ciompi Mohsen Ghafoorian Jeroen A. W. M. van der Laak Bram van Ginneken and Clara I. S\u00e1nchez. 2017. A survey on deep learning in medical image analysis. Medical Image Anal. 42 (2017) 60\u201388. DOI:10.1016\/J.MEDIA.2017.07.005","DOI":"10.1016\/J.MEDIA.2017.07.005"},{"issue":"2","key":"e_1_3_1_75_2","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1007\/s11263-019-01247-4","article-title":"Deep learning for generic object detection: A survey","volume":"128","author":"Liu Li","year":"2020","unstructured":"Li Liu, Wanli Ouyang, Xiaogang Wang, Paul Fieguth, Jie Chen, Xinwang Liu, and Matti Pietik\u00e4inen. 2020. Deep learning for generic object detection: A survey. International Journal of Computer Vision 128, 2 (2020), 261\u2013318.","journal-title":"International Journal of Computer Vision"},{"key":"e_1_3_1_76_2","first-page":"3741","volume-title":"Proceedings of the 29th International Conference on Computational Linguistics","author":"Liu Ruixue","year":"2022","unstructured":"Ruixue Liu, Shaozu Yuan, Aijun Dai, Lei Shen, Tiangang Zhu, Meng Chen, and Xiaodong He. 2022. Few-shot table understanding: A benchmark dataset and pre-training baseline. In Proceedings of the 29th International Conference on Computational Linguistics. 3741\u20133752."},{"key":"e_1_3_1_77_2","first-page":"944","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Long Rujiao","year":"2021","unstructured":"Rujiao Long, Wen Wang, Nan Xue, Feiyu Gao, Zhibo Yang, Yongpan Wang, and Gui-Song Xia. 2021. Parsing table structures in the wild. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 944\u2013952."},{"key":"e_1_3_1_78_2","unstructured":"Nam Tuan Ly Atsuhiro Takasu Phuc Nguyen and Hideaki Takeda. 2023. Rethinking image-based table recognition using weakly supervised methods. arXiv:2303.07641. Retrieved from https:\/\/arxiv.org\/abs\/2303.07641"},{"key":"e_1_3_1_79_2","doi-asserted-by":"publisher","unstructured":"Chixiang Ma Weihong Lin Lei Sun and Qiang Huo. 2023. Robust table detection and structure recognition from heterogeneous document images. Pattern Recognit. 133 (2023) 109006. DOI:10.1016\/J.PATCOG.2022.109006","DOI":"10.1016\/J.PATCOG.2022.109006"},{"issue":"16","key":"e_1_3_1_80_2","doi-asserted-by":"crossref","first-page":"7094","DOI":"10.3390\/s23167094","article-title":"GANMasker: A two-stage generative adversarial network for high-quality face mask removal","volume":"23","author":"Mahmoud Mohamed","year":"2023","unstructured":"Mohamed Mahmoud and Hyun-Soo Kang. 2023. GANMasker: A two-stage generative adversarial network for high-quality face mask removal. Sensors 23, 16 (2023), 7094.","journal-title":"Sensors"},{"key":"e_1_3_1_81_2","first-page":"1","volume-title":"Proceedings of the 2022 International Telecommunications Conference (ITC-Egypt)","author":"Mahmoud Mohamed","year":"2022","unstructured":"Mohamed Mahmoud, Mahmoud Kasem, Abdelrahman Abdallah, and Hyun Soo Kang. 2022. AE-LSTM: Autoencoder with LSTM-based intrusion detection in IoT. In Proceedings of the 2022 International Telecommunications Conference (ITC-Egypt). IEEE, 1\u20136."},{"issue":"3","key":"e_1_3_1_82_2","doi-asserted-by":"crossref","first-page":"1096","DOI":"10.1016\/j.patcog.2013.08.009","article-title":"KHATT: An open Arabic offline handwritten text database","volume":"47","author":"Mahmoud Sabri A","year":"2014","unstructured":"Sabri A Mahmoud, Irfan Ahmad, Wasfi G Al-Khatib, Mohammad Alshayeb, Mohammad Tanvir Parvez, Volker M\u00e4rgner, and Gernot A Fink. 2014. KHATT: An open Arabic offline handwritten text database. Pattern Recognition 47, 3 (2014), 1096\u20131112.","journal-title":"Pattern Recognition"},{"key":"e_1_3_1_83_2","doi-asserted-by":"publisher","unstructured":"Song Mao Azriel Rosenfeld and Tapas Kanungo. 2003. Document structure analysis algorithms: A literature survey. In Document Recognition and Retrieval X Santa Clara California USA January 22-23 2003 Proceedings (SPIE Proceedings) SPIE 197\u2013207. DOI:10.1117\/12.476326","DOI":"10.1117\/12.476326"},{"key":"e_1_3_1_84_2","first-page":"1","volume-title":"Proceedings of the 2018 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","author":"Masita Katleho L","year":"2018","unstructured":"Katleho L Masita, Ali N Hasan, and Satyakama Paul. 2018. Pedestrian detection using R-CNN object detector. In Proceedings of the 2018 IEEE Latin American Conference on Computational Intelligence (LA-CCI). IEEE, 1\u20136."},{"key":"e_1_3_1_85_2","doi-asserted-by":"crossref","first-page":"923","DOI":"10.1109\/GlobalSIP.2017.8309095","volume-title":"Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","author":"Minaee Shervin","year":"2017","unstructured":"Shervin Minaee and Zhu Liu. 2017. Automatic question-answering using a deep similarity neural network. In Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP). IEEE, 923\u2013927."},{"key":"e_1_3_1_86_2","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1007\/978-3-030-57058-3_16","volume-title":"Proceedings of the International Workshop on Document Analysis Systems","author":"Mondal Ajoy","year":"2020","unstructured":"Ajoy Mondal, Peter Lipps, and CV Jawahar. 2020. IIIT-AR-13K: A new dataset for graphical object detection in documents. In Proceedings of the International Workshop on Document Analysis Systems. Springer, 216\u2013230."},{"key":"e_1_3_1_87_2","first-page":"27","volume-title":"Proceedings of the VISIGRAPP (4: VISAPP)","author":"Namysl Marcin","year":"2022","unstructured":"Marcin Namysl, Alexander M Esser, Sven Behnke, and Joachim K\u00f6hler. 2022. Flexible table recognition and semantic interpretation system. In Proceedings of the VISIGRAPP (4: VISAPP). 27\u201337."},{"issue":"3","key":"e_1_3_1_88_2","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1007\/s42979-022-01659-z","article-title":"Flexible hybrid table recognition and semantic interpretation system","volume":"4","author":"Namys\u0142 Marcin","year":"2023","unstructured":"Marcin Namys\u0142, Alexander M Esser, Sven Behnke, and Joachim K\u00f6hler. 2023. Flexible hybrid table recognition and semantic interpretation system. SN Computer Science 4, 3 (2023), 246.","journal-title":"SN Computer Science"},{"key":"e_1_3_1_89_2","first-page":"4614","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Nassar Ahmed","year":"2022","unstructured":"Ahmed Nassar, Nikolaos Livathinos, Maksym Lysak, and Peter Staar. 2022. TableFormer: Table structure understanding with transformers. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 4614\u20134623."},{"issue":"1","key":"e_1_3_1_90_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s10032-021-00390-4","article-title":"TableSegNet: A fully convolutional network for table detection and segmentation in document images","volume":"25","author":"Nguyen Duc-Dung","year":"2022","unstructured":"Duc-Dung Nguyen. 2022. TableSegNet: A fully convolutional network for table detection and segmentation in document images. International Journal on Document Analysis and Recognition 25, 1 (2022), 1\u201314.","journal-title":"International Journal on Document Analysis and Recognition"},{"key":"e_1_3_1_91_2","volume-title":"Algorithmic Extraction of Data in Tables in PDF Documents","author":"Nurminen Anssi","year":"2013","unstructured":"Anssi Nurminen. 2013. Algorithmic Extraction of Data in Tables in PDF Documents. Master\u2019s thesis."},{"issue":"21","key":"e_1_3_1_92_2","doi-asserted-by":"crossref","first-page":"33075","DOI":"10.1007\/s11042-021-11399-6","article-title":"Handwritten Kazakh and Russian (HKR) database for text recognition","volume":"80","author":"Nurseitov Daniyar","year":"2021","unstructured":"Daniyar Nurseitov, Kairat Bostanbekov, Daniyar Kurmankhojayev, Anel Alimova, Abdelrahman Abdallah, and Rassul Tolegenov. 2021. Handwritten Kazakh and Russian (HKR) database for text recognition. Multimedia Tools and Applications 80, 21 (2021), 33075\u201333097.","journal-title":"Multimedia Tools and Applications"},{"issue":"11","key":"e_1_3_1_93_2","doi-asserted-by":"crossref","first-page":"1162","DOI":"10.1109\/34.244677","article-title":"The document spectrum for page layout analysis","volume":"15","author":"O\u2019Gorman Lawrence","year":"1993","unstructured":"Lawrence O\u2019Gorman. 1993. The document spectrum for page layout analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 15, 11 (1993), 1162\u20131173.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_1_94_2","first-page":"906","volume-title":"Proceedings of the 2009 10th International Conference on Document Analysis and Recognition","author":"Oro Ermelinda","year":"2009","unstructured":"Ermelinda Oro and Massimo Ruffolo. 2009. TREX: An approach for recognizing and extracting tables from PDF documents. In Proceedings of the 2009 10th International Conference on Document Analysis and Recognition. IEEE, 906\u2013910."},{"key":"e_1_3_1_95_2","doi-asserted-by":"crossref","first-page":"128","DOI":"10.1109\/ICDAR.2019.00029","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Paliwal Shubham Singh","year":"2019","unstructured":"Shubham Singh Paliwal, D Vishwanath, Rohit Rahul, Monika Sharma, and Lovekesh Vig. 2019. Tablenet: Deep learning model for end-to-end table detection and tabular data extraction from scanned document images. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 128\u2013133."},{"key":"e_1_3_1_96_2","first-page":"311","volume-title":"Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics","author":"Papineni Kishore","year":"2002","unstructured":"Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. Bleu: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. 311\u2013318."},{"key":"e_1_3_1_97_2","unstructured":"Ihsin Tsaiyun Phillips. 1996. User\u2019s reference manual for the UW english\/technical document image database III. UW-III English\/Technical Document Image database Manual (1996)."},{"key":"e_1_3_1_98_2","first-page":"572","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops","author":"Prasad Devashish","year":"2020","unstructured":"Devashish Prasad, Ayan Gadpal, Kshitij Kapadni, Manish Visave, and Kavita Sultanpure. 2020. CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops. 572\u2013573."},{"key":"e_1_3_1_99_2","doi-asserted-by":"crossref","unstructured":"P Pyreddy and WB Croft. 1997. Tinti: A system for retrieval in text tables title2.","DOI":"10.1145\/263690.263816"},{"key":"e_1_3_1_100_2","doi-asserted-by":"publisher","unstructured":"Shah Rukh Qasim Hassan Mahmood and Faisal Shafait. 2019. Rethinking table recognition using graph neural networks. In 2019 International Conference on Document Analysis and Recognition ICDAR 2019 Sydney Australia September 20-25 2019 IEEE 142\u2013147. DOI:10.1109\/ICDAR.2019.00031","DOI":"10.1109\/ICDAR.2019.00031"},{"key":"e_1_3_1_101_2","first-page":"99","volume-title":"Proceedings of the International Conference on Document Analysis and Recognition","author":"Qiao Liang","year":"2021","unstructured":"Liang Qiao, Zaisheng Li, Zhanzhan Cheng, Peng Zhang, Shiliang Pu, Yi Niu, Wenqi Ren, Wenming Tan, and Fei Wu. 2021. Lgpma: Complicated table structure recognition with local and global pyramid mask alignment. In Proceedings of the International Conference on Document Analysis and Recognition. Springer, 99\u2013114."},{"key":"e_1_3_1_102_2","first-page":"70","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Raja Sachin","year":"2020","unstructured":"Sachin Raja, Ajoy Mondal, and CV Jawahar. 2020. Table structure recognition using top-down and bottom-up cues. In Proceedings of the European Conference on Computer Vision. Springer, 70\u201386."},{"key":"e_1_3_1_103_2","first-page":"2299","volume-title":"Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision","author":"Raja Sachin","year":"2022","unstructured":"Sachin Raja, Ajoy Mondal, and CV Jawahar. 2022. Visual understanding of complex table structures from document images. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision. 2299\u20132308."},{"key":"e_1_3_1_104_2","unstructured":"Susie Xi Rao Johannes Rausch Peter H. Egger and Ce Zhang. 2022. TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets. In Proceedings of the Workshop on Scientific Document Understanding Co-Located with 36th AAAI Conference on Artificial Inteligence SDU@AAAI 2022 Virtual Event March 1 2022 (CEUR Workshop Proceedings) CEUR-WS.org. Retrieved from https:\/\/ceur-ws.org\/Vol-3164\/paper8.pdf"},{"key":"e_1_3_1_105_2","first-page":"777","volume-title":"Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","author":"Rashid Sheikh Faisal","year":"2017","unstructured":"Sheikh Faisal Rashid, Abdullah Akmal, Muhammad Adnan, Ali Adnan Aslam, and Andreas Dengel. 2017. Table recognition in heterogeneous documents using machine learning. In Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). IEEE, 777\u2013782."},{"key":"e_1_3_1_106_2","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1109\/ICDARW.2019.40097","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW)","author":"Reza Mohammad Mohsin","year":"2019","unstructured":"Mohammad Mohsin Reza, Syed Saqib Bukhari, Martin Jenckel, and Andreas Dengel. 2019. Table localization and segmentation using GAN and CNN. In Proceedings of the 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW). IEEE, 152\u2013157."},{"key":"e_1_3_1_107_2","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1109\/ICDAR.2019.00028","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Riba Pau","year":"2019","unstructured":"Pau Riba, Anjan Dutta, Lutz Goldmann, Alicia Forn\u00e9s, Oriol Ramos, and Josep Llad\u00f3s. 2019. Table detection in invoice documents by graph neural networks. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 122\u2013127."},{"key":"e_1_3_1_108_2","doi-asserted-by":"publisher","unstructured":"Pau Riba Lutz Goldmann Oriol Ramos Terrades Diede Rusticus Alicia Forn\u00e9s and Josep Llad\u00f3s. 2022. Table detection in business document images by message passing networks. Pattern Recognit. 127 (2022) 108641. DOI:10.1016\/J.PATCOG.2022.108641","DOI":"10.1016\/J.PATCOG.2022.108641"},{"key":"e_1_3_1_109_2","first-page":"339","volume-title":"Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges: Virtual Event, January 10-15, 2021","author":"Samari Arash","year":"2021","unstructured":"Arash Samari, Andrew Piper, Alison Hedley, and Mohamed Cheriet. 2021. Weakly supervised bounding box extraction for unlabeled data in table detection. In Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges: Virtual Event, January 10-15, 2021. Springer, 339\u2013352."},{"key":"e_1_3_1_110_2","first-page":"1162","volume-title":"Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","author":"Schreiber Sebastian","year":"2017","unstructured":"Sebastian Schreiber, Stefan Agne, Ivo Wolf, Andreas Dengel, and Sheraz Ahmed. 2017. Deepdesrt: Deep learning for detection and structure recognition of tables in document images. In Proceedings of the 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1162\u20131167."},{"issue":"1","key":"e_1_3_1_111_2","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1007\/s10032-014-0226-7","article-title":"Junction-based table detection in camera-captured document images","volume":"18","author":"Seo Wonkyo","year":"2015","unstructured":"Wonkyo Seo, Hyung Il Koo, and Nam Ik Cho. 2015. Junction-based table detection in camera-captured document images. International Journal on Document Analysis and Recognition 18, 1 (2015), 47\u201357.","journal-title":"International Journal on Document Analysis and Recognition"},{"key":"e_1_3_1_112_2","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1145\/1815330.1815339","volume-title":"Proceedings of the 9th IAPR International Workshop on Document Analysis Systems","author":"Shafait Faisal","year":"2010","unstructured":"Faisal Shafait and Ray Smith. 2010. Table detection in heterogeneous documents. In Proceedings of the 9th IAPR International Workshop on Document Analysis Systems. 65\u201372."},{"key":"e_1_3_1_113_2","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1145\/1815330.1815345","volume-title":"Proceedings of the 9th IAPR International Workshop on Document Analysis Systems","author":"Shahab Asif","year":"2010","unstructured":"Asif Shahab, Faisal Shafait, Thomas Kieninger, and Andreas Dengel. 2010. An open approach towards the benchmarking of table structure recognition systems. In Proceedings of the 9th IAPR International Workshop on Document Analysis Systems. 113\u2013120."},{"key":"e_1_3_1_114_2","first-page":"51","volume-title":"Proceedings of the International Conference on Document Analysis and Recognition","author":"Shehzadi Tahira","year":"2023","unstructured":"Tahira Shehzadi, Khurram Azeem Hashmi, Didier Stricker, Marcus Liwicki, and Muhammad Zeshan Afzal. 2023. Towards end-to-end semi-supervised table detection with deformable transformer. In Proceedings of the International Conference on Document Analysis and Recognition. Springer, 51\u201376."},{"key":"e_1_3_1_115_2","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1109\/ICTC55111.2022.9778621","volume-title":"Proceedings of the 2022 3rd Information Communication Technologies Conference (ICTC)","author":"Shen Xinyi","year":"2022","unstructured":"Xinyi Shen, Lingjun Kong, Yunchao Bao, Yaowei Zhou, and Weiguang Liu. 2022. RCANet: A rows and columns aggregated network for table structure recognition. In Proceedings of the 2022 3rd Information Communication Technologies Conference (ICTC). IEEE, 112\u2013116."},{"key":"e_1_3_1_116_2","doi-asserted-by":"crossref","first-page":"1403","DOI":"10.1109\/ICDAR.2019.00226","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Siddiqui Shoaib Ahmed","year":"2019","unstructured":"Shoaib Ahmed Siddiqui, Imran Ali Fateh, Syed Tahseen Raza Rizvi, Andreas Dengel, and Sheraz Ahmed. 2019. DeepTabStR: Deep learning based table structure recognition. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1403\u20131409."},{"key":"e_1_3_1_117_2","doi-asserted-by":"crossref","first-page":"1397","DOI":"10.1109\/ICDAR.2019.00225","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Siddiqui Shoaib Ahmed","year":"2019","unstructured":"Shoaib Ahmed Siddiqui, Pervaiz Iqbal Khan, Andreas Dengel, and Sheraz Ahmed. 2019. Rethinking semantic segmentation for table structure recognition in documents. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1397\u20131402."},{"key":"e_1_3_1_118_2","doi-asserted-by":"publisher","unstructured":"Shoaib Ahmed Siddiqui Muhammad Imran Malik Stefan Agne Andreas Dengel and Sheraz Ahmed. 2018. DeCNT: Deep deformable CNN for table detection. IEEE Access 6 (2018) 74151\u201374161. DOI:10.1109\/ACCESS.2018.2880211","DOI":"10.1109\/ACCESS.2018.2880211"},{"key":"e_1_3_1_119_2","first-page":"1","volume-title":"Proceedings of the 2015 Annual Conference of the North American Fuzzy Information Processing Society (NAFIPS) Held Jointly with 2015 5th World Conference on Soft Computing (WConSC)","author":"Sidorov Grigori","year":"2015","unstructured":"Grigori Sidorov, Helena G\u00f3mez-Adorno, Ilia Markov, David Pinto, and Nahun Loya. 2015. Computing text similarity using tree edit distance. In Proceedings of the 2015 Annual Conference of the North American Fuzzy Information Processing Society (NAFIPS) Held Jointly with 2015 5th World Conference on Soft Computing (WConSC). 1\u20134. DOI:10.1109\/NAFIPS-WConSC.2015.7284129"},{"key":"e_1_3_1_120_2","doi-asserted-by":"crossref","first-page":"223","DOI":"10.1145\/3197026.3197040","volume-title":"Proceedings of the 18th ACM\/IEEE on Joint Conference on Digital Libraries","author":"Siegel Noah","year":"2018","unstructured":"Noah Siegel, Nicholas Lourie, Russell Power, and Waleed Ammar. 2018. Extracting scientific figures with distantly supervised neural networks. In Proceedings of the 18th ACM\/IEEE on Joint Conference on Digital Libraries. 223\u2013232."},{"key":"e_1_3_1_121_2","first-page":"535","volume-title":"Proceedings of the International Conference on Document Analysis and Recognition","author":"Smock Brandon","year":"2023","unstructured":"Brandon Smock, Rohith Pesala, and Robin Abraham. 2023. GriTS: Grid table similarity metric for table structure recognition. In Proceedings of the International Conference on Document Analysis and Recognition. Springer, 535\u2013549."},{"key":"e_1_3_1_122_2","doi-asserted-by":"crossref","unstructured":"Brandon Smock Rohith Pesala and Robin Abraham. 2022. PubTables-1M: Towards comprehensive table extraction from unstructured documents. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 4634\u20134642.","DOI":"10.1109\/CVPR52688.2022.00459"},{"key":"e_1_3_1_123_2","first-page":"1314","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Sun Ningning","year":"2019","unstructured":"Ningning Sun, Yuanping Zhu, and Xiaoming Hu. 2019. Faster R-CNN based table detection combining corner locating. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1314\u20131319."},{"key":"e_1_3_1_124_2","volume-title":"Computer Vision: Algorithms and Applications","author":"Szeliski Richard","year":"2010","unstructured":"Richard Szeliski. 2010. Computer Vision: Algorithms and Applications. Springer Science and Business Media."},{"key":"e_1_3_1_125_2","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1109\/ICDAR.2019.00027","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Tensmeyer Chris","year":"2019","unstructured":"Chris Tensmeyer, Vlad I Morariu, Brian Price, Scott Cohen, and Tony Martinez. 2019. Deep splitting and merging for table structure decomposition. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 114\u2013121."},{"key":"e_1_3_1_126_2","doi-asserted-by":"publisher","unstructured":"Nazgul Toiganbayeva Mahmoud SalahEldin Kasem Galymzhan Abdimanap Kairat Bostanbekov Abdelrahman Abdallah Anel Alimova and Daniyar B. Nurseitov. 2022. KOHTD: Kazakh offline handwritten text dataset. Signal Process. Image Commun. 108 (2022) 116827. DOI:10.1016\/J.IMAGE.2022.116827","DOI":"10.1016\/J.IMAGE.2022.116827"},{"key":"e_1_3_1_127_2","first-page":"1","volume-title":"Proceedings of the 2019 IEEE Symposium on Computers and Communications (ISCC)","author":"Traquair Mark","year":"2019","unstructured":"Mark Traquair, Ertugrul Kara, Burak Kantarci, and Shahzad Khan. 2019. Deep learning for the detection of tabular information from electronic component datasheets. In Proceedings of the 2019 IEEE Symposium on Computers and Communications (ISCC). IEEE, 1\u20136."},{"key":"e_1_3_1_128_2","unstructured":"Scott Tupaj Zhongwen Shi C. Hwa Chang and Hassan Alam. 1996. Extracting tabular information from text files. EECS Department Tufts University Medford USA 1 (1996)."},{"key":"e_1_3_1_129_2","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1145\/511446.511478","volume-title":"Proceedings of the 11th International Conference on World Wide Web","author":"Wang Yalin","year":"2002","unstructured":"Yalin Wang and Jianying Hu. 2002. A machine learning based approach for table detection on the web. In Proceedings of the 11th International Conference on World Wide Web. 242\u2013250."},{"key":"e_1_3_1_130_2","first-page":"528","volume-title":"Proceedings of the 6th International Conference on Document Analysis and Recognition","author":"Wangt Yalin","year":"2001","unstructured":"Yalin Wangt, Ihsin T Phillipst, and Robert Haralick. 2001. Automatic table ground truth generation and a background-analysis-based table structure extraction method. In Proceedings of the 6th International Conference on Document Analysis and Recognition. IEEE, 528\u2013532."},{"key":"e_1_3_1_131_2","doi-asserted-by":"publisher","unstructured":"Shengkai Wu Jinrong Yang Xinggang Wang and Xiaoping Li. 2022. IoU-Balanced loss functions for single-stage object detection. Pattern Recognit. Lett. 156 (2022) 96\u2013103. DOI:10.1016\/J.PATREC.2022.01.021","DOI":"10.1016\/J.PATREC.2022.01.021"},{"key":"e_1_3_1_132_2","unstructured":"Bin Xiao Murat Simsek Burak Kantarci and Ala Abu Alkheir. 2022. Table structure recognition with conditional attention. arXiv:2203.03819. Retrieved from https:\/\/arxiv.org\/abs\/2203.03819"},{"key":"e_1_3_1_133_2","doi-asserted-by":"crossref","unstructured":"Bin Xiao Murat Simsek Burak Kantarci and Ala Abu Alkheir. 2023. Revisiting table detection datasets for visually rich documents. arXiv:2305.04833. Retrieved from https:\/\/arxiv.org\/abs\/2305.04833","DOI":"10.1016\/j.knosys.2023.111080"},{"key":"e_1_3_1_134_2","doi-asserted-by":"publisher","unstructured":"Wen Xu Julian Jang-Jaccard Amardeep Singh Yuanyuan Wei and Fariza Sabrina. 2021. Improving performance of autoencoder-based network anomaly detection on NSL-KDD dataset. IEEE Access 9 (2021) 140136\u2013140146. DOI:10.1109\/ACCESS.2021.3116612","DOI":"10.1109\/ACCESS.2021.3116612"},{"key":"e_1_3_1_135_2","first-page":"749","volume-title":"Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR)","author":"Xue Wenyuan","year":"2019","unstructured":"Wenyuan Xue, Qingyong Li, and Dacheng Tao. 2019. ReS2TIM: Reconstruct syntactic structures from table images. In Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 749\u2013755."},{"issue":"1","key":"e_1_3_1_136_2","doi-asserted-by":"crossref","first-page":"110","DOI":"10.1038\/s41597-023-01985-8","article-title":"A large-scale dataset for end-to-end table recognition in the wild","volume":"10","author":"Yang Fan","year":"2023","unstructured":"Fan Yang, Lei Hu, Xinwu Liu, Shuangping Huang, and Zhenghui Gu. 2023. A large-scale dataset for end-to-end table recognition in the wild. Scientific Data 10, 1 (2023), 110.","journal-title":"Scientific Data"},{"issue":"3","key":"e_1_3_1_137_2","doi-asserted-by":"crossref","first-page":"28","DOI":"10.3390\/a11030028","article-title":"Modified convolutional neural network based on dropout and the stochastic gradient descent optimizer","volume":"11","author":"Yang Jing","year":"2018","unstructured":"Jing Yang and Guanci Yang. 2018. Modified convolutional neural network based on dropout and the stochastic gradient descent optimizer. Algorithms 11, 3 (2018), 28.","journal-title":"Algorithms"},{"issue":"3","key":"e_1_3_1_138_2","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1109\/MCI.2018.2840738","article-title":"Recent trends in deep learning based natural language processing","volume":"13","author":"Young Tom","year":"2018","unstructured":"Tom Young, Devamanyu Hazarika, Soujanya Poria, and Erik Cambria. 2018. Recent trends in deep learning based natural language processing. IEEE Computational Intelligence Magazine 13, 3 (2018), 55\u201375.","journal-title":"IEEE Computational Intelligence Magazine"},{"key":"e_1_3_1_139_2","first-page":"4471","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Yu Jiahui","year":"2019","unstructured":"Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, and Thomas S Huang. 2019. Free-form image inpainting with gated convolution. In Proceedings of the IEEE\/CVF International Conference on Computer Vision. 4471\u20134480."},{"issue":"1","key":"e_1_3_1_140_2","first-page":"1","article-title":"A survey of table recognition","volume":"7","author":"Zanibbi Richard","year":"2004","unstructured":"Richard Zanibbi, Dorothea Blostein, and James R Cordy. 2004. A survey of table recognition. Document Analysis and Recognition 7, 1 (2004), 1\u201316.","journal-title":"Document Analysis and Recognition"},{"key":"e_1_3_1_141_2","doi-asserted-by":"publisher","unstructured":"Daqian Zhang Ruibin Mao Runting Guo Yang Jiang and Jing Zhu. 2023. YOLO-table: Disclosure document table detection with involution. Int. J. Document Anal. Recognit. 26 1 (2023) 1\u201314. DOI:10.1007\/S10032-022-00400-Z","DOI":"10.1007\/S10032-022-00400-Z"},{"issue":"7","key":"e_1_3_1_142_2","doi-asserted-by":"crossref","first-page":"1855","DOI":"10.1016\/j.patcog.2006.05.029","article-title":"Extraction and segmentation of tables from Chinese ink documents based on a matrix model","volume":"40","author":"Zhang Xi-wen","year":"2007","unstructured":"Xi-wen Zhang, Michael R Lyu, and Guo-zhong Dai. 2007. Extraction and segmentation of tables from Chinese ink documents based on a matrix model. Pattern Recognition 40, 7 (2007), 1855\u20131867.","journal-title":"Pattern Recognition"},{"issue":"5","key":"e_1_3_1_143_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3178115","article-title":"Deep learning for environmentally robust speech recognition: An overview of recent developments","volume":"9","author":"Zhang Zixing","year":"2018","unstructured":"Zixing Zhang, J\u00fcrgen Geiger, Jouni Pohjalainen, Amr El-Desoky Mousa, Wenyu Jin, and Bj\u00f6rn Schuller. 2018. Deep learning for environmentally robust speech recognition: An overview of recent developments. ACM Transactions on Intelligent Systems and Technology 9, 5 (2018), 1\u201328.","journal-title":"ACM Transactions on Intelligent Systems and Technology"},{"key":"e_1_3_1_144_2","doi-asserted-by":"publisher","unstructured":"Zhenrong Zhang Jianshu Zhang Jun Du and Fengren Wang. 2022. Split Embed and Merge: An accurate table structure recognizer. Pattern Recognit. 126 (2022) 108565. DOI:10.1016\/J.PATCOG.2022.108565","DOI":"10.1016\/J.PATCOG.2022.108565"},{"key":"e_1_3_1_145_2","article-title":"Global table extractor (GTE): A framework for joint table identification and cell structure recognition using visual context","author":"Zheng Xinyi","year":"2021","unstructured":"Xinyi Zheng, Doug Burdick, Lucian Popa, Peter Zhong, and Nancy Xin Ru Wang. 2021. Global table extractor (GTE): A framework for joint table identification and cell structure recognition using visual context. In Proceedings of the IEEE\/CVF Winter Conference for Applications in Computer Vision (WACV).","journal-title":"In Proceedings of the IEEE\/CVF Winter Conference for Applications in Computer Vision (WACV)."},{"key":"e_1_3_1_146_2","first-page":"697","volume-title":"Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision","author":"Zheng Xinyi","year":"2021","unstructured":"Xinyi Zheng, Douglas Burdick, Lucian Popa, Xu Zhong, and Nancy Xin Ru Wang. 2021. Global table extractor (GTE): A framework for joint table identification and cell structure recognition using visual context. In Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision. 697\u2013706."},{"key":"e_1_3_1_147_2","first-page":"564","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Zhong Xu","year":"2020","unstructured":"Xu Zhong, Elaheh ShafieiBavani, and Antonio Jimeno Yepes. 2020. Image-based table recognition: Data, model, and evaluation. In Proceedings of the European Conference on Computer Vision. Springer, 564\u2013580."},{"key":"e_1_3_1_148_2","first-page":"274","volume-title":"Proceedings of the 2020 15th IEEE International Conference on Signal Processing (ICSP)","author":"Zou Yajun","year":"2020","unstructured":"Yajun Zou and Jinwen Ma. 2020. A deep semantic segmentation model for image-based table structure recognition. In Proceedings of the 2020 15th IEEE International Conference on Signal Processing (ICSP). IEEE, 274\u2013280."},{"issue":"4","key":"e_1_3_1_149_2","doi-asserted-by":"crossref","first-page":"1765","DOI":"10.1007\/s11036-021-01759-9","article-title":"ClusTi: Clustering method for table structure recognition in scanned images","volume":"26","author":"Zucker Arthur","year":"2021","unstructured":"Arthur Zucker, Younes Belkada, Hanh Vu, and Van Nam Nguyen. 2021. ClusTi: Clustering method for table structure recognition in scanned images. Mobile Networks and Applications 26, 4 (2021), 1765\u20131776.","journal-title":"Mobile Networks and Applications"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3657281","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3657281","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T01:17:38Z","timestamp":1750295858000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3657281"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,3]]},"references-count":148,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2024,12,31]]}},"alternative-id":["10.1145\/3657281"],"URL":"https:\/\/doi.org\/10.1145\/3657281","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,10,3]]},"assertion":[{"value":"2022-12-13","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-04-02","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-10-03","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}