{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T13:36:57Z","timestamp":1753882617798,"version":"3.41.2"},"reference-count":23,"publisher":"World Scientific Pub Co Pte Ltd","issue":"02","funder":[{"name":"National Science and Technology Council, Taiwan","award":["NSTC 112-2221-E-032-041"],"award-info":[{"award-number":["NSTC 112-2221-E-032-041"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2024,2]]},"abstract":"<jats:p> This paper proposes an efficient method for rectifying distorted document images via deep learning, ultimately improving the legibility of graphics and text in documents. The framework comprises two interconnected UNets, working in tandem to predict a 3D coordinate map and a forward map for the input distorted document image, respectively. At the beginning of the process, a page mask is predicted and used as input to both U-Nets to help improve the performance of their tasks. In the last step, the predicted forward map is transformed into a corresponding backward map, which is utilized to rectify the distorted image. The experimental results not only reveal that the predicted page masks and 3D coordinate maps significantly enhance the accuracy of predicting forward maps for subsequent rectification but also demonstrate satisfactory results both globally and locally. <\/jats:p>","DOI":"10.1142\/s0218001423510230","type":"journal-article","created":{"date-parts":[[2024,2,18]],"date-time":"2024-02-18T04:46:48Z","timestamp":1708231608000},"source":"Crossref","is-referenced-by-count":1,"title":["Effective Document Image Rectification via a Deep Learning Framework"],"prefix":"10.1142","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0009-0001-7874-1605","authenticated-orcid":false,"given":"Hsiau-Wen","family":"Lin","sequence":"first","affiliation":[{"name":"Department of Information Management, Chihlee University of Technology, Taipei, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1539-233X","authenticated-orcid":false,"given":"Hwei Jen","family":"Lin","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Information Engineering, Tamkang University, Taipei, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-7034-5095","authenticated-orcid":false,"given":"Yihjia","family":"Tsai","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Information Engineering, Tamkang University, Taipei, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-7123-5183","authenticated-orcid":false,"given":"Yoshimasa","family":"Tokuyama","sequence":"additional","affiliation":[{"name":"Department of Media and Image Technology, Faculty of Engineering, Tokyo Polytechnic University, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-9395-4150","authenticated-orcid":false,"given":"Chou-Wei","family":"Kong","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Information Engineering, Tamkang University, Taipei, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2024,4,5]]},"reference":[{"key":"S0218001423510230BIB002","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2006.871082"},{"key":"S0218001423510230BIB003","first-page":"228","volume-title":"Proceedings Ninth IEEE International Conference on Computer Vision","volume":"1","author":"Cao H."},{"key":"S0218001423510230BIB004","first-page":"71","volume-title":"Proc. Seventh Int. Conf. Document Analysis and Recognition","author":"Cao H.","year":"2003"},{"key":"S0218001423510230BIB005","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00022"},{"key":"S0218001423510230BIB006","doi-asserted-by":"publisher","DOI":"10.1145\/3103010.3121030"},{"key":"S0218001423510230BIB008","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2013.88"},{"key":"S0218001423510230BIB009","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2017.146"},{"key":"S0218001423510230BIB010","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-15552-9_31"},{"key":"S0218001423510230BIB011","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2001.958227"},{"issue":"6","key":"S0218001423510230BIB012","first-page":"1","volume":"38","author":"Li X.","year":"2019","journal-title":"ACM Trans. Graphics"},{"first-page":"4700","volume-title":"2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Ma K.","key":"S0218001423510230BIB013"},{"key":"S0218001423510230BIB014","doi-asserted-by":"publisher","DOI":"10.1007\/11553595_131"},{"key":"S0218001423510230BIB015","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2006.40"},{"first-page":"1","volume-title":"2007 IEEE Conference on Computer Vision and Pattern Recognition","author":"Tsoi Y. C.","key":"S0218001423510230BIB016"},{"key":"S0218001423510230BIB017","first-page":"1001","volume-title":"Eighth International Conference on Document Analysis and Recognition (ICDAR\u201905)","volume":"2","author":"Ulges A."},{"key":"S0218001423510230BIB018","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007906904009"},{"key":"S0218001423510230BIB019","first-page":"1398","volume-title":"The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003","volume":"2","author":"Wang Z."},{"key":"S0218001423510230BIB020","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-70659-3_36"},{"key":"S0218001423510230BIB021","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR56361.2022.9956331"},{"key":"S0218001423510230BIB022","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2017.2675980"},{"key":"S0218001423510230BIB023","doi-asserted-by":"publisher","DOI":"10.1109\/ICIAP.2007.4362769"},{"key":"S0218001423510230BIB024","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2004.1315007"},{"key":"S0218001423510230BIB025","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2009.03.025"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001423510230","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,5]],"date-time":"2024-04-05T06:22:30Z","timestamp":1712298150000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218001423510230"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2]]},"references-count":23,"journal-issue":{"issue":"02","published-print":{"date-parts":[[2024,2]]}},"alternative-id":["10.1142\/S0218001423510230"],"URL":"https:\/\/doi.org\/10.1142\/s0218001423510230","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"type":"print","value":"0218-0014"},{"type":"electronic","value":"1793-6381"}],"subject":[],"published":{"date-parts":[[2024,2]]},"article-number":"2351023"}}