{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,22]],"date-time":"2026-06-22T07:13:02Z","timestamp":1782112382750,"version":"3.54.5"},"reference-count":50,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2023,11,24]],"date-time":"2023-11-24T00:00:00Z","timestamp":1700784000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,4,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>We present a novel approach for automatically classifying illustrations from historical Chinese local gazetteers using modern deep learning techniques. Our goal is to facilitate the digital organization and study of a large quantity of digitized local gazetteers. We evaluate the performance of eight state-of-the-art deep neural networks on a dataset of 4,309 manually labeled and organized images of Chinese local gazetteer illustrations, grouped into three coarse categories and nine fine classes according to their contents. Our experiments show that DaViT achieved the highest classification accuracy of 93.9 per cent and F1-score of 90.6 per cent. Our results demonstrate the effectiveness of deep learning models in accurately recognizing and categorizing historical local gazetteer illustrations. We also developed a user-friendly web service to enable researchers easy access to the developed models. The potential for extending this method to other collections of scanned documents beyond Chinese local gazetteers makes a significant contribution to the study of visual materials in the arts and history in the digital humanities field. The dataset used in this study is publicly available and can be used for further research in the field.<\/jats:p>","DOI":"10.1093\/llc\/fqad065","type":"journal-article","created":{"date-parts":[[2023,11,25]],"date-time":"2023-11-25T06:43:00Z","timestamp":1700894580000},"page":"61-73","source":"Crossref","is-referenced-by-count":8,"title":["Image classification for historical documents: a study on Chinese local gazetteers"],"prefix":"10.1093","volume":"39","author":[{"given":"Jhe-An","family":"Chen","sequence":"first","affiliation":[{"name":"Center for Geographic Information Science, Research Center for Humanities and Social Sciences, Academia Sinica , Taipei 115201, Taiwan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jen-Chien","family":"Hou","sequence":"additional","affiliation":[{"name":"Center for Geographic Information Science, Research Center for Humanities and Social Sciences, Academia Sinica , Taipei 115201, Taiwan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0513-107X","authenticated-orcid":false,"given":"Richard Tzong-Han","family":"Tsai","sequence":"additional","affiliation":[{"name":"Center for Geographic Information Science, Research Center for Humanities and Social Sciences, Academia Sinica , Taipei 115201, Taiwan"},{"name":"Computer Science and Information Engineering Department, National Central University , Taoyuan 320317, Taiwan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hsiung-Ming","family":"Liao","sequence":"additional","affiliation":[{"name":"Center for Geographic Information Science, Research Center for Humanities and Social Sciences, Academia Sinica , Taipei 115201, Taiwan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5930-4736","authenticated-orcid":false,"given":"Shih-Pei","family":"Chen","sequence":"additional","affiliation":[{"name":"Max Planck Institute for the History of Science , Berlin 14195, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ming-Ching","family":"Chang","sequence":"additional","affiliation":[{"name":"Center for Geographic Information Science, Research Center for Humanities and Social Sciences, Academia Sinica , Taipei 115201, Taiwan"},{"name":"Computer Science Department, University at Albany, State University of New York , Albany, NY 12222, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2023,11,24]]},"reference":[{"key":"2024040210381158700_fqad065-B1","first-page":"pp. 1459","author":"Antonacopoulos","year":"2013"},{"key":"2024040210381158700_fqad065-B2","doi-asserted-by":"crossref","DOI":"10.1515\/MFIR.2003.86","volume-title":"Copyright Issues Relevant to the Creation of a Digital Archive","author":"Besek","year":"2003"},{"key":"2024040210381158700_fqad065-B3","doi-asserted-by":"crossref","first-page":"4522","DOI":"10.1093\/bioinformatics\/btz259","article-title":"Biomedical Image Augmentation Using Augmentor\u2019,","volume":"35","author":"Bloice","year":"2019","journal-title":"Bioinformatics"},{"key":"2024040210381158700_fqad065-B4","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1007\/BF00058655","article-title":"Bagging Predictors\u2019,","volume":"24","author":"Breiman","year":"1996","journal-title":"Machine Learning"},{"key":"2024040210381158700_fqad065-B5","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random Forests\u2019,","volume":"45","author":"Breiman","year":"2001","journal-title":"Machine Learning"},{"key":"2024040210381158700_fqad065-B6","doi-asserted-by":"crossref","first-page":"125","DOI":"10.3390\/info11020125","article-title":"Albumentations: Fast and Flexible Image Augmentations\u2019,","volume":"11","author":"Buslaev","year":"2020","journal-title":"Information"},{"key":"2024040210381158700_fqad065-B7","doi-asserted-by":"crossref","first-page":"544","DOI":"10.1017\/jch.2020.26","article-title":"Local Gazetteers Research Tools: Overview and Research Application\u2019,","volume":"4","author":"Chen","year":"2020","journal-title":"Journal of Chinese History \u4e2d\u570b\u6b77\u53f2\u5b78\u520a"},{"key":"2024040210381158700_fqad065-B8","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1007\/s42803-022-00048-5","article-title":"Treating a Genre as a Database: A Digital Research Methodology for Studying Chinese Local Gazetteers\u2019,","volume":"4","author":"Chen","year":"2023","journal-title":"International Journal of Digital Humanities"},{"key":"2024040210381158700_fqad065-B9","first-page":"1251","author":"Chollet","year":"2017"},{"key":"2024040210381158700_fqad065-B10","first-page":"pp. 248","author":"Deng","year":"2009"},{"key":"2024040210381158700_fqad065-B11","volume-title":"Writing, Publishing, and Reading Local Gazetteers in Imperial China, 1100\u20131700","author":"Dennis","year":"2015"},{"key":"2024040210381158700_fqad065-B12","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1007\/978-3-031-20053-3_5","volume-title":"Computer Vision\u2013ECCV 2022: 17th European Conference, Proceedings, Part XXIV","author":"Ding","year":"2022"},{"key":"2024040210381158700_fqad065-B13","author":"Dosovitskiy","year":"2020"},{"key":"2024040210381158700_fqad065-B14","author":"Du","year":"2020"},{"key":"2024040210381158700_fqad065-B15","first-page":"1612","article-title":"A Short Introduction to Boosting\u2019,","volume":"14","author":"Freund","year":"1999","journal-title":"Journal-Japanese Society for Artificial Intelligence"},{"key":"2024040210381158700_fqad065-B16","author":"Granet","year":"2018"},{"key":"2024040210381158700_fqad065-B17","first-page":"398","author":"Guoxin","year":"2019"},{"key":"2024040210381158700_fqad065-B18","first-page":"770","author":"He","year":"2016"},{"key":"2024040210381158700_fqad065-B19","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1109\/5254.708428","article-title":"Support Vector Machines\u2019,","volume":"13","author":"Hearst","year":"1998","journal-title":"IEEE Intelligent Systems and their Applications"},{"key":"2024040210381158700_fqad065-B20","first-page":"4700","author":"Huang","year":"2017"},{"key":"2024040210381158700_fqad065-B21","doi-asserted-by":"crossref","first-page":"5867","DOI":"10.1007\/s11042-021-11754-7","article-title":"Deep Learning for Historical Books: Classification of Printing Technology for Digitized Images\u2019,","volume":"81","author":"Im","year":"2022","journal-title":"Multimedia Tools and Applications"},{"key":"2024040210381158700_fqad065-B22","author":"Jocher","year":"2020"},{"key":"2024040210381158700_fqad065-B23","author":"Kingma","year":"2014"},{"key":"2024040210381158700_fqad065-B24","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1145\/3065386","article-title":"ImageNet Classification with Deep Convolutional Neural Networks","volume":"60","author":"Krizhevsky","year":"2017","journal-title":"Communications of the ACM"},{"key":"2024040210381158700_fqad065-B25","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-based Learning Applied to Document Recognition\u2019,","volume":"86","author":"LeCun","year":"1998","journal-title":"Proceedings of the IEEE"},{"key":"2024040210381158700_fqad065-B26","doi-asserted-by":"crossref","first-page":"3403","DOI":"10.3390\/plants11233403","article-title":"Exploring the Rice Cultivars in Large-scale Chinese Local Gazetteers: A Computational Approach\u2019,","volume":"11","author":"Li","year":"2022","journal-title":"Plants"},{"key":"2024040210381158700_fqad065-B27","doi-asserted-by":"crossref","first-page":"81","DOI":"10.3366\/ijhac.2020.0246","article-title":"Displaying Spatial Epistemologies on Web GIS: Using Visual Materials from the Chinese Local Gazetteers as an Example\u2019,","volume":"14","author":"Lin","year":"2020","journal-title":"International Journal of Humanities and Arts Computing"},{"key":"2024040210381158700_fqad065-B28","first-page":"pp. 1629","author":"Liu","year":"2015"},{"key":"2024040210381158700_fqad065-B29","first-page":"pp. 87","author":"Liu","year":"2015"},{"key":"2024040210381158700_fqad065-B30","first-page":"10012","author":"Liu","year":"2021"},{"key":"2024040210381158700_fqad065-B31","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1007\/s42803-022-00059-2","article-title":"Automatic Biographical Information Extraction from Local Gazetteers with Bi-lstm-crf Model and Bert\u2019,","volume":"4","author":"Liu","year":"2023","journal-title":"International Journal of Digital Humanities"},{"key":"2024040210381158700_fqad065-B32","author":"Luo","year":"2016"},{"key":"2024040210381158700_fqad065-B33","volume-title":"Foundations of Statistical Natural Language Processing","author":"Manning","year":"1999"},{"key":"2024040210381158700_fqad065-B34","doi-asserted-by":"crossref","first-page":"17209","DOI":"10.1007\/s00521-020-04910-x","article-title":"Building an Efficient OCR System for Historical Documents with Little Training Data\u2019,","volume":"32","author":"Mart\u00ednek","year":"2020","journal-title":"Neural Computing and Applications"},{"key":"2024040210381158700_fqad065-B35","first-page":"243","author":"Mohammed","year":"2020"},{"key":"2024040210381158700_fqad065-B36","doi-asserted-by":"crossref","first-page":"1029","DOI":"10.1109\/5.156468","article-title":"Historical Review of OCR Research and Development\u2019,","volume":"80","author":"Mori","year":"1992","journal-title":"Proceedings of the IEEE"},{"key":"2024040210381158700_fqad065-B37","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1353\/late.1993.0001","article-title":"Extraction of Climate Information from Chinese Historical Writings\u2019,","volume":"14","author":"Peiyuan","year":"1993","journal-title":"Late Imperial China"},{"key":"2024040210381158700_fqad065-B38","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/978-3-031-02146-6","article-title":"Natural Language Processing for Historical Texts\u2019,","volume":"5","author":"Piotrowski","year":"2012","journal-title":"Synthesis Lectures on Human Language Technologies"},{"key":"2024040210381158700_fqad065-B39","author":"Ridnik","year":"2021"},{"key":"2024040210381158700_fqad065-B40","first-page":"pp. 185","author":"Roullet","year":"2021"},{"key":"2024040210381158700_fqad065-B41","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","article-title":"ImageNet Large Scale Visual Recognition Challenge\u2019,","volume":"115","author":"Russakovsky","year":"2015","journal-title":"International Journal of Computer Vision"},{"key":"2024040210381158700_fqad065-B42","doi-asserted-by":"crossref","first-page":"i156","DOI":"10.1093\/llc\/fqz055","article-title":"Automatic Detection and Visualization of Garment Color in Western Portrait Paintings\u2019,","volume":"34","author":"Sar\u0131","year":"2019","journal-title":"Digital Scholarship in the Humanities"},{"key":"2024040210381158700_fqad065-B43","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1109\/TSMCA.2009.2029559","article-title":"Rusboost: A Hybrid Approach to Alleviating Class Imbalance\u2019,","volume":"40","author":"Seiffert","year":"2009","journal-title":"IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans"},{"key":"2024040210381158700_fqad065-B44","author":"Simonyan","year":"2014"},{"key":"2024040210381158700_fqad065-B45","doi-asserted-by":"crossref","first-page":"37","DOI":"10.5120\/ijca2015906677","article-title":"A Survey on Methods for Solving Data Imbalance Problem for Classification\u2019,","volume":"127","author":"Singh","year":"2015","journal-title":"International Journal of Computer Applications"},{"key":"2024040210381158700_fqad065-B46","first-page":"1","author":"Szegedy","year":"2015"},{"key":"2024040210381158700_fqad065-B47","first-page":"6105","author":"Tan","year":"2019"},{"key":"2024040210381158700_fqad065-B48","doi-asserted-by":"crossref","first-page":"30174","DOI":"10.1109\/ACCESS.2018.2840218","article-title":"Dense and Tight Detection of Chinese Characters in Historical Documents: Datasets and a Recognition Guided Detector\u2019,","volume":"6","author":"Yang","year":"2018","journal-title":"IEEE Access"},{"key":"2024040210381158700_fqad065-B49","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1109\/JPROC.2020.3004555","article-title":"A Comprehensive Survey on Transfer Learning\u2019,","volume":"109","author":"Zhuang","year":"2020","journal-title":"Proceedings of the IEEE"},{"key":"2024040210381158700_fqad065-B50","author":"Zhuang","year":"1985"}],"container-title":["Digital Scholarship in the Humanities"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/dsh\/article-pdf\/39\/1\/61\/57134519\/fqad065.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/dsh\/article-pdf\/39\/1\/61\/57134519\/fqad065.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,2]],"date-time":"2024-04-02T13:56:57Z","timestamp":1712066217000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/dsh\/article\/39\/1\/61\/7450448"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,11,24]]},"references-count":50,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,11,24]]},"published-print":{"date-parts":[[2024,4,2]]}},"URL":"https:\/\/doi.org\/10.1093\/llc\/fqad065","relation":{},"ISSN":["2055-7671","2055-768X"],"issn-type":[{"value":"2055-7671","type":"print"},{"value":"2055-768X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,4,1]]},"published":{"date-parts":[[2023,11,24]]}}}