{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,15]],"date-time":"2026-04-15T15:21:54Z","timestamp":1776266514919,"version":"3.50.1"},"reference-count":40,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2024,12,7]],"date-time":"2024-12-07T00:00:00Z","timestamp":1733529600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Comput. Cult. Herit."],"published-print":{"date-parts":[[2024,12,31]]},"abstract":"<jats:p>The rapid development of generative artificial intelligence has brought significant opportunities for the advancement of digital humanities research. Intelligent processing of ancient texts, as an essential part of digital humanities, is also undergoing a transformation in research methodologies in the wave of AIGC. The integration of generative pre-trained models with Chinese ancient texts, a vital carrier of Chinese culture, allows for deep mining of the content of these texts and provides services that make ancient texts more understandable and accessible to the general public. In this research, we propose a method that combines the most renowned Chinese anthology, the \u201cSiku Quanshu,\u201d with generative pre-trained models. We developed the SikuGPT model, a generative model for ancient text processing tasks, based on GPT-type language models by continued pretraining. This model was tested on two typical tasks of ancient text processing: translation between classical and modern Chinese, and classification of ancient texts. The findings reveal that our model achieves advantages in understanding and generating scenarios of ancient texts. The capability of SikuGPT in processing traditional Chinese texts helps to promote the organization of ancient information and knowledge services, and advances the international dissemination of traditional Chinese culture.<\/jats:p>","DOI":"10.1145\/3676969","type":"journal-article","created":{"date-parts":[[2024,7,15]],"date-time":"2024-07-15T16:21:44Z","timestamp":1721060504000},"page":"1-17","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["SikuGPT: A Generative Pre-trained Model for Intelligent Information Processing of Ancient Texts from the Perspective of Digital Humanities"],"prefix":"10.1145","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9048-4552","authenticated-orcid":false,"given":"Chang","family":"Liu","sequence":"first","affiliation":[{"name":"College of Information Management, Nanjing Agricultural University, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5946-3489","authenticated-orcid":false,"given":"Dongbo","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Information Management, Nanjing Agricultural University, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-1992-4296","authenticated-orcid":false,"given":"Zhixiao","family":"Zhao","sequence":"additional","affiliation":[{"name":"College of Information Management, Nanjing Agricultural University, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-6730-450X","authenticated-orcid":false,"given":"Die","family":"Hu","sequence":"additional","affiliation":[{"name":"College of Information Management, Nanjing Agricultural University, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-6424-6875","authenticated-orcid":false,"given":"Mengcheng","family":"Wu","sequence":"additional","affiliation":[{"name":"College of Information Management, Nanjing Agricultural University, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8287-936X","authenticated-orcid":false,"given":"Litao","family":"Lin","sequence":"additional","affiliation":[{"name":"School of Information Management, Nanjing University, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7268-7313","authenticated-orcid":false,"given":"Jiangfeng","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Information Management, Nanjing University, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3698-5385","authenticated-orcid":false,"given":"Hai","family":"Zhang","sequence":"additional","affiliation":[{"name":"College of Information Management, Nanjing Agricultural University, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-3554-4307","authenticated-orcid":false,"given":"Si","family":"Shen","sequence":"additional","affiliation":[{"name":"Group of Science and Technology Full-text Knowledge Mining, School of Economics and Management, Nanjing University of Science and Technology, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7328-9947","authenticated-orcid":false,"given":"Bin","family":"Li","sequence":"additional","affiliation":[{"name":"College of Liberal Art, Nanjing Normal University, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-0642-8138","authenticated-orcid":false,"given":"Lianzhen","family":"Zhao","sequence":"additional","affiliation":[{"name":"School of Foreign Languages, China Pharmaceutical University, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,12,7]]},"reference":[{"key":"e_1_3_1_2_1","doi-asserted-by":"publisher","DOI":"10.1038\/s41586-022-04448-z"},{"key":"e_1_3_1_3_1","first-page":"1877","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","volume":"33","author":"Brown T.","year":"2020","unstructured":"T. Brown, B. Mann, N. Ryder, M. Subbiah, J. D. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. Herbert-Voss, G. Krueger, T. Henighan, R. Child, A. Ramesh, D. Ziegler, J. Wu, C. Winter, C. Hesse, M. Chen, E. Sigler, M. Litwin, S. Gray, B. Chess, J. Clark, C. Berner, S. McCandlish, A. Radford, I. Sutskever, and D. Amodei. (2020). Language models are few-shot learners. In Proceedings of the Advances in Neural Information Processing Systems Vol. 33. 1877\u20131901. Retrieved from https:\/\/proceedings.neurips.cc\/paper\/2020\/hash\/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html"},{"key":"e_1_3_1_4_1","doi-asserted-by":"publisher","unstructured":"E. Chang Y.-T. Shiue H.-S. Yeh and V. Demberg. (2021). Time-aware ancient chinese text translation and inference arXiv:2107.03179). Retrieved from 10.48550\/arXiv.2107.03179","DOI":"10.48550\/arXiv.2107.03179"},{"key":"e_1_3_1_5_1","unstructured":"ChatGPT. (2022). Retrieved May 17 2024 from https:\/\/openai.com\/chatgpt\/"},{"key":"e_1_3_1_6_1","doi-asserted-by":"publisher","unstructured":"H. W. Chung L. Hou S. Longpre B. Zoph Y. Tay W. Fedus Y. Li X. Wang M. Dehghani S. Brahma A. Webson S. S. Gu Z. Dai M. Suzgun X. Chen A. Chowdhery A. Castro-Ros M. Pellat K. Robinson D. Valter S. Narang G. Mishra A. Yu V. Zhao Y. Huang A. Dai H. Yu S. Petrov Ed H. Chi J. Dean J. Devlin A. Roberts D. Zhou Quoc V. Le and J. Wei (2022). Scaling instruction-finetuned language models. arXiv:2210.11416. Retrieved from 10.48550\/arXiv.2210.11416","DOI":"10.48550\/arXiv.2210.11416"},{"key":"e_1_3_1_7_1","unstructured":"Ckiplab\/gpt2-base-chinese \u00b7 Hugging Face. (2021). Retrieved April 8 2023 from https:\/\/huggingface.co\/ckiplab\/gpt2-base-chinese"},{"key":"e_1_3_1_8_1","doi-asserted-by":"publisher","unstructured":"J. Devlin M.-W. Chang K. Lee and K. Toutanova. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805. Retrieved from 10.48550\/arXiv.1810.04805","DOI":"10.48550\/arXiv.1810.04805"},{"key":"e_1_3_1_9_1","first-page":"32","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Dong L.","year":"2019","unstructured":"L. Dong, N. Yang, W. Wang, F. Wei, X. Liu, Y. Wang, J. Gao, M. Zhou, and H.-W. Hon. (2019). Unified language model pre-training for natural language understanding and generation. In Proceedings of the Advances in Neural Information Processing Systems. 32. Retrieved from https:\/\/proceedings.neurips.cc\/paper\/2019\/hash\/c20bb2d9a50d5ac1f713f8b34d9aac5a-Abstract.html"},{"key":"e_1_3_1_10_1","unstructured":"Ethan. (2023). Ethan-yt\/guwenbert. Retrieved from https:\/\/github.com\/Ethan-yt\/guwenbert (Original work published 2020)"},{"key":"e_1_3_1_11_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.2003794117"},{"key":"e_1_3_1_12_1","unstructured":"General Catalog of Chinese Ancient Books. (2016). Retrieved May 13 2024 from https:\/\/bibliographical.guji.cn\/"},{"key":"e_1_3_1_13_1","doi-asserted-by":"publisher","unstructured":"J. Hu and M. Sun. (2020). Generating major types of chinese classical poetry in a uniformed framework. arXiv:2003.11528. Retrieved from 10.48550\/arXiv.2003.11528","DOI":"10.48550\/arXiv.2003.11528"},{"issue":"12","key":"e_1_3_1_14_1","first-page":"138","article-title":"Automatic text classification of \u201cZi\u201d part of siku quanshu from the perspective of digital humanities: Based on SikuBERT and SikuRoBERTa pre-trained models","volume":"42","author":"Hu H.","year":"2022","unstructured":"H. Hu, Y. Zhang, S. Deng, D. Wang, M. Feng, L. Liu, and B. Li. (2022). Automatic text classification of \u201cZi\u201d part of siku quanshu from the perspective of digital humanities: Based on SikuBERT and SikuRoBERTa pre-trained models. LIBRARY TRIBUNE 42, 12 (2022), 138\u2013148.","journal-title":"LIBRARY TRIBUNE"},{"key":"e_1_3_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-06703-7_37"},{"key":"e_1_3_1_16_1","doi-asserted-by":"publisher","unstructured":"M. Lewis Y. Liu N. Goyal M. Ghazvininejad A. Mohamed O. Levy V. Stoyanov and L. Zettlemoyer. (2019). BART: Denoising sequence-to-sequence pre-training for natural language generation translation and comprehension. arXiv:1910.13461. Retrieved from 10.48550\/arXiv.1910.13461","DOI":"10.48550\/arXiv.1910.13461"},{"key":"e_1_3_1_17_1","doi-asserted-by":"publisher","unstructured":"Y. Liao Y. Wang Q. Liu and X. Jiang. (2019). GPT-based generation for classical chinese poetry. arXiv:1907.00151. Retrieved from 10.48550\/arXiv.1907.00151","DOI":"10.48550\/arXiv.1907.00151"},{"key":"e_1_3_1_18_1","doi-asserted-by":"publisher","DOI":"10.19809\/j.cnki.kjqbyj.2023.01.006"},{"key":"e_1_3_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3560815"},{"key":"e_1_3_1_20_1","doi-asserted-by":"publisher","unstructured":"M. Moradi K. Blagec F. Haberl and M. Samwald. (2022). GPT-3 models are poor few-shot learners in the biomedical domain. arXiv:2109.02555. Retrieved from 10.48550\/arXiv.2109.02555","DOI":"10.48550\/arXiv.2109.02555"},{"key":"e_1_3_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICMLA52953.2021.00252"},{"key":"e_1_3_1_22_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.eacl-main.273"},{"key":"e_1_3_1_23_1","unstructured":"A. Radford K. Narasimhan T. Salimans and I. Sutskever. (2018). Improving Language Understanding by Generative Pre-Training. OpenAI. Retrieved from https:\/\/cdn.openai.com\/research-covers\/language-unsupervised\/language_understanding_paper.pdf"},{"key":"e_1_3_1_24_1","unstructured":"A. Radford J. Wu R. Child D. Luan D. Amodei and I. Sutskever. (2019). Language Models are Unsupervised Multitask Learners. OpenAI. Retrieved from https:\/\/cdn.openai.com\/better-language-models\/language_models_are_unsupervised_multitask_learners.pdf"},{"issue":"1","key":"e_1_3_1_25_1","first-page":"140:5485","article-title":"Exploring the limits of transfer learning with a unified text-to-text transformer","volume":"21","author":"Raffel C.","year":"2020","unstructured":"C. Raffel, N. Shazeer, A. Roberts, K. Lee, S. Narang, M. Matena, Y. Zhou, W. Li, and P. J. Liu, (2020). Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research 21, 1 (2020), 140:5485\u2013140:5551.","journal-title":"The Journal of Machine Learning Research"},{"key":"e_1_3_1_26_1","doi-asserted-by":"publisher","unstructured":"V. Sanh A. Webson C. Raffel S. H. Bach L. Sutawika Z. Alyafeai A. Chaffin A. Stiegler T. L. Scao A. Raja M. Dey M. S. Bari C. Xu U. Thakker S. S. Sharma E. Szczechla T. Kim G. Chhablani N. Nayak D. Datta J. Chang Mike T.-J. Jiang H. Wang M. Manica S. Shen Zheng X. Yong H. Pandey R. Bawden T. Wang T. Neeraj J. Rozen A. Sharma A. Santilli T. Fevry Jason A. Fries R. Teehan T. Bers S. Biderman L. Gao T. Wolf and A. M. Rush (2022). Multitask prompted training enables zero-shot task generalization. arXiv:2110.08207. Retrieved from 10.48550\/arXiv.2110.08207","DOI":"10.48550\/arXiv.2110.08207"},{"key":"e_1_3_1_27_1","doi-asserted-by":"publisher","unstructured":"K. Song X. Tan T. Qin J. Lu and T.-Y. Liu. (2019). MASS: Masked sequence to sequence pre-training for language generation. arXiv:1905.02450. Retrieved from 10.48550\/arXiv.1905.02450","DOI":"10.48550\/arXiv.1905.02450"},{"key":"e_1_3_1_28_1","unstructured":"D. Sturgeon (2006). Chinese Text Project. Retrieved May 13 2024 from https:\/\/ctext.org\/"},{"key":"e_1_3_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN52387.2021.9534342"},{"key":"e_1_3_1_30_1","unstructured":"Tongyi. (2023). Retrieved May 17 2024 from https:\/\/tongyi.aliyun.com\/qianwen\/"},{"key":"e_1_3_1_31_1","unstructured":"Uer\/gpt2-chinese-ancient \u00b7 Hugging Face. (2021). Retrieved April 8 2023 from https:\/\/huggingface.co\/uer\/gpt2-chinese-ancient"},{"key":"e_1_3_1_32_1","first-page":"30","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Vaswani A.","year":"2017","unstructured":"A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, \u0141. Kaiser, and I. Polosukhin. (2017). Attention is all you need. In Proceedings of the Advances in Neural Information Processing Systems. 30. Retrieved from https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2017\/hash\/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html"},{"issue":"6","key":"e_1_3_1_33_1","first-page":"31","article-title":"Construction and application of pre-trained models of siku quanshu in orientation to digital humanities","volume":"42","author":"Wang D.","year":"2022","unstructured":"D. Wang, C. Liu, Z. Zhu, J. Liu, H. Hu, S. Si, and B. Li. (2022). Construction and application of pre-trained models of siku quanshu in orientation to digital humanities. Library Tribune 42, 6 (2022), 31\u201343.","journal-title":"Library Tribune"},{"key":"e_1_3_1_34_1","doi-asserted-by":"publisher","DOI":"10.1186\/s40494-023-00882-y"},{"key":"e_1_3_1_35_1","unstructured":"Wenxinyiyan. (2023). Retrieved May 17 2024 from https:\/\/yiyan.baidu.com\/welcome"},{"key":"e_1_3_1_36_1","unstructured":"White Paper on Artificial Intelligence-Generated Content (AIGC) (2022) \u2013 China Information and Communication Academy. Retrieved March 28 2023 from http:\/\/www.caict.ac.cn\/sytj\/202209\/t20220913_408835.htm"},{"key":"e_1_3_1_37_1","unstructured":"Xujiacheng127\/anchi-bert \u00b7 Hugging Face. (2022). Retrieved April 8 2023 from https:\/\/huggingface.co\/xujiacheng127\/anchi-bert"},{"key":"e_1_3_1_38_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-88480-2_10"},{"key":"e_1_3_1_39_1","doi-asserted-by":"publisher","unstructured":"Z. Zhang X. Han Z. Liu X. Jiang M. Sun and Q. Liu. (2019). ERNIE: Enhanced language representation with informative entities. arXiv:1905.07129. Retrieved from 10.48550\/arXiv.1905.07129","DOI":"10.48550\/arXiv.1905.07129"},{"key":"e_1_3_1_40_1","doi-asserted-by":"publisher","unstructured":"Z. Zhao H. Chen J. Zhang X. Zhao T. Liu W. Lu X. Chen H. Deng Q. Ju and X. Du. (2019). UER: An open-source toolkit for pre-training models. arXiv:1909.05658. Retrieved from 10.48550\/arXiv.1909.05658","DOI":"10.48550\/arXiv.1909.05658"},{"key":"e_1_3_1_41_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-3005"}],"container-title":["Journal on Computing and Cultural Heritage"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3676969","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3676969","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T01:19:13Z","timestamp":1750295953000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3676969"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,7]]},"references-count":40,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,12,31]]}},"alternative-id":["10.1145\/3676969"],"URL":"https:\/\/doi.org\/10.1145\/3676969","relation":{},"ISSN":["1556-4673","1556-4711"],"issn-type":[{"value":"1556-4673","type":"print"},{"value":"1556-4711","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,12,7]]},"assertion":[{"value":"2024-01-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-06-21","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-12-07","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}