{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,23]],"date-time":"2025-11-23T15:03:27Z","timestamp":1763910207294,"version":"3.41.2"},"reference-count":32,"publisher":"Emerald","issue":"2","license":[{"start":{"date-parts":[[2020,3,25]],"date-time":"2020-03-25T00:00:00Z","timestamp":1585094400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["AJIM"],"published-print":{"date-parts":[[2020,3,25]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>The purpose of this paper is to explore the task design and assignment of full-text generation on mass Chinese historical archives (CHAs) by crowdsourcing, with special attention paid to how to best divide full-text generation tasks into smaller ones assigned to crowdsourced volunteers and to improve the digitization of mass CHAs and the data-oriented processing of the digital humanities.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>This paper starts from the complexities of character recognition of mass CHAs, takes Sheng Xuanhuai archives crowdsourcing project of Shanghai Library as a case study, and makes use of the theories of archival science, including diplomatics of Chinese archival documents, and the historical approach of Chinese archival traditions as the theoretical basis and analysis methods. The results are generated through the comprehensive research.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>This paper points out that volunteer tasks of full-text generation include transcription, punctuation, proofreading, metadata description, segmentation, and attribute annotation in digital humanities and provides a metadata element set for volunteers to use in creating or revising metadata descriptions and also provides an attribute tag set. The two sets can be used across the humanities to construct overall observations about texts and the archives of which they are a part. Along these lines, this paper presents significant insights for application in outlining the principles, methods, activities, and procedures of crowdsourced full-text generation for mass CHAs.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>This study is the first to explore and identify the effective design and allocation of tasks for crowdsourced volunteers completing full-text generation on CHAs in digital humanities.<\/jats:p><\/jats:sec>","DOI":"10.1108\/ajim-09-2019-0245","type":"journal-article","created":{"date-parts":[[2020,3,25]],"date-time":"2020-03-25T06:00:37Z","timestamp":1585116037000},"page":"262-286","source":"Crossref","is-referenced-by-count":10,"title":["Task design and assignment of full-text generation on mass Chinese historical archives in digital humanities"],"prefix":"10.1108","volume":"72","author":[{"given":"Jihong","family":"Liang","sequence":"first","affiliation":[]},{"given":"Hao","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Xiaojing","family":"Li","sequence":"additional","affiliation":[]}],"member":"140","reference":[{"issue":"5895","key":"key2020042013533620200_ref001","doi-asserted-by":"crossref","first-page":"1465","DOI":"10.1126\/science.1160379","article-title":"reCAPTCHA: human-based character recognition via Web security measures","volume":"321","year":"2008","journal-title":"Science"},{"issue":"2","key":"key2020042013533620200_ref002","doi-asserted-by":"crossref","first-page":"383","DOI":"10.17723\/aarc.72.2.g54085061q586416","article-title":"Envisioning the archival commons","volume":"72","year":"2009","journal-title":"American Archivist"},{"issue":"6","key":"key2020042013533620200_ref003","article-title":"Moving the crowd at istockphoto: the composition of the crowd and motivations for participation in A crowdsourcing application","volume":"13","year":"2008","journal-title":"First Monday"},{"unstructured":"Causer, T. and Terras, M. (2014), \u201c\u2018Many hands make light work. many hands together make merry work\u2019: transcribe Bentham and crowdsourcing manuscript collections\u201d, in Ridge, M. (Ed.), Crowdsourcing Our Cultural Heritage, Ashgate Publishing, Farnham, pp. 57-88.","key":"key2020042013533620200_ref004"},{"unstructured":"Chen, S., Du, X. and Xiang, J. (2011), \u201cThe document processing workflow of THDL\u201d, in Xiang, J. (Ed.), From Preservation to Knowledge Creation: The Way to Digital Humanities, National Taiwan University Press, Taibei, pp. 51-66.","key":"key2020042013533620200_ref005"},{"key":"key2020042013533620200_ref006","first-page":"148","article-title":"Arrangement of Chinese historical materials","volume":"1","year":"1929","journal-title":"History Annual"},{"unstructured":"Deutsche Forschungsgemeinschaft (DFG) (2009), \u201cDFG practical guidelines on digitisation\u201d, available at: https:\/\/www.dfg.de\/formulare\/12_151\/12_151_en.pdf (accessed 2 August, 2019).","key":"key2020042013533620200_ref007"},{"key":"key2020042013533620200_ref008","first-page":"5","article-title":"Ancient books entering the age of 3.0: gulian Company launching ancient books arrangement and publication system platform","year":"2018","journal-title":"Guangming Daily"},{"unstructured":"Duranti, L. (2015), \u201cArchival bond\u201d, in Duranti, L. and Franks, P.C. (Ed.), Encyclopedia of Archival Science, Rowman & Littlefield, Lanham, p. 28.","key":"key2020042013533620200_ref009"},{"issue":"2","key":"key2020042013533620200_ref010","doi-asserted-by":"crossref","first-page":"387","DOI":"10.17723\/aarc.70.2.d157t6667g54536g","article-title":"Archives of the people, by the people, for the people","volume":"70","year":"2007","journal-title":"American Archivist"},{"key":"key2020042013533620200_ref011","first-page":"84","article-title":"A preliminary study of readjustment on archives","year":"1936","journal-title":"Bulletin of the National Palace Museum of Peiping"},{"issue":"3","key":"key2020042013533620200_ref012","first-page":"51","article-title":"Research on quality control of archival crowdsourcing in China","year":"2019","journal-title":"Archives Science Bulletin"},{"issue":"4","key":"key2020042013533620200_ref013","article-title":"Measuring the correctness of double-keying: error classification and quality control in a large corpus of TEI- annotated historical text","year":"2013","journal-title":"Journal of the Text Encoding Initiative"},{"issue":"3","key":"key2020042013533620200_ref014","first-page":"73","article-title":"Influence factors of task performance on crowdsourcing transcription platform in digital humanity domain: perspectives of task complexity and domain knowledge","year":"2019","journal-title":"Library and Information"},{"issue":"6","key":"key2020042013533620200_ref015","first-page":"1","article-title":"The rise of crowdsourcing","volume":"14","year":"2006","journal-title":"Wired Magazine"},{"issue":"2","key":"key2020042013533620200_ref016","first-page":"93","article-title":"Research on the problems existed in digitizing description of historical archives and its solutions","year":"2017","journal-title":"Archives Science Study"},{"volume-title":"Socialization of Archival Resource: Historic Change in Archival Resource Structure","year":"2019","first-page":"208","key":"key2020042013533620200_ref017"},{"issue":"5","key":"key2020042013533620200_ref018","first-page":"88","article-title":"Archives digitization and public service of the second historical archives of China","year":"2016","journal-title":"Archives Science Study"},{"issue":"4","key":"key2020042013533620200_ref019","first-page":"1","article-title":"The framework and approaches of crowdsourcing in archives information resources construction","year":"2019","journal-title":"Archives Science Bulletin"},{"volume-title":"Written on Bamboo and Silk: The Beginnings of Chinese Books and Inscriptions","year":"2004","first-page":"204","key":"key2020042013533620200_ref020"},{"volume-title":"Modern Archives: Principles and Techniques","year":"1996","first-page":"185","key":"key2020042013533620200_ref021"},{"issue":"3","key":"key2020042013533620200_ref022","first-page":"2","article-title":"Big? Smart? Clean? Messy? Data in the humanities","volume":"2","year":"2013","journal-title":"Journal of Digital Humanities"},{"volume-title":"Selection of Sheng Xuanhuai","year":"2014","first-page":"1","key":"key2020042013533620200_ref023"},{"key":"key2020042013533620200_ref024","first-page":"9","article-title":"Profile on archival arrangement of archives of the Palace museum","volume-title":"Proceedings of Chinese Museum Society","year":"1935"},{"key":"key2020042013533620200_ref025","first-page":"198","article-title":"Proceedings of the School of sinological research at the national university of peking","volume-title":"The Journal of Sinological Studies","year":"1923"},{"key":"key2020042013533620200_ref200","first-page":"78","article-title":"Danxin archives of Taiwan created in the late Qing dynasty and their arrangement","volume-title":"The Journal of Chinese Social and Economic History","year":"2017"},{"issue":"2","key":"key2020042013533620200_ref026","first-page":"161","article-title":"Research on massive historical documents digitalization processing method based on network crowdsourcing mode","volume":"39","year":"2019","journal-title":"Journal of Modern Information"},{"volume-title":"An Illustrated Survey of Archives in the Ming and Qing Dynasty","year":"2000","first-page":"5","key":"key2020042013533620200_ref027"},{"issue":"5","key":"key2020042013533620200_ref028","first-page":"30","article-title":"Application research on crowdsourcing in the construction of ancient books database in China","volume":"46","year":"2016","journal-title":"Library Research"},{"volume-title":"Wenshi Tongyi","year":"1994","first-page":"587","key":"key2020042013533620200_ref029"},{"issue":"3","key":"key2020042013533620200_ref030","first-page":"21","article-title":"Exploring participants' motivations in sustained stage of citizen science projects in digital humanities domain: a case study on Transcribe Sheng","year":"2018","journal-title":"Documentation, Information and Knowledge"},{"unstructured":"Shanghai Library. (2019), \u201cHistorical document crowdsourcing system\u201d, available at: http:\/\/zb.library.sh.cn\/frontProject.jspx?completeType=11 (accessed 31 July 2019).","key":"key2020042013533620200_ref031"}],"container-title":["Aslib Journal of Information Management"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/AJIM-09-2019-0245\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/AJIM-09-2019-0245\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T23:01:15Z","timestamp":1753398075000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/ajim\/article\/72\/2\/262-286\/21014"}},"subtitle":["A crowdsourcing approach"],"short-title":[],"issued":{"date-parts":[[2020,3,25]]},"references-count":32,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2020,3,25]]}},"alternative-id":["10.1108\/AJIM-09-2019-0245"],"URL":"https:\/\/doi.org\/10.1108\/ajim-09-2019-0245","relation":{},"ISSN":["2050-3806"],"issn-type":[{"type":"print","value":"2050-3806"}],"subject":[],"published":{"date-parts":[[2020,3,25]]}}}