{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,5,13]],"date-time":"2025-05-13T22:00:23Z","timestamp":1747173623596,"version":"3.40.5"},"reference-count":51,"publisher":"Cambridge University Press (CUP)","issue":"4","license":[{"start":{"date-parts":[[2022,12,22]],"date-time":"2022-12-22T00:00:00Z","timestamp":1671667200000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/www.cambridge.org\/core\/terms"}],"content-domain":{"domain":["cambridge.org"],"crossmark-restriction":true},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2023,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Understanding various historical entity information (e.g., persons, locations, and time) plays a very important role in reasoning about the developments of historical events. With the increasing concern about the fields of digital humanities and natural language processing, named entity recognition (NER) provides a feasible solution for automatically extracting these entities from historical texts, especially in Chinese historical research. However, previous approaches are domain-specific, ineffective with relatively low accuracy, and non-interpretable, which hinders the development of NER in Chinese history. In this paper, we propose a new hybrid deep learning model called \u201csubword-based ensemble network\u201d (SEN), by incorporating subword information and a novel attention fusion mechanism. The experiments on a massive self-built Chinese historical corpus CMAG show that SEN has achieved the best with 93.87% for F1-micro and 89.70% for F1-macro, compared with other advanced models. Further investigation reveals that SEN has a strong generalization ability of NER on Chinese historical texts, which is not only relatively insensitive to the categories with fewer annotation labels (e.g., OFI) but can also accurately capture diverse local and global semantic relations. Our research demonstrates the effectiveness of the integration of subword information and attention fusion, which provides an inspiring solution for the practical use of entity extraction in the Chinese historical domain.<\/jats:p>","DOI":"10.1017\/s1351324922000493","type":"journal-article","created":{"date-parts":[[2022,12,22]],"date-time":"2022-12-22T07:59:30Z","timestamp":1671695970000},"page":"1043-1065","update-policy":"https:\/\/doi.org\/10.1017\/policypage","source":"Crossref","is-referenced-by-count":1,"title":["SEN: A subword-based ensemble network for Chinese historical entity extraction"],"prefix":"10.1017","volume":"29","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1128-550X","authenticated-orcid":false,"given":"Chengxi","family":"Yan","sequence":"first","affiliation":[]},{"given":"Ruojia","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Xiaoke","family":"Fang","sequence":"additional","affiliation":[]}],"member":"56","published-online":{"date-parts":[[2022,12,22]]},"reference":[{"key":"S1351324922000493_ref15","doi-asserted-by":"publisher","DOI":"10.1109\/PROC.1973.9030"},{"key":"S1351324922000493_ref27","unstructured":"Li, L. , Mao, T. , Huang, D. and Yang, Y. (2006). Hybrid models for Chinese named entity recognition. In Proceedings of the 5th SIGHAN Workshop on Chinese Language Processing. Stroudsburg, PA, USA: Association for Computational Linguistics, pp. 72\u201378."},{"key":"S1351324922000493_ref39","unstructured":"Vaswani, A. , Shazeer, N. , Parmar, N. , Uszkoreit, J. , Jones, L. , Gomez, A.N. , Kaiser, \u0141. and Polosukhin, I. (2017). Attention is all you need. In Proceedings of the 31st Annual Conference on Neural Information Processing Systems. NeurIPS, pp. 5998\u20136008."},{"key":"S1351324922000493_ref24","unstructured":"Leong, K.S. , Wong, F. , Li, Y. and Dong, M.C. (2008). Chinese tagging based on maximum entropy model. In Proceedings of the 6th SIGHAN Workshop on Chinese Language Processing, pp. 138\u2013142."},{"key":"S1351324922000493_ref8","unstructured":"Chen, A. , Peng, F. , Shan, R. and Sun, G. (2006). Chinese named entity recognition with conditional probabilistic models. In Proceedings of the 5th SIGHAN Workshop on Chinese Language Processing. Stroudsburg, PA, USA: Association for Computational Linguistics, pp. 173\u2013176."},{"key":"S1351324922000493_ref32","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1101"},{"key":"S1351324922000493_ref19","doi-asserted-by":"publisher","DOI":"10.1145\/3325730.3325736"},{"key":"S1351324922000493_ref29","doi-asserted-by":"publisher","DOI":"10.1109\/BigData.2015.7363931"},{"key":"S1351324922000493_ref3","unstructured":"Botha, J. and Blunsom, P. (2014). Compositional morphology for word representations and language modelling. In: Proceedings of the 31st International Conference on International Conference on Machine Learning, (ICML), pp. 1899\u20131907."},{"key":"S1351324922000493_ref34","unstructured":"Mikolov, T. , Sutskever, I. , Chen, K. , Corrado, G.S. and Dean, J. (2013). Distributed representations of words and phrases and their compositionality. In Proceedings of the 26th International Conference on Neural Information Processing Systems. NeurIPS, pp. 3111\u20133119."},{"key":"S1351324922000493_ref23","unstructured":"Lafferty, J.D. , McCallum, A. and Pereira, F.C. (2001). Conditional Random Fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of 18th International Conference on Machine Learning, (ICML), pp. 282\u2013289."},{"key":"S1351324922000493_ref26","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2020.2981314"},{"key":"S1351324922000493_ref4","doi-asserted-by":"publisher","DOI":"10.1109\/ICSC.2007.107"},{"key":"S1351324922000493_ref6","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.12029"},{"key":"S1351324922000493_ref50","doi-asserted-by":"publisher","DOI":"10.1145\/3308560.3317704"},{"key":"S1351324922000493_ref51","unstructured":"Zhu, Y. and Wang, G. (2019). CAN-NER: Convolutional attention network for Chinese named entity recognition. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics. Stroudsburg, PA, USA: Association for Computational Linguistics, pp. 3384\u20133393."},{"key":"S1351324922000493_ref43","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-14331-6_31"},{"key":"S1351324922000493_ref5","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v31i1.10993"},{"key":"S1351324922000493_ref10","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2013.03.002"},{"key":"S1351324922000493_ref16","doi-asserted-by":"publisher","DOI":"10.1007\/s11432-020-2982-y"},{"key":"S1351324922000493_ref13","unstructured":"Devlin, J. , Chang, M.W. , Lee, K. and Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg, PA, USA: Association for Computational Linguistics, pp. 4171\u20134186."},{"key":"S1351324922000493_ref7","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1366"},{"key":"S1351324922000493_ref14","doi-asserted-by":"publisher","DOI":"10.1145\/3132847.3133088"},{"key":"S1351324922000493_ref31","unstructured":"Luong, M.-T. , Socher, R. and Manning, C.D. (2013). Better word representations with recursive neural networks for morphology. In Proceedings of the 17th Conference on Computational Natural Language Learning. Stroudsburg, PA, USA: Association for Computational Linguistics, pp. 104\u2013113."},{"key":"S1351324922000493_ref36","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1162"},{"key":"S1351324922000493_ref37","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-12640-1_34"},{"key":"S1351324922000493_ref38","first-page":"65","article-title":"Mencius: A Chinese named entity recognizer using the maximum entropy-based hybrid model","volume":"9","author":"Tsai","year":"2004)","journal-title":"International Journal of Computational Linguistics and Chinese Language Processing"},{"key":"S1351324922000493_ref40","doi-asserted-by":"publisher","DOI":"10.1525\/ae.1986.13.4.02a00020"},{"key":"S1351324922000493_ref25","unstructured":"Levow, G.A. (2006). The third international Chinese language processing bakeoff: Word segmentation and named entity recognition. In Proceedings of the 5th SIGHAN Workshop on Chinese Language Processing. Stroudsburg, PA, USA: Association for Computational Linguistics, pp. 108\u2013117."},{"key":"S1351324922000493_ref47","doi-asserted-by":"crossref","unstructured":"Yu, P. and Wang, X. (2021). BERT-based named entity recognition in Chinese Twenty-Four Histories. In International Conference on Web Information Systems and Applications. Cham , Switzerland: Springer, pp. 289\u2013301.","DOI":"10.1007\/978-3-030-60029-7_27"},{"key":"S1351324922000493_ref41","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.121"},{"key":"S1351324922000493_ref45","doi-asserted-by":"publisher","DOI":"10.1109\/BigData50022.2020.9378009"},{"key":"S1351324922000493_ref30","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-49508-8_34"},{"key":"S1351324922000493_ref35","unstructured":"Peng, W. , Cheng, H. and Chen, S.-P. (2018). From text to data: Extracting posting data from Chinese local gazetteers. In Proceedings of the 9th International Conference of Digital Archives and Digital Humanities. DADH, pp. 79\u2013125."},{"key":"S1351324922000493_ref44","doi-asserted-by":"publisher","DOI":"10.1145\/3357384.3358117"},{"key":"S1351324922000493_ref12","doi-asserted-by":"publisher","DOI":"10.1017\/jch.2020.23"},{"key":"S1351324922000493_ref28","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6338"},{"key":"S1351324922000493_ref20","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-16-6471-7_24"},{"key":"S1351324922000493_ref49","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-68636-1_10"},{"key":"S1351324922000493_ref9","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00104"},{"key":"S1351324922000493_ref22","unstructured":"Kingma, D.P. and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv preprint arXiv:14126980."},{"key":"S1351324922000493_ref46","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-99495-6_16"},{"key":"S1351324922000493_ref33","unstructured":"Meng, Y. , Wu, W. , Wang, F. , Li, X. , Nie, P. , Yin, F. , Li, M. , Han, Q. , Sun, X. and Li, J. (2019). Glyce: Glyph-vectors for Chinese character representations. In Proceedings of the 33rd Conference on Neural Information Processing Systems. NeurIPS, pp. 2746\u20132757."},{"key":"S1351324922000493_ref17","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2019\/692"},{"key":"S1351324922000493_ref21","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2942433"},{"key":"S1351324922000493_ref11","unstructured":"Dauphin, Y.N. , Fan, A. , Auli, M. and Grangier, D. (2017). Language modeling with gated convolutional networks. In Proceedings of the 34th International Conference on Machine Learning, (ICML), pp. 933\u2013941."},{"key":"S1351324922000493_ref1","doi-asserted-by":"publisher","DOI":"10.1186\/s40655-015-0007-3"},{"key":"S1351324922000493_ref48","doi-asserted-by":"publisher","DOI":"10.1145\/3357384.3358005"},{"key":"S1351324922000493_ref2","unstructured":"Bhojanapalli, S. , Yun, C. , Rawat, A.S. , Reddi, S. and Kumar, S. (2020). Low-rank bottleneck in multi-head attention models. In Proceedings of the 37th International Conference on Machine Learning, (ICML), pp. 864\u2013873."},{"key":"S1351324922000493_ref18","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"S1351324922000493_ref42","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01716-3_8"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324922000493","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,7,19]],"date-time":"2023-07-19T08:59:19Z","timestamp":1689757159000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324922000493\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,22]]},"references-count":51,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2023,7]]}},"alternative-id":["S1351324922000493"],"URL":"https:\/\/doi.org\/10.1017\/s1351324922000493","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"type":"print","value":"1351-3249"},{"type":"electronic","value":"1469-8110"}],"subject":[],"published":{"date-parts":[[2022,12,22]]},"assertion":[{"value":"\u00a9 The Author(s), 2022. Published by Cambridge University Press","name":"copyright","label":"Copyright","group":{"name":"copyright_and_licensing","label":"Copyright and Licensing"}}]}}