{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,2]],"date-time":"2026-07-02T18:59:38Z","timestamp":1783018778048,"version":"3.54.6"},"publisher-location":"New York, NY, USA","reference-count":21,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,8,25]],"date-time":"2023-08-25T00:00:00Z","timestamp":1692921600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,8,25]]},"DOI":"10.1145\/3604951.3605514","type":"proceedings-article","created":{"date-parts":[[2023,8,1]],"date-time":"2023-08-01T17:20:33Z","timestamp":1690910433000},"page":"85-90","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Enhancing Named Entity Recognition for Holocaust Testimonies through Pseudo Labelling and Transformer-based Models"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6283-3460","authenticated-orcid":false,"given":"Isuri","family":"Anuradha Nanomi Arachchige","sequence":"first","affiliation":[{"name":"Faculty of Science &amp; Engineering\/ School of computer science &amp; Mathematics, University of Wolverhampton, United Kingdom"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9203-0957","authenticated-orcid":false,"given":"Le","family":"Ha","sequence":"additional","affiliation":[{"name":"Faculty of Science &amp; Engineering\/ School of computer science &amp; Mathematics, University of Wolverhampton, United Kingdom"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2522-066X","authenticated-orcid":false,"given":"Ruslan","family":"Mitkov","sequence":"additional","affiliation":[{"name":"Computing and Communications, Lancastser University, United Kingdom"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3822-6008","authenticated-orcid":false,"given":"Johannes-Dieter","family":"Steinert","sequence":"additional","affiliation":[{"name":"Faculty of Social Science\/Modern European History and Migration Studies, University of Wolverhampton, United Kingdom"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2023,8,25]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages. 68\u201372","author":"Aprosio Alessio\u00a0Palmero","year":"2022","unstructured":"Alessio\u00a0Palmero Aprosio , Stefano Menini , and Sara Tonelli . 2022 . BERToldo, the Historical BERT for Italian . In Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages. 68\u201372 . Alessio\u00a0Palmero Aprosio, Stefano Menini, and Sara Tonelli. 2022. BERToldo, the Historical BERT for Italian. In Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages. 68\u201372."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00146-021-01368-w"},{"key":"e_1_3_2_1_3_1","volume-title":"Named-entity recognition in Turkish legal texts. Natural Language Engineering","author":"\u00c7etinda\u011f Can","year":"2022","unstructured":"Can \u00c7etinda\u011f , Berkay Yaz\u0131c\u0131o\u011flu , and Aykut Ko\u00e7 . 2022. Named-entity recognition in Turkish legal texts. Natural Language Engineering ( 2022 ), 1\u201328. Can \u00c7etinda\u011f, Berkay Yaz\u0131c\u0131o\u011flu, and Aykut Ko\u00e7. 2022. Named-entity recognition in Turkish legal texts. Natural Language Engineering (2022), 1\u201328."},{"key":"e_1_3_2_1_4_1","volume-title":"Unsupervised cross-lingual representation learning at scale. arXiv preprint arXiv:1911.02116","author":"Conneau Alexis","year":"2019","unstructured":"Alexis Conneau , Kartikay Khandelwal , Naman Goyal , Vishrav Chaudhary , Guillaume Wenzek , Francisco Guzm\u00e1n , Edouard Grave , Myle Ott , Luke Zettlemoyer , and Veselin Stoyanov . 2019. Unsupervised cross-lingual representation learning at scale. arXiv preprint arXiv:1911.02116 ( 2019 ). Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzm\u00e1n, Edouard Grave, Myle Ott, Luke Zettlemoyer, and Veselin Stoyanov. 2019. Unsupervised cross-lingual representation learning at scale. arXiv preprint arXiv:1911.02116 (2019)."},{"key":"e_1_3_2_1_5_1","volume-title":"Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2018 . Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018). Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-13643-6_26"},{"key":"e_1_3_2_1_7_1","unstructured":"Helena Hubkov\u00e1. 2019. Named-entity recognition in Czech historical texts: Using a CNN-BiLSTM neural network model.  Helena Hubkov\u00e1. 2019. Named-entity recognition in Czech historical texts: Using a CNN-BiLSTM neural network model."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2020.2981314"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1075\/li.30.1.03nad"},{"key":"e_1_3_2_1_10_1","volume-title":"Text chunking using transformation-based learning. Natural language processing using very large corpora","author":"Ramshaw A","year":"1999","unstructured":"Lance\u00a0 A Ramshaw and Mitchell\u00a0 P Marcus . 1999. Text chunking using transformation-based learning. Natural language processing using very large corpora ( 1999 ), 157\u2013176. Lance\u00a0A Ramshaw and Mitchell\u00a0P Marcus. 1999. Text chunking using transformation-based learning. Natural language processing using very large corpora (1999), 157\u2013176."},{"key":"e_1_3_2_1_11_1","volume-title":"hmbert: Historical multilingual language models for named entity recognition. arXiv preprint arXiv:2205.15575","author":"Schweter Stefan","year":"2022","unstructured":"Stefan Schweter , Luisa M\u00e4rz , Katharina Schmid , and Erion \u00c7ano . 2022. hmbert: Historical multilingual language models for named entity recognition. arXiv preprint arXiv:2205.15575 ( 2022 ). Stefan Schweter, Luisa M\u00e4rz, Katharina Schmid, and Erion \u00c7ano. 2022. hmbert: Historical multilingual language models for named entity recognition. arXiv preprint arXiv:2205.15575 (2022)."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2022.3206539"},{"key":"e_1_3_2_1_13_1","volume-title":"Deep learning approaches for question answering system. Procedia computer science 132","author":"Sharma Yashvardhan","year":"2018","unstructured":"Yashvardhan Sharma and Sahil Gupta . 2018. Deep learning approaches for question answering system. Procedia computer science 132 ( 2018 ), 785\u2013794. Yashvardhan Sharma and Sahil Gupta. 2018. Deep learning approaches for question answering system. Procedia computer science 132 (2018), 785\u2013794."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/COMPTELIX.2017.8003957"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1021\/acs.jcim.9b00470"},{"key":"e_1_3_2_1_16_1","volume-title":"Xlnet: Generalized autoregressive pretraining for language understanding. Advances in neural information processing systems 32","author":"Yang Zhilin","year":"2019","unstructured":"Zhilin Yang , Zihang Dai , Yiming Yang , Jaime Carbonell , Russ\u00a0 R Salakhutdinov , and Quoc\u00a0 V Le . 2019 . Xlnet: Generalized autoregressive pretraining for language understanding. Advances in neural information processing systems 32 (2019). Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Russ\u00a0R Salakhutdinov, and Quoc\u00a0V Le. 2019. Xlnet: Generalized autoregressive pretraining for language understanding. Advances in neural information processing systems 32 (2019)."},{"key":"e_1_3_2_1_17_1","volume-title":"Collabonet: collaboration of deep neural networks for biomedical named entity recognition. BMC bioinformatics 20, 10","author":"Yoon Wonjin","year":"2019","unstructured":"Wonjin Yoon , Chan\u00a0Ho So , Jinhyuk Lee , and Jaewoo Kang . 2019. Collabonet: collaboration of deep neural networks for biomedical named entity recognition. BMC bioinformatics 20, 10 ( 2019 ), 55\u201365. Wonjin Yoon, Chan\u00a0Ho So, Jinhyuk Lee, and Jaewoo Kang. 2019. Collabonet: collaboration of deep neural networks for biomedical named entity recognition. BMC bioinformatics 20, 10 (2019), 55\u201365."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1002\/widm.1253"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.3390\/e22020252"},{"key":"e_1_3_2_1_20_1","volume-title":"Workshops at the Thirty-Second AAAI Conference on Artificial Intelligence.","author":"Zhong Zexuan","year":"2018","unstructured":"Zexuan Zhong , Jiaqi Guo , Wei Yang , Tao Xie , Jian-Guang Lou , Ting Liu , and Dongmei Zhang . 2018 . Generating regular expressions from natural language specifications: Are we there yet? . In Workshops at the Thirty-Second AAAI Conference on Artificial Intelligence. Zexuan Zhong, Jiaqi Guo, Wei Yang, Tao Xie, Jian-Guang Lou, Ting Liu, and Dongmei Zhang. 2018. Generating regular expressions from natural language specifications: Are we there yet?. In Workshops at the Thirty-Second AAAI Conference on Artificial Intelligence."},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of the 20th chinese national conference on computational linguistics. 1218\u20131227","author":"Zhuang Liu","year":"2021","unstructured":"Liu Zhuang , Lin Wayne , Shi Ya , and Zhao Jun . 2021 . A robustly optimized BERT pre-training approach with post-training . In Proceedings of the 20th chinese national conference on computational linguistics. 1218\u20131227 . Liu Zhuang, Lin Wayne, Shi Ya, and Zhao Jun. 2021. A robustly optimized BERT pre-training approach with post-training. In Proceedings of the 20th chinese national conference on computational linguistics. 1218\u20131227."}],"event":{"name":"HIP '23: 7th International Workshop on Historical Document Imaging and Processing","location":"San Jose CA USA","acronym":"HIP '23"},"container-title":["Proceedings of the 7th International Workshop on Historical Document Imaging and Processing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3604951.3605514","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3604951.3605514","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:46:37Z","timestamp":1750178797000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3604951.3605514"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,25]]},"references-count":21,"alternative-id":["10.1145\/3604951.3605514","10.1145\/3604951"],"URL":"https:\/\/doi.org\/10.1145\/3604951.3605514","relation":{},"subject":[],"published":{"date-parts":[[2023,8,25]]},"assertion":[{"value":"2023-08-25","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}