{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T14:19:52Z","timestamp":1753885192852,"version":"3.41.2"},"reference-count":37,"publisher":"World Scientific Pub Co Pte Ltd","issue":"01","funder":[{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program","doi-asserted-by":"crossref","award":["2021YFB3300400"],"award-info":[{"award-number":["2021YFB3300400"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62173017"],"award-info":[{"award-number":["62173017"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Model. Simul. Sci. Comput."],"published-print":{"date-parts":[[2024,2]]},"abstract":"<jats:p> Distant supervision has been proven to be an efficient way of generating labeled instances for Named Entity Recognition (NER). However, it suffers from dictionary biases and ambiguous entities, resulting in noisy and incomplete labels. To overcome this drawback, this paper proposes a template augmented distant supervision framework, which generates high-quality labeled training data with minimal human effort. Specifically, we use distant supervision to extract sentences that contain entities and apply a pre-trained language model to encode these sentences. The encoded sentences are clustered and then for each cluster, three sentences are sampled out to form a seed template pool. The seed templates are calibrated and decomposed to decouple the connection between different parts. Finally, the seed templates and entity dictionary are combined with pre-trained language model to generate semantically coherent and precisely labeled training data. Experimental results on the EC and NEWS datasets and a practical electronic after-sale Q&amp;A dataset with multiple pre-trained language models demonstrate that the proposed framework is able to improve the F1 score of the distantly supervised NER models by 7.9%\u201312.9%. <\/jats:p>","DOI":"10.1142\/s1793962324500181","type":"journal-article","created":{"date-parts":[[2023,12,4]],"date-time":"2023-12-04T02:47:32Z","timestamp":1701658052000},"source":"Crossref","is-referenced-by-count":0,"title":["A template augmented distant supervision framework for Chinese named entity recognition"],"prefix":"10.1142","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-8519-9951","authenticated-orcid":false,"given":"Chengwen","family":"Qi","sequence":"first","affiliation":[{"name":"School of Automation Science and Electrical Engineering, Beihang University, Beijing 100191, P. R. China"},{"name":"Zhongguancun Laboratory, Beijing 100094, P. R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3834-4602","authenticated-orcid":false,"given":"Yuanjun","family":"Laili","sequence":"additional","affiliation":[{"name":"School of Automation Science and Electrical Engineering, Beihang University, Beijing 100191, P. R. China"},{"name":"Zhongguancun Laboratory, Beijing 100094, P. R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6346-6930","authenticated-orcid":false,"given":"Lei","family":"Ren","sequence":"additional","affiliation":[{"name":"School of Automation Science and Electrical Engineering, Beihang University, Beijing 100191, P. R. China"},{"name":"Zhongguancun Laboratory, Beijing 100094, P. R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1989-6102","authenticated-orcid":false,"given":"Lin","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Automation Science and Electrical Engineering, Beihang University, Beijing 100191, P. R. China"},{"name":"Zhongguancun Laboratory, Beijing 100094, P. R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9444-7652","authenticated-orcid":false,"given":"Bowen","family":"Li","sequence":"additional","affiliation":[{"name":"Shanghai AI Laboratory West Bund, AI Center No. 701 Yunjin Road, Xuhui District Shanghai City, P. R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2024,1,19]]},"reference":[{"key":"S1793962324500181BIB001","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.490"},{"key":"S1793962324500181BIB002","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.428"},{"key":"S1793962324500181BIB003","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.434"},{"key":"S1793962324500181BIB004","first-page":"203","volume-title":"Proc. 2nd Workshop on Noisy User-generated Text","author":"Mishra S.","year":"2016"},{"key":"S1793962324500181BIB005","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2020.11.017"},{"key":"S1793962324500181BIB006","doi-asserted-by":"publisher","DOI":"10.1075\/li.30.1.03nad"},{"key":"S1793962324500181BIB007","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.498"},{"key":"S1793962324500181BIB008","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1231"},{"key":"S1793962324500181BIB009","doi-asserted-by":"publisher","DOI":"10.1109\/BigData50022.2020.9378052"},{"key":"S1793962324500181BIB010","doi-asserted-by":"publisher","DOI":"10.1145\/3394486.3403149"},{"key":"S1793962324500181BIB011","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.813"},{"key":"S1793962324500181BIB012","doi-asserted-by":"publisher","DOI":"10.21105\/joss.00205"},{"key":"S1793962324500181BIB013","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2020.2981314"},{"key":"S1793962324500181BIB014","first-page":"827","volume-title":"Proc. 2013 Conf. Empirical Methods in Natural Language Processing","author":"Chiticariu L.","year":"2013"},{"key":"S1793962324500181BIB015","doi-asserted-by":"publisher","DOI":"10.1016\/j.autcon.2021.104108"},{"key":"S1793962324500181BIB016","first-page":"282","volume-title":"Proc. 18th Int. Conf. Machine Learning","author":"Lafferty J.","year":"2001"},{"key":"S1793962324500181BIB018","series-title":"Association for Computational Linguistics","first-page":"4171","volume-title":"Proc. 2019 Conf. North American Chapter of the Association for Computational Linguistics: Human Language Technologies","volume":"1","author":"Devlin J.","year":"2019"},{"key":"S1793962324500181BIB019","doi-asserted-by":"publisher","DOI":"10.1109\/ICDSCA53499.2021.9650256"},{"key":"S1793962324500181BIB020","doi-asserted-by":"publisher","DOI":"10.1109\/CISP-BMEI48845.2019.8965823"},{"volume-title":"Int. Conf. Learning Representations","year":"2020","author":"Lan Z.","key":"S1793962324500181BIB021"},{"key":"S1793962324500181BIB023","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.372"},{"key":"S1793962324500181BIB024","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2020\/341"},{"key":"S1793962324500181BIB025","doi-asserted-by":"publisher","DOI":"10.3115\/1690219.1690287"},{"key":"S1793962324500181BIB026","first-page":"893","volume-title":"Proc. 2012 Joint Conf. Empirical Methods in Natural Language Processing and Computational Natural Language Learning","author":"Lin T.","year":"2012"},{"key":"S1793962324500181BIB027","first-page":"94","volume-title":"26th AAAI Conf. Artificial Intelligence","author":"Ling X.","year":"2012"},{"key":"S1793962324500181BIB028","first-page":"1488","volume-title":"Proc. 51st Annual Meeting of the Association for Computational Linguistics","volume":"1","author":"Nakashole N.","year":"2013"},{"key":"S1793962324500181BIB029","first-page":"2159","volume-title":"Proc. 27th Int. Conf. Computational Linguistics","author":"Yang Y.","year":"2018"},{"key":"S1793962324500181BIB030","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1230"},{"key":"S1793962324500181BIB031","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.371"},{"key":"S1793962324500181BIB032","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.519"},{"key":"S1793962324500181BIB033","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6358"},{"key":"S1793962324500181BIB034","first-page":"108","volume-title":"Proc. 5th SIGHAN Workshop Chinese Language Processing","author":"Levow G.","year":"2006"},{"key":"S1793962324500181BIB035","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-demos.12"},{"key":"S1793962324500181BIB036","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-demos.6"},{"key":"S1793962324500181BIB037","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2016.09.123"},{"key":"S1793962324500181BIB038","doi-asserted-by":"publisher","DOI":"10.1145\/3357713.3384307"},{"key":"S1793962324500181BIB039","doi-asserted-by":"publisher","DOI":"10.3115\/977035.977059"}],"container-title":["International Journal of Modeling, Simulation, and Scientific Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S1793962324500181","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,3,27]],"date-time":"2024-03-27T08:20:34Z","timestamp":1711527634000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S1793962324500181"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,19]]},"references-count":37,"journal-issue":{"issue":"01","published-print":{"date-parts":[[2024,2]]}},"alternative-id":["10.1142\/S1793962324500181"],"URL":"https:\/\/doi.org\/10.1142\/s1793962324500181","relation":{},"ISSN":["1793-9623","1793-9615"],"issn-type":[{"type":"print","value":"1793-9623"},{"type":"electronic","value":"1793-9615"}],"subject":[],"published":{"date-parts":[[2024,1,19]]},"article-number":"2450018"}}