{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,18]],"date-time":"2025-12-18T20:00:42Z","timestamp":1766088042823,"version":"3.37.3"},"reference-count":46,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2023,4,25]],"date-time":"2023-04-25T00:00:00Z","timestamp":1682380800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,4,25]],"date-time":"2023-04-25T00:00:00Z","timestamp":1682380800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100005063","name":"Excellent Young Talents Fund Program of Higher Education Institutions of Anhui Province","doi-asserted-by":"publisher","award":["gxyq2020031"],"award-info":[{"award-number":["gxyq2020031"]}],"id":[{"id":"10.13039\/501100005063","id-type":"DOI","asserted-by":"publisher"}]},{"name":"University Natural Science Research Projects of Anhui Province","award":["KJ2021A0516","2022AH050972"],"award-info":[{"award-number":["KJ2021A0516","2022AH050972"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2023,10]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Entity synonyms play a significant role in entity-based tasks. Previous approaches use linguistic syntax, distributional, and semantic features to expand entity synonym sets from text corpora. Due to the flexibility and complexity of the Chinese language expression, the aforementioned approaches are still difficult to expand entity synonym sets robustly from Chinese text, because these approaches fail to track holistic semantics among entities and suffer from error propagation. This paper introduces an approach for expanding Chinese entity synonym sets based on bilateral context and filtering strategy. Specifically, the approach consists of two novel components. First, a bilateral-context-based Siamese network classifier is proposed to determine whether a new entity should be inserted into the existing entity synonym set. The classifier tracks the holistic semantics of bilateral contexts and is capable of imposing soft holistic semantic constraints to improve synonym prediction. Second, a filtering-strategy-based set expansion algorithm is presented to generate Chinese entity synonym sets. The filtering strategy enhances semantic and domain consistencies to filter out wrong synonym entities, thereby mitigating error propagation. Experimental results on two Chinese real-world datasets demonstrate that the proposed approach is effective and outperforms the selected existing state-of-the-art approaches to the Chinese entity synonym set expansion task.<\/jats:p>","DOI":"10.1007\/s40747-023-01064-w","type":"journal-article","created":{"date-parts":[[2023,4,25]],"date-time":"2023-04-25T17:04:13Z","timestamp":1682442253000},"page":"6065-6085","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion"],"prefix":"10.1007","volume":"9","author":[{"given":"Subin","family":"Huang","sequence":"first","affiliation":[]},{"given":"Yu","family":"Xiu","sequence":"additional","affiliation":[]},{"given":"Jun","family":"Li","sequence":"additional","affiliation":[]},{"given":"Sanmin","family":"Liu","sequence":"additional","affiliation":[]},{"given":"Chao","family":"Kong","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,4,25]]},"reference":[{"key":"1064_CR1","unstructured":"Mahdisoltani F, Biega J, Suchanek FM (2015) YAGO3: a knowledge base from multilingual wikipedias. In: Seventh biennial conference on innovative data systems research, CIDR 2015, Asilomar, CA, USA, January 4\u20137, 2015"},{"key":"1064_CR2","doi-asserted-by":"crossref","unstructured":"Xu B, Xu Y, Liang J, Xie C, Liang B, Cui W, Xiao Y (2017) Cn-dbpedia: a never-ending Chinese knowledge extraction system. In: Advances in artificial intelligence: from theory to practice\u201430th international conference on industrial engineering and other applications of applied intelligent systems. IEA\/AIE 2017, Arras, France, June 27\u201330, part II, vol 10351, pp 428\u2013438","DOI":"10.1007\/978-3-319-60045-1_44"},{"key":"1064_CR3","doi-asserted-by":"crossref","unstructured":"Qi F, Chang L, Sun M, Ouyang S, Liu Z (2020) Towards building a multilingual sememe knowledge base: Predicting sememes for BabelNet synsets. In: The thirty-fourth AAAI conference on artificial intelligence, AAAI 2020, the thirty-second innovative applications of artificial intelligence conference, IAAI 2020, the tenth AAAI symposium on educational advances in artificial intelligence, EAAI 2020, New York, NY, USA, February 7\u201312, pp 8624\u20138631","DOI":"10.1609\/aaai.v34i05.6386"},{"key":"1064_CR4","doi-asserted-by":"crossref","unstructured":"Rios-Alvarado AB, Martinez-Rodriguez JL, Garcia-Perez AG, Guerrero-Melendez TY, Lopez-Arevalo I, Gonzalez-Compean JL (2022) Exploiting lexical patterns for knowledge graph construction from unstructured text in Spanish. Complex Intell Syst 9:1281\u20131297","DOI":"10.1007\/s40747-022-00805-7"},{"key":"1064_CR5","doi-asserted-by":"crossref","unstructured":"Gupta A, Lebret R, Harkous H, Aberer K (2017) Taxonomy induction using hypernym subsequences. In: Proceedings of the 2017 ACM on conference on information and knowledge management. CIKM 2017, Singapore, November 06\u201310, pp 1329\u20131338","DOI":"10.1145\/3132847.3133041"},{"key":"1064_CR6","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2019.07.032","volume":"182","author":"S Huang","year":"2019","unstructured":"Huang S, Luo X, Huang J, Guo Y, Gu S (2019) An unsupervised approach for learning a Chinese IS-A taxonomy from an unstructured corpus. Knowl Based Syst 182:104861","journal-title":"Knowl Based Syst"},{"issue":"14","key":"1064_CR7","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.5696","volume":"32","author":"S Huang","year":"2020","unstructured":"Huang S, Luo X, Huang J, Wang H, Gu S, Guo Y (2020) Improving taxonomic relation learning via incorporating relation descriptions into word embeddings. Concurr Comput: Pract Exp 32(14):e5696","journal-title":"Concurr Comput: Pract Exp"},{"key":"1064_CR8","doi-asserted-by":"crossref","unstructured":"Shen J, Shen Z, Xiong C, Wang C, Wang K, Han J (2020) TaxoExpan: self-supervised taxonomy expansion with position-enhanced graph neural network. In: Huang Y, King I, Liu T, van Steen M (eds) WWW \u201920: the web conference 2020, Taipei, Taiwan, April 20\u201324, pp 486\u2013497","DOI":"10.1145\/3366423.3380132"},{"issue":"1","key":"1064_CR9","doi-asserted-by":"publisher","DOI":"10.1111\/exsy.12603","volume":"38","author":"S Gu","year":"2021","unstructured":"Gu S, Luo X, Wang H, Huang J, Wei Q, Huang S (2021) Improving answer selection with global features. Expert Syst: J Knowl Eng 38(1):e12603","journal-title":"Expert Syst: J Knowl Eng"},{"key":"1064_CR10","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2021.107626","volume":"235","author":"M Bakhshi","year":"2022","unstructured":"Bakhshi M, Nematbakhsh M, Mohsenzadeh M, Rahmani AM (2022) SParseQA: sequential word reordering and parsing for answering complex natural language questions over knowledge graphs. Knowl Based Syst 235:107626","journal-title":"Knowl Based Syst"},{"key":"1064_CR11","doi-asserted-by":"publisher","first-page":"851","DOI":"10.1007\/s40747-021-00448-0","volume":"8","author":"X Li","year":"2022","unstructured":"Li X, Alazab M, Li Q, Yu K, Yin Q (2022) Question-aware memory network for multi-hop question answering in human\u2013robot interaction. Complex Intell Syst 8:851\u2013861","journal-title":"Complex Intell Syst"},{"key":"1064_CR12","doi-asserted-by":"crossref","unstructured":"Shen J, Qiu W, Shang J, Vanni M, Ren X, Han J (2020) Synsetexpan: an iterative framework for joint entity set expansion and synonym discovery. In: Proceedings of the 2020 conference on empirical methods in natural language processing EMNLP 2020, Online, November 16\u201320, pp 8292\u20138307","DOI":"10.18653\/v1\/2020.emnlp-main.666"},{"key":"1064_CR13","doi-asserted-by":"crossref","unstructured":"Huang S, Luo X, Huang J, Qin W, Gu S (2020a) Neural entity synonym set generation using association information and entity constraint. In: 2020 IEEE international conference on knowledge graph, ICKG 2020, Online, August 9\u201311, pp 321\u2013328","DOI":"10.1109\/ICBK50248.2020.00053"},{"key":"1064_CR14","doi-asserted-by":"crossref","unstructured":"Yang Y, Yin X, Yang H, Fei X, Peng H, Zhou K, Lai K, Shen J (2021) KGSynNet: a novel entity synonyms discovery framework with knowledge graph. In: Database systems for advanced applications\u201426th international conference. DASFAA 2021, Taipei, China, April 11\u201314, part I, vol 12681, pp 174\u2013190","DOI":"10.1007\/978-3-030-73194-6_13"},{"key":"1064_CR15","doi-asserted-by":"crossref","unstructured":"Shen J, Lyu R, Ren X, Vanni M, Sadler BM, Han J (2019) Mining entity synonyms with efficient neural set generation. In: The thirty-third AAAI conference on artificial intelligence, AAAI 2019, the thirty-first innovative applications of artificial intelligence conference, IAAI 2019, the ninth AAAI symposium on educational advances in artificial intelligence, EAAI 2019, Honolulu, HI, USA, January 27\u2013February 1, pp 249\u2013256","DOI":"10.1609\/aaai.v33i01.3301249"},{"key":"1064_CR16","doi-asserted-by":"publisher","first-page":"159","DOI":"10.1186\/1471-2105-9-159","volume":"9","author":"JP McCrae","year":"2008","unstructured":"McCrae JP, Collier N (2008) Synonym set extraction from the biomedical literature by lexical pattern discovery. BMC Bioinform 9:159","journal-title":"BMC Bioinform"},{"key":"1064_CR17","doi-asserted-by":"crossref","unstructured":"Wang W, Thomas C, Sheth AP, Chan V (2010) Pattern-based synonym and antonym extraction. In: Proceedings of the 48th annual southeast regional conference, Oxford, MS, USA, April 15\u201317, p 64","DOI":"10.1145\/1900008.1900094"},{"key":"1064_CR18","unstructured":"Li W, Lu Q (2011) A hybrid extraction model for Chinese noun\/verb synonymous bi-gram collocations. In: Proceedings of the 25th Pacific Asia conference on language, information and computation, PACLIC 25, Singapore, December 16\u201318, pp 430\u2013439"},{"key":"1064_CR19","doi-asserted-by":"crossref","unstructured":"Nguyen KA, im Walde SS, Vu NT (2017) Distinguishing antonyms and synonyms in a pattern-based neural network. In: Proceedings of the 15th conference of the European chapter of the association for computational linguistics. EACL 2017, Valencia, Spain, April 3\u20137, pp 76\u201385","DOI":"10.18653\/v1\/E17-1008"},{"issue":"2\u20133","key":"1064_CR20","doi-asserted-by":"publisher","first-page":"146","DOI":"10.1080\/00437956.1954.11659520","volume":"10","author":"ZS Harris","year":"1954","unstructured":"Harris ZS (1954) Distributional structure. Word 10(2\u20133):146\u2013162","journal-title":"Word"},{"key":"1064_CR21","doi-asserted-by":"crossref","unstructured":"Qu M, Ren X, Han J (2017) Automatic synonym discovery with knowledge bases. In: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, Halifax, NS, Canada, August 13\u201317, pp 997\u20131005","DOI":"10.1145\/3097983.3098185"},{"key":"1064_CR22","doi-asserted-by":"crossref","unstructured":"Turney PD (2001) Mining the web for synonyms: PMI-IR versus LSA on TOEFL. In: Machine learning: EMCL 2001, 12th European conference on machine learning, Freiburg, Germany, September 5\u20137, vol 2167, pp 491\u2013502","DOI":"10.1007\/3-540-44795-4_42"},{"key":"1064_CR23","doi-asserted-by":"crossref","unstructured":"Chakrabarti K, Chaudhuri S, Cheng T, Xin D (2012) A framework for robust discovery of entity synonyms. In: The 18th ACM SIGKDD international conference on knowledge discovery and data mining. KDD\u201912, Beijing, China, August 12\u201316, pp 1384\u20131392","DOI":"10.1145\/2339530.2339743"},{"issue":"3","key":"1064_CR24","doi-asserted-by":"publisher","first-page":"42","DOI":"10.4018\/IJIIT.2019070103","volume":"15","author":"X Ma","year":"2019","unstructured":"Ma X, Luo X, Huang S, Guo Y (2019) Multi-distribution characteristics based Chinese entity synonym extraction from the web. Int J Intell Inf Technol 15(3):42\u201363","journal-title":"Int J Intell Inf Technol"},{"key":"1064_CR25","doi-asserted-by":"crossref","unstructured":"Zhang C, Li Y, Du N, Fan W, Yu PS (2020) Entity synonym discovery via multipiece bilateral context matching. In Proceedings of the twenty-ninth international joint conference on artificial intelligence, IJCAI 2020, pp 1431\u20131437","DOI":"10.24963\/ijcai.2020\/199"},{"key":"1064_CR26","doi-asserted-by":"crossref","unstructured":"Dorow B, Widdows D (2003) Discovering corpus-specific word senses. In: EACL 2003, 10th conference of the European chapter of the association for computational linguistics, Budapest, Hungary, April 12\u201317, pp 79\u201382","DOI":"10.3115\/1067737.1067753"},{"key":"1064_CR27","unstructured":"Duan L, Chen J, Li H, Li A (2010) A Chinese synonyms reduced algorithm based on sememe tree. In: International conference on computational aspects of social networks. CASON 2010, Taiyuan, China, September 26\u201328, pp 337\u2013340"},{"key":"1064_CR28","doi-asserted-by":"crossref","unstructured":"Ustalov D, Panchenko A, Biemann C (2017) Automatic induction of synsets from a graph of synonyms. In: Proceedings of the 55th annual meeting of the association for computational linguistics, ACL 2017, Vancouver, Canada, July 30\u2013August 4, vol 1, Long papers, pp 1579\u20131590","DOI":"10.18653\/v1\/P17-1145"},{"issue":"1","key":"1064_CR29","doi-asserted-by":"publisher","first-page":"130","DOI":"10.1016\/j.ipm.2018.10.002","volume":"56","author":"G Ercan","year":"2019","unstructured":"Ercan G, Haziyev F (2019) Synset expansion on translation graph for automatic wordnet construction. Inf Process Manag 56(1):130\u2013150","journal-title":"Inf Process Manag"},{"key":"1064_CR30","doi-asserted-by":"crossref","unstructured":"Ren X, Cheng T (2015) Synonym discovery for structured entities on heterogeneous graphs. In: Proceedings of the 24th international conference on world wide web companion. WWW 2015, Florence, Italy, May 18\u201322, pp 443\u2013453","DOI":"10.1145\/2740908.2745396"},{"key":"1064_CR31","doi-asserted-by":"crossref","unstructured":"Shen J, Wu Z, Lei D, Shang J, Ren X, Han J (2017) Setexpan: corpus-based set expansion via context feature selection and rank ensemble. In: Machine learning and knowledge discovery in databases\u2014European conference, ECML PKDD 2017, Skopje, Macedonia, September 18\u201322, vol 10534, pp 288\u2013304","DOI":"10.1007\/978-3-319-71249-9_18"},{"key":"1064_CR32","doi-asserted-by":"crossref","unstructured":"Huang S, Qin W, Zhao S, Gu S (2020c) An automatic approach for extracting Chinese entity synonyms from encyclopedias. In: Proceedings of the 2020 3rd international conference on big data technologies, ICBDT, Qingdao, China, September 18\u201320","DOI":"10.1145\/3422713.3422737"},{"key":"1064_CR33","doi-asserted-by":"crossref","unstructured":"Wang C, Yan J, Zhou A, He X (2017) Transductive non-linear learning for Chinese hypernym prediction. In: Proceedings of the 55th annual meeting of the association for computational linguistics. ACL 2017, Vancouver, Canada, July 30\u2013August 4, vol 1, Long papers, pp 1394\u20131404","DOI":"10.18653\/v1\/P17-1128"},{"key":"1064_CR34","doi-asserted-by":"crossref","unstructured":"Hearst MA (1992) Automatic acquisition of hyponyms from large text corpora. In: The 14th international conference on computational linguistics. COLING 1992, Nantes, France, August 23\u201328, pp 539\u2013545","DOI":"10.3115\/992133.992154"},{"key":"1064_CR35","doi-asserted-by":"crossref","unstructured":"Kwong OY, Tsou BK (2006) Feasibility of enriching a Chinese synonym dictionary with a synchronous Chinese corpus. In: Advances in natural language processing, 5th international conference on NLP. FINTAL 2006, Turku, Finland, August 23\u201325, vol 4139, pp 322\u2013332","DOI":"10.1007\/11816508_33"},{"key":"1064_CR36","unstructured":"Yu L-C, Chien W-N, Chen S-T (2011) A baseline system for Chinese near-synonym choice. In: Fifth international joint conference on natural language processing. IJCNLP 2011, Chiang Mai, Thailand, November 8\u201313, pp 1366\u20131370"},{"key":"1064_CR37","doi-asserted-by":"crossref","unstructured":"Gan Y (2017) A study on Chinese synonyms: from the perspective of collocations. In: Chinese lexical semantics\u201418th workshop. CLSW 2017, Leshan, China, May 18\u201320, revised selected papers, vol 10709, pp 586\u2013600","DOI":"10.1007\/978-3-319-73573-3_53"},{"key":"1064_CR38","unstructured":"Lu Y, Hou H (2008) Research on automatic acquiring of Chinese synonyms from wiki repository. In: Proceedings of the 2008 IEEE\/WIC\/ACM international conference on web intelligence and international conference on intelligent agent technology\u2014workshops. Sydney, NSW, Australia, December 9\u201312, pp 287\u2013290"},{"key":"1064_CR39","doi-asserted-by":"crossref","unstructured":"Mintz M, Bills S, Snow R, Jurafsky D (2009) Distant supervision for relation extraction without labeled data. In: ACL 2009: proceedings of the 47th annual meeting of the association for computational linguistics and the 4th international joint conference on natural language processing of the AFNLP. Singapore, August 2\u20137, pp 1003\u20131011","DOI":"10.3115\/1690219.1690287"},{"key":"1064_CR40","doi-asserted-by":"crossref","unstructured":"Vashishth S, Joshi R, Prayaga SS, Bhattacharyya C, Talukdar PP (2018) RESIDE: improving distantly-supervised neural relation extraction using side information. In: Proceedings of the 2018 conference on empirical methods in natural language processing, Brussels, Belgium, October 31\u2013November 4, pp 1257\u20131266","DOI":"10.18653\/v1\/D18-1157"},{"key":"1064_CR41","unstructured":"Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. In: 1st International conference on learning representations, ICLR 2013, Scottsdale, AZ, USA, May 2\u20134 (workshop track proceedings)"},{"key":"1064_CR42","doi-asserted-by":"crossref","unstructured":"Ji G, Liu K, He S, Zhao J (2017) Distant supervision for relation extraction with sentence-level attention and entity descriptions. In: Proceedings of the 31st AAAI conference on artificial intelligence, San Francisco, CA, USA, February 4\u20139, pp 3060\u20133066","DOI":"10.1609\/aaai.v31i1.10953"},{"key":"1064_CR43","unstructured":"Zaheer M, Kottur S, Ravanbakhsh S, Poczos B, Salakhutdinov R, Smola AJ (2017) Deep sets. In: Advances in neural information processing systems 30: annual conference on neural information processing systems 2017, Long Beach, CA, USA, December 4\u20139, pp 3391\u20133401"},{"key":"1064_CR44","doi-asserted-by":"crossref","unstructured":"Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of international conference on neural networks (ICNN\u201395), Perth, WA, Australia, November 27\u2013December 1, pp 1942\u20131948","DOI":"10.1109\/ICNN.1995.488968"},{"issue":"6","key":"1064_CR45","doi-asserted-by":"publisher","first-page":"2715","DOI":"10.1109\/TCYB.2019.2933499","volume":"50","author":"Z-J Wang","year":"2020","unstructured":"Wang Z-J, Zhan Z-H, Yu W, Lin Y, Zhang J, Gu T, Zhang J (2020) Dynamic group learning distributed particle swarm optimization for large-scale optimization and its application in cloud workflow scheduling. IEEE Trans Cybern 50(6):2715\u20132729","journal-title":"IEEE Trans Cybern"},{"key":"1064_CR46","unstructured":"Devlin J, Chang M-W, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies. NAACL-HLT 2019, Minneapolis, MN, USA, June 2\u20137, vol 1 (long and short papers), pp 4171\u20134186"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-023-01064-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-023-01064-w\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-023-01064-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,22]],"date-time":"2023-09-22T17:03:46Z","timestamp":1695402226000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-023-01064-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,25]]},"references-count":46,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2023,10]]}},"alternative-id":["1064"],"URL":"https:\/\/doi.org\/10.1007\/s40747-023-01064-w","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"type":"print","value":"2199-4536"},{"type":"electronic","value":"2198-6053"}],"subject":[],"published":{"date-parts":[[2023,4,25]]},"assertion":[{"value":"11 October 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 March 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 April 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"On behalf of all authors, the corresponding author states that there is no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}