{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T03:00:35Z","timestamp":1775617235395,"version":"3.50.1"},"reference-count":33,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,6,2]],"date-time":"2022-06-02T00:00:00Z","timestamp":1654128000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,6,2]],"date-time":"2022-06-02T00:00:00Z","timestamp":1654128000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Yunnan Science and Technology Major Project","award":["202002AE090010"],"award-info":[{"award-number":["202002AE090010"]}]},{"name":"Subproject 5 of Yunnan Science and Technology Major","award":["202002AD080002-5"],"award-info":[{"award-number":["202002AD080002-5"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2023,2]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Software knowledge community contains a large scale of software knowledge entity information, complex structure and rich semantic correlations. It is significant to recognize and extract software knowledge entity from software knowledge community, as it has great impact on entity-centric tasks such as software knowledge graph construction, software document generation and expert recommendation. Since the texts of the software knowledge community are unstructured by user-generated texts, it is difficult to apply the traditional entity extraction method in the domain of the software knowledge community due to the problems of entity variation, entity sparsity, entity ambiguity, out-of-vocabulary (OOV) words and the lack of annotated data sets. This paper proposes a novel software knowledge entity extraction model, named AttenSy-SNER, which integrates syntactic features and semantic augmentation information, to extract fine-grained software knowledge entities from unstructured user-generated content. The input representation layer utilizes Bidirectional Encoder Representations from Transformers (BERT) model to extract the feature representation of the input sequence. The contextual coding layer leverages the Bidirectional Long Short-Term Memory (BiLSTM) network and Graph Convolutional Network (GCN) for contextual information and syntactic dependency information, and a semantic augmentation strategy based on attention mechanism is introduced to enrich the semantic feature representation of sequences as well. The tag decoding layer leverages Conditional Random Fields (CRF) to solve the dependency between the output tags and obtain the global optimal label sequence. The results of model comparison experiments show that the proposed model has better performance than the benchmark model in software engineering domain.<\/jats:p>","DOI":"10.1007\/s40747-022-00742-5","type":"journal-article","created":{"date-parts":[[2022,6,2]],"date-time":"2022-06-02T09:02:54Z","timestamp":1654160574000},"page":"25-39","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["AttenSy-SNER: software knowledge entity extraction with syntactic features and semantic augmentation information"],"prefix":"10.1007","volume":"9","author":[{"given":"Mingjing","family":"Tang","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3257-213X","authenticated-orcid":false,"given":"Tong","family":"Li","sequence":"additional","affiliation":[]},{"given":"Wei","family":"Gao","sequence":"additional","affiliation":[]},{"given":"Yu","family":"Xia","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,6,2]]},"reference":[{"key":"742_CR1","doi-asserted-by":"publisher","unstructured":"Wang T, Yin G, Wang HM, Yang C, Zou P (2015) Automatic knowledge sharing across communities: a case study on android issue tracker and StackOverflow. In: 2015 IEEE symposium on service-oriented system engineering, San Francisco, CA, USA, pp 107\u2013116. https:\/\/doi.org\/10.1109\/SOSE.2015.34","DOI":"10.1109\/SOSE.2015.34"},{"key":"742_CR2","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2021.3070843","author":"SX Ji","year":"2020","unstructured":"Ji SX, Pan SR, Cambria E, Marttinen P, Yu PS (2020) A survey on knowledge graphs: representation, acquisition, and applications. IEEE Trans Neural Netw Learn Syst. https:\/\/doi.org\/10.1109\/TNNLS.2021.3070843","journal-title":"IEEE Trans Neural Netw Learn Syst"},{"issue":"3","key":"742_CR3","doi-asserted-by":"publisher","first-page":"809","DOI":"10.1016\/j.ipm.2019.01.005","volume":"56","author":"X Tang","year":"2019","unstructured":"Tang X, Chen L, Cui J, Wei BG (2019) Knowledge representation learning with entity descriptions, hierarchical types, and textual relations. Inf Process & Manag 56(3):809\u2013822. https:\/\/doi.org\/10.1016\/j.ipm.2019.01.005","journal-title":"Inf Process & Manag"},{"key":"742_CR4","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2020.2981314","author":"J Li","year":"2020","unstructured":"Li J, Sun AX, Han JL, Li CL (2020) A survey on deep learning for named entity recognition. IEEE Trans Knowl Data Eng. https:\/\/doi.org\/10.1109\/TKDE.2020.2981314","journal-title":"IEEE Trans Knowl Data Eng"},{"issue":"12","key":"742_CR5","doi-asserted-by":"publisher","first-page":"163","DOI":"10.14569\/IJACSA.2017.081220","volume":"8","author":"PD Soomro","year":"2017","unstructured":"Soomro PD, Kumar S, Banbhrani, Shaikh AA, Raj H (2017) Bio-NER: biomedical named entity recognition using rule-based and statistical learners. Int J Adv Comput Sci Appl 8(12):163\u2013170. https:\/\/doi.org\/10.14569\/IJACSA.2017.081220","journal-title":"Int J Adv Comput Sci Appl"},{"key":"742_CR6","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1016\/j.procs.2016.09.123","volume":"100","author":"AP Quimbaya","year":"2016","unstructured":"Quimbaya AP, M\u00fanera AS, Rivera RAG, Rodr\u00edguez JCD, Velandia OMM, Pe\u00f1a AAG et al (2016) Named entity recognition over electronic health records through a combined Dictionary-based approach. Proc Comput Sci 100:55\u201361. https:\/\/doi.org\/10.1016\/j.procs.2016.09.123","journal-title":"Proc Comput Sci"},{"issue":"6","key":"742_CR7","doi-asserted-by":"publisher","first-page":"411","DOI":"10.1016\/j.jbi.2004.08.005","volume":"37","author":"J Zhang","year":"2004","unstructured":"Zhang J, Shen D, Zhou GD, Su J, Tan CL (2004) Enhancing HMM-based biomedical named entity recognition by studying special phenomena. J Biomed Inform 37(6):411\u2013422. https:\/\/doi.org\/10.1016\/j.jbi.2004.08.005","journal-title":"J Biomed Inform"},{"issue":"5","key":"742_CR8","doi-asserted-by":"publisher","first-page":"52117","DOI":"10.1103\/PhysRevE.89.052117","volume":"89","author":"R Cofre","year":"2014","unstructured":"Cofre R, Cessac B (2014) Exact computation of the maximum-entropy potential of spiking neural-network models. Phys Rev E Stat Nonlin Soft Matter Phys 89(5):52117\u201352130. https:\/\/doi.org\/10.1103\/PhysRevE.89.052117","journal-title":"Phys Rev E Stat Nonlin Soft Matter Phys"},{"key":"742_CR9","doi-asserted-by":"publisher","DOI":"10.1109\/ICCSIT.2008.187","author":"A Mansouri","year":"2008","unstructured":"Mansouri A, Affendy LS, Mamat A (2008) A new fuzzy support vector machine method for named entity recognition. Int Conf Comput Sci Inf Technol. https:\/\/doi.org\/10.1109\/ICCSIT.2008.187","journal-title":"Int Conf Comput Sci Inf Technol"},{"issue":"5","key":"742_CR10","doi-asserted-by":"publisher","first-page":"625","DOI":"10.3233\/SW-170253","volume":"8","author":"GA Seker","year":"2017","unstructured":"Seker GA, Eryigit G (2017) Extending a CRF-based named entity recognition model for Turkish well formed text and user generated content. Semantic Web 8(5):625\u2013642. https:\/\/doi.org\/10.3233\/SW-170253","journal-title":"Semantic Web"},{"issue":"14","key":"742_CR11","doi-asserted-by":"publisher","first-page":"i37","DOI":"10.1093\/bioinformatics\/btx228","volume":"33","author":"M Habibi","year":"2017","unstructured":"Habibi M, Weber L, Neves M, Wiegandt DL, Leser U (2017) Deep learning with word embeddings improves biomedical named entity recognition. Bioinformatics 33(14):i37\u2013i48. https:\/\/doi.org\/10.1093\/bioinformatics\/btx228","journal-title":"Bioinformatics"},{"key":"742_CR12","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2020.2994436","author":"Z Tang","year":"2020","unstructured":"Tang Z, By W, Yang L (2020) Word-character graph convolution network for chinese named entity recognition. IEEE\/ACM Trans Audio Speech Lang Process. https:\/\/doi.org\/10.1109\/TASLP.2020.2994436","journal-title":"IEEE\/ACM Trans Audio Speech Lang Process"},{"key":"742_CR13","doi-asserted-by":"publisher","unstructured":"Ye DH, Xing ZC, Foo CY, Ang ZQ, Li J, Kapre N (2016) Software-specific named entity recognition in software engineering social content. In: IEEE 23rd international conference on software analysis, evolution, and reengineering, pp 90\u2013101. https:\/\/doi.org\/10.1109\/SANER.2016.10","DOI":"10.1109\/SANER.2016.10"},{"key":"742_CR14","doi-asserted-by":"publisher","unstructured":"Zhao XJ, Xing ZC, Kabir MS, Sawada N, Li J, Lin SW (2017) Hdskg: harvesting domain specific knowledge graph from content of webpages. In: IEEE 24th international conference on software analysis, evolution and reengineering, pp 56\u201367. https:\/\/doi.org\/10.1109\/SANER.2017.7884609","DOI":"10.1109\/SANER.2017.7884609"},{"key":"742_CR15","doi-asserted-by":"publisher","unstructured":"Guo JP, Luo H, Sun Y (2019) Research on extracting named entities in software engineering field from wiki webpage. In: IEEE international conference on consumer electronics\u2014Taiwan, pp 1\u20132. https:\/\/doi.org\/10.1109\/ICCE-TW46550.2019.8991742","DOI":"10.1109\/ICCE-TW46550.2019.8991742"},{"key":"742_CR16","doi-asserted-by":"publisher","unstructured":"Reddy MVPR, Prasad PVRD, Chikkamath M, Mandadi S (2019) NERSE: named entity recognition in software engineering as a service. In: Australian symposium on service research and innovation, pp 65\u201380. https:\/\/doi.org\/10.1007\/978-3-030-32242-7_6","DOI":"10.1007\/978-3-030-32242-7_6"},{"issue":"1","key":"742_CR17","doi-asserted-by":"publisher","first-page":"1027","DOI":"10.32604\/cmc.2020.012478","volume":"66","author":"WQ Lv","year":"2021","unstructured":"Lv WQ, Liao ZF, Liu SZ, Zhang Y (2021) MEIM: a multi-source software knowledge entity extraction integration model. Comput Mater Continua 66(1):1027\u20131042. https:\/\/doi.org\/10.32604\/cmc.2020.012478","journal-title":"Comput Mater Continua"},{"key":"742_CR18","doi-asserted-by":"publisher","unstructured":"Tabassum J, Maddela M, Xu W, Ritter A (2020) Code and named entity recognition in StackOverflow. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 4913\u20134926. https:\/\/doi.org\/10.18653\/v1\/2020.acl-main.443","DOI":"10.18653\/v1\/2020.acl-main.443"},{"key":"742_CR19","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00104","author":"JPC Chiu","year":"2016","unstructured":"Chiu JPC, Nichols E (2016) Named entity recognition with bidirectional LSTM-CNNs. Trans Assoc Comput Linguistics. https:\/\/doi.org\/10.1162\/tacl_a_00104","journal-title":"Trans Assoc Comput Linguistics"},{"key":"742_CR20","doi-asserted-by":"publisher","unstructured":"Strubell E, Verga P, Belanger D, McCallum A (2017) Fast and accurate entity recognition with iterated dilated convolutions. In: Proceedings of the 2017 conference on empirical methods in natural language processing, Copenhagen, Denmark, pp 2670\u20132680. https:\/\/doi.org\/10.18653\/v1\/D17-1283","DOI":"10.18653\/v1\/D17-1283"},{"key":"742_CR21","doi-asserted-by":"publisher","unstructured":"Xu MB, Jiang H, Watcharawittayakul S (2017) A local detection approach for named entity recognition and mention detection. In: Proceedings of the 55th annual meeting of the association for computational linguistics, Vancouver, Canada, pp 1237\u20131247. https:\/\/doi.org\/10.18653\/v1\/P17-1114","DOI":"10.18653\/v1\/P17-1114"},{"key":"742_CR22","doi-asserted-by":"publisher","unstructured":"Devlin J, Chang MW, Lee K, Toutanova K (2019) Bert: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the north American chapter of the association for computational linguistics, Minneapolis, Minnesota, pp 4171\u20134186. https:\/\/doi.org\/10.18653\/v1\/N19-1423","DOI":"10.18653\/v1\/N19-1423"},{"issue":"8","key":"742_CR23","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735\u20131780. https:\/\/doi.org\/10.1162\/neco.1997.9.8.1735","journal-title":"Neural Comput"},{"key":"742_CR24","doi-asserted-by":"crossref","unstructured":"Yao L, Mao CS, Luo Y (2019) Graph convolutional networks for text classification. In: the 33rd AAAI conference on artificial intelligence, pp 7370\u20137377","DOI":"10.1609\/aaai.v33i01.33017370"},{"key":"742_CR25","doi-asserted-by":"publisher","unstructured":"Marcheggiani D, Titov I (2017) Encoding sentences with graph convolutional networks for semantic role labeling. In: Proceedings of the 2017 conference on empirical methods in natural language processing, Copenhagen, Denmark, pp 1506\u20131515. https:\/\/doi.org\/10.18653\/v1\/D17-1159","DOI":"10.18653\/v1\/D17-1159"},{"key":"742_CR26","doi-asserted-by":"publisher","unstructured":"Guo ZJ, Zhang Y, Lu W (2019) Attention guided graph convolutional networks for relation extraction. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 241\u2013251. https:\/\/doi.org\/10.18653\/v1\/P19-1024","DOI":"10.18653\/v1\/P19-1024"},{"key":"742_CR27","doi-asserted-by":"publisher","unstructured":"Bastings J, Titov I, Aziz W, Marcheggiani D, Sima'an K (2017) Graph convolutional encoders for syntax-aware neural machine translation. In: Proceedings of the 2017 conference on empirical methods in natural language processing, pp 1957\u20131967. https:\/\/doi.org\/10.18653\/v1\/D17-1209","DOI":"10.18653\/v1\/D17-1209"},{"key":"742_CR28","doi-asserted-by":"publisher","unstructured":"Nie YY, Tian YH, Wan X, Song Y, Dai B (2020) Named entity recognition for social media texts with semantic augmentation. In: Proceedings of the 2020 conference on empirical methods in natural language processing, pp 1383\u20131391. https:\/\/doi.org\/10.18653\/v1\/2020.emnlp-main.107","DOI":"10.18653\/v1\/2020.emnlp-main.107"},{"key":"742_CR29","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.2019.2940439","author":"X Chen","year":"2019","unstructured":"Chen X, Chen CY, Zhang D, Xing ZC (2019) SEthesaurus: WordNet in software engineering. IEEE Trans Softw Eng. https:\/\/doi.org\/10.1109\/TSE.2019.2940439","journal-title":"IEEE Trans Softw Eng"},{"key":"742_CR30","doi-asserted-by":"publisher","unstructured":"Tian Y, Lo D, Lawall J (2014) Automated construction of a software-specific word similarity database. In: 2014 software evolution week\u2014IEEE conference on software maintenance, reengineering, and reverse engineering, Antwerp, Belgium, pp 44\u201353. https:\/\/doi.org\/10.1109\/CSMR-WCRE.2014.6747213","DOI":"10.1109\/CSMR-WCRE.2014.6747213"},{"key":"742_CR31","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gonez AN, et al (2017) Attention is all you need. In: Proceedings of the 31th conference on neural information processing systems, pp 5998\u20136008"},{"key":"742_CR32","doi-asserted-by":"publisher","unstructured":"Margatina K, Baziotis C, Potamianos A (2019) Attention-based conditioning methods for external knowledge integration. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy, pp 3944\u20133951. https:\/\/doi.org\/10.18653\/v1\/P19-1385","DOI":"10.18653\/v1\/P19-1385"},{"issue":"1","key":"742_CR33","doi-asserted-by":"publisher","first-page":"471","DOI":"10.32604\/cmc.2021.016054","volume":"68","author":"N Alsaaran","year":"2021","unstructured":"Alsaaran N, Alrabiah M (2021) Arabic named entity recognition: a BERT-BGRU approach. Comput Mater Continua 68(1):471\u2013485. https:\/\/doi.org\/10.32604\/cmc.2021.016054","journal-title":"Comput Mater Continua"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-022-00742-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-022-00742-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-022-00742-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,22]],"date-time":"2023-02-22T18:45:02Z","timestamp":1677091502000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-022-00742-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,6,2]]},"references-count":33,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2023,2]]}},"alternative-id":["742"],"URL":"https:\/\/doi.org\/10.1007\/s40747-022-00742-5","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,6,2]]},"assertion":[{"value":"12 May 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 March 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 June 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}