{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T05:30:08Z","timestamp":1775107808465,"version":"3.50.1"},"reference-count":63,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T00:00:00Z","timestamp":1775088000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T00:00:00Z","timestamp":1775088000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Cybersecurity"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Cyberspace Surveying and Mapping (CSM) involves the identification and analysis of digital assets to support network management and security, yet its domain-specific named entity recognition (NER) remains underexplored. A key challenge is the semantic gap between general-domain corpora and CSM domain texts, the suboptimal performance of existing named entity recognition (NER) models in accurately identifying entities within CSM data. To tackle obstacles, we proposed a NER model CyMapNER for the CSM domain. A clear definition of named entity categories pertinent to the CSM domain was established initially, followed by the creation of a dedicated NER dataset tailored to this domain. Subsequently, we present a domain adaptation training framework that integrates large language models. It combines with data augments, pseudo-labeling and domain-adaptive pretraining to enhance the adaptability of the NER model. The comparative experimental results demonstrate that CyMapNER models outperforms traditional NER models in CSM datasets. The results reveal that by domain adaptation training framework, the recognition accuracy of CyMapNER model reaches 97%, which achieves an improvement from 5.6% to 18.3% over the state-of-the-art NER models, and it performs well in recognizing complex and sparse entities, highlighting its effectiveness in handling the intricacies of CSM data.<\/jats:p>","DOI":"10.1186\/s42400-026-00578-3","type":"journal-article","created":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T04:01:21Z","timestamp":1775102481000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["CyMapNER: a named entity recognition model for cyberspace surveying and mapping domain"],"prefix":"10.1186","volume":"9","author":[{"given":"Jiancheng","family":"Zhang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4533-2706","authenticated-orcid":false,"given":"Fan","family":"Shi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chengxi","family":"Xu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ye","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xinyu","family":"Yin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mingyi","family":"Ge","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2026,4,2]]},"reference":[{"key":"578_CR1","doi-asserted-by":"publisher","unstructured":"Aghaei E, Niu X, Shadid W, Al-Shaer E (2023) Securebert: a domain-specific language model for cybersecurity. In: Li F, Liang K, Lin Z, Katsikas SK (eds) Security and Privacy in Communication Networks, pp 39\u201356. Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-031-25538-0_3","DOI":"10.1007\/978-3-031-25538-0_3"},{"key":"578_CR2","doi-asserted-by":"publisher","unstructured":"Aravind PC, Arikkat DR, Krishnan AS, Tesneem B, Sebastian A, Dev MJ, Aswathy KR, Rehiman KAR, Vinod P (2024) Cytie: cyber threat intelligence extraction with named entity recognition. In: Rajagopal S, Popat K, Meva D, Bajeja S (eds) Advancements in Smart Computing and Information Security, pp 163\u2013178. Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-031-59100-6_13","DOI":"10.1007\/978-3-031-59100-6_13"},{"issue":"2","key":"578_CR3","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3652594","volume":"27","author":"M Bayer","year":"2024","unstructured":"Bayer M, Kuehn P, Shanehsaz R, Reuter C (2024) Cysecbert: a domain-adapted language model for the cybersecurity domain. ACM Trans Priv Secur 27(2):1\u201320. https:\/\/doi.org\/10.1145\/3652594","journal-title":"ACM Trans Priv Secur"},{"key":"578_CR4","doi-asserted-by":"publisher","unstructured":"Bogdanov S, Constantin A, Bernard T, Crabb\u00e9 B, Bernard EP (2024) NuNER: entity recognition encoder pre-training via LLM-annotated data. In: Al-Onaizan Y, Bansal M, Chen Y-N (eds) Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp 11829\u201311841. Association for Computational Linguistics, Miami, Florida, USA. https:\/\/doi.org\/10.18653\/v1\/2024.emnlp-main.660 . https:\/\/aclanthology.org\/2024.emnlp-main.660\/","DOI":"10.18653\/v1\/2024.emnlp-main.660"},{"key":"578_CR5","unstructured":"Borthwick A, Sterling J, Agichtein E, Grishman R (1998) NYU: description of the MENE named entity system as used in MUC-7. In: Proceedings of the 7th message understanding conference (MUC-7), pp 287\u2013300. Association for Computational Linguistics, Stroudsburg, PA, USA. https:\/\/aclanthology.org\/M98-1018\/"},{"key":"578_CR6","unstructured":"Censys: Censys Search Engine. https:\/\/censys.io\/ (2025)"},{"key":"578_CR7","doi-asserted-by":"publisher","unstructured":"Chen Y, Ding J, Li D, Chen Z (2021) Joint bert model based cybersecurity named entity recognition. In: Proceedings of the 2021 4th International Conference on Software Engineering and Information Management. ICSIM \u201921, pp 236\u2013242. Association for Computing Machinery, Stroudsburg, PA, USA. https:\/\/doi.org\/10.1145\/3451471.3451508","DOI":"10.1145\/3451471.3451508"},{"key":"578_CR8","doi-asserted-by":"publisher","unstructured":"Deka P, Rajapaksha S, Rani R, Almutairi A, Karafili E (2025) Attacker: towards enhancing cyber-attack attribution with a named entity recognition dataset. In: Barhamgi M, Wang H, Wang X (eds.) Web Information Systems Engineering \u2013 WISE 2024, pp 255\u2013270. Springer, Singapore. https:\/\/doi.org\/10.1007\/978-981-96-0576-7_20","DOI":"10.1007\/978-981-96-0576-7_20"},{"key":"578_CR9","doi-asserted-by":"publisher","unstructured":"Devlin J, Chang M-W, Lee K, Toutanova K (2019) Bert: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp 4171\u20134186. Association for Computational Linguistics, Stroudsburg, PA, USA. https:\/\/doi.org\/10.18653\/v1\/N19-1423","DOI":"10.18653\/v1\/N19-1423"},{"key":"578_CR10","doi-asserted-by":"publisher","unstructured":"Ding B, Qin C, Zhao R, Luo T, Li X, Chen G, Xia W, Hu J, Luu AT, Joty S (2024) Data Augmentation Using LLMs: Data Perspectives, Learning Paradigms and Challenges. In: Ku L-W, Martins A, Srikumar V (eds) Findings of the Association for Computational Linguistics: ACL 2024, pp 1679\u20131705. Association for Computational Linguistics, Bangkok, Thailand. https:\/\/doi.org\/10.18653\/v1\/2024.findings-acl.97","DOI":"10.18653\/v1\/2024.findings-acl.97"},{"issue":"8","key":"578_CR11","doi-asserted-by":"publisher","first-page":"1215","DOI":"10.1093\/comjnl\/bxaa141","volume":"64","author":"Y Fang","year":"2020","unstructured":"Fang Y, Zhang Y, Huang C (2020) Cybereyes: cybersecurity entity recognition model based on graph convolutional network. Comput J 64(8):1215\u20131225. https:\/\/doi.org\/10.1093\/comjnl\/bxaa141","journal-title":"Comput J"},{"key":"578_CR12","unstructured":"FOFA: FOFA Search Engine. https:\/\/fofa.info\/ (2025)"},{"issue":"9","key":"578_CR13","doi-asserted-by":"publisher","first-page":"1153","DOI":"10.1631\/FITEE.2000286","volume":"22","author":"C Gao","year":"2021","unstructured":"Gao C, Zhang X, Han M, Liu H (2021) A review on cyber security named entity recognition. Front Inf Technol Electronic Eng 22(9):1153\u20131168. https:\/\/doi.org\/10.1631\/FITEE.2000286","journal-title":"Front Inf Technol Electronic Eng"},{"issue":"1","key":"578_CR14","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1186\/s42400-021-00072-y","volume":"4","author":"C Gao","year":"2021","unstructured":"Gao C, Zhang X, Liu H (2021) Data and knowledge-driven named entity recognition for cyber security. Cybersecurity 4(1):9. https:\/\/doi.org\/10.1186\/s42400-021-00072-y","journal-title":"Cybersecurity"},{"key":"578_CR15","doi-asserted-by":"crossref","unstructured":"Gasmi H, Laval J, Bouras A (2018) Lstm recurrent neural networks for cybersecurity named entity recognition. In: Proceedings of the 13th International Conference on Software Engineering (ICSEA 2018), pp 1\u20136. IARIA, Wilmington. https:\/\/hal.science\/hal-04680741\/document","DOI":"10.1109\/CSET.2019.8904905"},{"key":"578_CR16","doi-asserted-by":"publisher","unstructured":"Grishman R, Sundheim B (1996) Message understanding conference- 6: a brief history. In: COLING 1996 Volume 1: The 16th International Conference on Computational Linguistics, pp 466\u2013471. Association for Computational Linguistics, Stroudsburg, PA, USA. https:\/\/doi.org\/10.3115\/992628.992709","DOI":"10.3115\/992628.992709"},{"key":"578_CR17","doi-asserted-by":"publisher","unstructured":"Gui T, Zou Y, Zhang Q, Peng M, Fu J, Wei Z, Huang X (2019) A lexicon-based graph neural network for Chinese NER. In: Inui K, Jiang J, Ng V, Wan X (eds) Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp 1040\u20131050. Association for Computational Linguistics, Hong Kong, China. https:\/\/doi.org\/10.18653\/v1\/D19-1096 . https:\/\/aclanthology.org\/D19-1096\/","DOI":"10.18653\/v1\/D19-1096"},{"key":"578_CR18","doi-asserted-by":"publisher","unstructured":"Gururangan S, Marasovi\u0107 A, Swayamdipta S, Lo K, Beltagy I, Downey D, Smith NA (2020) Don\u2019t stop pretraining: adapt language models to domains and tasks. In: Jurafsky D, Chai J, Schluter N, Tetreault J (eds) Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp 8342\u20138360. https:\/\/doi.org\/10.18653\/v1\/2020.acl-main.740","DOI":"10.18653\/v1\/2020.acl-main.740"},{"issue":"1","key":"578_CR19","doi-asserted-by":"publisher","first-page":"096","DOI":"10.1093\/bioadv\/vbaf096","volume":"5","author":"BG Happi Happi","year":"2025","unstructured":"Happi Happi BG, Pelap GF, Symeonidou D, Larmande P (2025) GRU-SCANET: unleashing the power of GRU-based sinusoidal capture network for precision-driven named entity recognition. Bioinformat Adv 5(1):096. https:\/\/doi.org\/10.1093\/bioadv\/vbaf096","journal-title":"Bioinformat Adv"},{"key":"578_CR20","doi-asserted-by":"publisher","unstructured":"He X, Lin Z, Gong Y, Jin A-L, Zhang H, Lin C, Jiao J, Yiu SM, Duan N, Chen W (2024) AnnoLLM: making large language models to be better crowdsourced annotators. In: Yang Y, Davani A, Sil A, Kumar A (eds) Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 6: Industry Track), pp. 165\u2013190. Association for Computational Linguistics, Mexico City, Mexico . https:\/\/doi.org\/10.18653\/v1\/2024.naacl-industry.15","DOI":"10.18653\/v1\/2024.naacl-industry.15"},{"key":"578_CR21","unstructured":"Jackaduma: SecBERT. https:\/\/github.com\/jackaduma\/SecBERT (2022)"},{"issue":"1","key":"578_CR22","doi-asserted-by":"publisher","first-page":"116","DOI":"10.1186\/s42400-025-00503-0","volume":"8","author":"Y Jiang","year":"2025","unstructured":"Jiang Y, Hu H, Li Y, Li F, Zhao C, Chen C, Liu Y (2025) A zero-shot self-improving NER method for cyber threat intelligence via knowledge injection. Cybersecurity 8(1):116. https:\/\/doi.org\/10.1186\/s42400-025-00503-0","journal-title":"Cybersecurity"},{"key":"578_CR23","doi-asserted-by":"crossref","unstructured":"Kim\u00a0Sang EFT, De\u00a0Meulder F (2003) Introduction to the CoNLL-2003 shared task: language-independent named entity recognition. In: Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003, pp 142\u2013147. https:\/\/aclanthology.org\/W03-0419\/","DOI":"10.3115\/1119176.1119195"},{"key":"578_CR24","doi-asserted-by":"publisher","unstructured":"Kim J-H, Woodland P (2000) A rule-based named entity recognition system for speech input. In: Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), pp 528\u2013531. ISCA, Beijing, China. https:\/\/doi.org\/10.21437\/ICSLP.2000-131","DOI":"10.21437\/ICSLP.2000-131"},{"key":"578_CR25","unstructured":"Lee D-H (2013) Pseudo-label: the simple and efficient semi-supervised learning method for deep neural networks. In: ICML 2013 Workshop: Challenges in Representation Learning (WREPL), vol 3, p 896. https:\/\/api.semanticscholar.org\/CorpusID:18507866"},{"key":"578_CR26","unstructured":"Levow G-A (2006) The third international Chinese language processing bakeoff: word segmentation and named entity recognition. In: Ng HT, Kwong OOY (eds) Proceedings of the Fifth Sighan Workshop on Chinese Language Processing, pp 108\u2013117. Association for Computational Linguistics, Sydney, Australia. https:\/\/aclanthology.org\/W06-0115\/"},{"issue":"6","key":"578_CR27","doi-asserted-by":"publisher","first-page":"903","DOI":"10.1631\/FITEE.1800743","volume":"21","author":"Z-Z Li","year":"2020","unstructured":"Li Z-Z, Feng D-W, Li D-S, Lu X-C (2020) Learning to select pseudo labels: a semi-supervised method for named entity recognition. Front Inf Technol Electronic Eng 21(6):903\u2013916. https:\/\/doi.org\/10.1631\/FITEE.1800743","journal-title":"Front Inf Technol Electronic Eng"},{"key":"578_CR28","doi-asserted-by":"publisher","unstructured":"Li M, Shi T, Ziems C, Kan M-Y, Chen N, Liu Z, Yang D (2023) CoAnnotating: Uncertainty-guided work allocation between human and large language models for data annotation. In: Bouamor, H, Pino J, Bali K (eds) Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pp 1487\u20131505. Association for Computational Linguistics, Singapore. https:\/\/doi.org\/10.18653\/v1\/2023.emnlp-main.92","DOI":"10.18653\/v1\/2023.emnlp-main.92"},{"key":"578_CR29","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1016\/j.aiopen.2022.03.001","volume":"3","author":"B Li","year":"2022","unstructured":"Li B, Hou Y, Che W (2022) Data augmentation approaches in natural language processing: a survey. AI Open 3:71\u201390. https:\/\/doi.org\/10.1016\/j.aiopen.2022.03.001","journal-title":"AI Open"},{"key":"578_CR30","doi-asserted-by":"publisher","unstructured":"Liu X, Lin W, Ding Z (2026) Cyberner-llm: cyber threat intelligence named entity recognition with large language model. In: Han J, Xiang Y, Cheng G, Susilo W, Chen L (eds) Information and Communications Security, pp 513\u2013530. Springer, Singapore. https:\/\/doi.org\/10.1007\/978-981-95-3543-9_28","DOI":"10.1007\/978-981-95-3543-9_28"},{"key":"578_CR31","doi-asserted-by":"publisher","unstructured":"Ma X, Hovy E (2016) End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF. In: Erk K, Smith NA (eds) Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp 1064\u20131074. Association for Computational Linguistics, Berlin, Germany. https:\/\/doi.org\/10.18653\/v1\/P16-1101 . https:\/\/aclanthology.org\/P16-1101\/","DOI":"10.18653\/v1\/P16-1101"},{"key":"578_CR32","doi-asserted-by":"crossref","unstructured":"McCallum A, Li W (2003) Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons. In: Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003 - Vol 4, pp 188\u2013191. Association for Computational Linguistics, Stroudsburg, PA, USA. https:\/\/aclanthology.org\/W03-0430\/","DOI":"10.3115\/1119176.1119206"},{"key":"578_CR33","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2025.110992","volume":"156","author":"H Ming","year":"2025","unstructured":"Ming H, Yang J, Liu S, Jiang L, An N (2025) Harnessing high-quality pseudo-labels for robust few-shot nested named entity recognition. Eng Appl Artif Intell 156:110992. https:\/\/doi.org\/10.1016\/j.engappai.2025.110992","journal-title":"Eng Appl Artif Intell"},{"key":"578_CR34","doi-asserted-by":"publisher","unstructured":"Park Y, You W (2023) A pretrained language model for cyber threat intelligence. In: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track, pp 113\u2013122. Association for Computational Linguistics, Singapore. https:\/\/doi.org\/10.18653\/v1\/2023.emnlp-industry.12","DOI":"10.18653\/v1\/2023.emnlp-industry.12"},{"key":"578_CR35","unstructured":"Pradhan S, Moschitti A, Xue N, Ng HT, Bj\u00f6rkelund A, Uryupina O, Zhang Y, Zhong Z (2013) Towards robust linguistic analysis using OntoNotes. In: Hockenmaier J, Riedel S (eds) Proceedings of the Seventeenth Conference on Computational Natural Language Learning, pp 143\u2013152. Association for Computational Linguistics, Sofia, Bulgaria. https:\/\/aclanthology.org\/W13-3516\/"},{"key":"578_CR36","doi-asserted-by":"publisher","unstructured":"Qiao Z, Zhang C, Du G (2023) Improving cybersecurity named entity recognition with large language models. In: 2023 6th International Conference on Software Engineering and Computer Science (CSECS), pp 01\u201306. https:\/\/doi.org\/10.1109\/CSECS60003.2023.10428218","DOI":"10.1109\/CSECS60003.2023.10428218"},{"key":"578_CR37","doi-asserted-by":"publisher","first-page":"872","DOI":"10.1631\/FITEE.1800520","volume":"20","author":"Y Qin","year":"2019","unstructured":"Qin Y, Shen G, Zhao W-B, Chen Y, Yu M, Jin X (2019) A network security entity recognition method based on feature template and CNN-BILSTM-CRT. Front Inf Technol Electronic Eng 20:872\u2013884. https:\/\/doi.org\/10.1631\/FITEE.1800520","journal-title":"Front Inf Technol Electronic Eng"},{"key":"578_CR38","doi-asserted-by":"publisher","unstructured":"Ranade P, Piplai A, Joshi A, Finin T (2021) Cybert: contextualized embeddings for the cybersecurity domain. In: 2021 IEEE International Conference on Big Data (Big Data), pp 3334\u20133342. https:\/\/doi.org\/10.1109\/BigData52589.2021.9671824","DOI":"10.1109\/BigData52589.2021.9671824"},{"key":"578_CR39","doi-asserted-by":"publisher","unstructured":"Rau LF (1991) Extracting company names from text. In: Proceedings. The Seventh IEEE Conference on Artificial Intelligence Application, pp 29\u201332. IEEE, Piscataway, NJ, USA. https:\/\/doi.org\/10.1109\/CAIA.1991.120841","DOI":"10.1109\/CAIA.1991.120841"},{"key":"578_CR40","doi-asserted-by":"publisher","unstructured":"Santoso J, Sutanto P, Cahyadi B, Setiawan E (2024) Pushing the limits of low-resource NER using LLM artificial data generation. In: Ku L-W, Martins A, Srikumar V (eds) Findings of the Association for Computational Linguistics: ACL 2024, pp 9652\u20139667. Association for Computational Linguistics, Bangkok, Thailand. https:\/\/doi.org\/10.18653\/v1\/2024.findings-acl.575 . https:\/\/aclanthology.org\/2024.findings-acl.575\/","DOI":"10.18653\/v1\/2024.findings-acl.575"},{"key":"578_CR41","unstructured":"Shodan: Shodan Search Engine. https:\/\/www.shodan.io\/ (2025)"},{"key":"578_CR42","doi-asserted-by":"publisher","unstructured":"Soltani S, Limouni E (2025) Llm based data annotation and augmentation for ner and relationship extraction models enhancement. In: Verdejo D, Mercier-Laurent E (eds) Artificial Intelligence for Global Security, pp 153\u2013160. Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-031-96522-7_12","DOI":"10.1007\/978-3-031-96522-7_12"},{"key":"578_CR43","unstructured":"Spark-Lite: Spark-Lite. https:\/\/xinghuo.xfyun.cn\/sparkapi (2025)"},{"key":"578_CR44","unstructured":"Syed Z, Padia A, Finin TW, Mathews ML, Joshi A (2016) Uco: a unified cybersecurity ontology. In: AAAI Workshop: Artificial Intelligence for Cyber Security, pp 195\u2013202. Association for the Advancement of Artificial Intelligence (AAAI), Menlo Park. https:\/\/cdn.aaai.org\/ocs\/ws\/ws0163\/12574-57427-1-PB.pdf"},{"key":"578_CR45","doi-asserted-by":"publisher","unstructured":"Tikhomirov M, Loukachevitch N, Sirotina A, Dobrov B (2020) Using bert and augmentation in named entity recognition for cybersecurity domain. In: M\u00e9tais E, Meziane F, Horacek H, Cimiano P (eds) Natural Language Processing and Information Systems, pp 16\u201324. Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-030-51310-8_2","DOI":"10.1007\/978-3-030-51310-8_2"},{"key":"578_CR46","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: Proceedings of the 31st international Conference on Neural Information Processing Systems. NIPS\u201917, pp 6000\u20136010. Curran Associates Inc., Red Hook, NY, USA"},{"key":"578_CR47","doi-asserted-by":"publisher","unstructured":"Wang X, Liu X, Ao S, Li N, Jiang Z, Xu Z, Xiong Z, Xiong M, Zhang X (2020) DNRTI: a large-scale dataset for named entity recognition in threat intelligence. In: 2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom), pp 1842\u20131848. https:\/\/doi.org\/10.1109\/TrustCom50675.2020.00252","DOI":"10.1109\/TrustCom50675.2020.00252"},{"key":"578_CR48","doi-asserted-by":"publisher","unstructured":"Wang X, Liu R, Yang J, Chen R, Ling Z, Yang P, Zhang K (2022) Cyber threat intelligence entity extraction based on deep learning and field knowledge engineering. In: 2022 IEEE 25th International Conference on Computer Supported Cooperative Work in Design (CSCWD), pp. 406\u2013413. IEEE, Piscataway, NJ, USA. https:\/\/doi.org\/10.1109\/CSCWD54268.2022.9776139","DOI":"10.1109\/CSCWD54268.2022.9776139"},{"key":"578_CR49","doi-asserted-by":"publisher","unstructured":"Wang P, Zhao Z, Wen H, Wang F, Wang B, Zhang Q, Wang Y (2024) Llm-autoda: large language model-driven automatic data augmentation for long-tailed problems. In: Globerson A, Mackey L, Belgrave D, Fan A, Paquet U, Tomczak J, Zhang C (eds) Advances in Neural Information Processing Systems, Vol 37, pp 64915\u201364941. Curran Associates, Inc., Red Hook, NY. https:\/\/doi.org\/10.52202\/079017-2072","DOI":"10.52202\/079017-2072"},{"key":"578_CR50","doi-asserted-by":"publisher","unstructured":"Wang Z, Zhang J, Zhang X, Liu K, Wang P, Zhou Y (2025) Diversity-oriented data augmentation with large language models. In: Che W, Nabende J, Shutova E, Pilehvar MT (eds) Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp 22265\u201322283. Association for Computational Linguistics, Vienna, Austria. https:\/\/doi.org\/10.18653\/v1\/2025.acl-long.1084. https:\/\/aclanthology.org\/2025.acl-long.1084\/","DOI":"10.18653\/v1\/2025.acl-long.1084"},{"issue":"3","key":"578_CR51","doi-asserted-by":"publisher","first-page":"402","DOI":"10.1162\/dint_a_00105","volume":"3","author":"C Wen","year":"2021","unstructured":"Wen C, Chen T, Jia X, Zhu J (2021) Medical named entity recognition from un-labelled medical records based on pre-trained language models and domain dictionary. Data Intell 3(3):402\u2013417. https:\/\/doi.org\/10.1162\/dint_a_00105","journal-title":"Data Intell"},{"key":"578_CR52","doi-asserted-by":"publisher","unstructured":"Xu R, Zhang Z, Rao Z, Chen J, Li M, Liu F, Pan S (2019) Cyberspace surveying and mapping: hierarchical model and resource formalization. In: IEEE INFOCOM 2019 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), pp 68\u201372. IEEE, Piscataway, NJ, USA. https:\/\/doi.org\/10.1109\/INFCOMW.2019.8845226","DOI":"10.1109\/INFCOMW.2019.8845226"},{"key":"578_CR53","doi-asserted-by":"publisher","first-page":"149061","DOI":"10.1109\/ACCESS.2024.3476481","volume":"12","author":"J Yang","year":"2024","unstructured":"Yang J, Li C, Chen X, Tan K, Zhao Y, Wang H, Wang J (2024) Research on method for fusion and mapping of cyberspace assets based on knowledge graph. IEEE Access 12:149061\u2013149075. https:\/\/doi.org\/10.1109\/ACCESS.2024.3476481","journal-title":"IEEE Access"},{"issue":"1","key":"578_CR54","doi-asserted-by":"publisher","first-page":"106","DOI":"10.1186\/s42400-025-00505-y","volume":"9","author":"X Yang","year":"2026","unstructured":"Yang X, Zhong R, Chen Y, Peng G, Yao D, Chen C, Wang C, Zhang D, Zhou Y, Yang Z (2026) CTI-thinker: an LLM-driven system for CTI knowledge graph construction and attack reasoning. Cybersecurity 9(1):106. https:\/\/doi.org\/10.1186\/s42400-025-00505-y","journal-title":"Cybersecurity"},{"key":"578_CR55","doi-asserted-by":"publisher","unstructured":"Zhang N, Chen M, Bi Z, Liang X, Li L, Shang X, Yin K, Tan C, Xu J, Huang F, Si L, Ni Y, Xie G, Sui Z, Chang B, Zong H, Yuan Z, Li L, Yan J, Zan H, Zhang K, Tang B, Chen Q (2022) CBLUE: a Chinese biomedical language understanding evaluation benchmark. In: Muresan S, Nakov P, Villavicencio A (eds) Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp 7888\u20137915. Association for Computational Linguistics, Dublin, Ireland. https:\/\/doi.org\/10.18653\/v1\/2022.acl-long.544. https:\/\/aclanthology.org\/2022.acl-long.544\/","DOI":"10.18653\/v1\/2022.acl-long.544"},{"key":"578_CR56","doi-asserted-by":"publisher","unstructured":"Zhang R, Li Y, Ma Y, Zhou M, Zou L (2023) LLMaAA: making large language models as active annotators. In: Bouamor H, Pino J, Bali K (eds) Findings of the Association for Computational Linguistics: EMNLP 2023, pp 13088\u201313103. Association for Computational Linguistics, Singapore. https:\/\/doi.org\/10.18653\/v1\/2023.findings-emnlp.872","DOI":"10.18653\/v1\/2023.findings-emnlp.872"},{"key":"578_CR57","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2025.114183","volume":"328","author":"H Zhang","year":"2025","unstructured":"Zhang H, Wu T, Zhu T, Wen S, Xiang Y (2025) Cyberllama: a fine-tuned large language model for cybersecurity named entity recognition. Knowl-Based Syst 328:114183. https:\/\/doi.org\/10.1016\/j.knosys.2025.114183","journal-title":"Knowl-Based Syst"},{"key":"578_CR58","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2025.126651","volume":"271","author":"Y Zhang","year":"2025","unstructured":"Zhang Y, Liu J, Zhong X, Wu L (2025) SecLMNER: a framework for enhanced named entity recognition in multi-source cybersecurity data using large language models. Expert Syst Appl 271:126651. https:\/\/doi.org\/10.1016\/j.eswa.2025.126651","journal-title":"Expert Syst Appl"},{"key":"578_CR59","doi-asserted-by":"publisher","DOI":"10.1016\/j.cose.2024.104220","volume":"150","author":"Y Zhang","year":"2025","unstructured":"Zhang Y, Du T, Ma Y, Wang X, Xie Y, Yang G, Lu Y, Chang E-C (2025) AttacKG+: boosting attack graph construction with large language models. Comput Secur 150:104220. https:\/\/doi.org\/10.1016\/j.cose.2024.104220","journal-title":"Comput Secur"},{"key":"578_CR60","doi-asserted-by":"publisher","DOI":"10.1016\/j.cose.2023.103524","volume":"136","author":"X Zhao","year":"2024","unstructured":"Zhao X, Jiang R, Han Y, Li A, Peng Z (2024) A survey on cybersecurity knowledge graph construction. Comput Secur 136:103524. https:\/\/doi.org\/10.1016\/j.cose.2023.103524","journal-title":"Comput Secur"},{"key":"578_CR61","doi-asserted-by":"publisher","unstructured":"Zhou G, Su J (2002) Named entity recognition using an hmm-based chunk tagger. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp 473\u2013480. Association for Computational Linguistics, Stroudsburg, PA, USA. https:\/\/doi.org\/10.3115\/1073083.1073163","DOI":"10.3115\/1073083.1073163"},{"key":"578_CR62","unstructured":"ZoomEye: ZoomEye Search Engine. https:\/\/www.zoomeye.org\/ (2025)"},{"key":"578_CR63","doi-asserted-by":"publisher","unstructured":"\u017dukov-Gregori\u010d A, Bachrach Y, Coope S (2018) Named entity recognition with parallel recurrent neural networks. In: Gurevych I, Miyao Y (eds) Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp 69\u201374. Association for Computational Linguistics, Melbourne, Australia. https:\/\/doi.org\/10.18653\/v1\/P18-2012 . https:\/\/aclanthology.org\/P18-2012\/","DOI":"10.18653\/v1\/P18-2012"}],"container-title":["Cybersecurity"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s42400-026-00578-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s42400-026-00578-3","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s42400-026-00578-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T04:01:29Z","timestamp":1775102489000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1186\/s42400-026-00578-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,4,2]]},"references-count":63,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2026,12]]}},"alternative-id":["578"],"URL":"https:\/\/doi.org\/10.1186\/s42400-026-00578-3","relation":{},"ISSN":["2523-3246"],"issn-type":[{"value":"2523-3246","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,4,2]]},"assertion":[{"value":"5 January 2026","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 March 2026","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 April 2026","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"146"}}