{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,18]],"date-time":"2026-06-18T15:40:59Z","timestamp":1781797259292,"version":"3.54.5"},"reference-count":54,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2023,10,12]],"date-time":"2023-10-12T00:00:00Z","timestamp":1697068800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["U20A2091"],"award-info":[{"award-number":["U20A2091"]}]},{"name":"National Natural Science Foundation of China","award":["41771426"],"award-info":[{"award-number":["41771426"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>The rapid development of information technology has made the amount of information in massive texts far exceed human intuitive cognition, and dependency parsing can effectively deal with information overload. In the background of domain specialization, the migration and application of syntactic treebanks and the speed improvement in syntactic analysis models become the key to the efficiency of syntactic analysis. To realize domain migration of syntactic tree library and improve the speed of text parsing, this paper proposes a novel approach\u2014the Double-Array Trie and Multi-threading (DAT-MT) accelerated graph fusion dependency parsing model. It effectively combines the specialized syntactic features from small-scale professional field corpus with the generalized syntactic features from large-scale news corpus, which improves the accuracy of syntactic relation recognition. Aiming at the problem of high space and time complexity brought by the graph fusion model, the DAT-MT method is proposed. It realizes the rapid mapping of massive Chinese character features to the model\u2019s prior parameters and the parallel processing of calculation, thereby improving the parsing speed. The experimental results show that the unlabeled attachment score (UAS) and the labeled attachment score (LAS) of the model are improved by 13.34% and 14.82% compared with the model with only the professional field corpus and improved by 3.14% and 3.40% compared with the model only with news corpus; both indicators are better than DDParser and LTP 4 methods based on deep learning. Additionally, the method in this paper achieves a speedup of about 3.7 times compared to the method with a red-black tree index and a single thread. Efficient and accurate syntactic analysis methods will benefit the real-time processing of massive texts in professional fields, such as multi-dimensional semantic correlation, professional feature extraction, and domain knowledge graph construction.<\/jats:p>","DOI":"10.3390\/e25101444","type":"journal-article","created":{"date-parts":[[2023,10,12]],"date-time":"2023-10-12T07:28:44Z","timestamp":1697095724000},"page":"1444","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["DAT-MT Accelerated Graph Fusion Dependency Parsing Model for Small Samples in Professional Fields"],"prefix":"10.3390","volume":"25","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5167-2956","authenticated-orcid":false,"given":"Rui","family":"Li","sequence":"first","affiliation":[{"name":"State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430072, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Shili","family":"Shu","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430072, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6663-4960","authenticated-orcid":false,"given":"Shunli","family":"Wang","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430072, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4061-0631","authenticated-orcid":false,"given":"Yang","family":"Liu","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430072, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yanhao","family":"Li","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430072, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mingjun","family":"Peng","sequence":"additional","affiliation":[{"name":"Wuhan Geomatics Institute, Wuhan 430079, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2023,10,12]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"979","DOI":"10.1109\/TASLP.2022.3153261","article-title":"Improving Chinese Named Entity Recognition by Large-Scale Syntactic Dependency Graph","volume":"30","author":"Zhu","year":"2022","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Wang, S., Li, R., and Wu, H. (2023). Integrating machine learning with linguistic features: A universal method for extraction and normalization of temporal expressions in Chinese texts. Comput. Methods Programs Biomed., 233.","DOI":"10.1016\/j.cmpb.2023.107474"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"629","DOI":"10.1016\/j.neucom.2022.06.072","article-title":"Conversational emotion recognition studies based on graph convolutional neural networks and a dependent syntactic analysis","volume":"501","author":"Shou","year":"2022","journal-title":"Neurocomputing"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"17078","DOI":"10.1109\/ACCESS.2022.3149798","article-title":"Framework for Deep Learning-Based Language Models Using Multi-Task Learning in Natural Language Understanding: A Systematic Literature Review and Future Directions","volume":"10","author":"Samant","year":"2022","journal-title":"IEEE Access"},{"key":"ref_5","unstructured":"Ruggeri, F. (2022). Towards Unstructured Knowledge Integration in Natural Language Processing. [Ph.D. Thesis, Alma Mater Studiorum-Universit\u00e0 di Bologna]."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"132367","DOI":"10.1109\/ACCESS.2020.3002863","article-title":"BERT-Based Chinese Relation Extraction for Public Security","volume":"8","author":"Hou","year":"2020","journal-title":"IEEE Access"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Wang, X., Cao, Y., and Mao, B. (2020, January 14\u201317). Spatio-temporal Semantic Analysis of Safety Production Accidents in Grain Depot based on Natural Language Processing. Proceedings of the 2020 IEEE\/WIC\/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), Melbourne, Australia.","DOI":"10.1109\/WIIAT50758.2020.00142"},{"key":"ref_8","first-page":"473","article-title":"A Theory-based Deep-Learning Approach to Detecting Disinformation in Financial Social Media","volume":"25","author":"Chung","year":"2023","journal-title":"Inf. Syst. Front."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"10937","DOI":"10.1007\/s13369-023-07694-z","article-title":"Syntactic-Semantic Similarity Based on Dependency Tree Kernel","volume":"48","author":"Alian","year":"2023","journal-title":"Arab. J. Sci. Eng."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1898","DOI":"10.1007\/s11431-020-1666-4","article-title":"A survey of syntactic-semantic parsing based on constituent and dependency structures","volume":"63","author":"Zhang","year":"2020","journal-title":"Sci. China Technol. Sci."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"605","DOI":"10.1016\/j.tics.2014.08.001","article-title":"Trends in syntactic parsing: Anticipation, Bayesian estimation, and good-enough parsing","volume":"18","author":"Traxler","year":"2014","journal-title":"Trends Cogn. Sci."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"103427","DOI":"10.1016\/j.artint.2020.103427","article-title":"Dependency-based syntax-aware word representations","volume":"292","author":"Zhang","year":"2021","journal-title":"Artif. Intell."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Umair, A., Sarfraz, M.S., Ahmad, M., Habib, U., Ullah, M.H., and Mazzara, M. (2020). Spatiotemporal Analysis of Web News Archives for Crime Prediction. Appl. Sci., 10.","DOI":"10.3390\/app10228220"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1039","DOI":"10.1016\/j.ijmedinf.2015.06.007","article-title":"NLP based congestive heart failure case finding: A prospective analysis on statewide electronic medical records","volume":"84","author":"Wang","year":"2015","journal-title":"Int. J. Med Inform."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Zhong, J., Gao, C., and Yi, X. (2018, January 18\u201320). Categorization of Patient Disease into ICD-10 with NLP and SVM for Chinese Electronic Health Record Analysis. Proceedings of the 2018 International Conference on Artificial Intelligence and Pattern Recognition, Beijing, China.","DOI":"10.1145\/3268866.3268877"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Gorczyca, M.T., McDonald, T.M., Goodwyn, T.A., and David, P.F. (2020, January 21). A comparison of language representation models on small text corpora of scientific and technical documents. Proceedings of the Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications II, Online.","DOI":"10.1117\/12.2557891"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1016\/j.future.2020.07.043","article-title":"Measuring the short text similarity based on semantic and syntactic information","volume":"114","author":"Yang","year":"2021","journal-title":"Futur. Gener. Comput. Syst."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Ma, S., Sun, X., Zhang, Y., and Wei, B. (2018, January 26\u201330). Accelerating Graph-Based Dependency Parsing with Lock-Free Parallel Perceptron. Proceedings of the CCF International Conference on Natural Language Processing and Chinese Computing, Hohhot, China.","DOI":"10.1007\/978-3-319-99495-6_22"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"e1382","DOI":"10.1002\/widm.1382","article-title":"Automatic question generation","volume":"10","author":"Last","year":"2020","journal-title":"Wiley Interdiscip. Rev. Data Min. Knowl. Discov."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"11732","DOI":"10.1073\/pnas.1421236112","article-title":"An architecture for encoding sentence meaning in left mid-superior temporal cortex","volume":"112","author":"Frankland","year":"2015","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1162\/tacl_a_00103","article-title":"Multi-lingual Dependency Parsing Evaluation: A Large-scale Analysis of Word Order Properties using Artificial Data","volume":"4","author":"Gulordava","year":"2016","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1921","DOI":"10.1007\/s11431-020-1685-2","article-title":"Representation learning in discourse parsing: A survey","volume":"63","author":"Song","year":"2020","journal-title":"Sci. China Technol. Sci."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"397","DOI":"10.1007\/s10579-014-9290-3","article-title":"The Chinese Discourse TreeBank: A Chinese corpus annotated with discourse relations","volume":"49","author":"Zhou","year":"2015","journal-title":"Lang. Resour. Eval."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1162\/tacl_a_00113","article-title":"The Galactic Dependencies Treebanks: Getting More Data by Synthesizing New Languages","volume":"4","author":"Wang","year":"2016","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1161","DOI":"10.1007\/s10115-022-01665-w","article-title":"A survey on extraction of causal relations from natural language text","volume":"64","author":"Yang","year":"2022","journal-title":"Knowl. Inf. Syst."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Sato, M., Manabe, H., Noji, H., and Matsumoto, Y. (2017, January 3\u20134). Adversarial training for cross-domain universal dependency parsing. Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Vancouver, BC, Canada.","DOI":"10.18653\/v1\/K17-3007"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Yu, J., El-karef, M., and Bohnet, B. (2015, January 5\u20137). Domain adaptation for dependency parsing via self-training. Proceedings of the 14th International Conference on Parsing Technologies, Bilbao, Spain.","DOI":"10.18653\/v1\/W15-2201"},{"key":"ref_28","unstructured":"Li, Z., Liu, T., and Che, W. (2012, January 8\u201314). Exploiting multiple treebanks for parsing with quasi-synchronous grammars. Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jeju Island, Republic of Korea."},{"key":"ref_29","first-page":"25","article-title":"Construction of Chinese Sentence-Category Dependency Treebank","volume":"1","author":"Wang","year":"2013","journal-title":"Acta Sci. Nat. Univ. Pekin."},{"key":"ref_30","first-page":"102","article-title":"Unified framework for hybrid dependency parsing","volume":"45","author":"Wu","year":"2016","journal-title":"J. Univ. Electron. Sci. Technol. China"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Liu, F., Li, J., and Zhang, L. (2023). Syntax and Domain Aware Model for Unsupervised Program Translation. arXiv.","DOI":"10.1109\/ICSE48619.2023.00072"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"130","DOI":"10.1016\/j.jvlc.2015.10.017","article-title":"A hadoop based platform for natural language processing of web pages and documents","volume":"31","author":"Nesi","year":"2015","journal-title":"J. Vis. Lang. Comput."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"102122","DOI":"10.1016\/j.ipm.2019.102122","article-title":"Towards a real-time processing framework based on improved distributed recurrent neural network variants with fastText for social big data analytics","volume":"57","author":"Hammou","year":"2020","journal-title":"Inf. Process. Manag."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"107760","DOI":"10.1016\/j.knosys.2021.107760","article-title":"Multitask Pointer Network for multi-representational parsing","volume":"236","year":"2022","journal-title":"Knowl. Based Syst."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1162\/coli_a_00425","article-title":"To augment or not to augment? A comparative study on text augmentation techniques for low-resource NLP","volume":"48","year":"2022","journal-title":"Comput. Linguist."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Li, Z., Chao, J., Zhang, M., and Yang, J. (2016, January 1\u20135). Fast coupled sequence labeling on heterogeneous annotations via context-aware pruning. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, TX, USA.","DOI":"10.18653\/v1\/D16-1072"},{"key":"ref_37","unstructured":"Goldberg, Y., and Elhadad, M. (2010, January 2\u20134). An efficient algorithm for easy-first non-directional dependency parsing. Proceedings of the Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Los Angeles, CA, USA."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Loglo, S. (2020, January 21\u201322). A Lexical Dependency Probability Model for Mongolian Based on Integration of Morphological and Syntactic Features. Proceedings of the 2nd International Conference on Computer Modeling, Simulation and Algorithm, Beijing, China.","DOI":"10.1088\/1742-6596\/1624\/2\/022030"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Anderson, M., and G\u00f3mez-Rodr\u00edguez, C. (2020). Distilling Neural Networks for Greener and Faster Dependency Parsing. arXiv.","DOI":"10.18653\/v1\/2020.iwpt-1.2"},{"key":"ref_40","unstructured":"Che, W., Li, Z., and Liu, T. (2010, January 23\u201327). Ltp: A chinese language technology platform. Proceedings of the Coling 2010: Demonstrations Volume, Beijing, China."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"071401","DOI":"10.1115\/1.4030159","article-title":"Latent Customer Needs Elicitation by Use Case Analogical Reasoning from Sentiment Analysis of Online Product Reviews","volume":"137","author":"Zhou","year":"2015","journal-title":"J. Mech. Des."},{"key":"ref_42","unstructured":"Zhao, J., and Wang, X.-L. (2002, January 4\u20135). Chinese POS tagging based on maximum entropy model. Proceedings of the International Conference on Machine Learning and Cybernetics, Beijing, China."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Zhao, H., and Kit, C. (2008, January 16\u201317). Parsing syntactic and semantic dependencies with two single-stage maximum entropy models. Proceedings of the CoNLL 2008: Twelfth Conference on Computational Natural Language Learning, Manchester, UK.","DOI":"10.3115\/1596324.1596360"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Li, S., Wang, L., Cao, Z., and Li, W. (2014, January 22\u201327). Text-level discourse dependency parsing. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Baltimore, MD, USA.","DOI":"10.3115\/v1\/P14-1003"},{"key":"ref_45","unstructured":"Nov\u00e1k, V., and \u017dabokrtsk\u00fd, Z. (2007). International Conference on Text, Speech and Dialogue, Springer."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"945","DOI":"10.4028\/www.scientific.net\/AMR.225-226.945","article-title":"Research of Chinese Segmentation Based on MMSeg and Double Array TRIE","volume":"225\u2013226","author":"Xu","year":"2011","journal-title":"Adv. Mater. Res."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1007\/s10115-015-0873-0","article-title":"A compression method of double-array structures using linear functions","volume":"48","author":"Kanda","year":"2016","journal-title":"Knowl. Inf. Syst."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"528","DOI":"10.1016\/j.future.2017.02.007","article-title":"Analysis of classic algorithms on highly-threaded many-core architectures","volume":"82","author":"Ma","year":"2018","journal-title":"Futur. Gener. Comput. Syst."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1016\/j.future.2018.02.016","article-title":"On the adequacy of lightweight thread approaches for high-level parallel programming models","volume":"84","author":"Mayo","year":"2018","journal-title":"Futur. Gener. Comput. Syst."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"695","DOI":"10.1002\/spe.4380220902","article-title":"An efficient implementation of trie structures","volume":"22","author":"Aoe","year":"1992","journal-title":"Softw. Pract. Exp."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2628913","article-title":"Analysis of Fork\/Join and Related Queueing Systems","volume":"47","author":"Thomasian","year":"2014","journal-title":"ACM Comput. Surv."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"202","DOI":"10.1016\/j.future.2013.06.020","article-title":"A memory access model for highly-threaded many-core architectures","volume":"30","author":"Ma","year":"2014","journal-title":"Futur. Gener. Comput. Syst."},{"key":"ref_53","unstructured":"Zhang, S., Wang, L., Sun, K., and Xiao, X. (2020). A practical Chinese dependency parser based on a large-scale dataset. arXiv."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Che, W., Feng, Y., Qin, L., and Liu, T. (2020). N-LTP: An Open-source Neural Language Technology Platform for Chinese. arXiv.","DOI":"10.18653\/v1\/2021.emnlp-demo.6"}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/25\/10\/1444\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:05:37Z","timestamp":1760130337000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/25\/10\/1444"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,12]]},"references-count":54,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2023,10]]}},"alternative-id":["e25101444"],"URL":"https:\/\/doi.org\/10.3390\/e25101444","relation":{},"ISSN":["1099-4300"],"issn-type":[{"value":"1099-4300","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,12]]}}}