{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T17:56:52Z","timestamp":1770746212125,"version":"3.49.0"},"reference-count":58,"publisher":"Walter de Gruyter GmbH","issue":"3","license":[{"start":{"date-parts":[[2021,3,2]],"date-time":"2021-03-02T00:00:00Z","timestamp":1614643200000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Purpose<\/jats:title>\n                    <jats:p>Automatic keyphrase extraction (AKE) is an important task for grasping the main points of the text. In this paper, we aim to combine the benefits of sequence labeling formulation and pretrained language model to propose an automatic keyphrase extraction model for Chinese scientific research.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Design\/methodology\/approach<\/jats:title>\n                    <jats:p>We regard AKE from Chinese text as a character-level sequence labeling task to avoid segmentation errors of Chinese tokenizer and initialize our model with pretrained language model BERT, which was released by Google in 2018. We collect data from Chinese Science Citation Database and construct a large-scale dataset from medical domain, which contains 100,000 abstracts as training set, 6,000 abstracts as development set and 3,094 abstracts as test set. We use unsupervised keyphrase extraction methods including term frequency (TF), TF-IDF, TextRank and supervised machine learning methods including Conditional Random Field (CRF), Bidirectional Long Short Term Memory Network (BiLSTM), and BiLSTM-CRF as baselines. Experiments are designed to compare word-level and character-level sequence labeling approaches on supervised machine learning models and BERT-based models.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Findings<\/jats:title>\n                    <jats:p>Compared with character-level BiLSTM-CRF, the best baseline model with F1 score of 50.16%, our character-level sequence labeling model based on BERT obtains F1 score of 59.80%, getting 9.64% absolute improvement.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Research limitations<\/jats:title>\n                    <jats:p>We just consider automatic keyphrase extraction task rather than keyphrase generation task, so only keyphrases that are occurred in the given text can be extracted. In addition, our proposed dataset is not suitable for dealing with nested keyphrases.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Practical implications<\/jats:title>\n                    <jats:p>\n                      We make our character-level IOB format dataset of Chinese Automatic Keyphrase Extraction from scientific Chinese medical abstracts (CAKE) publicly available for the benefits of research community, which is available at:\n                      <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/possible1402\/Dataset-For-Chinese-Medical-Keyphrase-Extraction\">https:\/\/github.com\/possible1402\/Dataset-For-Chinese-Medical-Keyphrase-Extraction<\/jats:ext-link>\n                      .\n                    <\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Originality\/value<\/jats:title>\n                    <jats:p>By designing comparative experiments, our study demonstrates that character-level formulation is more suitable for Chinese automatic keyphrase extraction task under the general trend of pretrained language models. And our proposed dataset provides a unified method for model evaluation and can promote the development of Chinese automatic keyphrase extraction to some extent.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.2478\/jdis-2021-0013","type":"journal-article","created":{"date-parts":[[2021,3,11]],"date-time":"2021-03-11T04:25:38Z","timestamp":1615436738000},"page":"35-57","source":"Crossref","is-referenced-by-count":7,"title":["Automatic Keyphrase Extraction from Scientific Chinese Medical Abstracts Based on Character-Level Sequence Labeling"],"prefix":"10.2478","volume":"6","author":[{"given":"Liangping","family":"Ding","sequence":"first","affiliation":[{"name":"National Science Library, Chinese Academy of Sciences , Beijing , China"},{"name":"University of Chinese academy of sciences , Beijing , China"}]},{"given":"Zhixiong","family":"Zhang","sequence":"additional","affiliation":[{"name":"National Science Library, Chinese Academy of Sciences , Beijing , China"},{"name":"University of Chinese academy of sciences , Beijing , China"},{"name":"Wuhan Library, Chinese Academy of Sciences , Wuhan , China"}]},{"given":"Huan","family":"Liu","sequence":"additional","affiliation":[{"name":"National Science Library, Chinese Academy of Sciences , Beijing , China"},{"name":"University of Chinese academy of sciences , Beijing , China"}]},{"given":"Jie","family":"Li","sequence":"additional","affiliation":[{"name":"National Science Library, Chinese Academy of Sciences , Beijing , China"},{"name":"University of Chinese academy of sciences , Beijing , China"}]},{"given":"Gaihong","family":"Yu","sequence":"additional","affiliation":[{"name":"National Science Library, Chinese Academy of Sciences , Beijing , China"},{"name":"University of Chinese academy of sciences , Beijing , China"}]}],"member":"374","published-online":{"date-parts":[[2021,3,2]]},"reference":[{"key":"2026012113135989610_j_jdis-2021-0013_ref_001","doi-asserted-by":"crossref","unstructured":"Augenstein, I., Das, M., Riedel, S., Vikraman, L., & McCallum, A. (2017). Semeval 2017 task 10: Scienceie-extracting keyphrases and relations from scientific publications. ArXiv Preprint ArXiv:1704.02853.","DOI":"10.18653\/v1\/S17-2091"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_002","doi-asserted-by":"crossref","unstructured":"Barker, K., & Cornacchia, N. (2000). Using noun phrase heads to extract document keyphrases. Conference of the Canadian Society for Computational Studies of Intelligence, 40\u201352.","DOI":"10.1007\/3-540-45486-1_4"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_003","doi-asserted-by":"crossref","unstructured":"Bengio, Y., Lamblin, P., Popovici, D., & Larochelle, H. (2007). Greedy layer-wise training of deep networks. Advances in Neural Information Processing Systems, 153\u2013160.","DOI":"10.7551\/mitpress\/7503.003.0024"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_004","unstructured":"Berend, G. (2011). Opinion expression mining by exploiting keyphrase extraction. In Proceedings of the 5th International Joint Conference on Natural Language Processing, 1162\u20131170, Chiang Mai, Thailand."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_005","unstructured":"Bougouin, A., Boudin, F., & Daille, B. (2013). Topicrank: Graph-based topic ranking for keyphrase extraction. In Proceedings of International Joint Conference on Natural Language, 543\u2013551, Nagoya, Japan."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_006","doi-asserted-by":"crossref","unstructured":"Campos, R., Mangaravite, V., Pasquali, A., Jorge, A.M., Nunes, C., & Jatowt, A. (2018). A text feature based automatic keyword extraction method for single documents. European Conference on Information Retrieval, 684\u2013691.","DOI":"10.1007\/978-3-319-76941-7_63"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_007","doi-asserted-by":"crossref","unstructured":"Caragea, C., Bulgarov, F.A., Godea, A., & Gollapalli, S.D. (2014). Citation-enhanced keyphrase extraction from research papers: A supervised approach. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 1435\u20131446.","DOI":"10.3115\/v1\/D14-1150"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_008","doi-asserted-by":"crossref","unstructured":"Carpena, P., Bernaola-Galv\u00e1n, P., Hackenberg, M., Coronado, A., & Oliver, J. (2009). Level statistics of words: Finding keywords in literary texts and symbolic sequences. Physical Review E, 79(3), 035102.","DOI":"10.1103\/PhysRevE.79.035102"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_009","doi-asserted-by":"crossref","unstructured":"Chien, L.F. (1997). PAT-tree-based keyword extraction for Chinese information retrieval. In Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 50\u201358.","DOI":"10.1145\/258525.258534"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_010","doi-asserted-by":"crossref","unstructured":"Cohen, J.D. (1995). Highlights: Language-and domain-independent automatic indexing terms for abstracting. Journal of the American Society for Information Science, 46(3), 162\u2013174.","DOI":"10.1002\/(SICI)1097-4571(199504)46:3<162::AID-ASI2>3.0.CO;2-6"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_011","unstructured":"Dai, A.M., & Le, Q.V. (2015). Semi-supervised sequence learning. Advances in Neural Information Processing Systems, 3079\u20133087."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_012","doi-asserted-by":"crossref","unstructured":"Danesh, S., Sumner, T., & Martin, J.H. (2015). Sgrank: Combining statistical and graphical methods to improve the state of the art in unsupervised keyphrase extraction. Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics, 117\u2013126.","DOI":"10.18653\/v1\/S15-1013"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_013","unstructured":"Devlin, J., Chang, M.W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. ArXiv:1810.04805 [Cs]. http:\/\/arxiv.org\/abs\/1810.04805"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_014","unstructured":"Erhan, D., Bengio, Y., Courville, A., Manzagol, P.A., Vincent, P., & Bengio, S. (2010). Why does unsupervised pre-training help deep learning? Journal of Machine Learning Research, 11(Feb), 625\u2013660."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_015","doi-asserted-by":"crossref","unstructured":"Erkan, G., & Radev, D.R. (2004). Lexrank: Graph-based lexical centrality as salience in text summarization. Journal of Artificial Intelligence Research, 22, 457\u2013479.","DOI":"10.1613\/jair.1523"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_016","unstructured":"Frank, E., Paynter, G., Witten, I., Gutwin, C., & Nevill-Manning, C. (1999). Domain-Specific Keyphrase Extraction. In Proceeding of 16th International Joint Conference on Artificial Intelligence, Stockholm, Sweden, 668\u2013673."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_017","doi-asserted-by":"crossref","unstructured":"Giorgi, J.M., & Bader, G.D. (2018). Transfer learning for biomedical named entity recognition with neural networks. Bioinformatics, 34(23), 4087\u20134094.","DOI":"10.1093\/bioinformatics\/bty449"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_018","doi-asserted-by":"crossref","unstructured":"Grineva, M., Grinev, M., & Lizorkin, D. (2009). Extracting key terms from noisy and multitheme documents. In Proceedings of the 18th International Conference on World Wide Web, 661\u2013670.","DOI":"10.1145\/1526709.1526798"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_019","doi-asserted-by":"crossref","unstructured":"Habibi, M., Weber, L., Neves, M., Wiegandt, D.L., & Leser, U. (2017). Deep learning with word embeddings improves biomedical named entity recognition. Bioinformatics, 33(14), i37\u2013i48.","DOI":"10.1093\/bioinformatics\/btx228"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_020","doi-asserted-by":"crossref","unstructured":"Hasan, K.S., & Ng, V. (2014). Automatic keyphrase extraction: A survey of the state of the art. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 1262\u20131273. https:\/\/doi.org\/10.3115\/v1\/P14-1119","DOI":"10.3115\/v1\/P14-1119"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_021","doi-asserted-by":"crossref","unstructured":"Hinton, G.E., Osindero, S., & Teh, Y.W. (2006). A fast learning algorithm for deep belief nets. Neural Computation, 18(7), 1527\u20131554. https:\/\/doi.org\/10.1162\/neco.2006.18.7.1527","DOI":"10.1162\/neco.2006.18.7.1527"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_022","doi-asserted-by":"crossref","unstructured":"Howard, J., & Ruder, S. (2018). Universal language model fine-tuning for text classification. ArXiv:1801.06146 [Cs, Stat]. http:\/\/arxiv.org\/abs\/1801.06146","DOI":"10.18653\/v1\/P18-1031"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_023","doi-asserted-by":"crossref","unstructured":"Hulth, A. (2003). Improved automatic keyword extraction given more linguistic knowledge. In Proceedings of the 2003 Conference on Empirical Methods in Natural Language, 216\u2013223.","DOI":"10.3115\/1119355.1119383"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_024","doi-asserted-by":"crossref","unstructured":"Hulth, A., Karlgren, J., Jonsson, A., Bostr\u00f6m, H., & Asker, L. (2001). Automatic keyword extraction using domain knowledge. International Conference on Intelligent Text Processing and Computational Linguistics, 472\u2013482.","DOI":"10.1007\/3-540-44686-9_47"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_025","doi-asserted-by":"crossref","unstructured":"Hulth, A., & Megyesi, B.B. (2006). A study on automatically extracted keywords in text categorization. In Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, 537\u2013544.","DOI":"10.3115\/1220175.1220243"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_026","doi-asserted-by":"crossref","unstructured":"Jones, S., & Staveley, M.S. (1999). Phrasier: A system for interactive document retrieval using keyphrases. In Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 160\u2013167.","DOI":"10.1145\/312624.312671"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_027","unstructured":"Kelleher, D., & Luz, S. (2005). Automatic hypertext keyphrase detection. IJCAI, 5, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, 1608\u20131609."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_028","doi-asserted-by":"crossref","unstructured":"Kim, S.N., & Kan, M.Y. (2009). Re-examining automatic keyphrase extraction approaches in scientific articles. In Proceedings of the Workshop on Multiword Expressions: Identification, Interpretation, Disambiguation and Applications, 9\u201316.","DOI":"10.3115\/1698239.1698242"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_029","unstructured":"Kim, S.N., Medelyan, O., Kan, M.Y., & Baldwin, T. (2010). Semeval-2010 task 5: Automatic keyphrase extraction from scientific articles. In Proceedings of the 5th International Workshop on Semantic Evaluation, 21\u201326."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_030","doi-asserted-by":"crossref","unstructured":"Le, T.T.N., Le Nguyen, M., & Shimazu, A. (2016). Unsupervised keyphrase extraction: Introducing new kinds of words to keyphrases. Australasian Joint Conference on Artificial Intelligence, 665\u2013671.","DOI":"10.1007\/978-3-319-50127-7_58"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_031","doi-asserted-by":"crossref","unstructured":"Lee, J., Yoon, W., Kim, S., Kim, D., Kim, S., So, C.H., & Kang, J. (2019). BioBERT: A pre-trained biomedical language representation model for biomedical text mining. Bioinformatics, btz682. https:\/\/doi.org\/10.1093\/bioinformatics\/btz682","DOI":"10.1093\/bioinformatics\/btz682"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_032","unstructured":"Liu, Z.Y., Huang, W.Y., Zheng, Y.B., & Sun, M.S. (2010). Automatic keyphrase extraction via topic decomposition. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 366\u2013376."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_033","doi-asserted-by":"crossref","unstructured":"Liu, Z.Y., Li, P., Zheng, Y.B., & Sun, M.S. (2009). Clustering to find exemplar terms for keyphrase extraction. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-Volume 1, 257\u2013266.","DOI":"10.3115\/1699510.1699544"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_034","doi-asserted-by":"crossref","unstructured":"Luhn, H.P. (1957). A statistical approach to mechanized encoding and searching of literary information. IBM Journal of Research and Development, 1(4), 309\u2013317.","DOI":"10.1147\/rd.14.0309"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_035","doi-asserted-by":"crossref","unstructured":"Matsuo, Y., & Ishizuka, M. (2004). Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools, 13(01), 157\u2013169.","DOI":"10.1142\/S0218213004001466"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_036","doi-asserted-by":"crossref","unstructured":"Medelyan, O., Frank, E., & Witten, I.H. (2009). Human-competitive tagging using automatic keyphrase extraction. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3-Volume 3, 1318\u20131327.","DOI":"10.3115\/1699648.1699678"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_037","doi-asserted-by":"crossref","unstructured":"Mihalcea, R., & Tarau, P. (2004). Textrank: Bringing order into text. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 404\u2013411.","DOI":"10.3115\/1220575.1220627"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_038","unstructured":"Papagiannopoulou, E., & Tsoumakas, G. (2019). A review of keyphrase extraction. ArXiv:1905. 05044 [Cs]. http:\/\/arxiv.org\/abs\/1905.05044"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_039","doi-asserted-by":"crossref","unstructured":"Peters, M., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., & Zettlemoyer, L. (2018). Deep Contextualized Word Representations. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), 2227\u20132237. https:\/\/doi.org\/10.18653\/v1\/N18-1202","DOI":"10.18653\/v1\/N18-1202"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_040","unstructured":"Radford, A., Narasimhan, K., Salimans, T., & Sutskever, I. (2018). Improving language understanding with unsupervised learning. Technical Report, OpenAI."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_041","doi-asserted-by":"crossref","unstructured":"Sahrawat, D., Mahata, D., Kulkarni, M., Zhang, H., Gosangi, R., Stent, A., Sharma, A., Kumar, Y., Shah, R.R., & Zimmermann, R. (2019). Keyphrase Extraction from Scholarly Articles as Sequence Labeling using Contextualized Embeddings. ArXiv Preprint ArXiv:1910.08840.","DOI":"10.1007\/978-3-030-45442-5_41"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_042","doi-asserted-by":"crossref","unstructured":"Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513\u2013523.","DOI":"10.1016\/0306-4573(88)90021-0"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_043","doi-asserted-by":"crossref","unstructured":"Salton, G., Yang, C.S., & Yu, C.T. (1975). A theory of term importance in automatic text analysis. Journal of the American Society for Information Science, 26(1), 33\u201344.","DOI":"10.1002\/asi.4630260106"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_044","doi-asserted-by":"crossref","unstructured":"Tomokiyo, T., & Hurst, M. (2003). A language model approach to keyphrase extraction. In Proceedings of the ACL 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment, 33\u201340.","DOI":"10.3115\/1119282.1119287"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_045","doi-asserted-by":"crossref","unstructured":"Turney, P.D. (2000). Learning algorithms for keyphrase extraction. Information Retrieval, 2(4), 303\u2013336.","DOI":"10.1023\/A:1009976227802"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_046","unstructured":"Turney, P.D. (2002). Learning to extract keyphrases from text. ArXiv Preprint Cs\/0212013."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_047","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 5998\u20136008."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_048","unstructured":"Wan, X., & Xiao, J. (2008). Single document keyphrase extraction using neighborhood knowledge. AAAI, 8, 855\u2013860."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_049","doi-asserted-by":"crossref","unstructured":"Wang, M., Zhao, B., & Huang, Y. (2016). Ptr: Phrase-based topical ranking for automatic key-phrase extraction in scientific publications. International Conference on Neural Information Processing, 120\u2013128.","DOI":"10.1007\/978-3-319-46681-1_15"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_050","doi-asserted-by":"crossref","unstructured":"Wang, X., Zhang, Y., Ren, X., Zhang, Y.H., Zitnik, M., Shang, J.B., Langlotz, C., & Han, J.W. (2019). Cross-type biomedical named entity recognition with deep multi-task learning. Bioinformatics, 35(10), 1745\u20131752.","DOI":"10.1093\/bioinformatics\/bty869"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_051","doi-asserted-by":"crossref","unstructured":"Witten, I.H., Paynter, G.W., Frank, E., Gutwin, C., & Nevill-Manning, C.G. (2005). Kea: Practical automated keyphrase extraction. In Design and Usability of Digital Libraries: Case Studies in the Asia Pacific (pp. 129\u2013152). IGI global.","DOI":"10.4018\/978-1-59140-441-5.ch008"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_052","unstructured":"Wu, Y., Schuster, M., Chen, Z., Le, Q.V., Norouzi, M., Macherey, W., Krikun, M., Cao, Y., Gao, Q., & Macherey, K. (2016). Google\u2019s neural machine translation system: Bridging the gap between human and machine translation. ArXiv Preprint ArXiv:1609.08144."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_053","unstructured":"Zhang C.Z., Wang H.L., Liu Y., Wu D., Liao Y., & Wang B. (2008). Automatic keyword extraction from documents using conditional random fields. Journal of Computational Information Systems, 4(3), 1169\u20131180."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_054","doi-asserted-by":"crossref","unstructured":"Zhang, Q., Wang, Y., Gong, Y., & Huang, X.J. (2016). Keyphrase extraction using deep recurrent neural networks on twitter. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 836\u2013845.","DOI":"10.18653\/v1\/D16-1080"},{"key":"2026012113135989610_j_jdis-2021-0013_ref_055","unstructured":"Zhang, Y., Zincir-Heywood, N., & Milios, E. (2004). World wide web site summarization. Web Intelligence and Agent Systems: An International Journal, 2(1), 39\u201353."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_056","unstructured":"Zhao, W.X., Jiang, J., He, J., Song, Y., Achananuparp, P., Lim, E.P., & Li, X. (2011). Topical keyphrase extraction from twitter. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, 379\u2013388."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_057","unstructured":"Li, L.S., Dang, Y.Z., Zhang, J., & Li, D. (2013). Term extraction in the automotive field based on conditional random fields. Journal of Dalian University of Technology, 53(2), 267\u2013272."},{"key":"2026012113135989610_j_jdis-2021-0013_ref_058","unstructured":"Li, S.J., Wang, H.F., Yu, S.W., & Xin, C.S. (2004). Application research of maximum entropy model for keyword automatic indexing. Chinese Journal of Computers, 27(9), 1192\u20131197."}],"container-title":["Journal of Data and Information Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/content.sciendo.com\/view\/journals\/jdis\/ahead-of-print\/article-10.2478-jdis-2021-0013\/article-10.2478-jdis-2021-0013.xml","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.sciendo.com\/pdf\/10.2478\/jdis-2021-0013","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,21]],"date-time":"2026-01-21T13:14:35Z","timestamp":1769001275000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.sciendo.com\/article\/10.2478\/jdis-2021-0013"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,2]]},"references-count":58,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2021,3,2]]},"published-print":{"date-parts":[[2021,8,1]]}},"alternative-id":["10.2478\/jdis-2021-0013"],"URL":"https:\/\/doi.org\/10.2478\/jdis-2021-0013","relation":{},"ISSN":["2543-683X"],"issn-type":[{"value":"2543-683X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,3,2]]}}}