{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T04:34:52Z","timestamp":1777696492066,"version":"3.51.4"},"reference-count":29,"publisher":"SAGE Publications","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IDA"],"published-print":{"date-parts":[[2023,10,6]]},"abstract":"<jats:p>Short text classification has provoked a vast amount of attention and research in recent decades. However, most existing methods only focus on the short texts that contain dozens of words like Twitter and Microblog, while pay far less attention to the extreme short texts like news headline and search snippets. Meanwhile, contemporary short text classification methods that extend the features via external knowledge sources always introduce lots of useless concepts, which may be detrimental to classification performance. Moreover, unlike traditional short text classification methods, the classification results of extreme short texts are often determined by a few even one or two keywords. To address these problems, we propose a novel hybrid classification method via Keywords Screening and Attention Mechanisms in extreme short text, called KSAM. More specifically, firstly, the attention-based BiLSTM is introduced in our method to enhance the role of keywords. Secondly, we screen the keywords in the extreme short text for obtaining the true class label, and the concepts concerning the keywords are retrieved from external open knowledge sources like DBpedia. Thirdly, the attention mechanisms are introduced to acquire the weight of these retrieved concepts. Finally, conceptual information is utilized to assist the classification of the extreme short text. Extensive experiments have demonstrated the effectiveness of our method compared to other state-of-the-art methods.<\/jats:p>","DOI":"10.3233\/ida-220417","type":"journal-article","created":{"date-parts":[[2023,8,13]],"date-time":"2023-08-13T15:06:21Z","timestamp":1691939181000},"page":"1331-1345","source":"Crossref","is-referenced-by-count":0,"title":["A hybrid classification method via keywords screening and attention mechanisms in extreme short text"],"prefix":"10.1177","volume":"27","author":[{"given":"Xinke","family":"Zhou","sequence":"first","affiliation":[{"name":"School of Information Engineering, Yangzhou University, Yangzhou, Jiangsu, China"}]},{"given":"Yi","family":"Zhu","sequence":"additional","affiliation":[{"name":"School of Information Engineering, Yangzhou University, Yangzhou, Jiangsu, China"},{"name":"Key Laboratory of Knowledge Engineering with Big Data, Hefei University of Technology, Anhui, China"},{"name":"School of Computer Science and Information Engineering, Hefei University of Technology, Anhui, China"}]},{"given":"Yun","family":"Li","sequence":"additional","affiliation":[{"name":"School of Information Engineering, Yangzhou University, Yangzhou, Jiangsu, China"}]},{"given":"Jipeng","family":"Qiang","sequence":"additional","affiliation":[{"name":"School of Information Engineering, Yangzhou University, Yangzhou, Jiangsu, China"}]},{"given":"Yunhao","family":"Yuan","sequence":"additional","affiliation":[{"name":"School of Information Engineering, Yangzhou University, Yangzhou, Jiangsu, China"}]},{"given":"Xingdong","family":"Wu","sequence":"additional","affiliation":[{"name":"Key Laboratory of Knowledge Engineering with Big Data, Hefei University of Technology, Anhui, China"},{"name":"School of Computer Science and Information Engineering, Hefei University of Technology, Anhui, China"}]},{"given":"Runmei","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Electronics an Information Engineering, Anhui Jianzhu University, Anhui, China"}]}],"member":"179","reference":[{"issue":"5","key":"10.3233\/IDA-220417_ref1","doi-asserted-by":"crossref","first-page":"635","DOI":"10.4304\/jmm.9.5.635-643","article-title":"Short text classification: A survey","volume":"9","author":"Song","year":"2014","journal-title":"Journal of Multimedia"},{"key":"10.3233\/IDA-220417_ref2","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1016\/j.ins.2018.04.071","article-title":"Filtering out the noise in short text topic modeling","volume":"456","author":"Li","year":"2018","journal-title":"Information Sciences"},{"key":"10.3233\/IDA-220417_ref3","doi-asserted-by":"crossref","first-page":"92120","DOI":"10.1109\/ACCESS.2020.2994450","article-title":"A Hybrid Classification Method via Character Embedding in Chinese Short Text With Few Words","volume":"8","author":"Zhu","year":"2020","journal-title":"IEEE Access"},{"issue":"22","key":"10.3233\/IDA-220417_ref4","doi-asserted-by":"crossref","first-page":"29799","DOI":"10.1007\/s11042-018-5772-4","article-title":"Few-shot learning for short text classification","volume":"77","author":"Yan","year":"2018","journal-title":"Multimedia Tools and Applications"},{"issue":"12","key":"10.3233\/IDA-220417_ref6","doi-asserted-by":"crossref","first-page":"2928","DOI":"10.1109\/TKDE.2014.2313872","article-title":"Btm: Topic modeling over short texts","volume":"26","author":"Cheng","year":"2014","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"10.3233\/IDA-220417_ref8","doi-asserted-by":"crossref","unstructured":"S. Lai, L. Xu, K. Liu and J. Zhao, Recurrent convolutional neural networks for text classification, in: Twenty-ninth AAAI Conference on Artificial Intelligence, 2015.","DOI":"10.1609\/aaai.v29i1.9513"},{"key":"10.3233\/IDA-220417_ref9","unstructured":"M. Chen, X. Jin and D. Shen, Short text classification improved by learning multi-granularity topics, in: Twenty-second International Joint Conference on Artificial Intelligence, 2011."},{"key":"10.3233\/IDA-220417_ref10","doi-asserted-by":"crossref","first-page":"130","DOI":"10.1016\/j.ins.2015.06.033","article-title":"Effectively classifying short texts by structured sparse representation with dictionary filtering","volume":"323","author":"Gao","year":"2015","journal-title":"Information Sciences"},{"key":"10.3233\/IDA-220417_ref11","doi-asserted-by":"crossref","unstructured":"J. Wang, Z. Wang, D. Zhang and J. Yan, Combining Knowledge with Deep Convolutional Neural Networks for Short Text Classification, in: IJCAI, Vol. 350, 2017.","DOI":"10.24963\/ijcai.2017\/406"},{"key":"10.3233\/IDA-220417_ref12","doi-asserted-by":"crossref","unstructured":"J. Chen, Y. Hu, J. Liu, Y. Xiao and H. Jiang, Deep short text classification with knowledge powered attention, in: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33, 2019, pp.\u00a06252\u20136259.","DOI":"10.1609\/aaai.v33i01.33016252"},{"key":"10.3233\/IDA-220417_ref13","doi-asserted-by":"crossref","unstructured":"W. Wu, H. Li, H. Wang and K.Q. Zhu, Probase: A probabilistic taxonomy for text understanding, in: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, 2012, pp.\u00a0481\u2013492.","DOI":"10.1145\/2213836.2213891"},{"issue":"2","key":"10.3233\/IDA-220417_ref14","doi-asserted-by":"crossref","first-page":"167","DOI":"10.3233\/SW-140134","article-title":"Dbpedia \u2013 a large-scale, multilingual knowledge base extracted from wikipedia","volume":"6","author":"Lehmann","year":"2015","journal-title":"Semantic Web"},{"key":"10.3233\/IDA-220417_ref15","doi-asserted-by":"crossref","unstructured":"H. Chen, M. Sun, C. Tu, Y. Lin and Z. Liu, Neural sentiment classification with user and product attention, in: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016, pp.\u00a01650\u20131659.","DOI":"10.18653\/v1\/D16-1171"},{"key":"10.3233\/IDA-220417_ref16","doi-asserted-by":"crossref","unstructured":"B. Sriram, D. Fuhry, E. Demir, H. Ferhatosmanoglu and M. Demirbas, Short text classification in twitter to improve information filtering, in: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010, pp.\u00a0841\u2013842.","DOI":"10.1145\/1835449.1835643"},{"issue":"9","key":"10.3233\/IDA-220417_ref17","doi-asserted-by":"crossref","first-page":"2697","DOI":"10.1109\/TCYB.2017.2748598","article-title":"Learning from short text streams with topic drifts","volume":"48","author":"Li","year":"2017","journal-title":"IEEE Transactions on Cybernetics"},{"key":"10.3233\/IDA-220417_ref18","doi-asserted-by":"crossref","first-page":"806","DOI":"10.1016\/j.neucom.2015.09.096","article-title":"Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification","volume":"174","author":"Wang","year":"2016","journal-title":"Neurocomputing"},{"key":"10.3233\/IDA-220417_ref19","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1016\/j.knosys.2015.03.025","article-title":"Term-weighting learning via genetic programming for text classification","volume":"83","author":"Escalante","year":"2015","journal-title":"Knowledge-Based Systems"},{"key":"10.3233\/IDA-220417_ref22","doi-asserted-by":"crossref","unstructured":"R. Johnson and T. Zhang, Deep pyramid convolutional neural networks for text categorization, in: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2017, pp.\u00a0562\u2013570.","DOI":"10.18653\/v1\/P17-1052"},{"key":"10.3233\/IDA-220417_ref23","doi-asserted-by":"crossref","first-page":"366","DOI":"10.1016\/j.neucom.2019.07.052","article-title":"Improving text classification with weighted word embeddings via a multi-channel TextCNN model","volume":"363","author":"Guo","year":"2019","journal-title":"Neurocomputing"},{"key":"10.3233\/IDA-220417_ref24","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1016\/j.neucom.2019.10.033","article-title":"Text classification using capsules","volume":"376","author":"Kim","year":"2020","journal-title":"Neurocomputing"},{"key":"10.3233\/IDA-220417_ref25","doi-asserted-by":"crossref","unstructured":"J. Xu and Y. Cai, Incorporating context-relevant knowledge into convolutional neural networks for short text classification, in: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33, 2019, pp.\u00a010067\u201310068.","DOI":"10.1609\/aaai.v33i01.330110067"},{"key":"10.3233\/IDA-220417_ref26","doi-asserted-by":"crossref","unstructured":"H. Linmei, T. Yang, C. Shi, H. Ji and X. Li, Heterogeneous graph attention networks for semi-supervised short text classification, in: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 2019, pp.\u00a04821\u20134830.","DOI":"10.18653\/v1\/D19-1488"},{"key":"10.3233\/IDA-220417_ref27","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1016\/j.neucom.2019.08.080","article-title":"Incorporating context-relevant concepts into convolutional neural networks for short text classification","volume":"386","author":"Xu","year":"2020","journal-title":"Neurocomputing"},{"key":"10.3233\/IDA-220417_ref29","doi-asserted-by":"crossref","unstructured":"Y. Hao, Y. Zhang, K. Liu, S. He, Z. Liu, H. Wu and J. Zhao, An end-to-end model for question answering over knowledge base with cross-attention combining global knowledge, in: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2017, pp.\u00a0221\u2013231.","DOI":"10.18653\/v1\/P17-1021"},{"key":"10.3233\/IDA-220417_ref31","first-page":"1","article-title":"Automatic keyword extraction from individual documents","volume":"1","author":"Rose","year":"2010","journal-title":"Text Mining: Applications and Theory"},{"key":"10.3233\/IDA-220417_ref33","first-page":"649","article-title":"Character-level convolutional networks for text classification","volume":"28","author":"Zhang","year":"2015","journal-title":"Advances in Neural Information Processing Systems"},{"key":"10.3233\/IDA-220417_ref34","doi-asserted-by":"crossref","unstructured":"X.-H. Phan, L.-M. Nguyen and S. Horiguchi, Learning to classify short and sparse text & web with hidden topics from large-scale data collections, in: Proceedings of the 17th International Conference on World Wide Web, 2008, pp.\u00a091\u2013100.","DOI":"10.1145\/1367497.1367510"},{"key":"10.3233\/IDA-220417_ref35","doi-asserted-by":"crossref","unstructured":"A. Goel, J. Gautam and S. Kumar, Real time sentiment analysis of tweets using Naive Bayes, in: 2016 2nd International Conference on Next Generation Computing Technologies (NGCT), IEEE, 2016, pp.\u00a0257\u2013261.","DOI":"10.1109\/NGCT.2016.7877424"},{"key":"10.3233\/IDA-220417_ref36","doi-asserted-by":"crossref","unstructured":"Z. Yang, D. Yang, C. Dyer, X. He, A. Smola and E. Hovy, Hierarchical attention networks for document classification, in: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2016, pp.\u00a01480\u20131489.","DOI":"10.18653\/v1\/N16-1174"}],"container-title":["Intelligent Data Analysis"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/IDA-220417","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:20:12Z","timestamp":1777454412000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/IDA-220417"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,6]]},"references-count":29,"journal-issue":{"issue":"5"},"URL":"https:\/\/doi.org\/10.3233\/ida-220417","relation":{},"ISSN":["1088-467X","1571-4128"],"issn-type":[{"value":"1088-467X","type":"print"},{"value":"1571-4128","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,6]]}}}