{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,5]],"date-time":"2026-03-05T04:30:11Z","timestamp":1772685011997,"version":"3.50.1"},"reference-count":54,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2019,4,4]],"date-time":"2019-04-04T00:00:00Z","timestamp":1554336000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004329","name":"Javna Agencija za Raziskovalno Dejavnost RS","doi-asserted-by":"publisher","award":["P2-0103"],"award-info":[{"award-number":["P2-0103"]}],"id":[{"id":"10.13039\/501100004329","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000781","name":"European Research Council","doi-asserted-by":"publisher","award":["J7-7303"],"award-info":[{"award-number":["J7-7303"]}],"id":[{"id":"10.13039\/501100000781","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100010661","name":"Horizon 2020","doi-asserted-by":"publisher","award":["825153"],"award-info":[{"award-number":["825153"]}],"id":[{"id":"10.13039\/100010661","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["MAKE"],"abstract":"<jats:p>Deep neural networks are becoming ubiquitous in text mining and natural language processing, but semantic resources, such as taxonomies and ontologies, are yet to be fully exploited in a deep learning setting. This paper presents an efficient semantic text mining approach, which converts semantic information related to a given set of documents into a set of novel features that are used for learning. The proposed Semantics-aware Recurrent deep Neural Architecture (SRNA) enables the system to learn simultaneously from the semantic vectors and from the raw text documents. We test the effectiveness of the approach on three text classification tasks: news topic categorization, sentiment analysis and gender profiling. The experiments show that the proposed approach outperforms the approach without semantic knowledge, with highest accuracy gain (up to 10%) achieved on short document fragments.<\/jats:p>","DOI":"10.3390\/make1020034","type":"journal-article","created":{"date-parts":[[2019,4,5]],"date-time":"2019-04-05T11:36:01Z","timestamp":1554464161000},"page":"575-589","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":16,"title":["Towards Robust Text Classification with Semantics-Aware Recurrent Neural Architecture"],"prefix":"10.3390","volume":"1","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9916-8756","authenticated-orcid":false,"given":"Bla\u017e","family":"\u0160krlj","sequence":"first","affiliation":[{"name":"Jo\u017eef Stefan Institute, 1000 Ljubljana, Slovenia"},{"name":"Jo\u017eef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia"}]},{"given":"Jan","family":"Kralj","sequence":"additional","affiliation":[{"name":"Jo\u017eef Stefan Institute, 1000 Ljubljana, Slovenia"}]},{"given":"Nada","family":"Lavra\u010d","sequence":"additional","affiliation":[{"name":"Jo\u017eef Stefan Institute, 1000 Ljubljana, Slovenia"},{"name":"School of Engineering and Management, University of Nova Gorica, 5000 Nova Gorica, Slovenia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4380-0863","authenticated-orcid":false,"given":"Senja","family":"Pollak","sequence":"additional","affiliation":[{"name":"Jo\u017eef Stefan Institute, 1000 Ljubljana, Slovenia"},{"name":"Usher Institute, Medical School, University of Edinburgh, Edinburgh EH16 4UX, UK"}]}],"member":"1968","published-online":{"date-parts":[[2019,4,4]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Aggarwal, C.C., and Zhai, C. (2012). A survey of text classification algorithms. Mining Text Data, Springer.","DOI":"10.1007\/978-1-4614-3223-4"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/505282.505283","article-title":"Machine learning in automated text categorization","volume":"34","author":"Sebastiani","year":"2002","journal-title":"ACM Comput. Surv."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Tang, D., Qin, B., and Liu, T. (2015, January 17\u201321). Document modeling with gated recurrent neural network for sentiment classification. Proceedings of the Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal.","DOI":"10.18653\/v1\/D15-1167"},{"key":"ref_4","unstructured":"Kusner, M., Sun, Y., Kolkin, N., and Weinberger, K. (2015, January 6\u201311). From word embeddings to document distances. Proceedings of the International Conference on Machine Learning, Lille, France."},{"key":"ref_5","unstructured":"\u0141awrynowicz, A. (2017). Semantic Data Mining: An Ontology-Based Approach, IOS Press."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"304","DOI":"10.1093\/comjnl\/bxs057","article-title":"Semantic subgroup discovery systems and workflows in the SDM toolkit","volume":"56","year":"2013","journal-title":"Comput. J."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1007\/s10994-016-5550-3","article-title":"Explaining mixture models through semantic pattern mining and banded matrix visualization","volume":"105","author":"Adhikari","year":"2016","journal-title":"Mach. Learn."},{"key":"ref_8","unstructured":"Scott, S., and Matwin, S. (1998, January 16). Text classification using WordNet hypernyms. Proceedings of the Workshop on Usage of WordNet in Natural Language Processing Systems, Montreal, QC, Canada."},{"key":"ref_9","unstructured":"Mansuy, T.N., and Hilderman, R.J. (2006, January 11\u201313). Evaluating WordNet features in text classification models. Proceedings of the FLAIRS Conference, Melbourne Beach, FL, USA."},{"key":"ref_10","unstructured":"Rangel, F., Rosso, P., Chugur, I., Potthast, M., Trenkmann, M., Stein, B., Verhoeven, B., and Daelemans, W. (2014, January 15\u201318). Overview of the 2nd author profiling task at PAN 2014. Proceedings of the Working Notes Papers of the CLEF Conference, Sheffield, UK."},{"key":"ref_11","unstructured":"Rangel, F., Rosso, P., Verhoeven, B., Daelemans, W., Potthast, M., and Stein, B. (2016, January 5\u20138). Overview of the 4th author profiling task at PAN 2016: Cross-genre evaluations. Proceedings of the Working Notes Papers of the CLEF Conference, Evora, Portugal."},{"key":"ref_12","unstructured":"Cho, J., Lee, K., Shin, E., Choy, G., and Do, S. (2015). How much data is needed to train a medical image deep learning system to achieve necessary high accuracy?. arXiv."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Landauer, T.K. (2006). Latent Semantic Analysis, Wiley Online Library.","DOI":"10.1002\/0470018860.s00561"},{"key":"ref_14","first-page":"993","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"ref_15","unstructured":"Mikolov, T., Chen, K., Corrado, G., and Dean, J. (2013). Efficient estimation of word representations in vector space. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Pennington, J., Socher, R., and Manning, C. (2014, January 25\u201329). Glove: Global vectors for word representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar.","DOI":"10.3115\/v1\/D14-1162"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Bojanowski, P., Grave, E., Joulin, A., and Mikolov, T. (2016). Enriching Word Vectors with Subword Information. arXiv.","DOI":"10.1162\/tacl_a_00051"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"635","DOI":"10.4304\/jmm.9.5.635-643","article-title":"Short text classification: A survey","volume":"9","author":"Song","year":"2014","journal-title":"J. Multimed."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Tang, D., Wei, F., Yang, N., Zhou, M., Liu, T., and Qin, B. (2014, January 23\u201325). Learning sentiment-specific word embedding for twitter sentiment classification. Proceedings of the 52nd ACL Conference, Baltimore, MD, USA.","DOI":"10.3115\/v1\/P14-1146"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.datak.2013.01.005","article-title":"Improving classification models with taxonomy information","volume":"86","author":"Cagliero","year":"2013","journal-title":"Data Knowl. Eng."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"\u0160krlj, B., Kralj, J., and Lavra\u010d, N. (2019). CBSSD: Community-based semantic subgroup discovery. J. Intell. Inf. Syst., 1\u201340.","DOI":"10.1007\/s10844-019-00545-0"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Xu, N., Wang, J., Qi, G., Huang, T.S., and Lin, W. (2018). Ontological random forests for image classification. Computer Vision: Concepts, Methodologies, Tools, and Applications, IGI Global.","DOI":"10.4018\/978-1-5225-5204-8.ch031"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1","DOI":"10.4018\/IJSI.2018010101","article-title":"A novel approach for ontology-based feature vector generation for web text document classification","volume":"6","author":"Elhadad","year":"2018","journal-title":"Int. J. Softw. Innov."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Kaur, R., and Kumar, M. (2018, January 14\u201315). Domain ontology graph approach using Markov clustering algorithm for text classification. Proceedings of the International Conference on Intelligent Computing and Applications, Madurai, India.","DOI":"10.1007\/978-981-10-5520-1_47"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Ristoski, P., Faralli, S., Ponzetto, S.P., and Paulheim, H. (2017, January 23\u201326). Large-scale taxonomy induction using entity and word embeddings. Proceedings of the International Conference on Web Intelligence, Leipzig, Germany.","DOI":"10.1145\/3106426.3106465"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Liu, Q., Jiang, H., Wei, S., Ling, Z.H., and Hu, Y. (2015, January 26\u201331). Learning semantic word embeddings based on ordinal knowledge constraints. Proceedings of the 53rd ACL Conference and the 7th IJCNLP Conference, Beijing, China.","DOI":"10.3115\/v1\/P15-1145"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Bian, J., Gao, B., and Liu, T.Y. (2014, January 15\u201319). Knowledge-powered deep learning for word embedding. Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Nancy, France.","DOI":"10.1007\/978-3-662-44848-9_9"},{"key":"ref_28","unstructured":"Zhang, X., Zhao, J., and LeCun, Y. (2015). Character-level convolutional networks for text classification. Advances in Neural Information Processing Systems 28 (NIPS 2015), Curran Associates, Inc."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"LeCun","year":"2015","journal-title":"Nature"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Kim, Y. (2014). Convolutional neural networks for sentence classification. arXiv.","DOI":"10.3115\/v1\/D14-1181"},{"key":"ref_31","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_32","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012). Imagenet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems 25 (NIPS 2012), Curran Associates, Inc."},{"key":"ref_33","unstructured":"Goodfellow, I., Bengio, Y., Courville, A., and Bengio, Y. (2016). Deep Learning, MIT Press."},{"key":"ref_34","unstructured":"Gal, Y., and Ghahramani, Z. (2016). A theoretically grounded application of dropout in recurrent neural networks. Advances in Neural Information Processing Systems 29 (NIPS 2016), Curran Associates, Inc."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Cheng, J., Dong, L., and Lapata, M. (2016). Long short-term memory-networks for machine reading. arXiv.","DOI":"10.18653\/v1\/D16-1053"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Graves, A., Mohamed, A.R., and Hinton, G. (2013, January 26\u201331). Speech recognition with deep recurrent neural networks. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, BC, Canada.","DOI":"10.1109\/ICASSP.2013.6638947"},{"key":"ref_37","unstructured":"Chung, J., Gulcehre, C., Cho, K., and Bengio, Y. (2015, January 6\u201311). Gated feedback recurrent neural networks. Proceedings of the International Conference on Machine Learning, Lille, France."},{"key":"ref_38","unstructured":"Kowsari, K., Heidarysafa, M., Brown, D.E., Meimandi, K.J., and Barnes, L.E. (2018, January 9\u201311). Rmdl: Random multimodel deep learning for classification. Proceedings of the 2nd International Conference on Information System and Data Mining, Lakeland, FL, USA."},{"key":"ref_39","first-page":"1929","article-title":"Dropout: A simple way to prevent neural networks from overfitting","volume":"15","author":"Srivastava","year":"2014","journal-title":"J. Mach. Learn. Res."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Kowsari, K., Brown, D.E., Heidarysafa, M., Meimandi, K.J., Gerber, M.S., and Barnes, L.E. (2017, January 18\u201321). Hdltex: Hierarchical deep learning for text classification. Proceedings of the 16th IEEE International Conference on Machine Learning and Applications (ICMLA), Cancun, Mexico.","DOI":"10.1109\/ICMLA.2017.0-134"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Cheng, H.T., Koc, L., Harmsen, J., Shaked, T., Chandra, T., Aradhye, H., Anderson, G., Corrado, G., Chai, W., and Ispir, M. (2016, January 15). Wide & deep learning for recommender systems. Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, Boston, MA, USA.","DOI":"10.1145\/2988450.2988454"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1145\/219717.219748","article-title":"WordNet: A lexical database for English","volume":"38","author":"Miller","year":"1995","journal-title":"Commun. ACM"},{"key":"ref_43","unstructured":"Clevert, D.A., Unterthiner, T., and Hochreiter, S. (2015). Fast and accurate deep network learning by exponential linear units (elus). arXiv."},{"key":"ref_44","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_45","unstructured":"Chollet, F. (2019, March 20). Keras. Available online: https:\/\/github.com\/fchollet\/keras."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1023\/A:1025667309714","article-title":"Theoretical and empirical analysis of ReliefF and RReliefF","volume":"53","author":"Kononenko","year":"2003","journal-title":"Mach. Learn."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1145\/1961189.1961199","article-title":"LIBSVM: A library for support vector machines","volume":"2","author":"Chang","year":"2011","journal-title":"ACM Trans. Intell. Syst. Technol."},{"key":"ref_48","unstructured":"Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., and Isard, M. (2016, January 2\u20134). Tensorflow: A system for large-scale machine learning. Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), Savannah, GA, USA."},{"key":"ref_49","first-page":"2825","article-title":"Scikit-learn: Machine learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J. Mach. Learn. Res."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1109\/MCSE.2011.37","article-title":"The NumPy array: A structure for efficient numerical computation","volume":"13","author":"Walt","year":"2011","journal-title":"Comput. Sci. Eng."},{"key":"ref_51","first-page":"1","article-title":"Statistical comparisons of classifiers over multiple data sets","volume":"7","year":"2006","journal-title":"J. Mach. Learn. Res."},{"key":"ref_52","first-page":"2653","article-title":"Time for a change: A tutorial for comparing multiple classifiers through Bayesian analysis","volume":"18","author":"Benavoli","year":"2017","journal-title":"J. Mach. Learn. Res."},{"key":"ref_53","unstructured":"Hong, J., and Fang, M. (2015). Sentiment Analysis with Deeply Learned Distributed Representations of Variable Length Texts, Stanford University. Technical Report."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Zhang, H., Xiao, L., Chen, W., Wang, Y., and Jin, Y. (2017). Multi-task label embedding for text classification. arXiv.","DOI":"10.18653\/v1\/D18-1484"}],"container-title":["Machine Learning and Knowledge Extraction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-4990\/1\/2\/34\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T12:43:07Z","timestamp":1760186587000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-4990\/1\/2\/34"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,4,4]]},"references-count":54,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2019,6]]}},"alternative-id":["make1020034"],"URL":"https:\/\/doi.org\/10.3390\/make1020034","relation":{},"ISSN":["2504-4990"],"issn-type":[{"value":"2504-4990","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,4,4]]}}}