{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,7]],"date-time":"2026-05-07T23:47:03Z","timestamp":1778197623627,"version":"3.51.4"},"reference-count":27,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2023,10,4]],"date-time":"2023-10-04T00:00:00Z","timestamp":1696377600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>In recent years, machine learning approaches, in particular graph learning methods, have achieved great results in the field of natural language processing, in particular text classification tasks. However, many of such models have shown limited generalization on datasets in different languages. In this research, we investigate and elaborate graph machine learning methods on non-English datasets (such as the Persian Digikala dataset), which consists of users\u2019 opinions for the task of text classification. More specifically, we investigate different combinations of (Pars) BERT with various graph neural network (GNN) architectures (such as GCN, GAT, and GIN) as well as use ensemble learning methods in order to tackle the text classification task on certain well-known non-English datasets. Our analysis and results demonstrate how applying GNN models helps in achieving good scores on the task of text classification by better capturing the topological information between textual data. Additionally, our experiments show how models employing language-specific pre-trained models (like ParsBERT, instead of BERT) capture better information about the data, resulting in better accuracies.<\/jats:p>","DOI":"10.3390\/a16100470","type":"journal-article","created":{"date-parts":[[2023,10,4]],"date-time":"2023-10-04T11:58:57Z","timestamp":1696420737000},"page":"470","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["On Enhancement of Text Classification and Analysis of Text Emotions Using Graph Machine Learning and Ensemble Learning Methods on Non-English Datasets"],"prefix":"10.3390","volume":"16","author":[{"given":"Fatemeh","family":"Gholami","sequence":"first","affiliation":[{"name":"Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), Tehran 15916-39675, Iran"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6942-3453","authenticated-orcid":false,"given":"Zahed","family":"Rahmati","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), Tehran 15916-39675, Iran"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alireza","family":"Mofidi","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), Tehran 15916-39675, Iran"},{"name":"School of Mathematics, Institute for Research in Fundamental Sciences (IPM), Tehran 15916-39675, Iran"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6954-3896","authenticated-orcid":false,"given":"Mostafa","family":"Abbaszadeh","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), Tehran 15916-39675, Iran"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2023,10,4]]},"reference":[{"key":"ref_1","first-page":"1","article-title":"A Survey on Text Classification: From Traditional to Deep Learning","volume":"13","author":"Li","year":"2022","journal-title":"ACM Trans. Intell. Syst. Technol."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"e1253","DOI":"10.1002\/widm.1253","article-title":"Deep learning for sentiment analysis: A survey","volume":"8","author":"Zhang","year":"2018","journal-title":"Wiley Interdiscip. Rev. Data Min. Knowl. Dis."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Aggarwal, C.C., Zhai, C., Aggarwal, C.C., and Zhai, C. (2012). Mining Text Data, Springer.","DOI":"10.1007\/978-1-4614-3223-4"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1109\/TCBB.2018.2849968","article-title":"Natural language processing for ehr-based computational phenotyping","volume":"16","author":"Zeng","year":"2018","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"ref_5","unstructured":"Dai, Y., Liu, J., Ren, X., and Xu, Z. (2020, January 7\u201312). Adversarial training based multi-source unsupervised domain adaptation for sentiment analysis. Proceedings of the AAAI Conference on Artificial Intelligence 2020, New York, NY, USA."},{"key":"ref_6","unstructured":"Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2019, January 3\u20135). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the NAACL-HLT, Minneapolis, MN, USA."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"3831","DOI":"10.1007\/s11063-021-10528-4","article-title":"Parsbert: Transformer-based model for persian language understanding","volume":"53","author":"Farahani","year":"2021","journal-title":"Neural Process. Lett."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1016\/0306-4573(88)90021-0","article-title":"Term-weighting approaches in automatic text retrieval","volume":"24","author":"Salton","year":"1988","journal-title":"Inf. Process. Manag."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Medsker, L.R., and Jain, L.C. (1999). Recurrent Neural Networks: Design and Applications, CRC Press.","DOI":"10.1201\/9781420049176"},{"key":"ref_10","unstructured":"Liu, P., Qiu, X., and Huang, X. (2016, January 9\u201315). Recurrent neural network for text classification with multi-task learning. Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI), New York, NY, USA."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.jbi.2017.07.006","article-title":"Recurrent neural networks for classifying relations in clinical notes","volume":"72","author":"Luo","year":"2017","journal-title":"J. Biomed. Inform."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Lai, S., Xu, L., Liu, K., and Zhao, J. (2015, January 25\u201330). Recurrent convolutional neural networks for text classification. Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, TX, USA.","DOI":"10.1609\/aaai.v29i1.9513"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long Short-Term Memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Tai, K.S., Socher, R., and Manning, C.D. (2015, January 26\u201331). Improved semantic representations from tree-structured long short-term memory networks. Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Beijing, China.","DOI":"10.3115\/v1\/P15-1150"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1186\/s40649-019-0069-y","article-title":"Graph convolutional networks: A comprehensive review","volume":"6","author":"Zhang","year":"2019","journal-title":"Comput. Soc. Netw."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Kim, Y. (2014, January 25\u201329). Convolutional Neural Networks for Sentence Classification. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar.","DOI":"10.3115\/v1\/D14-1181"},{"key":"ref_17","first-page":"1","article-title":"Inductive representation learning on large graphs","volume":"30","author":"Hamilton","year":"2017","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_18","first-page":"4","article-title":"Graph Attention Networks","volume":"1050","author":"Velickovic","year":"2018","journal-title":"Stat"},{"key":"ref_19","unstructured":"Xu, K., Hu, W., Leskovec, J., and Jegelka, S. How Powerful are Graph Neural Networks? In Proceedings of the International Conference on Learning Representations, New Orleans, LA, USA, 6\u20139 May 2019."},{"key":"ref_20","unstructured":"Yao, L., Mao, C., and Luo, Y. (2019, January 27\u201328). Graph convolutional networks for text classification. Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA."},{"key":"ref_21","unstructured":"Han, S.C., Yuan, Z., Wang, K., Long, S., and Poon, J. (2022). Understanding Graph Convolutional Networks for Text Classification. arXiv."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Lin, Y., Meng, Y., Sun, X., Han, Q., Kuang, K., Li, J., and Wu, F. (2021, January 1\u20136). BertGCN: Transductive Text Classification by Combining GNN and BERT. Proceedings of the Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, Online.","DOI":"10.18653\/v1\/2021.findings-acl.126"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Chiang, W.-L., Liu, X., Si, S., Li, Y., Bengio, S., and Hsieh, C. (2019, January 4\u20138). Cluster-GCN: An efficient algorithm for training deep and large graph convolutional networks. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA.","DOI":"10.1145\/3292500.3330925"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1137\/S1064827595287997","article-title":"A fast and high quality multilevel scheme for partitioning irregular graphs","volume":"20","author":"Karypis","year":"1998","journal-title":"SIAM J. Sci. Comput."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.neucom.2019.10.009","article-title":"A Hybrid Persian Sentiment Analysis Framework: Integrating Dependency Grammar Based Rules and Deep Neural Networks","volume":"380","author":"Dashtipour","year":"2020","journal-title":"Neurocomputing"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1177\/0165551520962781","article-title":"Deep Persian sentiment analysis: Cross-lingual training for low-resource languages","volume":"48","author":"Ghasemi","year":"2022","journal-title":"J. Inf. Sci."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"107659","DOI":"10.1016\/j.knosys.2021.107659","article-title":"Graph fusion network for text classification","volume":"236","author":"Dai","year":"2022","journal-title":"Knowl.-Based Syst."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/16\/10\/470\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:05:13Z","timestamp":1760130313000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/16\/10\/470"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,4]]},"references-count":27,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2023,10]]}},"alternative-id":["a16100470"],"URL":"https:\/\/doi.org\/10.3390\/a16100470","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,4]]}}}