{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,6]],"date-time":"2026-05-06T02:20:04Z","timestamp":1778034004984,"version":"3.51.4"},"reference-count":23,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2022,8,30]],"date-time":"2022-08-30T00:00:00Z","timestamp":1661817600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Ministry of Science and Higher Education of the Republic of Kazakhstan","award":["\u0410\u0420 09259556"],"award-info":[{"award-number":["\u0410\u0420 09259556"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>The problems of machine translation are constantly arising. While the most advanced translation platforms, such as Google and Yandex, allow for high-quality translations of languages with simple grammatical structures, more morphologically rich languages still suffer from the translation of complex sentences, and translation services leave many structural errors. This study focused on designing the rules for the grammatical structures of complex sentences in the Kazakh language, which has a difficult grammar with many rules. First, the types of complex sentences in the Kazakh language were thoroughly observed with the use of templates from the FuzzyWuzzy library. Then, the correction of complex sentences was completed with parallel corpora. The sentences were translated into English and Russian by existing machine translation systems. Therefore, the grammar of both Kazakh\u2013English and Kazakh\u2013Russian language pairs was considered. They both used the rules specifically designed for the post-editing steps. Finally, the performance of the developed algorithm was evaluated for an accuracy score for each pair of languages. This approach was then proposed for use in other corpora generation, post-editing, and analysis systems in future works.<\/jats:p>","DOI":"10.3390\/info13090411","type":"journal-article","created":{"date-parts":[[2022,8,31]],"date-time":"2022-08-31T00:13:56Z","timestamp":1661904836000},"page":"411","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Semantic Connections in the Complex Sentences for Post-Editing Machine Translation in the Kazakh Language"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9660-6928","authenticated-orcid":false,"given":"Aliya","family":"Turganbayeva","sequence":"first","affiliation":[{"name":"Institute of Information and Computational Technologies, Al-Farabi Kazakh National University, Almaty 050010, Kazakhstan"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1427-198X","authenticated-orcid":false,"given":"Diana","family":"Rakhimova","sequence":"additional","affiliation":[{"name":"Institute of Information and Computational Technologies, Al-Farabi Kazakh National University, Almaty 050010, Kazakhstan"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8768-0349","authenticated-orcid":false,"given":"Vladislav","family":"Karyukin","sequence":"additional","affiliation":[{"name":"The Department of Information Systems, Faculty of Information Technology, Al-Farabi Kazakh National University, Almaty 050040, Kazakhstan"}]},{"given":"Aidana","family":"Karibayeva","sequence":"additional","affiliation":[{"name":"The Department of Information Systems, Faculty of Information Technology, Al-Farabi Kazakh National University, Almaty 050040, Kazakhstan"}]},{"given":"Asem","family":"Turarbek","sequence":"additional","affiliation":[{"name":"The Department of Information Systems, Faculty of Information Technology, Al-Farabi Kazakh National University, Almaty 050040, Kazakhstan"}]}],"member":"1968","published-online":{"date-parts":[[2022,8,30]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Yamamoto, Y., Matsumoto, Y., and Watanabe, T. (2021, January 5\u20136). Dependency patterns of complex sentences and semantic disambiguation for abstract meaning representation parsing. Proceedings of *SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics, Association for Computational Linguistics, Bangkok, Thailand.","DOI":"10.18653\/v1\/2021.starsem-1.20"},{"key":"ref_2","first-page":"61","article-title":"To the question of the influence of linking language means on subordinate relations structuring in a complex sentence (based on the material of the Russian and German languages)","volume":"23","author":"Skrebova","year":"2020","journal-title":"Russ. Linguist. Bull."},{"key":"ref_3","first-page":"265","article-title":"Communicative and pragmatic function of punctuation","volume":"97","author":"Yeskermessova","year":"2017","journal-title":"Man India"},{"key":"ref_4","first-page":"96","article-title":"Comparison of combined sentences (on the example of Uzbek and Kazakh school textbooks)","volume":"9","author":"Qodirova","year":"2021","journal-title":"Pindus J. Cult. Lit. ELT"},{"key":"ref_5","first-page":"126","article-title":"Semantics and translation of conditional patterns from Kazakh into English and their usage in the language and speech","volume":"11311","author":"Luchkova","year":"2013","journal-title":"Mod. Sci. Res. Pract. Appl."},{"key":"ref_6","unstructured":"Amanzholov, S. (2022, May 22). Sarsen Amanzholov\u2019s Research in the Field of Syntax. Available online: https:\/\/www.referat911.ru\/Literatura\/srsen-amanzholovty-sintaksis-salasyna-atysty\/223386-2436719-place7.html."},{"key":"ref_7","first-page":"563","article-title":"The absence of Turkic complex sentences as a consequence of the internal structure of agglutinative languages [\u041e\u0442\u0441\u0443\u0442\u0441\u0442\u0432\u0438\u0435 \u0442\u044e\u0440\u043a\u0441\u043a\u0438\u0445 \u0441\u043b\u043e\u0436\u043d\u043e\u043f\u043e\u0434\u0447\u0438\u043d\u0435\u043d\u043d\u044b\u0445 \u043f\u0440\u0435\u0434\u043b\u043e\u0436\u0435\u043d\u0438\u0439 \u043a\u0430\u043a \u0441\u043b\u0435\u0434\u0441\u0442\u0432\u0438\u0435 \u0432\u043d\u0443\u0442\u0440\u0435\u043d\u043d\u0435\u0433\u043e \u0443\u0441\u0442\u0440\u043e\u0439\u0441\u0442\u0432\u0430 \u0430\u0433\u0433\u043b\u044e\u0442\u0438\u043d\u0430\u0442\u0438\u0432\u043d\u044b\u0445 \u044f\u0437\u044b\u043a\u043e\u0432]","volume":"8","author":"Dubrovina","year":"2017","journal-title":"RUDN J. Lang. Stud. Semiot. Semant."},{"key":"ref_8","unstructured":"Naber, D. (2003). A Rule-Based Style and Grammar Checker. [Bachelor\u2019s Thesis, University of Bielefeld]."},{"key":"ref_9","unstructured":"Sidorov, G., Gupta, A., Tozer, M., Catala, D., Catena, A., and Fuentes, S. (2013, January 8\u20139). Rule-based system for automatic grammar correction using syntactic n-grams for English language learning (L2). Proceedings of the Seventeenth Conference on Computational Natural Language Learning: Shared Task, Association for Computational Linguistics, Sofia, Bulgaria."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Chodorow, M., Tetreault, J.R., and Han, N. (2007, January 28). Detection of grammatical errors involving prepositions. Proceedings of the fourth ACL-SIGSEM workshop on prepositions, Association for Computational Linguistics, Prague, Czech Republic.","DOI":"10.3115\/1654629.1654635"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1162\/tacl_a_00251","article-title":"Grammar error correction in morphologically rich languages: The case of Russian","volume":"7","author":"Rozovskaya","year":"2019","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1515\/comp-2019-0019","article-title":"Augmenting neural machine translation through round-trip training approach","volume":"9","author":"Ahmadnia","year":"2019","journal-title":"Open Comput. Sci."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Ahmadnia, B., Aranovich, R., and Dorr, B. (2021, January 4\u20136). Strengthening Low-resource Neural Machine Translation through Joint Learning: The Case of Farsi-Spanish. Proceedings of the 13th International Conference on Agents and Artificial Intelligence (ICAART 2021), Vienna, Austria. Available online: https:\/\/www.scitepress.org\/Link.aspx?doi=10.5220\/0010362604750481.","DOI":"10.5220\/0010362604750481"},{"key":"ref_14","unstructured":"Kuwanto, G., Feyza Aky\u00fcrek, A., Chara Tourni, I., Li, S., Jones, A.G., and Wijaya, D. (2021). Low-Resource Machine Translation Training Curriculum Fit for Low-Resource Languages. arXiv."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Bekmanova, G., Yergesh, B., Sharipbay, A., Omarbekova, A., and Zakirova, A. (2022). Linguistic Foundations of Low-Resource Languages for Speech Synthesis on the Example of the Kazakh Language. Computational Science and Its Applications\u2014ICCSA 2022 Workshops, Springer. Lecture Notes in Computer Science.","DOI":"10.1007\/978-3-031-10545-6_1"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Vania, C., Kementchedjhieva, Y., S\u00f8gaard, A., and Lopez, A. (2019, January 5\u20137). A systematic comparison of methods for low-resource dependency parsing on genuinely low-resource languages. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, Hong Kong, China.","DOI":"10.18653\/v1\/D19-1102"},{"key":"ref_17","unstructured":"Washington, J.N., Salimzyanov, I., and Tyers, F.M. (2014, January 26\u201331). Finite-state morphological transducers for three Kypchak languages. Proceedings of the 9th Conference on Language Resources and Evaluation, LREC2014, Reykjavik, Iceland."},{"key":"ref_18","unstructured":"(2017). Development of a Free\/Open System of Machine Translation from Kazakh into English and Russian (and Vice Versa) Based on the Apertium Platform: Research Report (Final)\/Leader, Tukeyev U.A.. (In Russian)."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Nguyen, N., Papadopoulos, G., J\u0119drzejowicz, P., Trawi\u0144ski, B., and Vossen, G. (2017). Combined Technology of Lexical Selection in Rule-Based Machine Translation. Computational Collective Intelligence, Springer. Lecture Notes in Computer Science.","DOI":"10.1007\/978-3-319-67077-5"},{"key":"ref_20","unstructured":"(2022, June 13). NLP-KazNU. Available online: https:\/\/github.com\/NLP-KazNU\/KazCorpus_Compound_and_Complex_Sentences."},{"key":"ref_21","unstructured":"Coughlin, D. (2003, January 23\u201327). Correlating Automated and Human Assessments of Machine Translation Quality. Proceedings of the Machine Translation Summit IX: Papers, New Orleans, LA, USA."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Koehn, P. (2010). Statistical Machine Translation, Cambridge University Press.","DOI":"10.1017\/CBO9780511815829"},{"key":"ref_23","unstructured":"Snover, M., Dorr, B., Schwartz, R., Micciulla, L., and Makhoul, J. (2006, January 8\u201312). A study of translation edit rate with targeted human annotation. Proceedings of the 7th Conference of the Association for Machine Translation of the Americas (AMTA 2006), Visions for the Future of Machine Translation, Cambridge, MA, USA."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/13\/9\/411\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:20:27Z","timestamp":1760142027000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/13\/9\/411"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,30]]},"references-count":23,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2022,9]]}},"alternative-id":["info13090411"],"URL":"https:\/\/doi.org\/10.3390\/info13090411","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,8,30]]}}}