{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,19]],"date-time":"2026-02-19T07:24:14Z","timestamp":1771485854196,"version":"3.50.1"},"reference-count":30,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2020,4,17]],"date-time":"2020-04-17T00:00:00Z","timestamp":1587081600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"The Scientific and Technological Research Council of Turkey","award":["3170959"],"award-info":[{"award-number":["3170959"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2020,7,31]]},"abstract":"<jats:p>In this article, we make use of sequence-to-sequence (seq2seq) models for spelling correction in the agglutinative Turkish language. In the baseline system, misspelled and target words are split into their letters and the letter sequences are fed into the seq2seq model. We prefer letters as the unit of the model due to the agglutinative nature of Turkish, which results in an impractical dictionary size when words are used as a dictionary unit. In order to improve the baseline performance, we incorporate right and left context of the misspelled words. All context words are represented with their first three consonants in the context-dependent model. We train the seq2seq models using a large text corpus collected automatically from the Internet. The corpus contains approximately 4 million sentences. We randomly introduce substitution, deletion, and insertion spelling errors to the words in the corpus. We test the performance of the proposed context-dependent seq2seq model using synthetic and realistic test sets. The synthetic test set is constructed similar to the training set. The realistic test set contains human-made misspellings from Twitter messages. In the experiments, we observed that the proposed context-dependent model performs significantly better than the baseline system. Its correction accuracy reaches 94% on the synthetic dataset. Additionally, the proposed method provides 2.1% absolute improvement over a state-of-the-art Turkish spelling correction system on the Twitter test set.<\/jats:p>","DOI":"10.1145\/3383200","type":"journal-article","created":{"date-parts":[[2020,5,4]],"date-time":"2020-05-04T06:53:19Z","timestamp":1588575199000},"page":"1-16","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["Context-Dependent Sequence-to-Sequence Turkish Spelling Correction"],"prefix":"10.1145","volume":"19","author":[{"given":"Osman","family":"B\u00fcy\u00fck","sequence":"first","affiliation":[{"name":"Kocaeli University and Sestek Speech Enabled Software Technologies Incorporation, Kocaeli, TURKEY"}]}],"member":"320","published-online":{"date-parts":[[2020,4,17]]},"reference":[{"key":"e_1_2_1_1_1","first-page":"1","article-title":"Zemberek, an open source NLP framework for Turkic languages","volume":"10","author":"Ak\u0131n Ahmet Afsin","year":"2007","journal-title":"Structure"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2015.7178336"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.5555\/2390940.2390943"},{"key":"e_1_2_1_4_1","unstructured":"Dzmitry Bahdanau Kyunghyun Cho and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. Arxiv Preprint Arxiv:1409.0473 (2014).  Dzmitry Bahdanau Kyunghyun Cho and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. Arxiv Preprint Arxiv:1409.0473 (2014)."},{"key":"e_1_2_1_5_1","volume-title":"AIML 2005 Conference CICC","author":"Barari Loghman","year":"2005"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2556288.2557414"},{"key":"e_1_2_1_7_1","unstructured":"Osman B\u00fcy\u00fck. 2005. Sub-world Language Modelling for Turkish Speech Recognition. Ph.D. Dissertation.  Osman B\u00fcy\u00fck. 2005. Sub-world Language Modelling for Turkish Speech Recognition. Ph.D. Dissertation."},{"key":"e_1_2_1_8_1","unstructured":"\u00c7agri \u00c7\u00f6ltekin. 2014. A set of open source tools for Turkish natural language processing. In LREC. 1079--1086.  \u00c7agri \u00c7\u00f6ltekin. 2014. A set of open source tools for Turkish natural language processing. In LREC. 1079--1086."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2005.1566516"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-3021"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-2317"},{"key":"e_1_2_1_12_1","unstructured":"Shaona Ghosh and Per Ola Kristensson. 2017. Neural networks for text correction and completion in keyboard decoding. Arxiv Preprint Arxiv:1709.06429 (2017).  Shaona Ghosh and Per Ola Kristensson. 2017. Neural networks for text correction and completion in keyboard decoding. Arxiv Preprint Arxiv:1709.06429 (2017)."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1051"},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the 3rd International Joint Conference on Natural Language Processing: Volume-II.","author":"Hassan Ahmed","year":"2008"},{"key":"e_1_2_1_15_1","doi-asserted-by":"crossref","unstructured":"Geoffrey Hinton Li Deng Dong Yu George Dahl Abdel-rahman Mohamed Navdeep Jaitly Andrew Senior Vincent Vanhoucke Patrick Nguyen Brian Kingsbury et\u00a0al. 2012. Deep neural networks for acoustic modeling in speech recognition. IEEE Signal Processing Magazine 29 (2012).  Geoffrey Hinton Li Deng Dong Yu George Dahl Abdel-rahman Mohamed Navdeep Jaitly Andrew Senior Vincent Vanhoucke Patrick Nguyen Brian Kingsbury et\u00a0al. 2012. Deep neural networks for acoustic modeling in speech recognition. IEEE Signal Processing Magazine 29 (2012).","DOI":"10.1109\/MSP.2012.2205597"},{"key":"e_1_2_1_16_1","volume-title":"Two-level Morphology: A General Computational Model for Word-form Recognition and Production.","author":"Koskenniemi Kimmo","year":"1983"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1040830.1040867"},{"key":"e_1_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Minh-Thang Luong Hieu Pham and Christopher D. Manning. 2015. Effective approaches to attention-based neural machine translation. Arxiv Preprint Arxiv:1508.04025 (2015).  Minh-Thang Luong Hieu Pham and Christopher D. Manning. 2015. Effective approaches to attention-based neural machine translation. Arxiv Preprint Arxiv:1508.04025 (2015).","DOI":"10.18653\/v1\/D15-1166"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1093\/llc\/9.2.137"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.5555\/234285.234293"},{"key":"e_1_2_1_21_1","unstructured":"Kemal Oflazer Elvan G\u00f6\u00e7men and Cem Boz\u015fahin. 1994. An outline of Turkish morphology. Report to NATO Science Division SfS III (TU-LANGUAGE) Brussels (1994).  Kemal Oflazer Elvan G\u00f6\u00e7men and Cem Boz\u015fahin. 1994. An outline of Turkish morphology. Report to NATO Science Division SfS III (TU-LANGUAGE) Brussels (1994)."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2015.2420092"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the 5th Language and Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics. 51--55","author":"Rios Annette","year":"2011"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2017.7863722"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the 1st International Conference on Turkic Computational Linguistics at CICLING, Konya. 7--11","author":"Torunoglu-Selamet Dilara","year":"2016"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2702123.2702135"},{"key":"e_1_2_1_27_1","unstructured":"Oriol Vinyals and Quoc Le. 2015. A neural conversational model. Arxiv Preprint Arxiv:1506.05869 (2015).  Oriol Vinyals and Quoc Le. 2015. A neural conversational model. Arxiv Preprint Arxiv:1506.05869 (2015)."},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5555\/1699571.1699629"},{"key":"e_1_2_1_29_1","unstructured":"Yonghui Wu Mike Schuster Zhifeng Chen Quoc V. Le Mohammad Norouzi Wolfgang Macherey Maxim Krikun Yuan Cao Qin Gao Klaus Macherey et\u00a0al. 2016. Google\u2019s neural machine translation system: Bridging the gap between human and machine translation. Arxiv Preprint Arxiv:1609.08144 (2016).  Yonghui Wu Mike Schuster Zhifeng Chen Quoc V. Le Mohammad Norouzi Wolfgang Macherey Maxim Krikun Yuan Cao Qin Gao Klaus Macherey et\u00a0al. 2016. Google\u2019s neural machine translation system: Bridging the gap between human and machine translation. Arxiv Preprint Arxiv:1609.08144 (2016)."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6639215"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3383200","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3383200","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:02:00Z","timestamp":1750197720000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3383200"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,4,17]]},"references-count":30,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2020,7,31]]}},"alternative-id":["10.1145\/3383200"],"URL":"https:\/\/doi.org\/10.1145\/3383200","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,4,17]]},"assertion":[{"value":"2019-03-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-02-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-04-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}