{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,7,28]],"date-time":"2024-07-28T22:29:37Z","timestamp":1722205777010},"reference-count":35,"publisher":"MIT Press - Journals","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computational Linguistics"],"published-print":{"date-parts":[[2013,12]]},"abstract":"<jats:p> Translation models used for statistical machine translation are compiled from parallel corpora that are manually translated. The common assumption is that parallel texts are symmetrical: The direction of translation is deemed irrelevant and is consequently ignored. Much research in Translation Studies indicates that the direction of translation matters, however, as translated language (translationese) has many unique properties. It has already been shown that phrase tables constructed from parallel corpora translated in the same direction as the translation task outperform those constructed from corpora translated in the opposite direction. <\/jats:p><jats:p> We reconfirm that this is indeed the case, but emphasize the importance of also using texts translated in the \u201cwrong\u201d direction. We take advantage of information pertaining to the direction of translation in constructing phrase tables by adapting the translation model to the special properties of translationese. We explore two adaptation techniques: First, we create a mixture model by interpolating phrase tables trained on texts translated in the \u201cright\u201d and the \u201cwrong\u201d directions. The weights for the interpolation are determined by minimizing perplexity. Second, we define entropy-based measures that estimate the correspondence of target-language phrases to translationese, thereby eliminating the need to annotate the parallel corpus with information pertaining to the direction of translation. We show that incorporating these measures as features in the phrase tables of statistical machine translation systems results in consistent, statistically significant improvement in the quality of the translation. <\/jats:p>","DOI":"10.1162\/coli_a_00159","type":"journal-article","created":{"date-parts":[[2013,3,20]],"date-time":"2013-03-20T19:23:31Z","timestamp":1363807411000},"page":"999-1023","source":"Crossref","is-referenced-by-count":7,"title":["Improving Statistical Machine Translation by Adapting Translation Models to Translationese"],"prefix":"10.1162","volume":"39","author":[{"given":"Gennadi","family":"Lembersky","sequence":"first","affiliation":[{"name":"University of Haifa, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Noam","family":"Ordan","sequence":"additional","affiliation":[{"name":"University of Haifa, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shuly","family":"Wintner","sequence":"additional","affiliation":[{"name":"University of Haifa, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"281","reference":[{"key":"R1","volume-title":"Interpretation and the Language of Translation: Creativity and Conventions in Translation.","author":"Al-Shabab Omar S.","year":"1996"},{"key":"R2","first-page":"355","volume-title":"Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing","author":"Axelrod Amittai","year":"2011"},{"key":"R3","doi-asserted-by":"publisher","DOI":"10.1075\/z.64.15bak"},{"key":"R4","doi-asserted-by":"publisher","DOI":"10.1093\/llc\/fqi039"},{"key":"R5","first-page":"84","volume-title":"Routledge Encyclopedia of Translation Studies.","author":"Beeby Alison","year":"2009","edition":"2"},{"key":"R6","first-page":"17","volume-title":"Interlingual and Intercultural Communication Discourse and Cognition in Translation and Second Language Acquisition Studies","volume":"35","author":"Blum-Kulka Shoshana","year":"1986"},{"key":"R7","first-page":"119","volume-title":"Strategies in Interlanguage Communication.","author":"Blum-Kulka Shoshana","year":"1983"},{"key":"R9","first-page":"176","volume-title":"Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies","author":"Clark Jonathan H.","year":"2011"},{"key":"R10","first-page":"85","volume-title":"Proceedings of the Sixth Workshop on Statistical Machine Translation","author":"Denkowski Michael","year":"2011"},{"key":"R11","first-page":"1","volume-title":"Proceedings of The International Symposium on Using Corpora in Contrastive and Translation Studies","author":"Ferraresi Adriano","year":"2008"},{"key":"R12","first-page":"451","volume-title":"Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing","author":"Foster George","year":"2010"},{"key":"R13","doi-asserted-by":"publisher","DOI":"10.1145\/595576.595578"},{"key":"R14","first-page":"88","volume-title":"Translation Studies in Scandinavia.","author":"Gellerstam Martin","year":"1986"},{"key":"R15","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-12116-6_43"},{"key":"R16","first-page":"967","volume-title":"Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)","author":"Johnson Howard","year":"2007"},{"key":"R17","first-page":"388","volume-title":"Proceedings of EMNLP 2004","author":"Koehn Philipp","year":"2004"},{"key":"R18","first-page":"79","volume-title":"Proceedings of the Tenth Machine Translation Summit","author":"Koehn Philipp","year":"2005"},{"key":"R19","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511815829"},{"key":"R20","doi-asserted-by":"publisher","DOI":"10.3115\/1557769.1557821"},{"key":"R21","doi-asserted-by":"publisher","DOI":"10.3115\/1626355.1626388"},{"key":"R22","first-page":"1318","volume-title":"Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies","author":"Koppel Moshe","year":"2011"},{"key":"R23","first-page":"81","volume-title":"Proceedings of MT-Summit XII","author":"Kurokawa David","year":"2009"},{"key":"R24","doi-asserted-by":"publisher","DOI":"10.7202\/003425ar"},{"key":"R25","first-page":"363","volume-title":"Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing","author":"Lembersky Gennadi","year":"2011"},{"key":"R26","first-page":"255","volume-title":"Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics","author":"Lembersky Gennadi","year":"2012"},{"key":"R27","doi-asserted-by":"publisher","DOI":"10.1162\/COLI_a_00111"},{"key":"R28","first-page":"220","volume-title":"Proceedings of the ACL 2010 Conference, Short Papers","author":"Moore Robert C.","year":"2010"},{"key":"R29","doi-asserted-by":"publisher","DOI":"10.3115\/1075096.1075117"},{"key":"R30","doi-asserted-by":"publisher","DOI":"10.3115\/1075218.1075274"},{"key":"R31","first-page":"311","volume-title":"ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics","author":"Papineni Kishore","year":"2002"},{"key":"R32","doi-asserted-by":"publisher","DOI":"10.1075\/forum.5.2.05pav"},{"key":"R33","first-page":"539","volume-title":"Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics","author":"Sennrich Rico","year":"2012"},{"key":"R34","first-page":"223","volume-title":"Proceedings of the 7th Conference of the Association for Machine Translation of the Americas (AMTA-2006)","author":"Snover Matthew","year":"2006"},{"key":"R36","doi-asserted-by":"publisher","DOI":"10.1075\/btl.4"},{"key":"R37","doi-asserted-by":"publisher","DOI":"10.3115\/1599081.1599199"}],"container-title":["Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/COLI_a_00159","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,12]],"date-time":"2021-03-12T21:27:28Z","timestamp":1615584448000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/coli\/article\/39\/4\/999-1023\/1446"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,12]]},"references-count":35,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2013,12]]}},"alternative-id":["10.1162\/COLI_a_00159"],"URL":"https:\/\/doi.org\/10.1162\/coli_a_00159","relation":{},"ISSN":["0891-2017","1530-9312"],"issn-type":[{"value":"0891-2017","type":"print"},{"value":"1530-9312","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,12]]}}}