{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,28]],"date-time":"2025-09-28T20:43:53Z","timestamp":1759092233205},"reference-count":42,"publisher":"MIT Press","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computational Linguistics"],"published-print":{"date-parts":[[2015,6]]},"abstract":"<jats:p>In this article, we present a novel machine translation model, the Operation Sequence Model (OSM), which combines the benefits of phrase-based and N-gram-based statistical machine translation (SMT) and remedies their drawbacks. The model represents the translation process as a linear sequence of operations. The sequence includes not only translation operations but also reordering operations. As in N-gram-based SMT, the model is: (i) based on minimal translation units, (ii) takes both source and target information into account, (iii) does not make a phrasal independence assumption, and (iv) avoids the spurious phrasal segmentation problem. As in phrase-based SMT, the model (i) has the ability to memorize lexical reordering triggers, (ii) builds the search graph dynamically, and (iii) decodes with large translation units during search. The unique properties of the model are (i) its strong coupling of reordering and translation where translation and reordering decisions are conditioned on n previous translation and reordering decisions, and (ii) the ability to model local and long-range reorderings consistently. Using BLEU as a metric of translation accuracy, we found that our system performs significantly better than state-of-the-art phrase-based systems (Moses and Phrasal) and N-gram-based systems (Ncode) on standard translation tasks. We compare the reordering component of the OSM to the Moses lexical reordering model by integrating it into Moses. Our results show that OSM outperforms lexicalized reordering on all translation tasks. The translation quality is shown to be improved further by learning generalized representations with a POS-based OSM.<\/jats:p>","DOI":"10.1162\/coli_a_00218","type":"journal-article","created":{"date-parts":[[2015,4,30]],"date-time":"2015-04-30T12:34:32Z","timestamp":1430397272000},"page":"185-214","source":"Crossref","is-referenced-by-count":19,"title":["The Operation Sequence Model\u2014Combining N-Gram-Based and Phrase-Based Statistical Machine Translation"],"prefix":"10.1162","volume":"41","author":[{"given":"Nadir","family":"Durrani","sequence":"first","affiliation":[{"name":"QCRI Qatar"}]},{"given":"Helmut","family":"Schmid","sequence":"additional","affiliation":[{"name":"LMU Munich"}]},{"given":"Alexander","family":"Fraser","sequence":"additional","affiliation":[{"name":"LMU Munich"}]},{"given":"Philipp","family":"Koehn","sequence":"additional","affiliation":[{"name":"University of Edinburgh"}]},{"given":"Hinrich","family":"Sch\u00fctze","sequence":"additional","affiliation":[{"name":"LMU Munich"}]}],"member":"281","reference":[{"key":"R1","unstructured":"Birch, Alexandra, Nadir Durrani, and Philipp Koehn. 2013. Edinburgh SLT and MT System Description for the IWSLT 2013 Evaluation. In Proceedings of the 10th International Workshop on Spoken Language Translation, pages 40\u201348, Heidelberg."},{"key":"R2","unstructured":"Bisazza, Arianna and Marcello Federico. 2013. Efficient Solutions for Word Reordering in German-English Phrase-Based Statistical Machine Translation. In Proceedings of the Eighth Workshop on Statistical Machine Translation, pages 440\u2013451, Sofia."},{"key":"R3","unstructured":"Brown, Peter F., Stephen A. Della Pietra, Vincent J. Della Pietra, and R. L. Mercer. 1993. The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics, 19(2):263\u2013311."},{"key":"R4","doi-asserted-by":"publisher","DOI":"10.1162\/089120104323093294"},{"key":"R5","unstructured":"Cer, Daniel, Michel Galley, Daniel Jurafsky, and Christopher D. Manning. 2010. Phrasal: A Statistical Machine Translation Toolkit for Exploring New Model Features. In Proceedings of the North American Chapter of ACL 2010 Demonstration Session, pages 9\u201312, Los Angeles, CA."},{"key":"R6","unstructured":"Cherry, Colin. 2013. Improved Reordering for Phrase-Based Translation Using Sparse Features. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 22\u201331, Atlanta, GA."},{"key":"R7","doi-asserted-by":"publisher","DOI":"10.1162\/coli.2007.33.2.201"},{"key":"R8","doi-asserted-by":"publisher","DOI":"10.3115\/1614108.1614143"},{"key":"R10","doi-asserted-by":"publisher","DOI":"10.1007\/s10590-007-9024-z"},{"key":"R11","unstructured":"Crego, Josep M. and Jos\u00e9 B. Mari\u00f1o. 2007. Syntax-Enhanced N-gram-Based SMT. In Proceedings of the 11th Machine Translation Summit, pages 111\u2013118, Copenhagen."},{"key":"R12","unstructured":"Crego, Josep M. and Fran\u00e7ois Yvon. 2009. Gappy Translation Units under Left-to-Right SMT Decoding. In Proceedings of the Meeting of the European Association for Machine Translation, pages 66\u201373, Barcelona."},{"key":"R13","unstructured":"Crego, Josep M. and Fran\u00e7ois Yvon. 2010. Improving Reordering with Linguistically Informed Bilingual N-Grams. In COLING 2010: Posters, pages 197\u2013205, Beijing."},{"key":"R14","doi-asserted-by":"publisher","DOI":"10.2478\/v10108-011-0010-5"},{"key":"R15","unstructured":"Durrani, Nadir, Alexander Fraser, and Helmut Schmid. 2013. Model With Minimal Translation Units, But Decode With Phrases. In the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 1\u201311, Atlanta, GA."},{"key":"R16","unstructured":"Durrani, Nadir, Alexander Fraser, Helmut Schmid, Hieu Hoang, and Philipp Koehn. 2013a. Can Markov Models Over Minimal Translation Units Help Phrase-Based SMT? In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pages 399\u2013405, Sofia."},{"key":"R17","unstructured":"Durrani, Nadir, Barry Haddow, Kenneth Heafield, and Philipp Koehn. 2013b. Edinburgh's Machine Translation Systems for European Language Pairs. In Proceedings of the Eighth Workshop on Statistical Machine Translation, pages 114\u2013121, Sofia."},{"key":"R18","unstructured":"Durrani, Nadir, Philipp Koehn, Helmut Schmid, and Alexander Fraser. 2014. Investigating the Usefulness of Generalized Word Representations in SMT. In Proceedings of the 25th Annual Conference on Computational Linguistics (COLING), pages 421\u2013432, Dublin."},{"key":"R19","unstructured":"Durrani, Nadir, Helmut Schmid, and Alexander Fraser. 2011. A Joint Sequence Translation Model with Integrated Reordering. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 1,045\u20131,054, Portland, OR."},{"key":"R20","doi-asserted-by":"crossref","unstructured":"Galley, Michel and Christopher D. Manning. 2008. A Simple and Effective Hierarchical Phrase Reordering Model. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pages 848\u2013856, Honolulu, Hl.","DOI":"10.3115\/1613715.1613824"},{"key":"R21","unstructured":"Galley, Michel and Christopher D. Manning. 2010. Accurate Non-Hierarchical Phrase-Based Translation. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 966\u2013974, Los Angeles, CA."},{"key":"R22","doi-asserted-by":"crossref","unstructured":"Gispert, Adri\u00e0 and Jos\u00e9 B. Mari\u00f1o. 2006. Linguistic Tuple Segmentation in N-Gram-Based Statistical Machine Translation. In INTERSPEECH, pages 1,149\u20131,152, Pittsburgh, PA.","DOI":"10.21437\/Interspeech.2006-350"},{"key":"R23","unstructured":"Green, Spence, Michel Galley, and Christopher D. Manning. 2010. Improved Models of Distortion Cost for Statistical Machine Translation. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 867\u2013875, Los Angeles, CA."},{"key":"R25","doi-asserted-by":"publisher","DOI":"10.2478\/v10108-010-0008-4"},{"key":"R26","unstructured":"Huang, Liang and David Chiang. 2007. Forest Rescoring: Faster Decoding with Integrated Language Models. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 144\u2013151, Prague."},{"key":"R27","unstructured":"Kneser, Reinhard and Hermann Ney. 1995. Improved Backing-off for M-gram Language Modeling. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 181\u2013184."},{"key":"R28","doi-asserted-by":"crossref","unstructured":"Koehn, Philipp. 2004a. Pharaoh: A Beam Search Decoder for Phrase-Based Statistical Machine Translation Models. In Association for Machine Translation in the Americas, pages 115\u2013124, Washington, DC.","DOI":"10.1007\/978-3-540-30194-3_13"},{"key":"R29","unstructured":"Koehn, Philipp. 2004b. Statistical Significance Tests for Machine Translation Evaluation. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, pages 388\u2013395, Barcelona."},{"key":"R31","unstructured":"Koehn, Philipp, Amittai Axelrod, Alexandra Birch, Chris Callison-Burch, Miles Osborne, and David Talbot. 2005. Edinburgh System Description for the 2005 IWSLT Speech Translation Evaluation. In International Workshop on Spoken Language Translation, pages 68\u201375, Pittsburgh, PA."},{"key":"R32","unstructured":"Koehn, Philipp and Hieu Hoang. 2007. Factored Translation Models. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pages 868\u2013876, Prague."},{"key":"R33","unstructured":"Koehn, Philipp, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, and Evan Herbst. 2007. Moses: Open Source Toolkit for Statistical Machine Translation. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics: Demonstrations. pages 117\u2013180, Prague."},{"key":"R34","doi-asserted-by":"crossref","unstructured":"Koehn, Philipp, Franz J. Och, and Daniel Marcu. 2003. Statistical Phrase-Based Translation. In 2003 Meeting of the North American Chapter of the Association for Computational Linguistics, pages 127\u2013133, Edmonton.","DOI":"10.3115\/1073445.1073462"},{"key":"R35","unstructured":"Kumar, Shankar and William J. Byrne. 2004. Minimum Bayes-Risk Decoding for Statistical Machine Translation. In Human Language Technologies: The 2004 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 169\u2013176, Boston, MA."},{"key":"R36","doi-asserted-by":"publisher","DOI":"10.1162\/coli.2006.32.4.527"},{"key":"R37","unstructured":"Moore, Robert and Chris Quirk. 2007. Faster Beam Search Decoding for Phrasal Statistical Machine Translation. In Proceedings of the 11th Machine Translation Summit, Copenhagen."},{"key":"R39","doi-asserted-by":"crossref","unstructured":"Och, Franz J. 2003. Minimum Error Rate Training in Statistical Machine Translation. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pages 160\u2013167, Sapporo.","DOI":"10.3115\/1075096.1075117"},{"key":"R40","doi-asserted-by":"publisher","DOI":"10.1162\/089120103321337421"},{"key":"R41","doi-asserted-by":"publisher","DOI":"10.1162\/0891201042544884"},{"key":"R44","doi-asserted-by":"crossref","unstructured":"Stolcke, Andreas. 2002. SRILM - An Extensible Language Modeling Toolkit. In International Conference on Spoken Language Processing, Denver, CO.","DOI":"10.21437\/ICSLP.2002-303"},{"key":"R45","doi-asserted-by":"crossref","unstructured":"Tillmann, Christoph and Tong Zhang. 2005. A Localized Prediction Model for Statistical Machine Translation. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, pages 557\u2013564, Ann Arbor, MI.","DOI":"10.3115\/1219840.1219909"},{"key":"R46","unstructured":"Vaswani, Ashish, Haitao Mi, Liang Huang, and David Chiang. 2011. Rule Markov Models for Fast Tree-to-String Translation. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 856\u2013864, Portland, OR."},{"key":"R47","doi-asserted-by":"publisher","DOI":"10.2478\/v10108-009-0018-2"},{"key":"R48","unstructured":"Zhang, Hui, Kristina Toutanova, Chris Quirk, and Jianfeng Gao. 2013. Beyond Left-to-Right: Multiple Decomposition Structures for SMT. In the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 12\u201321, Atlanta, GA."}],"container-title":["Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/COLI_a_00218","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,9]],"date-time":"2023-08-09T23:07:24Z","timestamp":1691622444000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/coli\/article\/41\/2\/185-214\/1505"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,6]]},"references-count":42,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2015,6]]}},"alternative-id":["10.1162\/COLI_a_00218"],"URL":"https:\/\/doi.org\/10.1162\/coli_a_00218","relation":{},"ISSN":["0891-2017","1530-9312"],"issn-type":[{"value":"0891-2017","type":"print"},{"value":"1530-9312","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,6]]}}}