{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,2,3]],"date-time":"2024-02-03T00:29:01Z","timestamp":1706920141458},"reference-count":0,"publisher":"Interciencia","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Interciencia"],"published-print":{"date-parts":[[2024]]},"abstract":"<jats:p>The Transformer-based neural machine translation (NMT) model has been very successful in recent years and has become a new mainstream method. However, using them in lowresourced languages requires large amounts of data and efficient model configuration (hyperparameter tuning) mechanisms. The scarcity of parallel texts is a bottleneck for high quality (N) MTs, especially for under resourced languages like Amharic. As a result, this paper presents an attempt to improve English-Amharic MT by introducing three different vanilla Transformer architectures, with different hyper-parameter values. To obtain additional training material, offline token level corpus augmentation was applied to the previously collected English-Amharic parallel corpus. Compared to previous work on Amharic MT, the best of the three Transformer models have achieved state-of-the-art BLEU scores. In fact, we were able to achieve this result by employing corpus augmentation techniques and hyper-parameter tuning.<\/jats:p>","DOI":"10.59671\/mbulj","type":"journal-article","created":{"date-parts":[[2024,2,2]],"date-time":"2024-02-02T14:01:13Z","timestamp":1706882473000},"source":"Crossref","is-referenced-by-count":0,"title":["Boosting English-Amharic machine translation using corpus augmentation and Transformer"],"prefix":"10.59671","author":[{"given":"Yohannes","family":"Biadgligne","sequence":"first","affiliation":[]},{"given":"Kamel","family":"Smaili","sequence":"additional","affiliation":[]}],"member":"38994","published-online":{"date-parts":[[2024]]},"container-title":["Interciencia"],"original-title":[],"deposited":{"date-parts":[[2024,2,2]],"date-time":"2024-02-02T14:01:14Z","timestamp":1706882474000},"score":1,"resource":{"primary":{"URL":"http:\/\/informaticajournal.com\/informatica\/index.php\/landing\/index\/MBULJ"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024]]},"references-count":0,"URL":"https:\/\/doi.org\/10.59671\/mbulj","relation":{},"ISSN":["0378-1844","0378-1844"],"issn-type":[{"value":"0378-1844","type":"print"},{"value":"0378-1844","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024]]}}}