Loading…

Integration of a Multilingual Preordering Component into a Commercial SMT Platform

We present a multilingual preordering component tailored for a commercial Statistical Machine translation platform. In commercial settings, issues such as processing speed as well as the ability to adapt models to the customers’ needs play a significant role and have a big impact on the choice of ap...

Full description

Saved in:
Bibliographic Details
Published in:Prague bulletin of mathematical linguistics 2017-06, Vol.108 (1), p.61-72
Main Authors: Ramm, Anita, Superbo, Riccardo, Shterionov, Dimitar, O’Dowd, Tony, Fraser, Alexander
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We present a multilingual preordering component tailored for a commercial Statistical Machine translation platform. In commercial settings, issues such as processing speed as well as the ability to adapt models to the customers’ needs play a significant role and have a big impact on the choice of approaches that are added to the custom pipeline to deal with specific problems such as long-range reorderings. We developed a fast and customisable preordering component, also available as an open-source tool, which comes along with a generic implementation that is restricted neither to the translation platform nor to the Machine Translation paradigm. We test preordering on three language pairs: English →Japanese/German/Chinese for both Statistical Machine Translation (SMT) and Neural Machine Translation (NMT). Our experiments confirm previously reported improvements in the SMT output when the models are trained on preordered data, but they also show that preordering does not improve NMT.
ISSN:1804-0462
0032-6585
1804-0462
DOI:10.1515/pralin-2017-0009