Loading…
SITD-NMT: Synchronous Inference NMT with Turing Re-Translation Detection
Conventional Neural Machine Translation (NMT) relies on previous tokens and the hidden state of the target for the inference of the target tokens in the decoding phase, and this left-to-right decoding approach loses the context of the target sequence from right-to-left. In addition, the vanilla atte...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Conventional Neural Machine Translation (NMT) relies on previous tokens and the hidden state of the target for the inference of the target tokens in the decoding phase, and this left-to-right decoding approach loses the context of the target sequence from right-to-left. In addition, the vanilla attention mechanism lacks interactivity in the bilingual training phase, where the computation of the attention weights is independent at each step, which leads the decoder to disregard whether or not the current token has already been translated. In this paper, we proposed a novel interactive synchronized bi-directional inference method that uses past and future contexts to synchronously predict target sequences and incorporates Neural Turing Machine (NTM) ideas to detect historical attentional information, which relies on a read − write mechanism to update the source hidden state. We evaluate the proposed model in the WMT14 German-English translation task and the LDC Chinese-English translation task, and the experimental results show that our method improves 2.18 and 4.07 BLEU scores over the Transformer, respectively, which fully demonstrates the effectiveness of the proposed method. |
---|---|
ISSN: | 2161-4407 |
DOI: | 10.1109/IJCNN60899.2024.10650580 |