Robust Neural Machine Translation: Modeling Orthographic and Interpunctual Variation

Frontiers in Artificial Intelligence and Applications - Human Language Technologies – The Baltic Perspective ◽

10.3233/faia200606 ◽

2020 ◽

Author(s):

Toms Bergmanis ◽

Artūrs Stafanovičs ◽

Mārcis Pinnis

Keyword(s):

Machine Translation ◽

Noisy Data ◽

Training Data ◽

Noise Model ◽

Web Pages ◽

Data Systems ◽

Neural Machine Translation ◽

Original Sentence ◽

Adversarial Examples ◽

Translation Systems

Neural machine translation systems typically are trained on curated corpora and break when faced with non-standard orthography or punctuation. Resilience to spelling mistakes and typos, however, is crucial as machine translation systems are used to translate texts of informal origins, such as chat conversations, social media posts and web pages. We propose a simple generative noise model to generate adversarial examples of ten different types. We use these to augment machine translation systems’ training data and show that, when tested on noisy data, systems trained using adversarial examples perform almost as well as when translating clean data, while baseline systems’ performance drops by 2-3 BLEU points. To measure the robustness and noise invariance of machine translation systems’ outputs, we use the average translation edit rate between the translation of the original sentence and its noised variants. Using this measure, we show that systems trained on adversarial examples on average yield 50 % consistency improvements when compared to baselines trained on clean data.

Download Full-text

The Development of a Comprehensive Data Set for Systematic Studies of Machine Translation

Multilingual Facilitation ◽

10.31885/9789515150257.22 ◽

2021 ◽

pp. 248-262

Author(s):

Jörg Tiedemann

Keyword(s):

Machine Translation ◽

Training Data ◽

Sufficient Information ◽

Data Set ◽

Neural Machine Translation ◽

High Resource ◽

Language Groups ◽

Comprehensive Data ◽

Parallel Data ◽

Initial Results

This paper presents our on-going efforts to develop a comprehensive data set and benchmark for machine translation beyond high-resource languages. The current release includes 500GB of compressed parallel data for almost 3,000 language pairs covering over 500 languages and language variants. We present the structure of the data set and demonstrate its use for systematic studies based on baseline experiments with multilingual neural machine translation between Finno-Ugric languages and other language groups. Our initial results show the capabilities of training effective multilingual translation models with skewed training data but also stress the shortcomings with low-resource settings and the difficulties to obtain sufficient information through straightforward transfer from related languages.

Download Full-text

Recurrent Stacking of Layers for Compact Neural Machine Translation Models

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016292 ◽

2019 ◽

Vol 33 ◽

pp. 6292-6299 ◽

Cited By ~ 2

Author(s):

Raj Dabre ◽

Atsushi Fujita

Keyword(s):

Machine Translation ◽

Single Layer ◽

Training Data ◽

Neural Machine Translation ◽

Parallel Corpora ◽

Translation Quality ◽

Sequence Generation ◽

Sequence Modeling ◽

Back Translation

In encoder-decoder based sequence-to-sequence modeling, the most common practice is to stack a number of recurrent, convolutional, or feed-forward layers in the encoder and decoder. While the addition of each new layer improves the sequence generation quality, this also leads to a significant increase in the number of parameters. In this paper, we propose to share parameters across all layers thereby leading to a recurrently stacked sequence-to-sequence model. We report on an extensive case study on neural machine translation (NMT) using our proposed method, experimenting with a variety of datasets. We empirically show that the translation quality of a model that recurrently stacks a single-layer 6 times, despite its significantly fewer parameters, approaches that of a model that stacks 6 different layers. We also show how our method can benefit from a prevalent way for improving NMT, i.e., extending training data with pseudo-parallel corpora generated by back-translation. We then analyze the effects of recurrently stacked layers by visualizing the attentions of models that use recurrently stacked layers and models that do not. Finally, we explore the limits of parameter sharing where we share even the parameters between the encoder and decoder in addition to recurrent stacking of layers.

Download Full-text

Translating the Molecules: Adapting Neural Machine Translation to Predict IUPAC Names from a Chemical Identifier

10.26434/chemrxiv.14170472 ◽

2021 ◽

Author(s):

Jennifer Handsel ◽

Brian Matthews ◽

Nicola Knight ◽

Simon Coles

Keyword(s):

Machine Translation ◽

Inorganic Compounds ◽

Training Data ◽

Neural Machine Translation ◽

Chemical Identifier ◽

Machine Learning Model ◽

Decoder Architecture ◽

Service Training ◽

The Neural Networks ◽

Low Coverage

We present a sequence-to-sequence machine learning model for predicting the IUPAC name of a chemical from its standard International Chemical Identifier (InChI). The model uses two stacks of transformers in an encoder-decoder architecture, a setup similar to the neural networks used in state-of-the-art machine translation. Unlike neural machine translation, which usually tokenizes input and output into words or sub-words, our model processes the InChI and predicts the 2 IUPAC name character by character. The model was trained on a dataset of 10 million InChI/IUPAC name pairs freely downloaded from the National Library of Medicine’s online PubChem service. Training took five days on a Tesla K80 GPU, and the model achieved test-set accuracies of 95% (character-level) and 91% (whole name). The model performed particularly well on organics, with the exception of macrocycles. The predictions were less accurate for inorganic compounds, with a character-level accuracy of 71%. This can be explained by inherent limitations in InChI for representing inorganics, as well as low coverage (1.4 %) of the training data.

Download Full-text

Is automation changing the translation profession?

International Journal of the Sociology of Language ◽

10.1515/ijsl-2020-0015 ◽

2021 ◽

Vol 0 (0) ◽

Author(s):

Anthony Pym ◽

Ester Torres-Simón

Keyword(s):

Machine Translation ◽

Communication Skills ◽

Service Providers ◽

Time Data ◽

Market Segments ◽

Interactive Communication ◽

Neural Machine Translation ◽

History Of ◽

Translation Services ◽

Translation Systems

Abstract As a language-intensive profession, translation is of frontline interest in the era of language automation. In particular, the development of neural machine translation systems since 2016 has brought with it fears that soon there will be no more human translators. When considered in terms of the history of automation, however, any such direct effect is far from obvious: the translation industry is still growing and machine translation is only one instance of automation. At the same time, data on remuneration indicate structural wage dispersion in professional translation services, with some signs that this dispersion may increase in certain market segments as automated workflows and translation technologies are adopted more by large language-service providers more than by smaller companies and individual freelancers. An analysis of recent changes in discourses on and in the translation profession further indicates conceptual adjustments in the profession that may be attributed to growing automation, particularly with respect to expanding skills set associated with translation, the tendency to combine translation with other forms of communication, and the use of interactive communication skills to authorize and humanize the results of automation.

Download Full-text

Checkpoint Reranking: An Approach to Select Better Hypothesis for Neural Machine Translation Systems

10.18653/v1/2020.acl-srw.38 ◽

2020 ◽

Author(s):

Vinay Pandramish ◽

Dipti Misra Sharma

Keyword(s):

Machine Translation ◽

Neural Machine Translation ◽

Translation Systems

Download Full-text

Tsinghua University Neural Machine Translation Systems for CCMT 2020

Communications in Computer and Information Science - Machine Translation ◽

10.1007/978-981-33-6162-1_9 ◽

2020 ◽

pp. 98-104

Author(s):

Gang Chen ◽

Shuo Wang ◽

Xuancheng Huang ◽

Zhixing Tan ◽

Maosong Sun ◽

...

Keyword(s):

Machine Translation ◽

Neural Machine Translation ◽

Tsinghua University ◽

Translation Systems

Download Full-text

A Survey on Hybrid Machine Translation

E3S Web of Conferences ◽

10.1051/e3sconf/202018401061 ◽

2020 ◽

Vol 184 ◽

pp. 01061

Author(s):

Anusha Anugu ◽

Gajula Ramesh

Keyword(s):

Machine Translation ◽

Language Processing ◽

Literature Survey ◽

Neural Machine Translation ◽

Translation Tools ◽

Translation Techniques ◽

Hybrid Machine ◽

Hybrid Machine Translation ◽

Translation Systems ◽

Evaluation Techniques

Machine translation has gradually developed in past 1940’s.It has gained more and more attention because of effective and efficient nature. As it makes the translation automatically without the involvement of human efforts. The distinct models of machine translation along with “Neural Machine Translation (NMT)” is summarized in this paper. Researchers have previously done lots of work on Machine Translation techniques and their evaluation techniques. Thus, we want to demonstrate an analysis of the existing techniques for machine translation including Neural Machine translation, their differences and the translation tools associated with them. Now-a-days the combination of two Machine Translation systems has the full advantage of using features from both the systems which attracts in the domain of natural language processing. So, the paper also includes the literature survey of the Hybrid Machine Translation (HMT).

Download Full-text

Translation of Medical Texts using Neural Networks

International Journal of Reliable and Quality E-Healthcare ◽

10.4018/ijrqeh.2016100104 ◽

2016 ◽

Vol 5 (4) ◽

pp. 51-66 ◽

Cited By ~ 5

Author(s):

Krzysztof Wolk ◽

Krzysztof P. Marasek

Keyword(s):

Machine Translation ◽

Statistical Machine Translation ◽

European Medicines Agency ◽

Translation System ◽

Training Methods ◽

Neural Machine Translation ◽

Machine Translation System ◽

Source Sentence ◽

Parallel Text ◽

Translation Systems

The quality of machine translation is rapidly evolving. Today one can find several machine translation systems on the web that provide reasonable translations, although the systems are not perfect. In some specific domains, the quality may decrease. A recently proposed approach to this domain is neural machine translation. It aims at building a jointly-tuned single neural network that maximizes translation performance, a very different approach from traditional statistical machine translation. Recently proposed neural machine translation models often belong to the encoder-decoder family in which a source sentence is encoded into a fixed length vector that is, in turn, decoded to generate a translation. The present research examines the effects of different training methods on a Polish-English Machine Translation system used for medical data. The European Medicines Agency parallel text corpus was used as the basis for training of neural and statistical network-based translation systems. A comparison and implementation of a medical translator is the main focus of our experiments.

Download Full-text

Modeling Past and Future for Neural Machine Translation

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00011 ◽

2018 ◽

Vol 6 ◽

pp. 145-157 ◽

Cited By ~ 2

Author(s):

Zaixiang Zheng ◽

Hao Zhou ◽

Shujian Huang ◽

Lili Mou ◽

Xinyu Dai ◽

...

Keyword(s):

Machine Translation ◽

Alignment Error ◽

Source Information ◽

Neural Machine Translation ◽

Attention Model ◽

Translation Quality ◽

The Past ◽

Proposed Model ◽

Coverage Model ◽

Translation Systems

Existing neural machine translation systems do not explicitly model what has been translated and what has not during the decoding phase. To address this problem, we propose a novel mechanism that separates the source information into two parts: translated Past contents and untranslated Future contents, which are modeled by two additional recurrent layers. The Past and Future contents are fed to both the attention model and the decoder states, which provides Neural Machine Translation (NMT) systems with the knowledge of translated and untranslated contents. Experimental results show that the proposed approach significantly improves the performance in Chinese-English, German-English, and English-German translation tasks. Specifically, the proposed model outperforms the conventional coverage model in terms of both the translation quality and the alignment error rate.

Download Full-text

Augmenting Neural Machine Translation through Round-Trip Training Approach

Open Computer Science ◽

10.1515/comp-2019-0019 ◽

2019 ◽

Vol 9 (1) ◽

pp. 268-278 ◽

Cited By ~ 1

Author(s):

Benyamin Ahmadnia ◽

Bonnie J. Dorr

Keyword(s):

Machine Translation ◽

Training Data ◽

Training Dataset ◽

Round Trip ◽

Neural Machine Translation ◽

Low Resource ◽

Translation Quality ◽

High Resource ◽

Training Approach ◽

Language Pair

AbstractThe quality of Neural Machine Translation (NMT), as a data-driven approach, massively depends on quantity, quality and relevance of the training dataset. Such approaches have achieved promising results for bilingually high-resource scenarios but are inadequate for low-resource conditions. Generally, the NMT systems learn from millions of words from bilingual training dataset. However, human labeling process is very costly and time consuming. In this paper, we describe a round-trip training approach to bilingual low-resource NMT that takes advantage of monolingual datasets to address training data bottleneck, thus augmenting translation quality. We conduct detailed experiments on English-Spanish as a high-resource language pair as well as Persian-Spanish as a low-resource language pair. Experimental results show that this competitive approach outperforms the baseline systems and improves translation quality.

Download Full-text