Scaling up a Hybrid MT System: From low to full resources

Linguistica Antverpiensia, New Series – Themes in Translation Studies ◽

10.52034/lanstts.v8i.245 ◽

2021 ◽

Vol 8 ◽

Author(s):

Vincent Vandeghinste

Keyword(s):

Machine Translation ◽

Hybrid Approach ◽

Scaling Up ◽

Rule Based ◽

Academic Literature ◽

Translation Quality ◽

Parallel Data ◽

Hybrid Machine ◽

Hybrid Machine Translation ◽

Target Side

This article describes a hybrid approach to machine translation (MT) that is inspired by the rule-based, statistical, example-based, and other hybrid machine translation approaches currently used or described in academic literature. It describes how the approach was implemented for language pairs using only limited monolingual resources and hardly any parallel resources (the METIS-II system), and how it is currently implemented with rich resources on both the source and target side as well as rich parallel data (the PaCo-MT system). We aim to illustrate that a similar paradigm can be used, irrespectively of the resources available, but of course with an impact on translation quality.

Download Full-text

Rule Based and Expectation Maximization algorithm for Arabic-English Hybrid Machine Translation

IAES International Journal of Artificial Intelligence (IJ-AI) ◽

10.11591/ijai.v5.i2.pp72-79 ◽

2016 ◽

Vol 5 (2) ◽

pp. 72

Author(s):

Arwa Hatem Alqudsi ◽

Nazlia Omar ◽

Rabha W. Ibrahim

Keyword(s):

Em Algorithm ◽

Machine Translation ◽

Expectation Maximization ◽

Statistical Machine Translation ◽

Expectation Maximization Algorithm ◽

Algorithm Performance ◽

Rule Based ◽

Hybrid Machine ◽

Hybrid Machine Translation ◽

Rule Based Approach

<p><strong> </strong>It is practically impossible for pure machine translation approach to process all of translation problems; however, Rule Based Machine Translation and Statistical Machine translation (RBMT and SMT) use different architectures for performing translation task. Lexical analyser and syntactic analyser are solved by Rule Based and some amount of ambiguity is left to be solved by Expectation–Maximization (EM) algorithm, which is an iterative statistic algorithm for finding maximum likelihood. In this paper we have proposed an integrated Hybrid Machine Translation (HMT) system. The goal is to combine the best properties of each approach. Initially, Arabic text is keyed into RBMT; then the output will be edited by EM algorithm to generate the final translation of English text. As we have seen in previous works, the performance and enhancement of EM algorithm, the key of EM algorithm performance is the ability to accurately transform a frequency from one language to another. Results showing that, as proved by BLEU system, the proposed method can substantially outperform standard Rule Based approach and EM algorithm in terms of frequency and accuracy. The results of this study have been showed that the score of HMT system is higher than SMT system in all cases. When combining two approaches, HMT outperformed SMT in Bleu score.</p>

Download Full-text

Hybrid Machine Translation with Multi-Source Encoder-Decoder Long Short-Term Memory in English-Malay Translation

International Journal on Advanced Science Engineering and Information Technology ◽

10.18517/ijaseit.8.4-2.6816 ◽

2018 ◽

Vol 8 (4-2) ◽

pp. 1446

Author(s):

Yin-Lai Yeong ◽

Tien-Ping Tan ◽

Keng Hoon Gan ◽

Siti Khaotijah Mohammad

Keyword(s):

Machine Translation ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory ◽

Hybrid Machine ◽

Hybrid Machine Translation

Download Full-text

SisHiTra : A Hybrid Machine Translation System from Spanish to Catalan

Advances in Natural Language Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-540-30228-5_31 ◽

2004 ◽

pp. 349-359 ◽

Cited By ~ 1

Author(s):

José R. Navarro ◽

Jorge González ◽

David Picó ◽

Francisco Casacuberta ◽

Joan M. de Val ◽

...

Keyword(s):

Machine Translation ◽

Translation System ◽

Machine Translation System ◽

Hybrid Machine ◽

Hybrid Machine Translation

Download Full-text

A Survey on Hybrid Machine Translation

E3S Web of Conferences ◽

10.1051/e3sconf/202018401061 ◽

2020 ◽

Vol 184 ◽

pp. 01061

Author(s):

Anusha Anugu ◽

Gajula Ramesh

Keyword(s):

Machine Translation ◽

Language Processing ◽

Literature Survey ◽

Neural Machine Translation ◽

Translation Tools ◽

Translation Techniques ◽

Hybrid Machine ◽

Hybrid Machine Translation ◽

Translation Systems ◽

Evaluation Techniques

Machine translation has gradually developed in past 1940’s.It has gained more and more attention because of effective and efficient nature. As it makes the translation automatically without the involvement of human efforts. The distinct models of machine translation along with “Neural Machine Translation (NMT)” is summarized in this paper. Researchers have previously done lots of work on Machine Translation techniques and their evaluation techniques. Thus, we want to demonstrate an analysis of the existing techniques for machine translation including Neural Machine translation, their differences and the translation tools associated with them. Now-a-days the combination of two Machine Translation systems has the full advantage of using features from both the systems which attracts in the domain of natural language processing. So, the paper also includes the literature survey of the Hybrid Machine Translation (HMT).

Download Full-text

Hybrid Machine Translation Oriented to Cross-Language Information Retrieval: English-Spanish Error Analysis

Advances in Intelligent Systems and Computing - New Knowledge in Information Systems and Technologies ◽

10.1007/978-3-030-16181-1_18 ◽

2019 ◽

pp. 185-194

Author(s):

Juncal Gutiérrez-Artacho ◽

María-Dolores Olvera-Lobo ◽

Irene Rivera-Trigueros

Keyword(s):

Information Retrieval ◽

Error Analysis ◽

Machine Translation ◽

Cross Language Information Retrieval ◽

Hybrid Machine ◽

Hybrid Machine Translation ◽

Cross Language

Download Full-text

The Use of a Hybrid Machine Translation System in China's Government Portals

2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology ◽

10.1109/wiiat.2008.142 ◽

2008 ◽

Author(s):

Huo Yuehong ◽

Wang Chunyu

Keyword(s):

Machine Translation ◽

Translation System ◽

Machine Translation System ◽

Hybrid Machine ◽

Hybrid Machine Translation

Download Full-text

Topic-Based Dissimilarity and Sensitivity Models for Translation Rule Selection

Journal of Artificial Intelligence Research ◽

10.1613/jair.4265 ◽

2014 ◽

Vol 50 ◽

pp. 1-30 ◽

Cited By ~ 3

Author(s):

M. Zhang ◽

X. Xiao ◽

D. Xiong ◽

Q. Liu

Keyword(s):

Machine Translation ◽

Topic Model ◽

Statistical Machine Translation ◽

Model Space ◽

Target Language ◽

Translation Quality ◽

Rule Selection ◽

Translation Rule ◽

Selection Experiments ◽

Target Side

Translation rule selection is a task of selecting appropriate translation rules for an ambiguous source-language segment. As translation ambiguities are pervasive in statistical machine translation, we introduce two topic-based models for translation rule selection which incorporates global topic information into translation disambiguation. We associate each synchronous translation rule with source- and target-side topic distributions.With these topic distributions, we propose a topic dissimilarity model to select desirable (less dissimilar) rules by imposing penalties for rules with a large value of dissimilarity of their topic distributions to those of given documents. In order to encourage the use of non-topic specific translation rules, we also present a topic sensitivity model to balance translation rule selection between generic rules and topic-specific rules. Furthermore, we project target-side topic distributions onto the source-side topic model space so that we can benefit from topic information of both the source and target language. We integrate the proposed topic dissimilarity and sensitivity model into hierarchical phrase-based machine translation for synchronous translation rule selection. Experiments show that our topic-based translation rule selection model can substantially improve translation quality.

Download Full-text