minimum error rate training Latest Research Papers

Optimization for Statistical Machine Translation: A Survey

Computational Linguistics ◽

10.1162/coli_a_00241 ◽

2016 ◽

Vol 42 (1) ◽

pp. 1-54 ◽

Cited By ~ 7

Author(s):

Graham Neubig ◽

Taro Watanabe

Keyword(s):

Machine Translation ◽

Large Scale ◽

Nonlinear Models ◽

Statistical Machine Translation ◽

Risk Minimization ◽

Discriminative Models ◽

Scale Optimization ◽

State Of Affairs ◽

Translation Accuracy ◽

Minimum Error Rate Training

In statistical machine translation (SMT), the optimization of the system parameters to maximize translation accuracy is now a fundamental part of virtually all modern systems. In this article, we survey 12 years of research on optimization for SMT, from the seminal work on discriminative models (Och and Ney 2002) and minimum error rate training (Och 2003), to the most recent advances. Starting with a brief introduction to the fundamentals of SMT systems, we follow by covering a wide variety of optimization algorithms for use in both batch and online optimization. Specifically, we discuss losses based on direct error minimization, maximum likelihood, maximum margin, risk minimization, ranking, and more, along with the appropriate methods for minimizing these losses. We also cover recent topics, including large-scale optimization, nonlinear models, domain-dependent optimization, and the effect of MT evaluation measures or search on optimization. Finally, we discuss the current state of affairs in MT optimization, and point out some unresolved problems that will likely be the target of further research in optimization for MT.

Download Full-text

Minimum Error Rate Training for Bilingual News Alignment

Lecture Notes in Computer Science - Chinese Lexical Semantics ◽

10.1007/978-3-642-45185-0_45 ◽

2013 ◽

pp. 425-435

Author(s):

Can Wang ◽

Yang Liu ◽

Maosong Sun

Keyword(s):

Error Rate ◽

Minimum Error ◽

Minimum Error Rate Training

Download Full-text

Better Splitting Algorithms for Parallel Corpus Processing

Prague Bulletin of Mathematical Linguistics ◽

10.2478/v10108-012-0013-x ◽

2012 ◽

Vol 98 (1) ◽

pp. 109-119

Author(s):

Lane Schwartz

Keyword(s):

Parallel Computing ◽

Parallel Machine Scheduling ◽

Parallel Machine ◽

Computing Environment ◽

Minimum Error ◽

Splitting Algorithms ◽

Parallel Corpus ◽

Speed Up ◽

Novel Algorithms ◽

Minimum Error Rate Training

Better Splitting Algorithms for Parallel Corpus Processing Each iteration of minimum error rate training involves re-translating a development set. Distributing this work across computational nodes can speed up translation time, but in practice some parts may take much longer to complete than others, leading to computational slack time. To address this problem, we develop three novel algorithms for distributing translation tasks in a parallel computing environment, drawing on research in parallel machine scheduling. We present results showing a substantial speedup in overall decoding time.

Download Full-text

Margin Infused Relaxed Algorithm for Moses

Prague Bulletin of Mathematical Linguistics ◽

10.2478/v10108-011-0012-3 ◽

2011 ◽

Vol 96 (1) ◽

pp. 69-78 ◽

Cited By ~ 5

Author(s):

Eva Hasler ◽

Barry Haddow ◽

Philipp Koehn

Keyword(s):

Open Source ◽

Machine Translation ◽

Error Rate ◽

Statistical Machine Translation ◽

Experimental Results ◽

Minimum Error ◽

Feature Sets ◽

Translation Quality ◽

Core Feature ◽

Minimum Error Rate Training

Margin Infused Relaxed Algorithm for Moses We describe an open-source implementation of the Margin Infused Relaxed Algorithm (MIRA) for statistical machine translation (SMT). The implementation is part of the Moses toolkit and can be used as an alternative to standard minimum error rate training (MERT). A description of the implementation and its usage on core feature sets as well as large, sparse feature sets is given and we report experimental results comparing the performance of MIRA with MERT in terms of translation quality and stability.

Download Full-text

Multi-Task Minimum Error Rate Training for SMT

Prague Bulletin of Mathematical Linguistics ◽

10.2478/v10108-011-0015-0 ◽

2011 ◽

Vol 96 (1) ◽

pp. 99-108

Author(s):

Patrick Simianer ◽

Katharina Wäschle ◽

Stefan Riezler

Keyword(s):

Error Rate ◽

Statistical Machine Translation ◽

Discriminative Training ◽

International Patent Classification ◽

Specific Training ◽

Minimum Error ◽

Patent Classification ◽

International Patent ◽

Minimum Error Rate Training ◽

Dense Features

Multi-Task Minimum Error Rate Training for SMT We present experiments on multi-task learning for discriminative training in statistical machine translation (SMT), extending standard minimum-error-rate training (MERT) by techniques that take advantage of the similarity of related tasks. We apply our techniques to German-to-English translation of patents from 8 tasks according to the International Patent Classification (IPC) system. Our experiments show statistically significant gains over task-specific training by techniques that model commonalities through shared parameters. However, more finegrained combinations of shared parameters with task-specific ones could not be brought to bear on models with a small number of dense features. The software used in the experiments is released as open-source tool.

Download Full-text

Metric and reference factors in minimum error rate training

Machine Translation ◽

10.1007/s10590-010-9072-7 ◽

2010 ◽

Vol 24 (1) ◽

pp. 27-38 ◽

Cited By ~ 3

Author(s):

Yifan He ◽

Andy Way

Keyword(s):

Error Rate ◽

Minimum Error ◽

Minimum Error Rate Training

Download Full-text

Improved Minimum Error Rate Training in Moses

Prague Bulletin of Mathematical Linguistics ◽

10.2478/v10108-009-0011-9 ◽

2009 ◽

Vol 91 (1) ◽

Cited By ~ 17

Author(s):

Nicola Bertoldi ◽

Barry Haddow ◽

Jean-Baptiste Fouet

Keyword(s):

Error Rate ◽

Minimum Error ◽

Minimum Error Rate Training

Download Full-text

Feasibility of human-in-the-loop minimum error rate training

Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing Volume 1 - EMNLP '09 ◽

10.3115/1699510.1699518 ◽

2009 ◽

Cited By ~ 2

Author(s):

Omar F. Zaidan ◽

Chris Callison-Burch

Keyword(s):

Error Rate ◽

Minimum Error ◽

Human In The Loop ◽

Minimum Error Rate Training

Download Full-text

Stabilizing minimum error rate training

10.3115/1626431.1626478 ◽

2009 ◽

Cited By ~ 2

Author(s):

George Foster ◽

Roland Kuhn

Keyword(s):

Error Rate ◽

Minimum Error ◽

Minimum Error Rate Training

Download Full-text

Modeling letter-to-phoneme conversion as a phrase based statistical machine translation problem with minimum error rate training

10.3115/1620932.1620948 ◽

2009 ◽

Cited By ~ 8

Author(s):

Taraka Rama ◽

Anil Kumar Singh ◽

Sudheer Kolachina

Keyword(s):

Machine Translation ◽

Error Rate ◽

Statistical Machine Translation ◽

Minimum Error ◽

Minimum Error Rate Training

Download Full-text

minimum error rate training
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Optimization for Statistical Machine Translation: A Survey

Minimum Error Rate Training for Bilingual News Alignment

Better Splitting Algorithms for Parallel Corpus Processing

Margin Infused Relaxed Algorithm for Moses

Multi-Task Minimum Error Rate Training for SMT

Metric and reference factors in minimum error rate training

Improved Minimum Error Rate Training in Moses

Feasibility of human-in-the-loop minimum error rate training

Stabilizing minimum error rate training

Modeling letter-to-phoneme conversion as a phrase based statistical machine translation problem with minimum error rate training

Export Citation Format

minimum error rate trainingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Optimization for Statistical Machine Translation: A Survey

Minimum Error Rate Training for Bilingual News Alignment

Better Splitting Algorithms for Parallel Corpus Processing

Margin Infused Relaxed Algorithm for Moses

Multi-Task Minimum Error Rate Training for SMT

Metric and reference factors in minimum error rate training

Improved Minimum Error Rate Training in Moses

Feasibility of human-in-the-loop minimum error rate training

Stabilizing minimum error rate training

Modeling letter-to-phoneme conversion as a phrase based statistical machine translation problem with minimum error rate training

minimum error rate training
Recently Published Documents