A Systematic Comparison of Training Criteria for Statistical Machine Translation

Richard Zens

Sasa Hasan

Hermann Ney

Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), ACL, Prague, Czech Republic(2007), pp. 524-532

Download Google Scholar

Abstract

We address the problem of training the free parameters of a statistical machine translation system. We show signiﬁcant improvements over a state-of-the-art minimum error rate training baseline on a large ChineseEnglish translation task. We present novel training criteria based on maximum likelihood estimation and expected loss computation. Additionally, we compare the maximum a-posteriori decision rule and the minimum Bayes risk decision rule. We show that, not only from a theoretical point of view but also in terms of translation quality, the minimum Bayes risk decision rule is preferable.

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

A Systematic Comparison of Training Criteria for Statistical Machine Translation

Abstract

Research Areas

Meet the teams driving innovation

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

A Systematic Comparison of Training Criteria for Statistical Machine Translation

Abstract

Research Areas

Meet the teams driving innovation

AI/ML Foundations  & Capabilities