Markus Freitag

Research Areas

Authored Publications

Google Publications

Other Publications

INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback

Wenda Xu

Danqing Wang

Liangming Pan

Zhenqiao Song

Markus Freitag

William Wang

Lei Li

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Singapore, pp. 5967-5994

The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation

Patrick Fernandes

Dan Deutsch

Mara Finkelstein

Parker Riley

André Martins

Graham Neubig

Ankush Garg

Jon Clark

Markus Freitag

Orhan Firat

Conference on Machine Translation (2023)

WMT23 Metrics shared task Submission: Quality Estimation using Minimum Bayes Risk

Subhajit Naskar

Dan Deutsch

Markus Freitag

Proceedings of the Eighth Conference on Machine Translation, Association for Computational Linguistics, Singapore (2023), pp. 806-811

Prompting PaLM for Translation: Assessing Strategies and Performance

David Vilar Torres

Markus Freitag

Colin Cherry

Jiaming Luo

Viresh Ratnakar

George Foster

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Association for Computational Linguistics, Toronto, Canada (2023), 15406–15427

MetricX-23: The Google Submission to the WMT 2023 Metrics Shared Task

Jurik Juraska

Mara Finkelstein

Dan Deutsch

Aditya Siddhant

Mahdi Mirzazadeh

Markus Freitag

Conference on Machine Translation (2023)

Epsilon Sampling Rocks: Investigating Sampling Strategies for Minimum Bayes Risk Decoding for Machine Translation

Markus Freitag

Behrooz Ghorbani

Patrick Fernandes

Findings of the Association for Computational Linguistics: EMNLP 2023, Association for Computational Linguistics, Singapore, pp. 9198-9209

Results of WMT23 Metrics Shared Task: Metrics might be Guilty but References are not Innocent

Markus Freitag

Nitika Mathur

Chi-kiu Lo

Eleftherios Avramidis

Ricardo Rei

Brian Thompson

Tom Kocmi

Frédéric Blain

Dan Deutsch

Craig Stewart

Chrysoula Zerva

Sheila Castilho

Alon Lavie

George Foster

Proceedings of the Eighth Conference on Machine Translation, Association for Computational Linguistics, Singapore (2023), pp. 576-626

Ties Matter: Meta-Evaluating Modern Metrics with Pairwise Accuracy and Tie Calibration

Dan Deutsch

George Foster

Markus Freitag

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Singapore, pp. 12914-12929

Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph-Level

Dan Deutsch

Jurik Juraska

Mara Finkelstein

Markus Freitag

Proceedings of the Eighth Conference on Machine Translation, Association for Computational Linguistics, Singapore (2023), pp. 996-1013

There's no Data Like Better Data: Using QE Metrics for MT Data Filtering

Jan-Thorsten Peter

David Vilar Torres

Dan Deutsch

Mara Finkelstein

Jurik Juraska

Markus Freitag

Proceedings of the Eighth Conference on Machine Translation, Association for Computational Linguistics, Singapore (2023), pp. 561-577

Original or Translated? A Causal Analysis of the Impact of Translationese on Machine Translation Performance

Jingwei Ni

Zhijing Jin

Markus Freitag

Mrinmaya Sachan

Bernhard Scholkopf

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics, Seattle, United States, pp. 5303-5320

On Systematic Style Differences between Unsupervised and Supervised MT and an Application for High-Resource Machine Translation

Kelly Venning Marchisio

Markus Freitag

David Grangier

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 2214-2225

Findings of the WMT 2022 Shared Task on Automatic Post-Editing

Pushpak Bhattacharyya

Rajen Chatterjee

Markus Freitag

Diptesh Kanojia

Matteo Negri

Marco Turchi

Proceedings of the Seventh Conference on Machine Translation, Association for Computational Linguistics, Abu Dhabi (2022), pp. 109-117

A Natural Diet: Towards Improving Naturalness of Machine Translation Output

Markus Freitag

David Vilar Torres

David Grangier

Colin Cherry

George Foster

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Online (2022)

High Quality Rather than High Model Probability: Minimum Bayes Risk Decoding with Neural Metrics

Markus Freitag

David Grangier

Qijun Tan

Bowen Liang

Transactions of the Association for Computational Linguistics, vol. 10 (2022), pp. 811-825

Toward More Effective Human Evaluation for Machine Translation

Belén Saldías-Fuentes

George Foster

Markus Freitag

Qijun Tan

ACL2022 Workshop on Human Evaluation of NLP Systems

Results of WMT22 Metrics Shared Task: Stop Using BLEU - Neural Metrics Are Better and More Robust

Markus Freitag

Ricardo Rei

Nitika Mathur

Chi-kiu Lo

Craig Stewart

Eleftherios Avramidis

Tom Kocmi

George Foster

Alon Lavie

André Martins

Proceedings of the Seventh Conference on Machine Translation, Association for Computational Linguistics, Abu Dhabi (2022), pp. 46-68

Using Machine Translation to Localize Task Oriented NLG Output

Scott Roy

Cliff Brunk

Kyu-Young Kim

Justin Xu Zhao

Markus Freitag

Mihir Sanjay Kale

Gagan Bansal

Sidharth Mudgal

Chris Varano

CoRR, vol. abs/2107.04512 (2021)

Assessing Reference-Free Peer Evaluation for Machine Translation

Sweta Agrawal

George Foster

Markus Freitag

Colin Cherry

NAACL (2021)

Findings of the 2021 Conference on Machine Translation (WMT21)

Farhad Akhbardeh

Arkady Arkhangorodsky

Magdalena Biesialska

Ondrej Bojar

Rajen Chatterjee

Vishrav Chaudhary

Marta R. Costa-jussà

Cristina España-Bonet

Angela Fan

Christian Federman

Markus Freitag

Yvette Graham

Roman Grundkiewicz

Barry Haddow

Leonie Harter

Kenneth Heafield

Christopher M. Homan

Matthias Huck

Kwabena Amponsah-Kaakyire

Jungo Kasai

Daniel Khashabi

Kevin Knight

Tom Kocmi

Philipp Koehn

Nicholas Lourie

Christof Monz

Makoto Morishita

Masaaki Nagata

Ajay Nagesh

Toshiaki Nakazawa

Matteo Negri

Santanu Pal

Allahsera Tapo

Marco Turchi

Valentin Vydrin

Marcos Zampieri

Proceedings of the Sixth Conference on Machine Translation, Association for Computational Linguistics, Online (2021), pp. 1-88

Experts, Errors, and Context: A Large-Scale Study of Human Evaluation for Machine Translation

Markus Freitag

George Foster

David Grangier

Viresh Ratnakar

Qijun Tan

Wolfgang Macherey

Transactions of the Association for Computational Linguistics, vol. 9, pp. 1460-1474

Results of the WMT21 Metrics Shared Task: Evaluating Metrics with Expert-based Human Evaluations on TED and News Domain

Markus Freitag

Ricardo Rei

Nitika Mathur

Chi-kiu Lo

Craig Stewart

George Foster

Alon Lavie

Ondrej Bojar

Proceedings of the Sixth Conference on Machine Translation, Association for Computational Linguistics, Online (2021), pp. 733-774

Complete Multilingual Neural Machine Translation

Markus Freitag

Orhan Firat

Proceedings of the Fifth Conference on Machine Translation (Volume 1: Research Papers) (2020)

Translationese as a Language in “Multilingual” NMT

Parker Riley

Isaac Caswell

Markus Freitag

David Grangier

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Online (2020), pp. 7737-7746

Human-Paraphrased References Improve Neural Machine Translation

Markus Freitag

George Foster

David Grangier

Colin Cherry

Proceedings of the Fifth Conference on Machine Translation (Volume 1: Research Papers) (2020)

BLEU might be Guilty but References are not Innocent

Markus Freitag

David Grangier

Isaac Caswell

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics, pp. 61-71

KoBE: Knowledge-Based Machine Translation Evaluation

Zorik Gekhman

Roee Aharoni

Genady Beryozkin

Markus Freitag

Wolfgang Macherey

Findings of EMNLP (2020)

APE at Scale and its Implications on MT Evaluation Biases

Markus Freitag

Isaac Caswell

Scott Roy

Proceedings of the Fourth Conference on Machine Translation (Volume 1: Research Papers), Association for Computational Linguistics, Florence, Italy (2019), pp. 34-44

Unsupervised Natural Language Generation with Denoising Autoencoders

Markus Freitag

Scott Roy

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (2018), pp. 3922-3929

No Results Found

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Markus Freitag

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Markus Freitag

Research Areas

Filter by:

Year

Team

Research Area

Join us

AI/ML Foundations  & Capabilities