An Overview on Machine Translation Evaluation
Lifeng Han

TL;DR
This paper reviews the history, classification, and recent advances in machine translation evaluation methods, highlighting human, automatic, and meta-evaluation approaches, including deep learning and pre-trained models.
Contribution
It provides a comprehensive overview of MT evaluation techniques, covering traditional and cutting-edge methods, and discusses recent progress in task-based and model-based evaluation strategies.
Findings
Historical overview of MT evaluation methods
Introduction of deep learning and pre-trained models in evaluation
Emerging trends like task-based and lightweight evaluation models
Abstract
Since the 1950s, machine translation (MT) has become one of the important tasks of AI and development, and has experienced several different periods and stages of development, including rule-based methods, statistical methods, and recently proposed neural network-based learning methods. Accompanying these staged leaps is the evaluation research and development of MT, especially the important role of evaluation methods in statistical translation and neural translation research. The evaluation task of MT is not only to evaluate the quality of machine translation, but also to give timely feedback to machine translation researchers on the problems existing in machine translation itself, how to improve and how to optimise. In some practical application fields, such as in the absence of reference translations, the quality estimation of machine translation plays an important role as an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
