Document-Level Machine Translation with Large Language Models
Longyue Wang, Chenyang Lyu, Tianbo Ji, Zhirui Zhang, Dian Yu, Shuming, Shi, Zhaopeng Tu

TL;DR
This paper evaluates large language models like GPT-3.5 and GPT-4 for document-level machine translation, showing they outperform commercial systems and possess strong discourse modeling abilities, highlighting their potential as a new translation paradigm.
Contribution
The study provides an in-depth evaluation of LLMs for document-level MT, comparing their performance with existing systems and analyzing their discourse modeling capabilities.
Findings
GPT-3.5 and GPT-4 outperform commercial MT systems in human evaluations.
GPT-4 has stronger linguistic knowledge probing abilities than GPT-3.5.
LLMs demonstrate significant potential for discourse-aware document translation.
Abstract
Large language models (LLMs) such as ChatGPT can produce coherent, cohesive, relevant, and fluent answers for various natural language processing (NLP) tasks. Taking document-level machine translation (MT) as a testbed, this paper provides an in-depth evaluation of LLMs' ability on discourse modeling. The study focuses on three aspects: 1) Effects of Context-Aware Prompts, where we investigate the impact of different prompts on document-level translation quality and discourse phenomena; 2) Comparison of Translation Models, where we compare the translation performance of ChatGPT with commercial MT systems and advanced document-level MT methods; 3) Analysis of Discourse Modelling Abilities, where we further probe discourse knowledge encoded in LLMs and shed light on impacts of training techniques on discourse modeling. By evaluating on a number of benchmarks, we surprisingly find that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Multi-Head Attention · Attention Is All You Need · Cosine Annealing · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Cosine Annealing · {Dispute@FaQ-s}How to file a dispute with Expedia? · Attention Dropout · GPT-3
