Towards Effective Ancient Chinese Translation: Dataset, Model, and Evaluation
Geyang Guo, Jiarong Yang, Fengyuan Lu, Jiaxin Qin, Tianyi Tang, Wayne, Xin Zhao

TL;DR
This paper introduces Erya, a new dataset, model, and evaluation benchmark for ancient Chinese translation, demonstrating superior zero-shot and fine-tuned performance over existing models.
Contribution
The paper presents a comprehensive ancient Chinese dataset, a novel training method with two tasks, and a benchmark for evaluating translation quality, advancing the field significantly.
Findings
Erya achieves +12.0 BLEU over GPT-3.5 in zero-shot scenarios.
Erya outperforms ERNIE Bot in human evaluations.
Fine-tuning Erya yields +6.2 BLEU gain.
Abstract
Interpreting ancient Chinese has been the key to comprehending vast Chinese literature, tradition, and civilization. In this paper, we propose Erya for ancient Chinese translation. From a dataset perspective, we collect, clean, and classify ancient Chinese materials from various sources, forming the most extensive ancient Chinese resource to date. From a model perspective, we devise Erya training method oriented towards ancient Chinese. We design two jointly-working tasks: disyllabic aligned substitution (DAS) and dual masked language model (DMLM). From an evaluation perspective, we build a benchmark to judge ancient Chinese translation quality in different scenarios and evaluate the ancient Chinese translation capacities of various existing models. Our model exhibits remarkable zero-shot performance across five domains, with over +12.0 BLEU against GPT-3.5 models and better human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Translation Studies and Practices · Computational and Text Analysis Methods
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Attention Dropout · Residual Connection · Cosine Annealing
