An Energy-based Model for Word-level AutoCompletion in Computer-aided   Translation

Cheng Yang; Guoping Huang; Mo Yu; Zhirui Zhang; Siheng Li; Mingming; Yang; Shuming Shi; Yujiu Yang; Lemao Liu

arXiv:2407.20083·cs.CL·July 30, 2024

An Energy-based Model for Word-level AutoCompletion in Computer-aided Translation

Cheng Yang, Guoping Huang, Mo Yu, Zhirui Zhang, Siheng Li, Mingming, Yang, Shuming Shi, Yujiu Yang, Lemao Liu

PDF

1 Repo

TL;DR

This paper introduces an energy-based model for word-level auto-completion in computer-aided translation, improving performance by better leveraging source sentence information compared to traditional classification models.

Contribution

It proposes a novel energy-based approach for WLAC that enhances context understanding and demonstrates effective strategies to address training and inference challenges.

Findings

01

Achieves about 6.07% improvement over previous state-of-the-art models.

02

Effectively leverages source sentence information for better auto-completion.

03

Strategies significantly contribute to model performance.

Abstract

Word-level AutoCompletion(WLAC) is a rewarding yet challenging task in Computer-aided Translation. Existing work addresses this task through a classification model based on a neural network that maps the hidden vector of the input context into its corresponding label (i.e., the candidate target word is treated as a label). Since the context hidden vector itself does not take the label into account and it is projected to the label through a linear classifier, the model can not sufficiently leverage valuable information from the source sentence as verified in our experiments, which eventually hinders its overall performance. To alleviate this issue, this work proposes an energy-based model for WLAC, which enables the context hidden vector to capture crucial information from the source sentence. Unfortunately, training and inference suffer from efficiency and effectiveness challenges,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yc1999/energy_wlac
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.