TL;DR
This paper introduces an energy-based model for word-level auto-completion in computer-aided translation, improving performance by better leveraging source sentence information compared to traditional classification models.
Contribution
It proposes a novel energy-based approach for WLAC that enhances context understanding and demonstrates effective strategies to address training and inference challenges.
Findings
Achieves about 6.07% improvement over previous state-of-the-art models.
Effectively leverages source sentence information for better auto-completion.
Strategies significantly contribute to model performance.
Abstract
Word-level AutoCompletion(WLAC) is a rewarding yet challenging task in Computer-aided Translation. Existing work addresses this task through a classification model based on a neural network that maps the hidden vector of the input context into its corresponding label (i.e., the candidate target word is treated as a label). Since the context hidden vector itself does not take the label into account and it is projected to the label through a linear classifier, the model can not sufficiently leverage valuable information from the source sentence as verified in our experiments, which eventually hinders its overall performance. To alleviate this issue, this work proposes an energy-based model for WLAC, which enables the context hidden vector to capture crucial information from the source sentence. Unfortunately, training and inference suffer from efficiency and effectiveness challenges,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
