Sparse Regression for Machine Translation

Ergun Bi\c{c}ici

arXiv:2406.19478·cs.CL·July 1, 2024

Sparse Regression for Machine Translation

Ergun Bi\c{c}ici

PDF

Open Access

TL;DR

This paper explores using sparse regression techniques, especially Lasso, for learning feature mappings in machine translation, demonstrating improved translation quality and effective training instance selection.

Contribution

It introduces the dice instance selection method and compares L1 and L2 regularized regression for translation mapping tasks, showing L1's superior performance.

Findings

01

L1 regularized regression outperforms L2 in translation quality.

02

Proper training instance selection improves feature coverage.

03

Replacing phrase tables with learned mappings yields promising results.

Abstract

We use transductive regression techniques to learn mappings between source and target features of given parallel corpora and use these mappings to generate machine translation outputs. We show the effectiveness of $L_{1}$ regularized regression (\textit{lasso}) to learn the mappings between sparsely observed feature sets versus $L_{2}$ regularized regression. Proper selection of training instances plays an important role to learn correct feature mappings within limited computational resources and at expected accuracy levels. We introduce \textit{dice} instance selection method for proper selection of training instances, which plays an important role to learn correct feature mappings for improving the source and target coverage of the training set. We show that $L_{1}$ regularized regression performs better than $L_{2}$ regularized regression both in regression measurements and in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text and Document Classification Technologies · Speech Recognition and Synthesis