Self-Enhancing Multi-filter Sequence-to-Sequence Model

Yunhao Yang; Zhaokun Xue; Andrew Whinston

arXiv:2109.12399·cs.CL·January 12, 2023

Self-Enhancing Multi-filter Sequence-to-Sequence Model

Yunhao Yang, Zhaokun Xue, Andrew Whinston

PDF

Open Access

TL;DR

This paper introduces a multi-filter sequence-to-sequence model with a self-enhancing mechanism that uses reinforcement learning to optimize clustering, improving performance on NLP tasks like semantic parsing and translation.

Contribution

It proposes a novel self-enhancing multi-filter encoder-decoder model that addresses heterogeneity in representations through clustering and reinforcement learning.

Findings

01

Outperforms benchmarks by at least 5% in NLP tasks.

02

Self-enhancing mechanism improves performance by over 10%.

03

Positive correlation between clustering quality and model performance.

Abstract

Representation learning is important for solving sequence-to-sequence problems in natural language processing. Representation learning transforms raw data into vector-form representations while preserving their features. However, data with significantly different features leads to heterogeneity in their representations, which may increase the difficulty of convergence. We design a multi-filter encoder-decoder model to resolve the heterogeneity problem in sequence-to-sequence tasks. The multi-filter model divides the latent space into subspaces using a clustering algorithm and trains a set of decoders (filters) in which each decoder only concentrates on the features from its corresponding subspace. As for the main contribution, we design a self-enhancing mechanism that uses a reinforcement learning algorithm to optimize the clustering algorithm without additional training data. We run…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence