KATE: K-Competitive Autoencoder for Text

Yu Chen; Mohammed J. Zaki

arXiv:1705.02033·stat.ML·June 6, 2017·19 cites

KATE: K-Competitive Autoencoder for Text

Yu Chen, Mohammed J. Zaki

PDF

Open Access 1 Repo

TL;DR

KATE introduces a k-competitive autoencoder that enhances text representation learning by encouraging neuron specialization, outperforming traditional autoencoders and other models in various downstream NLP tasks.

Contribution

The paper proposes the KATE model, a novel k-competitive autoencoder that improves text representation learning through neuron competition, leading to better performance than existing autoencoders and generative models.

Findings

01

KATE learns more meaningful text representations than traditional autoencoders.

02

KATE outperforms deep generative and probabilistic models in downstream tasks.

03

Neuron competition in KATE enhances specialization and model effectiveness.

Abstract

Autoencoders have been successful in learning meaningful representations from image datasets. However, their performance on text datasets has not been widely studied. Traditional autoencoders tend to learn possibly trivial representations of text documents due to their confounding properties such as high-dimensionality, sparsity and power-law word distributions. In this paper, we propose a novel k-competitive autoencoder, called KATE, for text documents. Due to the competition between the neurons in the hidden layer, each neuron becomes specialized in recognizing specific data patterns, and overall the model can learn meaningful representations of textual data. A comprehensive set of experiments show that KATE can learn better representations than traditional autoencoders including denoising, contractive, variational, and k-sparse autoencoders. Our model also outperforms deep generative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hugochan/KATE
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies