Short Text Clustering with Transformers

Leonid Pugachev; Mikhail Burtsev

arXiv:2102.00541·cs.CL·February 2, 2021·1 cites

Short Text Clustering with Transformers

Leonid Pugachev, Mikhail Burtsev

PDF

Open Access

TL;DR

This paper explores the use of Transformer-based sentence embeddings combined with clustering algorithms to improve short text clustering, demonstrating that iterative classification enhances initial results.

Contribution

It introduces a novel approach using Transformer sentence vectors and iterative classification to advance short text clustering performance.

Findings

01

Transformer sentence embeddings improve clustering accuracy

02

Iterative classification further enhances clustering results

03

Pre-trained Transformer models are effective for short text clustering

Abstract

Recent techniques for the task of short text clustering often rely on word embeddings as a transfer learning component. This paper shows that sentence vector representations from Transformers in conjunction with different clustering methods can be successfully applied to address the task. Furthermore, we demonstrate that the algorithm of enhancement of clustering via iterative classification can further improve initial clustering performance with different classifiers, including those based on pre-trained Transformer language models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Attention Is All You Need · Dense Connections · Residual Connection · Adam · Dropout · Label Smoothing · Multi-Head Attention