Learning Semantic Textual Similarity via Topic-informed Discrete Latent Variables
Erxin Yu, Lan Du, Yuan Jin, Zhepei Wei, Yi Chang

TL;DR
This paper introduces a topic-informed discrete latent variable model for semantic textual similarity, leveraging vector quantization and topic modeling to enhance semantic understanding and outperform existing neural baselines.
Contribution
The paper presents a novel discrete latent variable model that incorporates topic information and a semantic-driven attention mechanism to improve sentence similarity measurement.
Findings
Outperforms several strong neural baselines on English datasets
Utilizes topic modeling to enrich semantic representations
Enhances transformer models with semantic-driven attention
Abstract
Recently, discrete latent variable models have received a surge of interest in both Natural Language Processing (NLP) and Computer Vision (CV), attributed to their comparable performance to the continuous counterparts in representation learning, while being more interpretable in their predictions. In this paper, we develop a topic-informed discrete latent variable model for semantic textual similarity, which learns a shared latent space for sentence-pair representation via vector quantization. Compared with previous models limited to local semantic contexts, our model can explore richer semantic information via topic modeling. We further boost the performance of semantic similarity by injecting the quantized representation into a transformer-based language model with a well-designed semantic-driven attention mechanism. We demonstrate, through extensive experiments across various English…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
