Sentence transition matrix: An efficient approach that preserves   sentence semantics

Myeongjun Jang; Pilsung Kang

arXiv:1901.05219·cs.CL·January 17, 2019·1 cites

Sentence transition matrix: An efficient approach that preserves sentence semantics

Myeongjun Jang, Pilsung Kang

PDF

Open Access

TL;DR

This paper introduces a novel, efficient transition matrix approach that refines sentence embeddings to better capture semantic meaning, applicable across various embedding methods and effective even with limited training data.

Contribution

The study proposes a versatile transition matrix technique that enhances sentence embeddings' semantic accuracy without extensive training data or model-specific adjustments.

Findings

01

Improves semantic textual similarity performance across models

02

Requires fewer labeled training examples

03

Applicable to any sentence embedding method

Abstract

Sentence embedding is a significant research topic in the field of natural language processing (NLP). Generating sentence embedding vectors reflecting the intrinsic meaning of a sentence is a key factor to achieve an enhanced performance in various NLP tasks such as sentence classification and document summarization. Therefore, various sentence embedding models based on supervised and unsupervised learning have been proposed after the advent of researches regarding the distributed representation of words. They were evaluated through semantic textual similarity (STS) tasks, which measure the degree of semantic preservation of a sentence and neural network-based supervised embedding models generally yielded state-of-the-art performance. However, these models have a limitation in that they have multiple parameters to update, thereby requiring a tremendous amount of labeled training data.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies