Sentence transition matrix: An efficient approach that preserves sentence semantics
Myeongjun Jang, Pilsung Kang

TL;DR
This paper introduces a novel, efficient transition matrix approach that refines sentence embeddings to better capture semantic meaning, applicable across various embedding methods and effective even with limited training data.
Contribution
The study proposes a versatile transition matrix technique that enhances sentence embeddings' semantic accuracy without extensive training data or model-specific adjustments.
Findings
Improves semantic textual similarity performance across models
Requires fewer labeled training examples
Applicable to any sentence embedding method
Abstract
Sentence embedding is a significant research topic in the field of natural language processing (NLP). Generating sentence embedding vectors reflecting the intrinsic meaning of a sentence is a key factor to achieve an enhanced performance in various NLP tasks such as sentence classification and document summarization. Therefore, various sentence embedding models based on supervised and unsupervised learning have been proposed after the advent of researches regarding the distributed representation of words. They were evaluated through semantic textual similarity (STS) tasks, which measure the degree of semantic preservation of a sentence and neural network-based supervised embedding models generally yielded state-of-the-art performance. However, these models have a limitation in that they have multiple parameters to update, thereby requiring a tremendous amount of labeled training data.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
