Learning Semantic Textual Similarity from Conversations

Yinfei Yang; Steve Yuan; Daniel Cer; Sheng-yi Kong; Noah Constant,; Petr Pilar; Heming Ge; Yun-Hsuan Sung; Brian Strope; Ray Kurzweil

arXiv:1804.07754·cs.CL·April 23, 2018·31 cites

Learning Semantic Textual Similarity from Conversations

Yinfei Yang, Steve Yuan, Daniel Cer, Sheng-yi Kong, Noah Constant,, Petr Pilar, Heming Ge, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new unsupervised method for learning sentence embeddings from conversational data, improving semantic similarity tasks and outperforming many existing neural models.

Contribution

The paper proposes a novel conversational data-based approach for semantic similarity learning, combining multitask training to enhance performance on benchmark datasets.

Findings

01

Achieves top performance on the STS benchmark

02

Performs well on SemEval 2017 CQA question similarity

03

Outperforms many neural models in semantic similarity tasks

Abstract

We present a novel approach to learn representations for sentence-level semantic similarity using conversational data. Our method trains an unsupervised model to predict conversational input-response pairs. The resulting sentence embeddings perform well on the semantic textual similarity (STS) benchmark and SemEval 2017's Community Question Answering (CQA) question similarity subtask. Performance is further improved by introducing multitask training combining the conversational input-response prediction task and a natural language inference task. Extensive experiments show the proposed model achieves the best performance among all neural models on the STS benchmark and is competitive with the state-of-the-art feature engineered and mixed systems in both tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nickyeolk/info_retrieve
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems