Character-based Neural Networks for Sentence Pair Modeling

Wuwei Lan; Wei Xu

arXiv:1805.08297·cs.CL·May 23, 2018·1 cites

Character-based Neural Networks for Sentence Pair Modeling

Wuwei Lan, Wei Xu

PDF

Open Access 1 Repo

TL;DR

This paper investigates the effectiveness of character-based and subword-level representations in sentence pair modeling tasks, demonstrating that they can achieve state-of-the-art results without relying on pretrained word embeddings.

Contribution

It systematically studies subword-level representations in sentence pair tasks, showing they can outperform or match models using pretrained embeddings.

Findings

01

Subword models achieve state-of-the-art results on social media datasets.

02

Subword models perform competitively on news datasets.

03

Pretrained embeddings are not strictly necessary for high performance.

Abstract

Sentence pair modeling is critical for many NLP tasks, such as paraphrase identification, semantic textual similarity, and natural language inference. Most state-of-the-art neural models for these tasks rely on pretrained word embedding and compose sentence-level semantics in varied ways; however, few works have attempted to verify whether we really need pretrained embeddings in these tasks. In this paper, we study how effective subword-level (character and character n-gram) representations are in sentence pair modeling. Though it is well-known that subword models are effective in tasks with single sentence input, including language modeling and machine translation, they have not been systematically studied in sentence pair modeling tasks where the semantic and string similarities between texts matter. Our experiments show that subword models without any pretrained word embedding can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lanwuwei/SPM_toolkit
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification