Injecting Relational Structural Representation in Neural Networks for   Question Similarity

Antonio Uva; Daniele Bonadiman; Alessandro Moschitti

arXiv:1806.08009·cs.CL·June 22, 2018

Injecting Relational Structural Representation in Neural Networks for Question Similarity

Antonio Uva, Daniele Bonadiman, Alessandro Moschitti

PDF

1 Repo

TL;DR

This paper introduces a method to incorporate syntactic structural information into neural networks for question similarity by leveraging Tree Kernels and large-scale pre-training, improving accuracy on benchmark datasets.

Contribution

It proposes a novel approach combining Tree Kernel-based SVMs and neural network pre-training to better utilize syntactic structures in question similarity tasks.

Findings

01

Improved accuracy on Quora and SemEval datasets.

02

Effective use of Tree Kernels for structural representation.

03

Enhanced neural network performance after fine-tuning.

Abstract

Effectively using full syntactic parsing information in Neural Networks (NNs) to solve relational tasks, e.g., question similarity, is still an open problem. In this paper, we propose to inject structural representations in NNs by (i) learning an SVM model using Tree Kernels (TKs) on relatively few pairs of questions (few thousands) as gold standard (GS) training data is typically scarce, (ii) predicting labels on a very large corpus of question pairs, and (iii) pre-training NNs on such large corpus. The results on Quora and SemEval question similarity datasets show that NNs trained with our approach can learn more accurate models, especially after fine tuning on GS.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aseveryn/deep-qa
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSupport Vector Machine