TL;DR
This paper introduces HyperQA, a simple, efficient neural network for question-answer retrieval that models question-answer relationships in hyperbolic space, achieving competitive performance without complex mechanisms.
Contribution
The paper presents HyperQA, a novel hyperbolic embedding-based model that is parameter-efficient and outperforms complex models on QA benchmarks.
Findings
HyperQA outperforms parameter-intensive models on multiple benchmarks.
The hyperbolic space modeling enables automatic hierarchy discovery.
The model requires no feature engineering or attention mechanisms.
Abstract
The dominant neural architectures in question answer retrieval are based on recurrent or convolutional encoders configured with complex word matching layers. Given that recent architectural innovations are mostly new word interaction layers or attention-based matching mechanisms, it seems to be a well-established fact that these components are mandatory for good performance. Unfortunately, the memory and computation cost incurred by these complex mechanisms are undesirable for practical applications. As such, this paper tackles the question of whether it is possible to achieve competitive performance with simple neural architectures. We propose a simple but novel deep learning architecture for fast and efficient question-answer ranking and retrieval. More specifically, our proposed model, \textsc{HyperQA}, is a parameter efficient neural network that outperforms other parameter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
