Using Holographically Compressed Embeddings in Question Answering
Salvador E. Barbosa

TL;DR
This paper introduces a holographic compression method for embeddings that efficiently encodes tokens along with their part-of-speech and entity types, improving question answering performance while maintaining semantic relationships.
Contribution
It presents a novel holographic compression technique for embeddings that incorporates syntactic and semantic attributes in a compact form, enhancing neural question answering models.
Findings
Semantic relationships are preserved after compression.
The method yields strong question answering performance.
Embedding size is reduced without losing important information.
Abstract
Word vector representations are central to deep learning natural language processing models. Many forms of these vectors, known as embeddings, exist, including word2vec and GloVe. Embeddings are trained on large corpora and learn the word's usage in context, capturing the semantic relationship between words. However, the semantics from such training are at the level of distinct words (known as word types), and can be ambiguous when, for example, a word type can be either a noun or a verb. In question answering, parts-of-speech and named entity types are important, but encoding these attributes in neural models expands the size of the input. This research employs holographic compression of pre-trained embeddings, to represent a token, its part-of-speech, and named entity type, in the same dimension as representing only the token. The implementation, in a modified question answering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGloVe Embeddings
