t-Exponential Memory Networks for Question-Answering Machines
Kyriakos Tolias, Sotirios Chatzis

TL;DR
This paper introduces t-Exponential Memory Networks that incorporate Bayesian variational inference with t-exponential priors to better model complex temporal dependencies and outliers in language modeling tasks, outperforming current methods.
Contribution
It proposes a novel Bayesian approach using t-exponential priors and a new t-divergence measure for memory networks in language modeling, enhancing robustness to outliers.
Findings
Outperforms state-of-the-art language models on benchmark tasks.
Effectively handles outliers and complex temporal dynamics.
Demonstrates improved predictive uncertainty estimation.
Abstract
Recent advances in deep learning have brought to the fore models that can make multiple computational steps in the service of completing a task; these are capable of describ- ing long-term dependencies in sequential data. Novel recurrent attention models over possibly large external memory modules constitute the core mechanisms that enable these capabilities. Our work addresses learning subtler and more complex underlying temporal dynamics in language modeling tasks that deal with sparse sequential data. To this end, we improve upon these recent advances, by adopting concepts from the field of Bayesian statistics, namely variational inference. Our proposed approach consists in treating the network parameters as latent variables with a prior distribution imposed over them. Our statistical assumptions go beyond the standard practice of postulating Gaussian priors. Indeed, to allow for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
