Transfer Entropy Bottleneck: Learning Sequence to Sequence Information Transfer
Damjan Kalajdzievski, Ximeng Mao, Pascal Fortier-Poisson, Guillaume, Lajoie, Blake Richards

TL;DR
The paper introduces Transfer Entropy Bottleneck (TEB), a novel information bottleneck method that quantifies and leverages directed information transfer between two dependent data streams for improved sequence prediction.
Contribution
It develops a new information bottleneck approach, TEB, to model and quantify transfer of information between dependent streams, enhancing sequence-to-sequence prediction.
Findings
TEB effectively quantifies information transfer between streams.
TEB improves prediction accuracy by leveraging source-target information transfer.
The method provides insights into the directed information flow in dependent data streams.
Abstract
When presented with a data stream of two statistically dependent variables, predicting the future of one of the variables (the target stream) can benefit from information about both its history and the history of the other variable (the source stream). For example, fluctuations in temperature at a weather station can be predicted using both temperatures and barometric readings. However, a challenge when modelling such data is that it is easy for a neural network to rely on the greatest joint correlations within the target stream, which may ignore a crucial but small information transfer from the source to the target stream. As well, there are often situations where the target stream may have previously been modelled independently and it would be useful to use that model to inform a new joint model. Here, we develop an information bottleneck approach for conditional learning on two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Domain Adaptation and Few-Shot Learning
