Order Matters: Sequence to sequence for sets
Oriol Vinyals, Samy Bengio, Manjunath Kudlur

TL;DR
This paper explores the importance of order in sequence-to-sequence models, proposing extensions to handle set inputs and outputs, with empirical validation on tasks like sorting and probabilistic modeling.
Contribution
It introduces a principled way to incorporate set inputs and outputs into seq2seq models, addressing the challenge of order invariance in such data.
Findings
Order significantly impacts model performance.
Proposed methods improve sorting and probabilistic modeling tasks.
Extensions enhance seq2seq applicability to sets.
Abstract
Sequences have become first class citizens in supervised learning thanks to the resurgence of recurrent neural networks. Many complex tasks that require mapping from or to a sequence of observations can now be formulated with the sequence-to-sequence (seq2seq) framework which employs the chain rule to efficiently represent the joint probability of sequences. In many cases, however, variable sized inputs and/or outputs might not be naturally expressed as sequences. For instance, it is not clear how to input a set of numbers into a model where the task is to sort them; similarly, we do not know how to organize outputs when they correspond to random variables and the task is to model their unknown joint probability. In this paper, we first show using various examples that the order in which we organize input and/or output data matters significantly when learning an underlying model. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
