Pointer Networks
Oriol Vinyals, Meire Fortunato, Navdeep Jaitly

TL;DR
Pointer Networks introduce a neural architecture that uses attention as a pointer to select input elements as outputs, enabling learning solutions to variable-sized combinatorial problems like sorting and TSP.
Contribution
This paper presents Pointer Networks, a novel neural architecture that effectively handles variable-sized output spaces using attention as a pointer, outperforming previous models on geometric problems.
Findings
Pointer Nets can learn approximate solutions to geometric problems.
Models generalize beyond training input sizes.
Outperform sequence-to-sequence models with attention.
Abstract
We introduce a new neural architecture to learn the conditional probability of an output sequence with elements that are discrete tokens corresponding to positions in an input sequence. Such problems cannot be trivially addressed by existent approaches such as sequence-to-sequence and Neural Turing Machines, because the number of target classes in each step of the output depends on the length of the input, which is variable. Problems such as sorting variable sized sequences, and various combinatorial optimization problems belong to this class. Our model solves the problem of variable size output dictionaries using a recently proposed mechanism of neural attention. It differs from the previous attention attempts in that, instead of using attention to blend hidden units of an encoder to a context vector at each decoder step, it uses attention as a pointer to select a member of the input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Natural Language Processing Techniques
MethodsSigmoid Activation · Tanh Activation · [LivE@PeRson]How do I talk to a real person at Expedia? · Softmax · Long Short-Term Memory · Pointer Network
