Generating Descriptions from Structured Data Using a Bifocal Attention   Mechanism and Gated Orthogonalization

Preksha Nema; Shreyas Shetty; Parag Jain; Anirban Laha; Karthik; Sankaranarayanan; Mitesh M. Khapra

arXiv:1804.07789·cs.CL·July 16, 2019

Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization

Preksha Nema, Shreyas Shetty, Parag Jain, Anirban Laha, Karthik, Sankaranarayanan, Mitesh M. Khapra

PDF

2 Repos

TL;DR

This paper introduces a novel neural model with bifocal attention and gated orthogonalization for generating natural language descriptions from structured data, significantly outperforming previous methods.

Contribution

The paper proposes a new model combining bifocal attention and gated orthogonalization to better capture hierarchical information in structured data for description generation.

Findings

01

21% relative improvement over state-of-the-art methods

02

10% relative improvement over basic seq2seq models

03

Effective handling of macro and micro level information

Abstract

In this work, we focus on the task of generating natural language descriptions from a structured table of facts containing fields (such as nationality, occupation, etc) and values (such as Indian, actor, director, etc). One simple choice is to treat the table as a sequence of fields and values and then use a standard seq2seq model for this task. However, such a model is too generic and does not exploit task-specific characteristics. For example, while generating descriptions from a table, a human would attend to information at two levels: (i) the fields (macro level) and (ii) the values within the field (micro level). Further, a human would continue attending to a field for a few timesteps till all the information from that field has been rendered and then never return back to this field (because there is nothing left to say about it). To capture this behavior we use (i) a fused bifocal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence