Knowing When to Stop: Evaluation and Verification of Conformity to   Output-size Specifications

Chenglong Wang; Rudy Bunel; Krishnamurthy Dvijotham; Po-Sen Huang,; Edward Grefenstette; Pushmeet Kohli

arXiv:1904.12004·cs.LG·April 30, 2019·1 cites

Knowing When to Stop: Evaluation and Verification of Conformity to Output-size Specifications

Chenglong Wang, Rudy Bunel, Krishnamurthy Dvijotham, Po-Sen Huang,, Edward Grefenstette, Pushmeet Kohli

PDF

Open Access

TL;DR

This paper introduces methods to evaluate and verify neural models' output length, addressing the risk of undesirable output sizes in applications like translation and captioning.

Contribution

It proposes a differentiable proxy for output-lengthening and a formal verification approach to ensure output size constraints are met.

Findings

01

Output-lengthening can produce outputs 50 times longer than inputs.

02

Verification can prove output length bounds within a given input domain.

03

Methods improve understanding and control of model output sizes.

Abstract

Models such as Sequence-to-Sequence and Image-to-Sequence are widely used in real world applications. While the ability of these neural architectures to produce variable-length outputs makes them extremely effective for problems like Machine Translation and Image Captioning, it also leaves them vulnerable to failures of the form where the model produces outputs of undesirable length. This behavior can have severe consequences such as usage of increased computation and induce faults in downstream modules that expect outputs of a certain length. Motivated by the need to have a better understanding of the failures of these models, this paper proposes and studies the novel output-size modulation problem and makes two key technical contributions. First, to evaluate model robustness, we develop an easy-to-compute differentiable proxy objective that can be used with gradient-based algorithms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques