Excitation Backprop for RNNs

Sarah Adel Bargal; Andrea Zunino; Donghyun Kim; Jianming Zhang,; Vittorio Murino; Stan Sclaroff

arXiv:1711.06778·cs.CV·March 9, 2018

Excitation Backprop for RNNs

Sarah Adel Bargal, Andrea Zunino, Donghyun Kim, Jianming Zhang,, Vittorio Murino, Stan Sclaroff

PDF

1 Repo

TL;DR

This paper introduces a method to visualize and localize evidence in both space and time within videos, helping to understand deep model decisions in video classification and captioning tasks.

Contribution

It proposes a novel formulation for simultaneous spatiotemporal grounding using top-down saliency in a single pass, without additional training.

Findings

01

Effective visualization of spatiotemporal cues in videos

02

Ability to localize actions and phrases without explicit training

03

Enhanced interpretability of deep video models

Abstract

Deep models are state-of-the-art for many vision tasks including video action recognition and video captioning. Models are trained to caption or classify activity in videos, but little is known about the evidence used to make such decisions. Grounding decisions made by deep networks has been studied in spatial visual content, giving more insight into model predictions for images. However, such studies are relatively lacking for models of spatiotemporal visual content - videos. In this work, we devise a formulation that simultaneously grounds evidence in space and time, in a single pass, using top-down saliency. We visualize the spatiotemporal cues that contribute to a deep model's classification/captioning output using the model's internal representation. Based on these spatiotemporal cues, we are able to localize segments within a video that correspond with a specific action, or phrase…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sbargal/Caffe-ExcitationBP-RNNs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.