Action-conditioned Benchmarking of Robotic Video Prediction Models: a   Comparative Study

Manuel Serra Nunes; Atabak Dehban; Plinio Moreno; Jos\'e Santos-Victor

arXiv:1910.02564·cs.CV·October 8, 2019

Action-conditioned Benchmarking of Robotic Video Prediction Models: a Comparative Study

Manuel Serra Nunes, Atabak Dehban, Plinio Moreno, Jos\'e Santos-Victor

PDF

1 Repo

TL;DR

This paper introduces a new benchmarking method for robotic video prediction models that evaluates their effectiveness in guiding action decisions by inferring robot actions from predicted frames, highlighting discrepancies with traditional perceptual metrics.

Contribution

The study proposes an action inference-based metric for evaluating video prediction models, providing a more task-relevant assessment for robotic planning applications.

Findings

01

High perceptual scores do not guarantee accurate action inference.

02

Many models perform poorly in action inference despite good perceptual quality.

03

The new metric better predicts a model's usefulness in robot planning.

Abstract

A defining characteristic of intelligent systems is the ability to make action decisions based on the anticipated outcomes. Video prediction systems have been demonstrated as a solution for predicting how the future will unfold visually, and thus, many models have been proposed that are capable of predicting future frames based on a history of observed frames~(and sometimes robot actions). However, a comprehensive method for determining the fitness of different video prediction models at guiding the selection of actions is yet to be developed. Current metrics assess video prediction models based on human perception of frame quality. In contrast, we argue that if these systems are to be used to guide action, necessarily, the actions the robot performs should be encoded in the predicted frames. In this paper, we are proposing a new metric to compare different video prediction models based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

m-serra/action-inference-for-video-prediction-benchmarking
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.