Towards Holistic Scene Understanding: Feedback Enabled Cascaded   Classification Models

Congcong Li; Adarsh Kowdle; Ashutosh Saxena; Tsuhan Chen

arXiv:1110.5102·cs.CV·October 25, 2011·45 cites

Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models

Congcong Li, Adarsh Kowdle, Ashutosh Saxena, Tsuhan Chen

PDF

Open Access

TL;DR

This paper introduces Feedback Enabled Cascaded Classification Models (FE-CCM), a method that jointly optimizes multiple scene understanding sub-tasks using a feedback mechanism, improving accuracy without altering individual classifiers.

Contribution

The paper presents a novel feedback-enabled cascade approach that enhances multi-task scene understanding by leveraging correlated outputs without modifying existing classifiers.

Findings

01

Significant performance improvements across all sub-tasks.

02

Effective in robotic applications like object grasping and finding.

03

Operates with only black-box access to classifiers.

Abstract

Scene understanding includes many related sub-tasks, such as scene categorization, depth estimation, object detection, etc. Each of these sub-tasks is often notoriously hard, and state-of-the-art classifiers already exist for many of them. These classifiers operate on the same raw image and provide correlated outputs. It is desirable to have an algorithm that can capture such correlation without requiring any changes to the inner workings of any classifier. We propose Feedback Enabled Cascaded Classification Models (FE-CCM), that jointly optimizes all the sub-tasks, while requiring only a `black-box' interface to the original classifier for each sub-task. We use a two-layer cascade of classifiers, which are repeated instantiations of the original ones, with the output of the first layer fed into the second layer as input. Our training method involves a feedback step that allows later…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning