Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models
Congcong Li, Adarsh Kowdle, Ashutosh Saxena, Tsuhan Chen

TL;DR
This paper introduces Feedback Enabled Cascaded Classification Models (FE-CCM), a method that jointly optimizes multiple scene understanding sub-tasks using a feedback mechanism, improving accuracy without altering individual classifiers.
Contribution
The paper presents a novel feedback-enabled cascade approach that enhances multi-task scene understanding by leveraging correlated outputs without modifying existing classifiers.
Findings
Significant performance improvements across all sub-tasks.
Effective in robotic applications like object grasping and finding.
Operates with only black-box access to classifiers.
Abstract
Scene understanding includes many related sub-tasks, such as scene categorization, depth estimation, object detection, etc. Each of these sub-tasks is often notoriously hard, and state-of-the-art classifiers already exist for many of them. These classifiers operate on the same raw image and provide correlated outputs. It is desirable to have an algorithm that can capture such correlation without requiring any changes to the inner workings of any classifier. We propose Feedback Enabled Cascaded Classification Models (FE-CCM), that jointly optimizes all the sub-tasks, while requiring only a `black-box' interface to the original classifier for each sub-task. We use a two-layer cascade of classifiers, which are repeated instantiations of the original ones, with the output of the first layer fed into the second layer as input. Our training method involves a feedback step that allows later…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
