Scenarios: A New Representation for Complex Scene Understanding
Zachary A. Daniels, Dimitris N. Metaxas

TL;DR
This paper introduces scenarios as a low-dimensional, data-driven scene representation and presents ScenarioNet, a neural network that efficiently performs multiple scene understanding tasks with interpretability.
Contribution
The paper proposes a novel scenario-based scene representation and a neural network architecture, ScenarioNet, that unifies multiple scene understanding tasks efficiently and interpretably.
Findings
ScenarioNet achieves similar performance to CNNs with fewer parameters.
It can perform scene classification, recognition, and retrieval within a single framework.
ScenarioNet is more interpretable and computationally efficient.
Abstract
The ability for computational agents to reason about the high-level content of real world scene images is important for many applications. Existing attempts at addressing the problem of complex scene understanding lack representational power, efficiency, and the ability to create robust meta-knowledge about scenes. In this paper, we introduce scenarios as a new way of representing scenes. The scenario is a simple, low-dimensional, data-driven representation consisting of sets of frequently co-occurring objects and is useful for a wide range of scene understanding tasks. We learn scenarios from data using a novel matrix factorization method which we integrate into a new neural network architecture, the ScenarioNet. Using ScenarioNet, we can recover semantic information about real world scene images at three levels of granularity: 1) scene categories, 2) scenarios, and 3) objects.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
