What is (missing or wrong) in the scene? A Hybrid Deep Boltzmann Machine   For Contextualized Scene Modeling

\.Ilker Bozcan; Ya\u{g}mur Oymak; \.Idil Zeynep Alemdar; Sinan Kalkan

arXiv:1710.05664·cs.CV·August 21, 2018

What is (missing or wrong) in the scene? A Hybrid Deep Boltzmann Machine For Contextualized Scene Modeling

\.Ilker Bozcan, Ya\u{g}mur Oymak, \.Idil Zeynep Alemdar, Sinan Kalkan

PDF

TL;DR

This paper introduces a hybrid deep Boltzmann Machine that models object relations for improved scene reasoning in robotics, outperforming baseline models on scene classification tasks.

Contribution

The paper presents a novel hybrid Boltzmann Machine with tri-way edges to incorporate object relations into scene modeling, enhancing reasoning capabilities.

Findings

01

Outperforms baseline models in scene classification accuracy

02

Effectively models object relations for scene understanding

03

Improves reasoning about missing or incorrect scene elements

Abstract

Scene models allow robots to reason about what is in the scene, what else should be in it, and what should not be in it. In this paper, we propose a hybrid Boltzmann Machine (BM) for scene modeling where relations between objects are integrated. To be able to do that, we extend BM to include tri-way edges between visible (object) nodes and make the network to share the relations across different objects. We evaluate our method against several baseline models (Deep Boltzmann Machines, and Restricted Boltzmann Machines) on a scene classification dataset, and show that it performs better in several scene reasoning tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.