STDG: Semi-Teacher-Student Training Paradigram for Depth-guided   One-stage Scene Graph Generation

Xukun Zhou; Zhenbo Song; Jun He; Hongyan Liu; Zhaoxin Fan

arXiv:2309.08179·cs.CV·September 18, 2023

STDG: Semi-Teacher-Student Training Paradigram for Depth-guided One-stage Scene Graph Generation

Xukun Zhou, Zhenbo Song, Jun He, Hongyan Liu, Zhaoxin Fan

PDF

Open Access

TL;DR

STDG introduces a depth-guided, semi-teacher-student training approach for one-stage scene graph generation, effectively utilizing depth cues to improve environmental understanding without extra inference costs.

Contribution

The paper proposes a novel depth-guided architecture with semi-teacher-student training for scene graph generation, leveraging depth information more effectively than prior methods.

Findings

01

Significant performance improvements over baseline methods.

02

Effective utilization of depth cues enhances scene understanding.

03

No additional inference computational cost.

Abstract

Scene Graph Generation is a critical enabler of environmental comprehension for autonomous robotic systems. Most of existing methods, however, are often thwarted by the intricate dynamics of background complexity, which limits their ability to fully decode the inherent topological information of the environment. Additionally, the wealth of contextual information encapsulated within depth cues is often left untapped, rendering existing approaches less effective. To address these shortcomings, we present STDG, an avant-garde Depth-Guided One-Stage Scene Graph Generation methodology. The innovative architecture of STDG is a triad of custom-built modules: The Depth Guided HHA Representation Generation Module, the Depth Guided Semi-Teaching Network Learning Module, and the Depth Guided Scene Graph Generation Module. This trifecta of modules synergistically harnesses depth information,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques