UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation
Xinyao Liao, Wei Wei, Dangyang Chen, Yuanyuan Fu

TL;DR
UniQ introduces a unified decoder with task-specific queries for scene graph generation, effectively balancing coupled and decoupled features, leading to improved performance over existing methods.
Contribution
The paper proposes UniQ, a novel unified decoder architecture with task-specific queries that enhances relational feature modeling in one-stage scene graph generation.
Findings
Outperforms existing one-stage and two-stage methods on Visual Genome dataset.
Effectively models both coupled and decoupled features within relational triplets.
Reduces computational overhead compared to traditional methods.
Abstract
Scene Graph Generation(SGG) is a scene understanding task that aims at identifying object entities and reasoning their relationships within a given image. In contrast to prevailing two-stage methods based on a large object detector (e.g., Faster R-CNN), one-stage methods integrate a fixed-size set of learnable queries to jointly reason relational triplets <subject, predicate, object>. This paradigm demonstrates robust performance with significantly reduced parameters and computational overhead. However, the challenge in one-stage methods stems from the issue of weak entanglement, wherein entities involved in relationships require both coupled features shared within triplets and decoupled visual features. Previous methods either adopt a single decoder for coupled triplet feature modeling or multiple decoders for separate visual feature extraction but fail to consider both. In this paper,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Web Data Mining and Analysis · Advanced Text Analysis Techniques
MethodsADaptive gradient method with the OPTimal convergence rate · Sparse Evolutionary Training
