SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data
Ziyan Yang, Kushal Kafle, Zhe Lin, Scott Cohen, Zhihong Ding, Vicente, Ordonez

TL;DR
This paper introduces SCoRD, a subject-conditional relation detection method that predicts relations and object locations conditioned on a subject, demonstrating improved performance and generalization on a new challenging benchmark based on the Open Images dataset.
Contribution
The paper presents a novel auto-regressive model for subject-conditional relation detection and introduces the OIv6-SCoRD benchmark with distribution shift, improving upon previous scene graph prediction methods.
Findings
Achieved recall@3 of 83.8% for relation-object predictions, outperforming previous methods.
Leveraged textual captions to improve relation-object and object-box prediction generalization.
Demonstrated better handling of distribution shifts in the new benchmark.
Abstract
We propose Subject-Conditional Relation Detection SCoRD, where conditioned on an input subject, the goal is to predict all its relations to other objects in a scene along with their locations. Based on the Open Images dataset, we propose a challenging OIv6-SCoRD benchmark such that the training and testing splits have a distribution shift in terms of the occurrence statistics of subject, relation, object triplets. To solve this problem, we propose an auto-regressive model that given a subject, it predicts its relations, objects, and object locations by casting this output as a sequence of tokens. First, we show that previous scene-graph prediction methods fail to produce as exhaustive an enumeration of relation-object pairs when conditioned on a subject on this benchmark. Particularly, we obtain a recall@3 of 83.8% for our relation-object predictions compared to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning
Methodsfail
