GREC: Generalized Referring Expression Comprehension

Shuting He; Henghui Ding; Chang Liu; Xudong Jiang

arXiv:2308.16182·cs.CV·December 27, 2023·2 cites

GREC: Generalized Referring Expression Comprehension

Shuting He, Henghui Ding, Chang Liu, Xudong Jiang

PDF

Open Access 1 Repo 1 Models 2 Datasets

TL;DR

This paper introduces GREC, a new benchmark and dataset for referring expression comprehension that handles multiple targets and non-target expressions, extending the scope of classic REC for more practical applications.

Contribution

It presents the first large-scale GREC dataset, gRefCOCO, enabling the study of expressions referring to multiple or no specific objects, and offers a compatible evaluation framework.

Findings

01

First large-scale GREC dataset gRefCOCO created

02

Supports expressions for multiple targets and no specific target

03

Provides code for GREC methods and evaluation

Abstract

The objective of Classic Referring Expression Comprehension (REC) is to produce a bounding box corresponding to the object mentioned in a given textual description. Commonly, existing datasets and techniques in classic REC are tailored for expressions that pertain to a single target, meaning a sole expression is linked to one specific object. Expressions that refer to multiple targets or involve no specific target have not been taken into account. This constraint hinders the practical applicability of REC. This study introduces a new benchmark termed as Generalized Referring Expression Comprehension (GREC). This benchmark extends the classic REC by permitting expressions to describe any number of target objects. To achieve this goal, we have built the first large-scale GREC dataset named gRefCOCO. This dataset encompasses a range of expressions: those referring to multiple targets,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

henghuiding/grefcoco
pytorchOfficial

Models

🤗
linhuixiao/Awesome-Visual-Grounding
model· ♡ 1
♡ 1

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification