Universal Instance Perception as Object Discovery and Retrieval

Bin Yan; Yi Jiang; Jiannan Wu; Dong Wang; Ping Luo; Zehuan Yuan,; Huchuan Lu

arXiv:2303.06674·cs.CV·August 21, 2023·5 cites

Universal Instance Perception as Object Discovery and Retrieval

Bin Yan, Yi Jiang, Jiannan Wu, Dong Wang, Ping Luo, Zehuan Yuan,, Huchuan Lu

PDF

Open Access 1 Repo

TL;DR

UNINEXT is a unified, flexible model for diverse instance perception tasks that improves data efficiency and performance across multiple benchmarks by reformulating tasks into object discovery and retrieval.

Contribution

The paper introduces UNINEXT, a universal model that unifies various instance perception tasks into a single framework, enabling joint training and efficient multi-task handling.

Findings

01

Outperforms on 20 benchmarks across 10 tasks

02

Unified model reduces redundant computation

03

Effective in low-data scenarios for certain tasks

Abstract

All instance perception tasks aim at finding certain objects specified by some queries such as category names, language expressions, and target annotations, but this complete field has been split into multiple independent subtasks. In this work, we present a universal instance perception model of the next generation, termed UNINEXT. UNINEXT reformulates diverse instance perception tasks into a unified object discovery and retrieval paradigm and can flexibly perceive different types of objects by simply changing the input prompts. This unified formulation brings the following benefits: (1) enormous data from different tasks and label vocabularies can be exploited for jointly training general instance-level representations, which is especially beneficial for tasks lacking in training data. (2) the unified model is parameter-efficient and can save redundant computation when handling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MasterBin-IIAU/UNINEXT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning