Open-World Amodal Appearance Completion

Jiayang Ao; Yanbei Jiang; Qiuhong Ke; Krista A. Ehinger

arXiv:2411.13019·cs.CV·November 21, 2024

Open-World Amodal Appearance Completion

Jiayang Ao, Yanbei Jiang, Qiuhong Ke, Krista A. Ehinger

PDF

Open Access

TL;DR

This paper presents a training-free, open-world amodal appearance completion framework that uses flexible text queries to reconstruct occluded objects in diverse scenes, unifying segmentation, occlusion analysis, and inpainting.

Contribution

It introduces a novel reasoning-based amodal completion method that generalizes to arbitrary objects and queries without training, advancing open-world scene understanding.

Findings

01

Effective generalization to novel objects and occlusions

02

Unifies segmentation, occlusion analysis, and inpainting

03

Establishes a new benchmark for open-world amodal completion

Abstract

Understanding and reconstructing occluded objects is a challenging problem, especially in open-world scenarios where categories and contexts are diverse and unpredictable. Traditional methods, however, are typically restricted to closed sets of object categories, limiting their use in complex, open-world scenes. We introduce Open-World Amodal Appearance Completion, a training-free framework that expands amodal completion capabilities by accepting flexible text queries as input. Our approach generalizes to arbitrary objects specified by both direct terms and abstract queries. We term this capability reasoning amodal completion, where the system reconstructs the full appearance of the queried object based on the provided image and language query. Our framework unifies segmentation, occlusion analysis, and inpainting to handle complex occlusions and generates completed objects as RGBA…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Financial Crisis of the 21st Century

MethodsInpainting