OSCAR-Net: Object-centric Scene Graph Attention for Image Attribution

Eric Nguyen; Tu Bui; Vishy Swaminathan; John Collomosse

arXiv:2108.03541·cs.CV·October 22, 2021

OSCAR-Net: Object-centric Scene Graph Attention for Image Attribution

Eric Nguyen, Tu Bui, Vishy Swaminathan, John Collomosse

PDF

Open Access

TL;DR

OSCAR-Net introduces an object-centric scene graph attention model for robust image hashing, enabling accurate image attribution by capturing subtle visual details while remaining invariant to benign transformations.

Contribution

The paper presents OSCAR-Net, a novel Transformer-based model that constructs scene graph representations for robust image hashing, improving content fingerprinting for large-scale image attribution.

Findings

01

Achieves state-of-the-art accuracy in image hashing for attribution.

02

Robust to benign transformations like quality changes and resizing.

03

Scales efficiently to millions of images.

Abstract

Images tell powerful stories but cannot always be trusted. Matching images back to trusted sources (attribution) enables users to make a more informed judgment of the images they encounter online. We propose a robust image hashing algorithm to perform such matching. Our hash is sensitive to manipulation of subtle, salient visual details that can substantially change the story told by an image. Yet the hash is invariant to benign transformations (changes in quality, codecs, sizes, shapes, etc.) experienced by images during online redistribution. Our key contribution is OSCAR-Net (Object-centric Scene Graph Attention for Image Attribution Network); a robust image hashing model inspired by recent successes of Transformers in the visual domain. OSCAR-Net constructs a scene graph representation that attends to fine-grained changes of every object's visual appearance and their spatial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Multimodal Machine Learning Applications

MethodsContrastive Learning