Learning Visual Context by Comparison

Minchul Kim; Jongchan Park; Seil Na; Chang Min Park; Donggeun Yoo

arXiv:2007.07506·cs.CV·July 16, 2020

Learning Visual Context by Comparison

Minchul Kim, Jongchan Park, Seil Na, Chang Min Park, Donggeun Yoo

PDF

Open Access 2 Repos

TL;DR

This paper introduces the Attend-and-Compare Module (ACM), a novel component that explicitly models differences between related regions in images, improving performance in medical and object detection tasks.

Contribution

The paper proposes ACM, a plug-in module for deep learning models that enhances comparison between regions, addressing a key missing characteristic in current methods.

Findings

01

Consistent performance improvements across chest X-ray recognition tasks.

02

Enhanced object detection and segmentation results on COCO dataset.

03

Demonstrated versatility of ACM in different vision tasks.

Abstract

Finding diseases from an X-ray image is an important yet highly challenging task. Current methods for solving this task exploit various characteristics of the chest X-ray image, but one of the most important characteristics is still missing: the necessity of comparison between related regions in an image. In this paper, we present Attend-and-Compare Module (ACM) for capturing the difference between an object of interest and its corresponding context. We show that explicit difference modeling can be very helpful in tasks that require direct comparison between locations from afar. This module can be plugged into existing deep learning models. For evaluation, we apply our module to three chest X-ray recognition tasks and COCO object detection & segmentation tasks and observe consistent improvements across tasks. The code is available at https://github.com/mk-minchul/attend-and-compare.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 diagnosis using AI · Multimodal Machine Learning Applications · AI in cancer detection