# What Object Should I Use? - Task Driven Object Detection

**Authors:** Johann Sawatzky, Yaser Souri, Christian Grund, Juergen Gall

arXiv: 1904.03000 · 2019-04-08

## TL;DR

This paper introduces the COCO-Tasks dataset and a Gated Graph Neural Network approach to identify the most suitable objects for specific tasks in images, addressing a gap in current object detection benchmarks.

## Contribution

The paper presents a new dataset for task-driven object detection and a novel Gated Graph Neural Network method to select appropriate objects based on task context.

## Key findings

- The approach outperforms classification and ranking methods on the COCO-Tasks dataset.
- The dataset contains about 40,000 images with annotations for 14 tasks.
- The method effectively exploits object appearance and scene context for task relevance.

## Abstract

When humans have to solve everyday tasks, they simply pick the objects that are most suitable. While the question which object should one use for a specific task sounds trivial for humans, it is very difficult to answer for robots or other autonomous systems. This issue, however, is not addressed by current benchmarks for object detection that focus on detecting object categories. We therefore introduce the COCO-Tasks dataset which comprises about 40,000 images where the most suitable objects for 14 tasks have been annotated. We furthermore propose an approach that detects the most suitable objects for a given task. The approach builds on a Gated Graph Neural Network to exploit the appearance of each object as well as the global context of all present objects in the scene. In our experiments, we show that the proposed approach outperforms other approaches that are evaluated on the dataset like classification or ranking approaches.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.03000/full.md

## Figures

14 figures with captions in the complete paper: https://tomesphere.com/paper/1904.03000/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/1904.03000/full.md

---
Source: https://tomesphere.com/paper/1904.03000