Explaining YOLO: Leveraging Grad-CAM to Explain Object Detections

Armin Kirchknopf; Djordje Slijepcevic; Ilkay Wunderlich; Michael; Breiter; Johannes Traxler; Matthias Zeppelzauer

arXiv:2211.12108·cs.CV·November 23, 2022

Explaining YOLO: Leveraging Grad-CAM to Explain Object Detections

Armin Kirchknopf, Djordje Slijepcevic, Ilkay Wunderlich, Michael, Breiter, Johannes Traxler, Matthias Zeppelzauer

PDF

TL;DR

This paper explores how to enhance the explainability of the YOLO object detector by integrating Grad-CAM, analyzing attribution-based explanations, and emphasizing the importance of normalization for interpretation.

Contribution

It introduces a method to incorporate Grad-CAM into YOLO for better explanations of detections and highlights the impact of normalization on interpretability.

Findings

01

Normalization significantly affects explanation clarity

02

Grad-CAM can be integrated into YOLO for attribution analysis

03

Attribution explanations help understand detection decisions

Abstract

We investigate the problem of explainability for visual object detectors. Specifically, we demonstrate on the example of the YOLO object detector how to integrate Grad-CAM into the model architecture and analyze the results. We show how to compute attribution-based explanations for individual detections and find that the normalization of the results has a great impact on their interpretation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.