ReFrame: Rectification Framework for Image Explaining Architectures
Debjyoti Das Adhikary, Aritra Hazra, Partha Pratim Chakrabarti

TL;DR
ReFrame is a novel rectification framework that enhances image explanation models by reducing hallucinated objects and increasing completeness across various architectures, significantly improving explanation accuracy.
Contribution
The paper introduces an interpretable, plug-in rectification framework that improves the consistency and completeness of image explanations in multiple AI models.
Findings
Significant improvement in explanation completeness (up to 81.81%)
Reduction in inconsistency of explanations (up to 37.10%)
Outperforms current state-of-the-art methods across multiple architectures
Abstract
Image explanation has been one of the key research interests in the Deep Learning field. Throughout the years, several approaches have been adopted to explain an input image fed by the user. From detecting an object in a given image to explaining it in human understandable sentence, to having a conversation describing the image, this problem has seen an immense change throughout the years, However, the existing works have been often found to (a) hallucinate objects that do not exist in the image and/or (b) lack identifying the complete set of objects present in the image. In this paper, we propose a novel approach to mitigate these drawbacks of inconsistency and incompleteness of the objects recognized during the image explanation. To enable this, we propose an interpretable framework that can be plugged atop diverse image explaining frameworks including Image Captioning, Visual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
