Abduction of Domain Relationships from Data for VQA

Al Mehdi Saadat Chowdhury; Paulo Shakarian; Gerardo I. Simari

arXiv:2502.09219·cs.LO·February 14, 2025·ICLP

Abduction of Domain Relationships from Data for VQA

Al Mehdi Saadat Chowdhury, Paulo Shakarian, Gerardo I. Simari

PDF

TL;DR

This paper introduces a method to infer domain relationships from data to improve visual question answering (VQA) systems that operate with logic-based representations, enhancing accuracy with minimal examples.

Contribution

It presents a novel abduction-based approach to derive domain relationships in VQA, complementing existing knowledge augmentation techniques.

Findings

01

Significant accuracy improvement in VQA tasks.

02

Effective with few training examples.

03

Baseline approach demonstrates practical feasibility.

Abstract

In this paper, we study the problem of visual question answering (VQA) where the image and query are represented by ASP programs that lack domain data. We provide an approach that is orthogonal and complementary to existing knowledge augmentation techniques where we abduce domain relationships of image constructs from past examples. After framing the abduction problem, we provide a baseline approach, and an implementation that significantly improves the accuracy of query answering yet requires few examples.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.