LLaVA Needs More Knowledge: Retrieval Augmented Natural Language   Generation with Knowledge Graph for Explaining Thoracic Pathologies

Ameer Hamza; Abdullah; Yong Hyun Ahn; Sungyoung Lee; Seong Tae Kim

arXiv:2410.04749·cs.CV·December 20, 2024

LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies

Ameer Hamza, Abdullah, Yong Hyun Ahn, Sungyoung Lee, Seong Tae Kim

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a knowledge graph-augmented vision-language framework to improve natural language explanations for thoracic pathology predictions, addressing domain knowledge gaps and privacy concerns.

Contribution

It proposes a novel KG-based retrieval method integrated with multiple models, enhancing medical explanation quality while maintaining data privacy.

Findings

01

Achieved state-of-the-art results on MIMIC-NLE dataset.

02

Enhanced explanation accuracy with KG augmentation.

03

Frameworks are modular and adaptable to various architectures.

Abstract

Generating Natural Language Explanations (NLEs) for model predictions on medical images, particularly those depicting thoracic pathologies, remains a critical and challenging task. Existing methodologies often struggle due to general models' insufficient domain-specific medical knowledge and privacy concerns associated with retrieval-based augmentation techniques. To address these issues, we propose a novel Vision-Language framework augmented with a Knowledge Graph (KG)-based datastore, which enhances the model's understanding by incorporating additional domain-specific medical knowledge essential for generating accurate and informative NLEs. Our framework employs a KG-based retrieval mechanism that not only improves the precision of the generated explanations but also preserves data privacy by avoiding direct data retrieval. The KG datastore is designed as a plug-and-play module,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ailab-kyunghee/kg-llava
pytorchOfficial

Videos

LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques