Multiple Meta-model Quantifying for Medical Visual Question Answering

Tuong Do; Binh X. Nguyen; Erman Tjiputra; Minh Tran; Quang D. Tran,; Anh Nguyen

arXiv:2105.08913·cs.CV·June 29, 2021·5 cites

Multiple Meta-model Quantifying for Medical Visual Question Answering

Tuong Do, Binh X. Nguyen, Erman Tjiputra, Minh Tran, Quang D. Tran,, Anh Nguyen

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel multiple meta-model quantifying approach that enhances medical VQA by utilizing dataset meta-data, auto-annotation, and noise handling, achieving superior accuracy without external data.

Contribution

The proposed method effectively leverages dataset meta-data for medical VQA, increasing meta-data through auto-annotation and handling noisy labels, without relying on external data.

Findings

01

Achieves superior accuracy on two public medical VQA datasets.

02

Does not require external data for training meta-models.

03

Effectively handles noisy labels and increases meta-data.

Abstract

Transfer learning is an important step to extract meaningful features and overcome the data limitation in the medical Visual Question Answering (VQA) task. However, most of the existing medical VQA methods rely on external data for transfer learning, while the meta-data within the dataset is not fully utilized. In this paper, we present a new multiple meta-model quantifying method that effectively learns meta-annotation and leverages meaningful features to the medical VQA task. Our proposed method is designed to increase meta-data by auto-annotation, deal with noisy labels, and output meta-models which provide robust features for medical VQA tasks. Extensively experimental results on two public medical VQA datasets show that our approach achieves superior accuracy in comparison with other state-of-the-art methods, while does not require external data to train meta-models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques