Multiple Meta-model Quantifying for Medical Visual Question Answering
Tuong Do, Binh X. Nguyen, Erman Tjiputra, Minh Tran, Quang D. Tran,, Anh Nguyen

TL;DR
This paper introduces a novel multiple meta-model quantifying approach that enhances medical VQA by utilizing dataset meta-data, auto-annotation, and noise handling, achieving superior accuracy without external data.
Contribution
The proposed method effectively leverages dataset meta-data for medical VQA, increasing meta-data through auto-annotation and handling noisy labels, without relying on external data.
Findings
Achieves superior accuracy on two public medical VQA datasets.
Does not require external data for training meta-models.
Effectively handles noisy labels and increases meta-data.
Abstract
Transfer learning is an important step to extract meaningful features and overcome the data limitation in the medical Visual Question Answering (VQA) task. However, most of the existing medical VQA methods rely on external data for transfer learning, while the meta-data within the dataset is not fully utilized. In this paper, we present a new multiple meta-model quantifying method that effectively learns meta-annotation and leverages meaningful features to the medical VQA task. Our proposed method is designed to increase meta-data by auto-annotation, deal with noisy labels, and output meta-models which provide robust features for medical VQA tasks. Extensively experimental results on two public medical VQA datasets show that our approach achieves superior accuracy in comparison with other state-of-the-art methods, while does not require external data to train meta-models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
