ViX-Ray: A Vietnamese Chest X-Ray Dataset for Vision-Language Models
Duy Vu Minh Nguyen, Chinh Thanh Truong, Phuc Hoang Tran, Hung Tuan Le, Nguyen Van-Thanh Dat, Trung Hieu Pham, Kiet Van Nguyen

TL;DR
ViX-Ray introduces a Vietnamese chest X-ray dataset with expert annotations, highlighting linguistic features and evaluating vision-language models' performance, revealing challenges like hallucination and low precision in clinical report generation.
Contribution
The paper presents ViX-Ray, the first Vietnamese chest X-ray dataset with annotations, and benchmarks multiple models, revealing linguistic insights and challenges in Vietnamese medical VLMs.
Findings
Models often hallucinate and lack precision in report generation.
ViX-Ray reveals unique linguistic patterns in Vietnamese radiology reports.
Benchmark results highlight the need for improved Vietnamese medical VLMs.
Abstract
Vietnamese medical research has become an increasingly vital domain, particularly with the rise of intelligent technologies aimed at reducing time and resource burdens in clinical diagnosis. Recent advances in vision-language models (VLMs), such as Gemini and GPT-4V, have sparked a growing interest in applying AI to healthcare. However, most existing VLMs lack exposure to Vietnamese medical data, limiting their ability to generate accurate and contextually appropriate diagnostic outputs for Vietnamese patients. To address this challenge, we introduce ViX-Ray, a novel dataset comprising 5,400 Vietnamese chest X-ray images annotated with expert-written findings and impressions from physicians at a major Vietnamese hospital. We analyze linguistic patterns within the dataset, including the frequency of mentioned body parts and diagnoses, to identify domain-specific linguistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · COVID-19 diagnosis using AI · Machine Learning in Healthcare
