Efficient Medical VIE via Reinforcement Learning
Lijun Liu, Ruiyang Li, Zhaocheng Liu, Chenglin Zhu, Chong Li, Jiehan Cheng, Qiang Ju, Jian Xie

TL;DR
This paper introduces a reinforcement learning-based approach for medical visual information extraction that achieves state-of-the-art results with minimal annotated data, addressing domain-specific challenges and hallucination issues.
Contribution
It proposes a novel RLVR framework for medical VIE that improves accuracy and reasoning with limited annotations and domain-specific schema handling.
Findings
Achieves state-of-the-art F1, precision, and recall on medical VIE tasks.
Reduces hallucinations through balanced reward mechanisms.
Enhances reasoning capabilities with innovative sampling strategies.
Abstract
Visual Information Extraction (VIE) converts unstructured document images into structured formats like JSON, critical for medical applications such as report analysis and online consultations. Traditional methods rely on OCR and language models, while end-to-end multimodal models offer direct JSON generation. However, domain-specific schemas and high annotation costs limit their effectiveness in medical VIE. We base our approach on the Reinforcement Learning with Verifiable Rewards (RLVR) framework to address these challenges using only 100 annotated samples. Our approach ensures dataset diversity, a balanced precision-recall reward mechanism to reduce hallucinations and improve field coverage, and innovative sampling strategies to enhance reasoning capabilities. Fine-tuning Qwen2.5-VL-7B with our RLVR method, we achieve state-of-the-art performance on medical VIE tasks, significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Human-Automation Interaction and Safety · Elevator Systems and Control
MethodsBalanced Selection
