VLM-in-the-Loop: A Plug-In Quality Assurance Module for ECG Digitization Pipelines
Jiachen Li, Shihao Li, Soovadeep Bakshi, Wei Li, Dongmei Chen

TL;DR
This paper presents VLM-in-the-Loop, a plug-in module that enhances ECG digitization quality by integrating visual language models with domain-specific signal analysis, improving accuracy and robustness.
Contribution
It introduces a novel tool-grounding approach that anchors VLM assessments in quantitative signal evidence, applicable without modifying existing digitization systems.
Findings
Tool grounding increased verdict consistency from 71% to 89%.
Doubled fidelity separation ($b4$PCC 0.03 to 0.08).
Improved quality metrics across multiple backends and datasets.
Abstract
ECG digitization could unlock billions of archived clinical records, yet existing methods collapse on real-world images despite strong benchmark numbers. We introduce \textbf{VLM-in-the-Loop}, a plug-in quality assurance module that wraps any digitization backend with closed-loop VLM feedback via a standardized interface, requiring no modification to the underlying digitizer. The core mechanism is \textbf{tool grounding}: anchoring VLM assessment in quantitative evidence from domain-specific signal analysis tools. In a controlled ablation on 200 records with paired ground truth, tool grounding raises verdict consistency from 71\% to 89\% and doubles fidelity separation (PCC 0.03 0.08), with the effect replicating across three VLMs (Claude Opus~4, GPT-4o, Gemini~2.5 Pro), confirming a pattern-level rather than model-specific gain. Deployed across four backends, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
