A Hierarchical Computer Vision Pipeline for Physiological Data Extraction from Bedside Monitors
Vinh Chau, Khoa Le Dinh Van, Hon Huynh Ngoc, Binh Nguyen Thien, Hao Nguyen Thien, Vy Nguyen Quang, Phuc Vo Hong, Yen Lam Minh, Kieu Pham Tieu, Trinh Nguyen Thi Diem, Louise Thwaites, and Hai Ho Bich

TL;DR
This paper introduces a computer vision pipeline that automatically captures and digitizes vital signs from bedside monitor screens, enabling integration into electronic health records in low-resource healthcare settings without hardware upgrades.
Contribution
It presents a hierarchical detection and OCR framework that accurately extracts physiological data from monitor screens under variable conditions, addressing interoperability gaps in low-resource environments.
Findings
Achieved 99.5% monitor detection accuracy
Attained 91.5% ROI localisation accuracy
End-to-end physiological data extraction over 98.9%
Abstract
In many low-resource healthcare settings, bedside monitors remain standalone legacy devices without network connectivity, creating a persistent interoperability gap that prevents seamless integration of physiological data into electronic health record (EHR) systems. To address this challenge without requiring costly hardware replacement, we present a computer vision-based pipeline for the automated capture and digitisation of vital sign data directly from bedside monitor screens. Our method employs a hierarchical detection framework combining YOLOv11 for accurate monitor and region of interest (ROI) localisation with PaddleOCR for robust text extraction. To enhance reliability across variable camera angles and lighting conditions, a geometric rectification module standardizes the screen perspective before character recognition. We evaluated the system on a dataset of 6,498 images…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Healthcare Technology and Patient Monitoring · Multimodal Machine Learning Applications
