Interpreting COVID Lateral Flow Tests' Results with Foundation Models
Stuti Pandey, Josh Myers-Dean, Jarek Reynolds, Danna Gurari

TL;DR
This paper evaluates the capabilities of modern foundation vision language models in interpreting COVID lateral flow tests, introduces a new dataset for this task, and highlights current limitations of these models.
Contribution
The paper creates a new labeled dataset called LFT-Grounding and benchmarks eight VLMs for interpreting LFT images, revealing significant challenges in current models.
Findings
VLMs often fail to identify LFT test types correctly
Models struggle to interpret test results accurately
Recognition drops when tests are partially obscured
Abstract
Lateral flow tests (LFTs) enable rapid, low-cost testing for health conditions including Covid, pregnancy, HIV, and malaria. Automated readers of LFT results can yield many benefits including empowering blind people to independently learn about their health and accelerating data entry for large-scale monitoring (e.g., for pandemics such as Covid) by using only a single photograph per LFT test. Accordingly, we explore the abilities of modern foundation vision language models (VLMs) in interpreting such tests. To enable this analysis, we first create a new labeled dataset with hierarchical segmentations of each LFT test and its nested test result window. We call this dataset LFT-Grounding. Next, we benchmark eight modern VLMs in zero-shot settings for analyzing these images. We demonstrate that current VLMs frequently fail to correctly identify the type of LFT test, interpret the test…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLandslides and related hazards · Dam Engineering and Safety
