# Traffic Light Recognition Assistant for Color Vision Deficiency Using YOLO with Multilingual Audio Feedback

**Authors:** Yinyuan Ma, Fathan Arifah, Qonita Afifah, Liko Bun, Kangfu Zhang, Minan Tang

PMC · DOI: 10.3390/s26041093 · Sensors (Basel, Switzerland) · 2026-02-08

## TL;DR

This paper introduces a traffic light recognition system that helps color vision deficient drivers by using audio feedback and position-based detection instead of color.

## Contribution

The study introduces a spatial-position inference framework using YOLOv12 for traffic light recognition tailored for color vision deficiency.

## Key findings

- The system achieved an average detection confidence of 0.73 with a maximum of 0.95.
- Processing latency was 0.214 seconds on a CPU-only setup.
- The system works in various conditions like day/night, weather, and traffic density.

## Abstract

Drivers with color vision deficiency (CVD) often face difficulty recognizing traffic light colors at intersections. Relying solely on their limited color vision can increase safety risks while driving in urban environments. In the era of technological development, Intelligent Transportation Systems (ITSs) increasingly aim to provide support for traffic users, including individuals with CVD. To address user needs from diverse backgrounds, this study aims to develop a traffic light recognition system that provides offline multilingual audio feedback in Indonesian, Mandarin, and English. The proposed approach introduces a spatial-position inference framework by applying a full-frame traffic light annotation strategy to a YOLOv12 model, enabling traffic light state recognition based on the relative position of active lights rather than relying primarily on color information. This work contributes to reducing reliance on color-based perception traffic signal recognition frameworks tailored for assistive ITS applications targeting users with color vision deficiency. System performance is evaluated to verify its feasibility using a comprehensive dataset consisting of various traffic light conditions, including daytime and nighttime scenarios, varying weather, and different traffic densities. Experimental results show an average detection confidence of approximately 0.73, with a maximum confidence of 0.95 and low processing latency of 0.214 s on a CPU-only configuration. The system has the potential to enhance driving safety for individuals with color vision deficiency by offering an additional intelligent assistive tool instead of replacing standard driving regulations.

## Full-text entities

- **Diseases:** cognitive fatigue (MESH:D005221), traffic accidents (MESH:D000081084), Disabilities (MESH:D009069), blind (MESH:D001766), injury to (MESH:D014947), visual impairment (MESH:D014786), CVD (MESH:D003117)
- **Chemicals:** YOLO (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12944492/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12944492/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/PMC12944492/full.md

---
Source: https://tomesphere.com/paper/PMC12944492