# Research on Eye-Tracking Control Methods Based on an Improved YOLOv11 Model

**Authors:** Xiangyang Sun, Jiahua Wu, Wenjun Zhang, Xianwei Chen, Haixia Mei

PMC · DOI: 10.3390/s25196236 · Sensors (Basel, Switzerland) · 2025-10-08

## TL;DR

This paper improves eye-tracking accuracy using a modified YOLOv11 model to better detect eyes and control robotic arms for medical rehabilitation.

## Contribution

The study introduces an improved YOLOv11 model with EFFM and ORC modules for enhanced eye detection and movement direction accuracy.

## Key findings

- The improved model increased eye socket and iris recognition accuracy by 1.7% and 9.9%, respectively.
- Eye movement direction discrimination achieved average accuracy rates of 95.3%, 92.8%, and 94.8% for fixation, left, and right directions.
- Eye movement encoding matched control commands with robotic arms at an average of 93.4% to 96.8%.

## Abstract

Eye-tracking technology has gained traction in the field of medical rehabilitation due to its non-invasive and intuitive nature. However, current eye-tracking methods based on object detection technology suffer from insufficient accuracy in detecting the eye socket and iris, as well as inaccuracies in determining eye movement direction. To address this, this study improved the YOLOv11 model using the EFFM and ORC modules, resulting in a 1.7% and 9.9% increase in recognition accuracy for the eye socket and iris, respectively, and a 5.5% and 44% increase in recall rate, respectively. A method combining frame voting mechanisms with eye movement area discrimination was proposed for eye movement direction discrimination, achieving average accuracy rates of 95.3%, 92.8%, and 94.8% for iris fixation, left, and right directions, respectively. The discrimination results of multiple eye movement images were mapped to a binary value, and eye movement encoding was used to obtain control commands that align with the robotic arm. The average matching degree of eye movement encoding ranged from 93.4% to 96.8%. An experimental platform was established, and the average completion rates for three object-grabbing tasks controlled by eye movements were 98%, 78%, and 96%, respectively.

## Full-text entities

- **Genes:** PTPRF (protein tyrosine phosphatase receptor type F) [NCBI Gene 5792] {aka BNAH2, LAR}
- **Diseases:** iris displacement (MESH:D007499), eye fatigue (MESH:D001248), aphasia (MESH:D001037), injury to (MESH:D014947), eye tremors (MESH:D014202)
- **Chemicals:** Concat (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** YOLOv11 — Homo sapiens (Human), Transformed cell line (CVCL_C1JD), EC66 — Mus musculus (Mouse), Hepatocellular carcinoma of the mouse, Cancer cell line (CVCL_5771)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12526887/full.md

## Figures

17 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12526887/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/PMC12526887/full.md

---
Source: https://tomesphere.com/paper/PMC12526887