RITnet: Real-time Semantic Segmentation of the Eye for Gaze Tracking

Aayush K.Chaudhary; Rakshit Kothari; Manoj Acharya; Shusil Dangi,; Nitinraj Nair; Reynold Bailey; Christopher Kanan; Gabriel Diaz; Jeff B. Pelz

arXiv:1910.00694·cs.CV·December 22, 2020

RITnet: Real-time Semantic Segmentation of the Eye for Gaze Tracking

Aayush K.Chaudhary, Rakshit Kothari, Manoj Acharya, Shusil Dangi,, Nitinraj Nair, Reynold Bailey, Christopher Kanan, Gabriel Diaz, Jeff B. Pelz

PDF

2 Repos

TL;DR

RITnet is a lightweight deep neural network combining U-Net and DenseNet, achieving high accuracy and real-time performance in eye segmentation for gaze tracking applications.

Contribution

The paper introduces RITnet, a compact and efficient neural network that enables real-time eye segmentation with high accuracy, suitable for gaze tracking.

Findings

01

Achieves 95.3% accuracy on OpenEDS dataset

02

Runs at over 300Hz on a GTX 1080 Ti

03

Model size under 1 MB

Abstract

Accurate eye segmentation can improve eye-gaze estimation and support interactive computing based on visual attention; however, existing eye segmentation methods suffer from issues such as person-dependent accuracy, lack of robustness, and an inability to be run in real-time. Here, we present the RITnet model, which is a deep neural network that combines U-Net and DenseNet. RITnet is under 1 MB and achieves 95.3\% accuracy on the 2019 OpenEDS Semantic Segmentation challenge. Using a GeForce GTX 1080 Ti, RITnet tracks at $>$ 300Hz, enabling real-time gaze tracking applications. Pre-trained models and source code are available https://bitbucket.org/eye-ush/ritnet/.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsU-Net · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Convolution · Average Pooling · Concatenated Skip Connection · Global Average Pooling · Dense Block · Kaiming Initialization · 1x1 Convolution