GAVIN: Gaze-Assisted Voice-Based Implicit Note-taking
Anam Ahmad Khan, Joshua Newn, Ryan Kelly, Namrata Srivastava, and James Bailey, Eduardo Velloso

TL;DR
GAVIN is a gaze-assisted voice note-taking system that uses eye-tracking and machine learning to implicitly anchor voice notes to specific text passages, improving mobile annotation efficiency.
Contribution
The paper introduces GAVIN, a novel system combining gaze tracking and machine learning for implicit text anchoring of voice notes in mobile reading environments.
Findings
Successful prediction of text passages for voice notes using trained classifiers
GAVIN enables seamless, accurate voice note anchoring with minimal user effort
User study confirms system feasibility and effectiveness
Abstract
Annotation is an effective reading strategy people often undertake while interacting with digital text. It involves highlighting pieces of text and making notes about them. Annotating while reading in a desktop environment is considered trivial but, in a mobile setting where people read while hand-holding devices, the task of highlighting and typing notes on a mobile display is challenging. In this paper, we introduce GAVIN, a gaze-assisted voice note-taking application, which enables readers to seamlessly take voice notes on digital documents by implicitly anchoring them to text passages. We first conducted a contextual enquiry focusing on participants' note-taking practices on digital documents. Using these findings, we propose a method which leverages eye-tracking and machine learning techniques to annotate voice notes with reference text passages. To evaluate our approach, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Tactile and Sensory Interactions · Usability and User Interface Design
