Multimodal Behavioral Patterns Analysis with Eye-Tracking and LLM-Based Reasoning

Dongyang Guo; Yasmeen Abdrabou; Enkeleda Thaqi; Enkelejda Kasneci

arXiv:2507.18252·cs.HC·July 25, 2025

Multimodal Behavioral Patterns Analysis with Eye-Tracking and LLM-Based Reasoning

Dongyang Guo, Yasmeen Abdrabou, Enkeleda Thaqi, Enkelejda Kasneci

PDF

Open Access

TL;DR

This paper introduces a multimodal framework combining eye-tracking data and large language models to analyze cognitive patterns, improving interpretability and accuracy in behavioral analysis tasks.

Contribution

It presents a novel human-AI collaborative pipeline that integrates LLM reasoning with eye-tracking analysis for enhanced cognitive pattern extraction.

Findings

01

Up to 50% accuracy in difficulty prediction tasks

02

Improved consistency and interpretability of behavioral analysis

03

Effective hybrid anomaly detection combining LSTM and LLMs

Abstract

Eye-tracking data reveals valuable insights into users' cognitive states but is difficult to analyze due to its structured, non-linguistic nature. While large language models (LLMs) excel at reasoning over text, they struggle with temporal and numerical data. This paper presents a multimodal human-AI collaborative framework designed to enhance cognitive pattern extraction from eye-tracking signals. The framework includes: (1) a multi-stage pipeline using horizontal and vertical segmentation alongside LLM reasoning to uncover latent gaze patterns; (2) an Expert-Model Co-Scoring Module that integrates expert judgment with LLM output to generate trust scores for behavioral interpretations; and (3) a hybrid anomaly detection module combining LSTM-based temporal modeling with LLM-driven semantic analysis. Our results across several LLMs and prompt strategies show improvements in consistency,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Advanced Text Analysis Techniques