Transformer-Based Modeling of User Interaction Sequences for Dwell Time Prediction in Human-Computer Interfaces
Rui Liu, Runsheng Zhang, Shixiao Wang

TL;DR
This paper introduces a Transformer-based framework for predicting user dwell time in human-computer interfaces, effectively modeling complex interaction behaviors and outperforming existing methods in accuracy and robustness.
Contribution
The study presents a novel Transformer framework that integrates diverse interaction features and captures long-range dependencies for dwell time prediction, demonstrating superior performance over baseline models.
Findings
Achieves the best performance in MSE, RMSE, MAPE, and RMAE metrics.
Effectively models complex user interaction patterns.
Demonstrates robustness and adaptability across hyperparameters and environments.
Abstract
This study investigates the task of dwell time prediction and proposes a Transformer framework based on interaction behavior modeling. The method first represents user interaction sequences on the interface by integrating dwell duration, click frequency, scrolling behavior, and contextual features, which are mapped into a unified latent space through embedding and positional encoding. On this basis, a multi-head self-attention mechanism is employed to capture long-range dependencies, while a feed-forward network performs deep nonlinear transformations to model the dynamic patterns of dwell time. Multiple comparative experiments are conducted with BILSTM, DRFormer, FedFormer, and iTransformer as baselines under the same conditions. The results show that the proposed method achieves the best performance in terms of MSE, RMSE, MAPE, and RMAE, and more accurately captures the complex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersonal Information Management and User Behavior · Gaze Tracking and Assistive Technology · Interactive and Immersive Displays
