TGT: A Temporal Gating Transformer for Smartphone App Usage Prediction
Longlong Li, Cunquan Qu, Guanghui Wang

TL;DR
TGT introduces a novel temporal gating transformer that adaptively models daily usage rhythms, significantly improving smartphone app usage prediction accuracy, especially in cold-start scenarios, and providing interpretable insights into user behavior.
Contribution
The paper presents TGT, a transformer with a temporal gating module that enhances time-awareness and interpretability in app usage prediction, outperforming existing methods.
Findings
TGT achieves superior accuracy on real-world datasets.
It maintains robustness in cold-start scenarios.
Gating vectors reveal human-like daily usage patterns.
Abstract
Accurately predicting smartphone app usage is challenging due to the sparsity and irregularity of user behavior, especially under cold-start and low-activity conditions. Existing approaches mostly rely on static or attention-only architectures, which struggle to model fine-grained temporal dynamics. We propose TGT, a Transformer framework equipped with a temporal gating module that conditions hidden representations on the hour-of-day. Unlike conventional time embeddings, temporal gating adaptively rescales feature dimensions in a time-aware manner, working orthogonally to self-attention and strengthening temporal sensitivity. TGT further incorporates a context-aware encoder that integrates session sequences and user profiles into a unified representation. Experiments on two real-world datasets, Tsinghua App Usage and LSApp, demonstrate that TGT significantly outperforms 15 competitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGreen IT and Sustainability · Multimedia Communication and Technology · Mobile and Web Applications
MethodsAttention Is All You Need · Absolute Position Encodings · Linear Layer · Layer Normalization · Byte Pair Encoding · Dense Connections · Residual Connection · Label Smoothing · Multi-Head Attention · Position-Wise Feed-Forward Layer
