Expert with Clustering: Hierarchical Online Preference Learning Framework
Tianyue Zhou, Jung-Hoon Cho, Babak Rahimi Ardabili, Hamed Tabkhi, and, Cathy Wu

TL;DR
This paper introduces a hierarchical online preference learning framework called Expert with Clustering (EWC), which accelerates user preference learning in mobility recommendation systems by integrating clustering and expert advice, achieving lower regret.
Contribution
The paper presents the first regret analysis of an integrated expert algorithm with k-Means clustering, demonstrating theoretical and practical improvements in preference learning.
Findings
EWC achieves a regret bound of O(N√T log K) + NT.
EWC reduces regret by 27.57% compared to LinUCB.
The approach effectively captures hierarchical user preferences.
Abstract
Emerging mobility systems are increasingly capable of recommending options to mobility users, to guide them towards personalized yet sustainable system outcomes. Even more so than the typical recommendation system, it is crucial to minimize regret, because 1) the mobility options directly affect the lives of the users, and 2) the system sustainability relies on sufficient user participation. In this study, we consider accelerating user preference learning by exploiting a low-dimensional latent space that captures the mobility preferences of users. We introduce a hierarchical contextual bandit framework named Expert with Clustering (EWC), which integrates clustering techniques and prediction with expert advice. EWC efficiently utilizes hierarchical user information and incorporates a novel Loss-guided Distance metric. This metric is instrumental in generating more representative cluster…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Mobile Crowdsensing and Crowdsourcing
MethodsElastic Weight Consolidation
