LOFA: Online Influence Maximization under Full-Bandit Feedback using Lazy Forward Selection
Jinyu Xu, Abhishek K. Umrawal

TL;DR
This paper introduces LOFA, an online influence maximization algorithm that effectively leverages submodularity under full-bandit feedback, demonstrating superior empirical performance on real social network data.
Contribution
The paper proposes LOFA, a novel lazy forward selection algorithm for online influence maximization under full-bandit feedback, achieving lower regret than existing methods.
Findings
LOFA outperforms existing algorithms in cumulative regret.
LOFA achieves higher instantaneous rewards.
Experimental results on real social networks validate its effectiveness.
Abstract
We study the problem of influence maximization (IM) in an online setting, where the goal is to select a subset of nodescalled the seed setat each time step over a fixed time horizon, subject to a cardinality budget constraint, to maximize the expected cumulative influence. We operate under a full-bandit feedback model, where only the influence of the chosen seed set at each time step is observed, with no additional structural information about the network or diffusion process. It is well-established that the influence function is submodular, and existing algorithms exploit this property to achieve low regret. In this work, we leverage this property further and propose the Lazy Online Forward Algorithm (LOFA), which achieves a lower empirical regret. We conduct experiments on a real-world social network to demonstrate that LOFA achieves superior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques · Complex Network Analysis Techniques
