Sequential Stochastic Combinatorial Optimization Using Hierarchal Reinforcement Learning
Xinsong Feng, Zihan Yu, Yanhai Xiong, Haipeng Chen

TL;DR
This paper introduces a hierarchical reinforcement learning framework called WS-option for sequential stochastic combinatorial optimization, effectively learning adaptive budget allocation and node selection, outperforming traditional methods in efficiency and generalizability.
Contribution
The paper proposes a novel two-layer HRL framework for SSCO, capturing interdependencies and improving efficiency, stability, and scalability over existing approaches.
Findings
WS-option outperforms traditional methods in effectiveness.
The model generalizes well to larger graphs.
It reduces computational overhead significantly.
Abstract
Reinforcement learning (RL) has emerged as a promising tool for combinatorial optimization (CO) problems due to its ability to learn fast, effective, and generalizable solutions. Nonetheless, existing works mostly focus on one-shot deterministic CO, while sequential stochastic CO (SSCO) has rarely been studied despite its broad applications such as adaptive influence maximization (IM) and infectious disease intervention. In this paper, we study the SSCO problem where we first decide the budget (e.g., number of seed nodes in adaptive IM) allocation for all time steps, and then select a set of nodes for each time step. The few existing studies on SSCO simplify the problems by assuming a uniformly distributed budget allocation over the time horizon, yielding suboptimal solutions. We propose a generic hierarchical RL (HRL) framework called wake-sleep option (WS-option), a two-layer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetaheuristic Optimization Algorithms Research
MethodsSparse Evolutionary Training · Focus
