Sequential Stochastic Combinatorial Optimization Using Hierarchal   Reinforcement Learning

Xinsong Feng; Zihan Yu; Yanhai Xiong; Haipeng Chen

arXiv:2502.05537·cs.AI·February 11, 2025

Sequential Stochastic Combinatorial Optimization Using Hierarchal Reinforcement Learning

Xinsong Feng, Zihan Yu, Yanhai Xiong, Haipeng Chen

PDF

Open Access

TL;DR

This paper introduces a hierarchical reinforcement learning framework called WS-option for sequential stochastic combinatorial optimization, effectively learning adaptive budget allocation and node selection, outperforming traditional methods in efficiency and generalizability.

Contribution

The paper proposes a novel two-layer HRL framework for SSCO, capturing interdependencies and improving efficiency, stability, and scalability over existing approaches.

Findings

01

WS-option outperforms traditional methods in effectiveness.

02

The model generalizes well to larger graphs.

03

It reduces computational overhead significantly.

Abstract

Reinforcement learning (RL) has emerged as a promising tool for combinatorial optimization (CO) problems due to its ability to learn fast, effective, and generalizable solutions. Nonetheless, existing works mostly focus on one-shot deterministic CO, while sequential stochastic CO (SSCO) has rarely been studied despite its broad applications such as adaptive influence maximization (IM) and infectious disease intervention. In this paper, we study the SSCO problem where we first decide the budget (e.g., number of seed nodes in adaptive IM) allocation for all time steps, and then select a set of nodes for each time step. The few existing studies on SSCO simplify the problems by assuming a uniformly distributed budget allocation over the time horizon, yielding suboptimal solutions. We propose a generic hierarchical RL (HRL) framework called wake-sleep option (WS-option), a two-layer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMetaheuristic Optimization Algorithms Research

MethodsSparse Evolutionary Training · Focus