Pareto-NRPA: A Novel Monte-Carlo Search Algorithm for Multi-Objective Optimization
No\'e Lallouet, Tristan Cazenave, Cyrille Enderli

TL;DR
Pareto-NRPA is a pioneering Monte-Carlo algorithm that extends NRPA to multi-objective optimization, effectively balancing convergence and diversity in discrete search spaces, and outperforming existing methods on complex benchmarks.
Contribution
This work introduces the first multi-objective adaptation of NRPA, combining policy-based search with Pareto front maintenance for improved multi-objective optimization.
Findings
Achieves competitive results on MO-TSPTW and neural architecture search benchmarks.
Outperforms state-of-the-art evolutionary algorithms on constrained problems.
Demonstrates effective exploration and diversity preservation in multi-objective search.
Abstract
We introduce Pareto-NRPA, a new Monte-Carlo algorithm designed for multi-objective optimization problems over discrete search spaces. Extending the Nested Rollout Policy Adaptation (NRPA) algorithm originally formulated for single-objective problems, Pareto-NRPA generalizes the nested search and policy update mechanism to multi-objective optimization. The algorithm uses a set of policies to concurrently explore different regions of the solution space and maintains non-dominated fronts at each level of search. Policy adaptation is performed with respect to the diversity and isolation of sequences within the Pareto front. We benchmark Pareto-NRPA on two classes of problems: a novel bi-objective variant of the Traveling Salesman Problem with Time Windows problem (MO-TSPTW), and a neural architecture search task on well-known benchmarks. Results demonstrate that Pareto-NRPA achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
