Improving performance in multi-objective decision-making in Bottles environments with soft maximin approaches
Benjamin J Smith, Robert Klassert, Roland Pihlakas

TL;DR
This paper introduces SFELLA, a novel soft maximin approach for multi-objective decision-making that learns faster and is more robust than existing methods, improving AI alignment in complex environments.
Contribution
The paper proposes SFELLA, a new loss-averse soft maximin method that outperforms state-of-the-art approaches in learning speed and robustness in multi-objective AI tasks.
Findings
SFELLA learns faster than thresholded methods in most tasks.
SFELLA maintains optimal performance after learning.
SFELLA shows robustness to changes in objective scale.
Abstract
Balancing multiple competing and conflicting objectives is an essential task for any artificial intelligence tasked with satisfying human values or preferences. Conflict arises both from misalignment between individuals with competing values, but also between conflicting value systems held by a single human. Starting with principle of loss-aversion, we designed a set of soft maximin function approaches to multi-objective decision-making. Bench-marking these functions in a set of previously-developed environments, we found that one new approach in particular, 'split-function exp-log loss aversion' (SFELLA), learns faster than the state of the art thresholded alignment objective method (Vamplew et al, 2021) on three of four tasks it was tested on, and achieved the same optimal performance after learning. SFELLA also showed relative robustness improvements against changes in objective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Criteria Decision Making · Cognitive Science and Mapping
