Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons
Hazhar Rahmani, Jie Fu

TL;DR
This paper introduces a method to learn user preferences over temporal sequences by modeling them with Preference Deterministic Finite Automata, enabling the inference of complex temporal goals from pairwise comparisons.
Contribution
It formalizes preference inference over temporal goals using PDFA and provides an algorithm to learn minimal PDFA from characteristic samples, addressing computational challenges.
Findings
Proposes PDFA as a model for preferences over temporal sequences.
Develops an algorithm guaranteeing minimal PDFA learning from characteristic samples.
Demonstrates the approach with a robotic motion planning example.
Abstract
Many preference elicitation algorithms consider preference over propositional logic formulas or items with different attributes. In sequential decision making, a user's preference can be a preorder over possible outcomes, each of which is a temporal sequence of events. This paper considers a class of preference inference problems where the user's unknown preference is represented by a preorder over regular languages (sets of temporal sequences), referred to as temporal goals. Given a finite set of pairwise comparisons between finite words, the objective is to learn both the set of temporal goals and the preorder over these goals. We first show that a preference relation over temporal goals can be modeled by a Preference Deterministic Finite Automaton (PDFA), which is a deterministic finite automaton augmented with a preorder over acceptance conditions. The problem of preference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
