Synthesis of Data Word Transducers

L\'eo Exibard; Emmanuel Filiot; Pierre-Alain Reynier

arXiv:1905.03538·cs.FL·June 22, 2023

Synthesis of Data Word Transducers

L\'eo Exibard, Emmanuel Filiot, Pierre-Alain Reynier

PDF

TL;DR

This paper explores the synthesis of data word transducers from specifications over infinite alphabets, revealing decidability results in various settings and extending classical reactive synthesis to data $ ext{omega}$-words.

Contribution

It introduces a framework for synthesizing transducers with registers from data $ ext{omega}$-word specifications, analyzing decidability across different specification types and register constraints.

Findings

01

Decidability in deterministic data $ ext{omega}$-word synthesis

02

Undecidability for nondeterministic specifications with data

03

Decidability can be recovered by restricting input tests in bounded synthesis

Abstract

In reactive synthesis, the goal is to automatically generate an implementation from a specification of the reactive and non-terminating input/output behaviours of a system. Specifications are usually modelled as logical formulae or automata over infinite sequences of signals ( $ω$ -words), while implementations are represented as transducers. In the classical setting, the set of signals is assumed to be finite. In this paper, we consider data $ω$ -words instead, i.e., words over an infinite alphabet. In this context, we study specifications and implementations respectively given as automata and transducers extended with a finite set of registers. We consider different instances, depending on whether the specification is nondeterministic, universal or deterministic, and depending on whether the number of registers of the implementation is given or not. In the unbounded setting,…

Equations21

ϕ ::= ⊤ ∣ ⊥ ∣ r^{=} ∣ r^{\neq =} ∣ ϕ \land ϕ ∣ ϕ \lor ϕ ∣ \neg ϕ

ϕ ::= ⊤ ∣ ⊥ ∣ r^{=} ∣ r^{\neq =} ∣ ϕ \land ϕ ∣ ϕ \lor ϕ ∣ \neg ϕ

δ \subseteq ⋃_{α = \mathbbm i, \mathbbm o} (Q_{α} \times Σ_{α} \times Tst_{R} \times Asgn_{R} \times Q_{\overline{α}}),

δ \subseteq ⋃_{α = \mathbbm i, \mathbbm o} (Q_{α} \times Σ_{α} \times Tst_{R} \times Asgn_{R} \times Q_{\overline{α}}),

S_{1}

S_{1}

S_{2}

T

(p, C) σ, α_{E}, asgn (q, C^{'})

(p, C) σ, α_{E}, asgn (q, C^{'})

(-, d_{0}) (-, d_{1}) a_{0}^{0} a_{1}^{0} \dots a_{∣ M ∣ - 1}^{0} t_{0} a_{0}^{1} a_{1}^{1} \dots a_{∣ M ∣ - 1}^{1} t_{1} \dots t_{n - 1} a_{0}^{n} a_{1}^{n} \dots a_{∣ M ∣ - 1}^{n}

(-, d_{0}) (-, d_{1}) a_{0}^{0} a_{1}^{0} \dots a_{∣ M ∣ - 1}^{0} t_{0} a_{0}^{1} a_{1}^{1} \dots a_{∣ M ∣ - 1}^{1} t_{1} \dots t_{n - 1} a_{0}^{n} a_{1}^{n} \dots a_{∣ M ∣ - 1}^{n}

S =

S =

{d_{0} \neq = d_{1} and c_{0} t_{0} c_{1} t_{1} c_{2} t_{2} \dots t_{n - 1} c_{n} is the encoding of a computation of M}

\cup

\cup

L = {(r, d_{1}) \dots (r, d_{n}) (g, d_{1}^{'}) \dots (g, d_{m}^{'}) (#, d)^{ω} ∣ \forall i \neq = j, d_{i} \neq = d_{j} \land \forall1 \leq i \leq n, \exists j, d_{j}^{'} = d_{i}},

L = {(r, d_{1}) \dots (r, d_{n}) (g, d_{1}^{'}) \dots (g, d_{m}^{'}) (#, d)^{ω} ∣ \forall i \neq = j, d_{i} \neq = d_{j} \land \forall1 \leq i \leq n, \exists j, d_{j}^{'} = d_{i}},

\textsc A l l D i f f = {w = (σ_{\mathbbm i}^{1}, d_{\mathbbm i}^{1}) (σ_{\mathbbm o}^{1}, d_{\mathbbm o}^{1}) \dots \in RW ∣ \forall0 \leq i < i^{'}, d_{\mathbbm i}^{i} \neq = d_{\mathbbm i}^{i^{'}}}

\textsc A l l D i f f = {w = (σ_{\mathbbm i}^{1}, d_{\mathbbm i}^{1}) (σ_{\mathbbm o}^{1}, d_{\mathbbm o}^{1}) \dots \in RW ∣ \forall0 \leq i < i^{'}, d_{\mathbbm i}^{i} \neq = d_{\mathbbm i}^{i^{'}}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

\lmcsdoi

17122 \lmcsheadingLABEL:LastPageDec. 17, 2019Mar. 18, 2021

\titlecommentThis paper is the journal version of [EFR19]. ACM Classification: Theory of computation $\rightarrow$ Logic and verification; Theory of computation $\rightarrow$ Automata over infinite objects; Theory of computation $\rightarrow$ Transducers.

Synthesis of Data Word Transducers

Léo Exibard\rsupera,b

,

Emmanuel Filiot\rsuperb

and

Pierre-Alain Reynier\rsupera

\lsuperbUniversité libre de Bruxelles, Brussels, Belgium

{leo.exibard,efiliot}@ulb.ac.be

\lsuperbAix Marseille Univ, Université de Toulon, CNRS, LIS, Marseille, France

[email protected]

Abstract.

In reactive synthesis, the goal is to automatically generate an implementation from a specification of the reactive and non-terminating input/output behaviours of a system. Specifications are usually modelled as logical formulae or automata over infinite sequences of signals ( $\omega$ -words), while implementations are represented as transducers. In the classical setting, the set of signals is assumed to be finite. In this paper, we consider data $\omega$ -words instead, i.e., words over an infinite alphabet. In this context, we study specifications and implementations respectively given as automata and transducers extended with a finite set of registers. We consider different instances, depending on whether the specification is nondeterministic, universal or deterministic, and depending on whether the number of registers of the implementation is given or not.

In the unbounded setting, we show undecidability for both universal and nondeterministic specifications, while decidability is recovered in the deterministic case. In the bounded setting, undecidability still holds for nondeterministic specifications, but can be recovered by disallowing tests over input data. The generic technique we use to show the latter result allows us to reprove some known result, namely decidability of bounded synthesis for universal specifications.

Key words and phrases:

Register Automata, Synthesis, Data words, Transducers

L. Exibard is funded by a FRIA fellowship from the F.R.S.-FNRS. E. Filiot is a research associate of F.R.S.-FNRS. He is supported by the ARC Project Transform Fédération Wallonie-Bruxelles and the FNRS CDR J013116F and MIS F451019F projects. P.-A. Reynier is partly funded by the DeLTA project (ANR–16–CE40–0007).

Introduction

Reactive synthesis is an active research domain whose goal is to design algorithmic methods able to automatically construct a reactive system from a specification of its admissible behaviours. Such systems are notoriously difficult to design correctly, and the main appealing idea of synthesis is to automatically generate systems that are correct by construction. Reactive systems are non-terminating systems that continuously interact with the environment in which they are executed, through input and output signals. At each time step, the system receives an input signal from a set In and produces an output signal from a set Out. An execution is then modelled as an infinite sequence alternating between input and output signals, i.e., an $\omega$ -word in ${(\textsf{In}\cdot\textsf{Out})}^{\omega}$ . Classically, the sets In and Out are assumed to be finite and reactive systems are modelled as (sequential) transducers. Transducers are simple finite-state machines with transitions of type $\textsf{States}\times\textsf{In}\rightarrow\textsf{States}\times\textsf{Out}$ , which, at any state, can process any input signal and deterministically produce some output signal, while possibly moving, again deterministically, to a new state. A specification is then a language $S\subseteq{(\textsf{In}\cdot\textsf{Out})}^{\omega}$ telling which are the acceptable behaviours of the system. It is also classically represented as an automaton, or as a logical formula then converted into an automaton. Some regular specifications may not be realisable by any transducer, and the realisability problem asks, given a regular specification $S$ , whether there exists a transducer $T$ whose behaviours satisfy $S$ (i.e., are included in $S$ ). The synthesis problem asks to construct $T$ if it exists.

A typical example of reactive system is that of a server granting requests from a finite set of clients $C$ . Requests are represented as the set of input signals $\textsf{In}=\{(r,i)\mid i\in C\}\cup\{\textsf{idle}\}$ (client $i$ requests the resource) and grants by the set of output signals $\textsf{Out}=\{(g,i)\mid i\in C\}\cup\{\textsf{idle}\}$ (server grants client $i$ ’s request). A typical constraint to be imposed on such a system is that every request is eventually granted, which can be represented by the LTL formula $\bigwedge_{i\in C}G((r,i)\rightarrow F(g,i))$ . The latter specification is realisable for instance by the transducer which outputs $(g,i)$ whenever it reads $(r,i)$ and idle whenever it reads idle.

It is well-known that the realisability problem is decidable for $\omega$ -regular specifications. It is ExpTime-complete when represented by parity automata [BL69, PR89, FJLW16]; and 2ExpTime-complete for LTL specifications [PR89]. Such positive results have triggered a recent and very active research interest in efficient symbolic methods and tools for reactive synthesis (see e.g. [BCJ18]). Extensions of this classical setting have been proposed to capture more realistic scenarios [BCJ18]. However, only a few works have considered infinite sets of input and output signals. In the previous example, the number of clients is assumed to be finite, and small. To the best of our knowledge, existing synthesis tools do not handle large alphabets, although it is more realistic to consider an unbounded (infinite) set of client identifiers, e.g. $C=\mathbb{N}$ . The goal of this paper is to investigate how reactive synthesis can be extended to handle infinite sets of signals.

Data words are infinite sequences $x_{1}x_{2}\dots$ of labelled data, i.e., pairs $(\sigma,d)$ with $\sigma$ a label from a finite alphabet and $d$ is a data from a countably infinite alphabet $\mathcal{D}$ . They can naturally model executions of reactive systems over an infinite set of signals. Among other models, register automata are one of the main extensions of automata recognising languages of data words [KF94, Seg06]. They can use a finite set of registers in which to store data that are read, and to compare the current data with the content of some of the registers (in this paper, we allow comparison of equality). Likewise, transducers can be extended to register transducers as a model of reactive systems over data words: a register transducer is equipped with a set of registers, and when reading an input labelled data $(\sigma,d)$ , it can test $d$ for equality with the content of some of its registers, and depending on the result of this test, deterministically assign some of its registers to $d$ and output a finite label $\beta$ along with the content of one of its registers. Its executions are then data words alternating between input and output labelled data, and register automata can thus be used to represent specifications, as languages of such data words.

Contributions

We consider two classical semantics for register automata, nondeterministic and universal, both with a parity acceptance condition, which give two classes of register automata respectively denoted NRA and URA. We study the parity acceptance condition because it can express the other classical acceptance conditions; e.g., Büchi and co-Büchi can be expressed with a 2-colours parity condition. Since NRA are not closed under complement (already over finite data words), NRA and URA define incomparable classes of specifications. The request-grant specification, as defined above, can be generalised to an infinite number of clients, and it is then expressible by an URA [KMB18]: whenever a request is made by client $i$ (labelled data $(r,i)$ ), universally trigger a run which stores $i$ in some register and verifies that the labelled data $(g,i)$ eventually occurs in the data word. In contrast, no NRA can define it. On the other hand, consider the specification $S_{0}$ : “all input data but one are copied on the output, the missing one being replaced by some data which occurred before it”, modelled as the set of data sequences $d_{1}d_{1}d_{2}d_{2}\dots d_{i}d_{j}d_{i+1}d_{i+1}\dots$ for all $i\geq 0$ and $j<i$ (finite labels are irrelevant and not represented). $S_{0}$ is not definable by any URA, as it would require to guess $j$ , which can be arbitrarily smaller than $i$ , but it is expressible by some NRA making this guess.

However, we show (unsurprisingly) that the realisability problem by register transducers of specifications defined by NRA is undecidable. The same negative result also holds for URA, solving an open question raised in [KMB18]. On the positive side, we show that decidability is recovered for deterministic (parity) register automata (DRA) in which the output is driven by the input (meaning that it is contained in some register). We call this class the DRA with input-driven outputs, denoted by $\textsf{DRA}_{\textsf{ido}}$ . One of the difficulties of register transducer synthesis is that the number of registers needed to realise the specification is, a priori, unbounded with regards to the number of registers of the specification. We show it is in fact not the case for $\textsf{DRA}_{\textsf{ido}}$ : any specification expressed as a $\textsf{DRA}_{\textsf{ido}}$ with $r$ registers is realisable by a register transducer iff it is realisable by a transducer with $r$ registers.

A way to obtain decidability is to fix a bound $k$ and to target register transducers with at most $k$ registers. This setting is called bounded synthesis in [KMB18], which establishes that bounded synthesis is decidable in 2ExpTime for URA. We show that unfortunately, bounded synthesis is still undecidable for NRA specifications (even when targetting implementations with a single register). To recover decidability for NRA, we disallow equality tests on the input data and add a syntactic requirement which entails that on any accepted word, each output data is the content of some register which has been assigned an input data occurring before. This defines a subclass of NRA that we call (input) test-free NRA ( $\textsf{NRA}_{\textsf{tf}}$ ). $\textsf{NRA}_{\textsf{tf}}$ can express how output data can be obtained from input data (by copying, moving or duplicating them), although they do not have the whole power of register automata on the input nor the output side. Note that the specification $S_{0}$ given before is $\textsf{NRA}_{\textsf{tf}}$ -definable. To show that bounded synthesis is decidable for $\textsf{NRA}_{\textsf{tf}}$ , we establish a generic transfer property characterising realisable data word specifications in terms of realisability of corresponding specifications over a finite alphabet, thus reducing to the well-known synthesis problem over a finite alphabet. Such property also allows us to reprove the result of [KMB18], with a rather short proof based on standard results from the theory of register automata, indicating that it might allow to establish decidability for other classes of data specifications. Our results are summarised in Table 1.

Related Work

As already mentioned, bounded synthesis of register transducers is considered in [KMB18] where it is shown to be decidable for URA. We reprove this result in a shorter way. Our proof bears some similarities with that of [KMB18], but it seems that our formulation benefits more from the use of existing results. The technique is also more generic and we instantiate it to $\textsf{NRA}_{\textsf{tf}}$ . $\textsf{NRA}_{\textsf{tf}}$ correspond to the one-way, nondeterministic version of the expressive transducer model of [DH16], which however does not consider the synthesis problem.

The synthesis problem over infinite alphabets is also considered in [ESK14], in which data represent identifiers and specifications (given as particular automata close to register automata) can depend on equality between identifiers. However, the class of implementations is very expressive: it allows for unbounded memory through a queue data structure. The synthesis problem is shown to be undecidable and a sound but incomplete algorithm is given.

Finally, classical reactive synthesis has strong connections with game theory on finite graphs. Some extension of games to infinite graphs whose vertices are valuations of variables in an infinite data domain have been considered in [FP18]. Such games are shown to be undecidable and a decidable restriction is proposed, which however does not seem to match our context.

1. Data Words and Register Automata

For a (possibly infinite) set $S$ , we denote by $S^{\omega}$ the set of infinite words over this alphabet. For $1\leq i\leq j$ , we let $u[i{:}j]=u_{i}u_{i+1}\dots u_{j}$ and $u[i]=u[i{:}i]$ the $i$ th letter of $u$ . For $u,v\in S^{\omega}$ , we define their interleaving $\langle u,v\rangle=u[1]v[1]u[2]v[2]\dots$

1.1. Data Words

Let $\Sigma$ be a finite alphabet and $\mathcal{D}$ a countably infinite set, denoting, all over this paper, a set of elements called data. We also distinguish an (arbitrary) data value $\textsf{d}_{0}\in\mathcal{D}$ . Given a set $R$ , let $\tau_{0}^{R}$ be the constant function defined by $\tau_{0}^{R}(r)=\textsf{d}_{0}$ for all $r\in R$ . A labelled data (or l-data for short) is a pair $x=(\sigma,d)\in\Sigma\times\mathcal{D}$ , where $\sigma$ is the label and $d$ the data. We define the projections $\textsf{lab}(x)=\sigma$ and $\textsf{dt}(x)=d$ . A data word over $\Sigma$ and $\mathcal{D}$ is an infinite sequence of labelled data, i.e. a word $w\in{(\Sigma\times\mathcal{D})}^{\omega}$ . We extend the projections lab and dt to data words naturally, i.e. $\textsf{lab}(w)\in\Sigma^{\omega}$ and $\textsf{dt}(w)\in\mathcal{D}^{\omega}$ . We denote the set of data words over $\Sigma$ and $\mathcal{D}$ by $\textsf{DW}(\Sigma,\mathcal{D})$ (DW when clear from the context). A data word language is a subset $L\subseteq\textsf{DW}(\Sigma,\mathcal{D})$ . Note that in this paper, data words are infinite, otherwise they are called finite data words, and we denote by $\textsf{DW}_{\!f}(\Sigma,\mathcal{D})$ the set of finite data words.

1.2. Register Automata

Register automata are automata recognising data word languages. They were first introduced in [KF94] as finite-memory automata. Here, we define them in a spirit close to [LTV15], but over infinite words, with a parity acceptance condition. The current data can be compared for equality with the register contents via tests. Our tests are symbolic and defined via Boolean formulas of the following form. Given $R$ a set of registers, a test is a formula $\phi$ satisfying the following syntax:

[TABLE]

where $r\in R$ . Given a valuation $\tau:R\rightarrow\mathcal{D}$ , a test $\phi$ and a data $d$ , we denote by $\tau,d\models\phi$ the satisfiability of $\phi$ by $d$ in valuation $\tau$ , defined as $\tau,d\models r^{=}$ if $\tau(r)=d$ and $\tau,d\models r^{\neq}$ if $\tau(r)\neq d$ . The Boolean combinators behave as usual. We denote by $\textnormal{{Tst}}_{R}$ the set of (symbolic) tests over $R$ .

{defi}

A register automaton (RA) is a tuple $\mathcal{A}=(\Sigma,\mathcal{D},Q,q_{0},\delta,R,c)$ , where:

•

$\Sigma$ is a finite alphabet of labels, $\mathcal{D}$ is an infinite alphabet of data

•

$Q$ is a finite set of states and $q_{0}\in Q$ is the initial state

•

$R$ is a finite set of registers. We denote $\textnormal{{Asgn}}_{R}=2^{R}$ .

•

$c:Q\rightarrow\{1,\dots,d\}$ , where $d\in\mathbb{N}$ is the number of priorities, is the colouring function, used to define the acceptance condition

•

$\delta\subseteq Q\times\Sigma\times\textnormal{{Tst}}_{R}\times\textnormal{{Asgn}}_{R}\times Q$ is a set of transitions.

A transition $(q,\sigma,\phi,\textnormal{{asgn}},q^{\prime})$ is also written $q\xrightarrow[\raisebox{5.38193pt}[0.0pt]{$ \scriptstyle\mathcal{A} $}]{\sigma,\phi,\textnormal{{asgn}}}q^{\prime}$ . We may omit $\mathcal{A}$ in the latter notation. Intuitively such transition means that on input ( $\sigma,d$ ) in state $q$ the automaton:

(1)

checks that $\phi$ is satisfied by the current register contents and the current data 2. (2)

assigns $d$ to all the registers in asgn (asgn might be empty) 3. (3)

transitions to state $q^{\prime}$ .

$\mathcal{A}$ is said to be deterministic if the tests are mutually exclusive, i.e., for any two distinct transitions of the form $q\xrightarrow{\sigma,\phi,\textnormal{{asgn}}}q^{\prime}$ and $q\xrightarrow{\sigma^{\prime},\phi^{\prime},\textnormal{{asgn}}^{\prime}}q^{\prime\prime}$ , then either $\sigma\neq\sigma^{\prime}$ or $\phi\wedge\phi^{\prime}$ is not satisfiable. The automaton $\mathcal{A}$ is said to be complete if for any given state $q$ , any label $\sigma$ , any data $d$ and any register valuation $\tau$ , there exists a transition $q\xrightarrow{\sigma,\phi,\textnormal{{asgn}}}q^{\prime}\in\delta$ such that $\tau,d\models\phi$ .

1.3. Configurations and Runs

A configuration is a pair $(q,\tau)\in Q\times(R\rightarrow\mathcal{D})$ . Fix a transition $t=p\xrightarrow{\sigma,\phi,\textnormal{{asgn}}}p^{\prime}$ . We say that $(q,\tau)$ enables $t$ on reading $(\sigma^{\prime},d)$ if $q=p$ , $\sigma^{\prime}=\sigma$ and $\tau,d\models\phi$ . Let $\text{next}(\tau,\textnormal{{asgn}},d)$ be the valuation $\tau^{\prime}$ defined by $\tau^{\prime}(i)=d$ if $i\in\textnormal{{asgn}}$ , and $\tau^{\prime}(i)=\tau(i)$ otherwise. We extend this notation to configurations as follows: if $\gamma=(q,\tau)$ enables $t$ on input $(\sigma,d)$ , the successor configuration of $(q,\tau)$ by $t$ on input $(\sigma,d)$ is $\text{next}(\gamma,\textnormal{{asgn}},d,t)=(p^{\prime},\text{next}(\tau,\textnormal{{asgn}},d))$ . We also write $\text{next}(\gamma,t,\sigma,d)$ to denote the successor of $(q,\tau)$ by transition $t$ when $(q,\tau)$ enables $t$ on input $(\sigma,d)$ . The initial configuration is $(q_{0},\tau_{0}^{R})$ . Then, a run over a data word $(\sigma_{1},d_{1})(\sigma_{2},d_{2})\dots$ is an infinite sequence of transitions $t_{0}t_{1}\dots$ such that there exists a sequence of configurations $\gamma_{0}\gamma_{1}\dots=(q_{0},\tau_{0})(q_{1},\tau_{1})\dots$ such that $\gamma_{0}$ is initial and for all $i\geq 0$ , $\gamma_{i+1}=\text{next}(\gamma_{i},t_{i},\sigma_{i},d_{i})$ . With a run $\rho$ , we associate its sequence of states $\textsf{states}(\rho)=q_{0}q_{1}\dots$

1.4. Languages Defined by RA

Given a run $\rho$ , we denote, by a slight abuse of notation, $c(\rho)=\max\{j\mid c(q_{l})=j\text{ for infinitely many }q_{l}\in\textsf{states}(\rho)\}$ the maximum color that occurs infinitely often in $\rho$ . Then, in the parity acceptance condition, $\rho$ is accepting whenever $c(\rho)$ is even. We consider two dual semantics for RA: nondeterministic (N) and universal (U). Given a RA $A$ , depending on whether it is considered nondeterministic or universal, it recognises $L_{N}(A)=\{w\mid\text{there exists an accepting run$ \rho $on$ w $}\}$ or $L_{U}(A)=\{w\mid\text{all runs$ \rho $on$ w $are accepting}\}$ . Note that those semantics are dual: for a RA $A$ , by letting $\overline{A}$ be a copy of $A$ with colouring function $\overline{c}:q\mapsto c(q)+1$ , we have that $L_{U}(\overline{A})=\overline{L_{N}(A)}$ .

We denote by NRA (resp. URA) the class of register automata interpreted with a nondeterministic (resp. universal) parity acceptance condition, and given $A\in\textnormal{{NRA}}$ (resp. $A\in\textnormal{{URA}}$ ), we write $L(A)$ instead of $L_{N}(A)$ (resp. $L_{U}(A)$ ). We also denote by DRA the class of deterministic parity register automata.

2. Synthesis of Register Transducers

2.1. Specifications, Implementations and the Realisability Problem

Let $\Sigma_{\mathbbm{i}}$ and $\Sigma_{\mathbbm{o}}$ be two finite alphabets of labels, and $\mathcal{D}$ a countable set of data. A relational data word is an element of $w\in{[(\Sigma_{\mathbbm{i}}\times\mathcal{D})\cdot(\Sigma_{\mathbbm{o}}\times\mathcal{D})]}^{\omega}$ . Such a word is called relational as it defines a pair of data words in $\textsf{DW}(\Sigma_{\mathbbm{i}},\mathcal{D})\times\textsf{DW}(\Sigma_{\mathbbm{o}},\mathcal{D})$ through the following projections. If $w=x^{1}_{\mathbbm{i}}x^{1}_{\mathbbm{o}}x^{2}_{\mathbbm{i}}x^{2}_{\mathbbm{o}}\dots$ , we let $\textsf{inp}(w)=x^{1}_{\mathbbm{i}}x^{2}_{\mathbbm{i}}\dots$ and $\textsf{out}(w)=x^{1}_{\mathbbm{o}}x^{2}_{\mathbbm{o}}\dots$ We denote by $\textsf{RW}(\Sigma_{\mathbbm{i}},\Sigma_{\mathbbm{o}},\mathcal{D})$ (just RW when clear from the context) the set of relational data words. A specification is simply a language $S\subseteq\textsf{RW}(\Sigma_{\mathbbm{i}},\Sigma_{\mathbbm{o}},\mathcal{D})$ . An implementation is a total function $I:{(\Sigma_{\mathbbm{i}}\times\mathcal{D})}^{*}\rightarrow\Sigma_{\mathbbm{o}}\times\mathcal{D}$ . From $I$ , we define another function $f_{I}:\textsf{DW}(\Sigma_{\mathbbm{i}},\mathcal{D})\rightarrow\textsf{DW}(\Sigma_{\mathbbm{o}},\mathcal{D})$ which, with an input data word $w_{\mathbbm{i}}=x^{1}_{\mathbbm{i}}x^{2}_{\mathbbm{i}}\dots\in\Sigma_{\mathbbm{i}}\times\mathcal{D}$ , associates the output data word $f_{I}(w_{\mathbbm{i}})=x^{1}_{\mathbbm{o}}x^{2}_{\mathbbm{o}}\dots$ such that $\forall i\geq 1$ , $x^{i}_{\mathbbm{o}}=I(x^{1}_{\mathbbm{i}}\dots x^{i{-}1}_{\mathbbm{i}})$ . $I$ also defines a language of relational data words $L(I)=\{\langle w_{\mathbbm{i}},f_{I}(w_{\mathbbm{i}})\rangle\mid w_{\mathbbm{i}}\in\textsf{DW}(\Sigma_{\mathbbm{i}},\mathcal{D})\}$ .

We say that $I$ realises $S$ when $L(I)\subseteq S$ , and that $S$ is realisable if there exists an implementation realising it. Note that since $f_{I}$ is a total function, we have that if $S$ is realisable, then in particular its domain is total, i.e. for all $w_{\mathbbm{i}}\in\textsf{DW}(\Sigma_{\mathbbm{i}},\mathcal{D})$ , there exists $w_{\mathbbm{o}}\in\textsf{DW}(\Sigma_{\mathbbm{o}},\mathcal{D})$ such that $\langle w_{\mathbbm{i}},w_{\mathbbm{o}}\rangle\in S$ . Therefore, any specification whose domain is not total is not realisable according to this definition. For a discussion on this definition, see Section 5.

The realisability problem consists, given a (finite representation of a) specification $S$ , in checking whether $S$ is realisable. In general, we parameterise this problem by classes of specifications $\mathcal{S}$ and of implementations $\mathcal{I}$ , defining the $(\mathcal{S},\mathcal{I})$ -realisability problem, denoted $(\mathcal{S},\mathcal{I})$ . Given a specification $S\in\mathcal{S}$ , it asks whether $S$ is realisable by some implementation $I\in\mathcal{I}$ . We now introduce the classes $\mathcal{S}$ and $\mathcal{I}$ we consider.

2.2. Specification Register Automata

In this paper, we consider specifications defined by register automata (hence alternately reading input and output labelled data). We assume that the set of states is partitioned into $Q_{\mathbbm{i}}$ (called input states, reading only labels in $\Sigma_{\mathbbm{i}}$ ) and $Q_{\mathbbm{o}}$ (called output states, reading only labels in $\Sigma_{\mathbbm{o}}$ ), where $q_{0}\in Q_{\mathbbm{i}}$ , and such that the transition relation $\delta$ alternates between these two sets, i.e.

[TABLE]

where $\overline{\mathbbm{i}}=\mathbbm{o}$ (resp. $\overline{\mathbbm{o}}=\mathbbm{i}$ ). We denote by DRA (resp. NRA, URA) the class of specifications defined by deterministic (resp. nondeterministic, universal) parity register automata. {exa} Remember the setting described in the introduction of a server granting requests from an unbounded set of clients $C$ . The input (resp. output) finite alphabets are $\Sigma_{\mathbbm{i}}=\{\textnormal{{req}},\textnormal{{idle}}\}$ and $\Sigma_{\mathbbm{o}}=\{\textnormal{{grt}},\textnormal{{idle}}\}$ , while the set of data is any countably infinite set $\mathcal{D}$ containing $C$ . Without loss of generality, $C\subseteq\mathbb{N}$ is a set of client ids, so we can take $\mathcal{D}=\mathbb{N}$ . Then, as stated in the introduction, the specification that for all $i\in C$ , every request of client $i$ is eventually granted can be expressed with the URA of Figure 1.

2.3. Register Transducers As Implementations

We consider implementations represented as transducers processing data words. A register transducer is a tuple $T=(\Sigma_{\mathbbm{i}},\Sigma_{\mathbbm{o}},Q,q_{0},\delta,R)$ where $Q$ is a finite set of states with initial state $q_{0}$ , $R$ is a finite set of registers, and $\delta:Q\times\Sigma_{\mathbbm{i}}\times\textnormal{{Tst}}_{R}\rightarrow\textnormal{{Asgn}}_{R}\times\Sigma_{\mathbbm{o}}\times R\times Q$ is the transition function (as before, $\textnormal{{Asgn}}_{R}=2^{R}$ ), assumed to be complete in the sense that, as for RA, for every state $q$ and label $\sigma_{\mathbbm{i}}$ , for every data $d$ and register valuation $\tau$ , there exists a transition $\delta(q,\sigma_{\mathbbm{i}},\phi)=(\textnormal{{asgn}},\sigma_{\mathbbm{o}},r,q^{\prime})$ such that $\tau,d\models\phi$ . When processing an l-data $(\sigma_{\mathbbm{i}},d)$ , $T$ compares $d$ with the content of some of its registers, and depending on the result, moves to another state, stores $d$ in some registers, and outputs some label in $\Sigma_{\mathbbm{o}}$ along with the content of some register $r\in R$ .

Let us formally define the semantics of a register transducer $T$ , as an implementation $I_{T}$ . First, for a finite input data word $w=(\sigma_{\mathbbm{i}}^{1},d_{\mathbbm{i}}^{1})\dots(\sigma_{\mathbbm{i}}^{n},d_{\mathbbm{i}}^{n})$ in ${(\Sigma_{\mathbbm{i}}\times\mathcal{D})}^{*}$ , we denote by $(q_{i},\tau_{i})$ the $i$ th configuration reached by $T$ on $w$ , where $(q_{0},\tau_{0})$ is initial and for all $0<i<n$ , $(q_{i},\tau_{i})$ is the unique configuration such that there exists a transition $\delta(q_{i-1},\sigma_{\mathbbm{i}}^{i},\phi)=(\textnormal{{asgn}},\sigma_{\mathbbm{o}},r,q_{i})$ such that $\tau_{i-1},d_{\mathbbm{i}}^{i}\models\phi$ and $\tau_{i}=\text{next}(\tau_{i-1},d_{\mathbbm{i}}^{i},\textnormal{{asgn}})$ . We let $(\sigma_{\mathbbm{o}}^{i},d_{\mathbbm{o}}^{i})=(\sigma_{\mathbbm{o}},\tau_{i}(r))$ and $I_{T}(w)=(\sigma_{\mathbbm{o}}^{n},d_{\mathbbm{o}}^{n})$ . Then, we denote $f_{T}=f_{I_{T}}$ and $L(T)=L(I_{T})$ . Note that if $T$ is interpreted as a DRA with exactly one transition per output state and whose states are all accepting (i.e. have even maximal parity [math]), then $L(I_{T})$ is indeed the language of such register automaton. We denote by $\textnormal{{RT}}[k]$ the class of implementations defined by register transducers with at most $k$ registers, and by $\textnormal{{RT}}=\bigcup_{k\geq 0}\textnormal{{RT}}[k]$ the class of implementations defined by register transducers. {exa} Consider again the specification of Example 2.2. Such specification is realisable for instance by the transducer which outputs $(\textnormal{{grt}},i)$ whenever it reads $(\textnormal{{req}},i)$ and $(\textnormal{{idle}},d)$ ( $d$ does not matter) whenever it reads idle, which is depicted in Figure 2.

2.4. Synthesis from Data-Free Specifications

If in the latter definitions of the synthesis problem, one considers specifications defined by RA with no registers (i.e. parity automata), and implementations defined by RT with no registers, then the data in data-words can be ignored and we are back to the classical reactive synthesis setting, for which important results are known: {thmC}[[BL69]] The realisability problem of (data-free) specifications given as (register-free) nondeterministic parity automata by (register-free) transducers is ExpTime-complete.

Proof.

The upper bound was first established in [BL69] and [PR89]. Hardness is folklore, but a proof in the particular case of finite words (easily adapted to the $\omega$ -word setting) can be found in [FJLW16, Proposition 6]. ∎

3. Unbounded Synthesis

In this section, we consider the unbounded synthesis problem $(\textnormal{{RA}},\textnormal{{RT}})$ . Thus, we do not fix a priori the number of registers of the implementation.

3.1. Undecidability Results

Let us first consider the case of NRA and URA, which are, in our setting, the most natural devices to express data word specifications. Unfortunately, the two corresponding problems happen to be undecidable:

Theorem 1.

$(\textnormal{{NRA}},\textnormal{{RT}})$ * is undecidable.*

Proof 3.1.

We reduce the problem from the universality of NRA over finite words, which is undecidable [NSV04]. Let $A$ be a (finite data-word) NRA. Let $S$ be a specification which first reads some finite data word $w$ , then a separator $\#$ (its associated data is arbitrary and not represented), then allows for swapping the first and second l-data on any input read later on, while also allowing to behave like the identity whenever $w\in L(A)$ . $S$ is also equal to the identity over any word not containing $\#$ so that its domain is total. Formally, let $S=S_{1}\cup S_{2}\cup T$ , where:

[TABLE]

$S$ * is definable by a NRA running over relational data words, because each component is and NRA are closed under union. Recognising the interversion of the first two labels $\sigma_{1}$ and $\sigma_{2}$ after the $\#$ in $S_{1}$ is easily done using nondeterminism, and the behaviour on data is the identity, so $S_{1}$ is NRA-definable. Then, emulating the identity over some NRA-definable domain is easy, so $S_{2}$ and $T$ are also NRA-definable.*

Now, if $A$ is universal, ie $L(A)=\textsf{DW}_{\!f}$ , then the identity $\mathrm{id}_{\textsf{DW}}$ over DW realises $S$ , since then $\mathrm{id}_{\textsf{DW}}\subseteq S$ and has total domain. Conversely, if $L(A)\subsetneq\textsf{DW}_{\!f}$ , assume by contradiction that $S$ is realisable by a register transducer $I$ . Let $w\in\textsf{DW}_{\!f}\backslash L(A)$ . Then, for any $(\sigma_{1},d_{1})(\sigma_{2},d_{2})u\in\textsf{DW}$ , we must have $I(w\#(\sigma_{1},d_{1})(\sigma_{2},d_{2})u)=w\#(\sigma_{2},d_{1})(\sigma_{1},d_{2})u$ ; but this implies guessing the second label while having only read the first one, which is not doable by any transducer as long as $\sigma_{1}\neq\sigma_{2}$ .

Actually, we can observe that such undecidability proof extends to $(\textnormal{{NRA}},\textnormal{{RT}}[1])$ , and to all $(\textnormal{{NRA}},\textnormal{{RT}}[k])$ for $k\geq 1$ . Indeed, $A$ is universal iff $S$ is realisable by the identity over data words, which is implementable using a $1$ -register transducer:

Theorem 2.

For all $k\geq 1$ , $(\textnormal{{NRA}},\textnormal{{RT}}[k])$ is undecidable.

Now, we can show that the unbounded synthesis problem is also undecidable for URA, answering a question left open in [KMB18].

Theorem 3.

$(\textnormal{{URA}},\textnormal{{RT}})$ * is undecidable.*

Proof 3.2.

We present a reduction to our synthesis problem from the emptiness problem of URA over finite words. The latter is undecidable by a direct reduction from the universality problem of NRA, which is undecidable by [NSV04].

First, consider the relation $S_{1}=\{(u\#v,u\#w)\mid u\in\textsf{DW}_{\!f},v\in\textsf{DW}$ , each data of $u$ appears infinitely often in $w\}$ . $S_{1}$ is recognised by a $1$ -register URA which, upon reading a data $d$ in $u$ , stores it in its register and checks that it appears infinitely often in $w$ by visiting a state with maximal parity $2$ every time it sees $d$ (all other states have parity $1$ ). Note that for all $k\geq 1$ , $S_{1}\cap\{(u\#v,u\#w)\mid u\in\textsf{DW}_{\!f},v,w\in\textsf{DW}\text{ and$ u $has at most$ k $distinct data}\}$ is realisable by a $k$ -register transducer: on reading $u$ , store each distinct data in one register, and after the $\#$ output them in turn in a round-robin fashion. However, $S_{1}$ is not realisable: on reading the $\#$ separator, any implementation must have all the data of $u$ in its registers, but the number of such data is not bounded ( $u$ can have pairwise distinct data and be of arbitrary length).

Then, let $A$ be a URA over finite data words. Consider the specification $S=S_{1}\cup S_{2}\cup T$ , where $S_{2}=\{(u\#v,u\#w\#{(a,\textsf{d}_{0})}^{\omega})\mid u\in\textsf{DW}_{\!f},v\in\textsf{DW},w\in L(A)\}$ and $T=\{(u,w)\mid u\notin\textsf{DW}_{\!f}\#\textsf{DW},w\in\textsf{DW}\}$ . $S$ has total domain, and is recognisable by a URA. Indeed, URA are closed under union, by the same product construction as for the intersection of NRA [KF94], and each part is URA-recognisable: $S_{1}$ is, as described above, $S_{2}$ is by simulating $A$ on the output to check $w\in L(A)$ then looping over $(a,\textsf{d}_{0})$ , and $T$ simply checks a regular property.

Now, if $L(A)\neq\varnothing$ , let $w\in L(A)$ and let $D_{w}=\{d_{1},\dots,d_{k}\}$ be the set of data distinct from $\textsf{d}_{0}$ that occur in $w$ . As a consequence of the closure under automorphisms of register automata [KF94, Proposition 2], we have: for any set $D\subseteq\mathcal{D}$ such that $\lvert D\rvert\geq k$ , and for any injection $\pi:D_{w}\cup\{\textsf{d}_{0}\}\rightarrow D\cup\{\textsf{d}_{0}\}$ such that $\pi(\textsf{d}_{0})=\textsf{d}_{0}$ , by extending $\pi$ to a morphism $\widehat{\pi}$ over data words in the usual way (and behaving as the identity over the finite labels), $\widehat{\pi}(w)\in L(A)$ . Indeed, as register automata can only test for equality, acceptance is determined by the equality relations between the different data of the input, so we can rename them (with the exception of $\textsf{d}_{0}$ , which is a distinguished data).

Then, $S$ is realisable by a register transducer $I$ with $k+2$ registers. While it has not read a $\#$ , $I$ reads its input $u$ and outputs it along the way, using one register to store the current data and output it immediately. Meanwhile, it also stores the first $k$ distinct data of $u$ in its registers. Its last register is used to keep $\textsf{d}_{0}$ in memory. If there is no $\#$ in the input, then $I(u)=u$ , so $(u,I(u))\in T$ . Now, if some $\#$ is read, $I$ outputs $\#$ (along with an arbitrary data), and there are two cases: if the number of data in $u$ is lower than or equal to $k$ , $I$ realises $S_{1}$ , as described above. Otherwise, let $D_{u}=\{e_{1},\dots,e_{l}\}$ be the set of data of $u$ distinct from $\textsf{d}_{0}$ , indexed by order of appearance $(l\geq k)$ . Then, let $\pi:D_{w}\cup\{\textsf{d}_{0}\}\rightarrow D_{u}\cup\{\textsf{d}_{0}\}$ be such that for all $1\leq i\leq k,\pi(d_{i})=e_{i}$ and $\pi(\textsf{d}_{0})=\textsf{d}_{0}$ : $\pi$ is injective. Now, $I$ can output $\widehat{\pi}(w)\#{(a,\textsf{d}_{0})}^{\omega}$ since it stored $\{e_{1},\dots,e_{k}\}$ in its registers, hence realising $S_{2}$ . Conversely, if $L(A)=\varnothing$ , then $S$ is not realisable. If it were, $S\cap\textsf{DW}_{\!f}\#\textsf{DW}=S_{1}$ would be too, as a regular domain restriction, but we have seen above that this is not the case. Thus, $S$ is realisable iff $L(A)\neq\varnothing$ .

3.2. A Decidable Subclass: $\textsf{DRA}_{\textsf{ido}}$

However, we show that restricting to DRA allows to recover decidability, modulo one additional assumption, namely that the output data of a transition has to be the content of some register. We formally define this class as follows:

{defi}

[ $\textsf{DRA}_{\textsf{ido}}$ ] Let $\mathcal{A}=(\Sigma,\mathcal{D},Q,q_{0},\delta,R,c)$ be a DRA. We say that $\mathcal{A}$ is with input-driven outputs if for any output transition $p\xrightarrow{\sigma,\phi,\textnormal{{asgn}}}q$ , the test $\phi$ is of the form $r^{=}$ for some $r\in R$ . We denote by $\textsf{DRA}_{\textsf{ido}}$ the class of DRA with input-driven outputs.

Such assumption rules out pathological, and to our opinion uninteresting and technical cases stemming from the asymmetry between the class of specifications and implementations. E.g., consider the single-register DRA in Fig. 3(a) (finite labels are arbitrary and not depicted). It starts by reading one input data $d$ and stores it in $r$ , asks that the corresponding output data is different from the content $d$ of $r$ , then accepts any output over any input (transitions $\top$ are always takeable). It is not realisable because transducers necessarily output the content of some register (hence producing a data which already appeared). On the other hand, having tests of the form $\phi=r^{\neq}$ for instance does not imply unrealisability, as shown by the DRA of Fig. 3(b): it starts by reading one data $d_{1}$ , asks to copy it on the output, then reads another data $d_{2}$ , and requires that the output is either distinct from $d_{1}$ or equal to it, depending on whether $d_{2}\neq d_{1}$ . It happens that such specification is realisable by the identity.

We reduce the realisability of $\textsf{DRA}_{\textsf{ido}}$ -specifications to solving a finite parity game. To ease its construction, we first need to confer additional properties to the specification automaton.

A RA $A$ is said to be locally concretisable if for every finite sequence of transitions $\rho=t_{1}\dots t_{n}$ , for every finite data word $w\in\textsf{DW}_{\!f}$ such that $\rho$ is a partial run of $A$ on $w$ , we have that for all transitions $t\in\delta$ which are compatible with $\rho$ (i.e. such that the source state of $t$ is equal to the end state of $\rho$ ), there exists $d\in\mathcal{D}$ such that $\rho t$ is a partial run of $A$ on $wd$ . Note in particular that when $\rho$ is not a partial run, such condition trivially holds.

We say that a RA $A$ is in good form if

(1)

it is locally concretisable 2. (2)

it is complete on its input states 3. (3)

its tests $\phi$ are maximally consistent conjunctions of atoms 4. (4)

any transition $t$ whose test is different from $\bigwedge_{r\in R}r^{\neq}$ does not conduct an assignment ( $\textnormal{{asgn}}=\varnothing$ )

Lemma 4.

For all RA $A$ , there exists an equivalent RA $A^{\prime}$ in good form with exponentially many more states and transitions, and the same number of priorities and registers. Moreover if $A$ is a $\textsf{DRA}_{\textsf{ido}}$ , so is $A^{\prime}$ .

Proof 3.3.

Let $A=(\Sigma,\mathcal{D},Q,q_{0},\delta,R,c)$ be a RA. First, we can assume that $A$ is complete on its input states: add two sink states $s_{\mathbbm{i}}$ and $s_{\mathbbm{o}}$ with transitions $(s_{\mathbbm{i}},\sigma_{\mathbbm{i}},\top,\varnothing,s_{\mathbbm{o}})$ and $(s_{\mathbbm{o}},\sigma_{\mathbbm{o}},r^{=},\varnothing,s_{\mathbbm{i}})$ for all $\sigma_{\mathbbm{i}}\in\Sigma_{\mathbbm{i}},\sigma_{\mathbbm{o}}\in\Sigma_{\mathbbm{o}},r\in R$ , each with odd priority $c(s_{\mathbbm{i}})=c(s_{\mathbbm{o}})=1$ . Then, for all $q_{\mathbbm{i}}\in Q_{\mathbbm{i}}$ , and all finite label $\sigma_{\mathbbm{i}}\in\Sigma_{\mathbbm{i}}$ , add a transition $q_{\mathbbm{i}}\xrightarrow{\sigma_{\mathbbm{i}},\psi,\varnothing}s_{\mathbbm{o}}$ where $\psi=\neg\bigvee_{q_{\mathbbm{i}}\xrightarrow{\sigma_{\mathbbm{i}},\phi,\textnormal{{asgn}}}q_{\mathbbm{o}}}\phi$ is a test which is satisfied by a data if and only if such data satisfies no other possible test. This does not affect determinism nor the recognised language (as each added state has odd priority), and preserves the fact of being ido.

Now, we enrich the states with information on the equalities between registers in the current register valuation. Formally, we define constraints111The notion of constraint is pervasive in the study of registers automata, e.g. to recognise the projection over finite labels. as equivalence relations on $R$ . In the following, we denote by $\textsf{ER}(R)$ the set of equivalence relations on $R$ . Given a valuation $\tau$ of registers in $R$ , we can associate to it an equivalence relation on $R$ in the natural way (two registers $r,r^{\prime}\in R$ are equivalent iff $\tau(r)=\tau(r^{\prime})$ ). We denote it by $[\tau]$ . We use the letter $C$ to denote an element of $\textsf{ER}(R)$ , and we call it a constraint.

We let $A^{\prime}=(\Sigma,\mathcal{D},Q^{\prime},q^{\prime}_{0},\delta^{\prime},R,c^{\prime})$ be defined as follows:

•

$Q^{\prime}=Q\times\textsf{ER}(R)$ **

•

$q^{\prime}_{0}=\left(q_{0},[\tau_{0}^{R}]\right)$ **

•

$c^{\prime}(q,C)=c(q)$ , for every $(q,C)\in Q\times\textsf{ER}(R)$

•

$\delta^{\prime}$ * will be defined in the sequel.*

Given a constraint $C$ , and a set $E\subseteq R$ corresponding to an equivalence class of $C$ , we define a test corresponding to a maximally consistent conjunction of equalities and inequalities: $\alpha_{E}=\bigwedge_{r\in E}r^{=}\wedge\bigwedge_{r\not\in E}r^{\neq}$ . A data value satisfies this test iff it is equal to the (common) value stored in registers of $R$ . We also consider the test $\alpha_{\varnothing}=\bigwedge_{r\in R}r^{\neq}$ which corresponds to the case of a fresh data value, i.e. a data value distinct from all the values stored in registers.

Consider a transition $(p,\sigma,\phi,\textnormal{{asgn}},q)\in\delta$ . Given a formula $\alpha_{E}$ as defined above, one can decide whether the formula $\alpha_{E}\Rightarrow\phi$ is valid or not. If this is the case, then we add the following transition to $\delta^{\prime}$ :

[TABLE]

where $C^{\prime}$ is defined as follows: two registers $r,r^{\prime}$ are in relation with respect to $C^{\prime}$ if and only if one of the following cases holds:

•

they are in relation in $C$ , and not in asgn

•

they are both in asgn

•

$r$ * belongs to $E$ and $r^{\prime}$ belongs to asgn, or vice versa.*

First, observe that since $A$ is complete on its input states, so is $A^{\prime}$ and property (2) holds. Moreover, by definition, $A^{\prime}$ satisfies property (3).

Now, one can show by induction on the length $n$ of the partial run that every partial run $\rho=t_{1}\dots t_{n}$ of $A^{\prime}$ over some finite data word $w\in\textsf{DW}_{\!f}$ reaching some configuration $((p,C),\tau)$ satisfies $C=[\tau]$ . Thus, for every run of $A^{\prime}$ , by denoting ${\{((q_{i},C_{i}),\tau_{i})\}}_{i\in\mathbb{N}}$ its sequence of configurations, we have $C_{i}=[\tau_{i}]$ .

Additionally, for each run of $A$ , we can build a run of $A^{\prime}$ in a deterministic manner: let $\rho=t_{1}t_{2}\dots$ be a run of $A$ over some data word $w=(\sigma_{1},d_{1})(\sigma_{2},d_{2})\dots$ , where for all $i\in\mathbb{N}$ , $t_{i+1}=q_{i}\xrightarrow{\sigma_{i},\phi_{i},\textnormal{{asgn}}_{i}}q_{i+1}$ and let ${\{(q_{i},\tau_{i})\}}_{i\in\mathbb{N}}$ be its sequence of configurations. Correspondingly, let $\rho^{\prime}=t^{\prime}_{1}t^{\prime}_{2}\dots$ , where for each $i\in\mathbb{N}$ $t^{\prime}_{i+1}=(q_{i},C_{i})\xrightarrow[A^{\prime}]{\sigma_{i},\alpha_{E_{i}},\textnormal{{asgn}}_{i}}(q_{i+1},C_{i+1})$ , with $C_{i}=[\tau_{i}]$ and $E_{i}=\{r\in R\mid\tau_{i}(r)=d_{i}\}$ . Then, again by induction, we can show that $\rho^{\prime}$ is a run of $A^{\prime}$ over $w$ , whose sequence of configurations is ${\{((q_{i},C_{i}),\tau_{i})\}}_{i\in\mathbb{N}}$ . Moreover, $\rho^{\prime}$ is accepting if and only if $\rho$ is accepting, since $c^{\prime}(q_{i},C_{i})=c(q_{i})$ . Reciprocally, every run $\rho^{\prime}$ of $A^{\prime}$ can be projected to a run of $A$ by removing the $C_{i}$ , and this preserves acceptance. Overall, $L(A)=L(A^{\prime})$ .

Now, let $\rho=t_{1}\dots t_{n}$ be a partial run of $A^{\prime}$ over some finite data word $w\in\textsf{DW}_{\!f}$ ending in some configuration $((q,C),\tau)$ ; recall that $C=[\tau]$ . Let $t=q\xrightarrow{\sigma,\alpha_{E},\textnormal{{asgn}}}q^{\prime}$ be a transition compatible with $\rho$ , i.e. such that $q$ is the end state of $\rho$ . If $E=\varnothing$ , then $\alpha_{E}=\bigwedge_{r\in R}r^{\neq}$ , so any $d\in\mathcal{D}\backslash\tau(R)$ (where $\tau(R)$ denotes the image of $R$ by $\tau$ ) is such that $\tau,d\models\alpha_{E}$ . If $E\neq\varnothing$ , then by construction $E$ corresponds to an equivalence class of $C$ , so $\forall r,r^{\prime}\in E,\tau(r)=\tau(r^{\prime})$ and $\forall r\in E,\forall r^{\prime}\notin E,\tau(r)\neq\tau(r^{\prime})$ . Thus, by letting $d=\tau(r)$ for some $r\in E$ (its choice does not matter), we have that $\rho t$ is a partial run of $A^{\prime}$ over $wd$ . Overall, $A^{\prime}$ is locally concretisable, i.e. property (1) holds.

*The last step concerns property (4). Intuitively, the idea is that if the data read corresponds to a data stored in some register, then the assignment can be replaced by keeping in memory a relation between registers. This idea is merely an adaptation of the conversion from register automata (“ $M$ -automata”, in their terminology) to finite-memory automata [KF94]. The states can be enriched with the right information to deal with these additional relations. *

In order to solve the unbounded register synthesis problem, we resort to a synthesis problem for data-free specifications. In that framework, when specifications are described by means of parity automata, synthesis problems can be solved using reductions to parity games. We thus quickly recall the notion of parity game. For a complete presentation, we refer the reader to [AG11].

A two-player parity game is given as a finite graph, in which vertices are partitioned among the two players, together with an initial vertex. A colouring function associates with each vertex an integer. It is used to define the winning plays as follows: a play is winning iff the maximum colour appearing infinitely often is even.

In the sequel, we will use the parity game associated with a DRA $A$ , which is denoted as $G_{A}$ . It is is defined as follows: its set of vertices is exactly that of $A$ . Player Adam owns input vertices, and the associated input transitions, while player Eve owns output vertices/transitions. The colouring function is that of $A$ , and the initial vertex is the initial state of $A$ .

Proposition 5.

Let $A$ be a $\textsf{DRA}_{\textsf{ido}}$ in good form. Then, the following are equivalent:

(1)

$L(A)$ * is realisable by a register transducer with as many registers as $A$ * 2. (2)

$L(A)$ * is realisable by an implementation222Recall that implementations are defined in subsection 2.1. $I:{(\Sigma_{\mathbbm{i}}\times\mathcal{D})}^{*}\rightarrow\Sigma_{\mathbbm{o}}\times\mathcal{D}$ * 3. (3)

Eve wins the parity game $G_{A}$ associated with $A$

Proof 3.4.

We start with a preliminary remark on $\textsf{DRA}_{\textsf{ido}}$ . As $A$ is a $\textsf{DRA}_{\textsf{ido}}$ , every output transition has a test with at least one equality constraint ( $r^{=}$ for some $r$ ), and thus, as $A$ is in good form (property $(4)$ ), the assignment of output transitions are all empty. Note that 1 $\Rightarrow$ 2 is immediate.

From the parity game $G_{A}$ to the realisability of $L(A)$ : 3 $\Rightarrow$ 1

Assume Eve wins the game $G_{A}$ . Parity games admit memoryless strategies, i.e. strategies whose actions only depend on the current state of the game. We can thus consider a memoryless winning strategy for Eve, which we denote by a mapping $\chi$ from output vertices to output edges of the game, i.e. from output states to output transitions of $A$ .

We now detail how we define from $\chi$ a register transducer $T_{\chi}$ with $R^{A}$ as set of registers:

•

States are those of $A$

•

The initial state is that of $A$

•

Transitions are defined as follows. Consider some input state $p$ and some transition $t_{\mathbbm{i}}$ from $p$ to $q$ . By definition of $A$ , $q$ is an output state, and we let $t_{\mathbbm{o}}=\chi(q)$ be the transition given by Eve’s strategy.

We write $t_{\mathbbm{i}}=(p,\sigma,\phi,\textnormal{{asgn}},q)$ and $t_{\mathbbm{o}}=(q,\sigma^{\prime},\phi^{\prime},\textnormal{{asgn}}^{\prime},q^{\prime})$ . Thanks to our initial comment on the form of output transitions of $\textsf{DRA}_{\textsf{ido}}$ in good form, there exists a register $r$ appearing with an equality constraint in the test $\phi^{\prime}$ of the transition $t_{\mathbbm{o}}$ , and we have $\textnormal{{asgn}}^{\prime}=\varnothing$ . Then, we add to $T_{\chi}$ the transition $p\xrightarrow{\sigma,\phi\mid\textnormal{{asgn}},\sigma^{\prime},r}q^{\prime}$ .

Observe that $T$ is indeed a register transducer as for each state $p$ , it only uses transitions outgoing from $p$ in $A$ , hence it is deterministic as $A$ was.

We claim that $T_{\chi}$ realises $L(A)$ . Consider some input data word, and the behaviour of $T_{\chi}$ on this data word. As $A$ is in good form, it is complete on its input states. This entails that this run is infinite. It corresponds to a play in $G_{A}$ compatible with Eve’s strategy $\chi$ . As $\chi$ is a winning strategy, this implies that the run is accepting, hence corresponds to some accepting run of $A$ , yielding the result.

From the realisability of $L(A)$ to the parity game $G_{A}$ : 2 $\Rightarrow$ 3

Assume that $L(A)$ is realisable by an implementation $I:{(\Sigma_{\mathbbm{i}}\times\mathcal{D})}^{*}\rightarrow\Sigma_{\mathbbm{o}}\times\mathcal{D}$ . We let $f_{I}:\textsf{DW}(\Sigma_{\mathbbm{i}},\mathcal{D})\rightarrow\textsf{DW}(\Sigma_{\mathbbm{o}},\mathcal{D})$ be the function it implements, and naturally extend it to finite words: for $w_{\mathbbm{i}}\in\textsf{DW}_{\!f}(\Sigma_{\mathbbm{i}},\mathcal{D}),f_{I}(w_{\mathbbm{i}})=I(w_{\mathbbm{i}}[1])I(w_{\mathbbm{i}}[1:2])\dots I(w_{\mathbbm{i}}[1:\lvert w_{\mathbbm{i}}\rvert])$ . Let us build from $I$ a winning strategy $\chi_{I}$ in $G_{A}$ , with memory ${(\Sigma_{\mathbbm{i}}\times\mathcal{D})}^{*}\times(Q_{S}\times\mathcal{D}^{R_{A}})$ .

We define $\chi_{I}$ by induction, and show that when $\chi_{I}$ is in memory state $(w_{\mathbbm{i}},(q,\tau))$ , the finite sequence of transitions constructed so far is a partial run of $A$ over $\langle w_{\mathbbm{i}},f_{I}(w_{\mathbbm{i}})\rangle$ ending in configuration $(q,\tau)$ . Initially, $\chi_{I}$ has memory $(\varepsilon,(q_{0},\tau_{0}))$ .

Now, assume $\chi_{I}$ is in state $(w_{\mathbbm{i}},(q,\tau))$ , and Adam just played $(\sigma_{\mathbbm{i}},\phi,\textnormal{{asgn}})$ . Then, Eve picks some data $d_{\mathbbm{i}}\in\mathcal{D}$ such that $\tau,d_{\mathbbm{i}}\models\phi$ . Such data exists since $A$ is locally concretisable and the finite sequence of transitions constructed so far is the partial run over some data word. Let $(q^{\prime\prime},\tau^{\prime\prime})$ be the successor configuration of $(q,\tau)$ in $A$ on reading $d_{\mathbbm{i}}$ , i.e. $(q,\tau)\xrightarrow[A]{\sigma_{\mathbbm{i}},d_{\mathbbm{i}}}(q^{\prime\prime},\tau^{\prime\prime})$ , and let $w^{\prime}_{\mathbbm{i}}=w_{\mathbbm{i}}(\sigma_{\mathbbm{i}},d_{\mathbbm{i}})$ . Now, let $(\sigma_{\mathbbm{o}},d_{\mathbbm{o}})=I(w^{\prime}_{\mathbbm{i}})$ . Correspondingly, let $t_{\mathbbm{o}}$ be the transition taken from $(q^{\prime\prime},\tau^{\prime\prime})$ on reading $(\sigma_{\mathbbm{o}},d_{\mathbbm{o}})$ , i.e. such that $(q^{\prime\prime},\tau^{\prime\prime})\xrightarrow[t_{\mathbbm{o}}]{\sigma_{\mathbbm{o}},d_{\mathbbm{o}}}(q^{\prime},\tau^{\prime})$ . Such transition exists: let $w\in\textsf{DW}(\Sigma_{\mathbbm{i}},\mathcal{D})$ be some infinite suffix that we append to $w^{\prime}_{\mathbbm{i}}$ . Since $I$ is an implementation, $f_{I}$ is total and we know that $\langle w^{\prime}_{\mathbbm{i}}w,f_{I}(w^{\prime}_{\mathbbm{i}}w)\rangle\in L(A)$ , which means that $\langle w^{\prime}_{\mathbbm{i}}w,f_{I}(w^{\prime}_{\mathbbm{i}}w)\rangle$ admits an accepting run in $A$ . In particular, its prefix $\langle w^{\prime}_{\mathbbm{i}},f_{I}(w^{\prime}_{\mathbbm{i}})\rangle$ admits a partial run in $A$ , and its last transition is $t_{\mathbbm{o}}$ (such partial run is unique since $A$ is deterministic).

Then, Eve plays $t_{\mathbbm{o}}$ in $G_{A}$ and updates her memory to $\left(w^{\prime}_{\mathbbm{i}},(q^{\prime},\tau^{\prime})\right)$ . The invariant indeed holds, as the play constructed so far is a partial run of $A$ over $\langle w^{\prime}_{\mathbbm{i}},f_{I}(w^{\prime}_{\mathbbm{i}})\rangle$ ending in configuration $(q^{\prime},\tau^{\prime})$ .

$\chi_{I}$ * is indeed a strategy, as it is defined for any possible sequence of actions of Adam. It remains to show that it is winning. Let $\rho$ be a play consistent with $\chi_{I}$ , which is also a run of $A$ by definition of $G_{A}$ . We need to show that $\rho$ is accepting. We define $w\in{(\Sigma_{\mathbbm{i}}\times\mathcal{D})}^{\omega}$ as $w[i]=w_{\mathbbm{i}}^{i}[i]$ , where $w_{\mathbbm{i}}^{i}$ is the input word stored in memory at step $i$ of the play (i.e. such that $\chi_{I}$ is in state $(w_{\mathbbm{i}}^{i},(q_{i},\tau_{i}))$ for some $(q_{i},\tau_{i})$ after receiving $i$ actions of Adam). We then know that for all $i\in\mathbb{N}$ , $\rho[:i]$ is a partial run of $A$ over $\langle w[:i],f_{I}(w[:i])\rangle$ , so $\rho$ is a run of $A$ over $\langle w,f_{I}(w)\rangle$ . Since $I$ is an implementation, such run is accepting, i.e. satisfies the parity condition, which means that $\rho$ also satisfies the parity condition; it is thus winning. As a consequence, $\chi_{I}$ is a winning strategy in $G_{A}$ . *

Theorem 6.

$(\textnormal{$ \textsf{DRA}_{\textsf{ido}} $},\textnormal{{RT}})$ * is ExpTime-c.*

Proof 3.5.

First, we put $A$ in good form thanks to Lemma 4, resulting in some $\textsf{DRA}_{\textsf{ido}}$ $B$ exponentially bigger. Then, by Proposition 5, it suffices to solve the parity game $G_{B}$ . It is well-known to be possible in time $O(n^{d})$ where $n$ is the number of states and $d$ the number of priorities. If $n_{A}$ denotes the number of states of $A$ and $d$ its number of priorities, then $B$ has $n_{A}\cdot 2^{|R|^{2}}$ states and the same number of priorities $d$ , hence checking the realisability of $A$ can be done in time $O(n_{A}^{d}\cdot 2^{d\cdot|R|^{2}})$ , which is exponential with respect to the size of the input.

Hardness

The following proof is an adaptation of the one establishing PSpace-hardness of the nonemptiness problem for DRA presented in **[DL09, Theorem 5.1]**. Here, we use the input part to simulate universal transitions, and the output part to simulate nondeterministic ones, hence simulating alternation, which yields an ExpTime lower bound.

Thus, we reduce from the halting problem of alternating Turing machines over a binary alphabet with a linearly bounded tape. An alternating Turing machine is a tuple $\mathcal{M}=\langle Q,q_{i},\delta\rangle$ , where:

•

$Q$ * is a finite set of states, partitioned into existential ( $Q_{\exists}$ ) and universal ( $Q_{\forall}$ ) states: $Q=Q_{\exists}\uplus Q_{\forall}$ , where $q_{i}\in Q_{\forall}$ is the initial state*

•

$\delta:Q\times\{0,1\}\rightarrow 2^{Q\times\{0,1\}\times\{-1,1\}}$ * is the transition function. *

A configuration of $\mathcal{M}$ is then a triple $c=(q,i,w)$ , where $q\in Q$ is the machine state, $i\in\{0,\dots,\lvert\mathcal{M}\rvert-1\}$ is the head position, and $w\in{\{0,1\}}^{\lvert\mathcal{M}\rvert}$ is the tape content. It is existential if $q\in Q_{\exists}$ and universal if $q\in Q_{\forall}$ . A configuration $(q^{\prime},i^{\prime},w^{\prime})$ is a successor of $(q,i,w)$ if there exists $(p,a,m)\in\delta(q,w[i])$ , $p=q^{\prime}$ , $i^{\prime}=i+m\in\{0,\dots,\lvert\mathcal{M}\rvert-1\}$ and $w^{\prime}$ is such that $\forall j\neq i$ , $w^{\prime}[j]=w[j]$ and $w[i]=a$ . $t=q\xrightarrow{w[i],a,m}p$ is called the associated transition. A run of $\mathcal{M}$ is then a tree whose nodes are configurations and whose branches can be finite or infinite, rooted in the initial configuration $(q_{i},0,0^{\lvert\mathcal{M}\rvert})$ , and whose nodes satisfy the following properties:

(1)

If the node is an existential configuration $c_{\exists}$ , then it has exactly one child, which is a successor configuration of $c_{\exists}$ . 2. (2)

If the node is a universal configuration $c_{\forall}$ , then its children are all its successor configurations.

Note that a branch is finite if and only if it ends in a universal configuration with no successor. The machine $\mathcal{M}$ halts if it admits a run which is a finite tree (i.e. whose branches all end in a universal configuration with no successors). The following problem is ExpTime-hard **[CKS81]**: given an alternating Turing machine $\mathcal{M}$ , decide whether $\mathcal{M}$ halts.

Finally, a computation is a finite sequence of successive configurations (i.e. a finite path in a run). Let $(q_{0},i_{0},w_{0})\dots$ $(q_{n},i_{n},w_{n})$ be a computation of $\mathcal{M}$ , and $t_{0}\dots t_{n-1}$ the sequence of associated transitions. We encode such computation by the following data word over the alphabet $Q\uplus\delta\uplus\{-\}$ :

[TABLE]

where $d_{0}\neq d_{1}\in\mathcal{D}$ are two distinct data respectively encoding letters [math] and $1$ , and we have $\textsf{lab}(a_{l}^{k})=q_{k}$ if $l=i_{k}$ and $\textsf{lab}(a_{l}^{k})=-$ otherwise. Then, $\textsf{dt}(a_{l}^{k})=d_{0}$ if $w_{k}[l]=0$ and $\textsf{dt}(a_{l}^{k})=d_{1}$ if $w_{k}[l]=1$ . $\textsf{dt}(t_{k})$ does not matter.

Now, as in **[DL09]**, we can construct a DRA $A_{\mathcal{M}}$ which accepts a data word iff it has a prefix that encodes a computation of $\mathcal{M}$ from the initial state to a state with no successor. Indeed, the transitions are part of the input, so they do not have to be guessed: neither nondeterministic nor universal branching is needed here (they will respectively be simulated by the output and input player). For completeness, we describe the construction: $A_{\mathcal{M}}$ has memory $Q$ , along with an $\lvert\mathcal{M}\rvert$ -bounded counter $l$ to keep track of the position of the reading head in $w_{k}$ , a variable $i$ taking its values in $\{0,\dots,\lvert\mathcal{M}\rvert-1\}$ used to store the value of $i_{k}$ and a variable $t$ taking its values in $\delta$ to memorise $t_{k}$ ; which overall yields a $O(\lvert\mathcal{M}\rvert^{4})$ memory. Its finite alphabet is $Q\uplus\delta\uplus\{-\}$ , and it has $\lvert\mathcal{M}\rvert+2$ registers: $r_{0}$ and $r_{1}$ respectively store $d_{0}$ and $d_{1}$ , and, for all $0\leq l<\lvert\mathcal{M}\rvert$ , $r^{\prime}_{l}$ successively stores the different values of $w_{k}[l]$ for $0\leq k\leq n$ . Then, a run of $A_{\mathcal{M}}$ is as follows: initially, $A_{\mathcal{M}}$ stores $d_{0}$ and $d_{1}$ , while checking that they are distinct. Then, it checks that $w_{0}=0^{\lvert\mathcal{M}\rvert}$ . To check successorship, while maintaining the invariant that at any step $k$ , $r^{\prime}_{l}$ contains $w_{k}[l]$ , the automaton, when reading $t_{k}=q\xrightarrow{c,a,m}p$ , checks that $q=q_{k}$ (it was stored as the target of $t_{k-1}$ ), $c=w_{k}[i_{k}]$ (i.e. that $r^{\prime}_{i_{k}}$ contains $d_{c}$ ), and updates the value of $i_{k}$ to $i_{k+1}=i_{k}+m_{k}$ , while checking that $i_{k}\in\{0,\dots,\lvert\mathcal{M}\rvert-1\}$ . Then, with the help of its registers and its counter $l$ , it checks that $w_{k+1}[l]=w_{k}[l]$ for all $l\neq i_{k+1}$ , and that $w_{k+1}[i_{k+1}]=d_{a}$ .

*From such automaton, by adding * $\#$ s to enforce the alternation between input and output, we can build a specification automaton such that the input player provides the encoding of the successive configurations, and resolves the universal branching, and the output player has to resolve nondeterminism (i.e. chooses which nondeterministic transition to take). Then, if the input player can force the computation to go on ad infinitum, he wins, otherwise (if either the provided encoding is not correct, or if the computation is finite), the output player wins. Formally:

[TABLE]

The data corresponding to the $\#$ and $t_{i}$ do not matter, and are not depicted. Note that the even (i.e. universal) transitions are picked by the input player, while the odd (i.e. nondeterministic) transitions are picked by the output player.

*Now, if $\mathcal{M}$ halts, $A$ admits an implementation, which behaves as follows: it first checks that the $d_{0}$ and $d_{1}$ given as input are indeed distinct. Then, it checks on-the-fly that the given input is indeed an encoding of the initial configuration, while outputting * $\#$ *s. It then checks that $c_{1}$ is indeed a successor of $c_{0}$ following $t_{0}$ , again while outputting * $\#$ *s. Then, if it receives a $\#$ as input, it picks some $t_{1}$ which is a witness that $c_{0}$ is indeed accepting, and so on. If, at some point, the given input is not a valid encoding, then it behaves arbitrarily (e.g. by outputting only * $\#$ s).

Conversely, if $\mathcal{M}$ does not halt, then, by choosing an input whose universal transitions are witnesses that $c_{0}$ is not accepting, then either the implementation provides some non-admissible output at some point, or the computation goes ad infinitum, which breaks the specification.

*For readers familiar with game-theoretic formulations, winning strategies in the synthesis game of $A_{\mathcal{M}}$ are in one-to-one correspondence with halting runs of $\mathcal{M}$ . *

As a consequence of the fact that if a $\textsf{DRA}_{\textsf{ido}}$ is realisable, then it is so by a register transducer with the same number of registers, we obtain the following corollary:

Corollary 7.

Let $k\geq r$ be two integers. We denote by $\textnormal{$ \textsf{DRA}{\textsf{ido}} $}[r]$ the class of $\textsf{DRA}_{\textsf{ido}}$ with $r$ registers. $(\textnormal{$ \textsf{DRA}{\textsf{ido}} $}[r],\textnormal{{RT}}[k])$ is in ExpTime.

4. Bounded Synthesis: A Generic Approach

In this section, we study the setting where target implementations are register transducers in the class $\textnormal{{RT}}[k]$ , for some $k\geq 0$ that we now fix for the whole section. For the complexity analysis, we assume $k$ is given as input, in unary. Indeed, describing a $k$ -register automaton in general requires $O(k)$ bits, and not $O(\log k)$ bits. We prove the decidable cases of the first line of Table 1 (page 1), by reducing the problems to realisability problems for data-free specifications.

4.1. Abstract Actions

We let $R_{k}=\{1,\dots,k\}$ be a set of $k$ registers. Our aim is to reduce the problem to a finite alphabet problem. First, since the set of test formulas over $R_{k}$ is infinite and there are doubly exponentially many non-equivalent formulas over $R_{k}$ , we rather synthesise transducers whose tests are maximally consistent conjunctions of atoms of the form $r^{=}$ or $r^{\neq}$ . Such conjunctions can be identified as subsets of $R_{k}$ in a natural way, e.g. for $k=3$ , the test $r_{1}^{=}\wedge r_{2}^{\neq}\wedge r_{3}^{=}$ is identified with the set $\{1,3\}$ . We call them explicit tests and denote them by the capital letter $E$ . An explicit test $E\subseteq R_{k}$ is converted into the (implicit) test $\phi_{E}=\bigwedge_{r\in E}r^{=}\wedge\bigwedge_{r\not\in E}r^{\neq}$ . Explicit tests are for instance used in [Seg06].

We let $\textnormal{{Tst}}_{k}=\textnormal{{Asgn}}_{k}=2^{R_{k}}$ . The finite input actions are $A_{\mathbbm{i}}^{k}=\Sigma_{\mathbbm{i}}\times\textnormal{{Tst}}_{k}$ which corresponds to picking a label and a test over the $k$ registers, and the output actions are $A_{\mathbbm{o}}^{k}=\Sigma_{\mathbbm{o}}\times\textnormal{{Asgn}}_{k}\times R_{k}$ , corresponding to picking some output symbol, some assignment and some register whose content is to be output.

An alternating sequence of actions $\overline{a}=(\sigma_{\mathbbm{i}}^{1},E_{1})(\sigma_{\mathbbm{o}}^{1},\textnormal{{asgn}}_{1},r_{1})\dots\in{(A_{\mathbbm{i}}^{k}A_{\mathbbm{o}}^{k})}^{\omega}$ abstracts a set of relational data words of the form $w=(\sigma_{\mathbbm{i}}^{1},d_{\mathbbm{i}}^{1})(\sigma_{\mathbbm{o}}^{1},d_{\mathbbm{o}}^{1})\dots\in\textsf{RW}(\Sigma_{\mathbbm{i}},\Sigma_{\mathbbm{o}},\mathcal{D})$ via a compatibility relation that we now define. We say that $w$ is compatible with $\overline{a}$ if there exists a sequence of register configurations $\tau_{0}\tau_{1}\dots\in{(R_{k}\rightarrow\mathcal{D})}^{\omega}$ such that $\tau_{0}=\tau_{0}^{R_{k}}$ and for all $i\geq 1$ , $\tau_{i},d_{\mathbbm{i}}^{i}\models E_{i}$ , $d_{\mathbbm{o}}^{i}=\tau_{i}(r_{i})$ and $\tau_{i+1}=\text{next}(\tau_{i},d_{\mathbbm{i}}^{i},\textnormal{{asgn}}_{i})$ . In other words, $w$ is compatible with $\overline{a}$ if there exists some $k$ -register transducer and a run $\rho=t_{0}t_{1}\dots$ such that for all $i$ , $t_{i}$ is of the form $t_{i}=q_{i}\xrightarrow{\sigma_{\mathbbm{i}}^{i},E_{i}\mid\sigma_{\mathbbm{o}}^{i},\textnormal{{asgn}}_{i},r_{i}}q_{i+1}$ for some $q_{i},q_{i+1}\in Q_{T}$ . Note that this sequence is unique if it exists. We denote by $\textsf{Comp}(\overline{a})$ the set of relational data words compatible with $\overline{a}$ . Given a specification $S$ , we let $W_{S,k}=\{\overline{a}\mid\textsf{Comp}(\overline{a})\subseteq S\}$ . The set $W_{S,k}$ is then a specification over the finite input and output alphabets $A_{\mathbbm{i}}^{k}$ and $A_{\mathbbm{o}}^{k}$ .

Theorem 8 (Transfer).

Let $S$ be a data word specification. The following are equivalent:

(1)

$S$ * is realisable by a transducer with $k$ registers.* 2. (2)

The (data-free) word specification $W_{S,k}$ is realisable by a (register-free) finite transducer.

Proof 4.1.

Let $T$ be a transducer with $k$ registers realising $S$ . The tests of $T$ are implicit tests, so in a first step we explicit them, possibly by adding new transitions to $T$ . Formally, a transition $q\xrightarrow[\raisebox{5.38193pt}[0.0pt]{$ \scriptstyle T $}]{\sigma_{\mathbbm{i}},\phi\mid\sigma_{\mathbbm{o}},\textnormal{{asgn}},r}q^{\prime}$ is replaced by all the transitions $q\xrightarrow[\raisebox{5.38193pt}[0.0pt]{$ \scriptstyle T $}]{\sigma_{\mathbbm{i}},E\mid\sigma_{\mathbbm{o}},\textnormal{{asgn}},r}q^{\prime}$ for all $E\subseteq R_{k}$ such that $\phi_{E}\Rightarrow\phi$ is true. The resulting transducer can be seen as a finite transducer $T^{\prime}$ over input alphabet $A_{\mathbbm{i}}^{k}$ and output alphabet $A_{\mathbbm{o}}^{k}$ . Moreover, since the transition function of $T$ is complete, it is also the case of $T^{\prime}$ (this is required by the definition of transducer defining implementations).

Let us show that $W_{S,k}$ is realisable by $T^{\prime}$ , i.e. $L(T^{\prime})\subseteq W_{S,k}$ . Take a sequence $\overline{a}=a_{1}e_{1}a_{2}e_{2}\dots\in L(T^{\prime})$ . We show that $\textsf{Comp}(\overline{a})\subseteq S$ . Let $w\in\textsf{Comp}(\overline{a})$ . Then, there exists a run $q_{0}q_{1}q_{2}\dots$ of $T^{\prime}$ on $\overline{a}$ since $\overline{a}\in L(T^{\prime})$ . By definition of compatibility for $w$ , there exists a sequence of register configurations $\tau_{0}\tau_{1}\dots\in{(R_{k}\rightarrow\mathcal{D})}^{\omega}$ satisfying the conditions in the definition of compatibility. From this we can deduce that $(q_{0},\tau_{0})(q_{1},\tau_{1})\dots$ is an initial sequence of configurations of $T$ over $w$ , so $w\in L(T)$ . Finally, $T$ realises $S$ , and therefore $L(T)\subseteq S$ .

Conversely, suppose that $W_{S,k}$ is realisable by some finite transducer $T^{\prime}$ over the input (output) alphabets $A_{\mathbbm{i}}^{k}$ ( $A_{\mathbbm{o}}^{k}$ ). Again, the transducer $T^{\prime}$ can be seen as a transducer $T$ with $k$ registers over data words with explicit tests. We show that $T$ realises $S$ , i.e., $L(T)\subseteq S$ . Let $w\in L(T)$ . The run of $T$ over $w$ induces a sequence of actions $\overline{a}$ in ${(A_{\mathbbm{i}}^{k}A_{\mathbbm{o}}^{k})}^{\omega}$ which, by definition of compatibility, satisfies $w\in\textsf{Comp}(\overline{a})$ . Moreover, $\overline{a}\in L(T^{\prime})$ . Hence, since $T^{\prime}$ realises $W_{S,k}$ , we get $\textsf{Comp}(\overline{a})\subseteq S$ , so $w\in S$ , concluding the proof.

4.2. The case of URA specifications

In this section, we show that for any $S$ a data word specification given as some URA, the language $W_{S,k}$ is effectively $\omega$ -regular, entailing the decidability of $(\textnormal{{URA}},\textnormal{{RT}}[k])$ , by Theorem 8 and the decidability of (data-free) synthesis. Let us first prove a series of intermediate lemmas.

We define an operation $\otimes$ between relational data words $w\in\textsf{RW}(\Sigma_{\mathbbm{i}},\Sigma_{\mathbbm{o}},\mathcal{D})$ and sequences of actions $\overline{a}\in{(A_{\mathbbm{i}}^{k}A_{\mathbbm{o}}^{k})}^{\omega}$ as follows: $w\otimes\overline{a}\in\textsf{RW}(A_{\mathbbm{i}}^{k},A_{\mathbbm{o}}^{k},\mathcal{D})$ is defined only if for all $i\geq 1$ , $\textsf{lab}(w[i])=\textsf{lab}(\overline{a}[i])$ where $\textsf{lab}(\overline{a}[i])$ is the first component of $\overline{a}[i]$ (a label in $\Sigma_{\mathbbm{i}}\cup\Sigma_{\mathbbm{o}}$ ), by $(w\otimes\overline{a})[i]=(\overline{a}[i],\textsf{dt}(w[i]))$ . Note that such operation is always defined when $w\in\textsf{Comp}(\overline{a})$ .

Lemma 9.

The language $L_{k}=\{w\otimes\overline{a}\mid w\in\textsf{Comp}(\overline{a})\}$ is definable by some NRA.

Proof 4.2.

We define an NRA with $k$ registers which roughly follows the actions it reads on its input. Its set of states is $\{q\}\cup\textnormal{{Asgn}}_{R}$ , with initial state $q$ . In state $q$ , it is only allowed to read labelled data in $A_{\mathbbm{i}}^{k}\times\mathcal{D}$ . On reading $(\sigma_{\mathbbm{i}},\phi,d)$ , it guesses some assignment asgn, performs the test $\phi$ and the assignment asgn and goes to state asgn. In any state $\textnormal{{asgn}}\in\textnormal{{Asgn}}_{R}$ , it is only allowed to read labelled data of the form $(\sigma_{\mathbbm{o}},\textnormal{{asgn}},r,d)$ , for which it tests whether $d$ is equal to the content of $r$ . It does no assignment and moves back to state $q$ . All states are accepting (i.e. have parity [math]). Such NRA has size $O(2^{k^{2}})$ .

Let $S$ be a specification defined by some URA $A_{S}$ with set of states $Q$ . The following subset of $L_{k}$ is definable by some NRA, where $\overline{S}$ denotes the complement of $S$ :

Lemma 10.

The language $L_{\overline{S},k}=\{w\otimes\overline{a}\mid w\in\textsf{Comp}(\overline{a})\cap\overline{S}\}$ is definable by some NRA.

Proof 4.3.

Since $S$ is definable by the URA $A_{S}$ , $\overline{S}$ is NRA-definable with $\overline{A_{S}}$ , a copy of $A_{S}$ with colouring function $\overline{c}:q\mapsto c(q)+1$ , interpreted as an NRA. Let $B$ be some NRA defining $L_{k}$ (it exists by Lemma 9). It now suffices to take a product of $A_{\overline{S}}$ and $B$ to get an NRA defining $L_{\overline{S},k}$ .

Given a data word language $L$ , we denote by $\textsf{lab}(L)=\{\textsf{lab}(w)\mid w\in L\}$ its projection on labels. The language $W_{S,k}$ is obtained as the complement of the label projection of $L_{\overline{S},k}$ :

Lemma 11.

$W_{S,k}=\overline{\textsf{lab}(L_{\overline{S},k})}$ .

Proof 4.4.

Let $\overline{a}\in{(A_{\mathbbm{i}}^{k}A_{\mathbbm{o}}^{k})}^{\omega}$ . Then, $\overline{a}\notin W_{S,k}\Leftrightarrow\textnormal{{Comp}}(\overline{a})\not\subseteq S\Leftrightarrow\exists w\in\textsf{RW},w\in\textnormal{{Comp}}(\overline{a})\cap\overline{S}\Leftrightarrow\exists w\in\textsf{RW},w\otimes\overline{a}\in L_{\overline{S},k}\Leftrightarrow\overline{a}\in\textsf{lab}(L_{\overline{S},k})$ .

We are now able to show the regularity of $W_{S,k}$ .

Lemma 12.

Let $S$ be a data word specification, $k\geq 0$ . If $S$ is definable by some URA with $n$ states and $r$ registers, then $W_{S,k}$ is effectively $\omega$ -regular, definable by some deterministic parity automaton with $O(2^{n^{2}\cdot 16^{{(r+k)}^{2}}})$ states and $O(n\cdot 4^{{(r+k)}^{2}})$ priorities.

Proof 4.5.

First, $L_{\overline{S},k}$ is definable by some NRA with $O(2^{k^{2}}n)$ states and $O(r+k)$ registers by Lemma 11, obtained as product between the NRA $\overline{A_{S}}$ and the automaton obtained in Lemma 9, of size $O(2^{k^{2}})$ . It is known that the projection on the alphabet of labels of a language of data words recognised by some NRA is effectively regular [KF94]. The same construction, which is based on extending the state space with register equality types, carries over to $\omega$ -words, and one obtains a nondeterministic parity automaton with $O(n\cdot 4^{{(r+k)}^{2}})$ states and $d$ priorities recognising $\textsf{lab}(L_{\overline{S},k})$ . It can be complemented into a deterministic parity automaton with $O(2^{n^{2}\cdot 16^{{(r+k)}^{2}}})$ states and $O(n\cdot 4^{{(r+k)}^{2}})$ priorities using standard constructions [Pit07].

We are now able to reprove the following result, known from [KMB18]:

Theorem 13.

For all $k\geq 0$ , $(\textnormal{{URA}},\textnormal{{RT}}[k])$ is in 2ExpTime.

Proof 4.6.

By Lemma 12, we construct a deterministic parity automaton $P_{S,k}$ for $W_{S,k}$ . Then, according to Theorem 8, it suffices to check whether it is realisable by a (register-free) transducer. The way to decide it is to see $P_{S,k}$ as a two-player parity game and check whether the protagonist has a winning strategy. Parity games can be solved in time $O(m^{\log d})$ [CJK*+*17] where $m$ is the number of states of the game and $d$ the number of priorities. Overall, solving it requires doubly exponential time, more precisely in $O(2^{n^{3}\cdot 16^{{(r+k)}^{2}}})$ .

4.3. The case of test-free NRA specifications

Unfortunately, by Theorem 2, the synthesis problem for specifications expressed as NRA is undecidable, even when the number of registers of the implementation is bounded. And indeed, if we mimic the reasoning of the previous section, we get that $L_{\overline{S},k}$ is definable by a URA, but Lemma 11 does not allow to conclude because:

Proposition 14.

There exists a data word language $L$ which is URA-definable and whose string projection is not $\omega$ -regular.

Proof 4.7.

Consider

[TABLE]

which consists in a word $w\in r^{n}$ with pairwise distinct data followed by a word $w^{\prime}\in g^{m}$ which contains at least all the data of $w$ , and extended with ${(\#,d)}^{\omega}$ to make it infinite (here, the choice of $d$ does not matter). Such language can be interpreted as the request-grant specification, restricted to the case where all requests are made first, and are all made by pairwise distinct clients (plus a $\#$ infinite padding). $L$ is recognised by an URA which, on reading $(r,d_{i})$ , universally triggers a run checking that

(1)

*Once a label $g$ is read, only * $g$ s are read; and after the last $g$ , only $\#$ are read (this is an $\omega$ -regular property) 2. (2)

$(r,d_{i})$ * does not appear again* 3. (3)

$(g,d_{i})$ * appears at least once.*

Now, we have $\textsf{lab}(L)=\{r^{n}g^{m}\#^{\omega}\mid m\geq n\}$ , which is not $\omega$ -regular.

In this section, we consider the class of NRA which do not perform tests on input data, which we call test-free nondeterministic register automata ( $\textsf{NRA}_{\textsf{tf}}$ for short). Such restriction is inspired from [DH16], which defines transformations of data words using MSO interpretations with an MSO origin relation. The MSO interpretation describes the transformation over the finite alphabet (called the string transduction), as in [Cou94], while the MSO origin relation describes the relation between input and output data. Such relation does not depend on (un)equalities between different input data: it uniquely maps each output position to an input position, expressing that the output data at this position is equal to the corresponding input data. They then show that such model is equivalent to two-way deterministic transducers with data variables333Themselves equivalent to one-way streaming string transducers with data variables and parameters; such parameters are reminiscent of the guessing mechanism described in [KZ10].. Such data variables are used to implement the MSO origin relation: they are registers in which the transducer can store the input data values and output them, but it is not allowed to perform any test on the stored data, contrary to our model of register automata. To define $\textsf{NRA}_{\textsf{tf}}$ , we apply the same restriction to NRA: they correspond to nondeterministic one-way transducers with data variables. Such machines can only rearrange input data (duplicate, erase, copy) regardless of the actual data values (as there are no tests). This way, as stated in Proposition 15, registers induce an origin relation between input and output data.

To avoid confusion between the nature of specifications and implementations, we prefer to define them as register automata, instead of transducers.

{defi}

[Test-free register automaton] A NRA is test-free if:

(1)

Its input transitions do not depend on equality relations between input data: for all $t\in\delta$ , if $t=q\xrightarrow{\sigma,\phi,\textnormal{{asgn}}}q^{\prime}$ is an input transition, then $\phi=\top$ . 2. (2)

Its output transitions consist in outputting the content of some register: for all $t\in\delta$ , if $t=q\xrightarrow{\sigma,\phi,\textnormal{{asgn}}}q^{\prime}$ is an output transition, then $\phi=r^{=}$ for some $r\in R$ and $\textnormal{{asgn}}=\varnothing$ .

We now make the relation with the notion of origin precise: as shown in [DFL18], there is a tight connection between origin graphs and data words. Here, the encoding is slightly different, as we do not necessarily ask that the data labelling input position $n$ is equal to $n$ . However, as long as the input data are all pairwise distinct, such encoding carries to our setting: the output data at position $j$ is equal to $d_{\mathbbm{i}}^{i}$ , where $i$ is the (input) origin position. Thus, in the following, we let AllDiff denote the set of relational data words whose input data are pairwise distinct:

[TABLE]

where, by convention $d_{\mathbbm{i}}^{0}=\textsf{d}_{0}$ . Then, as we will show, the behaviour of an $\textsf{NRA}_{\textsf{tf}}$ over AllDiff determines its origin relation, and hence its behaviour over the entire data domain.

To a run $\rho=q_{0}\xrightarrow{\sigma^{1}_{\mathbbm{i}},\textnormal{{asgn}}^{1},r^{1},\sigma^{1}_{\mathbbm{o}}}q_{1}\xrightarrow{\sigma^{2}_{\mathbbm{i}},\textnormal{{asgn}}^{2},r^{2},\sigma^{2}_{\mathbbm{o}}}q_{2}\dots$ , we associate the origin function $o_{\rho}:j\mapsto\max\{i\leq j\mid r_{j}\in\textnormal{{asgn}}_{i}\}$ , with the convention $\max\varnothing=0$ . In other words, $o_{\rho}(j)$ is the last input position at which the register output at position $j$ was assigned, so the corresponding input data is the one which is output (if the register has never been assigned, it contains $\textsf{d}_{0}$ , which, by convention, is the data associated with input position [math]).

Now, for an origin function $o:\mathbb{N}\backslash\{0\}\rightarrow\mathbb{N}$ and for a relational data word $w\in\textsf{RW}$ , we say $w$ is compatible with the origin function $o$ , denoted $w\models o$ , whenever for all $j\geq 1$ , $\textsf{dt}(\textsf{out}(w)[j])=\textsf{dt}(\textsf{inp}(w)[o(j)])$ , with the convention $\textsf{dt}(\textsf{inp}(w)[0])=\textsf{d}_{0}$ .

The following proposition shows that actual data values in a word $w$ do not matter with respect to membership in some $\textsf{NRA}_{\textsf{tf}}$ , only the compatibility with origin functions does:

Proposition 15.

Let $w\in\textsf{RW}$ and $\rho$ a sequence of transitions of some $\textsf{NRA}_{\textsf{tf}}$ . Then,

(1)

If $\rho$ is a run over $w$ , then $w\models o_{\rho}$ . 2. (2)

If $\rho$ is a run over $w$ and $w\in\textsc{AllDiff}$ , then for all $o:\mathbb{N}\backslash\{0\}\rightarrow\mathbb{N}$ , $w\models o\Leftrightarrow o=o_{\rho}$ . 3. (3)

If $w$ and $\rho$ have the same finite labels and if $w\models o_{\rho}$ , then $\rho$ is a run over $w$ .

Proof 4.8.

(1) and (3) follow from the semantics of $\textsf{NRA}_{\textsf{tf}}$ , which do not conduct any test on the input data. The $\Leftarrow$ direction of (2) is exactly (1). Now, assume $w\in\textsc{AllDiff}$ admits $\rho$ as a run, and let $o$ such that $w\models o$ . Then, let $j\geq 1$ be such that $\textsf{dt}(\textsf{out}(w)[j])=\textsf{dt}(\textsf{inp}(w)[o(j)])$ . By (1) we know that $\textsf{dt}(\textsf{out}(w)[j])=\textsf{dt}(\textsf{inp}(w)[o_{\rho}(j)])$ , so $\textsf{dt}(\textsf{inp}(w)[o(j)])=\textsf{dt}(\textsf{inp}(w)[o_{\rho}(j)])$ . Since $w\in\textsc{AllDiff}$ , this implies $o(j)=o_{\rho}(j)$ , so, overall, $o=o_{\rho}$ .

It is not clear whether $W_{S,k}$ is regular for $\textsf{NRA}_{\textsf{tf}}$ specifications, but we show that it suffices to consider another set denoted $W_{S,k}^{\textnormal{$ \textsf{tf} $}}$ which is easier to analyse (and can be proven regular), which describes the behaviour of $S$ over input with pairwise distinct data. Indeed, as expressed by the above proposition, $\textsf{NRA}_{\textsf{tf}}$ cannot conduct tests on input data, and their behaviour only depends on the input labels. Thus, it suffices to study runs on input words whose data are all distinct; such choice ensures that two equal input data will not ease the task of the implementation. Otherwise, it could be that on reading a data word, two registers $r_{1}$ and $r_{2}$ are equal, and then the implementation can simultaneously take transitions labelled with $\textsf{out}(r_{1})$ and $\textsf{out}(r_{2})$ . An interesting side-product of this approach is that it implies that we can restrict to test-free implementations. A test-free transducer is a transducer whose transitions do not depend on tests over input data, i.e., for all transitions $t=q\xrightarrow[\raisebox{5.38193pt}[0.0pt]{$ \scriptstyle $}]{\sigma_{\mathbbm{i}},\phi\mid\textnormal{{asgn}},\sigma_{\mathbbm{o}},r}q^{\prime}\in\delta$ , we have $\phi=\top$ .

Proposition 16.

Let $S$ be a $\textsf{NRA}_{\textsf{tf}}$ specification, and $A_{\mathbbm{i}}^{\varnothing}=\Sigma_{\mathbbm{i}}\times\{\varnothing\}$ . The following are equivalent:

(1)

$S$ * is realisable* 2. (2)

$W_{S,k}^{\textnormal{$ \textsf{tf} $}}=\{\overline{a}\in{(A_{\mathbbm{i}}^{\varnothing}A_{\mathbbm{o}}^{k})}^{\omega}\mid\textnormal{{Comp}}(\overline{a})\cap S\cap{\textsc{AllDiff}}\neq\varnothing\}$ * is realisable by a (register-free) transducer with input alphabet $A_{\mathbbm{i}}^{\varnothing}$ * 3. (3)

$S$ * is realisable by a test-free transducer*

Proof 4.9.

$(\ref{itm:SRealTF})\Rightarrow(\ref{itm:SReal})$ * is trivial.*

$(\ref{itm:SReal})\Rightarrow(\ref{itm:WSkEmptyReal})$ : If $S$ is realisable, then, by Theorem 8, $W_{S,k}$ is realisable by some transducer $I$ . Now, since transducers are closed under regular domain restriction, $W_{S,k}^{\varnothing}=W_{S,k}\cap{(A_{\mathbbm{i}}^{\varnothing}A_{\mathbbm{o}}^{k})}^{\omega}$ is realisable by $I$ restricted to the input alphabet $A_{\mathbbm{i}}^{\varnothing}$ ; more precisely, by the transducer $I^{\prime}$ with the same set of states as $I$ and transition function $\delta^{\prime}=\delta\cap\left(Q_{I}\times\Sigma_{\mathbbm{i}}\times\{\varnothing\}\rightarrow\textnormal{{Asgn}}_{R_{k}}\times\Sigma_{\mathbbm{o}}\times R_{k}\times Q_{I}\right)$ . Moreover, $W_{S,k}^{\varnothing}\subseteq W_{S,k}^{\textnormal{$ \textsf{tf} $}}$ . Indeed, let $\overline{a}\in W_{S,k}^{\varnothing}$ . Then, $\textnormal{{Comp}}(\overline{a})\subseteq S$ . It is easy to build by induction a data word $w\in\textnormal{{Comp}}(\overline{a})\cap\textsc{AllDiff}$ , so $\textnormal{{Comp}}(\overline{a})\cap S\cap\textsc{AllDiff}\neq\varnothing$ . Thus, $W_{S,k}^{\textnormal{$ \textsf{tf} $}}$ is realisable by any transducer realising $W_{S,k}^{\varnothing}$ .

$(\ref{itm:WSkEmptyReal})\Rightarrow(\ref{itm:SRealTF})$ : Now, assume $W_{S,k}^{\textnormal{$ \textsf{tf} $}}$ is realisable by some transducer $I$ . We show that $I$ , when ignoring the $\varnothing$ input tests, is actually an implementation of $S$ . Thus, let $I^{\prime}$ be the same transducer as $I$ except that all input tests $\varnothing$ have been replaced with $\top$ . Formally, $q\xrightarrow[\raisebox{5.38193pt}[0.0pt]{$ \scriptstyle I^{\prime} $}]{\sigma_{\mathbbm{i}},\top\mid\textnormal{{asgn}},\sigma_{\mathbbm{o}},r}q^{\prime}$ iff $q\xrightarrow[\raisebox{5.38193pt}[0.0pt]{$ \scriptstyle I $}]{\sigma_{\mathbbm{i}},\varnothing\mid\textnormal{{asgn}},\sigma_{\mathbbm{o}},r}q^{\prime}$ Note that $I^{\prime}$ , interpreted as a register transducer, is test-free. Let $w\in\textsf{DW}$ , and $\overline{a}_{\mathbbm{i}}=\textsf{lab}(w)\times\varnothing^{\omega}$ be the input action in $A_{\mathbbm{i}}^{\varnothing}$ with same finite labels as $w$ . Let $\overline{a}=I(\overline{a}_{\mathbbm{i}})$ , and let $w^{\prime}\in\textnormal{{Comp}}(\overline{a})\cap S\cap\textsc{AllDiff}$ (such $w^{\prime}$ exists because, as above, $\textnormal{{Comp}}(\overline{a})\cap\textsc{AllDiff}\neq\varnothing$ ). Then, since $\textsf{lab}(w)=\textsf{lab}(w^{\prime})$ , they admit the same run $\rho^{I}$ in $I$ , so $w,w^{\prime}\models o_{\rho^{I}}$ . Now, $w^{\prime}\in S$ , so it admits an accepting run $\rho^{S}$ in $S$ , which implies $w^{\prime}\models o_{\rho^{S}}$ . Moreover, $w^{\prime}\in\textsc{AllDiff}$ so, by Proposition 15 (2), we get $o_{\rho^{I}}=o_{\rho^{S}}$ . Therefore, $w\models o_{\rho^{S}}$ , so, by Proposition 15 (3), $w$ admits $\rho^{S}$ as a run, i.e. $w\in S$ . Overall, $L(I)\subseteq S$ , meaning that $I$ is a (test-free) implementation of $S$ .

Finally, $W_{S,k}^{\textnormal{$ \textsf{tf} $}}=\{\overline{a}\in{(A_{\mathbbm{i}}^{\varnothing}A_{\mathbbm{o}}^{k})}^{\omega}\mid\textnormal{{Comp}}(\overline{a})\cap S\cap\textsc{AllDiff}\neq\varnothing\}$ is regular. Indeed, $W_{S,k}^{\textnormal{$ \textsf{tf} $}}=\{\overline{a}\in{(A_{\mathbbm{i}}^{\varnothing}A_{\mathbbm{o}}^{k})}^{\omega}\mid\textnormal{{Comp}}(\overline{a})\cap S^{\varnothing}\neq\varnothing\}$ , where $S^{\varnothing}$ is the same automaton as $S$ except that all input transitions $q\xrightarrow{\sigma_{\mathbbm{i}},\top,\textnormal{{asgn}}}q^{\prime}$ have been replaced with $q\xrightarrow{\sigma_{\mathbbm{i}},\bigwedge_{r\in R_{k}}r^{\neq},\textnormal{{asgn}}}q^{\prime}$ , because, for all $\overline{a}\in{(A_{\mathbbm{i}}^{\varnothing}A_{\mathbbm{o}}^{k})}^{\omega}$ , $\textnormal{{Comp}}(\overline{a})\cap S\cap\textsc{AllDiff}\neq\varnothing\Leftrightarrow\textnormal{{Comp}}(\overline{a})\cap S^{\varnothing}\neq\varnothing$ (the $\Rightarrow$ direction is trivial, and the $\Leftarrow$ stems from the fact that an AllDiff input only takes $\phi=\varnothing$ transitions).

Then, $L_{S,k}^{\textnormal{$ \textsf{tf} $}}=\{w\otimes\overline{a}\in\textsf{RW}\otimes{(A_{\mathbbm{i}}^{\varnothing}A_{\mathbbm{o}}^{k})}^{\omega}\mid w\in\textsf{Comp}(\overline{a})\cap S^{\varnothing}\}$ is NRA-definable. Indeed, $S$ is $\textsf{NRA}_{\textsf{tf}}$ -definable, so $S^{\varnothing}$ is NRA-definable, and by Lemma 9, $L_{k}=\{w\otimes\overline{a}\mid w\in\textsf{Comp}(\overline{a})\}$ is NRA-definable, so their product recognises $L_{S,k}^{\textnormal{$ \textsf{tf} $}}$ . Finally, $W_{S,k}^{\textnormal{$ \textsf{tf} $}}=\textsf{lab}(L_{S,k}^{\textnormal{$ \textsf{tf} $}})$ , and the projection of a NRA over some finite alphabet is regular [KF94].

Overall, by Theorem 8, we finally get (the complexity analysis is the same as for URA):

Theorem 17.

For all $k\geq 0$ , $(\textnormal{$ \textsf{NRA}_{\textsf{tf}} $},\textnormal{{RT}}[k])$ is decidable and in 2ExpTime.

5. Synthesis and Uniformisation

In this section, we discuss the connection between synthesis and uniformisation of relations, which is a more general problem: as pointed out in Section 2, if $S$ is realisable by a register transducer, then, in particular, it has a total domain, i.e. $\textsf{inp}(S)=\textsf{DW}(\Sigma_{\mathbbm{i}},\mathcal{D})$ , otherwise it cannot be that $L(T)\subseteq S$ for $T$ a register transducer, since by definition of transducers $\textsf{inp}(T)=\textsf{DW}(\Sigma_{\mathbbm{i}},\mathcal{D})$ . However, when defining a specification, the user might be interested only in a subset of behaviours (for instance, s/he knows that all input data will be pairwise distinct). In the finite alphabet setting, since the formalisms used to express specifications are closed under complement (whether it is LTL or $\omega$ -automata), it is actually not a restriction to assume that the input domain of the specification is total: it suffices to complete the specification by allowing any behaviour on the input not considered. However, since register automata are not closed under complement, such approach is not possible here. Thus, it is relevant to generalise the realisability problem to the case where the domain of the specification is not total. This can be done by equipping register transducers with an acceptance condition. It is also necessary to adapt the notion of realisability; otherwise, any transducer accepting no words realises any specification. (since it is always the case that $\varnothing\subseteq S$ ). A natural way is to consider synthesis as a uniformisation problem [FJLW16]. An (implementation) function $f:\textnormal{{In}}\rightarrow\textnormal{{Out}}$ is said to uniformise a (specification) relation $R\subseteq\textnormal{{In}}\times\textnormal{{Out}}$ whenever:

(1)

$\mathrm{dom}(f)=\mathrm{dom}(R)$ and 2. (2)

for all $i\in\mathrm{dom}(f),(i,f(i))\in R$

Note that constraint 1 is the main difference with the notion of realisability.

In the context of reactive synthesis, where $f=f_{I}$ is defined from an implementation $I$ and $R$ is given as a language of relational words, it can be rephrased as

(1)

$\textsf{inp}(L(I))=\textsf{inp}(R)$ and 2. (2)

for all $w_{\mathbbm{i}}\in\textsf{inp}(L(I)),\langle w_{\mathbbm{i}},f_{I}(w_{\mathbbm{i}})\rangle\in R$

Note that such definition coincides with the one of realisability of Section 2 when the class of implementations has total domain, because then it is equivalent to asking $L(I)\subseteq R$ . In the following, we denote by $\textsc{Unif}(\mathcal{S},\mathcal{I})$ the uniformisation problem from specifications in $\mathcal{S}$ to implementations in $\mathcal{I}$ . Unfortunately, this setting is actually much harder, as shown by the next two theorems:

Theorem 18.

Given $S$ a specification represented by a DRA, checking whether $\textsf{inp}(S)=\textsf{DW}(\Sigma_{\mathbbm{i}},\mathcal{D})$ is undecidable.

Proof 5.1.

We reduce from the universality problem of NRA, which is undecidable [NSV04]. Let $A=(\Sigma,\mathcal{D},Q,q_{0},\delta,R,c)$ be an NRA. We encode $L(A)$ as the domain of some DRA specification: the input transitions are the same as the transitions of the original automaton, but when there is some nondeterminism, its resolution is postponed to the corresponding output transition, whose finite label corresponds to the chosen transition. In the vocabulary of games, the input player chooses the finite input label and the equality relation of the input data to the registers of $A$ , and the output player resolves the nondeterminism. Thus, we construct a DRA $D$ accepting $R(D)=\{((\sigma_{1},d_{1})(\sigma_{2},d_{2})\dots,(t_{1},d_{1})(t_{2},d_{2})\dots)\mid t_{1}t_{2}\dots$ is a run of $A$ over $(\sigma_{1},d_{1})(\sigma_{2},d_{2})\dots\}$ .

Thus, define $D=(\Sigma\uplus\delta,\mathcal{D},Q\uplus Q\times(\Sigma\times\textnormal{{Tst}}_{R}),q_{0},\delta^{\prime},R\uplus\{r_{0}\},c^{\prime})$ , where $\delta^{\prime}$ is defined as follows: for all $q\in Q$ , $\sigma\in\Sigma$ and $\phi\in\textnormal{{Tst}}_{R}$ , we define the input transition $q\xrightarrow[\raisebox{5.38193pt}[0.0pt]{$ \scriptstyle D $}]{\sigma,\phi,\{r_{0}\}}(q,(\sigma,\phi))$ . Then, for all $t=q\xrightarrow[\raisebox{5.38193pt}[0.0pt]{$ \scriptstyle A $}]{\sigma,\phi,\textnormal{{asgn}}}q^{\prime}\in\delta$ , we define the output transition $(q,(\sigma,\phi))\xrightarrow[\raisebox{5.38193pt}[0.0pt]{$ \scriptstyle D $}]{t,\phi\wedge r_{0}^{=},\textnormal{{asgn}}}q^{\prime}$ . Then, let $c^{\prime}:q\mapsto c(q)$ and $(q,\bullet)\mapsto c(q)$ . Such automaton is indeed deterministic, and it recognises the relation $R(D)=\{((\sigma_{1},d_{1})(\sigma_{2},d_{2})\dots,(t_{1},d_{1})(t_{2},d_{2})\dots)\mid t_{1}t_{2}\dots$ is a run of $A$ over $(\sigma_{1},d_{1})(\sigma_{2},d_{2})\dots\}$ . Then, $\textsf{inp}(R(D))$ is universal iff $L(A)$ is universal.

Such result extends to NRA and URA, whose DRA are a special case. Note that the unbounded realisability problem for DRA is not reducible to deciding whether the domain is total: if the specification $S$ is not realisable, it is not possible to determine whether it is because the domain of $S$ is not total or because $S$ is not realisable by a sequential machine (e.g. $S$ asks to output right away a data that will only be input in the future).

Then, while the uniformisation setting obviously preserves the undecidability results from the synthesis setting, the above result allows to show that the somehow more general uniformisation problem is undecidable. For instance, we can prove:

Theorem 19.

For all $k\geq 1$ , $\textsc{Unif}(\textnormal{{URA}},\textnormal{{RT}}[k])$ is undecidable.

Proof 5.2.

Consider some unrealisable URA specification $S_{u}$ and the following specification $S$ mapping $w_{1}\#w_{2}$ to $w_{1}\#w^{\prime}_{2}$ such that $(w_{2},w^{\prime}_{2})\in S_{u}$ , defined only when $w_{1}$ is a finite data word accepted by some URA $A$ . Clearly, $S$ is URA-definable and realisable iff its domain is empty, i.e. $L(A)=\varnothing$ . However, emptiness of URA is an undecidable problem.

If the domain of the specification is DRA-recognisable, it is possible to reduce the uniformisation problem to realisability, by allowing any behaviour on the complement of the domain (which is then DRA-recognisable). However, such property is undecidable as a direct corollary of Theorem 18.

Conclusion

In this paper, we have given a picture of the decidability landscape of the synthesis of register transducers from register automata specifications. We studied the parity acceptance condition because of its generality, but our results allow to reduce the synthesis problem for register automata specifications to the one for finite automata while preserving the acceptance condition. We have also introduced and studied test-free NRA, which do not have the ability to test their input, but still have the power of duplicating, removing or copying the input data to form the output. We have shown that they allow to recover decidability in the presence of non-determinism, in the bounded synthesis case. We leave open the unbounded case, which we conjecture to be decidable. As future work, we want to study synthesis problems for register automata which are able to test additional properties over the data. In particular, allowing to compare data for an order over $\mathcal{D}$ [BLP10b, FHL16] looks promising. Note that most other natural predicates immediately yield undecidability, e.g. adding +1. Another direction is to study specifications given by logical formulae, for decidable data words logics such as two-variable fragments of FO [BMS*+*06, SZ12, DFL18]. Such problem is however much more challenging, as there do not exist good correspondence between logic and automata in the realm of data words, except in very restricted settings [BLP10a].

Acknowledgments

The authors would like to thank Ayrat Khalimov for his remarks and suggestions, which helped improve the quality of the paper. They also thank the anonymous reviewers, who took the time to read the paper in detail and subsequently suggested important clarifications as well as simplifications in the proofs.

Bibliography26

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AG 11] Krzysztof R. Apt and Erich Grädel. Lectures in Game Theory for Computer Scientists . Cambridge University Press, New York, NY, USA, 1st edition, 2011.
2[BCJ 18] Roderick Bloem, Krishnendu Chatterjee, and Barbara Jobstmann. Graph Games and Reactive Synthesis , pages 921–962. Springer International Publishing, Cham, 2018.
3[BL 69] J.R. Büchi and L.H. Landweber. Solving sequential conditions by finite-state strategies. Transactions of the American Mathematical Society , 138:295–311, 1969.
4[BLP 10a] Michael Benedikt, Clemens Ley, and Gabriele Puppis. Automata vs. logics on data words. In Anuj Dawar and Helmut Veith, editors, Computer Science Logic, 24th International Workshop, CSL 2010, 19th Annual Conference of the EACSL, Brno, Czech Republic, August 23-27, 2010. Proceedings , volume 6247 of Lecture Notes in Computer Science , pages 110–124. Springer, 2010.
5[BLP 10b] Michael Benedikt, Clemens Ley, and Gabriele Puppis. What you must remember when processing data words. In Alberto H. F. Laender and Laks V. S. Lakshmanan, editors, Proceedings of the 4th Alberto Mendelzon International Workshop on Foundations of Data Management, Buenos Aires, Argentina, May 17-20, 2010 , volume 619 of CEUR Workshop Proceedings . CEUR-WS.org, 2010.
6[BMS + 06] Mikołaj Bojańczyk, Anca Muscholl, Thomas Schwentick, Luc Segoufin, and Claire David. Two-Variable Logic on Words with Data. In Proceedings of the 21th IEEE Symposium on Logic in Computer Science (LICS 2006) , pages 7–16. ACM, 2006.
7[CJK + 17] Cristian S. Calude, Sanjay Jain, Bakhadyr Khoussainov, Wei Li, and Frank Stephan. Deciding parity games in quasipolynomial time. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing (STOC 2017) , pages 252–263. ACM, 2017.
8[CKS 81] Ashok K. Chandra, Dexter C. Kozen, and Larry J. Stockmeyer. Alternation. J. ACM , 28(1):114–133, January 1981.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Synthesis of Data Word Transducers

Abstract.

Key words and phrases:

Introduction

Contributions

Related Work

1. Data Words and Register Automata

1.1. Data Words

1.2. Register Automata

1.3. Configurations and Runs

1.4. Languages Defined by RA

2. Synthesis of Register Transducers

2.1. Specifications, Implementations and the Realisability Problem

2.2. Specification Register Automata

2.3. Register Transducers As Implementations

2.4. Synthesis from Data-Free Specifications

Proof.

3. Unbounded Synthesis

3.1. Undecidability Results

Theorem 1**.**

Proof 3.1**.**

Theorem 2**.**

Theorem 3**.**

Proof 3.2**.**

3.2. A Decidable Subclass: DRAido\textsf{DRA}_{\textsf{ido}}DRAido​

Lemma 4**.**

Proof 3.3**.**

Proposition 5**.**

Proof 3.4**.**

From the parity game GAG_{A}GA​ to the realisability of L(A)L(A)L(A): 3 ⇒\Rightarrow⇒ 1

From the realisability of L(A)L(A)L(A) to the parity game GAG_{A}GA​: 2 ⇒\Rightarrow⇒ 3

Theorem 6**.**

Proof 3.5**.**

Hardness

Corollary 7**.**

4. Bounded Synthesis: A Generic Approach

4.1. Abstract Actions

Theorem 8** (Transfer).**

Proof 4.1**.**

4.2. The case of URA specifications

Lemma 9**.**

Proof 4.2**.**

Lemma 10**.**

Proof 4.3**.**

Lemma 11**.**

Proof 4.4**.**

Lemma 12**.**

Proof 4.5**.**

Theorem 13**.**

Proof 4.6**.**

4.3. The case of test-free NRA specifications

Proposition 14**.**

Proof 4.7**.**

Proposition 15**.**

Proof 4.8**.**

Proposition 16**.**

Proof 4.9**.**

Theorem 17**.**

5. Synthesis and Uniformisation

Theorem 18**.**

Proof 5.1**.**

Theorem 19**.**

Proof 5.2**.**

Conclusion

Acknowledgments

Theorem 1.

Proof 3.1.

Theorem 2.

Theorem 3.

Proof 3.2.

3.2. A Decidable Subclass: $\textsf{DRA}_{\textsf{ido}}$

Lemma 4.

Proof 3.3.

Proposition 5.

Proof 3.4.

From the parity game $G_{A}$ to the realisability of $L(A)$ : 3 $\Rightarrow$ 1

From the realisability of $L(A)$ to the parity game $G_{A}$ : 2 $\Rightarrow$ 3

Theorem 6.

Proof 3.5.

Corollary 7.

Theorem 8 (Transfer).

Proof 4.1.

Lemma 9.

Proof 4.2.

Lemma 10.

Proof 4.3.

Lemma 11.

Proof 4.4.

Lemma 12.

Proof 4.5.

Theorem 13.

Proof 4.6.

Proposition 14.

Proof 4.7.

Proposition 15.

Proof 4.8.

Proposition 16.

Proof 4.9.

Theorem 17.

Theorem 18.

Proof 5.1.

Theorem 19.

Proof 5.2.