Lambda Calculus and Probabilistic Computation

Claudia Faggian; Simona Ronchi della Rocca

arXiv:1901.02853·cs.LO·May 13, 2019

Lambda Calculus and Probabilistic Computation

Claudia Faggian, Simona Ronchi della Rocca

PDF

TL;DR

This paper introduces probabilistic extensions of the lambda calculus for call-by-value and call-by-name evaluation strategies, establishing confluence and standardization properties, and unifies them through a linear logic-based calculus for better control of probabilistic choice and copying.

Contribution

It presents novel probabilistic lambda calculi with confluence and standardization proofs, and unifies them via a linear logic framework for enhanced control over probabilistic computation.

Findings

01

Both calculi enjoy confluence and standardization.

02

A unified calculus based on Linear Logic is developed.

03

The approach allows fine control of choice and copying interactions.

Abstract

We introduce two extensions of the $λ$ -calculus with a probabilistic choice operator, $Λ_{\oplus}^{c b v}$ and $Λ_{\oplus}^{c bn}$ , modeling respectively call-by-value and call-by-name probabilistic computation. We prove that both enjoys confluence and standardization, in an extended way: we revisit these two fundamental notions to take into account the asymptotic behaviour of terms. The common root of the two calculi is a further calculus based on Linear Logic, $Λ_{\oplus}^{!}$ , which allows for a fine control of the interaction between choice and copying, and which allows us to develop a unified, modular approach.

Equations56

\mu(M)=\left\{\begin{array}[]{ll}p&\mbox{ if }p=\sum_{i\in I}p_{i}\mbox{ s.t. }M_{i}=M\\ 0&\mbox{otherwise;}\end{array}\right.

\mu(M)=\left\{\begin{array}[]{ll}p&\mbox{ if }p=\sum_{i\in I}p_{i}\mbox{ s.t. }M_{i}=M\\ 0&\mbox{otherwise;}\end{array}\right.

C ((λ x . M) V) \to_{β_{v}} [C (M [V / x])] \lx@proof@logical@and (λ x . M) V \mapsto_{β_{v}} M [V / x] V \in V

C ((λ x . M) V) \to_{β_{v}} [C (M [V / x])] \lx@proof@logical@and (λ x . M) V \mapsto_{β_{v}} M [V / x] V \in V

S (M \oplus N) \to_{\oplus} [\frac{1}{2} S (M), \frac{1}{2} S (N)] M \oplus N \mapsto_{l \oplus} M M \oplus N \mapsto_{r \oplus} N

S (M \oplus N) \to_{\oplus} [\frac{1}{2} S (M), \frac{1}{2} S (N)] M \oplus N \mapsto_{l \oplus} M M \oplus N \mapsto_{r \oplus} N

[M] \Rightarrow [M] [M] \Rightarrow m M \to m [p_{i} M_{i} ∣ i \in I] \Rightarrow i \in I \sum p_{i} \cdot m_{i} ([M_{i}] \Rightarrow m_{i})_{i \in I}

[M] \Rightarrow [M] [M] \Rightarrow m M \to m [p_{i} M_{i} ∣ i \in I] \Rightarrow i \in I \sum p_{i} \cdot m_{i} ([M_{i}] \Rightarrow m_{i})_{i \in I}

\begin{array}[]{lcl|lcl}(x)_{\lambda}&=&x&(MN)_{\lambda}&=&(M)_{\lambda}(N)_{\lambda}\\ (M\oplus N)_{\lambda}&=&z(M)_{\lambda}(N)_{\lambda}&(\lambda x.M)_{\lambda}&=&\lambda x.(M)_{\lambda}\\ \end{array}

\begin{array}[]{lcl|lcl}(x)_{\lambda}&=&x&(MN)_{\lambda}&=&(M)_{\lambda}(N)_{\lambda}\\ (M\oplus N)_{\lambda}&=&z(M)_{\lambda}(N)_{\lambda}&(\lambda x.M)_{\lambda}&=&\lambda x.(M)_{\lambda}\\ \end{array}

L ::= □ ∣ L M ∣ V L

L ::= □ ∣ L M ∣ V L

n sup {U \in Obs \sum μ_{n} (U)} \leavevmode = \leavevmode U \in Obs \sum n sup {μ_{n} (U)} .

n sup {U \in Obs \sum μ_{n} (U)} \leavevmode = \leavevmode U \in Obs \sum n sup {μ_{n} (U)} .

\begin{array}[]{lcl|lcl}(x)_{!}&=&x&(\lambda x.M)_{!}&=&\lambda x.(M)_{!}\\ (M\oplus N)_{!}&=&z\leavevmode\nobreak\ !(M)_{!}\leavevmode\nobreak\ !(N)_{!}&(\lambda!x.M)_{!}&=&\lambda!x.(M)_{!}\\ (MN)_{!}&=&(M)_{!}(N)_{!}&(!M)_{!}&=&!(M)_{!}\\ \end{array}

\begin{array}[]{lcl|lcl}(x)_{!}&=&x&(\lambda x.M)_{!}&=&\lambda x.(M)_{!}\\ (M\oplus N)_{!}&=&z\leavevmode\nobreak\ !(M)_{!}\leavevmode\nobreak\ !(N)_{!}&(\lambda!x.M)_{!}&=&\lambda!x.(M)_{!}\\ (MN)_{!}&=&(M)_{!}(N)_{!}&(!M)_{!}&=&!(M)_{!}\\ \end{array}

H ::= λ x_{1} \dots λ x_{k} . □ P_{1} \dots P_{n} (head contexts)

H ::= λ x_{1} \dots λ x_{k} . □ P_{1} \dots P_{n} (head contexts)

\begin{array}[]{lcl|lcl}(x)_{{}_{\mathtt{N}}}&=&x&(\lambda x.M)_{{}_{\mathtt{N}}}&=&\lambda!x.(M)_{{}_{\mathtt{N}}}\\ (MN)_{{}_{\mathtt{N}}}&=&(M)_{{}_{\mathtt{N}}}!(N)_{{}_{\mathtt{N}}}&(M\oplus N)_{{}_{\mathtt{N}}}&=&(M)_{{}_{\mathtt{N}}}\oplus(N)_{{}_{\mathtt{N}}}\\ \end{array}

\begin{array}[]{lcl|lcl}(x)_{{}_{\mathtt{N}}}&=&x&(\lambda x.M)_{{}_{\mathtt{N}}}&=&\lambda!x.(M)_{{}_{\mathtt{N}}}\\ (MN)_{{}_{\mathtt{N}}}&=&(M)_{{}_{\mathtt{N}}}!(N)_{{}_{\mathtt{N}}}&(M\oplus N)_{{}_{\mathtt{N}}}&=&(M)_{{}_{\mathtt{N}}}\oplus(N)_{{}_{\mathtt{N}}}\\ \end{array}

\begin{array}[]{lcl}(\boldsymbol{[}p_{i}M_{i}\mid i\in I\boldsymbol{]})_{{}_{\mathtt{N}}}&=&\boldsymbol{[}p_{i}(M_{i})_{{}_{\mathtt{N}}}\mid i\in I\boldsymbol{]}\end{array}

\begin{array}[]{lcl}(\boldsymbol{[}p_{i}M_{i}\mid i\in I\boldsymbol{]})_{{}_{\mathtt{N}}}&=&\boldsymbol{[}p_{i}(M_{i})_{{}_{\mathtt{N}}}\mid i\in I\boldsymbol{]}\end{array}

(λ x . M) V \to_{s} [M [V / x]] V \in V M \oplus N \to_{s} [\frac{1}{2} M, \frac{1}{2} N]

(λ x . M) V \to_{s} [M [V / x]] V \in V M \oplus N \to_{s} [\frac{1}{2} M, \frac{1}{2} N]

M N \to_{s} m @ N M \to_{s} m M N \to_{s} N @ n N \to_{s} n

M N \to_{s} m @ N M \to_{s} m M N \to_{s} N @ n N \to_{s} n

(λ x . M) V \to_{l} [M [V / x]] V \in V M \oplus N \to_{l} [\frac{1}{2} M, \frac{1}{2} N]

(λ x . M) V \to_{l} [M [V / x]] V \in V M \oplus N \to_{l} [\frac{1}{2} M, \frac{1}{2} N]

M N \to_{l} m @ N M \to_{l} m V N \to_{l} V @ n \lx@proof@logical@and V \in V N \to_{l} n

M N \to_{l} m @ N M \to_{l} m V N \to_{l} V @ n \lx@proof@logical@and V \in V N \to_{l} n

x \leavevmode ∥ \to_{β_{v}} [x] λ x . M \leavevmode ∥ \to_{β_{v}} [λ x . N] M \leavevmode ∥ \to_{β_{v}} [N]

x \leavevmode ∥ \to_{β_{v}} [x] λ x . M \leavevmode ∥ \to_{β_{v}} [λ x . N] M \leavevmode ∥ \to_{β_{v}} [N]

M N \leavevmode ∥ \to_{β_{v}} [M^{'} N^{'}] \lx@proof@logical@and M \leavevmode ∥ \to_{β_{v}} [M^{'}] N \leavevmode ∥ \to_{β_{v}} [N^{'}]

M N \leavevmode ∥ \to_{β_{v}} [M^{'} N^{'}] \lx@proof@logical@and M \leavevmode ∥ \to_{β_{v}} [M^{'}] N \leavevmode ∥ \to_{β_{v}} [N^{'}]

(λ x . M) W \leavevmode ∥ \to_{β_{v}} [M^{'} [W^{'} / x]] \lx@proof@logical@and M \leavevmode ∥ \to_{β_{v}} [M^{'}] W \leavevmode ∥ \to_{β_{v}} [W^{'}] W value

(λ x . M) W \leavevmode ∥ \to_{β_{v}} [M^{'} [W^{'} / x]] \lx@proof@logical@and M \leavevmode ∥ \to_{β_{v}} [M^{'}] W \leavevmode ∥ \to_{β_{v}} [W^{'}] W value

M \oplus N \leavevmode ∥ \to_{β_{v}} [M^{'} \oplus N^{'}] \lx@proof@logical@and M \leavevmode ∥ \to_{β_{v}} [M^{'}] N \leavevmode ∥ \to_{β_{v}} [N^{'}]

M \oplus N \leavevmode ∥ \to_{β_{v}} [M^{'} \oplus N^{'}] \lx@proof@logical@and M \leavevmode ∥ \to_{β_{v}} [M^{'}] N \leavevmode ∥ \to_{β_{v}} [N^{'}]

x \leavevmode ∥ \to_{d}_{β_{v}} [x] λ x . M \leavevmode ∥ \to_{d}_{β_{v}} [λ x . N] M \leavevmode ∥ \to_{β_{v}} [N]

x \leavevmode ∥ \to_{d}_{β_{v}} [x] λ x . M \leavevmode ∥ \to_{d}_{β_{v}} [λ x . N] M \leavevmode ∥ \to_{β_{v}} [N]

M N \leavevmode ∥ \to_{d}_{β_{v}} [S T] \lx@proof@logical@and M \leavevmode ∥ \to_{d}_{β_{v}} [S] N \leavevmode ∥ \to_{d}_{β_{v}} [T]

M N \leavevmode ∥ \to_{d}_{β_{v}} [S T] \lx@proof@logical@and M \leavevmode ∥ \to_{d}_{β_{v}} [S] N \leavevmode ∥ \to_{d}_{β_{v}} [T]

M \oplus N \leavevmode ∥ \to_{d}_{β_{v}} [S \oplus T] \lx@proof@logical@and M \leavevmode ∥ \to_{β_{v}} [S] N \leavevmode ∥ \to_{β_{v}} [T]

M \oplus N \leavevmode ∥ \to_{d}_{β_{v}} [S \oplus T] \lx@proof@logical@and M \leavevmode ∥ \to_{β_{v}} [S] N \leavevmode ∥ \to_{β_{v}} [T]

\Rightarrow_{d}_{β_{v}} \leavevmode \leavevmode \subseteq \leavevmode \leavevmode \leavevmode ∥ \Rightarrow_{d}_{β_{v}} \leavevmode \leavevmode \subseteq \leavevmode \leavevmode \Rightarrow_{d}_{β_{v}}^{*}

\Rightarrow_{d}_{β_{v}} \leavevmode \leavevmode \subseteq \leavevmode \leavevmode \leavevmode ∥ \Rightarrow_{d}_{β_{v}} \leavevmode \leavevmode \subseteq \leavevmode \leavevmode \Rightarrow_{d}_{β_{v}}^{*}

\begin{array}[]{lcl|lcl}(x)_{\lambda}&=&x&(MN)_{\lambda}&=&(M)_{\lambda}(N)_{\lambda}\\ (M\oplus N)_{\lambda}&=&z(\lambda w.(M)_{\lambda})\lambda w.(N)_{\lambda}&(\lambda x.M)_{\lambda}&=&\lambda x.(M)_{\lambda}\\ \end{array}

\begin{array}[]{lcl|lcl}(x)_{\lambda}&=&x&(MN)_{\lambda}&=&(M)_{\lambda}(N)_{\lambda}\\ (M\oplus N)_{\lambda}&=&z(\lambda w.(M)_{\lambda})\lambda w.(N)_{\lambda}&(\lambda x.M)_{\lambda}&=&\lambda x.(M)_{\lambda}\\ \end{array}

M \to_{l}_{β_{v}}^{*} S \to_{int}_{β_{v}}^{*} N .

M \to_{l}_{β_{v}}^{*} S \to_{int}_{β_{v}}^{*} N .

M \to_{s}_{β_{v}}^{*} S \to_{d}_{β_{v}}^{*} N .

M \to_{s}_{β_{v}}^{*} S \to_{d}_{β_{v}}^{*} N .

n \to \infty lim x \in X \sum f_{n} (x) \leavevmode = \leavevmode x \in X \sum f (x)

n \to \infty lim x \in X \sum f_{n} (x) \leavevmode = \leavevmode x \in X \sum f (x)

n \to \infty lim U \in Obs \sum μ_{n} (U) \leavevmode = \leavevmode U \in Obs \sum ρ (U)

n \to \infty lim U \in Obs \sum μ_{n} (U) \leavevmode = \leavevmode U \in Obs \sum ρ (U)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Lambda Calculus and Probabilistic Computation

(Extended Version)

Claudia Faggian

Université de Paris, IRIF, CNRS, France

Simona Ronchi della Rocca

Dip. di Informatica, Università di Torino, Italy

Abstract

We introduce two extensions of the $\lambda$ -calculus with a probabilistic choice operator, $\Lambda_{\oplus}^{{\mathtt{cbv}}}$ and $\Lambda_{\oplus}^{{\mathtt{cbn}}}$ , modeling respectively call-by-value and call-by-name probabilistic computation. We prove that both enjoys confluence and standardization, in an extended way: we revisit these two fundamental notions to take into account the asymptotic behaviour of terms. The common root of the two calculi is a further calculus based on Linear Logic, $\Lambda_{\oplus}^{!}$ , which allows us to develop a unified, modular approach.

I Introduction

The pervasive role of stochastic models in a variety of domains (such as machine learning, natural language, verification) has prompted a vast body of research on probabilistic programming languages; such a language supports at least discrete distributions by providing an operator which models sampling. In particular, the functional style of probabilistic programming, pioneered by [28], attracts increasing interest because it allows for higher-order computation, and offers a level of abstraction well-suited to deal with mathematical objects. Early work [18, 24, 22, 26, 23] has evolved in a growing body of software development and theoretical research. In this context, the $\lambda$ -calculus has often been used as a core language.

In order to model higher-order probabilistic computation, it is a natural approach to take the $\lambda$ -calculus as general paradigm, and to enrich it with a probabilistic construct. The most simple and concrete way to do so ([10, 8, 13]) is to equip the untyped $\lambda$ -calculus with an operator $\oplus$ , which models flipping a fair coin. This suffices to have universality, as proved in [8], in the sense that the calculus is sound and complete with respect to computable probability distributions. The resulting calculus is however non-confluent, as it has been observed early (see [8] for an analysis). We revise the issue in Example 1. The problem with confluence is handled in the literature by fixing a deterministic reduction strategy, typically the leftmost-outermost strategy. This is not satisfactory both for theoretical and practical reasons, as we discuss later.

In this paper, we propose a more general point of view. Our goal is a foundational calculus, which plays the same role as the $\lambda$ -calculus does for deterministic computation. More precisely, taking the point of view propounded by Plotkin in [25], we discriminate between a calculus and a programming language. The former defines the reduction rules, independently from any reduction strategy, and enjoys confluence and standardization, the latter is specified by a deterministic strategy (an abstract machine). Standardization is what relates the two: the programming language implements the standard strategy associated to the calculus. Indeed, standardization implies the existence of a strategy (the standard strategy) which is guaranteed to reach the result, if it exists.

In this spirit, we consider a probabilistic calculus to be characterized by a specific calling mechanism; the reduction is otherwise only constrained by the need of discriminating between duplicating a function which samples from a distribution, and duplicating the result of sampling. Think of tossing a coin and duplicating the result, versus tossing the coin twice, which is indeed the issue at the core of confluence failure, as the following examples (adapted from [9, 8]) show.

Example 1 (Confluence).

Let us consider the untyped $\lambda$ -calculus extended with a binary operator $\oplus$ which models fair, binary probabilistic choice: $M\oplus N$ reduces to either $M$ or $N$ with equal probability $1/2$ ; we write this as $M\oplus N\rightarrow\{M^{\frac{1}{2}},N^{\frac{1}{2}}\}$ . Intuitively, the result of evaluating a probabilistic term is a distribution on its possible values.

*Consider the term $PQ$ , where $P=(\lambda z.z\mathtt{\leavevmode\nobreak\ XOR\leavevmode\nobreak\ }z)$ , and $Q=(\mathtt{T}\oplus\mathtt{F})$ ; $\mathtt{\leavevmode\nobreak\ XOR\leavevmode\nobreak\ }$ is the standard construct for exclusive $\mathtt{OR}$ , $\mathtt{T}=\lambda xy.x$ and $\mathtt{F}=\lambda xy.y$ code the boolean values. *

– If we first reduce $Q$ , we obtain $(\lambda z.z\mathtt{\leavevmode\nobreak\ XOR\leavevmode\nobreak\ }z)\mathtt{T}$ or $(\lambda z.z\mathtt{\leavevmode\nobreak\ XOR\leavevmode\nobreak\ }z)\mathtt{F}$ , with equal probability $1/2$ . This way, $PQ$ evaluates to $\{\mathtt{F}^{1}\}$ , i.e. $\mathtt{F}$ with probability $1$ .

*– If we reduce the outermost redex first, $PQ$ reduces to $(\mathtt{T}\oplus\mathtt{F})\mathtt{\leavevmode\nobreak\ XOR\leavevmode\nobreak\ }(\mathtt{T}\oplus\mathtt{F})$ , and the term evaluates to the distribution $\{\mathtt{T}^{\frac{1}{2}},\mathtt{F}^{\frac{1}{2}}\}$ . *

The two resulting distributions are not even comparable. 2. 2.

The same phenomenon appears even if we restrict ourselves to call-by-value. Consider for example the reductions of $PN$ with $P$ as in 1), and $N=(\lambda xy.x\oplus y)$ . We obtain the same two different distributions as above.

In this paper, we define two probabilistic $\lambda$ -calculi, respectively based on the call-by-value (CbV) and call-by-name (CbN) calling mechanism. Both enjoy confluence and standardization, in an extended way: indeed we revisit these two fundamental notions to take into account the asymptotic behaviour of terms. The common root of the two calculi is a further calculus based on Linear Logic, which is an extension of Simpson’s linear $\lambda$ -calculus [30], and which allows us to develop a unified, modular approach.

Content and Contributions

In Section IV, we introduce a call-by-value calculus, denoted $\Lambda_{\oplus}^{\mathtt{cbv}}$ , as a probabilistic extension of the call-by-value $\lambda$ -calculus of Plotkin (where the $\beta$ -reduction fires only in case the argument is a value, i.e. either a variable or a $\lambda$ -abstraction). We choose to study in detail call-by-value for two main reasons. First, it is the most relevant mechanism to probabilistic programming (most of the abstract languages we cited are call-by-value, but also real-world stochastic programs such as Church [16]). Second, call-by-value is a mechanism in which dealing with functions, and duplication of functions, is clean and intuitive, which allows us to address the issue at the core of confluence failure. The definition of value (in particular, a probabilistic choice is not a value) together with a suitable restriction of the evaluation context for the probabilistic choice, allow us to recover key results: confluence and a form of standardization (Section V). Let us recall that, in the classical $\lambda$ -calculus, standardization means that there is a strategy which is complete for all reduction sequences, i.e., for every reduction sequence $M\rightarrow^{*}N$ there is a standard reduction sequence from $M$ to $N$ . A standard reduction sequence with the same property exists also here. An unexpected result is that strategies which are complete in the classical case, are not so here, notably the leftmost strategy.

In Section VI we study the asymptotic behavior of terms. Our leading question is how the asymptotic behaviour of different sequences starting from the same term compare. We first analyze if and in which sense confluence implies that the result of a probabilistically terminating computation is unique. We formalize the notion of asymptotic result via limit distributions, and establish that there is a unique maximal one.

In Section VII we address the question of how to find such greatest limit distribution, a question which arises from the fact that evaluation in $\Lambda_{\oplus}^{\mathtt{cbv}}$ is non-deterministic, and different sequences may terminate with different probability. With this aim, we extend the notion of standardization to limits; this extension is non-trivial, and demands the development of new sophisticated proof methods.

We prove that the new notion of standardization supplies a family of complete reduction strategies which are guaranteed to reach the unique maximal result. Remarkably, we are able to show that, when evaluating programs, i.e., closed terms, this family does include the leftmost strategy. As we have already observed, this is the deterministic strategy which is typically adopted in the literature, in either its call-by-value ([18, 7]) or its call-by-name version ([10, 13]), but without any completeness result with respect to probabilistic computation. Our result offers an “a posteriori” justification for its use!

The study of $\Lambda_{\oplus}^{\mathtt{cbv}}$ allows us to develop a crisp approach, which we are then able to use in the study of different probabilistic calculi. Because the issue of duplication is central, it is natural to expect a benefit from the fine control over copies which is provided by Linear Logic. In Section IX we use our tools to introduce and study a probabilistic linear $\lambda$ -calculus, $\Lambda_{\oplus}^{!}$ . The linear calculus provides not only a finer control on duplication, but also a modular approach to confluence and standardization, which allow us to formalize a call-by-name version of our calculus, namely $\Lambda_{\oplus}^{{\mathtt{cbn}}}$ , in Section X. We prove that $\Lambda_{\oplus}^{{\mathtt{cbn}}}$ enjoys properties analogous to those of $\Lambda_{\oplus}^{\mathtt{cbv}}$ , in particular confluence and standardization.

In Section II we provide the reader with some background and motivational observations. Basic notions of discrete probability and rewriting are reviewed in Section III.

Related Work

The idea of extending the $\lambda$ -calculus with a probabilistic construct is not new; without any ambition to be exhaustive, let us cite [22, 26], [10, 13, 8, 5]. In all these cases, a specific reduction strategy is fixed; they are indeed languages, not calculi, according to Plotkin’s distinction.

The issue about confluence appears every time the $\lambda$ -calculus is extended with a choice effect: quantum, algebraic, non-deterministic. The ways of framing the same problem in different settings are naturally related, and we were inspired by them. Confluence for an algebric calculus is dealt with in [1] for the call-by-value, and in [31] for the call-by-name. In the quantum case we would like to cite [7, 6], which are based on Simpson’s calculus [30]. A probabilistic extension of Simpson’s calculus was first proposed in [11]. The language is similar to that of $\Lambda_{\oplus}^{!}$ ; however in [11] (as also in [7, 6]) no reduction (not even $\beta$ ) is allowed in the scope of a !-operator. The reduction there hence corresponds to surface reduction, which in Sec. IX we show to be the standard strategy for $\Lambda_{\oplus}^{!}$ .

To our knowledge, the only proposal of a probabilistic $\lambda$ -calculus in which the reduction is independent from a specific strategy is for call-by-name, namely the calculus of [19], in the line of work of differential [14] and algebric [31] $\lambda$ -calculus. The focus in [19] is essentially semantical, as the author want to study an equational theory for the $\lambda$ -calculus, based on an extension of Böhm trees. [19] develops results which in their essence are similar to those we obtain for call-by-name in Sec. X, in particular confluence and standardization, even if his calculus –which internalizes the probabilistic behavior– is quite different from ours, and so are the proof techniques.

Finally, we wish to mention that proposals of a probabilistic $\lambda$ -calculus could also be extracted from semantical models, such as the one in [3], which develops an idea earlier presented in [29], and in which the notion of graph models for $\lambda$ -calculus has been extended with a probabilistic construct.

II Background and Motivational observations

In this section, we first review -in a non-technical way- the specific features of probabilistic programs, and how they differ from classical ones. We then focus on some motivational observations which are relevant to our work. First, we give an example of features which are lost if a programming language is characterized by a strategy which is not rooted in a more general calculus. Then, we illustrate some of the issues which appear when we study a general calculus, instead of a specific reduction strategy. Addressing these issues will lead us to develop new notions and tools.

II-A Classical vs. Probabilistic Programs

A classical program defines a deterministic input-output relation; it terminates (on a given input), or does not; if it terminates, the program only runs for a finite number of steps. Instead, a probabilistic program generates a probability distribution over possible outputs; it terminates (on a given input) with a certain probability; it may have runs which take infinitely many steps even when termination has probability $1$ .

A probabilistic program is a stochastic model. The intuition is that the probabilistic program $P$ is executed, and random choices are made by sampling; this process defines a distribution over all the possible outputs of $P$ . Even if the termination probability is $1$ (almost sure termination), that degree of certitude is typically not reached in any finite number of steps, but it appears as a limit. A standard example is a term $M$ which reduces to either the normal form $\mathtt{T}$ or $M$ itself, with equal probability $1/2$ . After $n$ steps, $M$ reduces to $\mathtt{T}$ with probability $\frac{1}{2}+\frac{1}{2^{2}}+\dots+\frac{1}{2^{n}}$ . Only at the limit this computation terminates with probability $1$ .

Probabilistic vs. Quantitative

The notion of probabilistic termination is what sets apart probabilistic $\lambda$ -calculus from other quantitative calculi such as those in [1, 14, 31], and from the non-deterministic $\lambda$ -calculus [9]. For this reason, the asymptotic behaviour of terms will be the focus of this paper.

II-B Confluence of the calculus is relevant to programming

Functional languages have their foundation in the $\lambda$ -calculus and its properties, and such properties (notably, confluence and standardization) have theoretical and practical implications. A strength of classical functional languages -which is assuming growing importance- is that they are inherently parallel (we refer e.g. to [21] for discussion on deterministic parallel programming): every sub-expression can be evaluated in parallel, because of referential transparency; still, we can perform reasoning, testing and debugging on a program using a sequential model, because the result of the calculus is independent from the evaluation order. Not to force a sequential strategy impacts the implementation of the language, but also the conception of programs. As advocated by Harper, the parallelism of functional languages exposes the

“dependency structure of the computation by not introducing any dependencies that are not forced on us by the nature of the computation itself."

This feature of functional languages is rooted in the confluence of the $\lambda$ -calculus, and is an example of what is lost in the probabilistic setting, if we give-up either confluence, or the possibility of non-deterministic evaluation.

II-C The result of probabilistic computation

A ground for our approach is the distinction between calculus and language. Some of the issues which we will need to address do not appear when working with probabilistic languages, because they are based on a simplification of the $\lambda$ -calculus. Programming languages only evaluate programs, i.e., closed terms (without free variables). A striking simplification appears from another crucial restriction, weak evaluation, which does not evaluate function bodies (the scope of $\lambda$ -abstractions). In weak call-by-value (base of the ML/CAML family of probabilistic languages) values are normal forms.

What is the result of a probabilistic computation is well understood only in the case of programming languages: the result of a program is a distribution on its possible outcomes, which are normal forms w.r.t. a chosen strategy. In the literature of probabilistic $\lambda$ -calculus, two main deterministic strategies have been studied: weak left strategy in CbV [8] and head strategy in CbN [13], whose normal forms are respectively the closed values and the head normal forms.

When considering a calculus instead of a language, the identity between normal forms and results does not hold anymore, with important consequences in the definition of limit distributions. We investigate this issue in Sec. VI. The approach we develop is general and uniform to all our calculi.

III Technical Preliminaries

We review basic notions on discrete probability and rewriting which we use through the paper. We assume that the reader has some familiarity with the $\lambda$ -calculus.

III-A Basics on Discrete Probability

A discrete probability space is given by a pair $(\Omega,\mu)$ , where $\Omega$ is a countable set, and $\mu$ is a discrete probability distribution on $\Omega$ , i.e. is a function from $\Omega$ to $[0,1]\subset\mathbb{R}$ such that $\|\mu\|:=\sum_{\omega\in\Omega}\mu(\omega)=1$ . In this case, a probability measure is assigned to any subset $\mathcal{A}\subseteq\Omega$ as $\mu(\mathcal{A})=\sum_{\omega\in\mathcal{A}}\mu(\omega)$ . In the language of probability theory, a subset of $\Omega$ is called an event.

Let $(\Omega,\mu)$ be as above. Any function $F:\Omega\to\Delta$ , where $\Delta$ is another countable set, induces a probability distribution $\mu^{F}$ on $\Delta$ by composition: $\mu^{F}(d\in\Delta):=\mu(F^{-1}(d))$ i.e. $\mu\{\omega\in\Omega:F(\omega)=d\}$ . In the language of probability theory, $F$ is called a discrete random variable on $(\Omega,\mu)$ .

Example 2 (Die).

Consider tossing a die once. The space of possible outcomes is the set $\Omega=\{1,2,3,4,5,6\}$ . The probability measure $\mu$ of each outcome is $1/6$ . The event “result is odd" is the subset $\mathcal{O}=\{1,3,5\}$ , whose probability measure is $\mu(\mathcal{O})=1/2$ . 2. 2.

Let $\Delta$ be a set with two elements $\{\texttt{Even},\texttt{Odd}\}$ , and $F$ the obvious function from $\Omega$ to $\Delta$ . $F$ induces a distribution on $\Delta$ , with $\mu^{F}(\texttt{Even})=1/2$ and $\mu^{F}(\texttt{Odd})=1/2$ .

III-B Subdistributions and $\boldsymbol{\mathtt{DST}(\Omega)}$

Given a countable set $\Omega$ , a function $\mu:\Omega\to[0,1]$ is a probability subdistribution if $\|\mu\|\leq 1$ . We write $\mathtt{DST}(\Omega)$ for the set of subdistributions on $\Omega$ . With a slight abuse of language, we will use the term distribution also for subdistribution. Subdistributions allow us to deal with partial results and non-successful computations.

Order: $\mathtt{DST}(\Omega)$ is equipped with the standard order relation of functions : $\mu\leq\rho$ if $\mu(\omega)\leq\rho(\omega)$ for each $\omega\in\Omega$ .

Support: The support of $\mu$ is $\mathit{Supp}(\mu)=\{\omega:\mu(\omega)>0\}$ .

Representation: We represent a distribution by explicitly indicating the support, and (as superscript) the probability assigned to each element by $\mu$ . We write $\mu=\{a_{0}^{p_{0}},\dots,a_{n}^{p_{n}}\}$ if $\mu(a_{0})=p_{0},\dots,\mu(a_{n})=p_{n}$ and $\mu(a_{j})=0$ otherwise.

III-C Multidistributions

To syntactically represent the global evolution of a probabilistic system, we rely on the notion of multidistribution [2].

A multiset is a (finite) list of elements, modulo reordering, i.e. $\boldsymbol{[}a,b,a\boldsymbol{]}=\boldsymbol{[}a,a,b\boldsymbol{]}\not=\boldsymbol{[}a,b\boldsymbol{]}$ ; the multiset $\boldsymbol{[}a,a,b\boldsymbol{]}$ has three elements. Let $\mathcal{X}$ be a countable set and $\mathtt{m}$ a multiset of pairs of the form $pM$ , with $p\in]0,1]$ , and $M\in\mathcal{X}$ . We call $\mathtt{m}=\boldsymbol{[}p_{i}M_{i}\mid i\in I\boldsymbol{]}$ (where the index set $I$ ranges over the elements of $\mathtt{m}$ ) a multidistribution on $\mathcal{X}$ if $\sum_{i\in I}p_{i}\leq 1$ . We denote by $\mathtt{MDST}(\mathcal{X})$ the set of all multidistributions on $\mathcal{X}$ .

We write the multidistribution $\boldsymbol{[}1M\boldsymbol{]}$ simply as $\boldsymbol{[}M\boldsymbol{]}$ . The sum of multidistributions is denoted by $+$ , and it is the concatenation of lists. The product $q\cdot\mathtt{m}$ of a scalar $q$ and a multidistribution $\mathtt{m}$ is defined pointwise: $q\cdot\boldsymbol{[}p_{1}M_{1},...,p_{n}M_{n}\boldsymbol{]}=\boldsymbol{[}(qp_{1})M_{1},...,(qp_{n})M_{n}\boldsymbol{]}$ .

Intuitively, a multidistribution $\mathtt{m}\in\mathtt{MDST}(\mathcal{X})$ is a syntactical representation of a discrete probability space where at each element of the space is associated a probability and a term of $\mathcal{X}$ . To the multidistribution $\mathtt{m}=\boldsymbol{[}p_{i}M_{i}\mid i\in I\boldsymbol{]}$ we associate a probability distribution $\mu\in\mathtt{DST}(\mathcal{X})$ as follows:

[TABLE]

and we call $\mu$ the probability distribution associated to $\mathtt{m}$ .

Example 3 (Distribution vs. multidistribution).

If $\mathtt{m}=\boldsymbol{[}\frac{1}{2}a,\frac{1}{2}a\boldsymbol{]}$ , then $\mu=\{a^{1}\}$ . Please observe the difference between distribution and multidistribution: if $\mathtt{m}^{\prime}=\boldsymbol{[}1a\boldsymbol{]}$ , then $\mathtt{m}\not=\mathtt{m}^{\prime}$ , but $\mu=\mu^{\prime}$ .

III-D Binary relations (notations and basic definitions)

Let $\rightarrow_{r}$ be a binary relation on a set $\mathcal{X}$ . We denote $\rightarrow_{r}^{*}$ its reflexive and transitive closure. We denote $=_{r}$ the reflexive, symmetric and transitive closure of $\rightarrow_{r}$ . If $u\in\mathcal{X}$ , we write $u\not\rightarrow_{r}$ if there is no $t\in\mathcal{X}$ such that $u\rightarrow_{r}t$ ; in this case, $u$ is in $\rightarrow_{r}$ -normal form. Figures convention: as is standard, in the figures we depict $\rightarrow^{*}$ as $\twoheadrightarrow$ ; solid arrows are universally quantified, dashed arrows are existentially quantified.

Confluence and Commutation

Let $r,s,t,u\in{\mathcal{X}}$ . The relations $\rightarrow_{1}$ and $\rightarrow_{2}$ on $\mathcal{X}$ commute if ( $r\rightarrow^{*}_{1}s$ and $r\rightarrow^{*}_{2}t$ ) imply there is $u$ such that ( $s\rightarrow^{*}_{2}u$ and $r_{3}\rightarrow^{*}_{1}u$ ); they diamond-commute ( $\diamond$ -commute) if ( $r\rightarrow_{1}s$ and $r\rightarrow_{2}t$ ) imply there is $u$ such that ( $s\rightarrow_{2}u$ and $t\rightarrow_{1}u$ ). The relation $\rightarrow$ is confluent (resp. diamond) if it commutes (resp. $\diamond$ -commutes) with itself. It is well known that $\diamond$ -commutation implies commutation, and diamond implies confluence.

IV Call-by-Value calculus $\Lambda_{\oplus}^{\mathtt{cbv}}$

We define $\Lambda_{\oplus}^{\mathtt{cbv}}$ , a CbV probabilistic $\lambda$ -calculus.

IV-A Syntax of $\Lambda_{\oplus}^{\mathtt{cbv}}$

IV-A1 The language

Terms and values are generated respectively by the grammars:

$\begin{array}[]{lcllr}M,N,P,Q&::=&x\mid\lambda x.M\mid MM\mid M\oplus M&(\textbf{terms }\Lambda_{\oplus})\\ V,W&::=&x\mid\lambda x.M&(\textbf{values }\mathcal{V})\\ \end{array}$

where $x$ ranges over a countable set of variables (denoted by $x,y,\dots$ ). $\Lambda_{\oplus}$ and $\mathcal{V}$ denote respectively the set of terms and of values. Free variables are defined as usual. $M[N/x]$ denotes the term obtained by capture-avoiding substitution of $N$ for each free occurrence of $x$ in $M$ .

Contexts ( ${\bf C}$ ) and surface contexts ( $\bm{S}$ ) are generated by the grammars:

$\begin{array}[]{lcllr}{\bf C}&::=&\square\mid M{\bf C}\mid{\bf C}M\mid\lambda x.{\bf C}\mid{\bf C}\oplus M\mid M\oplus{\bf C}&(\textbf{contexts})\\ \bm{S}&::=&\square\mid M\bm{S}\mid\bm{S}M&(\textbf{surface contexts})\end{array}$

where $\square$ denotes the hole of the term context. Given a term context ${\bf C}$ , we denote by ${\bf C}(M)$ the term obtained from ${\bf C}$ by filling the hole with $M$ , allowing the capture of free variables. All surface contexts are contexts. Since the hole will be filled with a redex, surface contexts formalize the fact that the redex (the hole) is not in the scope of a $\lambda$ -abstraction, nor of a $\oplus$ .

$\mathtt{MDST}(\Lambda_{\oplus})$ denotes the set of multi-distributions on $\Lambda_{\oplus}$ .

IV-A2 Reductions

We first define reduction rules on terms (Fig. 1), and one-step reduction from terms to multidistributions (Fig. 2). We then lift the definition of reduction to a binary relation on $\mathtt{MDST}(\Lambda_{\oplus})$ .

Observe that, usually, a reduction step is given by the closure under context of the reduction rules. However, to define a reduction from term to term is not informative enough, because we still have to account for the probability. The meaning of $M\oplus N$ is that this term reduces to either $M$ or $N$ , with equal probability $\frac{1}{2}$ . There are various way to formalize this fact; here, we use multidistributions.

Reduction Rules and Steps

The reduction rules on the terms of $\Lambda_{\oplus}$ are defined in Fig. 1.

The (one-step) reduction relations $\rightarrow_{\beta_{v}},\rightarrow_{\oplus}\subseteq\Lambda_{\oplus}\times\mathtt{MDST}(\Lambda_{\oplus})$ are defined in Fig. 2. Observe that the probabilistic rules $\mapsto_{r\oplus,l\oplus}$ are closed only under surface contexts, while the reduction rule $\mapsto_{\beta_{v}}$ is closed under general context ${\bf C}$ (hence $\Lambda_{\oplus}^{\mathtt{cbv}}$ is a conservative extension of Plotkin’s CbV $\lambda$ -calculus, see IV-B). We denote by $\rightarrow$ the union $\rightarrow_{\beta_{v}}\cup\rightarrow_{\oplus}$ .

Lifting

We lift the reduction relation $\rightarrow\subseteq\Lambda_{\oplus}\times\mathtt{MDST}(\Lambda_{\oplus})$ to a relation $\Rightarrow\subseteq\mathtt{MDST}(\Lambda_{\oplus})\times\mathtt{MDST}(\Lambda_{\oplus})$ , as defined in Fig. 3. Observe that $\Rightarrow$ is a reflexive relation.

We define in the same way the lifting of any relation ${\rightarrow}_{r}\subseteq\Lambda_{\oplus}\times\mathtt{MDST}(\Lambda_{\oplus})$ to a binary relation ${\Rightarrow}_{r}$ on $\mathtt{MDST}(\Lambda_{\oplus})$ . In particular, we lift $\rightarrow_{\beta_{v}},\rightarrow_{\oplus}$ to $\Rightarrow_{\beta_{v}},\Rightarrow_{\oplus}$ .

Reduction sequences

A $\Rightarrow$ -sequence (reduction sequence) from $\mathtt{m}$ is a sequence $\mathtt{m}=\mathtt{m}_{0},\dots,\mathtt{m}_{i},\mathtt{m}_{i+1},\dots$ such that $\mathtt{m}_{i}\Rightarrow\mathtt{m}_{i+1}$ ( $\forall i$ ). We write $\mathtt{m}\Rightarrow^{*}\mathtt{n}$ to indicate that there is a finite sequence from $\mathtt{m}$ to $\mathtt{n}$ , and $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ for an infinite sequence.

$\beta_{v}$ equivalence

We write $=_{\beta_{v}}$ for the transitive, reflexive and symmetric closure of $\Rightarrow_{\beta_{v}}$ ; abusing the notation, we will write $M=_{\beta_{v}}N$ for $\boldsymbol{[}M\boldsymbol{]}=_{\beta_{v}}\boldsymbol{[}N\boldsymbol{]}$ .

Normal Forms

$\mathcal{N}$ denotes the set of $\rightarrow$ -normal forms. Given $\rightarrow_{r}\in\Lambda_{\oplus}\times\mathtt{MDST}(\Lambda_{\oplus})$ , a term $M$ is in $\rightarrow_{r}$ -normal form if $M\not\rightarrow_{r}$ , i.e. there is no $\mathtt{m}$ such that $M\rightarrow_{r}\mathtt{m}$ . It is easy to check that all closed $\rightarrow$ -normal forms are values, however a value is not necessarily a $\rightarrow$ -normal form.

IV-A3 Full Lifting

The definition of lifting allows us to apply a reduction step $\rightarrow$ to any number of $M_{i}$ in the multidistribution $\mathtt{m}=\boldsymbol{[}p_{i}M_{i}\mid i\in I\boldsymbol{]}$ . If no $M_{i}$ is reduced, then $\mathtt{m}\Rightarrow\mathtt{m}$ (the relation $\Rightarrow$ is reflexive). Another important case is when all $M_{i}$ for which a reduction step is possible are indeed reduced. This notion of full reduction, denoted by $\rightrightarrows$ , is defined as follows.

$\boldsymbol{[}M\boldsymbol{]}\rightrightarrows\boldsymbol{[}M\boldsymbol{]}M\not\rightarrow\quad\quad\boldsymbol{[}M\boldsymbol{]}\rightrightarrows\mathtt{m}M\rightarrow\mathtt{m}\quad\quad\boldsymbol{[}p_{i}M_{i}\mid i\in I\boldsymbol{]}\rightrightarrows\sum_{i\in I}{p_{i}\cdot\mathtt{m}_{i}}(\boldsymbol{[}M_{i}\boldsymbol{]}\rightrightarrows\mathtt{m}_{i})_{i\in I}$

Obviously, $\rightrightarrows\subset\Rightarrow$ . Similarly to lifting, also the notion of full lifting can be extended to any reduction. For any ${\rightarrow}_{r}\subseteq\Lambda_{\oplus}\times\mathtt{MDST}(\Lambda_{\oplus})$ , its full lifting is denoted by $\rightrightarrows_{r}\subseteq\mathtt{MDST}(\Lambda_{\oplus})\times\mathtt{MDST}(\Lambda_{\oplus})$ . The relation $\rightrightarrows$ plays an important role in VII.

IV-B $\Lambda_{\oplus}^{\mathtt{cbv}}$ * and the $\lambda$ -calculus*

A comparison between $\Lambda_{\oplus}^{\mathtt{cbv}}$ and the $\lambda$ -calculus is in order.

Let $\Lambda$ be the set of $\lambda$ -terms; we denote by $\Lambda^{\mathtt{cbn}}$ the CbN $\lambda$ -calculus, equipped with the reduction $\rightarrow_{\beta}$ [4], and by $\Lambda^{\mathtt{cbv}}$ the CbV $\lambda$ -calculus, equipped with the reduction $\rightarrow_{\beta_{v}}$ [25].

$\Lambda_{\oplus}^{\mathtt{cbv}}$ is a conservative extension of $\Lambda^{\mathtt{cbv}}$ . A translation $(\cdot)_{\lambda}:\Lambda_{\oplus}\rightarrow\Lambda$ can be defined as follows, where $z$ is a fresh variable which is used by no term:

[TABLE]

The translation is injective (if $(M)_{\lambda}=(N)_{\lambda}$ then $M=N$ ) and preserves values.

Proposition 4 (Simulation).

The translation is sound and complete. Let $M,N\in\Lambda_{\oplus}$ .

$M\rightarrow_{\beta_{v}}N$ * implies $(M)_{\lambda}\rightarrow_{\beta_{v}}(N)_{\lambda}$ ;* 2. 2.

$(M)_{\lambda}\rightarrow_{\beta_{v}}Q$ * implies there is a (unique) $N$ , with $Q=(N)_{\lambda}$ and $M\rightarrow_{\beta_{v}}N$ .*

IV-C Discussion (Surface Contexts)

The notion of surface context which we defined is familiar in the setting of $\lambda$ -calculus: it corresponds to weak evaluation, which we discussed in II-C. In $\Lambda_{\oplus}^{\mathtt{cbv}}$ , the $\rightarrow_{\beta_{v}}$ -reduction is unrestricted. Closing the $\oplus$ -rules under surface context $\bm{S}$ expresses the fact that the $\oplus$ -redex is not reduced under $\lambda$ -abstraction, nor in the scope of another $\oplus$ . The former is fundamental to confluence: it means that a function which samples from a distribution can be duplicated, but we cannot pre-evaluate the sampling. The latter is a technical simplification, which we adopt to avoid unessential burdens with associativity. To require no reduction in the scope of $\oplus$ is very similar to allow no reduction in the branches of an if-then-else.

V Confluence and Standardization

V-A Confluence

We prove that $\Lambda_{\oplus}^{\mathtt{cbv}}$ is confluent. We modularize the proof using the Hindley-Rosen lemma. The notions of commutation and $\diamond$ -commutation which we use are reviewed in Sec. III-D.

Lemma (Hindley-Rosen).

Let $\rightarrow_{1}$ and $\rightarrow_{2}$ be binary relations on the same set $\mathcal{R}$ . Their union $\rightarrow_{1}\cup\rightarrow_{2}$ is confluent if both $\rightarrow_{1}$ and $\rightarrow_{2}$ are confluent, and $\rightarrow_{1}$ and $\rightarrow_{2}$ commute.

The following criterion allows us to work pointwise in proving commutation and confluence of binary relations on multidistributions, namely $\Rightarrow_{\beta_{v}}$ and $\Rightarrow_{\oplus}$ .

Lemma 5 (Pointwise Criterion).

Let $\rightarrow_{o},\rightarrow_{b}\subseteq\Lambda_{\oplus}\times\mathtt{MDST}(\Lambda_{\oplus})$ and $\Rightarrow_{o},\Rightarrow_{b}$ their lifting (as defined in IV-A2). Property () below implies that $\Rightarrow_{o},\Rightarrow_{b}$ $\diamond$ -commute.*

() If $M\rightarrow_{b}\mathtt{n}$ and $M\rightarrow_{o}\mathtt{s},$ then $\exists\mathtt{r}$ s.t. $\mathtt{n}\Rightarrow_{o}\mathtt{r}$ and $\mathtt{s}\Rightarrow_{b}\mathtt{r}$ .*

Proof.

We prove that () $\mathtt{m}\Rightarrow_{b}\mathtt{n}$ and $\mathtt{m}\Rightarrow_{o}\mathtt{s}$ imply exists $\mathtt{r}$ s.t. $\mathtt{n}\Rightarrow_{o}\mathtt{r}$ and $\mathtt{s}\Rightarrow_{b}\mathtt{r}$ . Let $\mathtt{m}=\boldsymbol{[}p_{i}M_{i}\mid i\in I\boldsymbol{]}$ . By definition of lifting, for each $M_{i}$ , we have $\boldsymbol{[}M_{i}\boldsymbol{]}\Rightarrow_{b}\mathtt{n}_{i}$ and $\boldsymbol{[}M_{i}\boldsymbol{]}\Rightarrow_{o}\mathtt{s}_{i}$ , with $\mathtt{n}=\sum p_{i}\cdot\mathtt{n}_{i}$ and $\mathtt{s}=\sum p_{i}\cdot\mathtt{s}_{i}$ . It is easily checked, that for each $M_{i}$ , it exists $\mathtt{r}_{i}$ s.t. $\mathtt{n}_{i}\Rightarrow_{o}\mathtt{r}_{i}$ and $\mathtt{s}_{i}\Rightarrow_{b}\mathtt{r}_{i}$ . If either $\boldsymbol{[}M_{i}\boldsymbol{]}\Rightarrow_{b}\mathtt{n}_{i}$ or $\boldsymbol{[}M_{i}\boldsymbol{]}\Rightarrow_{o}\mathtt{s}_{i}$ uses reflexivity (rule $L1$ ), it is immediate to obtain $r_{i}$ . Otherwise, $\mathtt{r}_{i}$ is given by property (*). Hence $\mathtt{r}=\sum_{i}p_{i}\cdot\mathtt{r}_{i}$ satisfies (). ∎

We derive confluence of $\Rightarrow_{\beta_{v}}$ from the same property in the CbV $\lambda$ -calculus [25, 27], using the simulation of Prop. 4.

Lemma 6.

The reduction $\Rightarrow_{\beta_{v}}$ is confluent.

Proof.

Assume $\mathtt{m}\Rightarrow_{\beta_{v}}^{*}\mathtt{n}$ and $\mathtt{m}\Rightarrow_{\beta_{v}}^{*}\mathtt{s}$ . We first observe that if $\mathtt{m}=\boldsymbol{[}p_{i}M_{i}\mid i\in I\boldsymbol{]}$ , then $\mathtt{n}$ and $\mathtt{s}$ are respectively of the shape $\boldsymbol{[}p_{i}N_{i}\mid i\in I\boldsymbol{]}$ , $\boldsymbol{[}p_{i}S_{i}\mid i\in I\boldsymbol{]}$ , with $M_{i}\rightarrow_{\beta_{v}}^{*}N_{i}$ and $M_{i}\rightarrow_{\beta_{v}}^{*}S_{i}$ . By Prop. 4, we can project such reduction sequences on $\Lambda^{\mathtt{cbv}}$ , obtaining that for each $i\in I$ , $(M_{i})_{\lambda}\rightarrow_{\beta_{v}}^{*}(N_{i})_{\lambda}$ and $(M_{i})_{\lambda}\rightarrow_{\beta_{v}}^{*}(S_{i})_{\lambda}$ . Since $\rightarrow_{\beta_{v}}$ in CbV $\lambda$ -calculus is confluent, there are $R_{i}\in\Lambda$ such that $(N_{i})_{\lambda}\rightarrow_{\beta_{v}}^{*}R_{i}$ and $(S_{i})_{\lambda}\rightarrow_{\beta_{v}}^{*}R_{i}$ . By Prop. 4.2, for each $i\in I$ there is a unique $T_{i}\in\Lambda_{\oplus}$ such that $(T_{i})_{\lambda}=R_{i}$ , and the proof is given. ∎

We prove that the reduction $\Rightarrow_{\oplus}$ is diamond, i.e., the reduction diagram closes in one step.

Lemma 7.

The reduction $\Rightarrow_{\oplus}$ is diamond.

Proof.

We prove that if $M\rightarrow_{\oplus}\mathtt{n}$ and $M\rightarrow_{\oplus}\mathtt{s}$ , then $\exists\mathtt{r}$ such that $\mathtt{n}\Rightarrow_{\oplus}\mathtt{r}$ and $\mathtt{s}\Rightarrow_{\oplus}\mathtt{r}$ . The claim then follows by Lemma 5, by taking $\rightarrow_{o}\leavevmode\nobreak\ =\leavevmode\nobreak\ \rightarrow_{b}\leavevmode\nobreak\ =\leavevmode\nobreak\ \rightarrow_{\oplus}$ . Let $M=\bm{S}(P\oplus Q)=\bm{S}^{\prime}(P^{\prime}\oplus Q^{\prime})$ , $\mathtt{n}=\boldsymbol{[}\frac{1}{2}\bm{S}(P),\frac{1}{2}\bm{S}(Q)\boldsymbol{]}$ and $\mathtt{s}=\boldsymbol{[}\frac{1}{2}\bm{S}^{\prime}(P^{\prime}),\frac{1}{2}\bm{S}^{\prime}(Q^{\prime})\boldsymbol{]}$ . Because of definition of surface context, the two $\oplus$ -redexes do not overlap: $P^{\prime}\oplus Q^{\prime}$ is a subterm of $\bm{S}$ and $P\oplus Q$ is a subterm of $\bm{S}^{\prime}$ . Hence we can reduce those redexes in $\bm{S}$ and $\bm{S}^{\prime}$ , to obtain $\mathtt{r}$ . ∎

We prove commutation of $\Rightarrow_{\oplus}$ and $\Rightarrow_{\beta_{v}}$ by proving a stronger property: they $\diamond$ -commute.

Lemma 8.

The reductions $\Rightarrow_{\beta_{v}}$ and $\Rightarrow_{\oplus}$ $\diamond$ -commute.

Proof.

By using Lemma 5, we only need to prove that if $M\rightarrow_{\beta_{v}}\mathtt{n}$ and $M\rightarrow_{\oplus}\mathtt{s}$ , then $\exists\mathtt{r}$ such that $\mathtt{n}\Rightarrow_{\oplus}\mathtt{r}$ and $\mathtt{s}\Rightarrow_{\beta_{v}}\mathtt{r}$ . The proof is by induction on $M$ . Cases $M=x$ and $M=\lambda x.P$ are not possible given the hypothesis.

Case $M=P\oplus Q$ . $M$ is the only possible $\oplus$ -redex. Assume the $\beta_{v}$ -redex is inside $P$ (the other case is similar), and that $P\oplus Q\rightarrow_{\beta_{v}}\boldsymbol{[}P^{\prime}\oplus Q\boldsymbol{]}$ , $P\oplus Q\rightarrow_{\oplus}\boldsymbol{[}\frac{1}{2}P,\frac{1}{2}Q\boldsymbol{]}$ . It is immediate that $\mathtt{r}=\boldsymbol{[}\frac{1}{2}P^{\prime},\frac{1}{2}Q\boldsymbol{]}$ satisfies the claim. 2. 2.

Case $M=PQ$ . $M$ cannot have the form $(\lambda x.P^{\prime})V$ because neither $P$ nor $Q$ could contain a $\oplus$ -redex.

(a)

Assume that the $\beta_{v}$ -redex is inside $P$ , and the $\oplus$ -redex inside $Q$ . We have $PQ\rightarrow_{\beta_{v}}\boldsymbol{[}P^{\prime}Q\boldsymbol{]}$ (with $P\rightarrow_{\beta_{v}}P^{\prime}$ ), $PQ\rightarrow_{\oplus}\boldsymbol{[}\frac{1}{2}PQ^{\prime},\frac{1}{2}PQ^{\prime\prime}\boldsymbol{]}$ (with $Q\rightarrow_{\oplus}\boldsymbol{[}\frac{1}{2}Q^{\prime},\frac{1}{2}Q^{\prime\prime}\boldsymbol{]}$ ). It is immediate that $\mathtt{r}=\boldsymbol{[}\frac{1}{2}P^{\prime}Q^{\prime},\frac{1}{2}P^{\prime}Q^{\prime\prime}\boldsymbol{]}$ satisfies the claim. The symmetric case is similar. 2. (b)

Assume that both redexes are inside $Q$ . Let us write $M$ as $\bm{S}(Q)$ . Assume $Q\rightarrow_{\beta_{v}}\boldsymbol{[}N\boldsymbol{]}$ , $Q\rightarrow_{\oplus}\boldsymbol{[}\frac{1}{2}Q^{\prime},\frac{1}{2}Q^{\prime\prime}\boldsymbol{]}$ , therefore $\bm{S}(Q)\rightarrow_{\beta_{v}}\boldsymbol{[}\bm{S}(N)\boldsymbol{]}=\mathtt{n}$ and $\bm{S}(Q)\rightarrow_{\oplus}\boldsymbol{[}\frac{1}{2}\bm{S}(Q^{\prime}),\frac{1}{2}\bm{S}(Q^{\prime\prime})\boldsymbol{]}=\mathtt{s}$ . We use the inductive hypothesis on $Q$ to obtain $\mathtt{r}^{\prime}=\boldsymbol{[}\frac{1}{2}R^{\prime},\frac{1}{2}R^{\prime\prime}\boldsymbol{]}$ such that $\boldsymbol{[}N\boldsymbol{]}\Rightarrow_{\oplus}\boldsymbol{[}\frac{1}{2}R^{\prime},\frac{1}{2}R^{\prime\prime}\boldsymbol{]}$ , $\boldsymbol{[}Q^{\prime}\boldsymbol{]}\Rightarrow_{\beta_{v}}\boldsymbol{[}R^{\prime}\boldsymbol{]}$ , $\boldsymbol{[}Q^{\prime\prime}\boldsymbol{]}\Rightarrow_{\beta_{v}}\boldsymbol{[}R^{\prime\prime}\boldsymbol{]}$ . We conclude that for $\mathtt{r}=\boldsymbol{[}\frac{1}{2}\bm{S}(R^{\prime}),\frac{1}{2}\bm{S}(R^{\prime\prime})\boldsymbol{]}$ , it holds that $\mathtt{n}\Rightarrow_{\oplus}\mathtt{r}$ and $\mathtt{s}\Rightarrow_{\beta_{v}}\mathtt{r}$ .

∎

Theorem 9.

The reduction $\Rightarrow$ is confluent.

Proof.

By Hindley-Rosen, from Lemmas 8, 6, and 7. ∎

Let us call $\mathtt{n}$ an $\mathcal{N}$ -multidistribution if $\mathtt{n}\in\mathtt{MDST}(\mathcal{N})$ i.e. $\mathtt{n}=\boldsymbol{[}p_{i}M_{i}\boldsymbol{]}$ and all $M_{i}$ are $\rightarrow$ -normal forms. The following fact is an immediate consequence of confluence:

Fact.

The $\mathcal{N}$ -multidistribution to which $\mathtt{m}$ reduces, if any, is unique.

V-A1 Discussion

While immediate, the above fact is hardly useful, for two reasons. First, we know that probabilistic termination is not necessarily reached in a finite number of steps; the relevant notion is not that $\mathtt{m}\Rightarrow^{*}\mathtt{n}$ $\in\mathtt{MDST}(\mathcal{N})$ , but rather that of a distribution which is defined as limit by the sequence $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ . Secondly, in Plotkin’s CbV calculus the result of computation is formalized by the notion of value, and considering normal forms as values is unsound ([25], page 135). In Section VI-B we introduce a suitable notion of limit distribution, and study the implications of confluence on it.

V-B A Standardization Property

In this section, we first introduce surface and left reduction as strategies for $\Rightarrow$ . In the setting of the CbV $\lambda$ -calculus, the former corresponds to weak reduction, the latter to the standard strategy originally defined in [25]. We then establish a standardization result, namely that every finite $\Rightarrow$ -sequence can be partially ordered as a sequence in which all surface reductions are performed first. A counterexample shows that in $\Lambda_{\oplus}^{\mathtt{cbv}}$ , a standardization result using left reduction fails.

V-B1 Surface and Left Reduction

We remind the reader that in the $\lambda$ -calculus, a deterministic strategy defines a function from terms to redexes, associating to every term the next redex to be reduced. More generally, we call reduction strategy for $\rightarrow$ a reduction relation $\rightarrow_{a}$ such that $\rightarrow_{a}\subseteq\rightarrow$ . The notion of strategy can be easily formalized through the notion of context. With this in mind, let us consider surface and left contexts.

•

Surface contexts $\bm{S}$ have been defined in Sec.IV-A1.

•

Left contexts ${\bf L}$ are defined by the following grammar:

[TABLE]

Note that in particular a left contexts is a surface context.

•

We call surface reduction, denoted by $\overset{{}_{\textsf{s}}}{\rightarrow}$ (with lifting $\overset{{}_{\textsf{s}}}{\Rightarrow}$ ) and left reduction, denoted by $\overset{{}_{\textsf{l}}}{\rightarrow}$ (with lifting $\overset{{}_{\textsf{l}}}{\Rightarrow}$ ), the closure of the reduction rules in Fig. 1 under surface contexts and left contexts, respectively. It is clear that $\overset{{}_{\textsf{s}}}{\rightarrow}\leavevmode\nobreak\ =\leavevmode\nobreak\ \overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}\cup\rightarrow_{\oplus}$ . Observe that $\overset{{}_{\textsf{l}}}{\rightarrow}\subsetneq\overset{{}_{\textsf{s}}}{\rightarrow}$ .

•

A reduction step $M\rightarrow\mathtt{m}$ is deep, written $M\overset{{{}_{\textsf{d}}}}{\rightarrow}\mathtt{m}$ , if it is not a surface step. A reduction step is internal (written $M\overset{{{}_{\textsf{int}}}}{\rightarrow}\mathtt{m}$ ) if it is not a left step. Observe that $\overset{{{}_{\textsf{d}}}}{\rightarrow}\subset\overset{{{}_{\textsf{int}}}}{\rightarrow}$ .

Example 10.

•

( $\overset{{}_{\textsf{l}}}{\rightarrow}\subsetneq\overset{{}_{\textsf{s}}}{\rightarrow}$ ) Let $M=x(II)(II)$ , where $I=\lambda x.x$ . Then $M\overset{{}_{\textsf{s}}}{\rightarrow}\boldsymbol{[}xI(II)\boldsymbol{]}$ and $M\overset{{}_{\textsf{s}}}{\rightarrow}\boldsymbol{[}x(II)I\boldsymbol{]}$ ; instead, $M\overset{{}_{\textsf{l}}}{\rightarrow}\boldsymbol{[}xI(II)\boldsymbol{]}$ , $M\not\overset{{}_{\textsf{l}}}{\rightarrow}\boldsymbol{[}x(II)I\boldsymbol{]}$ .

•

( $\overset{{{}_{\textsf{d}}}}{\rightarrow}\subsetneq\overset{{{}_{\textsf{int}}}}{\rightarrow}$ ) Let $M=(\lambda x.II)(II)$ . Then $M\overset{{{}_{\textsf{int}}}}{\rightarrow}(\lambda x.I)(II)$ and $M\overset{{{}_{\textsf{int}}}}{\rightarrow}(\lambda x.II)I$ , while $M\overset{{{}_{\textsf{d}}}}{\rightarrow}(\lambda x.I)(II)$ and $M\not\overset{{{}_{\textsf{d}}}}{\rightarrow}(\lambda x.II)I$

Intuitively, left reduction chooses the leftmost of the surface redexes. More precisely, this is the case for closed terms (for example, the term $(xx)(II)$ has a $\overset{{}_{\textsf{s}}}{\rightarrow}$ -step, but no $\overset{{}_{\textsf{l}}}{\rightarrow}$ -step).

Surface Normal Forms: We denote by $\mathcal{S}^{\mathtt{cbv}}$ the set of $\overset{{}_{\textsf{s}}}{\rightarrow}$ -normal forms. We observe that all values are surface normal forms (but the converse does not hold): $\mathcal{V}\subsetneq\mathcal{S}^{\mathtt{cbv}}$ (and $\mathcal{N}\subsetneq\mathcal{S}^{\mathtt{cbv}}$ ). The situation is different if we restrict ourselves to close term, in fact the following result holds, which is easy to check.

Lemma 11.

If $M$ is a closed term, the following three are equivalent:

$M$ * is a $\overset{{}_{\textsf{s}}}{\rightarrow}$ -normal form;* 2. 2.

$M$ * is a $\overset{{}_{\textsf{l}}}{\rightarrow}$ -normal form;* 3. 3.

$M$ * is a value.*

V-B2 Finitary Surface Standardization

The next theorem proves a standardization result, in the sense that every finite reduction sequence can be (partially) ordered in a sequence of surface steps followed by a sequence of deep steps.

Theorem 12 (Finitary Surface Standardization).

In $\Lambda_{\oplus}^{\mathtt{cbv}}$ , if $\mathtt{m}\Rightarrow^{*}\mathtt{n}$ then exists $\mathtt{r}$ such that $\mathtt{m}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{r}$ and $\mathtt{r}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{n}$ .

Proof.

We build on an analogous result for CbV $\lambda$ -calculus, which is folklore and is proved explicitly in Appendix V-B. We then only need to check that deep steps commute with $\oplus$ -steps, which is straightforward technology (the full proof is in Appendix V-B). ∎

Finitary Left Standardization does not hold

The following statement is false for $\Lambda_{\oplus}^{\mathtt{cbv}}$ .

“If $\mathtt{m}\Rightarrow^{*}\mathtt{n}$ then there exists $\mathtt{r}$ such that $\mathtt{m}\overset{{}_{\textsf{l}}}{\Rightarrow}^{*}\mathtt{r}$ and $\mathtt{r}\overset{{}_{\textsf{int}}}{\Rightarrow}^{*}\mathtt{n}$ ."

Example 13 (Counter-example).

Let us consider the following sequence, where $I=\lambda x.x$ and $M=(II)((\lambda x.y\oplus z)I)$ . $\boldsymbol{[}M\boldsymbol{]}\overset{{}_{\textsf{int}}}{\Rightarrow}\boldsymbol{[}(II)(y\oplus z)\boldsymbol{]}$$\Rightarrow_{\oplus}\boldsymbol{[}\frac{1}{2}(II)y,\frac{1}{2}(II)z\boldsymbol{]}\Rightarrow_{\beta_{v}}\boldsymbol{[}\frac{1}{2}Iy,\frac{1}{2}(II)z\boldsymbol{]}$ . If we anticipate the reduction of $(II)$ , we have $M\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}\boldsymbol{[}I((\lambda x.y\oplus z)I)\boldsymbol{]}$ , from where we cannot reach $\boldsymbol{[}\frac{1}{2}Iy,\frac{1}{2}(II)z\boldsymbol{]}$ . Observe that the sequence is already surface-standard!

VI Asymptotic Evaluation

The specificity of probabilistic computation is to be concerned with asymptotic behavior; the focus is not what happens after a finite number $n$ of steps, but when $n$ tends to infinity. In this section, we study the asymptotic behavior of $\Rightarrow$ -sequences with respect to evaluation. The intuition is that a reduction sequence defines a distribution on the possible outcomes of the program. We first clarify what is the outcome of evaluating a probabilistic term, and then we formalize the idea of result “at the limit" with the notion of limit distribution (Def. 18). In Sec. VI-B we investigate how the asymptotic result of different sequences starting from the same $\mathtt{m}$ compare.

We recall that to each multidistribution $\mathtt{m}$ on $\Lambda_{\oplus}$ is associated a probability distribution $\mu\in\mathtt{DST}(\Lambda_{\oplus})$ (see Sec.III-C). We use the following letter convention: given a multidistribution $\mathtt{m},\mathtt{n},\mathtt{r},...$ we denote the associated distribution by the corresponding Greek letter $\mu,\nu,\rho,...$ If $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ is a $\Rightarrow$ -sequence, then $\langle\mu_{n}\rangle_{n\in\mathbb{N}}$ is the sequence of associated distributions.

VI-A Probabilistic Evaluation

We start by studying the property of being valuable (VI-A1) and by analyzing some examples (VI-A2). This motivates the more general approach we introduce in VI-A3.

VI-A1 To be valuable

In the CbV $\lambda$ -calculus, the key property of a term $M$ is *to be valuable, i.e., $M$ can reduce to a value. To be valuable is a yes/no property, whose probabilistic analogous is the probability to reduce to a value. * If $\mathtt{m}$ describes the result of a computation step, the probability that such a result is a value is simply $\mu(\mathcal{V}):=\sum_{V\in\mathcal{V}}\mu(V)$ , i.e. the probability of the event $\mathcal{V}\subset\Lambda_{\oplus}$ . Since the set of values is closed under reduction, the following property holds:

Fact 14.

If $V\in\mathcal{V}$ and $V\rightarrow\mathtt{m}$ , then $\mathtt{m}=\boldsymbol{[}W\boldsymbol{]}$ , with $W\in\mathcal{V}$ , and $V\rightarrow_{\beta_{v}}\boldsymbol{[}W\boldsymbol{]}$ .

Let $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ be a $\Rightarrow$ -sequence, and $\langle\mu_{n}\rangle_{n\in\mathbb{N}}$ the sequence of associated distributions. The sequence of reals $\langle{\mu_{n}(\mathcal{V})}\rangle_{n\in\mathbb{N}}$ is nondecreasing and bounded, because of Fact 14. Therefore the limit exists, and is the supremum: $\lim_{n\to\infty}{\mu_{n}(\mathcal{V})}=\sup_{n}\{\mu_{n}(\mathcal{V})\}.$ This fact allows us the following definition.

•

The sequence $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ evaluates with probability $p$

if $p=\sup_{n}{\mu_{n}(\mathcal{V})}$ , written $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}p$ .

•

$\mathtt{m}$ is * $p$ -valuable* if $p$ is the greatest probability to which a sequence from $\mathtt{m}$ can evaluate.

Example 15.

Let $\mathtt{T}=\lambda xy.x$ and $\mathtt{F}=\lambda xy.y$ .

Consider the term $PP$ where $P=(\lambda x.(xx\oplus\mathtt{T}))$ . Then $PP\rightarrow\boldsymbol{[}(PP)\oplus\mathtt{T}\boldsymbol{]}\Rightarrow\boldsymbol{[}\frac{1}{2}PP,\frac{1}{2}\mathtt{T}\boldsymbol{]}\Rightarrow^{2n}\boldsymbol{[}\frac{1}{2^{n}}PP,\frac{1}{2}\mathtt{T},\dots,\frac{1}{2^{n}}\mathtt{T}\boldsymbol{]}$ . Since $\lim_{n\to\infty}{\sum_{1}^{n}\frac{1}{2^{n}}}=1$ , $PP$ is $1$ -valuable. 2. 2.

Consider the term $QQ$ , where $Q=\lambda x.(xx\oplus(\mathtt{T}\oplus\mathtt{F}))$ . Then $QQ\rightarrow_{\beta_{v}}\boldsymbol{[}(QQ)\oplus(\mathtt{T}\oplus\mathtt{F})\boldsymbol{]}\Rightarrow^{*}\boldsymbol{[}\frac{1}{2}QQ,\frac{1}{4}\mathtt{T},\frac{1}{4}\mathtt{F}\boldsymbol{]}\Rightarrow^{*}\dots$ It is immediate that $QQ$ is $1$ -valuable. 3. 3.

Let $\Delta=\lambda x.xx$ , so that $\Delta\Delta$ is a divergent term, and let $N=\lambda x.(xx)\oplus(\mathtt{T}\oplus(\Delta\Delta))$ . Then $NN\rightarrow_{\beta_{v}}\boldsymbol{[}(NN)\oplus(\mathtt{T}\oplus(\Delta\Delta))\boldsymbol{]}\Rightarrow^{*}\boldsymbol{[}\frac{1}{2}NN,\frac{1}{4}\mathtt{T},\frac{1}{4}(\Delta\Delta)\boldsymbol{]}\Rightarrow^{*}\dots$ $NN$ is $\frac{1}{2}$ -valuable.

VI-A2 Result of a CbV computation

The notion of being $p$ -valuable allows for a simple definition, but it is too coarse. Consider Example 15; both 1) and 2) give examples of $1$ -valuable term. However, in 1) the probability is concentrated in the value $\mathtt{T}$ , while in 2) $\mathtt{T}$ and $\mathtt{F}$ have equal probability $\frac{1}{2}$ . Observe that $\mathtt{T}$ and $\mathtt{F}$ are different normal forms, and are not $\beta_{v}$ -equivalent. To discriminate between $\mathtt{T}$ and $\mathtt{F}$ , we need a finer notion of evaluation. Since the calculus is CbV, the result “at the limit" is intuitively a distribution on the possible values that the term can reach. Some care is needed though, as the following example shows.

Example 16.

Consider Plotkin’s CbV $\lambda$ -calculus. Let $\omega_{3}=\lambda x.xxx$ ; the term $M={(\lambda x.x)\lambda x.\omega_{3}\omega_{3}}$ has the following $\rightarrow_{\beta_{v}}$ -reduction: $M={(\lambda x.x)(\lambda x.\omega_{3}\omega_{3})}\rightarrow_{\beta_{v}}M_{1}={\lambda x.\omega_{3}\omega_{3}}\rightarrow_{\beta_{v}}M_{2}={\lambda x.\omega_{3}\omega_{3}\omega_{3}}\rightarrow_{\beta_{v}}\cdots$ . We obtain a reduction sequence where $\forall n\geq 1$ , $M_{n}={\lambda x.\omega_{3}\underbrace{\omega_{3}...\omega_{3}}_{n}}$ . Each $M_{i}$ is a value, but there is not a "final" one in which the reduction ends. Transposing this to $\Lambda_{\oplus}^{\mathtt{cbv}}$ , let $\mathtt{m}_{0}=\boldsymbol{[}M\boldsymbol{]}$ , $\mathtt{m}_{i}=\boldsymbol{[}M_{i}\boldsymbol{]}$ . The $\Rightarrow$ -sequence $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ is $1$ -valuable, but the distribution on values is different at every step. In other words, $\forall V\in\mathcal{V}$ , the sequence $\langle{\mu_{n}({V})}\rangle$ has limit [math]. Observe that however all the values $M_{i}$ are $\beta_{v}$ -equivalent.

VI-A3 Observations and Limit Distribution

Example 16 motivates the approach that we develop now: the result of probabilistic evaluation is not a distribution on values, but* a distribution on some events of interest*. In the case of $\Lambda_{\oplus}^{\mathtt{cbv}}$ , the most informative events are equivalence classes of values.

We first introduce the notion of observation, and then that of limit distribution.

Definition 17.

A set of observations for $(\Lambda_{\oplus},\Rightarrow)$ is a set $\mathtt{Obs}\subseteq\mathcal{P}(\Lambda_{\oplus})$ such that $\forall\mathbf{U},\mathbf{Z}\in\mathtt{Obs}$ , if $\mathbf{U}\not=\mathbf{Z}$ then $\mathbf{U}\cap\mathbf{Z}=\emptyset$ , and if $\mathtt{m}\Rightarrow\mathtt{m}^{\prime}$ then $\mu(\mathbf{U})\leq\mu^{\prime}(\mathbf{U})$ .

Note that, given $\mu\in\mathtt{DST}(\Lambda_{\oplus})$ , $\mathbf{U}\in\mathtt{Obs}$ has probability $\mu(\mathbf{U})$ (similarly to the event "the result is Odd" in Example 2).

It follows immediately from the definition that, given a sequence $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ , then for each $\mathbf{U}\in\mathtt{Obs}$ the sequence $\langle{\mu_{n}(\mathbf{U})}\rangle_{n\in\mathbb{N}}$ is nondecreasing and bounded, and therefore has a limit, the $\sup$ . Moreover, monotony implies the following

[TABLE]

which guarantees that the distribution $\boldsymbol{\rho}$ in Def. 18 is well defined, because $\sup_{n}\|\mu_{n}\|\leq 1$ and (1) gives $\sup_{n}\|\mu_{n}\|=\|\boldsymbol{\rho}\|$ .

Definition 18.

Let $\mathtt{Obs}$ be a set of observations. The sequence $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ defines a distribution $\boldsymbol{\rho}\in\mathtt{DST}(\mathtt{Obs})$ , where $\forall\mathbf{U}\in\mathtt{Obs}$ ,

$\boldsymbol{\rho}(\mathbf{U}):=\sup_{n}\{\mu_{n}(\mathbf{U})\}.$

•

We call such a $\boldsymbol{\rho}$ the* limit distribution* of $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ . Letter convention: greek bold letters denote limit distributions.

•

The sequence $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ converges to (or evaluates to) the limit distribution $\boldsymbol{\rho}$ , written

$\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}\Downarrow_{{}_{\mathtt{Obs}}}\boldsymbol{\rho}$ .

•

If $\mathtt{m}$ has a sequence which converges to $\boldsymbol{\rho}$ , we write

$\mathtt{m}\overset{{}_{\infty}}{\Rightarrow_{\mkern-4.0mu{}_{\mathtt{Obs}}}}\boldsymbol{\rho}$ .

•

Given $\mathtt{m}$ , we denote by $\mathtt{Lim}_{{}_{\mathtt{Obs}}}(\mathtt{m})$ the set $\{\boldsymbol{\rho}\mid\mathtt{m}\overset{{}_{\infty}}{\Rightarrow_{\mkern-4.0mu{}_{\mathtt{Obs}}}}\boldsymbol{\rho}\}$ of all limit distributions of $\mathtt{m}$ . If $\mathtt{Lim}_{{}_{\mathtt{Obs}}}(\mathtt{m})$ has a greatest element, we indicate it by $\llbracket{\mathtt{m}}\rrbracket_{{}_{\mathtt{Obs}}}$ .

If $\mathtt{Obs}$ is clear from the context, we omit the index which specifies it, and simply write $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}\Downarrow\boldsymbol{\rho}$ , $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\rho}$ , $\mathtt{Lim}(\mathtt{m})$ .

The notion of limit distribution formalizes what is the result of evaluating a probabilistic term, once we choose the set $\mathtt{Obs}$ of observations which interest us. In VI-B we prove that confluence implies that $\mathtt{Lim}(\mathtt{m})$ has a unique maximal element.

Sets of Observations for $\Lambda_{\oplus}^{\mathtt{cbv}}$

Let us consider two partitions of the set $\mathcal{V}\subset\Lambda_{\oplus}$ , the trivial one $\{\mathcal{V}\}$ , and the set $\mathcal{V}_{\sim}$ of values up to the equivalence $=_{\beta_{v}}$ , i.e. the collection of all events $\{W\in\mathcal{V}\mid W=_{\beta_{v}}V\}$ . For the set $\mathcal{N}$ of $\rightarrow$ -normal forms (see IV-A2), interesting partitions are $\{\mathcal{N}\}$ and the set of singletons $\mathcal{N}_{{}_{\{\}}}:=\{\{M\},M\in\mathcal{N}\}$ .

Proposition 19.

$\{\mathcal{V}\}$ , $\mathcal{V}_{\sim}$ , $\{\mathcal{N}\}$ and $\mathcal{N}_{{}_{\{\}}}$ are each a set of observations for $(\Lambda_{\oplus},\Rightarrow)$ .

Proof.

Clearly, any partition of $\mathcal{N}$ satisfies the conditions in Def. 17. For $\{\mathcal{V}\}$ and $\mathcal{V}_{\sim}$ , the result follows from Fact 14. ∎

Notice that convergence w.r.t. $\{\mathcal{V}\}$ corresponds to the notion of being $p$ -valuable. Instead $\{\mathcal{N}\}$ and $\mathcal{N}_{{}_{\{\}}}$ correspond to normalization and reaching a specific normal form, respectively; however these are events which are not significant in a CbV perspective, as we already discussed in V-A1. For this reason, in Sec. VII we will focus on the study of $\mathtt{Obs}:=\mathcal{V}_{\sim}$ (Sec. VII).

Example 20.

•

Let $\mathtt{Obs}$ be either $\mathcal{V}_{\sim}$ or $\mathcal{N}_{{}_{\{\}}}$ .

Let $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ be the sequence in Example 15.1, starting from $\boldsymbol{[}PP\boldsymbol{]}$ . Then $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}\Downarrow_{{}_{\mathtt{Obs}}}\{\boldsymbol{\mathtt{T}}^{1}\}$ . 2. 2.

Let $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ be the computation in Example 15.2, starting from $\boldsymbol{[}QQ\boldsymbol{]}$ . Then $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}\Downarrow_{{}_{\mathtt{Obs}}}\{\boldsymbol{\mathtt{T}}^{\frac{1}{2}},\boldsymbol{\mathtt{F}}^{\frac{1}{2}}\}$ . 3. 3.

Let $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ be the computation in Example 15.3, starting from $\boldsymbol{[}NN\boldsymbol{]}$ . Then $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}\Downarrow_{{}_{\mathtt{Obs}}}\{\boldsymbol{\mathtt{T}}^{\frac{1}{2}}\}$ .

•

Let $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ be the reduction sequence in Example 16, starting with $\boldsymbol{[}(\lambda x.x)\lambda x.\omega_{3}\omega_{3}\boldsymbol{]}$ . By taking as set of observations $\mathcal{V}_{\sim}$ , the sequence converges to $\{\boldsymbol{\lambda x.\omega_{3}\omega_{3}}^{1}\}$ .

Discussion

Each observation expresses a result of interest for the evaluation of the term $M$ . To better understand this, let us examine what become our notions of observation in the case of usual (non-probabilistic) CbV $\lambda$ -calculus. Let $M\rightarrow^{*}N\in\mathbf{U}$ and $\mathbf{U}\in\mathtt{Obs}$ ; if $\mathbf{U}\in\{\mathcal{V}\}$ then $M$ is valuable, if $\mathbf{U}\in\mathcal{V}_{\sim}$ , then $M$ reduces to the value $N$ up to $\beta_{v}$ -equivalence, if $\mathbf{U}\in\{\mathcal{N}\}$ , then $M$ normalizes, finally $\mathbf{U}=\{N\}\in\mathcal{N}_{{}_{\{\}}}$ means that $M$ has normal form $N$ . We say that $\mathbf{U}\in\mathtt{Obs}$ is a result of evaluating $M$ , if $M\rightarrow^{*}N\in\mathbf{U}$ . Clearly, fixed $\mathtt{Obs}$ , confluence implies that the result of evaluating $M$ , if any, is unique.

Sets of observations for Surface Reduction

It is interesting to examine the set of observations for surface reduction $\overset{{}_{\textsf{s}}}{\Rightarrow}$ . When considering $\overset{{}_{\textsf{s}}}{\rightarrow}$ , values are $\overset{{}_{\textsf{s}}}{\rightarrow}$ -normal forms (the converse does not hold!). Therefore $\{\{V\}\mid V\in\mathcal{V}\}$ (where $\{V\}$ is a singleton) is a set of observations for $(\Lambda_{\oplus},\overset{{}_{\textsf{s}}}{\Rightarrow})$ . In other words, when restricting oneself to surface reduction, the result of a probabilistic computation (i.e. the limit distribution) is a distribution on the possible values of the term. Observe that all set of observations for $\Rightarrow$ (Prop. 19) are also set of observations for $\overset{{}_{\textsf{s}}}{\Rightarrow}$ .

VI-B Uniqueness and Adequacy of the Evaluation

In this section, we adapt similar results from [15], to which we refer for details. We assume a set $\mathtt{Obs}$ to be fixed, hence we omit the index. For concreteness, think of $\mathcal{V}_{\sim}$ , but the results only depend on the properties in Def. 17, and on confluence.

How do different reduction sequences from the same initial $\mathtt{m}$ compare? More precisely, assume $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\rho}$ and $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ , how do $\boldsymbol{\rho}$ and $\boldsymbol{\mu}$ compare? Intuitively, the limit distributions of $\mathtt{m}$ (which are the result of a probabilistically terminating sequence) play the role of normal forms in finitary termination. As confluence implies uniqueness of normal forms, a similar property holds when considering probabilistic termination and limits, in the sense that each $\mathtt{m}$ has a unique maximal limit distribution (Thm. 22). While the property is similar, the proof is not as immediate as in the finitary case. The key result is Lemma 21 which implies both that $\mathtt{Lim}(\mathtt{m})$ has a greatest element (Thm. 22), and adequacy of the evaluation (Thm. 23).

Recall that the order $\leq$ on distributions is defined pointwise (Sec. III-A).

Lemma 21 (Main Lemma).

$\Lambda_{\oplus}^{\mathtt{cbv}}$ * has the following property: $\forall\mathtt{m},\mathtt{s}$ , if $\boldsymbol{\mu}\in\mathtt{Lim}(\mathtt{m})$ , and $\mathtt{m}\Rightarrow^{*}\mathtt{s}$ , then $\mathtt{s}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\sigma}$ with $\boldsymbol{\mu}\leq\boldsymbol{\sigma}$ . Moreover, if $\boldsymbol{\mu}$ is maximal in $\mathtt{Lim}(\mathtt{m})$ then $\boldsymbol{\sigma}=\boldsymbol{\mu}$ .*

Proof.

Let $\boldsymbol{\mu}\in\mathtt{Lim}(\mathtt{m})$ , and $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ be a sequence from $\mathtt{m}=\mathtt{m}_{0}$ which converges to $\boldsymbol{\mu}$ . Assume $\mathtt{m}\Rightarrow^{*}\mathtt{s}$ . As illustrated in Fig. 5, from $\mathtt{s}$ we build a sequence $\mathtt{s}=\mathtt{s}_{\mathtt{m}_{0}}\Rightarrow^{*}\mathtt{s}_{\mathtt{m}_{1}}\Rightarrow^{*}\mathtt{s}_{\mathtt{m}_{2}}\dots$ , where each segment $\mathtt{s}_{\mathtt{m}_{i}}\Rightarrow^{*}\mathtt{s}_{\mathtt{m}_{i+1}}$ ( $i\geq 0$ ) is given by confluence from $\mathtt{m}_{i}\Rightarrow^{*}\mathtt{s}_{\mathtt{m}_{i}}$ and $\mathtt{m}_{i}\Rightarrow\mathtt{m}_{i+1}$ . Let $\langle\mathtt{s}_{n}\rangle_{n\in\mathbb{N}}$ be the concatenation of all such segments and let $\boldsymbol{\sigma}$ be its limit distribution. Clearly, $\boldsymbol{\sigma}\in\mathtt{Lim}(\mathtt{m})$ . Since by construction $\mathtt{m}_{i}\Rightarrow^{*}\mathtt{s}_{\mathtt{m}_{i}}$ , then for each $\mathbf{V}\in\mathtt{Obs}$ , ${\mu_{i}(\mathbf{V})}\leq\boldsymbol{\sigma}(\mathbf{V})$ (because $\mu_{i}(\mathbf{V})\leq{\sigma_{\mathtt{m}_{i}}(\mathbf{V})}$ by definition of observation). Therefore $\sup_{n}\{\mu_{n}(\mathbf{V})\}=\boldsymbol{\mu}(\mathbf{V})\leq\boldsymbol{\sigma}(\mathbf{V})$ . If $\boldsymbol{\mu}$ is maximal, then $\boldsymbol{\sigma}=\boldsymbol{\mu}$ . ∎

Theorem 22 (Greatest Limit Distribution).

$\mathtt{Lim}(\mathtt{m})$ * has a greatest element, which we indicate by $\llbracket{\mathtt{m}}\rrbracket$ .*

Proof.

The proof of both existence and uniqueness of maximal elements relies on Lemma 21. Let us explicitly show uniqueness. Let $\boldsymbol{\mu}\in\mathtt{Lim}(\mathtt{m})$ be maximal. Given any $\boldsymbol{\rho}\in\mathtt{Lim}(\mathtt{m})$ , we prove that $\boldsymbol{\rho}\leq\boldsymbol{\mu}$ . Let $\langle\mathtt{r}_{n}\rangle_{n\in\mathbb{N}}$ be a sequence from $\mathtt{m}$ such that $\langle\mathtt{r}_{n}\rangle_{n\in\mathbb{N}}\Downarrow\boldsymbol{\rho}$ . By Lemma 21, $\forall\mathtt{r}_{n}$ there is a $\Rightarrow$ -sequence from $\mathtt{r}_{n}$ which has limit $\boldsymbol{\mu}$ . Therefore $\forall\mathbf{V}\in\mathcal{V}$ , $\forall n$ , $\rho_{n}(\mathbf{V})\leq\boldsymbol{\mu}(\mathbf{V})$ , hence $\boldsymbol{\rho}(\mathbf{V})\leq\boldsymbol{\mu}(\mathbf{V})$ . If $\boldsymbol{\rho}$ is maximal, $\boldsymbol{\rho}=\boldsymbol{\mu}$ . ∎

Theorem 23 (Adequacy of evaluation).

If $\mathtt{m}\Rightarrow^{*}\mathtt{s}$ , then $\llbracket{\mathtt{m}}\rrbracket=\llbracket{\mathtt{s}}\rrbracket$ .

Proof.

Observe first that $\llbracket{\mathtt{s}}\rrbracket\in\mathtt{Lim}(\mathtt{m})$ , hence $\llbracket{\mathtt{s}}\rrbracket\leq\llbracket{\mathtt{m}}\rrbracket$ . Indeed, if $\langle\mathtt{s}_{n}\rangle_{n\in\mathbb{N}}\Downarrow\llbracket{\mathtt{s}}\rrbracket$ , by concatenanting $\mathtt{m}\Rightarrow^{*}\mathtt{s}$ with $\langle\mathtt{s}_{n}\rangle_{n\in\mathbb{N}}$ , we have $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\llbracket{\mathtt{s}}\rrbracket$ . By Lemma 21, it holds that $\llbracket{\mathtt{m}}\rrbracket\in\mathtt{Lim}(\mathtt{s})$ , hence $\llbracket{\mathtt{m}}\rrbracket\leq\llbracket{\mathtt{s}}\rrbracket$ . Therefore $\llbracket{\mathtt{m}}\rrbracket=\llbracket{\mathtt{s}}\rrbracket$ . ∎

VII Asymptotic Standardization

In this section, we focus on $\mathcal{V}_{\sim}$ as set of observations, which is the most natural choice in a CbV setting, in particular if we want to evaluate programs, i.e., closed terms.

We proved, in Thm. 22, that each $\mathtt{m}$ has a unique maximal limit distribution $\llbracket{\mathtt{m}}\rrbracket$ . Now we address the question: is there a reduction strategy which is guaranteed to converge to $\llbracket{\mathtt{m}}\rrbracket$ ? We show that surface evaluation provides such a strategy; indeed, any limit distribution in $\mathtt{Lim}(\mathtt{m})$ can be reached by surface evaluation (Thm. 26). This result of asymptotic completeness is the main technical contribution of the section.

Following the notation introduced in VI-A3, we denote by $\mathbf{V}$ the set $\{W\in\mathcal{V}\mid W=_{\beta_{v}}V\}$ . We observe that:

Fact 24.

Let $M\overset{{{}_{\textsf{d}}}}{\rightarrow}\mathtt{m}$ , then $\mathtt{m}$ has form $\boldsymbol{[}P\boldsymbol{]}$ and $M=_{\beta_{v}}P$ ; $M$ is a value if and only if $P$ is a value.

As a consequence of the previous fact, we have

Lemma 25.

If $\mathtt{m}\overset{{}_{\textsf{d}}}{\Rightarrow}\mathtt{s}$ then $\mu(\mathcal{V})=\sigma(\mathcal{V})$ , and $\mu(\mathbf{V})=\sigma(\mathbf{V})$ , for each $\mathbf{V}\in\mathcal{V}_{\sim}$ .

We write $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \textsf{s}\leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ (resp. $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \textsf{l}\leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ ) if there is a sequence $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ such that all steps $\mathtt{m}_{i}\Rightarrow\mathtt{m}_{i+1}$ are surface (resp. left) reductions and $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}\Downarrow\boldsymbol{\mu}$ . Remember that given $\mathtt{m}$ , we write $\llbracket{\mathtt{m}}\rrbracket$ for the unique maximal element of $\mathtt{Lim}(\mathtt{m})$ , and $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ if there exists a $\Rightarrow$ -sequence from $\mathtt{m}$ which converges to $\boldsymbol{\mu}$ .

We now prove asymptotic completeness for surface evaluation. We exploit finitary standardization (Thm. 12) and extend it to the limit. In the proof, it is essential the fact that $\overset{{}_{\textsf{d}}}{\Rightarrow}$ -steps preserve the distributions (Lemma 25).

Theorem 26 (Asymptotic Completeness of Surface Reduction).

$\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ * if and only if $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \textsf{s}\leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ .*

Proof.

We prove that $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ implies $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \textsf{s}\leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ (the other direction holds by definition). Assume $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}\Downarrow\boldsymbol{\mu}$ , with $\mathtt{m}=\mathtt{m}_{0}$ . As illustrated in Fig. 5, we build a sequence $\langle\mathtt{s}_{\mathtt{m}_{n}}\rangle$ such that $\mathtt{m}_{0}=\mathtt{s}_{\mathtt{m}_{0}}$ and $\forall i$ ( $\mathtt{s}_{\mathtt{m}_{i}}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}_{\mathtt{m}_{i+1}}$ and $\mathtt{s}_{\mathtt{m}_{i+1}}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{m}_{i+1}$ ). If $i=0$ , by Thm. 12 it exists $\mathtt{s}_{\mathtt{m}_{1}}$ such that $\mathtt{m}_{0}=\mathtt{s}_{\mathtt{m}_{0}}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}_{\mathtt{m}_{1}}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{m}_{1}$ . We then procede by induction: for each $i>0$ , we apply Thm. 12 to the sequence $\mathtt{s}_{\mathtt{m}_{i}}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{m}_{i}\Rightarrow\mathtt{m}_{i+1}$ , and obtain the multidistribution $\mathtt{s}_{\mathtt{m}_{i+1}}$ such that $\mathtt{s}_{\mathtt{m}_{i}}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}_{\mathtt{m}_{i+1}}$ and $\mathtt{s}_{\mathtt{m}_{i+1}}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{m}_{i+1}$ , as wanted. The concatenation of all segments $\mathtt{s}_{\mathtt{m}_{0}}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}_{\mathtt{m}_{1}},...,\mathtt{s}_{\mathtt{m}_{i}}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}_{\mathtt{m}_{i+1}},...$ is a $\overset{{}_{\textsf{s}}}{\Rightarrow}$ -sequence. Let $\boldsymbol{\sigma}$ be its limit. By Lemma 25 and the fact that $\mathtt{s}_{\mathtt{m}_{i}}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{m}_{i}$ , we have $\sigma_{\mathtt{m}_{i}}(\mathbf{V})=\mu_{i}(\mathbf{V})$ , for each $\mathbf{V}\in\mathcal{V}_{\sim}$ . We conclude $\boldsymbol{\sigma}=\boldsymbol{\mu}$ because $\forall i$ :

$\sigma_{\mathtt{m}_{i}}(\mathbf{V})=\mu_{i}(\mathbf{V})\leq\boldsymbol{\mu}(\mathbf{V})$ , therefore $\boldsymbol{\sigma}(\mathbf{V})\leq\boldsymbol{\mu}(\mathbf{V})$ . 2. 2.

$\mu_{i}(\mathbf{V})=\sigma_{\mathtt{m}_{i}}(\mathbf{V})\leq\boldsymbol{\sigma}(\mathbf{V})$ , therefore $\boldsymbol{\mu}(\mathbf{V})\leq\boldsymbol{\sigma}(\mathbf{V})$ .

∎

Remark 27.

We observe that completeness of surface evaluation (Thm. 26) is specific to convergence w.r.t. $\mathcal{V}_{\sim}$ and $\{\mathcal{V}\}$ (the most natural set of observations in CbV). Surface evaluation is not necessarily complete if we evaluate w.r.t. other sets of observations, such as normal forms, where deep steps may be needed. Consider, for example, the term $\lambda z.II\overset{{{}_{\textsf{d}}}}{\rightarrow}\lambda z.I$ . To define a complete strategy w.r.t. $\mathcal{N}_{{}_{\{\}}}$ demands to refine the approach.

VII-A Surface and Left Evaluation

We are now equipped to tackle the goal of this section, namely the existence of a strategy to find the greatest limit distribution of a program.

Since our aim is to reach the greatest limit, it makes sense to reduce "whenever is possible", and use the full lifting $\rightrightarrows$ (Def. IV-A3). The reason is easy to see. Consider for example $\mathtt{m}=\boldsymbol{[}\frac{1}{2}\Delta\Delta,\frac{1}{2}II\boldsymbol{]}$ , which has greatest limit $\llbracket{\mathtt{m}}\rrbracket=\{\mathbf{I}^{\frac{1}{2}}\}$ . We observe that a $\Rightarrow$ -sequence from $\mathtt{m}$ may very well keep reducing only the diverging term $\Delta\Delta$ and never reach $\llbracket{\mathtt{m}}\rrbracket$ . The reduction $\rightrightarrows$ , instead, forces the reduction of each term which is not in normal form for $\rightarrow$ .

Lemma 28.

Let $\boldsymbol{\rho}$ be maximal among the limit distribution of all $\rightrightarrows$ -sequences from $\mathtt{m}$ . Then $\boldsymbol{\rho}=\llbracket{\mathtt{m}}\rrbracket$ .

Proof.

Obviously, $\boldsymbol{\rho}\in\mathtt{Lim}(\mathtt{m})$ . It is straightforward to check that if $\boldsymbol{\mu}$ is the limit of a $\Rightarrow$ -sequence, then there is a $\rightrightarrows$ -sequence, whose limit is greater or equal to $\boldsymbol{\mu}$ . ∎

We write $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{s} $}\vss}}}$ (resp. $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{l} $}\vss}}}$ ) for the full lifting of $\overset{{}_{\textsf{s}}}{\rightarrow}$ (resp. $\overset{{}_{\textsf{l}}}{\rightarrow}$ ). Observe that given $\mathtt{m}$ , there is only one $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{l} $}\vss}}}$ -sequence. We use the letters $\mathfrak{l}=\langle l_{n}\rangle_{n\in\mathbb{N}},\leavevmode\nobreak\ \mathfrak{s}=\langle s_{n}\rangle_{n\in\mathbb{N}},\leavevmode\nobreak\ \mathfrak{t}=\langle t_{n}\rangle_{n\in\mathbb{N}}$ to indicate (infinite) reduction sequences. We say that $\mathtt{m}$ is closed if it is a multidistribution on closed terms i.e. $\mathtt{m}=\boldsymbol{[}p_{i}M_{i}\mid i\in I\boldsymbol{]}$ with $M_{i}$ closed $\forall i\in I$ .

Proposition 29 (Left Evaluation).

Let $\mathtt{m}$ be closed.

Let $\mathfrak{s},\mathfrak{t}$ be $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{s} $}\vss}}}$ -sequences from $\mathtt{m}$ ; $\mathfrak{s}\Downarrow\boldsymbol{\mu}$ if and only if $\mathfrak{t}\Downarrow\boldsymbol{\mu}$ . 2. 2.

Let $\mathfrak{s}$ be any $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{s} $}\vss}}}$ -sequence from $\mathtt{m}$ , and $\mathfrak{l}$ the $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{l} $}\vss}}}$ -sequences from $\mathtt{m}$ . Then $\mathfrak{s}\Downarrow\boldsymbol{\mu}$ if and only if $\mathfrak{l}\Downarrow\boldsymbol{\mu}$ .

Proof.

[15], Sec.6, studies a CbV probabilistic $\lambda$ -calculus with surface reduction ( $\Lambda_{\oplus}^{\texttt{weak}}$ ) and proves using a diamond property that if $\mathtt{m}{\mathrel{\mathop{\rightrightarrows^{k}}\limits^{\vbox to-1.50694pt{\kern 0.0pt\hbox{$ \tiny\textsf{s}\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ $}\vss}}}}\mathtt{m}_{k}$ and $\mathtt{m}{\mathrel{\mathop{\rightrightarrows^{k}}\limits^{\vbox to-1.50694pt{\kern 0.0pt\hbox{$ \tiny\textsf{s}\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ $}\vss}}}}\mathtt{r}_{k}$ (both sequence have k steps) then $\forall V\in\mathcal{V}$ , $\mu_{k}(V)=\rho_{k}(V)$ . Hence claim (1.) follows.

Claim (2.) follows from (1.) and from Lemma 11, which implies that if $M\overset{{}_{\textsf{s}}}{\rightarrow}\mathtt{n}$ is closed, we can always choose a surface step which is a $\overset{{}_{\textsf{l}}}{\rightarrow}$ -step. ∎

Putting all elements together, we have proved that the limit distribution of any $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{s} $}\vss}}}$ -sequence from $\mathtt{m}$ is $\llbracket{\mathtt{m}}\rrbracket$ . In particular, $\llbracket{\mathtt{m}}\rrbracket$ is also the limit distribution of the $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{l} $}\vss}}}$ -sequence from $\mathtt{m}$ .

Theorem 30.

For $\mathtt{m}$ closed, the following hold.

Let $\mathfrak{s}$ be any $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{s} $}\vss}}}$ -sequence from $\mathtt{m}$ . Then $\mathfrak{s}\Downarrow\llbracket{\mathtt{m}}\rrbracket$ . 2. 2.

Let $\mathfrak{l}$ be the $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{l} $}\vss}}}$ -sequence from $\mathtt{m}$ . Then $\mathfrak{l}\Downarrow\llbracket{\mathtt{m}}\rrbracket$ . 3. 3.

The sets $\{\rho\mid\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\rho\},\{\rho\mid\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \textsf{s}\leavevmode\nobreak\ \infty}}{\Rightarrow}\rho\}$ , and $\{\rho\mid\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \textsf{l}\leavevmode\nobreak\ \infty}}{\Rightarrow}\rho\}$ have the same greatest element, which is $\llbracket{\mathtt{m}}\rrbracket$ .

While left reduction is not standard for finite sequences (as Example 13 shows), still is able to reach $\llbracket{\mathtt{m}}\rrbracket$ , if we only evaluate programs, i.e., closed terms. Thm. 30 justifies (a posteriori!) the use of the leftmost-outermost strategy in the literature of probabilistic $\lambda$ -calculus: left evaluation actually produces the best asymptotic result. However, it is not the only strategy to achieve this: any $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{s} $}\vss}}}$ -sequence will.

VIII Summing-up and Overview

The definition of reduction in $\Lambda_{\oplus}^{\mathtt{cbv}}$ is based on two components: the $\beta_{v}$ -rule and the $\oplus$ -rule. We stress that only the $\oplus$ -step is constrained, while $\beta_{v}$ is inherited "as is" from the $\lambda$ -calculus. The $\beta_{v}$ -rule is allowed in all contexts, while the $\oplus$ -rule is disabled in a function body. This avoids confusion between duplicating a function which performs a choice, and duplicating the choice, that is the core of confluence failure. It is then natural to expect that the fine control on duplication which is offered by linear logic could be beneficial.

In Sec. IX we apply the methods and tools which we have developed to study $\Lambda_{\oplus}^{\mathtt{cbv}}$ to define a probabilistic linear calculus $\Lambda_{\oplus}^{!}$ which extends with a probabilistic choice Simpson’s linear $\lambda$ -calculus [30]. This is a result of interest in its own, but also evidence that our approach is robust, as it transfers well to other probabilistic calculi. In Sec. X we then define a call-by-name probabilistic calculus, $\Lambda_{\oplus}^{{\mathtt{cbn}}}$ , and we show that similar results to the ones we have established for $\Lambda_{\oplus}^{\mathtt{cbv}}$ hold.

As we will see, the three calculi follow the same pattern: the $\oplus$ -reduction (and only this reduction) is restricted to surface contexts. In Sec. XI we discuss how the three calculi relate.

IX Probabilistic Linear Lambda Calculus

$\Lambda^{!}$ [30] is an untyped linear $\lambda$ -calculus which is closely based on linear logic. Abstraction is refined into linear abstraction $\lambda x.M$ and non-linear abstraction $\lambda!x.M$ , which allows duplication of the argument. The argument of $\lambda!x.M$ is required to be suspended as thunk $!N$ , that corresponds to the $!$ -box of linear logic. In this section, we define a probabilistic linear $\lambda$ -calculus $\Lambda_{\oplus}^{!}$ by extending $\Lambda^{!}$ with an operator $\oplus$ . We demand that probabilistic choice is not reduced under the scope of a $!$ operator, while the $\beta$ -reduction is unrestricted. We show that this suffices to preserve confluence; we then study the properties of the calculus.

IX-A Syntax of $\Lambda_{\oplus}^{!}$

IX-A1 The language

Raw terms $M,N,\dots$ are built up from a countable set of variables $x,y,\dots$ according to the grammar:

$\begin{array}[]{lcllr}M&::=&x\mid!M\mid\lambda x.M\mid\lambda!x.M\mid MM\mid M\oplus N&(\textbf{terms }\Lambda_{\oplus}^{!})\end{array}$

We say that $x$ is affine (resp. linear) in $M$ if $x$ occurs free at most (resp. exactly) once in $M$ , and moreover, the free occurrence of $x$ does not lie within the scope of a $!$ operator. A term $M$ is affine (resp. linear) if for every subterm $\lambda x.P$ of $M$ , $x$ is so in $P$ . Henceforth, we consider affine terms only.

It is immediate to observe that if $M$ is affine (linear) and $M\rightarrow N$ , then $N$ is affine (linear).

Contexts ( ${\bf C}$ ) and surface contexts ( $\bm{S}$ ) are generated by the grammars:

$\begin{array}[]{lclll}{\bf C}&::=&\square\mid M{\bf C}\mid{\bf C}M\mid\lambda x.{\bf C}\mid\lambda!x.{\bf C}\mid!{\bf C}\mid{\bf C}\oplus M\mid M\oplus{\bf C}&(\textbf{contexts})\\ \bm{S}&::=&\square\mid M\bm{S}\mid\bm{S}M\mid\lambda x.\bm{S}\mid\lambda!x.\bm{S}&(\textbf{surface c.})\end{array}$

where $\square$ denotes the hole of the term context. Observe that a surface context is defined in a different way than in IV-A. Here it expresses the fact that a surface redex cannot occur in the scope of a $!$ operator (nor in the scope of a $\oplus$ ).

IX-A2 Reductions

We follow the same pattern as for $\Lambda_{\oplus}^{\mathtt{cbv}}$ . The beta rules $\mapsto_{\beta}$ are given in Fig. 7. The probabilistic rules $\mapsto_{l\oplus},\mapsto_{r\oplus}$ are as in Fig. 1. The reduction steps are in Fig. 6: the $\beta$ -rule is closed under general context, while the $\oplus$ -rules are closed under surface contexts. The $\beta$ -rules also can be restricted to the closure under surface contexts, as shown in Fig. 6. A $\rightarrow$ -step is deep (written $\overset{{{}_{\textsf{d}}}}{\rightarrow}$ ) if it is not surface. The lifting of the relation $\rightarrow:\Lambda_{\oplus}^{!}\times\mathtt{MDST}(\Lambda_{\oplus}^{!})$ to a binary relation on $\Rightarrow$ $\mathtt{MDST}(\Lambda_{\oplus}^{!})$ is defined as in Fig. 3.

Remark 31.

To limit notations for reductions and contexts, we use the same as for $\Lambda_{\oplus}^{\mathtt{cbv}}$ , clearly the meaning is different.

IX-B $\Lambda_{\oplus}^{!}$ * is a conservative extension of $\Lambda^{!}$ *

As in IV-B, we denote by $\rightarrow_{\beta}$ both the reduction in $\Lambda^{!}$ and the $\beta$ reduction in $\Lambda_{\oplus}^{!}$ ; we prove that $(\Lambda_{\oplus}^{!},\Rightarrow_{\beta})$ is a conservative extension of $(\Lambda^{!},\rightarrow_{\beta})$ .

Definition 32 (Translation).

$(\cdot)_{!}:\Lambda_{\oplus}^{!}\rightarrow\Lambda^{!}$ is defined in the following way, where $z$ is a fixed fresh variable

[TABLE]

Note that the translation of terms of the form $M\oplus N$ is designed so to preserves surface reduction.

Proposition 33 (Simulation).

Let $M\in\Lambda_{\oplus}^{!}$ .

$M\rightarrow_{\beta}\boldsymbol{[}N\boldsymbol{]}$ * implies $(M)_{!}\rightarrow_{\beta}(N)_{!}$ .* 2. 2.

$(M)_{!}\rightarrow_{\beta}P$ * implies that exists (unique) $N\in\Lambda_{\oplus}^{!}$ , with $N=(P)_{!}$ and $M\rightarrow_{\beta}\boldsymbol{[}N\boldsymbol{]}$ .* 3. 3.

$M\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta}\boldsymbol{[}N\boldsymbol{]}$ * implies $(M)_{!}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta}(N)_{!}$ .* 4. 4.

$(M)_{!}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta}P$ * implies exists (unique) $N\in\Lambda_{\oplus}^{!}$ , s.t. $N=(P)_{!}$ and $M\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta}\boldsymbol{[}N\boldsymbol{]}$ .*

The translation tells us that the reduction $\boldsymbol{[}M\boldsymbol{]}\Rightarrow_{\beta}\boldsymbol{[}N\boldsymbol{]}$ on $\Lambda_{\oplus}^{!}$ behaves as the reduction $(M)_{!}\rightarrow_{\beta}(N)_{!}$ on $\Lambda^{!}$ .

IX-C Confluence and Finitary Standardization for $\Lambda_{\oplus}^{!}$

The following properties hold for $\Lambda^{!}$ [30].

Theorem (Simpson 05).

The following hold in $\Lambda^{!}$ .

Confluence. $\rightarrow_{\beta}$ is confluent. 2. 2.

Surface Standardization. If $M\rightarrow_{\beta}^{*}N$ then exists $R$ such that $M\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta}^{*}R$ and $R\overset{{{}_{\textsf{d}}}}{\rightarrow}^{*}N$ .

We show, using the methods developed for $\Lambda_{\oplus}^{\mathtt{cbv}}$ and the translation in Def. 32, that the same properties hold for $\Lambda_{\oplus}^{!}$ .

IX-C1 Confluence

We follow the same approach as in Sec. V-A. In fact, we already have most of the building blocks for the proof. Observe that Lemma 5 is general enough to apply also to binary relations on $\mathtt{MDST}(\Lambda_{\oplus}^{!})$ .

Lemma 34.

The reduction $\Rightarrow_{\oplus}$ is diamond. 2. 2.

The reduction $\Rightarrow_{\beta}$ is confluent. 3. 3.

The reductions $\Rightarrow_{\beta}$ and $\Rightarrow_{\oplus}$ commute.

Proof.

The details of the proof are in Appendix -E1. The proof of 1) and 2) is as for Lemmas 7 and 6; 3) is proved using Lemma 5, by induction on the term. ∎

By Hindley-Rosen Lemma, we obtain

Theorem 35.

The reduction $\Rightarrow$ of $\Lambda_{\oplus}^{!}$ is confluent.

IX-C2 Surface standardization

Proposition 36 (Finitary Surface Standardization).

In $\Lambda_{\oplus}^{!}$ , if $\mathtt{m}\Rightarrow^{*}\mathtt{n}$ then exists $\mathtt{r}$ such that $\mathtt{m}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{r}$ and $\mathtt{r}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{n}$ .

Proof.

The proof is given in Appendix -E2. ∎

IX-D Asymptotic behaviour

Normal forms are defined as in IV-A2; we denote by $\mathcal{N}^{!}$ the set of $\rightarrow$ -normal forms, and by $\mathcal{S}^{!}$ the set of the surface normal forms (i.e. the $\overset{{}_{\textsf{s}}}{\rightarrow}$ -normal forms). Clearly $\mathcal{N}^{!}\subsetneqq\mathcal{S}^{!}$ . We define $\mathcal{N}^{!}_{{}_{\{\}}}:=\{\{M\},M\in\mathcal{N}^{!}\}$ , and $\mathcal{S}^{!}_{\sim}$ as the set of all events $\textbf{R}:=\{S\in\mathcal{S}^{!}\mid S=_{\beta}R\}$ .

Observations

A set of observations for $(\Lambda_{\oplus}^{!},\Rightarrow)$ is defined in the same way as that for $(\Lambda_{\oplus},\Rightarrow)$ (Def. 17 ).

Proposition 37.

Each of the following sets $\{\mathcal{N}^{!}\}$ , $\{\mathcal{S}^{!}\}$ , $\mathcal{N}^{!}_{{{}_{\{\}}}}$ , $\mathcal{S}^{!}_{\sim}$ , is a set of observations for $(\Lambda_{\oplus}^{!},\Rightarrow)$ .

Limit distributions and evaluation

Once we fix a set of observations $\mathtt{Obs}$ for $(\Lambda_{\oplus}^{!},\Rightarrow)$ , the definition of evaluation and limit distribution, and the notations $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}\Downarrow\boldsymbol{\rho}$ , $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\rho}$ and $\mathtt{Lim}(\mathtt{m})$ are as in Def. 18. We already observed that Thm. 22 and 23 only depends on confluence, and on the definition of observations; therefore both hold.

Theorem 38.

For any choice of $\mathtt{Obs}$ , $\Lambda_{\oplus}^{!}$ has the properties:

•

$\mathtt{Lim}(\mathtt{m})$ * has a greatest element, which we indicate as $\llbracket{\mathtt{m}}\rrbracket$ .*

•

If $\mathtt{m}\Rightarrow^{*}\mathtt{s}$ , then $\llbracket{\mathtt{m}}\rrbracket=\llbracket{\mathtt{s}}\rrbracket$ .

Asymptotic Standardization

For the rest of the section we focus on $\mathtt{Obs}:=\mathcal{S}^{!}_{\sim}$ . Notice that if $\boldsymbol{\rho}$ is a limit distribution, $\boldsymbol{\rho}\in\mathtt{MDST}(\mathcal{S}^{!}_{\sim})$ . We have established that for each $\mathtt{m}\in\Lambda_{\oplus}^{!}$ , $\mathtt{Lim}(\mathtt{m})$ has a unique maximal element $\llbracket{\mathtt{m}}\rrbracket$ . We now want to have a strategy to find $\llbracket{\mathtt{m}}\rrbracket$ . Surface reduction plays that role. We use the following fact, which is easy to verify.

Fact 39.

Let $M\overset{{{}_{\textsf{d}}}}{\rightarrow}\mathtt{n}$ . Then

$\mathtt{n}$ * is of the form $\boldsymbol{[}N\boldsymbol{]}$ , and $M=_{\beta}N$ ;* 2. 2.

$M\in\mathcal{S}^{!}$ * if and only if $N\in\mathcal{S}^{!}$ .*

Theorem 40 (Asymptotic Completeness).

In $\Lambda_{\oplus}^{!}$ it holds that $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ if and only if $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \textsf{s}\leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ .

Proof.

As for Thm. 26, now using Fact 39 and Prop. 36. ∎

Similarly to Sec. VII, we can establish that any (infinitary) $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{s} $}\vss}}}$ -sequences from $\mathtt{m}$ converges precisely to $\llbracket{\mathtt{m}}\rrbracket$ , where $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{s} $}\vss}}}$ indicate the full lifting of the relation $\overset{{}_{\textsf{s}}}{\rightarrow}\subseteq\Lambda_{\oplus}^{!}\times\mathtt{MDST}(\Lambda_{\oplus}^{!})$ .

Theorem 41 (Surface Evaluation).

Let $\mathfrak{s}=\langle\mathtt{s}_{n}\rangle_{n\in\mathbb{N}}$ be any $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{s} $}\vss}}}$ -sequences from $\mathtt{m}$ . It holds that $\mathfrak{s}\Downarrow\llbracket{\mathtt{m}}\rrbracket.$

X Call-by-Name calculus $\Lambda_{\oplus}^{{\mathtt{cbn}}}$

We show that results similar to those for $\Lambda_{\oplus}^{\mathtt{cbv}}$ hold for a CbN calculus, denoted $\Lambda_{\oplus}^{\mathtt{cbn}}$ . We could adapt all the proofs, but now we prefer to follow a different way. Once we take the point of view of linear logic, we have a roadmap to CbN via Girard’s translation of intuitionistic into linear logic. More precisely, we rely on recent work [12, 17] which expresses those translations in untyped $\lambda$ -calculus. We exploit the faithful nature of the translation to transfer both confluence and standardization from $\Lambda_{\oplus}^{!}$ to $\Lambda_{\oplus}^{\mathtt{cbn}}$ , essentially for free.

X-A Syntax of $\Lambda_{\oplus}^{\mathtt{cbn}}$

We write $\Lambda_{\oplus}^{\mathtt{cbn}}$ for the set of terms $\Lambda_{\oplus}$ equipped with the reduction relation $\Rightarrow$ defined below.

X-A1 The language

Terms and contexts $({\bf C})$ are the same as in $\Lambda_{\oplus}^{\mathtt{cbv}}$ . Surface contexts ( $\bm{S}$ ) are generated by the grammar:

$\begin{array}[]{lcllr}\bm{S}&::=&\square\mid\lambda x.\bm{S}\mid\bm{S}M&({\mathtt{cbn}}\textbf{ surface contexts})\end{array}$

X-A2 Reductions

The $\beta$ -rule $\mapsto_{\beta}$ is as in the CbN $\lambda$ -calculus (Fig. 8). The probabilistic rules $\mapsto_{l\oplus},\mapsto_{l\oplus}$ are as in Fig. 1.

Reduction steps $\rightarrow,\rightarrow_{\beta},\rightarrow_{\oplus}\subseteq\Lambda_{\oplus}\times\mathtt{MDST}(\Lambda_{\oplus})$ and surface reduction steps $\overset{{}_{\textsf{s}}}{\rightarrow},\overset{{}_{\textsf{s}}}{\rightarrow}_{\oplus},\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta}\subseteq\Lambda_{\oplus}\times\mathtt{MDST}(\Lambda_{\oplus})$ are defined in Fig. 6, following the usual pattern. By definition of surface context, a reduction step is surface if it does not occur in argument position (nor in the scope of $\oplus$ ).

The lifting of $\rightarrow\subseteq\Lambda_{\oplus}\times\mathtt{MDST}(\Lambda_{\oplus})$ to a binary relation $\Rightarrow$ on $\mathtt{MDST}(\Lambda_{\oplus}^{\mathtt{cbn}})$ is defined as in Fig. 3. The full lifting $\rightrightarrows$ is defined as in IV-A3.

X-A3 Normal Forms

We denote by $\mathcal{N}^{\mathtt{cbn}}$ the set of $\rightarrow$ -normal forms, and by $\mathcal{S}^{\mathtt{cbn}}$ the set of the surface normal forms (i.e. the $\overset{{}_{\textsf{s}}}{\rightarrow}$ -normal forms). Clearly $\mathcal{N}^{\mathtt{cbn}}\subsetneqq\mathcal{S}^{\mathtt{cbn}}$ .

Let us extend to $\Lambda_{\oplus}^{{\mathtt{cbn}}}$ the notion of head normal form. Head reduction $\overset{{}_{\textsf{h}}}{\rightarrow}$ is the closure of both the $\beta$ and the probabilistic rules under head context ${\bf H}$ , which is defined by the following grammar

${\bf H}::=\lambda x.{\bf H}\mid{\bf K}\quad\quad{\bf K}::=\square\mid{\bf K}M\quad\quad(\textbf{ head contexts })$

Remark.

A common way to write head context ${\bf H}$ is as follows:

[TABLE]

Observe that $\overset{{}_{\textsf{h}}}{\rightarrow}\leavevmode\nobreak\ \subsetneqq\leavevmode\nobreak\ \overset{{}_{\textsf{s}}}{\rightarrow}$ (for example, the reduction $(\lambda x.(\lambda y.y)P)Q\overset{{}_{\textsf{s}}}{\rightarrow}(\lambda x.P)Q$ is not a head reduction). However, the two relations have the same normal forms. Let us write $\mathcal{H}$ for the set of head normal forms . If $M$ is in surface normal form, it is also in head normal form. It is easy to verify that a head normal form has no $\overset{{}_{\textsf{s}}}{\rightarrow}$ -redex, and conclude:

$\mathcal{S}^{\mathtt{cbn}}=\mathcal{H}$

X-A4 $\Lambda_{\oplus}^{{\mathtt{cbn}}}$ to $\Lambda_{\oplus}^{!}$ .

In [17], the translation from $\Lambda^{\mathtt{cbn}}$ into a linear $\lambda$ -calculus is proved sound and complete. We follow their work to define a similar translation $(\cdot)_{{}_{\mathtt{N}}}:\Lambda_{\oplus}^{\mathtt{cbn}}\to\Lambda_{\oplus}^{!}$ :

[TABLE]

The following extend to the probabilistic setting an analogous result proved in [17]. Observe that, with a slight abuse of notation, reductions in the two calculi are denoted in the same way, the meaning being clear from the context.

Proposition 42 (Simulation).

The translation $(.)_{{}_{\mathtt{N}}}$ is sound and complete; it preserves surface reduction and surface normal forms. Let $M\in\Lambda_{\oplus}^{\mathtt{cbn}}$ ; the following hold:

if $M\rightarrow\mathtt{n}$ then $(M)_{{}_{\mathtt{N}}}\rightarrow(\mathtt{n})_{{}_{\mathtt{N}}}$ ; 2. 2.

if $M\overset{{}_{\textsf{s}}}{\rightarrow}\mathtt{n}$ then $(M)_{{}_{\mathtt{N}}}\overset{{}_{\textsf{s}}}{\rightarrow}(\mathtt{n})_{{}_{\mathtt{N}}}$ ; 3. 3.

if $(M)_{{}_{\mathtt{N}}}\rightarrow\mathtt{s}$ then $\exists!\mathtt{n}$ such that $\mathtt{s}=(\mathtt{n})_{{}_{\mathtt{N}}}$ and $M\rightarrow\mathtt{n}$ ; 4. 4.

if $(M)_{{}_{\mathtt{N}}}\overset{{}_{\textsf{s}}}{\rightarrow}\mathtt{s}$ then $\exists!\mathtt{n}$ such that $\mathtt{s}=(\mathtt{n})_{{}_{\mathtt{N}}}$ and $M\overset{{}_{\textsf{s}}}{\rightarrow}\mathtt{n}$ ; 5. 5.

$M\in\mathcal{H}$ * if and only if $(M)_{{}_{\mathtt{N}}}\in\mathcal{S}^{!}$ .*

Proof.

The proof is in Appendix X-A4. ∎

X-B Confluence and Finitary Standardization for $\Lambda_{\oplus}^{\mathtt{cbn}}$

The fact that surface reduction is preserved by $(.)_{{}_{\mathtt{N}}}$ is crucial to transfer the standardization result from $\Lambda_{\oplus}^{!}$ to $\Lambda_{\oplus}^{\mathtt{cbn}}$ . We show that via translation, $\Lambda_{\oplus}^{{\mathtt{cbn}}}$ inherits both the confluence and the surface standardization property from $\Lambda_{\oplus}^{!}$ .

Theorem 43 (Confluence).

The relation $\Rightarrow_{\mathtt{cbn}}$ is confluent.

Proof.

From Thm. 35, using back-and-forth Thm 42. ∎

Theorem 44 (Finitary Surface standardization).

If $\mathtt{m}\Rightarrow^{*}\mathtt{n}$ then exists $\mathtt{r}$ such that $\mathtt{m}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{r}$ and $\mathtt{r}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{n}$ .

Proof.

From Thm. 36, by using back-and-forth Thm 42, and the fact that the translation preserves surface reduction. ∎

In the classical $\lambda$ -calculus, the standardization property (Barendregt, Th. 11.4.7) says that every reduction sequence can be ordered in such a way to perform first only left $\beta$ -redexes, reading the term from left to right, and then internal ones (a redex is internal if it is not the leftmost one).

In $\Lambda_{\oplus}^{\mathtt{cbn}}$ this notion of standardization fails, as the following example (which we take from [19]) shows.

Example 45.

In each step, we underline the redex. Consider $\boldsymbol{[}(\lambda x.\underline{I(y\oplus z)})I\boldsymbol{]}\Rightarrow\boldsymbol{[}(\lambda x.\underline{y\oplus z})I\boldsymbol{]}\Rightarrow\boldsymbol{[}\underline{{\frac{1}{2}}(\lambda x.y)I},{\frac{1}{2}}(\lambda x.z)I\boldsymbol{]}\Rightarrow\boldsymbol{[}{\frac{1}{2}}y,{\frac{1}{2}}(\lambda x.z)I\boldsymbol{]}$ , where only the last step reduces a left redex. If we perform the left redex first, we have $\boldsymbol{[}(\lambda x.I(y\oplus z))I\boldsymbol{]}\Rightarrow\boldsymbol{[}I(y\oplus z)\boldsymbol{]}$ , from which $\boldsymbol{[}{\frac{1}{2}}y,{\frac{1}{2}}(\lambda x.z)I\boldsymbol{]}$ cannot be reached.

A consequence of standardization is that $M$ has a head normal form iff the $\overset{{}_{\textsf{h}}}{\rightarrow}$ -sequence from $M$ terminates. In the following section we retrieve an analogue of this result.

X-C Asymptotic behaviour

We denote by $\mathcal{H}_{\sim}$ the set of head normal forms up to the equivalence $=_{\beta}$ , and we define $\mathcal{N}^{{\mathtt{cbn}}}_{{}_{\{\}}}=\{\{M\},M\in\mathcal{N}^{{\mathtt{cbn}}}\}$ .

Observations

Observations are defined as in Def. 17.

Proposition 46.

Each of the following is a set of observations for $\Lambda_{\oplus}^{\mathtt{cbn}}$ : $\{\mathcal{H}\}$ , $\{\mathcal{N}^{\mathtt{cbn}}\}$ , $\mathcal{H}_{\sim}$ , $\mathcal{N}^{\mathtt{cbn}}_{{}_{\{\}}}$ .

Convergence and Limit distributions

Once we fix a set of observations $\mathtt{Obs}$ for $\Lambda_{\oplus}^{\mathtt{cbn}}$ , the definition of convergence and limit distribution are as in Def. 18. We observe that Theorems 22 and 23 both hold. Hence in particular

Theorem 47.

For any choice of $\mathtt{Obs}$ , the following holds in $\Lambda_{\oplus}^{\mathtt{cbn}}$ : given $\mathtt{m}$ , $\mathtt{Lim}(\mathtt{m})$ has a greatest element $\llbracket{\mathtt{m}}\rrbracket$ .

We now study the notion of convergence induced by* choosing head normal forms as outcome*, i.e. $\mathtt{Obs}:=\mathcal{H}_{\sim}$ . Therefore, if $\boldsymbol{\rho}\in\mathtt{Lim}(\mathtt{m})$ , it holds $\boldsymbol{\rho}\in\mathtt{MDST}(\mathcal{H}_{\sim})$ . The following results match the analogous results in $\Lambda_{\oplus}^{!}$ (Thm. 40 and 41).

Theorem 48.

Let $\mathtt{Obs}:=\mathcal{H}_{\sim}$ . For every multidistribution $\mathtt{m}$ :

•

$\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ * if and only if $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \textsf{s}\leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ .*

•

If $\langle\mathtt{s}_{n}\rangle_{n\in\mathbb{N}}$ is a $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{s} $}\vss}}}$ -sequences of full surface reductions from $\mathtt{m}$ , then $\langle\mathtt{s}_{n}\rangle_{n\in\mathbb{N}}\Downarrow\llbracket{\mathtt{m}}\rrbracket.$

Similarly to Prop. 29, it is not hard to prove that in $\Lambda_{\oplus}^{\mathtt{cbn}}$ , $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{s} $}\vss}}}$ satisfies a diamond property in the sense of [15], and hence all $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{s} $}\vss}}}$ -sequences from $\mathtt{m}$ converge to the same limit distribution. Since $\overset{{}_{\textsf{h}}}{\rightarrow}\subset\overset{{}_{\textsf{s}}}{\rightarrow}$ and since head reduction and surface reduction have the same normal forms, we can always choose a $\overset{{}_{\textsf{l}}}{\rightarrow}$ step whenever a $\overset{{}_{\textsf{s}}}{\rightarrow}$ -step is possible. This allows us to retrieve a result of completeness for head reduction: **

Let $\langle\mathtt{s}_{n}\rangle_{n\in\mathbb{N}}$ be the $\mathrel{\mathop{\rightrightarrows}\limits^{\vbox to0.0pt{\kern 0.0pt\hbox{$ \tiny\textsf{h} $}\vss}}}$ -sequences of full head reductions from $\mathtt{m}$ . It holds that $\langle\mathtt{s}_{n}\rangle_{n\in\mathbb{N}}\Downarrow\llbracket{\mathtt{m}}\rrbracket.$

Once again, this justifies a posteriori the choice of head reduction in probabilistic CbN (such as [13]). Observe that we follow the same reasoning as in the case of $\Lambda_{\oplus}^{\mathtt{cbv}}$ (with $\mathcal{V}_{\sim}$ as set of observations). First we proved that surface reduction is sufficient to reach the greatest limit distribution, then we observed that in particular left reduction can be chosen. There is a close parallelism between $\Lambda_{\oplus}^{\mathtt{cbv}}$ and $\Lambda_{\oplus}^{\mathtt{cbn}}$ : similar results hold if we consider as set of observations $\mathcal{V}_{\sim}$ and $\mathcal{H}_{\sim}$ respectively.

XI Conclusion and discussion

XI-A Summary

In this paper we design two probabilistic extensions of respectively the CbV and CbN $\lambda$ -calculus, $\Lambda_{\oplus}^{{\mathtt{cbv}}}$ and $\Lambda_{\oplus}^{{\mathtt{cbn}}}$ , which we propose as foundational calculi for probabilistic computation. Both calculi enjoy confluence and standardization, in an extended way. Namely, first we prove both properties for the finite sequences, exploiting classical methods, then we extend these properties to the limit, developing new sophisticated proof methods. In particular, we prove the uniqueness of the (maximal) result, parametrized by the notion of set of observations, and that the asymptotic extension of surface standardization supplies a family of complete reduction strategies which are guaranteed to reach the best result. The two calculi have a common root in the linear $\lambda$ -calculus $\Lambda_{\oplus}^{!}$ , which is both a technical tool and a calculus of interest in its own, in which a fine control of the interaction between copying and choice is possible.

In all three calculi, $\beta$ -reduction is unconstrained; hence for each calculus, its restriction to only $\beta$ -reduction exactly gives the usual corresponding (CbN, CbV, or linear) $\lambda$ -calculus; this is not the case for extensions in which a strategy is fixed.

New proof methods include the asymptotic extension of surface standardization (Thm. 26), and the use of a translation to transfer standardization properties, namely from $\Lambda_{\oplus}^{!}$ to $\Lambda_{\oplus}^{{\mathtt{cbn}}}$ . It is worth stressing a crucial element: the fact that the translation is sound, complete and preserves surface contexts is what allows us to transfer the results.

XI-B Discussion

Relating the calculi (Girard’s Translations)

The key to understand how $\Lambda_{\oplus}^{\mathtt{cbv}}$ , $\Lambda_{\oplus}^{\mathtt{cbn}}$ , and $\Lambda_{\oplus}^{!}$ relate are the two Girard’s translations which embed intuitionistic logic into linear logic, and which are well known to respectively correspond to CbN and CbV computations. Let us clarify this. Let us start from $\Lambda_{\oplus}^{!}$ : the natural constraint to avoid copying the result of a choice is "no $\oplus$ -reduction in the scope of $!$ " (i.e., inside a !-box). Using the intuition provided by Girard’s translations as a guide, the constraint above becomes respectively "no $\oplus$ -reduction in the scope of a $\lambda$ -abstraction" (in CbV) and "no $\oplus$ -reduction in argument position" (in CbN). Our three notions of surface context express these three constraints.

The intuitive reasoning above can be formalized thanks to a recent line of work [12, 17], which internalizes the insights coming from linear logic and proof nets into a $\lambda$ -syntax. The resulting calculus subsumes both CbN and CbV $\lambda$ -calculi via Girard’s translation. The idea of a system which subsumes both CbV and CbN had been already advocated and developed by Levy, via the Call-By-Push-Value paradigm [20]. And indeed, [12] can be seen as an untyped version of Levy’s calculus. We leave to the future a comprehensive approach, where a probabilistic linear calculus is the metalanguage in which all the results are developed.

On non-deterministic $\lambda$ -calculi

The finitary results we presented (namely, confluence and finitary surface standardization) also hold if the probabilistic choice is replaced by non-deterministic choice (just forget the coefficients). Asymptotic results, instead, are specific to probabilistic computation.

$\Lambda_{\oplus}^{!}$ and quantum $\lambda$ -calculi

The fine control of duplication which $\Lambda^{!}$ inherits from linear logic has made it an ideal base for quantum $\lambda$ -calculi (such as [7, 6]). In those calculi,* surface reduction* is the key ingredient to allow for the coexistence of quantum bits with duplication and erasing.

No reduction (not even $\beta$ ) is allowed in the scope of a $!$ operator. Our results show that $\beta$ -reduction can be unrestricted, only measurement (the quantum analogue of $\oplus$ ) needs to be surface.

-C Proofs of Section V-B

We prove Thm. 12, i.e. finitary Surface Standardization for $\Lambda_{\oplus}^{\mathtt{cbv}}$ . We start by establishing Surface Standardization for the (non probabilistic) call-by-value $\lambda$ -calculus, $\Lambda^{\mathtt{cbv}}$ , in -C2. This result is folklore, but we could not find it in the literature. In -C3 we extend the result to $\Lambda_{\oplus}^{\mathtt{cbv}}$ .

-C1 Preliminary definitions

Surface and left reduction

Surface and left reduction have been defined in Sec. V-B1; Fig. 9 and Fig. 10 give explicitly the inference rules for surface and left steps; we use the notation defined below:

Notation.

If $\mathtt{m}=\boldsymbol{[}p_{i}M_{i}\mid i\in I\boldsymbol{]}$ , we write $\mathtt{m}@Q$ for $\boldsymbol{[}p_{i}(M_{i}Q)\mid i\in I\boldsymbol{]}$ , and $Q@\mathtt{m}$ for $\boldsymbol{[}p_{i}(QM_{i})\mid i\in I\boldsymbol{]}$ .

We recall that a reduction step $\rightarrow$ is deep, written $\overset{{{}_{\textsf{d}}}}{\rightarrow}$ , (resp. internal, written $\overset{{{}_{\textsf{int}}}}{\rightarrow}$ ) if it is not a surface step (a left step). We have already observed that $\overset{{{}_{\textsf{d}}}}{\rightarrow}\subset\overset{{{}_{\textsf{int}}}}{\rightarrow}$ , and that since a $\oplus$ -redex is always surface, a $\overset{{{}_{\textsf{d}}}}{\rightarrow}$ step is always a $\rightarrow_{\beta_{v}}$ step.

Parallel $\beta_{v}$ -reduction

•

Parallel $\beta_{v}$ -reduction is a standard definition, and is given in Fig. 12. We define its lifting ${\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\Rightarrow}_{\beta_{v}}$ as usual (see Section IV-A).

•

Deep parallel reduction ( ${\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}$ , with lifting ${\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}$ ) indicates that $M{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\Rightarrow}\boldsymbol{[}S\boldsymbol{]}$ and $M{\overset{{}_{\textsf{d}}}{\Rightarrow}}^{*}\boldsymbol{[}S\boldsymbol{]}$ . We make the rules explicit in Fig. 12.

Fact 49.

The following holds

[TABLE]

Translation

We refine the translation given in in Sec. IV-B in order to preserves surface reduction.

Let $z,w$ be fresh variables. $(\cdot)_{\lambda}:\Lambda_{\oplus}\rightarrow\Lambda$ is defined as follows:

[TABLE]

The following is straightforward to check.

Lemma 50.

Assume $M\in\Lambda_{\oplus}$ .

$P\rightarrow_{\beta_{v}}\boldsymbol{[}Q\boldsymbol{]}$ * and $(Q)_{\lambda}=S$ (in $\Lambda_{\oplus}$ ) $\Longleftrightarrow$ $(P)_{\lambda}\rightarrow_{\beta_{v}}S$ (in $\Lambda$ ).* 2. 2.

$P\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}\boldsymbol{[}Q\boldsymbol{]}$ * and $(Q)_{\lambda}=S$ (in $\Lambda_{\oplus}$ ) $\Longleftrightarrow$ $(P)_{\lambda}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}S$ (in $\Lambda$ ).* 3. 3.

$P{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\rightarrow}_{\beta_{v}}\boldsymbol{[}Q\boldsymbol{]}$ * and $(Q)_{\lambda}=S$ (in $\Lambda_{\oplus}$ ) $\Longleftrightarrow$ $(P)_{\lambda}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\rightarrow}_{\beta_{v}}(Q)_{\lambda}$ (in $\Lambda$ ).*

-C2 $\Lambda^{\mathtt{cbv}}$ and Surface Standardization

With the standard definition of left, internal, and parallel reduction (denoted $\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}},\overset{{{}_{\textsf{int}}}}{\rightarrow}_{\beta_{v}},{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\rightarrow}_{\beta_{v}}$ , respectively) the following results are well known to hold (see [25, 27]).

(a)

If $M\rightarrow_{\beta_{v}}^{*}N$ then exists $S$ such that

[TABLE]

(b)

If $M{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\rightarrow}_{\beta_{v}}N$ then exists $S$ s.t. $M\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}^{*}S{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{int}}}}{\rightarrow}}_{\beta_{v}}N$ .

(c)

If $M{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{int}}}}{\rightarrow}}_{\beta_{v}}M^{\prime}\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}N$ , it exists $S$ s.t. $M\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}^{*}S{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{int}}}}{\rightarrow}}_{\beta_{v}}N$ .

The results of the following lemma are immediately obtained from the previous ones, by observing that a left reduction is a surface reduction, a deep reduction is always an internal reduction, and that $\overset{{{}_{\textsf{int}}}}{\rightarrow}_{\beta_{v}}$ does not modify the shape of a term (see [27]).

Lemma 51.

If $M\rightarrow_{\beta_{v}}^{*}N$ then exists $S$ such that

[TABLE] 2. 2.

If $M{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\rightarrow}_{\beta_{v}}N$ then exists $S$ s.t. $M\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}^{*}S{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}_{\beta_{v}}N$ . 3. 3.

If $M{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}_{\beta_{v}}M^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}N$ then exists $S$ s.t. $M\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}^{*}S{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}_{\beta_{v}}N$ .

Proof.

The first two are by induction on $N$ . We recall that $\overset{{}_{\textsf{l}}}{\rightarrow}\subseteq\overset{{}_{\textsf{s}}}{\rightarrow}$ .

By (a) $M\rightarrow_{\beta_{v}}^{*}N$ implies $M\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}^{*}S\overset{{{}_{\textsf{int}}}}{\rightarrow}_{\beta_{v}}^{*}N$ . We examine $N$ .

•

$N=x$ . Then $S=N$ and the result holds trivially.

•

$N=\lambda x.P$ . Hence $S=\lambda x.Q\overset{{{}_{\textsf{int}}}}{\rightarrow}_{\beta_{v}}^{*}\lambda x.P$ . Then $M\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}^{*}\lambda x.Q\overset{{{}_{\textsf{d}}}}{\rightarrow}_{\beta_{v}}^{*}\lambda x.P$ .

•

$N=PQ$ . Then $S=P^{\prime}Q^{\prime}$ , where $P^{\prime}\rightarrow_{\beta_{v}}^{*}P$ and $Q^{\prime}\rightarrow_{\beta_{v}}^{*}Q$ . By induction $P^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}^{*}P^{\prime\prime}\overset{{{}_{\textsf{d}}}}{\rightarrow}_{\beta_{v}}^{*}P$ and $Q^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}^{*}Q^{\prime\prime}\overset{{{}_{\textsf{d}}}}{\rightarrow}_{\beta_{v}}^{*}Q$ , and the desired sequence is $M\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}^{*}P^{\prime}Q^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}^{*}P^{\prime\prime}Q^{\prime\prime}\overset{{{}_{\textsf{d}}}}{\rightarrow}_{\beta_{v}}^{*}PQ$ .

The result follows since $\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}\subset\overset{{{}_{\textsf{d}}}}{\rightarrow}_{\beta_{v}}$ . 2. 2.

Similar to the previous one, using (b) , i.e., the fact that $M{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\rightarrow}_{\beta_{v}}N$ implies $M\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}^{*}S{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{int}}}}{\rightarrow}}_{\beta_{v}}N$ .

•

$N=\lambda x.P$ . Then $S=\lambda x.Q$ and $Q{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\rightarrow}_{\beta_{v}}P$ . By definition, $M\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}^{*}\lambda x.Q{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}_{\beta_{v}}\lambda x.P$ .

•

$N=PQ$ . Then $S=P^{\prime}Q^{\prime}$ , with $P^{\prime}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\rightarrow}_{\beta_{v}}P$ and $Q^{\prime}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\rightarrow}_{\beta_{v}}Q$ . By induction $P^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}^{*}P^{\prime\prime}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}_{\beta_{v}}P$ and $Q^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}^{*}Q^{\prime\prime}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}_{\beta_{v}}Q$ , and the desired sequence is $M\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}^{*}P^{\prime}Q^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}^{*}P^{\prime\prime}Q^{\prime\prime}\overset{{{}_{\textsf{d}}}}{\rightarrow}_{\beta_{v}}^{*}PQ$ . 3. 3.

By induction on $M$ .

•

$M=x$ or $M=\lambda x.P$ . Immediate.

•

$M=(\lambda x.P)V$ . Assume $(\lambda x.P)V{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}_{\beta_{v}}(\lambda x.P^{\prime})V^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}N$ . Since the deep step is an internal step, the surface step is a left step, we have $(\lambda x.P)V{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{int}}}}{\rightarrow}}_{\beta_{v}}(\lambda x.P^{\prime})V^{\prime}\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}N$ . From (c), it exists $S$ , $M\overset{{}_{\textsf{l}}}{\rightarrow}_{\beta_{v}}^{*}S{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{int}}}}{\rightarrow}}_{\beta_{v}}N$ . The ${\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{int}}}}{\rightarrow}}_{\beta_{v}}$ step is in particular a ${\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\rightarrow}_{\beta_{v}}$ , hence from point 2 it holds that $S\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}^{*}S^{\prime}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}_{\beta_{v}}N$ , hence the claim.

•

$M=PQ$ . By hypothesis, $PQ{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}_{\beta_{v}}P^{\prime}Q^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}N$ ; the surface redex is inside either $P^{\prime}$ or $Q^{\prime}$ , say $Q^{\prime}$ . We have $N=P^{\prime}R$ , $Q{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}_{\beta_{v}}Q^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}R$ and by induction $Q\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}^{*}R^{\prime}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}_{\beta_{v}}R$ . Hence $PQ\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}^{*}PR^{\prime}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}_{\beta_{v}}P^{\prime}R$ .

∎

-C3 Surface Standardization in $\Lambda_{\oplus}^{\mathtt{cbv}}$

In order to prove Theorem 12, we need a lemma.

Lemma 52.

If $M{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}\boldsymbol{[}M^{\prime}\boldsymbol{]}$ and $M^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}\mathtt{n}$ , then it exists $\mathtt{s}$ , such that $\boldsymbol{[}M\boldsymbol{]}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}$ and $\mathtt{s}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}\mathtt{n}$ .

Proof.

If $M^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta_{v}}\mathtt{n}$ , the claim holds by simulation in $\Lambda^{\mathtt{cbv}}$ and Lemma 51, point (3). If $M^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\oplus}\mathtt{n}$ , we procede by induction on $M$ .

The case $M=x$ and $M=\lambda x.P$ do not apply. 2. 2.

Let $M=P\oplus Q$ . Assume $P\oplus Q{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}\boldsymbol{[}R\oplus S\boldsymbol{]}$ , so $P{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\rightarrow}_{\beta_{v}}R$ , $Q{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\rightarrow}_{\beta_{v}}S$ , and $R\oplus S\overset{{}_{\textsf{s}}}{\rightarrow}_{\oplus}\boldsymbol{[}\frac{1}{2}R,\frac{1}{2}S\boldsymbol{]}$ . By Lemma 51, point 2 and simulation in $\Lambda^{\mathtt{cbv}}$ , it holds that $P\overset{{}_{\textsf{s}}}{\rightarrow}^{*}P^{\prime}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}R$ and $Q\overset{{}_{\textsf{s}}}{\rightarrow}^{*}Q^{\prime}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}S$ . Therefore $P\oplus Q\overset{{}_{\textsf{s}}}{\rightarrow}_{\oplus}\boldsymbol{[}\frac{1}{2}P,\frac{1}{2}Q\boldsymbol{]}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\boldsymbol{[}\frac{1}{2}P^{\prime},\frac{1}{2}Q^{\prime}\boldsymbol{]}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}\boldsymbol{[}\frac{1}{2}R,\frac{1}{2}S\boldsymbol{]}$ 3. 3.

Let $M=PQ$ . Assume $PQ{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}\boldsymbol{[}RT\boldsymbol{]}$ (with $P{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}R$ and $Q{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}T$ ) and $RT\rightarrow_{\oplus}\mathtt{n}$ with the $\oplus$ -redex in either $R$ or $T$ , say is in $R$ . Hence $R\rightarrow_{\oplus}\mathtt{r}=\boldsymbol{[}\frac{1}{2}R_{i}\mid i\in\{1,2\}\boldsymbol{]}$ and $RT\overset{{}_{\textsf{s}}}{\rightarrow}\boldsymbol{[}\frac{1}{2}R_{i}T\mid i\in\{1,2\}\boldsymbol{]}=\mathtt{n}$ . By induction, from $P{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}\boldsymbol{[}R\boldsymbol{]}\overset{{}_{\textsf{s}}}{\Rightarrow}\mathtt{r}$ we have $\boldsymbol{[}P\boldsymbol{]}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\boldsymbol{[}\frac{1}{2}S_{i}\mid i\in\{1,2\}\boldsymbol{]}$ and $S_{i}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}R_{i}$ . Therefore $\boldsymbol{[}PQ\boldsymbol{]}\overset{{}_{\textsf{s}}}{\rightarrow}^{*}\boldsymbol{[}\frac{1}{2}S_{i}Q\mid i\in\{1,2\}\boldsymbol{]}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}\boldsymbol{[}\frac{1}{2}R_{i}T\mid i\in\{1,2\}\boldsymbol{]}=\mathtt{n}$ .

∎

Corollary 53.

If $\mathtt{m}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}\mathtt{n}$ and $\mathtt{n}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{r}$ , then exists $\mathtt{s}$ with $\mathtt{m}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}$ and $\mathtt{s}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}\mathtt{r}$ .

Proof.

By induction on the length $k$ of $\mathtt{n}\overset{{}_{\textsf{s}}}{\Rightarrow}^{(k)}\mathtt{r}$ . If $k=0$ the result is trivial. Otherwise, let $\mathtt{n}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{r}$ be $\mathtt{n}\overset{{}_{\textsf{s}}}{\Rightarrow}\mathtt{n}_{1}\overset{{}_{\textsf{s}}}{\Rightarrow}^{(k-1)}\mathtt{r}$ . By Lemma 52, from $\mathtt{m}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}\mathtt{n}\overset{{}_{\textsf{s}}}{\Rightarrow}\mathtt{n}_{1}$ we have that $\mathtt{m}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}\mathtt{n}_{1}\overset{{}_{\textsf{s}}}{\Rightarrow}^{(k-1)}\mathtt{r}$ . By inductive hypothesis, $\mathtt{s}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}r^{\prime}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}r$ , hence $\mathtt{m}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{r}^{\prime}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}\mathtt{r}$ . ∎

Now we are able to prove the theorem:

Thm. 12: if $\mathtt{m}\Rightarrow^{*}\mathtt{n}$ then then exists $\mathtt{r}$ such that $\mathtt{m}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{r}$ and $\mathtt{r}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{n}$ .

Proof.

By induction on the length $k$ of the reduction $\mathtt{m}\Rightarrow^{*}\mathtt{n}$ , using Corollary 53.

If $k=0$ , the result is trivial ( $\mathtt{r}=\mathtt{m}$ ). Otherwise, $\mathtt{m}\Rightarrow\mathtt{m}_{1}\Rightarrow^{*}\mathtt{n}$ . By induction, we have $\mathtt{m}_{1}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{r}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{n}$ . We can separate the first step in two: $\mathtt{m}\overset{{}_{\textsf{s}}}{\Rightarrow}\mathtt{m}^{\prime}\overset{{}_{\textsf{d}}}{\Rightarrow}\mathtt{m}_{1}$ , by reducing first only the elements of $\mathtt{m}$ which have a surface reduction, and then only the elements which have a deep reduction. The step $\mathtt{m}^{\prime}\overset{{}_{\textsf{d}}}{\Rightarrow}\mathtt{m}_{1}$ can be regarded as a parallel step. By Corollary 53, from $\mathtt{m}^{\prime}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}\mathtt{m}_{1}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{r}$ we obtain $\mathtt{m}^{\prime}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}\mathtt{r}$ , hence it holds that $\mathtt{m}\overset{{}_{\textsf{s}}}{\Rightarrow}\mathtt{m}^{\prime}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{r}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{n}$ .

∎

-D Proofs of Section VI

Monotone Convergence.

We recall the following standard result.

Theorem (Monotone Convergence for Sums).

Let $\mathcal{X}$ be a countable set, $f_{n}:\mathcal{X}\to[0,\infty]$ a non-decreasing sequence of functions, such that $f(x):=\lim_{n\to\infty}f_{n}(x)=\sup_{n}f_{n}(x)$ exists for each $x\in\mathcal{X}$ . Then

[TABLE]

Hence, given $\mu_{n}:\mathtt{Obs}\to[0,1]$ and $\boldsymbol{\rho}(\mathbf{U})=\lim_{n\to\infty}\mu_{n}(\mathbf{U})$ , the following holds:

[TABLE]

Existence of maximals.

We recall the definition of norm $\|\mu\|=\sum_{x\in\mathcal{X}}\mu(x)$ .

Lemma 54 (Existence of maximals).

Confluence implies that:

$\texttt{Norms}(\mathtt{m})=\{\|\boldsymbol{\mu}\|\leavevmode\nobreak\ \mid\leavevmode\nobreak\ \boldsymbol{\mu}\in Lim(\mathtt{m})\}$ * has a greatest element;* 2. 2.

$Lim(\mathtt{m})$ * has maximal elements.*

Proof.

(1. ) Let $p=\sup\leavevmode\nobreak\ {\texttt{Norms}(\mathtt{m})}$ . We show that $p\in\texttt{Norms}(\mathtt{m})$ , by providing a rewrite sequence $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}$ from $\mathtt{m}$ such that $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\tau}$ and $\|\boldsymbol{\tau}\|=p$ .

The following facts are all easy to check:

a.

If $\alpha<\beta$ then $\|\alpha\|<\|\beta\|$ .

b.

If $p\not\in\texttt{Norms}(\mathtt{m})$ , then for each $\epsilon$ , there exists $\boldsymbol{\mu}\in Lim(\mathtt{m})$ such that $\|\boldsymbol{\mu}\|\geq p-\epsilon$ .

c.

The Main Lemma implies that, fixed $\epsilon$ , if $\mathtt{m}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ with $\|\boldsymbol{\mu}\|\geq(p-\epsilon)$ , and $\mathtt{m}\Rightarrow^{*}\mathtt{s}$ , then there exists $\mathtt{s}^{\prime}$ , such that $\mathtt{s}\Rightarrow^{*}\mathtt{s}^{\prime}$ and $\|\sigma^{\prime}\|\geq(p-2\epsilon)$ .

(Proof: Main Lemma implies that there is a rewrite sequence $\langle\mathtt{s}_{n}\rangle_{n\in\mathbb{N}}$ from $\mathtt{s}$ which converges to $\boldsymbol{\sigma}\geq\boldsymbol{\mu}$ . Therefore $\langle\mathtt{s}_{n}\rangle_{n\in\mathbb{N}}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\sigma}$ where $\|\boldsymbol{\sigma}\|\geq(p-\epsilon)$ . For the same $\epsilon$ , there is an index $N$ such that $\mathtt{s}\Rightarrow^{*}\mathtt{s}_{N}$ and $\|\sigma_{N}\|\geq(\|\boldsymbol{\sigma}\|-\epsilon)$ , hence $\|\sigma_{N}\|\geq p-2\epsilon$ . )

d.

$\forall\delta\in\mathbb{R}^{+}$ there exists $k$ such that $\frac{p}{2^{k}}\leq\delta$ .

For each $k\in\mathbb{N}$ , let $\epsilon_{k}=\frac{p}{2^{k}}$ . Let $\mathtt{s}^{(0)}=\mathtt{m}$ . From here, we build a sequence of reductions $\mathtt{m}\Rightarrow^{*}\mathtt{s}^{(1)}\Rightarrow^{*}s^{(2)}\Rightarrow^{*}\dots$ whose limit has norm $p$ , as illustrated in Fig. 13. For each $k>0$ , we observe that:

•

By (b.) there exists $\boldsymbol{\mu}^{(k)}\in Lim(\mathtt{m})$ such that $\|\boldsymbol{\mu}^{(k)}\|\geq(p-\frac{1}{2}\frac{p}{2^{k}})$ .

•

From $\mathtt{m}\Rightarrow^{*}\mathtt{s}^{(k-1)}$ , we use (c.) to establish that there exists $\mathtt{s}^{(k)}$ such that $\mathtt{s}^{(k-1)}\Rightarrow^{*}\mathtt{s}^{(k)}$ and $\|{\sigma^{(k)}}\|\geq(p-\frac{p}{2^{k}})$ . Observe that $\boldsymbol{\mu}^{(k)},\mathtt{s}^{(k-1)},\mathtt{s}^{(k)}$ resp. instantiate $\boldsymbol{\mu},\mathtt{s},\mathtt{s}^{\prime}$ of (c.).

Let $\langle\mathtt{s}_{n}\rangle_{n\in\mathbb{N}}$ be the concatenation of all the finite sequences $\mathtt{s}^{(k-1)}\Rightarrow^{*}\mathtt{s}^{(k)}$ . By construction, $\langle\mathtt{s}_{n}\rangle_{n\in\mathbb{N}}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\tau}$ such that $\|\boldsymbol{\tau}\|=p$ . Hence $p\in\texttt{Norms}(\mathtt{m})$ .

**(1. $\Rightarrow$ 2.) ** We observe that if $\langle\mathtt{m}_{n}\rangle_{n\in\mathbb{N}}\overset{{}_{\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \infty}}{\Rightarrow}\boldsymbol{\mu}$ and $\|\boldsymbol{\mu}\|$ is maximal in $\texttt{Norms}(\mathtt{m})$ , then $\boldsymbol{\mu}$ is maximal in $Lim(\mathtt{m})$ , because of (a.).

∎

-E Proofs of Section IX

-E1 Confluence of $\Lambda_{\oplus}^{!}$

We prove Theorem 35. First, we need to prove some preliminary results.

Lemma 55.

If $M\rightarrow_{\beta}\mathtt{n}$ and $M\rightarrow_{\oplus}\mathtt{s}$ , then exists $\mathtt{r}$ such that $\mathtt{n}\Rightarrow_{\oplus}\mathtt{r}$ and $\mathtt{s}\Rightarrow_{\beta}\mathtt{r}$

Proof.

We reason by induction on $M$ . The key case is case 5. Case $M=x$ and $M=!P$ are not possible given the hypothesis.

Case $M=P\oplus Q$ . Similar to Lemma 8, case (1) 2. 2.

Case $M=\bm{S}(Q)$ , and both redexes are inside $Q$ . Similar to Lemma 8, case (2.2b). 3. 3.

Case $M=PQ$ , with the $\beta$ -redex inside $P$ , and the $\oplus$ -redex inside $Q$ . Similar to Lemma 8, case (2.2a). 4. 4.

Case $M=(\lambda!x.P)!Q$ , where $M$ is the $\beta$ -redex. The $\oplus$ -redex needs to be inside $P$ . Assume $P\rightarrow_{\oplus}\boldsymbol{[}\frac{1}{2}P_{1},\frac{1}{2}P_{2}\boldsymbol{]}$ . We have $M\rightarrow_{\oplus}\boldsymbol{[}\frac{1}{2}(\lambda!x.P_{1})!Q,\frac{1}{2}(\lambda!x.P_{2})!Q\boldsymbol{]}$ , and $M\rightarrow_{\beta}\boldsymbol{[}P[Q/x]\boldsymbol{]}$ . It is immediate that the multidistribution $\mathtt{r}=\boldsymbol{[}\frac{1}{2}P_{1}[Q/x],\frac{1}{2}P_{2}[Q/x]\boldsymbol{]}$ satisfies the claim. 5. 5.

Case $M=(\lambda x.P)Q$ , where $M$ is the $\beta$ -redex. If the $\oplus$ -redex is inside $P$ , we reason as above. Assume that the $\oplus$ -redex is inside $Q$ , and we have $Q\rightarrow_{\oplus}\boldsymbol{[}\frac{1}{2}Q_{1},\frac{1}{2}Q_{2}\boldsymbol{]}$ . The key observation is that in $P$ there is *at most one occurrence *of $x$ . Let assume there is exactly one occurrence (the case of none is easy). Let ${\bf C}$ be the context such that $P={\bf C}(x)$ (i.e., ${\bf C}$ is $P$ , with a hole in the place of $x$ ). Observe that $P[Q/x]={\bf C}(Q)$ . We have $M\rightarrow_{\beta}\boldsymbol{[}P[Q/x]={\bf C}(Q)\boldsymbol{]}$ , and $M\rightarrow_{\oplus}\boldsymbol{[}\frac{1}{2}(\lambda x.P)Q_{1},\frac{1}{2}(\lambda x.P)Q_{2}\boldsymbol{]}$ . The multidistribution $\mathtt{r}=\boldsymbol{[}\frac{1}{2}{\bf C}(Q_{1}),\frac{1}{2}{\bf C}(Q_{2})\boldsymbol{]}$ satisfies the claim.

∎

Lem. 34:

The reduction $\Rightarrow_{\oplus}$ is diamond. 2. 2.

The reduction $\Rightarrow_{\beta}$ is confluent. 3. 3.

The reductions $\Rightarrow_{\beta}$ and $\Rightarrow_{\oplus}$ commute.

Proof.

Same proof as for Lemma 7. 2. 2.

Inherited from $\Lambda^{!}$ via the translation $(.)_{!}$ and Prop. 33. 3. 3.

We prove that $\Rightarrow_{\beta}$ and $\Rightarrow_{\oplus}$ $\diamond$ -commute, by using Lemma 5 and Lemma 55.

∎

Thm. 35 The reduction $\Rightarrow$ of $\Lambda_{\oplus}^{!}$ is confluent.

Proof.

By Hindley-Rosen Lemma, from Lemma 34. ∎

-E2 Surface Standardization in $\Lambda_{\oplus}^{!}$

In order to prove Proposition 36, first we prove a lemma.

Lemma 56.

If $M{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{{}_{\textsf{d}}}}{\rightarrow}}M^{\prime}$ and $M^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}\mathtt{n}$ , then $\boldsymbol{[}M\boldsymbol{]}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}$ and $\mathtt{s}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}\mathtt{n}$ .

Proof.

If $M^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\beta}\mathtt{n}$ , the claim holds by simulation in $\Lambda^{!}$ . If $M^{\prime}\overset{{}_{\textsf{s}}}{\rightarrow}_{\oplus}\mathtt{n}$ , we procede by induction on $M$ .

The case $M=x$ and $M=!P$ do not apply. 2. 2.

Let $M=P\oplus Q$ . Similar to Lemma 52, Point (2.). 3. 3.

Let $M=PQ,\leavevmode\nobreak\ \lambda!x.P$ or $\lambda x.P$ . Similar to Lemma 52, Point (3.).

∎

Corollary 57.

If $\mathtt{m}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}\mathtt{n}$ and $\mathtt{n}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{r}$ , then exists $\mathtt{s}$ with $\mathtt{m}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{s}$ and $\mathtt{s}{\leavevmode\nobreak\ \shortparallel\mkern-3.0mu\mkern-8.0mu\overset{{}_{\textsf{d}}}{\Rightarrow}}\mathtt{r}$ .

Proof.

Same as Lemma 53, using Lemma 56. ∎

Then we can prove:

Prop. 36 If $\mathtt{m}\Rightarrow^{*}\mathtt{n}$ then there exists $\mathtt{r}$ such that $\mathtt{m}\overset{{}_{\textsf{s}}}{\Rightarrow}^{*}\mathtt{r}$ and $\mathtt{r}\overset{{}_{\textsf{d}}}{\Rightarrow}^{*}\mathtt{n}$ .

Proof.

Same as the proof of Thm. 12, by induction on the length of the reduction $\mathtt{m}\Rightarrow^{*}\mathtt{n}$ , using this time Corollary 57.

∎

-F Proofs of Section X

-F1 $\Lambda_{\oplus}^{!}$ is a conservative extension of $\Lambda_{\oplus}^{\mathtt{cbn}}$ (X-A4)

To prove Proposition 42, we first prove that $(\cdot)_{{}_{\mathtt{N}}}$ preserves both surface contexts and $\oplus$ -redexes.

Lemma 58.

Given $M\in\Lambda_{\oplus}^{\mathtt{cbn}}$ , (S1) holds $\Longleftrightarrow$ (S2) holds, where:

S1:

in $\Lambda_{\oplus}^{\mathtt{cbn}}$ , there exists $\bm{S}$ surface context and a redex $r=R_{1}\oplus R_{2}$ such that $M=\bm{S}(r)$ ; 2. S2:

in $\Lambda_{\oplus}^{!}$ there exists $\bm{T}$ surface context and a redex $u=U_{1}\oplus U_{2}$ such that $(M)_{{}_{\mathtt{N}}}=\bm{T}(u)$ ;

and moreover $(\bm{S}(R_{i}))_{{}_{\mathtt{N}}}=\bm{T}(U_{i})$ , for $i\in\{1,2\}$ .

Proof.

$\Longrightarrow$ . By induction on the form of the surface context.

•

$\square$ . Since $M=r$ , then $(M)_{{}_{\mathtt{N}}}=(R_{1})_{{}_{\mathtt{N}}}\oplus(R_{2})_{{}_{\mathtt{N}}}$ . Hence $u=(R_{1})_{{}_{\mathtt{N}}}\oplus(R_{2})_{{}_{\mathtt{N}}}$ and $\bm{T}=\square$ satisfy the claim.

•

$\bm{S}Q$ . We have that $M=\bm{S}Q(r)=\bm{S}(r)Q$ . Hence $(\bm{S}(r)Q)_{{}_{\mathtt{N}}}=(\bm{S}(r))_{{}_{\mathtt{N}}}!(Q)_{{}_{\mathtt{N}}}$ . By inductive hypothesis, there exist $\bm{T}^{\prime}$ and $u$ such that $(\bm{S}(r))_{{}_{\mathtt{N}}}=\bm{T}^{\prime}(u)$ , and $(\bm{S}(R_{i}))_{{}_{\mathtt{N}}}=\bm{T}^{\prime}(U_{i})$ . By definition of surface context in $\Lambda_{\oplus}^{!}$ , the claim hold with $\bm{T}=\bm{T}^{\prime}!(Q)_{{}_{\mathtt{N}}}$ , and the same $u$ .

•

$\lambda x.\bm{S}$ . $(\lambda x.\bm{S}(r))_{{}_{\mathtt{N}}}=\lambda!x.(\bm{S}^{\prime}(r))_{{}_{\mathtt{N}}}$ , and the claim holds by inductive hypothesis.

$\Longleftarrow$ . We examine the possible form of $\bm{T}$ , given that $(M)_{{}_{\mathtt{N}}}=\bm{T}(u)$ ; we prove that $M=\bm{S}(r)$ and that $(\bm{S}(R_{i}))_{{}_{\mathtt{N}}}=\bm{T}(U_{i})$ .

•

$\square$ . Immediate.

•

$\bm{T}^{\prime}Q$ . We have that $(\bm{T}^{\prime}Q)(u)=\bm{T}^{\prime}(u)Q$ , and $\bm{T}^{\prime}(u)Q=(LN)_{{}_{\mathtt{N}}}=(L)_{{}_{\mathtt{N}}}!(N)_{{}_{\mathtt{N}}}$ with $M=LN$ . Therefore $\bm{T}^{\prime}(u)=(L)_{{}_{\mathtt{N}}}$ , and the claim holds by inductive hypothesis and definition of surface context.

•

$\lambda!x.\bm{T}^{\prime}$ . We have that $(\lambda!x.\bm{T}^{\prime})(u)=\lambda!x.\bm{T}^{\prime}(u)$ and $\lambda!x.\bm{T}^{\prime}(u)=\lambda!x.(M^{\prime})_{{}_{\mathtt{N}}}$ , with $M=\lambda x.M^{\prime}$ . The claim holds by inductive hypothesis.

∎

Proposition.

42. [Simulation] The translation $(.)_{{}_{\mathtt{N}}}$ is sound and complete; it preserves surface reduction and surface normal forms. Let $M\in\Lambda_{\oplus}^{\mathtt{cbn}}$ ; the following hold:

if $M\rightarrow\mathtt{n}$ then $(M)_{{}_{\mathtt{N}}}\rightarrow(\mathtt{n})_{{}_{\mathtt{N}}}$ ; 2. 2.

if $M\overset{{}_{\textsf{s}}}{\rightarrow}\mathtt{n}$ then $(M)_{{}_{\mathtt{N}}}\overset{{}_{\textsf{s}}}{\rightarrow}(\mathtt{n})_{{}_{\mathtt{N}}}$ ; 3. 3.

if $(M)_{{}_{\mathtt{N}}}\rightarrow\mathtt{s}$ then $\exists!\mathtt{n}$ such that $\mathtt{s}=(\mathtt{n})_{{}_{\mathtt{N}}}$ and $M\rightarrow\mathtt{n}$ ; 4. 4.

if $(M)_{{}_{\mathtt{N}}}\overset{{}_{\textsf{s}}}{\rightarrow}\mathtt{s}$ then $\exists!\mathtt{n}$ such that $\mathtt{s}=(\mathtt{n})_{{}_{\mathtt{N}}}$ and $M\overset{{}_{\textsf{s}}}{\rightarrow}\mathtt{n}$ ; 5. 5.

$M\in\mathcal{H}$ if and only if $(M)_{{}_{\mathtt{N}}}\in\mathcal{S}^{!}$ .

Proof.

We prove (1.)-(4.); since $\rightarrow\leavevmode\nobreak\ =\leavevmode\nobreak\ \rightarrow_{\beta}\cup\rightarrow_{\oplus}$ , we deal separately with the two reductions. Point (5.) is an immediate consequence of the other points.

$\rightarrow_{\beta}$

We deal with $\rightarrow_{\beta}$ via simulation in $\Lambda^{\mathtt{cbn}}$ and $\Lambda^{!}$ , since the analogous result is proved in [17]. We have defined a translation $(-)_{!}:\Lambda_{\oplus}^{!}\to\Lambda^{!}$ which is sound and complete, and preserves surface reduction. It is straightforward to define a similar translation from $\Lambda_{\oplus}^{\mathtt{cbn}}$ into $\Lambda^{\mathtt{cbn}}$ . Therefore, if in $\Lambda_{\oplus}^{\mathtt{cbn}}$ it holds $M\rightarrow_{\beta}N$ , we translate in $\Lambda^{\mathtt{cbn}}$ , use the result in [17] and conclude (via simulation) that $(M)_{{}_{\mathtt{N}}}\rightarrow_{\beta}(N)_{{}_{\mathtt{N}}}$ in $\Lambda_{\oplus}^{!}$ . Similarly for (2.)-(3.)-(4.).

$\rightarrow_{\oplus}$

Immediate consequence of Lemma 58, which proves that that $(\cdot)_{{}_{\mathtt{N}}}$ preserves both surface contexts and $\oplus$ -redexes.

∎

Bibliography31

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] P. Arrighi and G. Dowek. Lineal: A linear-algebraic lambda-calculus. Logical Methods in Computer Science , 13(1), 2017.
2[2] M. Avanzini, U. Dal Lago, and A. Yamada. On probabilistic term rewriting. In J. P. Gallagher and M. Sulzmann, editors, Functional and Logic Programming - 14th International Symposium, FLOPS 2018, Nagoya, Japan, May 9-11, 2018, Proceedings , volume 10818 of Lecture Notes in Computer Science , pages 132–148. Springer, 2018.
3[3] G. Bacci, R. Furber, D. Kozen, R. Mardare, P. Panangaden, and D. Scott. Boolean-valued semantics for the stochastic λ 𝜆 \lambda -calculus. In A. Dawar and E. Grädel, editors, Proceedings of the 33rd Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2018, Oxford, UK, July 09-12, 2018 , pages 669–678. ACM, 2018.
4[4] H. P. Barendregt. The Lambda Calculus: Its Syntax and Semantics , volume 103. North Holland, 1984.
5[5] J. Borgström, U. Dal Lago, A. D. Gordon, and M. Szymczak. A lambda-calculus foundation for universal probabilistic programming. In Proceedings of the 21st ACM SIGPLAN International Conference on Functional Programming, ICFP 2016, Nara, Japan, September 18-22, 2016 , pages 33–46, 2016.
6[6] U. Dal Lago, C. Faggian, B. Valiron, and A. Yoshimizu. The geometry of parallelism: classical, probabilistic, and quantum effects. In G. Castagna and A. D. Gordon, editors, Proceedings of the 44th ACM SIGPLAN Symposium on Principles of Programming Languages, POPL 2017, Paris, France, January 18-20, 2017 , pages 833–845. ACM, 2017.
7[7] U. Dal Lago, A. Masini, and M. Zorzi. Confluence results for a quantum lambda calculus with measurements. Electr. Notes Theor. Comput. Sci. , 270(2):251–261, 2011.
8[8] U. Dal Lago and M. Zorzi. Probabilistic operational semantics for the lambda calculus. Co RR , abs/1104.0195, 2011.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Lambda Calculus and Probabilistic Computation

Abstract

I Introduction

Example 1** (Confluence).**

Content and Contributions

Related Work

II Background and Motivational observations

II-A Classical vs. Probabilistic Programs

Probabilistic vs. Quantitative

II-B Confluence of the calculus is relevant to programming

II-C The result of probabilistic computation

III Technical Preliminaries

III-A Basics on Discrete Probability

Example 2** (Die).**

III-B Subdistributions and DST(Ω)\boldsymbol{\mathtt{DST}(\Omega)}DST(Ω)

III-C Multidistributions

Example 3** (Distribution vs. multidistribution).**

III-D Binary relations (notations and basic definitions)

Confluence and Commutation

IV Call-by-Value calculus Λ⊕cbv\Lambda_{\oplus}^{\mathtt{cbv}}Λ⊕cbv​

IV-A Syntax of Λ⊕cbv\Lambda_{\oplus}^{\mathtt{cbv}}Λ⊕cbv​

IV-A1 The language

IV-A2 Reductions

Reduction Rules and Steps

Lifting

Reduction sequences

βv\beta_{v}βv​ equivalence

Normal Forms

IV-A3 Full Lifting

IV-B Λ⊕cbv\Lambda_{\oplus}^{\mathtt{cbv}}Λ⊕cbv​* and the λ\lambdaλ-calculus*

Proposition 4** (Simulation).**

IV-C Discussion (Surface Contexts)

V Confluence and Standardization

V-A Confluence

Lemma** (Hindley-Rosen).**

Lemma 5** (Pointwise Criterion).**

Proof.

Lemma 6**.**

Proof.

Lemma 7**.**

Proof.

Lemma 8**.**

Proof.

Theorem 9**.**

Proof.

Fact**.**

V-A1 Discussion

V-B A Standardization Property

V-B1 Surface and Left Reduction

Example 10**.**

Lemma 11**.**

V-B2 Finitary Surface Standardization

Theorem 12** (Finitary Surface Standardization).**

Proof.

Finitary Left Standardization does not hold

Example 13** (Counter-example).**

VI Asymptotic Evaluation

VI-A Probabilistic Evaluation

VI-A1 To be valuable

Fact 14**.**

Example 15**.**

VI-A2 Result of a CbV computation

Example 16**.**

VI-A3 Observations and Limit Distribution

Definition 17**.**

Definition 18**.**

Sets of Observations for Λ⊕cbv\Lambda_{\oplus}^{\mathtt{cbv}}Λ⊕cbv​

Proposition 19**.**

Proof.

Example 20**.**

Discussion

Sets of observations for Surface Reduction

VI-B Uniqueness and Adequacy of the Evaluation

Lemma 21** (Main Lemma).**

Example 1 (Confluence).

Example 2 (Die).

III-B Subdistributions and $\boldsymbol{\mathtt{DST}(\Omega)}$

Example 3 (Distribution vs. multidistribution).

IV Call-by-Value calculus $\Lambda_{\oplus}^{\mathtt{cbv}}$

IV-A Syntax of $\Lambda_{\oplus}^{\mathtt{cbv}}$

$\beta_{v}$ equivalence

IV-B $\Lambda_{\oplus}^{\mathtt{cbv}}$ * and the $\lambda$ -calculus*

Proposition 4 (Simulation).

Lemma (Hindley-Rosen).

Lemma 5 (Pointwise Criterion).

Lemma 6.

Lemma 7.

Lemma 8.

Theorem 9.

Fact.

Example 10.

Lemma 11.

Theorem 12 (Finitary Surface Standardization).

Example 13 (Counter-example).

Fact 14.

Example 15.

Example 16.

Definition 17.

Definition 18.

Sets of Observations for $\Lambda_{\oplus}^{\mathtt{cbv}}$

Proposition 19.

Example 20.

Lemma 21 (Main Lemma).

Theorem 22 (Greatest Limit Distribution).

Theorem 23 (Adequacy of evaluation).

Fact 24.

Lemma 25.

Theorem 26 (Asymptotic Completeness of Surface Reduction).

Remark 27.

Lemma 28.

Proposition 29 (Left Evaluation).

Theorem 30.

IX-A Syntax of $\Lambda_{\oplus}^{!}$

Remark 31.

IX-B $\Lambda_{\oplus}^{!}$ * is a conservative extension of $\Lambda^{!}$ *

Definition 32 (Translation).

Proposition 33 (Simulation).

IX-C Confluence and Finitary Standardization for $\Lambda_{\oplus}^{!}$

Theorem (Simpson 05).

Lemma 34.

Theorem 35.

Proposition 36 (Finitary Surface Standardization).

Proposition 37.

Theorem 38.

Fact 39.

Theorem 40 (Asymptotic Completeness).

Theorem 41 (Surface Evaluation).

X Call-by-Name calculus $\Lambda_{\oplus}^{{\mathtt{cbn}}}$

X-A Syntax of $\Lambda_{\oplus}^{\mathtt{cbn}}$

Remark.

X-A4 $\Lambda_{\oplus}^{{\mathtt{cbn}}}$ to $\Lambda_{\oplus}^{!}$ .

Proposition 42 (Simulation).

X-B Confluence and Finitary Standardization for $\Lambda_{\oplus}^{\mathtt{cbn}}$

Theorem 43 (Confluence).

Theorem 44 (Finitary Surface standardization).

Example 45.

Proposition 46.

Theorem 47.

Theorem 48.