Intersection Types and Counting

Pawe{\l} Parys

arXiv:1701.05303·cs.LO·January 20, 2017

Intersection Types and Counting

Pawe{\l} Parys

PDF

Open Access

TL;DR

This paper introduces a novel type system approach to analyze the finiteness of trees generated by nondeterministic higher-order recursion schemes, linking tree properties to derivation properties for decision-making.

Contribution

It presents a new type system that characterizes the finiteness of generated trees through derivation size, enabling decidability of the finiteness problem for HORSes.

Findings

01

Type system accurately characterizes tree finiteness.

02

Decidability of the finiteness problem for nondeterministic HORS.

03

Provides a method to relate tree properties to derivation properties.

Abstract

We present a new approach to the following meta-problem: given a quantitative property of trees, design a type system such that the desired property for the tree generated by an infinitary ground $λ$ -term corresponds to some property of a derivation of a type for this $λ$ -term, in this type system. Our approach is presented in the particular case of the language finiteness problem for nondeterministic higher-order recursion schemes (HORSes): given a nondeterministic HORS, decide whether the set of all finite trees generated by this HORS is finite. We give a type system such that the HORS can generate a tree of an arbitrarily large finite size if and only if in the type system we can obtain derivations that are arbitrarily large, in an appropriate sense; the latter condition can be easily decided.

Equations54

T^{α \to β} = P (F_{ord (α \to β)}^{α}) \times T^{β}, T^{o} = o,

T^{α \to β} = P (F_{ord (α \to β)}^{α}) \times T^{β}, T^{o} = o,

F_{k}^{α} = {(k, F, M, τ) ∣ F, M \subseteq {0, \dots, k - 1}, F \cap M = \emptyset, τ \in T^{α}}, F^{α} = k \in N ⋃ F_{k}^{α} .

\frac{\vbox Γ ⊢ P _{i} : τ ^ ▹ c i \in { 1 , 2 } \vbox}{\vbox \vbox Γ ⊢ br P _{1} P _{2} : τ ^ ▹ c}

\frac{\vbox Γ ⊢ P _{i} : τ ^ ▹ c i \in { 1 , 2 } \vbox}{\vbox \vbox Γ ⊢ br P _{1} P _{2} : τ ^ ▹ c}

\frac{\vbox Γ ^{'} [ x \mapsto T ] ⊢ P : ( m , F , M , τ ) ▹ c Split ( Γ ∣ Γ ^{'} ) Γ ^{'} ( x ) = \emptyset \vbox}{\vbox \vbox Γ ⊢ λ x . P : ( m , F , M ∖ ⋃ _{(k, F^{'}, M^{'}, σ) \in T} M ^{'} , T \to τ ) ▹ c}

\frac{\vbox Γ ^{'} [ x \mapsto T ] ⊢ P : ( m , F , M , τ ) ▹ c Split ( Γ ∣ Γ ^{'} ) Γ ^{'} ( x ) = \emptyset \vbox}{\vbox \vbox Γ ⊢ λ x . P : ( m , F , M ∖ ⋃ _{(k, F^{'}, M^{'}, σ) \in T} M ^{'} , T \to τ ) ▹ c}

\vbox \vbox ε [ x \mapsto { ρ ^ _{1} }] ⊢ x : ( 2 , \emptyset , { 0 } , o ) ▹ 0

\vbox \vbox ε [ x \mapsto { ρ ^ _{1} }] ⊢ x : ( 2 , \emptyset , { 0 } , o ) ▹ 0

\frac{\vbox ε [ x \mapsto { ρ ^ _{1} }] ⊢ a x : ( 2 , { 1 } , { 0 } , o ) ▹ 0 \vbox}{\vbox \vbox ε ⊢ λ x . a x : ( 2 , { 1 } , \emptyset , { ρ ^ _{1} } \to o ) ▹ 0}

\frac{\vbox ε [ x \mapsto { ρ ^ _{1} }] ⊢ a x : ( 2 , { 1 } , { 0 } , o ) ▹ 0 \vbox}{\vbox \vbox ε ⊢ λ x . a x : ( 2 , { 1 } , \emptyset , { ρ ^ _{1} } \to o ) ▹ 0}

F = {n \in {0, \dots, m - 1} ∣ f_{n} > 0 \land n \neq \in M},

F = {n \in {0, \dots, m - 1} ∣ f_{n} > 0 \land n \neq \in M},

f_{n} = f_{n}^{'} + i \in I \sum ∣ F_{i} \cap {n} ∣,

\frac{\vbox Γ _{i} ⊢ P _{i} : ( m , F _{i} , M _{i} , o ) ▹ c _{i} \mbox f or e a c h i \in { 1 , \dots , r } M = M ^{'} ⊎ M _{1} ⊎ \dots ⊎ M _{r} \vbox ( m = 0 ) \Rightarrow ( F ^{'} = \emptyset \land c ^{'} = 1 ) ( m > 0 ) \Rightarrow ( F ^{'} = { 0 } \land c ^{'} = 0 ) ( r > 0 ) \Rightarrow ( M ^{'} = \emptyset ) \vbox a \neq = br Split ( Γ ∣ Γ _{1} , \dots , Γ _{r} ) Comp _{m} ( M ; ( F ^{'} , c ^{'} ) , ( F _{1} , c _{1} ) , \dots , ( F _{r} , c _{r} )) = ( F , c ) \vbox}{\vbox \vbox Γ ⊢ a P _{1} \dots P _{r} : ( m , F , M , o ) ▹ c}

\frac{\vbox Γ _{i} ⊢ P _{i} : ( m , F _{i} , M _{i} , o ) ▹ c _{i} \mbox f or e a c h i \in { 1 , \dots , r } M = M ^{'} ⊎ M _{1} ⊎ \dots ⊎ M _{r} \vbox ( m = 0 ) \Rightarrow ( F ^{'} = \emptyset \land c ^{'} = 1 ) ( m > 0 ) \Rightarrow ( F ^{'} = { 0 } \land c ^{'} = 0 ) ( r > 0 ) \Rightarrow ( M ^{'} = \emptyset ) \vbox a \neq = br Split ( Γ ∣ Γ _{1} , \dots , Γ _{r} ) Comp _{m} ( M ; ( F ^{'} , c ^{'} ) , ( F _{1} , c _{1} ) , \dots , ( F _{r} , c _{r} )) = ( F , c ) \vbox}{\vbox \vbox Γ ⊢ a P _{1} \dots P _{r} : ( m , F , M , o ) ▹ c}

\frac{\vbox ε [ x \mapsto { ρ ^ _{1} }] ⊢ x : ( 2 , \emptyset , { 0 } , o ) ▹ 0 \vbox}{\vbox \vbox ε [ x \mapsto { ρ ^ _{1} }] ⊢ a x : ( 2 , { 1 } , { 0 } , o ) ▹ 0}

\frac{\vbox ε [ x \mapsto { ρ ^ _{1} }] ⊢ x : ( 2 , \emptyset , { 0 } , o ) ▹ 0 \vbox}{\vbox \vbox ε [ x \mapsto { ρ ^ _{1} }] ⊢ a x : ( 2 , { 1 } , { 0 } , o ) ▹ 0}

\frac{\vbox Γ ^{'} ⊢ P : ( m , F ^{'} , M ^{'} , {( ord ( P ) , F _{i} ↾ _{< ord (P)} , M _{i} ↾ _{< ord (P)} , τ _{i} ) ∣ i \in I } \to τ ) ▹ c ^{'} \vbox Γ _{i} ⊢ Q : ( m , F _{i} , M _{i} , τ _{i} ) ▹ c _{i} \mbox f or e a c h i \in I M = M ^{'} ⊎ ⨄ _{i \in I} M _{i} \vbox ord ( P ) \leq m Split ( Γ ∣ Γ ^{'} , ( Γ _{i} ) _{i \in I} ) Comp _{m} ( M ; ( F ^{'} , c ^{'} ) , (( F _{i} ↾ _{\geq ord (P)} , c _{i} ) ) _{i \in I} ) = ( F , c ) \vbox}{\vbox \vbox Γ ⊢ P Q : ( m , F , M , τ ) ▹ c}

\frac{\vbox Γ ^{'} ⊢ P : ( m , F ^{'} , M ^{'} , {( ord ( P ) , F _{i} ↾ _{< ord (P)} , M _{i} ↾ _{< ord (P)} , τ _{i} ) ∣ i \in I } \to τ ) ▹ c ^{'} \vbox Γ _{i} ⊢ Q : ( m , F _{i} , M _{i} , τ _{i} ) ▹ c _{i} \mbox f or e a c h i \in I M = M ^{'} ⊎ ⨄ _{i \in I} M _{i} \vbox ord ( P ) \leq m Split ( Γ ∣ Γ ^{'} , ( Γ _{i} ) _{i \in I} ) Comp _{m} ( M ; ( F ^{'} , c ^{'} ) , (( F _{i} ↾ _{\geq ord (P)} , c _{i} ) ) _{i \in I} ) = ( F , c ) \vbox}{\vbox \vbox Γ ⊢ P Q : ( m , F , M , τ ) ▹ c}

\overset{τ}{^}_{f} = (2, {1}, \emptyset, {\overset{ρ}{^}_{1}} \to o),

\overset{τ}{^}_{f} = (2, {1}, \emptyset, {\overset{ρ}{^}_{1}} \to o),

\frac{\vbox \raise 11.0 pt \vbox \vbox ε [ f \mapsto { τ ^ _{m} }] ⊢ f : τ ^ _{m} ▹ 0 (Var) \raise 11.0 pt \vbox \vbox ε ⊢ e : ( 2 , { 1 } , { 0 } , o ) ▹ 0 to0.0pt (Con) \hss \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ f e : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ 1}

\frac{\vbox \raise 11.0 pt \vbox \vbox ε [ f \mapsto { τ ^ _{m} }] ⊢ f : τ ^ _{m} ▹ 0 (Var) \raise 11.0 pt \vbox \vbox ε ⊢ e : ( 2 , { 1 } , { 0 } , o ) ▹ 0 to0.0pt (Con) \hss \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ f e : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ 1}

\frac{\vbox \raise 11.0 pt \frac{\vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ f e : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ 1 \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ br ( f e ) ( R ( λ x . f ( f x ))) : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ 1} to0.0pt (Br) \hss \vbox}{\vbox \vbox ε ⊢ R : σ ^ _{R} ▹ 1}

\frac{\vbox \raise 11.0 pt \frac{\vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ f e : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ 1 \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ br ( f e ) ( R ( λ x . f ( f x ))) : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ 1} to0.0pt (Br) \hss \vbox}{\vbox \vbox ε ⊢ R : σ ^ _{R} ▹ 1}

\frac{\vbox \raise 11.0 pt \frac{\vbox \raise 11.0 pt \vbox \vbox ε [ f \mapsto { τ ^ _{f} }] ⊢ f : τ ^ _{f} ▹ 0 \raise 11.0 pt \frac{\vbox \raise 11.0 pt \vbox \vbox ε [ f \mapsto { }] ⊢ f : ▹ 0 \raise 11.0 pt \vbox \vbox ε [ x \mapsto { }] ⊢ x : ( 2 , \emptyset , { 0 } , o ) ▹ 0 \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} } , x \mapsto { ρ ^ _{1} }] ⊢ f x : ( 2 , { 1 } , { 0 } , o ) ▹ 0} to0.0pt ( @ ) \hss \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} } , x \mapsto { ρ ^ _{1} }] ⊢ f ( f x ) : ( 2 , { 1 } , { 0 } , o ) ▹ 0} to0.0pt ( @ ) \hss \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} }] ⊢ λ x . f ( f x ) : τ ^ _{f} ▹ 0}

\frac{\vbox \raise 11.0 pt \frac{\vbox \raise 11.0 pt \vbox \vbox ε [ f \mapsto { τ ^ _{f} }] ⊢ f : τ ^ _{f} ▹ 0 \raise 11.0 pt \frac{\vbox \raise 11.0 pt \vbox \vbox ε [ f \mapsto { }] ⊢ f : ▹ 0 \raise 11.0 pt \vbox \vbox ε [ x \mapsto { }] ⊢ x : ( 2 , \emptyset , { 0 } , o ) ▹ 0 \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} } , x \mapsto { ρ ^ _{1} }] ⊢ f x : ( 2 , { 1 } , { 0 } , o ) ▹ 0} to0.0pt ( @ ) \hss \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} } , x \mapsto { ρ ^ _{1} }] ⊢ f ( f x ) : ( 2 , { 1 } , { 0 } , o ) ▹ 0} to0.0pt ( @ ) \hss \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} }] ⊢ λ x . f ( f x ) : τ ^ _{f} ▹ 0}

\frac{\vbox \raise 11.0 pt \frac{\vbox \raise 11.0 pt \vbox \vbox ε [ f \mapsto { τ ^ _{f} }] ⊢ f : τ ^ _{f} ▹ 0 \raise 11.0 pt \frac{\vbox \raise 11.0 pt \vbox \vbox ε [ f \mapsto { }] ⊢ f : ▹ 0 \raise 11.0 pt \vbox \vbox ε [ x \mapsto { }] ⊢ x : ( 2 , \emptyset , { 0 } , o ) ▹ 0 \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{m} } , x \mapsto { ρ ^ _{1} }] ⊢ f x : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ 0} to0.0pt ( @ ) \hss \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} } , x \mapsto { ρ ^ _{1} }] ⊢ f ( f x ) : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ 1} to0.0pt ( @ ) \hss \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ λ x . f ( f x ) : τ ^ _{m} ▹ 1}

\frac{\vbox \raise 11.0 pt \frac{\vbox \raise 11.0 pt \vbox \vbox ε [ f \mapsto { τ ^ _{f} }] ⊢ f : τ ^ _{f} ▹ 0 \raise 11.0 pt \frac{\vbox \raise 11.0 pt \vbox \vbox ε [ f \mapsto { }] ⊢ f : ▹ 0 \raise 11.0 pt \vbox \vbox ε [ x \mapsto { }] ⊢ x : ( 2 , \emptyset , { 0 } , o ) ▹ 0 \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{m} } , x \mapsto { ρ ^ _{1} }] ⊢ f x : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ 0} to0.0pt ( @ ) \hss \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} } , x \mapsto { ρ ^ _{1} }] ⊢ f ( f x ) : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ 1} to0.0pt ( @ ) \hss \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ λ x . f ( f x ) : τ ^ _{m} ▹ 1}

\frac{\vbox \raise 11.0 pt \frac{\vbox \raise 11.0 pt \frac{\vbox ε ⊢ R : σ ^ _{R} ▹ c ε [ f \mapsto { τ ^ _{f} }] ⊢ λ x . f ( f x ) : τ ^ _{f} ▹ 0 ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ λ x . f ( f x ) : τ ^ _{m} ▹ 1 \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ R ( λ x . f ( f x )) : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ c + 1} to0.0pt ( @ ) \hss \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ br ( f e ) ( R ( λ x . f ( f x ))) : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ c + 1} to0.0pt (Br) \hss \vbox}{\vbox \vbox ε ⊢ R : σ ^ _{R} ▹ c + 1}

\frac{\vbox \raise 11.0 pt \frac{\vbox \raise 11.0 pt \frac{\vbox ε ⊢ R : σ ^ _{R} ▹ c ε [ f \mapsto { τ ^ _{f} }] ⊢ λ x . f ( f x ) : τ ^ _{f} ▹ 0 ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ λ x . f ( f x ) : τ ^ _{m} ▹ 1 \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ R ( λ x . f ( f x )) : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ c + 1} to0.0pt ( @ ) \hss \vbox}{\vbox \vbox ε [ f \mapsto { τ ^ _{f} , τ ^ _{m} }] ⊢ br ( f e ) ( R ( λ x . f ( f x ))) : ( 2 , \emptyset , { 0 , 1 } , o ) ▹ c + 1} to0.0pt (Br) \hss \vbox}{\vbox \vbox ε ⊢ R : σ ^ _{R} ▹ c + 1}

\frac{\vbox ε ⊢ R : σ ^ _{R} ▹ c ε ⊢ λ x . a x : τ ^ _{f} ▹ 0 ε ⊢ λ x . a x : τ ^ _{m} ▹ 1 \vbox}{\vbox \vbox ε ⊢ P _{1} : ρ ^ _{2} ▹ c + 1}

\frac{\vbox ε ⊢ R : σ ^ _{R} ▹ c ε ⊢ λ x . a x : τ ^ _{f} ▹ 0 ε ⊢ λ x . a x : τ ^ _{m} ▹ 1 \vbox}{\vbox \vbox ε ⊢ P _{1} : ρ ^ _{2} ▹ c + 1}

\overset{τ}{^}_{0}^{'} = (2, {0}, \emptyset, {(1, {0}, \emptyset, o)} \to o),

\overset{τ}{^}_{0}^{'} = (2, {0}, \emptyset, {(1, {0}, \emptyset, o)} \to o),

\overset{τ}{^}_{f}^{'} = (2, {1}, \emptyset, {(1, {0}, \emptyset, o), \overset{ρ}{^}_{1}} \to o), \mbox an d

\overset{τ}{^}_{m}^{'} = (2, \emptyset, {1}, {(1, {0}, \emptyset, o), \overset{ρ}{^}_{1}} \to o) .

\frac{\vbox \raise 11.0 pt \vbox \vbox ε [ x \mapsto { ρ ^ _{1} }] ⊢ x : ( 2 , \emptyset , { 0 } , o ) ▹ 0 to0.0pt (Var) \hss \vbox}{\vbox \vbox ε ⊢ λ x . x : τ ^ _{f}^{''} ▹ 0}

\frac{\vbox \raise 11.0 pt \vbox \vbox ε [ x \mapsto { ρ ^ _{1} }] ⊢ x : ( 2 , \emptyset , { 0 } , o ) ▹ 0 to0.0pt (Var) \hss \vbox}{\vbox \vbox ε ⊢ λ x . x : τ ^ _{f}^{''} ▹ 0}

\frac{\vbox \raise 11.0 pt \frac{\vbox \raise 10.5 pt \frac{\vbox \raise 11.0 pt \frac{\vbox \raise 11.0 pt to0.0pt (Var) \hss \vbox}{\vbox \vbox ε [ x \mapsto { }] ⊢ a x : ( 2 , { 1 } , { 0 } , o ) ▹ 0} to0.0pt (Con) \hss \vbox}{\vbox \vbox ⋮} to0.0pt (Con) \hss \vbox}{\vbox \vbox ε [ x \mapsto { ρ ^ _{1} }] ⊢ a ( a ( \dots ( a x ) \dots )) : ( 2 , { 1 } , { 0 } , o ) ▹ 0} to0.0pt (Con) \hss \vbox}{\vbox \vbox ε ⊢ λ x . a ( a ( \dots ( a x ) \dots )) : τ ^ _{f} ▹ 0}

\frac{\vbox \raise 11.0 pt \frac{\vbox \raise 10.5 pt \frac{\vbox \raise 11.0 pt \frac{\vbox \raise 11.0 pt to0.0pt (Var) \hss \vbox}{\vbox \vbox ε [ x \mapsto { }] ⊢ a x : ( 2 , { 1 } , { 0 } , o ) ▹ 0} to0.0pt (Con) \hss \vbox}{\vbox \vbox ⋮} to0.0pt (Con) \hss \vbox}{\vbox \vbox ε [ x \mapsto { ρ ^ _{1} }] ⊢ a ( a ( \dots ( a x ) \dots )) : ( 2 , { 1 } , { 0 } , o ) ▹ 0} to0.0pt (Con) \hss \vbox}{\vbox \vbox ε ⊢ λ x . a ( a ( \dots ( a x ) \dots )) : τ ^ _{f} ▹ 0}

\frac{\vbox \raise 11.0 pt \frac{\vbox ε [ g \mapsto { τ ^ _{f} }] ⊢ P _{3} : ρ ^ _{2} ▹ 1 \vbox}{\vbox \vbox ε ⊢ λ g . P _{3} : ( 2 , \emptyset , { 0 , 1 } , { τ ^ _{f} } \to o ) ▹ 1} ( λ ) ε ⊢ λ x . a ( a ( \dots ( a x ) \dots )) : τ ^ _{f} ▹ 0 \vbox}{\vbox \vbox ε ⊢ P _{4} : ρ ^ _{2} ▹ 1}

\frac{\vbox \raise 11.0 pt \frac{\vbox ε [ g \mapsto { τ ^ _{f} }] ⊢ P _{3} : ρ ^ _{2} ▹ 1 \vbox}{\vbox \vbox ε ⊢ λ g . P _{3} : ( 2 , \emptyset , { 0 , 1 } , { τ ^ _{f} } \to o ) ▹ 1} ( λ ) ε ⊢ λ x . a ( a ( \dots ( a x ) \dots )) : τ ^ _{f} ▹ 0 \vbox}{\vbox \vbox ε ⊢ P _{4} : ρ ^ _{2} ▹ 1}

d

d

= f_{m, m}^{'} + 0 + ∣ {i \in I ∖ {s} ∣ c_{i} > 0} ∣ + d_{s} .

c = f_{m, m}^{'} + i \in I \sum c_{i} .

c = f_{m, m}^{'} + i \in I \sum c_{i} .

d = k - 1 + d_{s} \geq k - 1 + lo g_{2} c_{s} \geq lo g_{2} (k \cdot c_{s}) \geq lo g_{2} c .

d = k - 1 + d_{s} \geq k - 1 + lo g_{2} c_{s} \geq lo g_{2} (k \cdot c_{s}) \geq lo g_{2} c .

c^{'}

c^{'}

= f_{m - 1, m} + i \in I \sum c_{i} \geq f_{m, m}^{'} + i \in I \sum c_{i} + ∣ F \cap {m - 1} ∣ = c + ∣ F \cap {m - 1} ∣ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLogic, programming, and type systems · semigroups and automata theory · Algorithms and Data Compression

Full text

Intersection Types and Counting††thanks: Work supported by the National Science Center (decision DEC-2012/07/D/ST6/02443).

Paweł Parys University of Warsaw, Poland [email protected]

Abstract

We present a new approach to the following meta-problem: given a quantitative property of trees, design a type system such that the desired property for the tree generated by an infinitary ground $\lambda$ -term corresponds to some property of a derivation of a type for this $\lambda$ -term, in this type system.

Our approach is presented in the particular case of the language finiteness problem for nondeterministic higher-order recursion schemes (HORSes): given a nondeterministic HORS, decide whether the set of all finite trees generated by this HORS is finite. We give a type system such that the HORS can generate a tree of an arbitrarily large finite size if and only if in the type system we can obtain derivations that are arbitrarily large, in an appropriate sense; the latter condition can be easily decided.

1 Introduction

In this paper we consider $\lambda Y$ -calculus, which is an extension of the simply typed $\lambda$ -calculus by a fixed-point operator $Y$ . A term $P$ of $\lambda Y$ -calculus that is of sort111We use the word “sort” instead of the usual “type” to avoid confusion with intersection types introduced in this paper. $o$ can be used to generate an infinite tree $\mathit{BT}(P)$ , called the Böhm tree of $P$ . Trees generated by terms of $\lambda Y$ -calculus can be used to faithfully represent the control flow of programs in languages with higher-order functions. Traditionally, Higher Order Recursive Schemes (HORSes) are used for this purpose [9, 13, 18, 17]; this formalism is equivalent to $\lambda Y$ -calculus, and the translation between them is rather straightforward [22]. Collapsible Pushdown Systems [11] and Ordered Tree-Pushdown Systems [8] are other equivalent formalisms.

Intersection type systems were intensively used in the context of HORSes, for several purposes like model-checking [14, 17, 6, 21], pumping [15], transformations of HORSes [16, 7], etc. Interestingly, constructions very similar to intersection types were used also on the side of collapsible pushdown systems; they were alternating stack automata [5], and types of stacks [20, 12].

In this paper we show how intersection types can be used for deciding quantitative properties of trees generated by $\lambda Y$ -terms. We concentrate on the language finiteness problem for nondeterministic HORSes: given a nondeterministic HORS, decide whether the set of all finite trees generated by this HORS is finite.

This problem can be restated in the world of $\lambda Y$ -terms (or standard, deterministic HORSes), generating a single infinite tree. Here, instead of resolving nondeterministic choices during the generation process, we leave them in the resulting tree. Those nondeterministic choices are denoted by a distinguished $\mathsf{br}$ (“branch”) symbol, below which we put options that could be chosen. Then to obtain a finite tree generated by the original HORS we just need to recursively choose in every $\mathsf{br}$ -labeled node which of the two subtrees we want to consider. Thus, in this setting, the language finiteness problem asks whether the set of all finite trees obtained this way is finite.

The difficulty of this problem lies in the fact that sometimes the same finite tree may be found in infinitely many different places of $\mathit{BT}(P)$ (i.e., generated by a nondeterministic HORS in many ways); thus the actual property to decide is whether there is a common bound on the size of each of these trees. This makes the problem inaccessible for standard methods used for analyzing HORSes, as they usually concern only regular properties of the Böhm tree, while boundedness is a problem of different kind. The same difficulty was observed in [15], where they prove a pumping lemma for deterministic HORSes, while admitting (Remark 2.2) that their method is too weak to reason about nondeterministic HORSes.

In order to solve the language finiteness problem, we present an appropriate intersection type system, where derivations are annotated by flags and markers of multiple kinds. The key property of this type system is that the number of flags in a type derivation for a $\lambda Y$ -term $P$ approximates the size of some finite tree obtained by resolving nondeterministic choices in the infinite tree $\mathit{BT}(P)$ . In consequence, there are type derivations using arbitrarily many flags if, and only if, the answer to the language finiteness problem is “no”.

The language finiteness problem was first attacked in [2] (for safe HORSes only), but their algorithm turned out to be incorrect [3]. To our knowledge, the only known solution of this problem follows from a recent decidability result for the diagonal problem [10, 7]. This problem asks, given a nondeterministic HORS and a set of letters $\Sigma$ , whether for every $n\in\mathbb{N}$ the HORS generates a finite tree in which every letter from $\Sigma$ appears at least $n$ times. Clearly, a nondeterministic HORS generates arbitrarily large trees exactly when for some letter $a$ it generates trees having arbitrarily many $a$ letters, i.e., when the answer to the diagonal problem for $\Sigma=\{a\}$ is “yes”.

Our type system is, to some extent, motivated by the algorithm of [7] solving the diagonal problem. This algorithm works by repeating two kinds of transformations of HORSes. The first of them turns the HORS into a HORS generating trees having only a fixed number of branches, one per each letter from $\Sigma$ (i.e., one branch in our case of $|\Sigma|=1$ ). The branches are chosen nondeterministically out of some tree generated by the original HORS; for every $a\in\Sigma$ there is a choice witnessing that $a$ appeared many times in the original tree. Then such a HORS of the special form is turned into a HORS that is of order lower by one, and generates trees having the same nodes as trees generated by the original HORS, but arranged differently (in particular, the new trees may have again arbitrarily many branches). After finitely many repetitions of this procedure, a HORS of order [math] is obtained, and the diagonal problem becomes easily decidable. In some sense we want to do the same, but instead of applying all these transformations one by one, we simulate all of them simultaneously in a single type derivation. In this derivation, for each order $n$ , we allow to place arbitrarily one marker “of order $n$ ”; this corresponds to the nondeterministic choice of one branch in the $n$ -th step of the previous algorithm. We also place some flags “of order $n$ ”, in places that correspond to nodes remaining after the $n$ -th step of the previous algorithm.

The idea of using intersection types for counting is not completely new. Paper [19] presents a type system that, essentially, allows to estimate the size of the $\beta$ -normal form of a $\lambda$ -term just by looking at (the number of some flags in) a derivation of a type for this term. A similar idea, but for higher-order pushdown automata, is present in [20], where we can estimate the number of $\sharp$ symbols appearing on a particular, deterministically chosen branch of the generated tree. This previous approach also uses intersection types, where the derivations are marked with just one kind of flags, denoting “productive” places of a $\lambda$ -term (oppositely to our approach, where we have different flags for different orders, and we also have markers). The trouble with the “one-flag” approach is that it works well only in a completely deterministic setting, where looking independently at each node of the Böhm tree we know how it contributes to the result; the method stops working (or at least we do not know how to prove that it works) in our situation, where we first nondeterministically perform some guesses in the Böhm tree, and only after that we want to count something that depends on the chosen values.

Acknowledgements.

I would like to thank Szymon Toruńczyk for stimulating discussions, and anonymous reviewers for useful comments.

2 Preliminaries

Trees.

Let $\Sigma$ be a ranked alphabet, i.e., a set of symbols together with a rank function assigning a nonnegative integer to each of the symbols. We assume that $\Sigma$ contains a distinguished symbol $\mathsf{br}$ of rank $2$ , used to denote nondeterministic choices. A $\Sigma$ -labeled tree is a tree that is rooted (there is a distinguished root node), node-labeled (every node has a label from $\Sigma$ ), ranked (a node with label of rank $n$ has exactly $n$ children), and ordered (children of a node of rank $n$ are numbered from $1$ to $n$ ).

When $t$ is a $\Sigma$ -labeled tree $t$ , by $\mathcal{L}(t)$ we denote the set of all finite trees that can be obtaining by choosing in every $\mathsf{br}$ -labeled node of $t$ which of the two subtrees we want to consider. More formally, we consider the following relation $\to_{\mathsf{br}}$ : we have $t\to_{\mathsf{br}}u$ if $u$ can be obtained from $t$ by choosing in $t$ a $\mathsf{br}$ -labeled node $x$ and its child $y$ , and replacing the subtree starting in $x$ by the subtree starting in $y$ (which removes $x$ and the other subtree of $x$ ). Let $\to_{\mathsf{br}}^{*}$ be the reflexive transitive closure of $\to_{\mathsf{br}}$ . Then $\mathcal{L}(t)$ contains all trees $u$ that do not use the $\mathsf{br}$ label, are finite, and such that $t\to_{\mathsf{br}}^{*}u$ .

Infinitary $\lambda$ -calculus.

The set of sorts (a.k.a. simple types), constructed from a unique basic sort $o$ using a binary operation ${\to}$ , is defined as usual. The order of a sort is defined by: $\mathit{ord}(o)=0$ , and $\mathit{ord}(\alpha{\to}\beta)=\max(1+\mathit{ord}(\alpha),\mathit{ord}(\beta))$ .

We consider infinitary, sorted $\lambda$ -calculus. Infinitary $\lambda$ -terms (or just $\lambda$ -terms) are defined by coinduction, according to the following rules:

•

if $a\in\Sigma$ is a symbol of rank $r$ , and $P_{1}^{o},\dots,P_{r}^{o}$ are $\lambda$ -terms, then $(a\,P_{1}^{o}\,\dots\,P_{r}^{o})^{o}$ is a $\lambda$ -term,

•

for every sort $\alpha$ there are infinitely many variables $x^{\alpha},y^{\alpha},z^{\alpha},\dots$ ; each of them is a $\lambda$ -term,

•

if $P^{\alpha{\to}\beta}$ and $Q^{\alpha}$ are $\lambda$ -terms, then $(P^{\alpha{\to}\beta}\,Q^{\alpha})^{\beta}$ is a $\lambda$ -term, and

•

if $P^{\beta}$ is a $\lambda$ -term and $x^{\alpha}$ is a variable, then $(\lambda x^{\alpha}.P^{\beta})^{\alpha{\to}\beta}$ is a $\lambda$ -term.

We naturally identify $\lambda$ -terms differing only in names of bound variables. We often omit the sort annotations of $\lambda$ -terms, but we keep in mind that every $\lambda$ -term (and every variable) has a particular sort. A $\lambda$ -term $P$ is closed if it has no free variables. Notice that, for technical convenience, a symbol of positive rank is not a $\lambda$ -term itself, but always comes with arguments. This is not a restriction, since e.g. instead of a unary symbol $a$ one may use the term $\lambda x.a\,x$ .

The order of a $\lambda$ -term is just the order of its sort. The complexity of a $\lambda$ -term $P$ is the smallest number $m$ such that the order of every subterm of $P$ is at most $m$ . We restrict ourselves to $\lambda$ -terms that have finite complexity.

A $\beta$ -reduction is defined as usual. We say that a $\beta$ -reduction $P\to_{\beta}Q$ is of order $n$ if it concerns a redex $(\lambda x.R)\,S$ such that $\mathit{ord}(\lambda x.R)=n$ . In this situation the order of $x$ is at most $n-1$ , but may be smaller (when other arguments of $R$ are of order $n-1$ ).

Böhm Trees.

We consider Böhm trees only for closed $\lambda$ -terms of sort $o$ . For such a term $P$ , its Böhm tree $\mathit{BT}(P)$ is constructed by coinduction, as follows: if there is a sequence of $\beta$ -reductions from $P$ to a $\lambda$ -term of the form $a\,P_{1}\,\ldots\,P_{r}$ (where $a$ is a symbol), then the root of the tree $t$ has label $a$ and $r$ children, and the subtree starting in the $i$ -th child is $\mathit{BT}(P_{i})$ . If there is no sequence of $\beta$ -reductions from $P$ to a $\lambda$ -term of the above form, then $\mathit{BT}(P)$ is the full binary tree with all nodes labeled by $\mathsf{br}$ .222Usually one uses a special label $\bot$ of rank [math] for this purpose, but from the perspective of our problem both definitions are equivalent. By $\mathcal{L}(P)$ we denote $\mathcal{L}(\mathit{BT}(P))$ .

$\lambda Y$ -calculus.

The syntax of $\lambda Y$ -calculus is the same as that of finite $\lambda$ -calculus, extended by symbols $Y^{(\alpha{\to}\alpha){\to}\alpha}$ , for each sort $\alpha$ . A term of $\lambda Y$ -calculus is seen as a term of infinitary $\lambda$ -calculus if we replace each symbol $Y^{(\alpha{\to}\alpha){\to}\alpha}$ by the unique infinite $\lambda$ -term $Z$ such that $Z$ is syntactically the same as $\lambda x^{\alpha{\to}\alpha}.x\,(Z\,x)$ . In this way, we view $\lambda Y$ -calculus as a fragment of infinitary $\lambda$ -calculus.

It is standard to convert a nondeterministic HORS $\mathcal{G}$ into a closed $\lambda Y$ -term $P^{o}$ such that $\mathcal{L}(P)$ is exactly the set of all finite trees generated by $\mathcal{G}$ . The following theorem, which is our main result, states that the language finiteness problem is decidable.

Theorem 1.

Given a closed $\lambda Y$ -term $P$ of sort $o$ , one can decide whether $\mathcal{L}(P)$ is finite.

3 Intersection Type System

In this section we introduce a type system that allows to determine the desired property: whether in $\mathcal{L}(P)$ there is an arbitrarily large tree.

Intuitions.

The main novelty of our type system is in using flags and markers, which may label nodes of derivation trees. To every flag and marker we assign a number, called an order. While deriving a type for a $\lambda$ -term of complexity $m$ , we may place in every derivation tree at most one marker of each order $n\in\{0,\dots,m-1\}$ , and arbitrarily many flags of each order $n\in\{0,\dots,m\}$ .

Consider first a $\lambda$ -term $M_{0}$ of complexity [math]. Such a term actually equals its Böhm tree. Our aim is to describe some finite tree $t$ in $\mathcal{L}(M_{0})$ , i.e., obtained from $M_{0}$ by resolving nondeterministic choices in some way. We thus just put flags of order [math] in all those (appearances of) symbols in $M_{0}$ that contribute to this tree $t$ ; the type system ensures that indeed all symbols of some finite tree in $\mathcal{L}(M_{0})$ are labeled by a flag. Then clearly we have the desired property that there is a derivation with arbitrarily many flags if, and only if, there are arbitrarily large trees in $\mathcal{L}(M_{0})$ .

Next, consider a $\lambda$ -term $M_{1}$ that is of complexity $1$ , and reduces to $M_{0}$ . Of course every finite tree from $\mathcal{L}(M_{0})$ is composed of symbols appearing already in $M_{1}$ ; we can thus already in $M_{1}$ label (by order-[math] flags) all symbols that contribute to some tree $t\in\mathcal{L}(M_{0})$ (and an intersection type system can easily check correctness of such labeling). There is, however, one problem: a single appearance of a symbol in $M_{1}$ may result in many appearances in $M_{0}$ (since a function may use its argument many times). Due to this, the number of order-[math] flags in $M_{1}$ does not correspond to the size of $t$ . We rescue ourselves in the following way. In $t$ we choose one leaf, we label it by an order-[math] marker, and on the path leading from the root to this marker we place order- $1$ flags. On the one hand, $\mathcal{L}(M_{0})$ contains arbitrarily large trees if, and only if, it contains trees with arbitrarily long paths, i.e., trees with arbitrarily many order- $1$ flags. On the other hand, we can perform the whole labeling (and the type system can check its correctness) already in $M_{1}$ , and the number of order- $1$ flags in $M_{1}$ will be precisely the same as it would be in $M_{0}$ . Indeed, in $M_{1}$ we have only order- $1$ functions, i.e., functions that take trees and use them as subtrees of larger trees; although a tree coming as an argument may be duplicated, the order-[math] marker can be placed in at most one copy. This means that, while reducing $M_{1}$ to $M_{0}$ , every symbol of $M_{1}$ can result in at most one symbol of $M_{0}$ lying on the selected path to the order-[math] marker (beside of arbitrarily many symbols outside of this path).

This procedure can be repeated for $M_{2}$ of complexity $2$ that reduces to $M_{1}$ via $\beta$ -reductions of order $2$ (and so on for higher orders). We now place a marker of order $1$ in some leaf of $M_{1}$ ; afterwards, we place an order- $2$ flag in every node that is on the path to the marked leaf, and that has a child outside of this path whose some descendant is labeled by an order- $1$ flag. In effect, for some choice of a leaf to be marked, the number of order- $2$ flags approximates the number of order- $1$ flags, up to logarithm. Moreover, the whole labeling can be done in $M_{2}$ instead of in $M_{1}$ , without changing the number of order- $2$ flags.

In this intuitive description we have talked about labeling “nodes of a $\lambda$ -term”, but formally we label nodes of a derivation tree that derives a type for the term, in our type system. Every such node contains a type judgment for some subterm of the term.

Type Judgments.

For every sort $\alpha$ we define the set $\mathcal{T}^{\alpha}$ of types of sort $\alpha$ , and the set $\mathcal{F}^{\alpha}$ of full types of sort $\alpha$ . This is done as follows, where $\mathcal{P}$ denotes the powerset:

[TABLE]

Notice that the sets $\mathcal{T}^{\alpha}$ and $\mathcal{F}_{k}^{\alpha}$ are finite (unlike $\mathcal{F}^{\alpha}$ ). A type $(T,\tau)\in\mathcal{T}^{\alpha{\to}\beta}$ is denoted as $T{\to}\tau$ . A full type $\hat{\tau}=(k,F,M,\tau)\in\mathcal{F}_{k}^{\alpha}$ consists of its order $k$ , a set $F$ of flag orders, a set $M$ of marker orders, and a type $\tau$ ; we write $\mathit{ord}(\hat{\tau})=k$ . In order to distinguish types from full types, the latter are denoted by letters with a hat, like $\hat{\tau}$ .

A type judgment is of the form $\Gamma\vdash P:\hat{\tau}\triangleright c$ , where $\Gamma$ , called a type environment, is a function that maps every variable $x^{\alpha}$ to a subset of $\mathcal{F}^{\alpha}$ , $P$ is a $\lambda$ -term, $\hat{\tau}$ is a full type of the same sort as $P$ (i.e., $\hat{\tau}\in\mathcal{F}^{\beta}$ when $P$ is of sort $\beta$ ), and $c\in\mathbb{N}$ .

As usual for intersection types, the intuitive meaning of a type $T{\to}\tau$ is that a $\lambda$ -term having this type can return a $\lambda$ -term having type $\tau$ , while taking an argument for which we can derive all full types from $T$ . Moreover, in $\mathcal{T}^{o}$ there is just one type $o$ , which can be assigned to every $\lambda$ -term of sort $o$ . Suppose that we have derived a type judgment $\Gamma\vdash P:\hat{\tau}\triangleright c$ with $\hat{\tau}=(m,F,M,\tau)$ . Then

•

$\tau$ is the type derived for $P$ ;

•

$\Gamma$ contains full types that could be used for free variables of $P$ in the derivation;

•

$m$ bounds the order of flags and markers that could be used in the derivation: flags could be of order at most $m$ , and markers of order at most $m-1$ ;

•

$M\subseteq\{0,\dots,m-1\}$ contains the orders of markers used in the derivation, together with those provided by free variables (i.e., we imagine that some derivations, specified by the type environment, are already substituted in our derivation for free variables); we, however, do not include markers provided by arguments of the term (i.e., coming from the sets $T_{i}$ when $\tau=T_{1}{\to}\dots\to T_{k}{\to}o$ );

•

$F$ contains those numbers $n\in\{0,\dots,m-1\}$ (excluding $n=m$ ) for which a flag of order $n$ is placed in the derivation itself, or provided by a free variable, or provided by an argument; for technical convenience we, however, remove $n$ from $F$ whenever $n\in M$ (when $n\in M$ , the information about order- $n$ flags results in placing an order- $(n+1)$ flag, and need not to be further propagated);

•

$c$ , called a flag counter, counts the number of order- $m$ flags present in the derivation.

Type System.

Before giving rules of the type system, we need a few definitions. We use the symbol $\uplus$ to denote disjoint union. When $A\subseteq\mathbb{N}$ and $n\in\mathbb{N}$ , we write $A{\restriction}_{<n}$ for $\{k\in A\mid k<n\}$ , and similarly $A{\restriction}_{\geq n}$ for $\{k\in A\mid k\geq n\}$ . By $\varepsilon$ we denote the type environment mapping every variable to $\emptyset$ , and by $\Gamma[x\mapsto T]$ the type environment mapping $x$ to $T$ and every other variable $y$ to $\Gamma(y)$ .

Let us now say how a type environment $\Gamma$ from the conclusion of a rule may be split into type environments $(\Gamma_{i})_{i\in I}$ used in premisses of the rule: we say that $\mathit{Split}(\Gamma\mid(\Gamma_{i})_{i\in I})$ holds if and only if for every variable $x$ it holds $\Gamma_{i}(x)\subseteq\Gamma(x)$ for every $i\in I$ , and every full type from $\Gamma(x)$ providing some markers (i.e., $(k,F,M,\tau)$ with $M\neq\emptyset$ ) appears in some $\Gamma_{i}(x)$ . Full types with empty $M$ may be discarded and duplicated freely. This definition forbids to discard full types with nonempty $M$ , and from elsewhere it will follow that they cannot be duplicated. As a special case $\mathit{Split}(\Gamma\mid\Gamma^{\prime})$ describes how a type environment can be weakened.

All type derivations are assumed to be finite (although we derive types mostly for infinite $\lambda$ -terms, each type derivation analyzes only a finite part of a term). Rules of the type system will guarantee that the order $m$ of derived full types will be the same in the whole derivation (although in type environments there may be full types of different orders).

We are ready to give the first three rules of our type system:

[TABLE]

We see that to derive a type for the nondeterministic choice $\mathsf{br}\,P_{1}\,P_{2}$ , we need to derive it either for $P_{1}$ or for $P_{2}$ .

The (Var) rule allows to have in the resulting set $M$ some numbers that do not come from the set $M^{\prime}$ assigned to $x$ by the type environment; these are the orders of markers placed in the leaf using this rule. Notice, however, that we allow here only orders not smaller than $k$ (which is the order of the superterm $\lambda x.P$ binding this variable $x$ ). This is consistent with the intuitive description of the type system (page 3), which says that a marker of order $n$ can be put in a place that will be a leaf after performing all $\beta$ -reductions of orders greater than $n$ . Indeed, the variable $x$ remains a leaf after performing $\beta$ -reductions of orders greater than $k$ , but while performing $\beta$ -reductions of order $k$ this leaf will be replaced by a subterm substituted for $x$ . Recall also that, by definition of a type judgment, we require that $(k,F,M^{\prime},\tau)\in\mathcal{F}^{\alpha}_{k}$ and $(m,F,M,\tau)\in\mathcal{F}^{\alpha}_{m}$ , for appropriate sort $\alpha$ ; this introduces a bound on maximal numbers that may appear in the sets $F$ and $M$ .

Example 1.

Denoting $\hat{\rho}_{1}=(1,\emptyset,\{0\},o)$ we can derive:

[TABLE]

In the derivation on the right, the marker of order $1$ is placed in the conclusion of the rule.

The ( $\lambda$ ) rule allows to use (in a subderivation concerning the $\lambda$ -term $P$ ) the variable $x$ with all full types given in the set $T$ . When the sort of $\lambda x.P$ is $\alpha{\to}\beta$ , by definition of $\mathcal{T}^{\alpha{\to}\beta}$ we have that all full types in $T$ have the same order $k=\mathit{ord}(\alpha{\to}\beta)$ (since $(T{\to}\tau)\in\mathcal{T}^{\alpha{\to}\beta}$ ). Recall that we intend to store in the set $M$ the markers contained in the derivation itself and those provided by free variables, but not those provided by arguments. Because of this, in the conclusion of the rule we remove from $M$ the markers provided by $x$ . This operation makes sense only because there is at most one marker of each order, so markers provided by $x$ cannot be provided by any other free variable nor placed in the derivation itself. The set $F$ , unlike $M$ , stores also flags provided by arguments, so we do not need to remove anything from $F$ .

Example 2.

The ( $\lambda$ ) rule can be used, e.g., in the following way (where $a$ is a symbol of rank $1$ ):

[TABLE]

Notice that in the conclusion of the rule, in both examples, we remove [math] from the set of marker orders, because the order-[math] marker is provided by $x$ .

The next two rules use a predicate $\mathit{Comp}_{m}$ , saying how flags and markers from premisses contribute to the conclusion. It takes “as input” pairs $(F_{i},c_{i})$ for $i\in I$ ; each of them consists of the set of flag orders $F_{i}$ and of the flag counter $c_{i}$ from some premiss. Moreover, the predicate takes a set of marker orders $M$ from the current type judgment (it contains orders of markers used in the derivation, including those provided by free variables). The goal is to compute the set of flag orders $F$ and the flag counter $c$ that should be placed in the current type judgment. First, for each $n\in\{1,\dots,m\}$ consecutively, we decide whether a flag of order $n$ should be placed on the current type judgment. We follow here the rules mentioned in the intuitive description. Namely, we place a flag of order $n$ if we are on the path leading to the marker of order $n-1$ (i.e., if $n-1\in M$ ), and simultaneously we receive an information about a flag of order $n-1$ . By receiving this information we mean that either a flag of order $n-1$ was placed on the current type judgment, or $n-1$ belongs to some set $F_{i}$ . Actually, we place multiple flags of order $n$ : one per each flag of order $n-1$ placed on the current type judgment, and one per each set $F_{i}$ containing $n-1$ . Then, we compute $F$ and $c$ . In $c$ we store the number of flags of the maximal order $m$ : we sum all the numbers $c_{i}$ , and we add the number of order- $m$ flags placed on the current type judgment. In $F$ we keep elements of all $F_{i}$ , and we add the orders $n$ of flags that were placed on the current type judgment. We, however, remove from $F$ all elements of $M$ . This is because every flag of some order $n-1$ should result in creating at most one flag of order $n$ , in the closest ancestor that lies on the path leading to the marker of order $n-1$ . If we have created an order- $n$ flag on the current type judgment, i.e., if $n-1\in M$ , we do not want to do this again in the parent.

Below we give a formal definition, in which $f_{n}^{\prime}$ contains the number of order- $n$ flags placed on the current type judgment, while $f_{n}$ additionally counts the number of premisses for which $n\in F_{i}$ . We say that $\mathit{Comp}_{m}(M;\allowbreak((F_{i},c_{i}))_{i\in I})=(F,c)$ when

[TABLE]

We now present a rule for constants other than $\mathsf{br}$ :

[TABLE]

Here, the conditions in the second line say that in a node using the (Con) rule we always place a flag of order [math] (via $F^{\prime}$ or via $c^{\prime}$ , depending on $m$ ), and that if the node is a leaf (i.e., $r=0$ ), then we are allowed to place markers of arbitrary order (via $M^{\prime}$ ). Then to the $\mathit{Comp}_{m}$ predicate, beside of pairs $(F_{i},c_{i})$ coming from premisses, we also pass the information $(F^{\prime},c^{\prime})$ about the order-[math] flag placed in the current node; this predicate decides whether we should place also some flags of positive orders. Let us emphasize that in this rule (and similarly in the next rule) we have a disjoint union $M^{\prime}\uplus M_{1}\uplus\dots\uplus M_{r}$ , which ensures that a marker of any order may be placed only in one node of a derivation.

Example 3.

The (Con) rule may be instantiated in the following way:

[TABLE]

In the left example, flags of order [math] and $1$ are placed in the conclusion of the rule (a flag of order [math] is created because we are in a constant; since the marker of order [math] is visible, we do not put [math] into the set of flag orders, but instead we create a flag of order $1$ ). In the right example, a marker of order $1$ is visible, which causes that this time flags of order [math], $1$ , and $2$ are placed in the conclusion of the (Con) rule (again, we do not put [math] nor $1$ into the set of flag orders, because of [math] and $1$ in the set of marker orders).

The next rule describes application:

[TABLE]

In this rule, it is allowed (but in fact useless) that for two different $i\in I$ the full types $(m,F_{i},M_{i},\tau_{i})$ are equal. It is also allowed that $I=\emptyset$ , in which case no type needs to be derived for $Q$ . Observe how flags and markers coming from premisses concerning $Q$ are propagated: only flags and markers of order $n<\mathit{ord}(P)$ are visible to $P$ , while only flags of order $n\geq\mathit{ord}(P)$ are passed to the $\mathit{Comp}_{m}$ predicate. This can be justified if we recall the intuitions staying behind the type system (see page 3). Indeed, while considering flags and markers of order $n$ , we should imagine the $\lambda$ -term obtained from the current $\lambda$ -term by performing all $\beta$ -reductions of all orders greater than $n$ ; the distribution of flags and markers of order $n$ in the current $\lambda$ -term actually simulates their distribution in this imaginary $\lambda$ -term. Thus, if $n<\mathit{ord}(P)$ , then our application will disappear in this imaginary $\lambda$ -term, and $Q$ will be already substituted somewhere in $P$ ; for this reason we need to pass the information about flags and markers of order $n$ from $Q$ to $P$ . Conversely, if $n\geq\mathit{ord}(P)$ , then in the imaginary $\lambda$ -term the considered application will be still present, and in consequence the subterm corresponding to $P$ will not see flags and markers of order $n$ placed in the subterm corresponding to $Q$ .

Example 4.

Denote by $\hat{\tau}_{\mathsf{f}}$ and $\hat{\tau}_{\mathsf{m}}$ the types derived in Example 2:

[TABLE]

Then, using the (@) rule, we can derive (where $e$ is a symbol of rank [math], and $f$ a variable):

[TABLE]

Recall that $\hat{\rho}_{1}=(1,\emptyset,\{0\},o)$ . In the conclusion of the (@) rule the information about a flag of order $1$ (from the second premiss) meets the information about the marker of order $1$ (from the first premiss), and thus a flag of order $2$ is placed, which increases the flag counter. Notice that we have discarded the full type $\hat{\tau}_{\mathsf{f}}$ assigned to $f$ in the type environment; this is allowed because $\hat{\tau}_{\mathsf{f}}$ provides no markers (equally well $\hat{\tau}_{\mathsf{f}}$ could be assigned to $f$ also in one or two of the premisses, and discarded there). On the other hand, the full type $\hat{\tau}_{\mathsf{m}}$ provides markers, so it cannot be discarded nor duplicated (in particular, we could not pass it to the conclusion of the (Con) rule).

The key property of the type system is described by the following theorem.

Theorem 2.

Let $P$ be a closed $\lambda$ -term of sort $o$ and complexity $m$ . Then $\mathcal{L}(P)$ is infinite if and only if for arbitrarily large $c$ we can derive $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ , where $\hat{\rho}_{m}=(m,\emptyset,\{0,\dots,m-1\},o)$ .

The left-to-right implication of Theorem 2 (completeness of the type system) is shown in Section 4, while the opposite implication (soundness of the type system) in Section 5. In Section 6 we discuss how Theorem 1 follows from Theorem 2. Before all that, we give a few more examples of derivations, illustrating the type system and Theorem 2.

Example 5.

In this example we analyze the $\lambda$ -term $P_{1}=R\,(\lambda x.a\,x)$ , where $R$ is defined by coinduction as $R=(\lambda f.\mathsf{br}\,(f\,e)\,(R\,(\lambda x.f\,(f\,x))))$ . As previously, $a$ and $e$ are symbols of rank $1$ and [math], respectively. In $\mathcal{L}(P_{1})$ there are trees that consist of a branch of $a$ symbols ended with an $e$ symbol, but only those where the number of $a$ symbols is $2^{k}$ for some $k\in\mathbb{N}$ . Notice that the complexity of $P_{1}$ is $2$ .

Continuing Example 4, we derive the full type $\hat{\sigma}_{R}=(2,\emptyset,\{0\},\{\hat{\tau}_{\mathsf{f}},\hat{\tau}_{\mathsf{m}}\}{\to}o)$ for $R$ :

[TABLE]

Next, we derive the same full type for $R$ , but using the second argument of the $\mathsf{br}$ symbol; this results in greater values of the flag counter. We start by deriving the full type $\hat{\tau}_{\mathsf{f}}$ for the subterm $\lambda x.f\,(f\,x)$ :

[TABLE]

In the above derivation there are no flags nor markers. Next, we derive $\hat{\tau}_{\mathsf{m}}$ for the same subterm:

[TABLE]

Below the lower (@) rule the information about a flag of order $1$ meets the information about the marker of order $1$ , and thus a flag of order $2$ is placed, which increases the flag counter. We continue with the $\lambda$ -term $R$ :

[TABLE]

In this fragment of a derivation no flag nor marker is placed. In particular, there is no order- $2$ flag in conclusion of the (@) rule, although its second premiss provides a flag of order $1$ while the third premiss provides the marker of order $1$ . We recall from the definition of the (@) rule that the information about flags and markers coming from the arguments is divided into two parts. Numbers smaller than the order of the operator ( $\mathit{ord}(R)=2$ in our case) are passed to the operator, while only greater numbers ( $\geq 2$ in our case) contribute in creating new flags via the $\mathit{Comp}$ predicate.

By composing the above fragments of a derivation, we can derive $\varepsilon\vdash R:\hat{\sigma}_{R}\triangleright c$ for every $c\geq 1$ . Recall that in Examples 1-3 we have derived $\varepsilon\vdash\lambda x.a\,x:\hat{\tau}_{\mathsf{f}}\triangleright 0$ and $\varepsilon\vdash\lambda x.a\,x:\hat{\tau}_{\mathsf{m}}\triangleright 1$ . Together with the above, this allows to derive for $P_{1}$ the full type $\hat{\rho}_{2}=(2,\emptyset,\{0,1\},o)$ (appearing in Theorem 2):

[TABLE]

We can notice a correspondence between a derivation with flag counter $c+1$ and a tree in $\mathcal{L}(P)$ of size $2^{c-1}+1$ . We remark that in every of these derivations only three flags of order [math] and only three flags of order $1$ are present, in the three nodes using the (Con) rule.

Example 6.

Consider a similar $\lambda$ -term $P_{2}=R\,(\lambda x.b\,x\,x)$ , where $R$ is as previously, and $b$ is a symbol of rank $2$ . In $\mathcal{L}(P_{2})$ we have, for every $k\in\mathbb{N}$ , a full binary tree in which every branch consist of $2^{k}$ symbols $b$ and ends with an $e$ symbol.

This time for the subterm $\lambda x.b\,x\,x$ we need to derive three full types:

[TABLE]

The last one is derived with flag counter $1$ . Notice that $\hat{\tau}_{\mathsf{f}}^{\prime}$ and $\hat{\tau}_{\mathsf{m}}^{\prime}$ need now two full types for the argument $x$ ; the new one $(1,\{0\},\emptyset,o)$ describes the subtree that is not on the path to the order-[math] marker. We also have a new full type $\hat{\tau}_{0}^{\prime}$ that describes the use of $\lambda x.b\,x\,x$ outside of the path to the order-[math] marker.

Then, similarly as in the previous example, for every $c\geq 1$ we can derive $\varepsilon\vdash R:\hat{\sigma}_{R}^{\prime}\triangleright c$ , where $\hat{\sigma}_{R}^{\prime}=(2,\emptyset,\{0\},\{\hat{\tau}_{0}^{\prime},\hat{\tau}_{\mathsf{f}}^{\prime},\hat{\tau}_{\mathsf{m}}^{\prime}\}{\to}o)$ . Again, this allows to derive $\varepsilon\vdash P_{2}:\hat{\rho}_{2}\triangleright c+1$ . This time a derivation with flag counter $c+1$ corresponds to a tree in $\mathcal{L}(P)$ of size $2^{h}-1$ with $h=2^{c-1}+1$ .

Example 7.

Next, consider the $\lambda$ -term $P_{3}=R\,(\lambda x.\,x)$ . The only tree in $\mathcal{L}(P_{3})$ consists of a single $e$ node. Let us see how the derivation from Example 5 has to be modified. The full type $\hat{\tau}_{\mathsf{m}}$ can still be derived for $\lambda x.\,x$ (although with flag counter [math] now), but instead of $\hat{\tau}_{\mathsf{f}}$ we have to use $\hat{\tau}_{\mathsf{f}}^{\prime\prime}=(2,\emptyset,\emptyset,\{\hat{\rho}_{1}\}{\to}o)$ that provides no flag of order $1$ :

[TABLE]

Next, for $R$ we want to derive the full type $\hat{\sigma}_{R}^{\prime\prime}=(2,\emptyset,\{0\},\{\hat{\tau}_{\mathsf{f}}^{\prime\prime},\hat{\tau}_{\mathsf{m}}\}{\to}o)$ . We can easily adopt every of the previous derivations for $\varepsilon\vdash R:\hat{\sigma}_{R}\triangleright c$ : we basically replace every $\hat{\tau}_{\mathsf{f}}$ by $\hat{\tau}_{\mathsf{f}}^{\prime\prime}$ . The key point is that while deriving the full type $\hat{\tau}_{\mathsf{m}}$ for the subterm $\lambda x.f\,(f\,x)$ , previously in the lower (@) rule we have received information about an order- $1$ flag, and thus we have created an order- $2$ flag and increased the flag counter; this time there is no information about an order- $1$ flag, and thus we do not create an order- $2$ flag and do not increase the flag counter. In consequence, even if this part of the derivation is repeated arbitrarily many times, the value of the flag counter of the whole derivation remains $1$ .

Example 8.

Finally, consider the $\lambda$ -term $P_{4}=(\lambda g.P_{3})\,(\lambda x.a\,(a\,(\dots\,(a\,x)\dots))$ , which $\beta$ -reduces to $P_{3}$ . Notice that we can create the following derivation:

[TABLE]

Every (Con) rule used in this derivation places in its conclusion an order-[math] flag and an order- $1$ flag. This derivation can be used as a part of a derivation for $P_{4}$ :

[TABLE]

Because $\hat{\tau}_{\mathsf{f}}$ provides no markers, it can be removed from the type environment and thus for $P_{3}$ we can use the derivation from the previous example. We thus obtain a derivation for $P_{4}$ in which there are many order-[math] and order- $1$ flags (but only one flag of order $2$ ). This shows that in the flag counter we indeed need to count only the number of flags of the maximal order (not, say, the total number of flags of all orders).

4 Completeness

The proof of the left-to-right implication of Theorem 2 is divided into the following three lemmata. Recall that a $\beta$ -reduction $P\to_{\beta}Q$ is of order $n$ if it concerns a redex $(\lambda x.R)\,S$ such that $\mathit{ord}(\lambda x.R)=n$ . The number of nodes of a tree $t$ is denoted $|t|$ . As in Theorem 2, we denote $\hat{\rho}_{m}=(m,\emptyset,\{0,\dots,m-1\},o)$ .

Lemma 3.

Let $P$ be a closed $\lambda$ -term of sort $o$ and complexity $m$ , and let $t\in\mathcal{L}(P)$ . Then there exist $\lambda$ -terms $Q_{m},Q_{m-1},\dots,Q_{0}$ such that $P=Q_{m}$ , and for every $k\in\{1,\dots,m\}$ the term $Q_{k-1}$ can be reached from $Q_{k}$ using only $\beta$ -reductions of order $k$ , and we can derive $\varepsilon\vdash Q_{0}:\hat{\rho}_{0}\triangleright|t|$ .

Lemma 4.

Suppose that we can derive $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ . Then we can also derive $\varepsilon\vdash P:\hat{\rho}_{m+1}\triangleright c^{\prime}$ for some $c^{\prime}\geq\log_{2}c$ .

Lemma 5.

Suppose that $P\to_{\beta}Q$ is a $\beta$ -reduction of order $m$ , and we can derive $\Gamma\vdash Q:\hat{\tau}\triangleright c$ with $\mathit{ord}(\hat{\tau})=m$ . Then we can also derive $\Gamma\vdash P:\hat{\tau}\triangleright c$ .

Now the left-to-right implication of Theorem 2 easily follows. Indeed, take a closed $\lambda$ -term $P$ of sort $o$ and complexity $m$ such that $\mathcal{L}(P)$ is infinite, and take any $c\in\mathbb{N}$ . By $\log^{k}_{2}$ we denote the $k$ -fold application of the logarithm: $\log^{0}_{2}x=x$ and $\log^{k+1}_{2}x=\log_{2}(\log_{2}^{k}x)$ . Since $\mathcal{L}(P)$ is infinite, it contains a tree $t$ so big that $\log_{2}^{m}|t|\geq c$ . We apply Lemma 3 to this tree, obtaining $\lambda$ -terms $Q_{m},Q_{m-1},\dots,Q_{0}$ and a derivation of $\varepsilon\vdash Q_{0}:\hat{\rho}_{0}\triangleright|t|$ . Then repeatedly for every $k\in\{1,\dots,m\}$ we apply Lemma 4, obtaining a derivation of $\varepsilon\vdash Q_{k-1}:\hat{\rho}_{k}\triangleright c_{k}$ for some $c_{k}\geq\log^{k}_{2}|t|$ , and Lemma 5 for every $\beta$ -reduction (of order $k$ ) between $Q_{k}$ and $Q_{k-1}$ , obtaining a derivation of $\varepsilon\vdash Q_{k}:\hat{\rho}_{k}\triangleright c_{k}$ . We end with a derivation of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c_{m}$ , where $c_{m}\geq\log^{m}_{2}|t|\geq c$ , as needed. In the remaining part of this section we prove the three lemmata.

Proof of Lemma 3 (sketch).

Recall that $t\in\mathcal{L}(P)$ is a finite tree, thus it can be found in some finite prefix of the Böhm tree of $P$ . By definition, this prefix will be already expanded after performing some finite number of $\beta$ -reductions from $P$ . We need to observe that these $\beta$ -reductions can be rearranged, so that those of higher order are performed first.

The key point is to observe that when we perform a $\beta$ -reduction of some order $k$ , then no new $\beta$ -redexes of higher order appear in the term. Indeed, suppose that $(\lambda x.R)\,S$ is changed into $R[S/x]$ somewhere in a term, where $\mathit{ord}(\lambda x.R)=k$ . One new redex that may appear is when $R$ starts with a $\lambda$ , and to the whole $R[S/x]$ some argument is applied; this redex is of order $\mathit{ord}(R)\leq k$ . Some other redexes may appear when $S$ starts with a $\lambda$ , and is substituted for such appearance of $x$ to which some argument is applied; but this redex is of order $\mathit{ord}(S)<k$ .

We can thus find a sequence of $\beta$ -reductions in which $\beta$ -reductions are arranged according to their order, that leads from $P$ to some $Q_{0}$ such that $t$ can be found in the prefix of $Q_{0}$ that is already expanded to a tree. It is now a routine to use the rules of our type system and derive $\varepsilon\vdash Q_{0}:\hat{\rho}_{0}\triangleright|t|$ : in every $\mathsf{br}$ -labeled node we choose the subtree in which $t$ continues, and this effects in counting the number of nodes of $t$ in the flag counter. ∎

Proof of Lemma 4.

Consider some derivation of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ . In this derivation we choose a leaf in which we will put the order- $m$ marker, as follows. Starting from the root of the derivation, we repeatedly go to this premiss in which the flag counter is the greatest (arbitrarily in the case of a tie). In every node that is not on the path to the selected leaf, we replace the current type judgment $\Gamma\vdash Q:(m,F,M,\tau)\triangleright d$ by $\Gamma\vdash Q:(m+1,F^{\prime},M,\tau)\triangleright 0$ , where $F^{\prime}=F\cup\{m\}$ if $d>0$ , and $F^{\prime}=F$ otherwise. In the selected leaf and all its ancestors, we change the order from $m$ to $m+1$ , we add $m$ to the set of marker orders, and we recalculate the flag counter.

Let us see how such transformation changes the flag counter on the path to the selected leaf. We will prove (by induction) that the previous value $d$ and the new value $d^{\prime}$ of the flag counter in every node on this path satisfy $d^{\prime}\geq\log_{2}d$ . In the selected leaf itself, the flag counter (being either [math] or $1$ ) remains unchanged; we have $d^{\prime}=d\geq\log_{2}d$ . Next, consider any proper ancestor of the selected node. Let $k$ be the number of those of its children in which the flag counter was positive, plus the number of order- $m$ flags placed in the considered node itself. Let also $d_{\max}$ and $d_{\max}^{\prime}$ be the previous value and the new value of the flag counter in this child that is in the direction of the selected leaf. By construction, the flag counter in this child was maximal, which implies $k\cdot d_{\max}\geq d$ , while by the induction assumption $d^{\prime}_{\max}\geq\log_{2}d_{\max}$ . To $d^{\prime}$ we take the flag counter only from the special child, while for other children with positive flag counter we add $1$ , i.e., $d^{\prime}=k-1+d^{\prime}_{\max}$ . Altogether we obtain $d^{\prime}=k-1+d^{\prime}_{\max}\geq k-1+\log_{2}d_{\max}\geq\log_{2}(k\cdot d_{\max})\geq\log_{2}d$ , as required. ∎

Proof of Lemma 5.

We consider the base case when $P=(\lambda x.R)\,S$ and $Q=R[S/x]$ ; the general situation (redex being deeper in $P$ ) is easily reduced to this one. In the derivation of $\Gamma\vdash Q:\hat{\tau}\triangleright c$ we identify the set $I$ of places (nodes) where we derive a type for $S$ substituted for $x$ . For $i\in I$ , let $\Sigma_{i}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}$ be the type judgment in $i$ . We change the nodes in $I$ into leaves, where we instead derive $\varepsilon[x\mapsto\{\hat{\sigma}_{i}\}]\vdash x:\hat{\sigma}_{i}\triangleright 0$ . It should be clear that we can repair the rest of the derivation, by changing type environments, replacing $S$ by $x$ in $\lambda$ -terms, and decreasing flag counters. In this way we obtain derivations of $\Sigma_{i}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}$ for every $i\in I$ , and a derivation of $\Sigma^{\prime}\vdash R:\hat{\tau}\triangleright d$ , where $\Sigma^{\prime}=\Sigma[x\mapsto\{\hat{\sigma}_{i}\mid i\in I\}]$ with $\Sigma(x)=\emptyset$ , and $\mathit{Split}(\Gamma\mid\Sigma,(\Sigma_{i})_{i\in I})$ , and $c=d+\Sigma_{i\in I}d_{i}$ . To the latter type judgment we apply the ( $\lambda$ ) rule, and then we merge it with the type judgments for $S$ using the (@) rule, which results in a derivation for $\Gamma\vdash P:\hat{\tau}\triangleright c$ . We remark that different $i\in I$ may give identical type judgments for $S$ (as long as the set of markers in $\hat{\sigma}_{i}$ is empty); this is not a problem. The (@) rule requires that $\mathit{ord}(\hat{\sigma}_{i})=\mathit{ord}(\lambda x.R)$ ; we have that $\mathit{ord}(\hat{\sigma}_{i})=\mathit{ord}(\hat{\tau})$ , and $\mathit{ord}(\hat{\tau})=m=\mathit{ord}(\lambda x.R)$ by assumption. ∎

5 Soundness

In this section we sketch the proof of the right-to-left implication of Theorem 2. We, basically, need to reverse the proof from the previous section. The following new fact is now needed.

Lemma 6.

If we can derive $\Gamma\vdash P:(m,F,M,\tau)\triangleright c$ with $m-1\not\in M$ and $\mathit{ord}(P)\leq m-1$ , then $c=0$ .

A simple inductive proof is based on the following idea: flags of order $m$ are created only when a marker of order $m-1$ is visible; the derivation itself (together with free variables) does not provide it ( $m-1\not\in M$ ), and the arguments, i.e. sets $T_{1},\dots,T_{k}$ in $\tau=T_{1}{\to}\dots\to T_{k}{\to}o$ , may provide only markers of order at most $\mathit{ord}(P)-1\leq m-2$ (see the definition of a type), thus no flags of order $m$ can be created.

We say that a $\lambda$ -term of the form $P\,Q$ is an application of order $n$ when $\mathit{ord}(P)=n$ , and that an (@) rule is of order $n$ if it derives a type for an application of order $n$ . We can successively remove applications of the maximal order from a type derivation.

Lemma 7.

Suppose that $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ for $m>0$ is derived by a derivation $D$ in which the (@) rule of order $m$ is used $n$ times. Then there exists $Q$ such that $P\to_{\beta}Q$ and $\varepsilon\vdash Q:\hat{\rho}_{m}\triangleright c$ can be derived by a derivation $D^{\prime}$ in which the (@) rule of order $m$ is used less than $n$ times.

Recall from the definition of the type system that the (@) rule of orders higher than $m$ cannot be used while deriving a full type of order $m$ . Thus in $D$ we have type judgments only for subterms of $P$ of order at most $m$ (although $P$ may also have subterms of higher orders), and in type environments we only have variables of order at most $m-1$ . In order to prove Lemma 7 we choose in $P$ a subterm $R\,S$ with $\mathit{ord}(R)=m$ such that there is a type judgment for $R\,S$ in some nodes of $D$ (at least one), but no descendants of those nodes use the (@) rule of order $m$ . Since $R$ is of order $m$ , it cannot be an application (then we would choose it instead of $R\,S$ ) nor a variable; thus $R=\lambda x.R^{\prime}$ . We obtain $Q$ by reducing the redex $(\lambda x.R^{\prime})\,S$ ; the derivation $D^{\prime}$ is obtained by performing a surgery on $D$ similar to that in the proof of Lemma 5 (but in the opposite direction). Notice that every full type $(m,F,M,\tau)$ (derived for $S$ ) with nonempty $M$ is used for exactly one appearance of $x$ in the derivation for $R^{\prime}$ ; full types with empty $M$ may be used many times, or not used at all, but thanks to Lemma 6 duplicating or removing the corresponding derivations for $S$ does not change the flag counter. In the derivations for $R^{\prime}[S/x]$ no (@) rule of order $m$ may appear, and the application $R\,S$ disappears, so the total number of (@) rules of order $m$ decreases.

When all (@) rules of order $m$ are eliminated, we can decrease $m$ .

Lemma 8.

Suppose that $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ for $m>0$ is derived by a derivation $D$ in which the (@) rule of order $m$ is not used. Then we can also derive $\varepsilon\vdash P:\hat{\rho}_{m-1}\triangleright c^{\prime}$ for some $c^{\prime}\geq c$ .

The proof is easy; we simply decrease the order $m$ of all derived full types by $1$ , and we ignore flags of order $m$ and markers of order $m-1$ . To obtain the inequality $c^{\prime}\geq c$ we observe that when no (@) rule of order $m$ is used, the information about flags of order $m-1$ goes only from descendants to ancestors, and thus every flag of order $m$ is created because of a different flag of order $m-1$ .

By repeatedly applying the two above lemmata, out of a derivation of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ we obtain a derivation of $\varepsilon\vdash Q:\hat{\rho}_{0}\triangleright c^{\prime}$ , where $P\to_{\beta}^{*}Q$ and $c^{\prime}\geq c$ . Since $\hat{\rho}_{0}$ is of order [math], using the latter derivation it is easy to find in the already expanded part of $Q$ (and thus in $\mathcal{L}(Q)=\mathcal{L}(P)$ ) a tree $t$ such that $|t|=c^{\prime}\geq c$ .

6 Effectiveness

Finally, we show how Theorem 1 follows from Theorem 2, i.e., how given a $\lambda Y$ -term $P$ of complexity $m$ we can check whether $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ can be derived for arbitrarily large $c$ . We say that two type judgments are equivalent if they differ only in the value of the flag counter.

Let us consider a set $\mathcal{D}$ of all derivations of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ in which on each branch (i.e., each root-leaf path) there are at most three type judgments from every equivalence class, and among premisses of each (@) rule there is at most one type judgment from every equivalence class. These derivations use only type judgments $\Gamma\vdash Q:\hat{\tau}\triangleright d$ with $Q$ being a subterm of $P$ and with $\Gamma(x)\neq\emptyset$ only for variables $x$ appearing in $P$ . Since a finite $\lambda Y$ -term, even when seen as an infinitary $\lambda$ -term, has only finitely many subterms, this introduces a common bound on the height of all derivations in $\mathcal{D}$ , and on their degree (i.e., on the maximal number of premisses of a rule). It follows that there are only finitely many derivations in $\mathcal{D}$ , and thus we can compute all of them.

We claim that $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ can be derived for arbitrarily large $c$ if and only if in $\mathcal{D}$ there is a derivation in which on some branch there are two equivalent type judgments with different values of the flag counter (and the latter condition can be easily checked). Indeed, having such a derivation, we can repeat its fragment between the two equivalent type judgments, obtaining derivations of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ with arbitrarily large $c$ . We use here an additivity property of our type system: if out of $\Gamma\vdash Q:\hat{\tau}\triangleright d$ we can derive $\Gamma^{\prime}\vdash Q^{\prime}:\hat{\tau}^{\prime}\triangleright d^{\prime}$ , then out of $\Gamma\vdash Q:\hat{\tau}\triangleright d+k$ we can derive $\Gamma^{\prime}\vdash Q^{\prime}:\hat{\tau}^{\prime}\triangleright d^{\prime}+k$ , for every $k\geq-d$ . Conversely, take a derivation of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ for some large enough $c$ . Suppose that some of its (@) rules uses two equivalent premisses. These premisses concern the argument subterm, which is of smaller order than the operator subterm, and thus of order at most $m-1$ . The set of marker orders in these premisses has to be empty, as the sets of marker orders from all premisses have to be disjoint. Thus, by Lemma 6, the flag counter in our two premisses is [math].

In consequence, we can remove one of the premisses, without changing anything in the remaining part of the derivation, even the flag counters. In this way we clean the whole derivation, so that at the end among premisses of each (@) rule there is at most one type judgment from every equivalence class. The degree is now bounded, and at each node the flag counter grows only by a constant above the sum of flag counters from the children. Thus, if $c$ is large enough, we can find on some branch two equivalent type judgments with different values of the flag counter. Then, for some pairs of equivalent type judgments, we remove the part of the derivation between these type judgments (and we adopt appropriately the flag counters in the remaining part). It it not difficult to perform this cleaning so that the resulting derivation will be in $\mathcal{D}$ , and simultaneously on some branch there will remain two equivalent type judgments with different values of the flag counter.

7 Conclusions

In this paper, we have shown an approach for expressing quantitative properties of Böhm trees using an intersection type system, on the example of the finiteness problem. It is an ongoing work to apply this approach to the diagonal problem, which should give a better complexity than that of the algorithm from [7]. Another ongoing work is to obtain an algorithm for model checking Böhm trees with respect to the Weak MSO+U logic [4]. This logic extends Weak MSO by a new quantifier U, expressing that a subformula holds for arbitrarily large finite sets. Furthermore, it seems feasible that our methods may help in proving a pumping lemma for nondeterministic HORSes.

Appendix A Proof of Lemma 3

Let us write $P\approx_{n}P^{\prime}$ if the $\lambda$ -terms $P$ and $P^{\prime}$ agree up to depth $n\in\mathbb{N}$ . Formally, $\approx_{n}$ is defined by induction on $n$ as the smallest equivalence relation such that:

•

$P\approx_{0}Q$ for all $\lambda$ -terms $P,Q$ ,

•

$a\,P_{1}\,\dots\,P_{r}\approx_{n}a\,P_{1}^{\prime}\,\dots\,P_{r}^{\prime}$ if $P_{i}\approx_{n-1}P_{i}^{\prime}$ for all $i\in\{1,\dots,r\}$ ,

•

$P\,Q\approx_{n}P^{\prime}\,Q^{\prime}$ if $P\approx_{n-1}P^{\prime}$ and $Q\approx_{n-1}Q^{\prime}$ , and

•

$\lambda x.P\approx_{n}\lambda x.P^{\prime}$ if $P\approx_{n-1}P^{\prime}$ .

Observe that $P\approx_{n}P^{\prime}$ implies $P\approx_{k}P^{\prime}$ for $k<n$ .

We split the proof of Lemma 3 into several lemmata.

Lemma 9.

Let $P$ be a closed $\lambda$ -term of sort $o$ , and let $t\in\mathcal{L}(P)$ . Then there exists a number $n\in\mathbb{N}$ and a $\lambda$ -term $Q$ such that $P\to_{\beta}^{*}Q$ , and whenever $Q\approx_{n}Q^{\prime}$ and $Q^{\prime}\to_{\beta}^{*}Q^{\prime\prime}$ for some $\lambda$ -terms $Q^{\prime}$ , $Q^{\prime\prime}$ , then one can derive $\varepsilon\vdash Q^{\prime\prime}:\hat{\rho}_{0}\triangleright|t|$ .

Proof.

Let $\to_{\mathsf{br}}^{k}$ be the relation obtained by composing $\to_{\mathsf{br}}$ with itself $k$ times. By definition of $\mathcal{L}(P)$ we have that $\mathit{BT}(P)\to_{\mathsf{br}}^{*}t$ , and thus $\mathit{BT}(P)\to_{\mathsf{br}}^{k}t$ for some $k\in\mathbb{N}$ . We prove the lemma by induction on $|t|+k$ , where $k$ is the smallest number such that $\mathit{BT}(P)\to_{\mathsf{br}}^{k}t$ . Because $\mathcal{L}(P)\neq\emptyset$ , we have that $P\to_{\beta}^{*}a\,P_{1}\,\dots\,P_{r}$ (where possibly $a=\mathsf{br}$ ). Then $\mathit{BT}(P)$ has $a$ in its root, and the subtrees starting in root’s children are $\mathit{BT}(P_{1}),\dots,\mathit{BT}(P_{r})$ . We have two cases.

Suppose first that $a\neq\mathsf{br}$ (this case serves as the induction base when $r=0$ ). Then also $t$ has $a$ in its root, and the subtrees $t_{1},\dots,t_{r}$ starting in root’s children are such that $\mathit{BT}(P_{i})\to_{\mathsf{br}}^{k_{i}}t_{i}$ for all $i\in\{1,\dots,r\}$ , where $k_{1}+\dots+k_{r}=k$ . We have $|t_{i}|+k_{i}<|t|+k$ , since $|t_{i}|<|t|$ . By the induction assumption, for every $i\in\{1,\dots,r\}$ we obtain a number $n_{i}$ and a $\lambda$ -term $Q_{i}$ such that $P_{i}\to_{\beta}^{*}Q_{i}$ , and whenever $Q_{i}\approx_{n_{i}}Q_{i}^{\prime}$ and $Q_{i}^{\prime}\to_{\beta}^{*}Q_{i}^{\prime\prime}$ for some $\lambda$ -terms $Q^{\prime}_{i}$ , $Q_{i}^{\prime\prime}$ , then one can derive $\varepsilon\vdash Q^{\prime\prime}_{i}:\hat{\rho}_{0}\triangleright|t_{i}|$ . Taking $Q=a\,Q_{1}\,\dots\,Q_{r}$ we have $P\to_{\beta}^{*}a\,P_{1}\,\dots\,P_{r}\to_{\beta}^{*}a\,Q_{1}\,\dots\,Q_{r}=Q$ . As $n$ we take $1+n_{1}+\dots+n_{r}$ . Let now $Q^{\prime}$ and $Q^{\prime\prime}$ be such that $Q\approx_{n}Q^{\prime}$ and $Q^{\prime}\to_{\beta}^{*}Q^{\prime\prime}$ . Then $Q^{\prime}=a\,Q_{1}^{\prime}\,\dots\,Q_{r}^{\prime}$ and $Q^{\prime\prime}=a\,Q_{1}^{\prime\prime}\,\dots\,Q_{r}^{\prime\prime}$ , where $Q_{i}\approx_{n-1}Q_{i}^{\prime}$ (thus also $Q_{i}\approx_{n_{i}}Q_{i}^{\prime}$ ) and $Q_{i}^{\prime}\to_{\beta}^{*}Q_{i}^{\prime\prime}$ for $i\in\{1,\dots,r\}$ . From the induction assumption we obtain derivations of $\varepsilon\vdash Q_{i}^{\prime\prime}:\hat{\rho}_{0}\triangleright|t_{i}|$ . Recall that $\hat{\rho}_{0}=(0,\emptyset,\emptyset,o)$ . We apply the (Con) rule to these derivations. Since $\mathit{ord}(\hat{\rho})=0$ , the pair $(F^{\prime},c^{\prime})$ appearing in the rule’s definition equals $(\emptyset,1)$ . The $\mathit{Comp}_{0}$ predicate simply adds flag counters from its arguments, and hence the resulting flag counter is $1+|t_{1}|+\dots+|t_{r}|$ , which equals $|t|$ . Thus the resulting type judgment is $\varepsilon\vdash Q^{\prime\prime}:\hat{\rho}_{0}\triangleright|t|$ , as required.

Next, suppose that $a=\mathsf{br}$ (and hence $r=2$ ). It should be clear that in the shortest reduction sequence $\mathit{BT}(P)\to_{\mathsf{br}}^{k}t$ we can rearrange reductions (without increasing their number) so that we first eliminate the $\mathsf{br}$ symbol from the root of $\mathit{BT}(P)$ . In other words, we have $\mathit{BT}(P)\to_{\mathsf{br}}\mathit{BT}(P_{i})\to_{\mathsf{br}}^{k-1}t$ , for some $i\in\{1,2\}$ . Let us focus our attention on the case of $i=1$ ; the case of $i=2$ is completely symmetric. Since $|t|+k-1<|t|+k$ , from the induction assumption we obtain a number $n_{1}$ and a $\lambda$ -term $Q_{1}$ such that $P_{1}\to_{\beta}^{*}Q_{1}$ , and whenever $Q_{1}\approx_{n}Q_{1}^{\prime}$ and $Q_{1}^{\prime}\to_{\beta}^{*}Q_{1}^{\prime\prime}$ for some $\lambda$ -terms $Q_{1}^{\prime}$ , $Q_{1}^{\prime\prime}$ , then one can derive $\varepsilon\vdash Q^{\prime\prime}_{1}:\hat{\rho}_{0}\triangleright|t|$ . Taking $Q=\mathsf{br}\,Q_{1}\,P_{2}$ we have $P\to_{\beta}^{*}\mathsf{br}\,P_{1}\,P_{2}\to_{\beta}^{*}\mathsf{br}\,Q_{1}\,P_{2}=Q$ . As $n$ we take $1+n_{1}$ . Let now $Q^{\prime}$ , $Q^{\prime\prime}$ be such that $Q\approx_{n}Q^{\prime}$ and $Q^{\prime}\to_{\beta}^{*}Q^{\prime\prime}$ . Then $Q^{\prime}=\mathsf{br}\,Q_{1}^{\prime}\,Q_{2}^{\prime}$ and $Q^{\prime\prime}=\mathsf{br}\,Q_{1}^{\prime\prime}\,Q_{2}^{\prime\prime}$ , where $Q_{1}\approx_{n-1}Q_{1}^{\prime}$ and $Q_{1}^{\prime}\to_{\beta}^{*}Q_{1}^{\prime\prime}$ . By the induction assumption we can derive $\varepsilon\vdash Q_{1}^{\prime\prime}:\hat{\rho}_{0}\triangleright|t|$ , which after applying the (Br) rule gives $\varepsilon\vdash Q^{\prime\prime}:\hat{\rho}_{0}\triangleright|t|$ . ∎

Lemma 10.

If $P\approx_{n}P^{\prime}$ and $Q\approx_{n}Q^{\prime}$ for some $n\in\mathbb{N}$ , then also $P[Q/x]\approx_{n}P^{\prime}[Q^{\prime}/x]$ .

Proof.

Induction on $n$ . For $n=0$ the lemma is obvious: $\approx_{0}$ always holds. When $n>0$ and $P=R\,S$ , then $P^{\prime}=R^{\prime}\,S^{\prime}$ with $R\approx_{n-1}R^{\prime}$ and $S\approx_{n-1}S^{\prime}$ . By the induction assumption we have $R[Q/x]\approx_{n-1}R^{\prime}[Q^{\prime}/x]$ and $S[Q/x]\approx_{n-1}S^{\prime}[Q^{\prime}/x]$ , and thus $P[Q/x]\approx_{n}P^{\prime}[Q^{\prime}/x]$ . The cases when $P=a\,P_{1}\,\dots\,P_{r}$ or $P=\lambda y.Q$ are similar. Finally, when $P=P^{\prime}$ is a variable, the thesis follows immediately from $Q\approx_{n}Q^{\prime}$ . ∎

Lemma 11.

If $P\approx_{n+2}P^{\prime}$ and $P\to_{\beta}Q$ , then for some $Q^{\prime}$ we have $P^{\prime}\to_{\beta}^{*}Q^{\prime}$ and $Q\approx_{n}Q^{\prime}$ .

Proof.

Induction on $n$ . If $n=0$ , the thesis holds for $Q^{\prime}=P^{\prime}$ . Suppose that $n>0$ and $P=(\lambda x.R)\,S$ and $Q=R[S/x]$ . Then $P^{\prime}=(\lambda x.R^{\prime})\,S^{\prime}$ , where $R\approx_{n}R^{\prime}$ and $S\approx_{n+1}S^{\prime}$ . Taking $Q^{\prime}=R^{\prime}[S^{\prime}/x]$ we have $P^{\prime}\to_{\beta}Q^{\prime}$ , and by Lemma 10 $Q\approx_{n}Q^{\prime}$ . The remaining case is that $n>0$ and the redex involved in the $\beta$ -reduction $P\to_{\beta}Q$ is not located on the front of $P$ . Then the thesis follows from the induction assumption. Let us consider only a representative example: suppose that $P=R\,S$ , and $Q=T\,S$ , and $R\to_{\beta}T$ . In this case $P^{\prime}=R^{\prime}\,S^{\prime}$ with $R\approx_{n+1}R^{\prime}$ and $S\approx_{n+1}S^{\prime}$ . The induction assumption gives us $T^{\prime}$ such that $R^{\prime}\to_{\beta}^{*}T^{\prime}$ and $T\approx_{n-1}T^{\prime}$ . Thus for $Q^{\prime}=T^{\prime}S^{\prime}$ we have $P^{\prime}\to_{\beta}^{*}Q^{\prime}$ and $Q\approx_{n}Q^{\prime}$ . ∎

Lemma 12.

For every $n\in\mathbb{N}$ , we can represent every $\lambda$ -term $P$ as $P=P^{\prime}[S_{1}/x_{1},\dots,S_{s}/x_{s}]$ so that $P^{\prime}$ is finite and $P\approx_{n}P^{\prime}$ .

Proof.

Induction on $n$ . For $n=0$ we represent $P=x[S/x]$ , and clearly $x\approx_{0}P$ . For $n>0$ we consider the representative case of $P=Q\,R$ ; for other forms of $P$ the proof is similar. The induction assumption gives us representations $Q=Q^{\prime}[S_{1}/x_{1},\dots,S_{r}/x_{r}]$ and $R=R^{\prime}[S_{r+1}/x_{r+1},\dots,S_{s}/x_{s}]$ with $Q^{\prime},R^{\prime}$ finite and such that $Q\approx_{n-1}Q^{\prime}$ and $R\approx_{n-1}R^{\prime}$ . W.l.o.g. we can assume that the fresh variables $x_{1},\dots,x_{r}$ are not free in $R^{\prime}$ , and the fresh variables $x_{r+1},\dots,x_{s}$ are not free in $Q^{\prime}$ . Then, for $P^{\prime}=Q^{\prime}\,R^{\prime}$ we have $P\approx_{n}P^{\prime}$ and $P=P^{\prime}[S_{1}/x_{1},\dots,S_{s}/x_{s}]$ . ∎

Corollary 13.

Let $n\in\mathbb{N}$ , and let $P$ , $Q$ be $\lambda$ -terms such that $P\to_{\beta}^{*}Q$ . Then we can represent $P$ as $P=P^{\prime}[S_{1}/x_{1},\dots,S_{s}/x_{s}]$ so that $P^{\prime}$ is finite, and for some $\lambda$ -term $Q^{\prime}$ it holds $P^{\prime}\to_{\beta}^{*}Q^{\prime}$ and $Q\approx_{n}Q^{\prime}$ .

Proof.

Let us write $P=P_{0}\to_{\beta}P_{1}\to_{\beta}\dots\to_{\beta}P_{l}=Q$ . By Lemma 12 we can write $P=P^{\prime}[S_{1}/x_{1},\dots,S_{s}/x_{s}]$ so that $P^{\prime}$ is finite and $P\approx_{n+2l}P^{\prime}$ . Take $P_{0}^{\prime}=P^{\prime}$ . Consecutively for all $i\in\{1,\dots,l\}$ Lemma 11 gives us a $\lambda$ -term $P_{i}^{\prime}$ such that $P_{i-1}^{\prime}\to_{\beta}^{*}P_{i}^{\prime}$ and $P_{i}\approx_{n+2(l-i)}P_{i}^{\prime}$ . At the end, for $Q^{\prime}=P^{\prime}_{l}$ we have $P^{\prime}\to_{\beta}^{*}Q^{\prime}$ and $Q\approx_{n}Q^{\prime}$ . ∎

Below, a $\lambda$ -term $(\lambda x.P)\,Q$ is called a $\beta$ -redex of order $k$ if $k=\mathit{ord}(\lambda x.P)$ .

Lemma 14.

Let $k\in\mathbb{N}$ , and let $P$ be a $\lambda$ -term without $\beta$ -redexes of orders higher than $k$ (as subterms). If $P\to_{\beta}Q$ is a $\beta$ -reduction of order $k$ , then also in $Q$ there are no $\beta$ -redexes of order higher than $k$ .

Proof.

This lemma was already justified on page 4, but let us repeat. Let $P^{\prime}$ be obtained from $P$ by replacing by $y$ the $\beta$ -redex $(\lambda x.R)\,S$ involved in the $\beta$ -reduction $P\to_{\beta}Q$ . Then we have $P=P^{\prime}[(\lambda x.R)\,S/y]$ and $Q=P^{\prime}[R[S/x]/y]$ . Suppose that $Q$ has a $\beta$ -redex of order higher than $k$ , i.e., a subterm $(\lambda z.T)\,U$ with $\mathit{ord}(\lambda z.T)>k$ ; we want to prove that this is impossible. The subterm $(\lambda z.T)\,U$ , like every subterm of $Q$ , can be found in one of the following three places:

•

Possibly $(\lambda z.T)\,U$ is a subterm of $S$ . This is impossible, because by assumption $P$ (and thus also its subterm $S$ ) contains no $\beta$ -redexes of orders higher than $k$ .

•

Possibly $(\lambda z.T)\,U=V[S/x]$ for a subterm $V$ of $R$ , where $V\neq x$ . Then $V$ has to be an application $V=W\,X$ , with $\mathit{ord}(W)=\mathit{ord}(\lambda z.T)>k$ . We have $W\neq x$ , because $\mathit{ord}(x)<\mathit{ord}(\lambda x.R)=k$ . Thus $W$ is a $\lambda$ -abstraction, with $V$ being itself a $\beta$ -redex of order higher than $k$ , which is again impossible by assumption.

•

Otherwise $(\lambda z.T)\,U=V[R[S/x]/y]$ for a subterm $V$ of $P^{\prime}$ , where $V\neq y$ . Again, $V$ has to be an application $V=W\,X$ , with $\mathit{ord}(W)=\mathit{ord}(\lambda z.T)>k$ . We have $W\neq y$ , because $\mathit{ord}(y)=\mathit{ord}(R)\leq\mathit{ord}(\lambda x.R)=k$ . Thus $W$ is a $\lambda$ -abstraction, with $V$ being itself a $\beta$ -redex of order higher than $k$ ; this is impossible, since $V[(\lambda x.R)\,S/y]$ is a subterm of $P$ and also a $\beta$ -redex of order higher than $k$ .∎

Lemma 15.

Let $P^{\prime}$ be a finite $\lambda$ -term of complexity at most $m$ . Then there exist $\lambda$ -terms $Q_{m}^{\prime},Q_{m-1}^{\prime},\dots,Q_{0}^{\prime}$ such that $P^{\prime}=Q_{m}^{\prime}$ , and for every $k\in\{1,\dots,m\}$ the term $Q^{\prime}_{k-1}$ can be reached from $Q^{\prime}_{k}$ using only $\beta$ -reductions of order $k$ , and $Q_{0}^{\prime}$ is in $\beta$ -normal form.

Proof.

Take $Q_{m}^{\prime}=P^{\prime}$ . Then, for $k=m,\dots,1$ , consecutively, out of $Q_{k}^{\prime}$ we perform $\beta$ -reductions of order $k$ as long as possible, and as $Q_{k-1}^{\prime}$ we take the resulting $\lambda$ -term, from which no more $\beta$ -reductions of order $k$ are possible. Every such sequence of $\beta$ -reductions finally ends, because $P^{\prime}$ is finite. For every $k\in\{1,\dots,m\}$ Lemma 14 ensures that in $Q_{0}^{\prime}$ there are no $\beta$ -redexes of order $k$ , because all $\beta$ -reductions between $Q_{k-1}^{\prime}$ and $Q_{0}^{\prime}$ were of orders smaller than $k$ . Moreover, because the complexity of a $\lambda$ -term cannot grow during $\beta$ -reductions, the complexity of $Q^{\prime}_{0}$ is at most $m$ , and hence it has no $\beta$ -redexes of order higher than $m$ . Thus $Q_{0}^{\prime}$ is in $\beta$ -normal form. ∎

Proof of Lemma 3.

Recall that in this lemma we are given a closed $\lambda$ -term $P$ of sort $o$ and complexity $m$ , and some $t\in\mathcal{L}(P)$ , and our goal is to exhibit $\lambda$ -terms $Q_{m},Q_{m-1},\dots,Q_{0}$ such that $P=Q_{m}$ , and for every $k\in\{1,\dots,m\}$ the term $Q_{k-1}$ can be reached from $Q_{k}$ using only $\beta$ -reductions of order $k$ , and we can derive $\varepsilon\vdash Q_{0}:\hat{\rho}_{0}\triangleright|t|$ .

We first apply Lemma 9 to $P$ and $t$ , obtaining a number $n$ and a $\lambda$ -term $Q$ such that $P\to_{\beta}^{*}Q$ . Then, we apply to them Corollary 13, obtaining a representation $P=P^{\prime}[S_{1}/x_{1},\dots,S_{s}/x_{s}]$ for finite $P^{\prime}$ , and a $\lambda$ -term $Q^{\prime}$ . Notice that the complexity of $P^{\prime}$ cannot be higher than that of $P$ , as for every subterm $R$ of $P^{\prime}$ , the term $R[S_{1}/x_{1},\dots,S_{s}/x_{s}]$ is a subterm of $P$ , and has the same order as $R$ . We then apply Lemma 15 to $P^{\prime}$ , obtaining $\lambda$ -terms $Q_{m}^{\prime},Q_{m-1}^{\prime},\dots,Q_{0}^{\prime}$ . As $Q_{k}$ we take $Q_{k}^{\prime}[S_{1}/x_{1},\dots,S_{s}/x_{s}]$ , for $k\in\{0,\dots,m\}$ . We have $Q_{m}=P$ since $Q_{m}^{\prime}=P^{\prime}$ . For every $k\in\{1,\dots,m\}$ Lemma 15 gives us a sequence of $\beta$ -reduction of order $k$ from $Q^{\prime}_{k}$ to $Q^{\prime}_{k-1}$ . After performing the substitution $[S_{1}/x_{1},\dots,S_{s}/x_{s}]$ to every $\lambda$ -term in this sequence, all $\beta$ -reductions in this sequence remain correct $\beta$ -reductions of order $k$ , and now the sequence leads from $Q_{k}$ to $Q_{k-1}$ .

Finally, by Corollary 13 we have $P^{\prime}\to_{\beta}^{*}Q^{\prime}$ and $Q\approx_{n}Q^{\prime}$ . Since, by Lemma 15, the $\beta$ -normal form of $P^{\prime}$ is $Q_{0}^{\prime}$ , we also have $Q^{\prime}\to_{\beta}^{*}Q_{0}^{\prime}$ . Thus by Lemma 9 (where we take $Q_{0}^{\prime}$ as $Q^{\prime\prime}$ ) we obtain a derivation of $\varepsilon\vdash Q^{\prime}_{0}:\hat{\rho}_{0}\triangleright|t|$ . To every $\lambda$ -term in this derivation we apply the substitution $[S_{1}/x_{1},\dots,S_{s}/x_{s}]$ , obtaining a derivation of $\varepsilon\vdash Q_{0}:\hat{\rho}_{0}\triangleright|t|$ . The derivation remains correct: the problem could only appear in a (Var) rule used for some of the variables $x_{1},\dots,x_{s}$ , but since the type environment of the resulting type judgment is empty, the derivation uses the (Var) rule only for bound variables of $Q^{\prime}_{0}$ , not for $x_{1},\dots,x_{s}$ . ∎

Appendix B Proof of Lemma 4

Lemma 16.

If $\mathit{Comp}_{m}(M;\allowbreak((F_{i},c_{i}))_{i\in I})=(F,c)$ , where $M$ and $F_{i}$ are subsets of $\{0,\dots,m-1\}$ , then it holds that $\mathit{Comp}_{m+1}(M;\allowbreak((G_{i},0))_{i\in I})=(G,0)$ , where $G=F\cup\{m\mid c>0\}$ and $G_{i}=F_{i}\cup\{m\mid c_{i}>0\}$ .

In the above, by $\{m\mid c>0\}$ we mean the set $\{m\}$ when $c>0$ , and $\emptyset$ otherwise.

Proof.

In the definition of the $\mathit{Comp}$ predicate some numbers $f_{n}$ and $f^{\prime}_{n}$ are computed. Denote by $f_{n,m}$ and $f^{\prime}_{n,m}$ the values taken by $f_{n}$ and $f^{\prime}_{n}$ in the above instantiation of the $\mathit{Comp}_{m}$ predicate, and by $f_{n,m+1}$ and $f^{\prime}_{n,m+1}$ their values in the above instantiation of the $\mathit{Comp}_{m+1}$ predicate. Since the sets $F_{i}$ and $G_{i}$ are the same when restricted to numbers smaller than $m$ , we have $f_{n,m}=f_{n,m+1}$ for $n<m$ , and $f^{\prime}_{n,m}=f^{\prime}_{n,m+1}$ for $n\leq m$ . In particular, for $n<m$ the definition of $\mathit{Comp}$ says that $n\in F\Leftrightarrow n\in G$ . We also have $f^{\prime}_{m+1,m+1}=0$ , since $m\not\in M$ ; thus the flag counter computed by the $\mathit{Comp}_{m+1}$ predicate (as the sum of $f^{\prime}_{m+1,m+1}=0$ and of the zeroes given as arguments to $\mathit{Comp}_{m+1}$ ) is actually [math]. Finally, we recall that $c=f^{\prime}_{m,m}+\sum_{i\in I}c_{i}$ ; thus $c>0$ if and only if $f^{\prime}_{m,m}>0$ or $c_{i}>0$ for some $i\in I$ . On the other hand, we have $f_{m,m+1}=f^{\prime}_{m,m+1}+\sum_{i\in I}|G_{i}\cap\{m\}|$ ; thus $f_{m,m+1}>0$ if and only if $f^{\prime}_{m,m+1}>0$ or $m\in G_{i}$ for some $i\in I$ . Since $f^{\prime}_{m,m}=f^{\prime}_{m,m+1}$ and $c_{i}>0\Leftrightarrow m\in G_{i}$ , we obtain $c>0\Leftrightarrow f_{m,m+1}>0$ . The $\mathit{Comp}_{m+1}$ predicate wants to put $m$ to the set $G$ if only if $f_{m,m+1}>0$ (since $m\not\in M$ ), thus if and only if $c>0$ ; this agrees with our definition of $G$ . ∎

Lemma 17.

If we can derive $\Gamma\vdash R:(m,F,M,\tau)\triangleright c$ , then we can also derive $\Gamma\vdash R:(m+1,G,M,\tau)\triangleright 0$ , where $G=F\cup\{m\mid c>0\}$ .

Proof.

Denote $\hat{\tau}=(m,F,M,\tau)$ and $\hat{\tau}^{\prime}=(m+1,G,M,\tau)$ . The proof is by induction on the structure of the derivation of $\Gamma\vdash R:\hat{\tau}\triangleright c$ . We have several cases, depending on the shape of $R$ .

Suppose that $R=\mathsf{br}\,P_{1}\,P_{2}$ . Then the last rule of the derivation is (Br) with premiss $\Gamma\vdash P_{i}:\hat{\tau}\triangleright c$ for some $i\in\{1,2\}$ . The induction assumption gives us a derivation of $\Gamma\vdash P_{i}:\hat{\tau}^{\prime}\triangleright 0$ . Applying back the (Br) rule we derive $\Gamma\vdash R:\hat{\tau}^{\prime}\triangleright 0$ .

Next, suppose that $R=x$ is a variable. Then the (Var) rule requires that $c=0$ , and thus $G=F$ . We just need to change in this rule $m$ to $m+1$ (which is not problematic at all), and we obtain a derivation of $\Gamma\vdash R:\hat{\tau}^{\prime}\triangleright 0$ .

Next, suppose that $R=\lambda x.P$ . Then the derivation ends with the ( $\lambda$ ) rule, whose premiss is $\Gamma^{\prime}[x\mapsto T]\vdash P:(m,F,M_{\lambda},\tau_{\lambda})\triangleright c$ , where $\tau=T{\to}\tau_{\lambda}$ , and $M=M_{\lambda}\setminus\bigcup_{(k,F^{\prime},M^{\prime},\sigma)\in T}M^{\prime}$ , and $\mathit{Split}(\Gamma\mid\Gamma^{\prime})$ , and $\Gamma^{\prime}(x)=\emptyset$ . The induction assumption gives us a derivation of $\Gamma^{\prime}[x\mapsto T]\vdash P:(m+1,G,M_{\lambda},\tau_{\lambda})\triangleright 0$ . We can apply back the ( $\lambda$ ) rule, and derive $\Gamma\vdash R:\hat{\tau}^{\prime}\triangleright 0$ .

Next, suppose that $R=a\,P_{1}\,\dots\,P_{r}$ for $a\neq\mathsf{br}$ . We have $\tau=o$ . Let $\Gamma_{i}\vdash P_{i}:(m,F_{i},M_{i},o)\triangleright c_{i}$ for $i\in\{1,\dots,r\}$ be the premisses of the final (Con) rule. Using the induction assumption we derive $\Gamma_{i}\vdash P_{i}:(m+1,G_{i},M_{i},o)\triangleright 0$ , where $G_{i}=F_{i}\cup\{m\mid c_{i}>0\}$ . We want to apply back the (Con) rule. Some of the conditions of the (Con) rule remain unchanged: $M=M^{\prime}\uplus M_{1}\uplus\dots\uplus M_{r}$ , and $M^{\prime}=\emptyset$ if $r>0$ , and $\mathit{Split}(\Gamma\mid\Gamma_{1},\dots,\Gamma_{r})$ . We also know that $(F,c)=\mathit{Comp}_{m}(M;\allowbreak(F^{\prime},c^{\prime}),(F_{1},c_{1}),\dots,(F_{r},c_{r}))$ , where $F^{\prime}=\emptyset$ and $c^{\prime}=1$ if $m=0$ , and $F^{\prime}=\{0\}$ and $c^{\prime}=0$ if $m>0$ . Notice that $F^{\prime}\cup\{m\mid c^{\prime}>0\}=\{0\}$ . Thus from Lemma 16 we obtain that $\mathit{Comp}_{m+1}(M;\allowbreak(\{0\},0),(G_{1},0),\dots,(G_{r},0))=(G,0)$ , as required.

Finally, suppose that $R=P\,Q$ . Let $\Gamma^{\prime}\vdash P:(m,F^{\prime},M^{\prime},T{\to}\tau)\triangleright c^{\prime}$ and $\Gamma_{i}\vdash Q:(m,F_{i},M_{i},\tau_{i})\triangleright c_{i}$ for each $i\in I$ be the premisses of the final (@) rule, where $T=\{(\mathit{ord}(P),F_{i}{\restriction}_{<\mathit{ord}(P)},M_{i}{\restriction}_{<\mathit{ord}(P)},\tau_{i})\mid i\in I\}$ . Using the induction assumption we derive $\Gamma^{\prime}\vdash P:(m+1,G^{\prime},M^{\prime},T{\to}\tau)\triangleright 0$ , where $G^{\prime}=F^{\prime}\cup\{m\mid c^{\prime}>0\}$ , and $\Gamma_{i}\vdash Q:(m+1,G_{i},M_{i},\tau_{i})\triangleright c_{i}$ , where $G_{i}=F_{i}\cup\{m\mid c_{i}>0\}$ , for each $i\in I$ . We want to apply back the (@) rule. The conditions $M=M^{\prime}\uplus\biguplus{}_{i\in I}M_{i}$ and $\mathit{Split}(\Gamma\mid\Gamma^{\prime},(\Gamma_{i})_{i\in I})$ required by the (@) rule remain unchanged, while $\mathit{ord}(P)\leq m+1$ holds because previously we had $\mathit{ord}(P)\leq m$ . Also due to $\mathit{ord}(P)\leq m$ we have $G_{i}{\restriction}_{<\mathit{ord}(P)}=F_{i}{\restriction}_{<\mathit{ord}(P)}$ , and thus the type needed now for $P$ , that is $\{(\mathit{ord}(P),G_{i}{\restriction}_{<\mathit{ord}(P)},M_{i}{\restriction}_{<\mathit{ord}(P)},\tau_{i})\mid i\in I\}{\to}o$ , is actually equal $T{\to}o$ , The condition $\mathit{Comp}_{m}(M;\allowbreak(F^{\prime},c^{\prime}),((F_{i}{\restriction}_{\geq\mathit{ord}(P)},c_{i}))_{i\in I})=(F,c)$ implies that $\mathit{Comp}_{m+1}(M;\allowbreak(G^{\prime},0),((G_{i}{\restriction}_{\geq\mathit{ord}(P)},0))_{i\in I})=(G,0)$ by Lemma 16 (notice that $G_{i}{\restriction}_{\geq\mathit{ord}(P)}=F_{i}{\restriction}_{\geq\mathit{ord}(P)}\cup\{m\mid c_{i}>0\}$ because $m\geq\mathit{ord}(P)$ ). ∎

Lemma 18.

Suppose that $\mathit{Comp}_{m}(M;\allowbreak((F_{i},c_{i}))_{i\in I})=(F,c)$ , where $M$ and $F_{i}$ are subsets of $\{0,\dots,m-1\}$ . If $s\in I$ is such that $c_{i}\leq c_{s}$ for all $i\in I$ , and $d_{s}\geq\log_{2}c_{s}$ , and $G_{i}=F_{i}\cup\{m\mid c_{i}>0\}$ for $i\in I$ , then $\mathit{Comp}_{m+1}(M\cup\{m\};\allowbreak(F_{s},d_{s}),((G_{i},0))_{i\in I\setminus\{s\}})=(F,d)$ for some $d$ such that $d\geq\log_{2}c$ .

Proof.

As in the proof of Lemma 16, denote by $f_{n,m}$ and $f^{\prime}_{n,m}$ the values taken by the variables $f_{n}$ and $f^{\prime}_{n}$ in the above instantiation of the $\mathit{Comp}_{m}$ predicate, and by $f_{n,m+1}$ and $f^{\prime}_{n,m+1}$ their values in the above instantiation of the $\mathit{Comp}_{m+1}$ predicate. As previously we have $f_{n,m}=f_{n,m+1}$ for $n<m$ , and $f^{\prime}_{n,m}=f^{\prime}_{n,m+1}$ for $n\leq m$ , and thus $\mathit{Comp}_{m+1}$ correctly says which numbers smaller than $m$ should belong to $F$ . Moreover, it says that $m\not\in F$ , since $m\in M\cup\{m\}$ , which is also correct. It remains to check that the flag counter $d$ computed by $\mathit{Comp}_{m+1}$ actually satisfies $d\geq\log_{2}c$ . Because $m\in M\cup\{m\}$ we have $f^{\prime}_{m+1,m+1}=f_{m,m+1}$ , and thus

[TABLE]

On the other hand, we have

[TABLE]

In the degenerate case of $c_{s}=0$ we have that $c_{i}=0$ for all $i\in I$ , and by the above $d=c$ , thus even more $d\geq\log_{2}c$ .

Next, suppose that $c_{s}>0$ . Denote $k=f^{\prime}_{m,m}+|\{i\in I\mid c_{i}>0\}|$ . Then (1) gives $d=k-1+d_{s}$ , while continuing (2) we have $c\leq f^{\prime}_{m,m}\cdot c_{s}+|\{i\in I\mid c_{i}>0\}|\cdot c_{s}=k\cdot c_{s}$ . Altogether we obtain as required:

[TABLE]

Corollary 19.

Suppose that $\mathit{Comp}_{m}(M;\allowbreak(F^{\prime},c^{\prime}))=(F,c)$ , where $M\subseteq\{0,\dots,m-1\}$ , and $F^{\prime}=\emptyset\land c^{\prime}=1$ if $m=0$ , and $F^{\prime}=\{0\}\land c^{\prime}=0$ if $m>0$ . Then $\mathit{Comp}_{m+1}(M\cup\{m\};\allowbreak(\{0\},0))=(F,d)$ for some $d$ such that $d\geq\log_{2}c$ .

Proof.

For $m>0$ this is a direct consequence of Lemma 18 (where the assumption $d_{s}\geq\log_{2}c_{s}$ is instantiated as $0\geq\log_{2}0$ ). For $m=0$ we have $M=\emptyset$ , and $\mathit{Comp}_{0}(M;\allowbreak(\emptyset,1))=(\emptyset,1)$ , while $\mathit{Comp}_{1}(M\cup\{0\};\allowbreak(\{0\},0))=(\emptyset,1)$ ; this is fine, since $d=1\geq\log_{2}1=\log_{2}c$ (for $m=0$ we could not apply Lemma 18, because the set $F^{\prime}=\emptyset$ in $\mathit{Comp}_{m}$ changes into $\{0\}$ in $\mathit{Comp}_{m+1}$ ). ∎

Corollary 20.

Suppose that $\mathit{Comp}_{m}(M;\allowbreak(F^{\prime},c^{\prime}),((F_{i},c_{i}))_{i\in I})=(F,c)$ , where $M$ and $F_{i}$ are subsets of $\{0,\dots,m-1\}$ , and $F^{\prime}=\emptyset\land c^{\prime}=1$ if $m=0$ , and $F^{\prime}=\{0\}\land c^{\prime}=0$ if $m>0$ . If $s\in I$ is such that $c_{i}\leq c_{s}$ for all $i\in I$ , and $d_{s}\geq\log_{2}c_{s}$ , and $G_{i}=F_{i}\cup\{m\mid c_{i}>0\}$ for $i\in I$ , then $\mathit{Comp}_{m+1}(M\cup\{m\};\allowbreak(\{0\},0),(F_{s},d_{s}),((G_{i},0))_{i\in I\setminus\{s\}})=(F,d)$ for some $d$ such that $d\geq\log_{2}c$ .

Proof.

For $m>0$ this is a direct consequence of Lemma 18 (we notice that $F^{\prime}\cup\{m\mid c^{\prime}>0\}=\{0\}$ and $c^{\prime}=0\leq c_{s}$ ). If $m=0$ and $c_{s}\geq 1$ we can use Lemma 18 as well. Suppose that $m=0$ and $c_{s}=0$ . Then $M=\emptyset$ , and $F_{i}=G_{i}=\emptyset\land c_{i}=0$ for all $i\in I$ , thus $\mathit{Comp}_{0}(M;\allowbreak(\emptyset,1),((F_{i},c_{i}))_{i\in I})=(\emptyset,1)$ , while $\mathit{Comp}_{1}(M\cup\{0\};\allowbreak(\{0\},0),(F_{s},d_{s}),((G_{i},0))_{i\in I\setminus\{s\}})=(\emptyset,1+d_{s})$ ; this is fine, since $d=1+d_{s}\geq\log_{2}1=\log_{2}c$ . ∎

Lemma 21.

Suppose that we can derive $\Gamma\vdash R:(m,F,M,\tau)\triangleright c$ , where $\mathit{ord}(R)\leq m$ and that every full type assigned by $\Gamma$ to a variable is of order at most $m$ . Then we can also derive $\Gamma\vdash R:(m+1,F,M\cup\{m\},\tau)\triangleright d$ for some $d$ such that $d\geq\log_{2}c$ .

Proof.

Denote $\hat{\tau}=(m,F,M,\tau)$ and $\hat{\tau}^{\prime}=(m+1,F,M\cup\{m\},\tau)$ . The proof is by induction on the structure of the derivation of $\Gamma\vdash R:\hat{\tau}\triangleright c$ . We have several cases, depending on the shape of $R$ .

The case of $R=\mathsf{br}\,P_{1}\,P_{2}$ follows immediately from the induction assumption, as in Lemma 17.

Suppose that $R=x$ is a variable. The (Var) rule used in the derivation ensures that $c=0$ and that $\Gamma(x)$ contains a full type $(k,F,M^{\prime},\tau)$ with $M{\restriction}_{<k}=M^{\prime}$ . By assumptions of the lemma, $k\leq m$ . Then $(M\cup\{m\}){\restriction}_{<k}=M^{\prime}$ , and hence the (Var) rule can equally well derive $\Gamma\vdash R:\hat{\tau}^{\prime}\triangleright 0$ ; we have $0\geq\log_{2}0$ .

Next, suppose that $R=\lambda x.P$ . Then the derivation ends with the ( $\lambda$ ) rule, whose premiss is $\Gamma^{\prime}[x\mapsto T]\vdash P:(m,F,M_{\lambda},\tau_{\lambda})\triangleright c$ , where $\tau=T{\to}\tau_{\lambda}$ , and $M=M_{\lambda}\setminus\bigcup_{(k,F^{\prime},M^{\prime},\sigma)\in T}M^{\prime}$ , and $\mathit{Split}(\Gamma\mid\Gamma^{\prime})$ , and $\Gamma^{\prime}(x)=\emptyset$ . Because $\tau$ is a type, the definition of a type ensures that all full types in $T$ are of order $\mathit{ord}(R)\leq m$ . Additionally $\mathit{ord}(P)\leq\mathit{ord}(R)\leq m$ , so assumptions of the lemma are satisfied for the premiss; the induction assumption gives us a derivation of $\Gamma^{\prime}[x\mapsto T]\vdash P:(m+1,F,M_{\lambda}\cup\{m\},\tau_{\lambda})\triangleright d$ , where $d\geq\log_{2}c$ . Again because all full types in $T$ are of order $\mathit{ord}(R)\leq m$ (and hence $M^{\prime}\subseteq\{0,\dots,m-1\}$ for all $(k,F^{\prime},M^{\prime},\sigma)\in T$ ) we have $M\cup\{m\}=M_{\lambda}\cup\{m\}\setminus\bigcup_{(k,F^{\prime},M^{\prime},\sigma)\in T}M^{\prime}$ . Thus after applying back the ( $\lambda$ ) rule we obtain a derivation of $\Gamma\vdash R:\hat{\tau}^{\prime}\triangleright d$ .

Next, suppose that $R=a$ (where $a$ is a symbol of rank [math]). We have $\tau=o$ . The conditions of the (Con) rule are $\mathit{Split}(\Gamma\mid\varepsilon)$ and $\mathit{Comp}_{m}(M;\allowbreak(F^{\prime},c^{\prime}))=(F,c)$ , where $F^{\prime}=\emptyset$ and $c^{\prime}=1$ if $m=0$ , and $F^{\prime}=\{0\}$ and $c^{\prime}=0$ if $m>0$ . By Corollary 19, $\mathit{Comp}_{m+1}(M\cup\{m\};\allowbreak(\{0\},0))=(F,d)$ for some $d$ such that $d\geq\log_{2}c$ . We can use the (Con) rule to derive $\Gamma\vdash R:\hat{\tau}^{\prime}\triangleright d$ .

Next, suppose that $R=a\,P_{1}\,\dots\,P_{r}$ for $a\neq\mathsf{br}$ and $r>0$ . We have $\tau=o$ . Denote $I=\{1,\dots,r\}$ . Let $\Gamma_{i}\vdash P_{i}:(m,F_{i},M_{i},o)\triangleright c_{i}$ for $i\in I$ be the premisses of the final (Con) rule, and let $s\in I$ be such that $c_{s}\geq c_{i}$ for all $i\in I$ . Using the induction assumption we derive $\Gamma_{s}\vdash P_{s}:(m+1,F_{s},M_{s}\cup\{m\},o)\triangleright d_{s}$ for some $d_{s}$ such that $d_{s}\geq\log_{2}c_{s}$ , while for $i\in I\setminus\{s\}$ we use Lemma 17 to derive $\Gamma_{i}\vdash P_{i}:(m+1,G_{i},M_{i},o)\triangleright 0$ , where $G_{i}=F_{i}\cup\{m\mid c_{i}>0\}$ . We want to apply back the (Con) rule. The condition $M=M^{\prime}\uplus M_{1}\uplus\dots\uplus M_{r}$ updates accordingly: $m$ is added to $M$ and to $M_{s}$ . The conditions $(r>0)\Rightarrow(M^{\prime}=\emptyset)$ and $\mathit{Split}(\Gamma\mid\Gamma_{1},\dots,\Gamma_{r})$ remain unchanged. We know that $\mathit{Comp}_{m}(M;\allowbreak(F^{\prime},c^{\prime}),((F_{i},c_{i}))_{i\in I})=(F,c)$ , where $F^{\prime}=\emptyset$ and $c^{\prime}=1$ if $m=0$ , and $F^{\prime}=\{0\}$ and $c^{\prime}=0$ if $m>0$ ; by Corollary 20 this implies $\mathit{Comp}_{m+1}(M\cup\{m\};\allowbreak(\{0\},0),(F_{s},d_{s}),((G_{i},0))_{i\in I\setminus\{s\}})=(F,d)$ for some $d$ such that $d\geq\log_{2}c$ , as required.

Finally, suppose that $R=P\,Q$ . Let $\Gamma^{\prime}\vdash P:(m,F^{\prime},M^{\prime},T{\to}\tau)\triangleright c^{\prime}$ and $\Gamma_{i}\vdash Q:(m,F_{i},M_{i},\tau_{i})\triangleright c_{i}$ for each $i\in I$ be the premisses of the final (@) rule, where $T=\{(\mathit{ord}(P),F_{i}{\restriction}_{<\mathit{ord}(P)},M_{i}{\restriction}_{<\mathit{ord}(P)},\tau_{i})\mid i\in I\}$ . A condition of the (@) rule implies that $\mathit{ord}(Q)<\mathit{ord}(P)\leq m$ , so we can use the induction assumption for these premisses. We know that $\mathit{Comp}_{m}(M;\allowbreak(F^{\prime},c^{\prime}),((F_{i}{\restriction}_{\geq\mathit{ord}(P)},c_{i}))_{i\in I})=(F,c)$ . Denote $G^{\prime}=F^{\prime}\cup\{m\mid c^{\prime}>0\}$ and $G_{i}=F_{i}\cup\{m\mid c_{i}>0\}$ for $i\in I$ . Notice that $G_{i}{\restriction}_{\geq\mathit{ord}(P)}=F_{i}{\restriction}_{\geq\mathit{ord}(P)}\cup\{m\mid c_{i}>0\}$ . We have two subcases. If $c^{\prime}\geq c_{i}$ for all $i\in I$ , then using the induction assumption we derive $\Gamma^{\prime}\vdash P:(m+1,F^{\prime},M^{\prime}\cup\{m\},T{\to}\tau)\triangleright d^{\prime}$ for some $d^{\prime}$ such that $d^{\prime}\geq\log_{2}c^{\prime}$ , and using Lemma 17 we derive $\Gamma_{i}\vdash Q:(m+1,G_{i},M_{i},\tau_{i})\triangleright 0$ for each $i\in I$ . Lemma 18 then implies that $\mathit{Comp}_{m+1}(M\cup\{m\};\allowbreak(F^{\prime},d^{\prime}),((G_{i}{\restriction}_{\geq\mathit{ord}(P)},0))_{i\in I})=(F,d)$ for some $d$ such that $d\geq\log_{2}c$ . Otherwise, we choose $s\in I$ such that $c_{s}\geq c_{i}$ for all $i\in I$ (and $c_{s}\geq c^{\prime}$ ); using the induction assumption we derive $\Gamma_{s}\vdash Q:(m+1,F_{s},M_{s}\cup\{m\},\tau_{s})\triangleright d_{s}$ for some $d_{s}$ such that $d_{s}\geq\log_{2}c_{s}$ , and using Lemma 17 we derive $\Gamma^{\prime}\vdash P:(m+1,G^{\prime},M^{\prime},T{\to}\tau)\triangleright 0$ and $\Gamma_{i}\vdash Q:(m+1,G_{i},M_{i},\tau_{i})\triangleright 0$ for each $i\in I\setminus\{s\}$ . Lemma 18 then implies that $\mathit{Comp}_{m+1}(M\cup\{m\};\allowbreak(F_{s}{\restriction}_{\geq\mathit{ord}(P)},d_{s}),(G^{\prime},0),((G_{i}{\restriction}_{\geq\mathit{ord}(P)},0))_{i\in I\setminus\{s\}})=(F,d)$ for some $d$ such that $d\geq\log_{2}c$ . In both cases we apply back the (@) rule to the obtained type judgments. The condition $M=M^{\prime}\uplus\biguplus{}_{i\in I}M_{i}$ is updated accordingly: $m$ is added to $M$ and either to $M^{\prime}$ or to $M_{s}$ . The condition $\mathit{Split}(\Gamma\mid\Gamma^{\prime},(\Gamma_{i})_{i\in I})$ remains unchanged, and $\mathit{ord}(P)\leq m+1$ holds since we even have $\mathit{ord}(P)\leq m$ . We also have that $G_{i}{\restriction}_{<\mathit{ord}(P)}=F_{i}{\restriction}_{<\mathit{ord}(P)}$ and $(M_{s}\cup\{m\}){\restriction}_{<\mathit{ord}(P)}=M_{s}$ , so the type required in a premiss for $P$ is indeed $T{\to}o$ . ∎

Lemma 4 says that if we can derive $\varepsilon\vdash R:(m,\emptyset,\{0,\dots,m-1\},o)\triangleright c$ , then we can also derive $\varepsilon\vdash R:(m+1,\emptyset,\{0,\dots,m\},o)\triangleright d$ for some $d$ such that $d\geq\log_{2}c$ . This is just a special case of Lemma 21.

Appendix C Proof of Lemma 5

Below, by $\mathsf{Mk}(\hat{\tau})$ we denote the set of marker orders of the full type $\hat{\tau}$ , i.e., $\mathsf{Mk}(\hat{\tau})=M$ if $\hat{\tau}=(m,F,M,\tau)$ . We extend this notation to sets of full types: $\mathsf{Mk}(T)=\bigcup_{\hat{\tau}\in T}\mathsf{Mk}(\hat{\tau})$ , and to type environments: $\mathsf{Mk}(\Gamma)=\bigcup_{x}\mathsf{Mk}(\Gamma(x))$ , where $x$ ranges over all variables.

Lemma 22.

Suppose that we can derive $\Gamma\vdash R:\hat{\tau}\triangleright c$ , and $x$ is not free in $R$ . Then for $\Sigma=\Gamma[x\mapsto\emptyset]$ we can also derive $\Sigma\vdash R:\hat{\tau}\triangleright c$ , and $\mathit{Split}(\Gamma\mid\Sigma)$ holds.

Proof.

Because $x$ is not free in $R$ , all full types assigned to $x$ by $\Gamma$ are discarded somewhere in the derivation of $\Gamma\vdash R:\hat{\tau}\triangleright c$ . On the one hand, this means that the set of marker orders in all these full types is empty, and thus $\mathit{Split}(\Gamma\mid\Sigma)$ holds. On the other hand, we can remove these full types from type environments in the derivation, which results in a derivation of $\Sigma\vdash R:\hat{\tau}\triangleright c$ . ∎

Lemma 23.

Suppose that we can derive $\Gamma\vdash P:\hat{\tau}\triangleright c$ . If $\mathit{Split}(\Gamma^{\prime}\mid\Gamma)$ holds, then we can also derive $\Gamma^{\prime}\vdash P:\hat{\tau}\triangleright c$ .

Proof.

Induction on the structure of a fixed derivation of $\Gamma\vdash P:\hat{\tau}\triangleright c$ . If the last rule is (Br), we change $\Gamma$ to $\Gamma^{\prime}$ in the premiss of the rule using the induction assumption, and we obtain a derivation of $\Gamma^{\prime}\vdash P:\hat{\tau}\triangleright c$ . In every other rule we can change $\Gamma$ to $\Gamma^{\prime}$ only in the conclusion. ∎

Lemma 24.

Suppose that we can derive $\Gamma\vdash R:\hat{\tau}\triangleright c$ . Then $\mathsf{Mk}(\Gamma)\subseteq\mathsf{Mk}(\hat{\tau})$ , and $\mathsf{Mk}(\Gamma(x))\cap\mathsf{Mk}(\Gamma(y))=\emptyset$ for all variables $x,y$ with $x\neq y$ .

Proof.

Fix some derivation of $\Gamma\vdash R:\hat{\tau}\triangleright c$ ; the proof is by induction on the structure of this derivation. We analyze the shape of $R$ .

Suppose first that $R=x$ . The (Var) rule says that $\mathit{Split}(\Gamma,\varepsilon[x\mapsto T])$ holds for $T$ such that $\mathsf{Mk}(T)\subseteq\mathsf{Mk}(\hat{\tau})$ . Thus $\mathsf{Mk}(\Gamma(x))\subseteq\mathsf{Mk}(\hat{\tau})$ and $\mathsf{Mk}(\Gamma(y))=\emptyset$ for all variables $y$ with $y\neq x$ .

In the case when $R=\mathsf{br}\,P_{1}\,P_{2}$ the thesis follows immediately from the induction assumption applied to the premiss of the final (Br) rule.

Next, suppose that $R=\lambda z.P$ . Let $\Gamma^{\prime}[z\mapsto T]\vdash P:\hat{\tau}^{\prime}\triangleright c$ with $\Gamma^{\prime}(z)=\emptyset$ be the premiss of the final ( $\lambda$ ) rule. By the conditions of the rule we have $\mathsf{Mk}(\hat{\tau})=\mathsf{Mk}(\hat{\tau}^{\prime})\setminus\mathsf{Mk}(T)$ and $\mathit{Split}(\Gamma\mid\Gamma^{\prime})$ . The latter condition implies that $\mathsf{Mk}(\Gamma(z))=\emptyset$ . For every variable $x$ other than $z$ the induction assumption ensures that $\mathsf{Mk}(\Gamma^{\prime}[z\mapsto T](x))\subseteq\mathsf{Mk}(\hat{\tau}^{\prime})$ and that $\mathsf{Mk}(\Gamma^{\prime}[z\mapsto T](x))\cap\mathsf{Mk}(\Gamma^{\prime}[z\mapsto T](z))=\emptyset$ . Since $\mathsf{Mk}(\Gamma(x))=\mathsf{Mk}(\Gamma^{\prime}[z\mapsto T](x))$ and $\mathsf{Mk}(\Gamma^{\prime}[z\mapsto T](z))=\mathsf{Mk}(T)$ we obtain $\mathsf{Mk}(\Gamma(x))\subseteq\mathsf{Mk}(\hat{\tau}^{\prime})\setminus\mathsf{Mk}(T)=\mathsf{Mk}(\hat{\tau})$ . For any two variables $x,y$ with $z\neq x\neq y\neq z$ we have $\mathsf{Mk}(\Gamma^{\prime}[z\mapsto T](x))\cap\mathsf{Mk}(\Gamma^{\prime}[z\mapsto T](y))=\emptyset$ by the induction assumption, and thus $\mathsf{Mk}(\Gamma(x))\cap\mathsf{Mk}(\Gamma(y))=\emptyset$ .

Next, suppose that $R=a\,P_{1}\,\dots\,P_{r}$ . Let $\Gamma_{i}\vdash P_{i}:\hat{\tau}_{i}\triangleright c_{i}$ for $i\in\{1,\dots,r\}$ be the premisses of the final (Con) rule. Consider some variable $x$ , and some $k\in\mathsf{Mk}(\Gamma(x))$ . Because of the condition $\mathit{Split}(\Gamma\mid\Gamma_{1},\dots,\Gamma_{r})$ of the (Con) rule we have $k\in\mathsf{Mk}(\Gamma_{i}(x))$ for some $i\in\{1,\dots,r\}$ ; then $k\in\mathsf{Mk}(\hat{\tau}_{i})$ by the induction assumption $\mathsf{Mk}(\Gamma_{i}(x))\subseteq\mathsf{Mk}(\hat{\tau}_{i})$ , and thus $k\in\mathsf{Mk}(\hat{\tau})$ by the condition $\mathsf{Mk}(\hat{\tau})=\mathsf{Mk}(\hat{\tau}_{1})\uplus\dots\uplus\mathsf{Mk}(\hat{\tau}_{r})$ of the (Con) rule. Suppose that we also have $k\in\mathsf{Mk}(\Gamma(y))$ for some variable $y$ other than $x$ . Then $k\in\mathsf{Mk}(\Gamma_{j}(y))\subseteq\mathsf{Mk}(\hat{\tau}_{j})$ for some $j\in\{1,\dots,r\}$ ; we cannot have $j=i$ by the induction assumption ( $\mathsf{Mk}(\Gamma_{i}(x))$ and $\mathsf{Mk}(\Gamma_{i}(y))$ are disjoint), and we cannot have $j\neq i$ because $\mathsf{Mk}(\hat{\tau}_{i})$ and $\mathsf{Mk}(\hat{\tau}_{j})$ are disjoint.

The case when $R=P\,Q$ is completely analogous to the previous one. ∎

Lemma 25.

Suppose that we can derive $\Gamma\vdash R[S/x]:\hat{\tau}\triangleright c$ . Then, for some finite set $I$ , we can derive $\Sigma_{i}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}$ for every $i\in I$ , and $\Lambda^{x}\vdash R:\hat{\tau}\triangleright e$ , where $\Lambda^{x}=\Lambda[x\mapsto\{\hat{\sigma}_{i}\mid i\in I\}]$ with $\Lambda(x)=\emptyset$ , and $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ holds, and $c=e+\sum_{i\in I}d_{i}$ , and $\mathsf{Mk}(\hat{\sigma}_{i})\cap\mathsf{Mk}(\hat{\sigma}_{j})=\emptyset$ for $i,j\in I$ if $i\neq j$ .

Proof.

Fix some derivation of $\Gamma\vdash R[S/x]:\hat{\tau}\triangleright c$ ; the proof is by induction on the structure of this derivation.

One possibility is that $x$ is not free in $R$ . Then we can take $I=\emptyset$ , and $\Lambda^{x}=\Lambda=\Gamma[x\mapsto\emptyset]$ , and $e=c$ . Because $x$ is not free in $R=R[S/x]$ , by Lemma 22 applied to the original derivation, we know that $\Lambda^{x}\vdash R:\hat{\tau}\triangleright e$ can be derived, and that $\mathit{Split}(\Gamma\mid\Lambda)$ holds. Because $I=\emptyset$ , the condition concerning disjointness of $\mathsf{Mk}(\hat{\sigma}_{i})$ becomes trivial.

In the sequel we assume that $x$ is free in $R$ . We analyze the shape of $R$ .

Suppose first that $R=x$ . Then we take $I=\{1\}$ , and $(\Sigma_{1},\hat{\sigma}_{1},d_{1})=(\Gamma,\hat{\tau},c)$ , and $\Lambda=\varepsilon$ , and $e=0$ . Obviously $\Lambda(x)=\emptyset$ , and $\mathit{Split}(\Gamma\mid\Lambda,\Sigma_{1})$ holds, and $c=e+d_{1}$ . We can derive $\Sigma_{1}\vdash S:\hat{\sigma}_{1}\triangleright d_{1}$ by assumption, and $\Lambda^{x}\vdash R:\hat{\tau}\triangleright e$ using the (Var) rule, where $\Lambda^{x}=\Lambda[x\mapsto\{\hat{\sigma}_{1}\}]$ .

Next, suppose that $R=\mathsf{br}\,P_{1}\,P_{2}$ . Then our derivation ends with the (Br) rule, whose premiss is $\Gamma\vdash P_{k}[S/x]:\hat{\tau}\triangleright c$ , for some $k\in\{1,2\}$ . The induction assumption applied to this premiss gives us a derivation of $\Sigma_{i}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}$ for every $i\in I$ , and of $\Lambda^{x}\vdash P_{k}:\hat{\tau}\triangleright e$ , where appropriate conditions hold. By applying back the (Br) rule to the latter type judgment, we obtain a derivation of $\Lambda^{x}\vdash R:\hat{\tau}\triangleright e$ as required.

Next, suppose that $R=\lambda y.P$ . We have $y\neq x$ , and, as always during a substitution, we assume (by performing $\alpha$ -conversion) that $y$ is not free in $S$ . The original derivation ends with the ( $\lambda$ ) rule, whose premiss is $\Gamma^{\prime}[y\mapsto T]\vdash P[S/x]:\hat{\tau}^{\prime}\triangleright c$ with $\Gamma^{\prime}(y)=\emptyset$ . We apply the induction assumption to this premiss, and we obtain a derivation of $\Sigma_{i}[y\mapsto T_{i}]\vdash S:\hat{\sigma}_{i}\triangleright d_{i}$ for every $i\in I$ , and of $\Lambda^{x}[y\mapsto T^{\prime}]\vdash P:\hat{\tau}^{\prime}\triangleright e$ , where $\Lambda^{x}=\Lambda[x\mapsto\{\hat{\sigma}_{i}\mid i\in I\}]$ with $\Lambda(x)=\Lambda(y)=\Sigma_{i}(y)=\emptyset$ , and $\mathit{Split}(\Gamma^{\prime}[y\mapsto T]\mid\Lambda[y\mapsto T^{\prime}],(\Sigma_{i}[y\mapsto T_{i}])_{i\in I})$ holds, and $c=e+\sum_{i\in I}d_{i}$ , and $\mathsf{Mk}(\hat{\sigma}_{i})\cap\mathsf{Mk}(\hat{\sigma}_{j})=\emptyset$ for $i,j\in I$ if $i\neq j$ . Together with $\mathit{Split}(\Gamma\mid\Gamma^{\prime})$ the above implies that $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ holds. Because $y$ is not free in $S$ , Lemma 22 implies that for every $i\in I$ we can derive $\Sigma_{i}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}$ (instead of $\Sigma_{i}[y\mapsto T_{i}]\vdash S:\hat{\sigma}_{i}\triangleright d_{i}$ ) and that $\mathit{Split}(\Sigma_{i}[y\mapsto T_{i}]\mid\Sigma_{i})$ holds. Thus $\mathsf{Mk}(T_{i})=\emptyset$ for all $i\in I$ , and hence $\mathsf{Mk}(T\setminus T^{\prime})=\emptyset$ ; simultaneously $T^{\prime}\subseteq T$ , which implies that $\mathit{Split}(\Lambda^{x}[y\mapsto T]\mid\Lambda^{x}[y\mapsto T^{\prime}])$ holds. In consequence, by Lemma 23 applied to $\Lambda^{x}[y\mapsto T^{\prime}]\vdash P:\hat{\tau}^{\prime}\triangleright e$ we can derive $\Lambda^{x}[y\mapsto T]\vdash P:\hat{\tau}^{\prime}\triangleright e$ . To the latter type judgment we apply again the ( $\lambda$ ) rule, which gives $\Lambda^{x}\vdash R:\hat{\tau}\triangleright e$ .

Another possibility is that $R=a\,P_{1}\,\dots\,P_{r}$ . Then the original derivation ends with the (Con) rule, whose premisses are $\Gamma_{j}\vdash P_{j}[S/x]:\hat{\tau}_{j}\triangleright c_{j}$ for $j\in\{1,\dots,r\}$ . We apply the induction assumption to these premisses. Assuming w.l.o.g. that the resulting sets $I_{j}$ are disjoint, and taking $I=\bigcup_{j=1}^{r}I_{j}$ , we obtain a derivation of $\Sigma_{i}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}$ for every $i\in I$ , and of $\Lambda^{x}_{j}\vdash P_{j}:\hat{\tau}_{j}\triangleright e_{j}$ for every $j\in\{1,\dots,r\}$ , where, for every $j\in\{1,\dots,r\}$ , we have $\Lambda^{x}_{j}=\Lambda_{j}[x\mapsto\{\hat{\sigma}_{i}\mid i\in I_{j}\}]$ with $\Lambda_{j}(x)=\emptyset$ , and $\mathit{Split}(\Gamma_{j}\mid\Lambda_{j},(\Sigma_{i})_{i\in I_{j}})$ holds, and $c_{j}=e_{j}+\sum_{i\in I_{j}}d_{i}$ , and $\mathsf{Mk}(\hat{\sigma}_{i})\cap\mathsf{Mk}(\hat{\sigma}_{i^{\prime}})=\emptyset$ for $i,i^{\prime}\in I_{j}$ if $i\neq i^{\prime}$ . By Lemma 24 we have $\mathsf{Mk}(\hat{\sigma}_{i})\subseteq\mathsf{Mk}(\hat{\tau}_{j})$ for $i\in I_{j}$ , $j\in J$ . Since $\mathsf{Mk}(\hat{\tau}_{j})\cap\mathsf{Mk}(\hat{\tau}_{j^{\prime}})=\emptyset$ for $j,j^{\prime}\in J$ with $j\neq j^{\prime}$ by a side condition of the (Con) rule, this implies that $\mathsf{Mk}(\hat{\sigma}_{i})\cap\mathsf{Mk}(\hat{\sigma}_{i^{\prime}})=\emptyset$ for all $i,i^{\prime}\in I$ with $i\neq i^{\prime}$ . Recalling the side condition $\mathit{Split}(\Gamma\mid(\Gamma_{j})_{j\in\{1,\dots,r\}})$ of the (Con) rule, we observe that $\mathit{Split}(\Gamma\mid(\Lambda_{j})_{j\in\{1,\dots,r\}},(\Sigma_{i})_{i\in I})$ holds. Define $\Lambda$ by taking $\Lambda(z)=\bigcup_{j=1}^{r}\Lambda_{j}(z)$ for every variable $z$ . We then have $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ and $\mathit{Split}(\Lambda\mid(\Lambda_{j})_{j\in\{1,\dots,r\}})$ , as well as $\mathit{Split}(\Lambda^{x}\mid(\Lambda^{x}_{j})_{j\in\{1,\dots,r\}})$ . Another side condition of the (Con) rule says that $\mathit{Comp}_{m}(M;\allowbreak(F^{\prime},c^{\prime}),(F_{1},c_{1}),\dots,(F_{r},c_{r}))=(F,c)$ for appropriate arguments $M,F,F^{\prime},c^{\prime},F_{j}$ . Taking $e=c+\sum_{j=1}^{r}(e_{j}-c_{j})$ we also have that $\mathit{Comp}_{m}(M;\allowbreak(F^{\prime},c^{\prime}),(F_{1},e_{1}),\dots,(F_{r},e_{r}))=(F,e)$ . Having all this, we can apply the (Con) rule again, deriving $\Lambda^{x}\vdash R:\hat{\tau}\triangleright e$ . Simultaneously we observe that $c=e+\sum_{i\in I}d_{i}$ .

Finally, suppose that $R=P\,Q$ . This case is very similar to the previous one. The original derivation ends with the (@) rule, whose premisses are $\Gamma_{0}\vdash P[S/x]:\hat{\tau}_{0}\triangleright c_{0}$ and $\Gamma_{j}\vdash Q[S/x]:\hat{\tau}_{j}\triangleright c_{j}$ for $j\in J$ , where we assume that $0\not\in J$ . We apply the induction assumption to all these premisses. Assuming w.l.o.g. that the resulting sets $I_{j}$ are disjoint, and taking $I=\bigcup_{j\in\{0\}\cup J}I_{j}$ , we obtain a derivation of $\Sigma_{i}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}$ for every $i\in I$ , and of $\Lambda^{x}_{0}\vdash P:\hat{\tau}_{0}\triangleright e_{0}$ , and of $\Lambda^{x}_{j}\vdash Q:\hat{\tau}_{j}\triangleright e_{j}$ for every $j\in J$ , where, for every $j\in\{0\}\cup J$ , we have $\Lambda^{x}_{j}=\Lambda_{j}[x\mapsto\{\hat{\sigma}_{i}\mid i\in I_{j}\}]$ with $\Lambda_{j}(x)=\emptyset$ , and $\mathit{Split}(\Gamma_{j}\mid\Lambda_{j},(\Sigma_{i})_{i\in I_{j}})$ holds, and $c_{j}=e_{j}+\sum_{i\in I_{j}}d_{i}$ , and $\mathsf{Mk}(\hat{\sigma}_{i})\cap\mathsf{Mk}(\hat{\sigma}_{i^{\prime}})=\emptyset$ for $i,i^{\prime}\in I_{j}$ if $i\neq i^{\prime}$ . By Lemma 24 we have $\mathsf{Mk}(\hat{\sigma}_{i})\subseteq\mathsf{Mk}(\hat{\tau}_{j})$ for $i\in I_{j}$ , $j\in\{0\}\cup J$ . Since $\mathsf{Mk}(\hat{\tau}_{j})\cap\mathsf{Mk}(\hat{\tau}_{j^{\prime}})=\emptyset$ for $j,j^{\prime}\in\{0\}\cup J$ with $j\neq j^{\prime}$ by a side condition of the (Con) rule, this implies that $\mathsf{Mk}(\hat{\sigma}_{i})\cap\mathsf{Mk}(\hat{\sigma}_{i^{\prime}})=\emptyset$ for all $i,i^{\prime}\in I$ with $i\neq i^{\prime}$ . Recalling the side condition $\mathit{Split}(\Gamma\mid(\Gamma_{j})_{i\in\{0\}\cup J})$ of the (@) rule, we observe that $\mathit{Split}(\Gamma\mid(\Lambda_{j})_{j\in\{0\}\cup J},(\Sigma_{i})_{i\in I})$ holds. Define $\Lambda$ by taking $\Lambda(z)=\bigcup_{j\in\{0\}\cup J}\Lambda_{j}(z)$ for every variable $z$ . We then have $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ and $\mathit{Split}(\Lambda\mid(\Lambda_{j})_{j\in\{0\}\cup J})$ , as well as $\mathit{Split}(\Lambda^{x}\mid(\Lambda^{x}_{j})_{j\in\{0\}\cup J})$ . Another side condition of the (Con) rule says that $\mathit{Comp}_{m}(M;\allowbreak((G_{j},c_{j}))_{j\in\{0\}\cup J})=(F,c)$ for appropriate sets $M,F,G_{j}$ . Taking $e=c+\sum_{j\in\{0\}\cup J}(e_{j}-c_{j})$ we also have that $\mathit{Comp}_{m}(M;\allowbreak((G_{j},c_{j}))_{j\in\{0\}\cup J})=(F,e)$ . Thus we can apply the (@) rule again, deriving $\Lambda^{x}\vdash R:\hat{\tau}\triangleright e$ . Simultaneously we observe that $c=e+\sum_{i\in I}d_{i}$ . ∎

Lemma 26.

Suppose that $\mathit{Comp}_{m}(M;\allowbreak(F,e),((\emptyset,d_{i}))_{i\in I})=(G,c)$ , where $F\cap M=\emptyset$ and $F,M\subseteq\{0,\dots,\allowbreak m-1\}$ . Then $G=F$ and $c=e+\sum_{i\in I}d_{i}$ .

Proof.

Consider the numbers $f_{n}$ and $f_{n}^{\prime}$ appearing in the definition of the $\mathit{Comp}_{m}$ predicate. Looking at them consecutively for $n=0,\dots,m$ we notice that $f_{n}^{\prime}=0$ and $f_{n}=|F\cap\{n\}|$ . Indeed, $f_{n}^{\prime}=0$ implies $f_{n}=|F\cap\{n\}|$ , and if $n-1\not\in M$ , we have $f_{n}^{\prime}=0$ , while if $n-1\in M$ , we have $f_{n}^{\prime}=f_{n-1}=|F\cap\{n-1\}|=0$ , because $F\cap M=\emptyset$ . Then $G=\{n\in\{0,\dots,m-1\}\mid f_{n}>0\land n\not\in M\}=F$ (again because $F\cap M=\emptyset$ ), and $c=f^{\prime}_{m}+e+\sum_{i\in I}d_{i}=e+\sum_{i\in I}d_{i}$ . ∎

Proof of Lemma 5.

Recall that we are given a derivation of $\Gamma\vdash Q:\hat{\tau}\triangleright c$ with $\mathit{ord}(\hat{\tau})=m$ , and a $\beta$ -reduction $P\to_{\beta}Q$ that is of order $m$ , and our goal is to derive $\Gamma\vdash P:\hat{\tau}\triangleright c$ .

Suppose first that $P=(\lambda x.R)\,S$ and $Q=R[S/x]$ , where $\mathit{ord}(\lambda x.R)=m$ . From Lemma 25 we obtain a derivation of $\Sigma_{i}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}$ for every $i\in I$ (for some set $I$ ), and a derivation of $\Lambda[x\mapsto T]\vdash R:\hat{\tau}\triangleright e$ , where $T=\{\hat{\sigma}_{i}\mid i\in I\}$ , and $\Lambda(x)=\emptyset$ , and $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ holds, and $c=e+\sum_{i\in I}d_{i}$ , and $\mathsf{Mk}(\hat{\sigma}_{i})\cap\mathsf{Mk}(\hat{\sigma}_{j})=\emptyset$ for $i,j\in I$ if $i\neq j$ . Let us write $\hat{\tau}=(m,F,M,\tau)$ , and $\hat{\sigma}_{i}=(m,F_{i},M_{i},\sigma_{i})$ . To the type judgment $\Lambda[x\mapsto T]\vdash R:\hat{\tau}\triangleright e$ we apply the ( $\lambda$ ) rule, deriving $\Lambda\vdash\lambda x.R:(m,F,M\setminus\bigcup_{i\in I}M_{i},T{\to}\tau)\triangleright e$ .

To this type judgment, and to $\Sigma_{i}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}$ for $i\in I$ , we want to apply the (@) rule. By definition of a full type, the sets $F_{i}$ and $M_{i}$ may contain only numbers smaller than $m$ (since $\hat{\sigma}_{i}$ is a full type). Recalling that $\mathit{ord}(\lambda x.R)=m$ we have that the type $\{(\mathit{ord}(\lambda x.R),F_{i}{\restriction}_{<\mathit{ord}(\lambda x.R)},M_{i}{\restriction}_{<\mathit{ord}(\lambda x.R)},\sigma_{i})\mid i\in I\}{\to}\tau$ that we have to derive for $\lambda x.R$ is indeed $T{\to}\tau$ . The conditions $M=(M\setminus\bigcup_{i\in I}M_{i})\uplus\biguplus_{i\in I}M_{i}$ , and $\mathit{ord}(\lambda x.R)\leq m$ , and $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ follow from what we have. Notice that the sets $F_{i}{\restriction}_{\geq\mathit{ord}(\lambda x.R)}$ are empty, and that $F\cap M=\emptyset$ by definition of a full type ( $\hat{\tau}=(m,F,M,\tau)$ is a full type), and hence $\mathit{Comp}_{m}(M;\allowbreak(F,e),((F_{i}{\restriction}_{\geq\mathit{ord}(\lambda x.R)},d_{i}))_{i\in I})=(F,c)$ by Lemma 26. Thus the (@) rule can be applied; it derives $\Gamma\vdash P:\hat{\tau}\triangleright c$ .

It remains to consider the general situation: the redex involved in the $\beta$ -reduction $P\to_{\beta}Q$ is located somewhere deeper in $P$ . Then the proof is by a trivial induction on the depth of this redex. Formally, we have several cases depending on the shape of $P$ , but let us consider only a representative example: suppose that $P=T\,U$ and $Q=T\,V$ with $U\to_{\beta}V$ . In the derivation $\Gamma\vdash Q:\hat{\tau}\triangleright c$ we apply the induction assumption to those premisses of the final (@) rule that concern the subterm $V$ , and we obtain type judgments in which $V$ is replaced by $U$ . We can apply the (@) rule to them, and to the premiss talking about $T$ , and derive $\Gamma\vdash P:\hat{\tau}\triangleright c$ . ∎

Appendix D Proof of Lemma 6

In order to enable an inductive proof of Lemma 6, we need to strengthen slightly its statement, and consider also $\lambda$ -terms of order $m$ . We say that a full type $(m,F,M,T_{1}{\to}\dots\to T_{k}{\to}o)$ is $(m-1)$ -clear if $m-1\not\in M\cup\bigcup_{i=1}^{k}\mathsf{Mk}(T_{i})$ .

Lemma 27.

If $\hat{\tau}\in\mathcal{F}^{\alpha}_{m}$ for $\mathit{ord}(\alpha)<m$ , and $m-1\not\in\mathsf{Mk}(\hat{\tau})$ , then $\hat{\tau}$ is $(m-1)$ -clear.

Proof.

If $\alpha=\alpha_{1}{\to}\dots\to\alpha_{k}{\to}o$ , we can write $\hat{\tau}=(m,F,M,T_{1}{\to}\dots\to T_{k}{\to}o)$ . For $i\in\{1,\dots,k\}$ by definition we have $T_{i}\subseteq\mathcal{F}^{\alpha_{i}}_{m_{i}}$ for $m_{i}=\mathit{ord}(\alpha_{i}{\to}\dots\to\alpha_{k}{\to}o)$ , and hence $\mathsf{Mk}(T_{i})\subseteq\{0,\dots,m_{i}-1\}$ ; because $\mathit{ord}(\alpha_{i}{\to}\dots\to\alpha_{k}{\to}o)\leq\mathit{ord}(\alpha)<m$ , we have $m-1\not\in T_{i}$ . Thus $\hat{\tau}$ is $(m-1)$ -clear. ∎

Lemma 28.

If we can derive $\Gamma\vdash R:\hat{\tau}\triangleright c$ , where $\mathit{ord}(\hat{\tau})=m>0$ and $\hat{\tau}$ is $(m-1)$ -clear, then $c=0$ .

Proof.

Fix some derivation of $\Gamma\vdash R:\hat{\tau}\triangleright c$ . The proof is by induction on the structure of this derivation. Let us write $\hat{\tau}=(m,F,M,\tau)$ . We have several cases depending on the shape of $R$ .

If $R=x$ , the (Var) rule ensures that the flag counter $c$ is [math].

If $R=\mathsf{br}\,P_{1}\,P_{2}$ , then above the final (Br) rule we have a premiss $\Gamma\vdash P_{i}:\hat{\tau}\triangleright c$ for some $i\in\{1,2\}$ ; the induction assumption used for this premiss implies $c=0$ .

Suppose that $R=\lambda x.P$ . Then above the final ( $\lambda$ ) rule we have a premiss $\Gamma^{\prime}\vdash P:(m,F,M^{\prime},\tau^{\prime})\triangleright c$ , where $\tau=T{\to}\tau^{\prime}$ and $M=M^{\prime}\setminus\mathsf{Mk}(T)$ . Because $\hat{\tau}$ is $(m-1)$ -clear, we have $m-1\not\in M\cup\mathsf{Mk}(T)$ , hence also $m-1\not\in M^{\prime}$ ; thus $(m,F,M^{\prime},\tau^{\prime})$ is $(m-1)$ -clear. The induction assumption applied to our premiss implies $c=0$ .

Next, suppose that $R=a\,P_{1}\,\dots\,P_{r}$ . Let $\Gamma_{i}\vdash P_{i}:(m,F_{i},M_{i},o)\triangleright c_{i}$ for $i\in\{1,\dots,r\}$ be the premisses of the final (Con) rule. For all $i\in\{1,\dots,r\}$ a side condition of this rule says that $M_{i}\subseteq M$ , so the full type $(m,F_{i},M_{i},o)$ is $(m-1)$ -clear, and hence $c_{i}=0$ by the induction assumption. We know that $\mathit{Comp}_{m}(M;\allowbreak(\{0\},0),(F_{1},c_{1}),\dots,(F_{r},c_{r}))=(F,c)$ (recall that $m>0$ ). The number $f_{m}^{\prime}$ considered in the definition of $\mathit{Comp}_{m}$ is [math] in our case of $m-1\not\in M$ , and thus we have $c=f_{m}^{\prime}+\sum_{i=1}^{r}c_{i}=0$ .

Finally, suppose that $R=P\,Q$ . Let $\Gamma^{\prime}\vdash P:(m,F^{\prime},M^{\prime},T{\to}\tau)\triangleright c^{\prime}$ and $\Gamma_{i}\vdash Q:(m,F_{i},M_{i},\tau_{i})\triangleright c_{i}$ for $i\in I$ be the premisses of the final (@) rule. A side condition of the rule says that $M^{\prime}\subseteq M$ and $M_{i}\subseteq M$ for $i\in I$ , so $m-1$ does not belong to these sets. By definition the marker sets in full types in $T$ are subsets of some $M_{i}$ , so $m-1\not\in\mathsf{Mk}(T)$ , and hence $(m,F^{\prime},M^{\prime},T{\to}\tau)$ is $(m-1)$ -clear. On the other hand $\mathit{ord}(Q)<\mathit{ord}(P)\leq m$ , so $(m,F,M_{i},\tau_{i})$ for $i\in I$ are $(m-1)$ -clear by Lemma 27. Thus the induction assumption can be used for all premisses; it says that $c^{\prime}=0$ and $c_{i}=0$ for all $i\in I$ . Because $\mathit{Comp}(M;\allowbreak(F^{\prime},c^{\prime}),((F_{i},c_{i}))_{i\in I})=(F,c)$ , and $m-1\not\in M$ , we obtain $c=0$ (as in the previous case). ∎

Proof of Lemma 6.

Recall that in this lemma we are given a derivation of $\Gamma\vdash P:\hat{\tau}\triangleright c$ , where $m-1\not\in\mathsf{Mk}(\hat{\tau})$ and $\mathit{ord}(P)\leq m-1$ for $m=\mathit{ord}(\hat{\tau})$ , and we have to prove that $c=0$ . Lemma 27 implies that $\hat{\tau}$ is $(m-1)$ -clear, and thus $c=0$ by Lemma 28 (we have $m>0$ because $\mathit{ord}(P)\leq m-1$ ). ∎

Appendix E Proof of Lemma 7

Let us formalize the notion of counting (@) rules of order $m$ in a derivation. We use here extended type judgments of the form $\Gamma\vdash P:\hat{\tau}\triangleright c\succ n$ , where $\Gamma\vdash P:\hat{\tau}\triangleright c$ is a type judgment, and $n\in\mathbb{N}$ . The number $n$ is called application counter. The meaning is that $\Gamma\vdash P:\hat{\tau}\triangleright c$ can be derived by a derivation in which the (@) rule of order $\mathit{ord}(\hat{\tau})$ is used $n$ times. Formally, we lift our type system so that it can derive extended type judgments, as follows. If, in the original type system, using premisses $\Gamma_{i}\vdash P_{i}:\hat{\tau}_{i}\triangleright c_{i}$ for $i\in I$ some rule could derive $\Gamma\vdash R:\hat{\tau}\triangleright c$ , then

•

if $R=P\,Q$ with $\mathit{ord}(P)=\mathit{ord}(\hat{\tau})$ , then using premisses $\Gamma_{i}\vdash P_{i}:\hat{\tau}_{i}\triangleright c_{i}\succ n_{i}$ for $i\in I$ we can derive $\Gamma\vdash R:\hat{\tau}\triangleright c\succ 1+\sum_{i\in I}n_{i}$ ;

•

otherwise (i.e., when $R=P\,Q$ with $\mathit{ord}(P)\neq\mathit{ord}(\hat{\tau})$ , or $R$ does not start with an application) using premisses $\Gamma_{i}\vdash P_{i}:\hat{\tau}_{i}\triangleright c_{i}\succ n_{i}$ for $i\in I$ we can derive $\Gamma\vdash R:\hat{\tau}\triangleright c\succ\sum_{i\in I}n_{i}$ .

It should be clear that we can derive $\Gamma\vdash R:\hat{\tau}\triangleright c$ if and only if we can derive $\Gamma\vdash R:\hat{\tau}\triangleright c\succ n$ for some $n\in\mathbb{N}$ . We also notice that in proofs of Lemmata 22 and 23 the number of (@) rules of the maximal order (and, more generally, the shape of a derivation) remain unchanged. Thus we can restate these lemmata as follows.

Lemma 29.

Suppose that we can derive $\Gamma\vdash R:\hat{\tau}\triangleright c\succ 0$ , and $x$ is not free in $R$ . Then for $\Sigma=\Gamma[x\mapsto\emptyset]$ we can also derive $\Sigma\vdash R:\hat{\tau}\triangleright c\succ 0$ , and $\mathit{Split}(\Gamma\mid\Sigma)$ holds.

Lemma 30.

Suppose that we can derive $\Gamma\vdash P:\hat{\tau}\triangleright c\succ 0$ . If $\mathit{Split}(\Gamma^{\prime}\mid\Gamma)$ holds, then we can also derive $\Gamma^{\prime}\vdash P:\hat{\tau}\triangleright c\succ 0$ .

As in the proof of Lemma 5 we start with a lemma describing substitution.

Lemma 31.

Suppose that we can derive $\Sigma_{i}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}\succ 0$ for every $i\in I$ (for some finite set $I$ ), and $\Lambda^{x}\vdash R:\hat{\tau}\triangleright e\succ 0$ , where $\Lambda^{x}=\Lambda[x\mapsto\{\hat{\sigma}_{i}\mid i\in I\}]$ with $\Lambda(x)=\emptyset$ , and $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ holds, and $\mathit{ord}(S)<\mathit{ord}(\hat{\tau})$ , and $\mathit{ord}(\hat{\sigma}_{i})=\mathit{ord}(\hat{\tau})$ for all $i\in I$ . Then we can derive $\Gamma\vdash R[S/x]:\hat{\tau}\triangleright e+\sum_{i\in I}d_{i}\succ 0$ .

Proof.

We start by observing the following property, denoted ( $\diamondsuit$ ):

If $\mathsf{Mk}(\hat{\sigma}_{i})=\emptyset$ for some $i\in I$ , then $d_{i}=0$ and $\mathsf{Mk}(\Sigma_{i})=\emptyset$ .

Indeed, $\mathsf{Mk}(\Sigma_{i})\subseteq\mathsf{Mk}(\sigma_{i})=\emptyset$ follows from Lemma 24, while $d_{i}=0$ follows from Lemma 6 if we recall that $\mathit{ord}(S)<\mathit{ord}(\hat{\tau})=\mathit{ord}(\hat{\sigma}_{i})$ .

The proof of the lemma is by induction on the structure of some fixed derivation of $\Lambda^{x}\vdash R:\hat{\tau}\triangleright e\succ 0$ .

One possibility is that $x$ is not free in $R$ . In such a situation by Lemma 29 we can derive $\Lambda\vdash R:\hat{\tau}\triangleright e\succ 0$ and $\mathit{Split}(\Lambda^{x}\mid\Lambda)$ holds, which means that $\mathsf{Mk}(\hat{\sigma}_{i})=\emptyset$ for all $i\in I$ . By ( $\diamondsuit$ ) we have $d_{i}=0$ and $\mathsf{Mk}(\Sigma_{i})=\emptyset$ for all $i\in I$ . Thus $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ implies $\mathit{Split}(\Gamma\mid\Lambda)$ , and thus by Lemma 30 applied to $\Lambda\vdash R:\hat{\tau}\triangleright e\succ 0$ we can derive $\Gamma\vdash R:\hat{\tau}\triangleright e\succ 0$ . This is the desired type judgment since $R[S/x]=R$ and $e+\sum_{i\in I}d_{i}=e$ .

In the sequel we assume that $x$ is free in $R$ . We analyze the shape of $R$ .

Suppose first that $R=x$ . Then the derivation for $R$ consists of a single rule, and thus $e=0$ and for some $s\in I$ we have $\mathit{Split}(\Lambda^{x}\mid\varepsilon[x\mapsto\{\hat{\sigma}_{s}\}])$ and $\hat{\tau}=\hat{\sigma}_{s}$ (this holds because all $\hat{\sigma}_{i}$ are of the same order as $\hat{\tau}$ ). It follows that $\mathsf{Mk}(\Lambda(y))=\emptyset$ for every variable $y$ , and that $\mathsf{Mk}(\hat{\sigma}_{i})=\emptyset$ for every $i\in I\setminus\{s\}$ . By ( $\diamondsuit$ ) we have $d_{i}=0$ and $\mathsf{Mk}(\Sigma_{i})=\emptyset$ for all $i\in I\setminus\{s\}$ . It follows that $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ implies $\mathit{Split}(\Gamma\mid\Sigma_{s})$ ; we can thus derive $\Gamma\vdash S:\hat{\sigma}_{s}\triangleright d_{s}\succ 0$ by Lemma 30. This is what we need, since $R[S/x]=S$ , and $\hat{\tau}=\hat{\sigma}_{s}$ , and $e+\sum_{i\in I}d_{i}=d_{s}$ .

Next, suppose that $R=\mathsf{br}\,P_{1}\,P_{2}$ . Then our derivation ends with the (Br) rule, whose premiss is $\Lambda^{x}\vdash P_{k}:\hat{\tau}\triangleright e\succ 0$ , for some $k\in\{1,2\}$ . The induction assumption applied to this premiss gives us a derivation of $\Gamma\vdash P_{k}[S/x]:\hat{\tau}\triangleright e+\sum_{i\in I}d_{i}\succ 0$ . By applying back the (Br) rule we derive $\Gamma\vdash R[S/x]:\hat{\tau}\triangleright e+\sum_{i\in I}d_{i}\succ 0$ , as required.

Next, suppose that $R=\lambda y.P$ . We have $y\neq x$ , and, as always during a substitution, we assume (by performing $\alpha$ -conversion) that $y$ is not free in $S$ . The derivation for $R$ ends with the ( $\lambda$ ) rule, whose premiss is $\Lambda_{\lambda}^{x}[y\mapsto T]\vdash P:\hat{\tau}^{\prime}\triangleright e\succ 0$ , where $\Lambda_{\lambda}^{x}(y)=\emptyset$ and $\mathit{Split}(\Lambda^{x}\mid\Lambda^{x}_{\lambda})$ holds. Denote $\Lambda_{\lambda}=\Lambda_{\lambda}^{x}[x\mapsto\emptyset]$ . We then have $\Lambda_{\lambda}^{x}=\Lambda_{\lambda}[x\mapsto\{\hat{\sigma}_{i}\mid i\in J\}]$ for some $J\subseteq I$ . The condition $\mathit{Split}(\Lambda^{x}\mid\Lambda^{x}_{\lambda})$ implies that $\mathit{Split}(\Lambda\mid\Lambda_{\lambda})$ holds, and that $\mathsf{Mk}(\hat{\sigma}_{i})=\emptyset$ for all $i\in I\setminus J$ ; then $d_{i}=0$ and $\mathsf{Mk}(\Sigma_{i})=\emptyset$ for $i\in I\setminus J$ by ( $\diamondsuit$ ). In the light of $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ this implies that $\mathit{Split}(\Gamma\mid\Lambda_{\lambda},(\Sigma_{i})_{i\in J})$ holds. Because $\Gamma(y)$ may be nonempty, we have to define $\Gamma_{\lambda}=\Gamma[y\mapsto\emptyset]$ and $\Sigma_{i}^{\lambda}=\Sigma_{i}[y\mapsto\emptyset]$ for $i\in J$ . By assumption $y$ is not free in $S$ , so Lemma 29 says that we can derive $\Sigma_{i}^{\lambda}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}\succ 0$ and that $\mathit{Split}(\Sigma_{i}\mid\Sigma_{i}^{\lambda})$ holds for $i\in J$ . Recall that $\Lambda_{\lambda}(y)=\emptyset$ . Due to $\mathit{Split}(\Gamma\mid\Lambda_{\lambda},(\Sigma_{i})_{i\in J})$ we also have $\mathit{Split}(\Gamma\mid\Gamma_{\lambda})$ and $\mathit{Split}(\Gamma_{\lambda}\mid\Lambda_{\lambda},(\Sigma_{i}^{\lambda})_{i\in J})$ , thus also $\mathit{Split}(\Gamma_{\lambda}[y\mapsto T]\mid\Lambda_{\lambda}[y\mapsto T],(\Sigma_{i}^{\lambda})_{i\in J})$ . We are ready to apply the induction assumption to our premiss. We obtain a derivation of $\Gamma_{\lambda}[y\mapsto T]\vdash P[S/x]:\hat{\tau}^{\prime}\triangleright e+\sum_{i\in J}d_{i}\succ 0$ . Because $\mathit{Split}(\Gamma\mid\Gamma_{\lambda})$ holds and $\Gamma_{\lambda}(y)=\emptyset$ , we can apply the ( $\lambda$ ) rule, obtaining $\Gamma\vdash R[S/x]:\hat{\tau}\triangleright e+\sum_{i\in J}d_{i}\succ 0$ , where the full type is indeed $\hat{\tau}$ , as in the original derivation. Because $d_{i}=0$ for $i\in J\setminus I$ , this is what we need.

Another possibility is that $R=a\,P_{1}\,\dots\,P_{r}$ . Then the derivation for $R$ ends with the (Con) rule, whose premisses are $\Lambda_{j}^{x}\vdash P_{j}:\hat{\tau}_{j}\triangleright e_{j}\succ 0$ for $j\in\{1,\dots,r\}$ . For $j\in\{1,\dots,r\}$ denote $\Lambda_{j}=\Lambda_{j}^{x}[x\mapsto\emptyset]$ . A side condition of the (Con) rule says that $\mathit{Split}(\Lambda^{x}\mid(\Lambda^{x}_{j})_{j\in\{1,\dots,r\}})$ holds. On the on one hand, this implies $\mathit{Split}(\Lambda\mid(\Lambda_{j})_{j\in\{1,\dots,r\}})$ . On the other hand, for $j\in\{1,\dots,r\}$ we have $\Lambda_{j}^{x}=\Lambda_{j}[x\mapsto\{\hat{\sigma}_{i}\mid i\in I_{j}\}]$ for some $I_{j}\subseteq I$ , and for $i\in I\setminus\bigcup_{j=1}^{r}I_{j}$ we have $\mathsf{Mk}(\hat{\sigma}_{i})=\emptyset$ , and thus $d_{i}=0$ and $\mathsf{Mk}(\Sigma_{i})=\emptyset$ by ( $\diamondsuit$ ). Knowing that $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ holds, we obtain $\mathit{Split}(\Gamma\mid(\Lambda_{j})_{j\in\{1,\dots,r\}},(\Sigma_{i})_{i\in I_{j},j\in\{1,\dots,r\}})$ . For $j\in\{1,\dots,r\}$ we define $\Gamma_{j}$ by taking $\Gamma_{j}(y)=\Lambda_{j}(y)\cup\bigcup_{i\in I_{j}}\Sigma_{i}(y)$ for all variables $y$ . Then we have $\mathit{Split}(\Gamma\mid(\Gamma_{j})_{j\in\{1,\dots,r\}})$ and $\mathit{Split}(\Gamma_{j}\mid\Lambda_{j},(\Sigma_{i})_{i\in I_{j}})$ for $j\in\{1,\dots,r\}$ . Using the induction assumption for every premiss, we obtain a derivation of $\Gamma_{j}\vdash P_{j}[S/x]:\hat{\tau}_{j}\triangleright e_{j}+\sum_{i\in I_{j}}d_{i}\succ 0$ for $j\in\{1,\dots,r\}$ . Because $\mathit{Split}(\Gamma\mid(\Gamma_{j})_{j\in\{1,\dots,r\}})$ holds, and the derived full types are the same as in the derivation for $R$ , and the flag counters are higher by $\sum_{i\in I_{j}}d_{i}$ , we can apply the (Con) rule and derive $\Gamma\vdash R[S/x]:\hat{\tau}\triangleright e+\sum_{j=1}^{r}\sum_{i\in I_{j}}d_{i}\succ 0$ . It remains to observe that $\sum_{j=1}^{r}\sum_{i\in I_{j}}d_{i}=\sum_{i\in I}d_{i}$ . As already said, $d_{i}=0$ when $i\in I\setminus\bigcup_{j=1}^{r}I_{j}$ . Consider some $i\in I_{j}\cap I_{j^{\prime}}$ for $j,j^{\prime}\in\{1,\dots,r\}$ with $j\neq j^{\prime}$ . We have $\mathsf{Mk}(\hat{\sigma}_{i})\subseteq\mathsf{Mk}(\Lambda_{j}^{x})$ , so Lemma 24 applied to the type judgment $\Lambda_{j}^{x}\vdash P_{j}:\hat{\tau}_{j}\triangleright e_{j}$ says that $\mathsf{Mk}(\hat{\sigma}_{i})\subseteq\mathsf{Mk}(\hat{\tau}_{j})$ , and similarly $\mathsf{Mk}(\hat{\sigma}_{i})\subseteq\mathsf{Mk}(\hat{\tau}_{j^{\prime}})$ . But, by a side condition of the (Con) rule, $\mathsf{Mk}(\hat{\tau}_{j})$ and $\mathsf{Mk}(\hat{\tau}_{j^{\prime}})$ are disjoint, so $\mathsf{Mk}(\hat{\sigma}_{i})=\emptyset$ , and hence $d_{i}=0$ by ( $\diamondsuit$ ). In consequence the two sums are indeed equal.

Finally, suppose that $R=P\,Q$ . The proof is almost the same as in the previous case. The derivation for $R$ ends with the (@) rule, whose premisses are $\Lambda_{0}^{x}\vdash P:\hat{\tau}_{0}\triangleright e_{0}\succ 0$ and $\Lambda_{j}^{x}\vdash Q:\hat{\tau}_{j}\triangleright e_{j}\succ 0$ for $j\in J$ , where we assume that $0\not\in J$ . For $j\in\{0\}\cup J$ denote $\Lambda_{j}=\Lambda_{j}^{x}[x\mapsto\emptyset]$ . A side condition of the (@) rule says that $\mathit{Split}(\Lambda^{x}\mid(\Lambda^{x}_{j})_{j\in\{0\}\cup J})$ holds. On the on one hand, this implies $\mathit{Split}(\Lambda\mid(\Lambda_{j})_{j\in\{0\}\cup J})$ . On the other hand, for $j\in\{0\}\cup J$ we have $\Lambda_{j}^{x}=\Lambda_{j}[x\mapsto\{\hat{\sigma}_{i}\mid i\in I_{j}\}]$ for some $I_{j}\subseteq I$ , and for $i\in I\setminus\bigcup_{j\in\{0\}\cup J}I_{j}$ we have $\mathsf{Mk}(\hat{\sigma}_{i})=\emptyset$ , and thus $d_{i}=0$ and $\mathsf{Mk}(\Sigma_{i})=\emptyset$ by ( $\diamondsuit$ ). Knowing that $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ holds, we obtain $\mathit{Split}(\Gamma\mid(\Lambda_{j})_{j\in\{0\}\cup J},(\Sigma_{i})_{i\in I_{j},j\in\{0\}\cup J})$ . For $j\in\{0\}\cup J$ we define $\Gamma_{j}$ by taking $\Gamma_{j}(y)=\Lambda_{j}(y)\cup\bigcup_{i\in I_{j}}\Sigma_{i}(y)$ for all variables $y$ . Then we have $\mathit{Split}(\Gamma\mid(\Gamma_{j})_{j\in\{0\}\cup J})$ and $\mathit{Split}(\Gamma_{j}\mid\Lambda_{j},(\Sigma_{i})_{i\in I_{j}})$ for $j\in\{0\}\cup J$ . Using the induction assumption for every premiss, we obtain a derivation of $\Gamma_{0}\vdash P[S/x]:\hat{\tau}_{0}\triangleright e_{0}+\sum_{i\in I_{0}}d_{i}\succ 0$ , and of $\Gamma_{j}\vdash Q[S/x]:\hat{\tau}_{j}\triangleright e_{j}+\sum_{i\in I_{j}}d_{i}\succ 0$ for all $j\in J$ . Because $\mathit{Split}(\Gamma\mid(\Gamma_{j})_{j\in\{0\}\cup J})$ holds, and the derived full types are the same as in the derivation for $R$ , and the flag counters are higher by $\sum_{i\in I_{j}}d_{i}$ , we can apply the (@) rule and derive $\Gamma\vdash R[S/x]:\hat{\tau}\triangleright e+\sum_{j\in\{0\}\cup J}\sum_{i\in I_{j}}d_{i}\succ 0$ . It remains to observe that $\sum_{j\in\{0\}\cup J}\sum_{i\in I_{j}}d_{i}=\sum_{i\in I}d_{i}$ . As already said, $d_{i}=0$ when $i\in I\setminus\bigcup_{j\in\{0\}\cup J}I_{j}$ . Consider some $i\in I_{j}\cap I_{j^{\prime}}$ for $j,j^{\prime}\in\{0\}\cup J$ with $j\neq j^{\prime}$ . We have $\mathsf{Mk}(\hat{\sigma}_{i})\subseteq\mathsf{Mk}(\Lambda_{j}^{x})$ , so Lemma 24 applied to the type judgment $\Lambda_{j}^{x}\vdash T:\hat{\tau}_{j}\triangleright e_{j}$ (where $T=P$ or $T=Q$ , depending on $j$ ) says that $\mathsf{Mk}(\hat{\sigma}_{i})\subseteq\mathsf{Mk}(\hat{\tau}_{j})$ , and similarly $\mathsf{Mk}(\hat{\sigma}_{i})\subseteq\mathsf{Mk}(\hat{\tau}_{j^{\prime}})$ . But, by a side condition of the (@) rule, $\mathsf{Mk}(\hat{\tau}_{j})$ and $\mathsf{Mk}(\hat{\tau}_{j^{\prime}})$ are disjoint, so $\mathsf{Mk}(\hat{\sigma}_{i})=\emptyset$ , and hence $d_{i}=0$ by ( $\diamondsuit$ ). In consequence the two sums are indeed equal. ∎

Lemma 32.

Suppose that we can derive $\Gamma\vdash R\,S:\hat{\tau}\triangleright c\succ n$ so that all premisses of the final (@) rule have [math] in the application counter. If $\mathit{ord}(R)=\mathit{ord}(\hat{\tau})$ , and $\Gamma(y)\neq\emptyset$ only for variables $y$ of order at most $\mathit{ord}(\hat{\tau})-1$ , then $R=\lambda x.R^{\prime}$ , and we can derive $\Gamma\vdash R^{\prime}[S/x]:\hat{\tau}\triangleright c\succ 0$ .

Proof.

Let $\Gamma_{\lambda}\vdash R:\hat{\tau}_{\lambda}\triangleright e\succ 0$ and $\Sigma_{i}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}\succ 0$ for $i\in I$ be the premisses of the final (@) rule. Let us write $\hat{\tau}=(m,F,M,\tau)$ , and $\hat{\tau}_{\lambda}=(m,F^{\prime},M_{\lambda},T{\to}\tau)$ , and $\hat{\sigma}_{i}=(m,F_{i},M_{i},\sigma_{i})$ for $i\in I$ . We have $T=\{(\mathit{ord}(R),F_{i}{\restriction}_{<\mathit{ord}(R)},M_{i}{\restriction}_{<\mathit{ord}(R)},\sigma_{i})\mid i\in I\}=\{\hat{\sigma}_{i}\mid i\in I\}$ , because $\mathit{ord}(R)=m$ and sets $F_{i}$ and $M_{i}$ contain only numbers smaller than $m$ .

We start by determining the shape of $R$ , by looking at the premiss concerning it, i.e., $\Gamma_{\lambda}\vdash R:\hat{\tau}_{\lambda}\triangleright e\succ 0$ . If $R$ was a variable, then the derivation of this premiss would consist of the (Var) rule requiring that $\Gamma_{\lambda}(R)\neq\emptyset$ , hence also $\Gamma(R)\neq\emptyset$ ; this is impossible by assumption since $R$ is of order $m$ . If $R$ is an application, $R=U\,V$ , then the derivation of the premiss concerning $R$ starts with the (@) rule, requiring that $\mathit{ord}(U)\leq m$ . However $\mathit{ord}(U)\geq\mathit{ord}(R)=m$ , so actually $\mathit{ord}(U)=m$ , and hence the application counter in this premiss should be at least $1$ , violating our assumption. It follows that $R$ cannot be an application. Moreover, $R$ takes an argument, so it cannot start with a symbol. Thus $R$ starts with a $\lambda$ -abstraction, $R=\lambda x.R^{\prime}$ .

The type judgment concerning $R$ is derived using the ( $\lambda$ ) rule out of a premiss $\Lambda[x\mapsto T]\vdash R^{\prime}:\hat{\tau}^{\prime}\triangleright e\succ 0$ , where $\hat{\tau}^{\prime}=(m,F^{\prime},M^{\prime},\tau)$ and $\Lambda(x)=\emptyset$ . The two rules imply that $M_{\lambda}=M^{\prime}\setminus\mathsf{Mk}(T)$ and $M=M_{\lambda}\cup\mathsf{Mk}(T)$ . Lemma 24 applied to the type judgment $\Lambda[x\mapsto T]\vdash R^{\prime}:\hat{\tau}^{\prime}\triangleright e$ implies that $\mathsf{Mk}(T)\subseteq M^{\prime}$ , and thus we obtain $M=M^{\prime}$ . Next, we notice that the sets $F_{i}{\restriction}_{\geq\mathit{ord}(R)}$ are empty, and that $F^{\prime}\cap M=F^{\prime}\cap M^{\prime}=\emptyset$ (because $\hat{\tau}^{\prime}=(m,F^{\prime},M^{\prime},\tau)$ is a full type), and that $\mathit{Comp}_{m}(M;\allowbreak(F^{\prime},e),((F_{i}{\restriction}_{\geq\mathit{ord}(R)},d_{i}))_{i\in I})=(F,c)$ by a side condition of the (@) rule. In such a situation Lemma 26 implies that $F=F^{\prime}$ (thus actually $\hat{\tau}^{\prime}=\hat{\tau}$ ) and $c=e+\sum_{i\in I}d_{i}$ . Finally, due to side conditions of the (@) and ( $\lambda$ ) rules we have $\mathit{Split}(\Gamma\mid\Gamma_{\lambda},(\Sigma_{i})_{i\in I})$ and $\mathit{Split}(\Gamma_{\lambda}\mid\Lambda)$ , hence also $\mathit{Split}(\Gamma\mid\Lambda,(\Sigma_{i})_{i\in I})$ . We also have $\mathit{ord}(S)<\mathit{ord}(R)=m$ . Thus we can apply Lemma 31 to type judgments $\Sigma_{i}\vdash S:\hat{\sigma}_{i}\triangleright d_{i}\succ 0$ for $i\in I$ and $\Lambda[x\mapsto T]\vdash R^{\prime}:\hat{\tau}\triangleright e\succ 0$ . We obtain a derivation of $\Gamma\vdash R^{\prime}[S/x]:\hat{\tau}\triangleright c\succ 0$ , as required. ∎

We now give a generalization of Lemma 7 suitable for induction. Notice that a subterm of a $\lambda$ -term may be involved in multiple subtrees of a derivation tree for the whole $\lambda$ -term. Because of that, we have to handle multiple derivations for the same $\lambda$ -term at once.

Lemma 33.

Suppose that for a finite set $I$ , a number $m$ , and a $\lambda$ -term $P$ we can derive $\Gamma_{i}\vdash P:\hat{\tau}_{i}\triangleright c_{i}\succ n_{i}$ for $i\in I$ , where $\mathit{ord}(P)\leq m$ , and $\sum_{i\in I}n_{i}>0$ , and for all $i\in I$ we have $\mathit{ord}(\hat{\tau}_{i})=m$ and $\Gamma_{i}(x)\neq\emptyset$ only for variables $x$ of order at most $m-1$ . Then there is a $\lambda$ -term $Q$ such that $P\to_{\beta}Q$ , and we can derive $\Gamma_{i}\vdash Q:\hat{\tau}_{i}\triangleright c_{i}\succ n_{i}^{\prime}$ for all $i\in I$ , for some numbers $n_{i}^{\prime}$ such that $\sum_{i\in I}n_{i}^{\prime}<\sum_{i\in I}n_{i}$ .

Proof.

The proof is by induction on the smallest total size of derivations needed to derive $\Gamma_{i}\vdash P:\hat{\tau}_{i}\triangleright c_{i}\succ n_{i}$ for all $i\in I$ . Because $\sum_{i\in I}n_{i}>0$ , we have $|I|\geq 1$ .

Suppose first that for every $i\in I$ all premisses of the last rule used to derive $\Gamma_{i}\vdash P:\hat{\tau}_{i}\triangleright c_{i}\succ n_{i}$ have [math] in the application counter. This is the base case, in which we perform a $\beta$ -reduction. Because $\sum_{i\in I}n_{i}>0$ , necessarily $P=R\,S$ with $\mathit{ord}(R)=m$ , because only in such a situation the application counter in a derived type judgment can be higher than the sum of application counters in premisses. Then, for every $i\in I$ separately we apply Lemma 32 to our type judgment, and we obtain that $R=\lambda x.R^{\prime}$ , and that we can derive $\Gamma_{i}\vdash R^{\prime}[S/x]:\hat{\tau}_{i}\triangleright c_{i}\succ 0$ . Since $P=(\lambda x.R^{\prime})\,S\to_{\beta}R^{\prime}[S/x]$ , this gives the thesis.

Let us now consider the opposite case, when we have a premiss with positive application counter. We have multiple cases depending on the shape of $P$ , but all of them are similar, and boil down to a use of the induction assumption. Suppose, for example, that $P=R\,S$ . Then for every $i\in I$ the type judgment $\Gamma_{i}\vdash P:\hat{\tau}_{i}\triangleright c_{i}\succ n_{i}$ is derived by the (@) rule out of premisses $\Gamma_{i}^{\prime}\vdash R:\hat{\tau}_{i}^{\prime}\triangleright c_{i}^{\prime}\succ n_{i}^{\prime}$ and $\Gamma_{i,j}\vdash S:\hat{\tau}_{i,j}\triangleright c_{i,j}\succ n_{i,j}$ for $j\in J_{i}$ , for some finite set $J_{i}$ . We have $\mathit{ord}(S)<\mathit{ord}(R)\leq m$ by a side condition of the (@) rule. For every variable $x$ of order at least $m$ we have $\Gamma_{i}^{\prime}(x)\subseteq\Gamma_{i}(x)=\emptyset$ for all $i\in I$ , and $\Gamma_{i,j}(x)\subseteq\Gamma_{i}(x)=\emptyset$ for all $i\in I$ , $j\in J_{i}$ . When $n_{i,j}>0$ for some $i\in I$ , $j\in J_{i}$ , we apply the induction assumption to $S$ and to the collection of all premisses concerning it in all our derivations. We obtain a $\lambda$ -term $S^{\prime}$ such that $S\to_{\beta}S^{\prime}$ , and derivations of $\Gamma_{i,j}\vdash S^{\prime}:\hat{\tau}_{i,j}\triangleright c_{i,j}\succ n^{\prime}_{i,j}$ for all $i\in I$ , $j\in J_{i}$ , where $\sum_{i\in I}\sum_{j\in J_{i}}n_{i,j}^{\prime}<\sum_{i\in I}\sum_{j\in J_{i}}n_{i,j}$ . By applying the (@) rule to these type judgments and to the premisses concerning $R$ , we obtain derivations of $\Gamma_{i}\vdash Q:\hat{\tau}_{i}\triangleright c_{i}\succ n_{i}^{\prime}$ for all $i\in I$ , where $Q=R\,S^{\prime}$ and $\sum_{i\in I}n_{i}^{\prime}=\sum_{i\in I}(n_{i}+\sum_{j\in J_{i}}(n_{i,j}^{\prime}-n_{i,j}))<\sum_{i\in I}n_{i}$ . When $n_{i,j}=0$ for all $i\in I$ , $j\in J_{i}$ , we necessarily have $n_{i}^{\prime}>0$ for some $i$ (as we are in the case in which some premiss has a positive application counter), and we apply the induction assumption to the premisses concerning $R$ . We proceed similarly when $P=a\,P_{1}\,\dots\,P_{r}$ for $a\neq\mathsf{br}$ , when $P=\mathsf{br}\,P_{1}\,P_{2}$ , and when $P=\lambda x.R$ (we cannot have $P=x$ , as then the application counter in all the type judgments would be [math]). In the case of $P=\lambda x.R$ we use the assumption $\mathit{ord}(P)\leq m$ to deduce that the full types assigned to $x$ in type environments of premisses are not assigned to a variable of order higher than $m-1$ : we have $\mathit{ord}(x)<\mathit{ord}(P)\leq m$ . ∎

Finally, we observe that Lemma 7 is a special case of Lemma 33, where $|I|=1$ and the type judgment is of the form $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ .

Appendix F Proof of Lemma 8

Lemma 34.

Suppose that $\mathit{Comp}_{m}(M;\allowbreak(F_{i},c_{i})_{i\in I})=(F,c)$ , where $m\geq 1$ . If $c^{\prime}_{i}\geq c_{i}+|F_{i}\cap\{m-1\}|$ for $i\in I$ , then $\mathit{Comp}_{m-1}(M{\restriction}_{<m-1};\allowbreak(F_{i}{\restriction}_{<m-1},c_{i}^{\prime})_{i\in I})=(F{\restriction}_{<m-1},c^{\prime})$ for some $c^{\prime}\geq c+|F\cap\{m-1\}|$ .

Proof.

The definition of the $\mathit{Comp}$ predicate specifies variables $f_{n}$ and $f_{n}^{\prime}$ . Let $f_{n,m}$ and $f^{\prime}_{n,m}$ be values of these variables in the above instantiation of the $\mathit{Comp}_{m}$ predicate, while $f_{n,m-1}$ and $f^{\prime}_{n,m-1}$ —in the above instantiation of the $\mathit{Comp}_{m-1}$ predicate. Notice that $f_{n,m-1}=f_{n,m}$ for $n<m-1$ , and $f^{\prime}_{n,m-1}=f^{\prime}_{n,m}$ for $n\leq m-1$ . In consequence the requirements given by $\mathit{Comp}_{m-1}$ on the set $F$ are satisfied, since they are the same as the requirements given by $\mathit{Comp}_{m}$ .

Next, let us observe that $f_{m-1,m}\geq f^{\prime}_{m,m}+|F\cap\{m-1\}|$ . Indeed, if $m-1\in M$ , we have $f^{\prime}_{m,m}=f_{m-1,m}$ and $m-1\not\in F$ . Conversely, if $m-1\not\in M$ , we have $f^{\prime}_{m,m}=0$ , and if $f_{m-1,m}=0$ then also $m-1\not\in F$ .

Finally, because $f^{\prime}_{m-1,m-1}=f^{\prime}_{m-1,m}$ , we have

[TABLE]

We now generalize Lemma 8 to arbitrary type judgments.

Lemma 35.

Suppose that we can derive $\Gamma\vdash P:(m,F,M,\tau)\triangleright c\succ 0$ , where $\mathit{ord}(P)\leq m-1$ , and for every variable $x$ and every $\hat{\eta}\in\Gamma(x)$ we have $\mathit{ord}(\hat{\eta})\leq m-1$ . Then we can also derive $\Gamma\vdash P:(m-1,F{\restriction}_{<m-1},M{\restriction}_{<m-1},\tau)\triangleright c^{\prime}$ with $c^{\prime}\geq c+|F\cap\{m-1\}|$ .

Proof.

Denote $\hat{\tau}=(m,F,M,\tau)$ and $\hat{\sigma}=(m-1,F{\restriction}_{<m-1},M{\restriction}_{<m-1},\tau)$ . The proof is by induction on the structure of some fixed derivation of $\Gamma\vdash P:\hat{\tau}\triangleright c\succ 0$ . We have $m\geq 1$ , since $\mathit{ord}(P)\leq m-1$ . We distinguish several cases depending on the shape of $R$ .

Suppose first that $P$ is a variable, $P=x$ . Then the (Var) rule used in the derivation implies that $\mathit{Split}(\Gamma\mid\varepsilon[x\mapsto(k,F,M{\restriction}_{<k},\tau)])$ holds, and $c=0$ . By assumption of the lemma we have $k\leq m-1$ , so $(M{\restriction}_{<m-1}){\restriction}_{<k}=M{\restriction}_{<k}$ and $F{\restriction}_{<m-1}=F$ (because $F\subseteq\{0,\dots,k-1\}$ ). In consequence, we can use the (Var) rule to derive $\Gamma\vdash P:\hat{\sigma}\triangleright 0$ .

Next, suppose that $P=\mathsf{br}\,P_{1}\,P_{2}$ . Then the final (Br) rule has a premiss $\Gamma\vdash P_{k}:\hat{\tau}\triangleright c\succ 0$ for some $k\in\{1,2\}$ . Surely $\mathit{ord}(P_{k})=0\leq m-1$ . The induction assumption applied to this premiss gives us a derivation of $\Gamma\vdash P_{k}:\hat{\sigma}\triangleright c^{\prime}$ with $c^{\prime}\geq c+|F\cap\{m-1\}|$ . We apply back the (Br) rule, obtaining $\Gamma\vdash P:\hat{\sigma}\triangleright c^{\prime}$ .

Next, suppose that $P=\lambda x.Q$ . Then the final ( $\lambda$ ) rule has a premiss $\Gamma^{\prime}[x\mapsto T]\vdash Q:(m,F,M^{\prime},\tau^{\prime})\triangleright c\succ 0$ , where $\tau=T{\to}\hat{\tau}^{\prime}$ , and $M=M^{\prime}\setminus\mathsf{Mk}(T)$ , and $\mathit{Split}(\Gamma\mid\Gamma^{\prime})$ holds. Clearly $\mathit{ord}(Q)\leq\mathit{ord}(P)\leq m-1$ . We also have $T\subseteq\mathcal{F}^{\alpha}_{\mathit{ord}(P)}$ , where $\alpha$ is the sort of $x$ . In consequence, for every variable $y$ and every $\hat{\eta}\in(\Gamma^{\prime}[x\mapsto T])(y)$ we have $\mathit{ord}(\hat{\eta})\leq m-1$ . Using the induction assumption for our premiss we obtain a derivation of $\Gamma^{\prime}[x\mapsto T]\vdash Q:(m-1,F{\restriction}_{<m-1},M^{\prime}{\restriction}_{<m-1},\tau^{\prime})\triangleright c^{\prime}$ with $c^{\prime}\geq c+|F\cap\{m-1\}|$ . Because $M^{\prime}{\restriction}_{<m-1}\setminus\mathsf{Mk}(T)=(M^{\prime}\setminus\mathsf{Mk}(T)){\restriction}_{<m-1}=M{\restriction}_{<m-1}$ , by applying back the ( $\lambda$ ) rule we derive $\Gamma\vdash P:\hat{\sigma}\triangleright c^{\prime}$ .

Next, suppose that $P=a\,P_{1}\,\dots\,P_{r}$ with $a\neq\mathsf{br}$ . Then $\tau=o$ , and the final (Con) rule has premisses $\Gamma_{i}\vdash P_{i}:(m,F_{i},M_{i},o)\triangleright c_{i}\succ 0$ for $i\in\{1,\dots,r\}$ . For every $i\in\{1,\dots,r\}$ we have $\mathit{ord}(P_{i})=0\leq m-1$ , and $\Gamma_{i}(x)\subseteq\Gamma(x)$ for every variable $x$ , so we can use the induction assumption for the $i$ -th premiss and obtain a derivation of $\Gamma_{i}\vdash P_{i}:(m-1,F_{i}{\restriction}_{<m-1},M_{i}{\restriction}_{<m-1},o)\triangleright c_{i}^{\prime}$ with $c^{\prime}_{i}\geq c_{i}+|F_{i}\cap\{m-1\}|$ . If $r>0$ , we have a side condition $M=\biguplus_{i=1}^{r}M_{i}$ , which implies $M{\restriction}_{<m-1}=\biguplus_{i=1}^{r}M_{i}{\restriction}_{<m-1}$ . Another side condition says that $\mathit{Comp}_{m}(M;\allowbreak(\{0\},0),(F_{i},c_{i})_{i\in\{1,\dots,r\}})=(F,c)$ , and we need to see that $\mathit{Comp}_{m-1}(M{\restriction}_{<m-1};\allowbreak(F_{0},c_{0}),(F_{i}{\restriction}_{<m-1},c_{i}^{\prime})_{i\in\{1,\dots,r\}})=(F{\restriction}_{<m-1},c^{\prime})$ for some $c^{\prime}\geq c+|F\cap\{m-1\}|$ , where $(F_{0},c_{0})=(\{0\},0)$ if $m-1>0$ , and $(F_{0},c_{0})=(\emptyset,1)$ if $m-1=0$ . This follows from Lemma 34, where we notice that $\{0\}{\restriction}_{<m-1}=F_{0}$ , and $c_{0}\geq 0+|\{0\}\cap\{m-1\}|$ . Thus we can apply back the (Con) rule deriving $\Gamma\vdash P:\hat{\sigma}\triangleright c^{\prime}$ .

Finally, suppose that $P=Q\,R$ . Then the final (@) rule has premisses $\Gamma^{\prime}\vdash Q:(m,F^{\prime},M^{\prime},T{\to}\tau)\triangleright e\succ 0$ and $\Gamma_{i}\vdash R:(m,F_{i},M_{i},\tau_{i})\triangleright d_{i}\succ 0$ for $i\in I$ , where $T=\{(\mathit{ord}(Q),F_{i}{\restriction}_{<\mathit{ord}(Q)},M_{i}{\restriction}_{<\mathit{ord}(Q)},\tau_{i})\mid i\in I\}$ . Because the application counter is [math] in the conclusion, we have $\mathit{ord}(Q)\neq m$ , so actually $\mathit{ord}(Q)\leq m-1$ by a side condition of the (@) rule. Simultaneously $\mathit{ord}(R)<\mathit{ord}(Q)\leq m-1$ , and the type environments $\Gamma^{\prime}$ and $(\Gamma_{i})_{i\in I}$ store only full types stored already in $\Gamma$ . The induction assumption applied to all premisses gives us derivations of $\Gamma^{\prime}\vdash Q:(m-1,F^{\prime}{\restriction}_{<m-1},M^{\prime}{\restriction}_{<m-1},T{\to}\tau)\triangleright e^{\prime}$ with $e^{\prime}\geq e+|F^{\prime}\cap\{m-1\}|$ , and of $\Gamma_{i}\vdash R:(m-1,F_{i}{\restriction}_{<m-1},M_{i}{\restriction}_{<m-1},\tau_{i})\triangleright d_{i}^{\prime}$ with $d_{i}^{\prime}\geq d_{i}+|F_{i}\cap\{m-1\}|$ for $i\in I$ . The side condition $M=M^{\prime}\uplus\biguplus_{i\in I}M_{i}$ implies $M{\restriction}_{<m-1}=M^{\prime}{\restriction}_{<m-1}\uplus\biguplus_{i\in I}M_{i}{\restriction}_{<m-1}$ . Another side condition says that $\mathit{Comp}_{m}(M;\allowbreak(F^{\prime},e),(F_{i},d_{i})_{i\in I})=(F,c)$ , which by Lemma 34 implies that $\mathit{Comp}_{m-1}(M{\restriction}_{<m-1};\allowbreak(F^{\prime}{\restriction}_{<m-1},e^{\prime}),(F_{i}{\restriction}_{<m-1},d_{i}^{\prime})_{i\in I})=(F{\restriction}_{<m-1},c^{\prime})$ for some $c^{\prime}\geq c+|F\cap\{m-1\}|$ . We also have that $T=\{(\mathit{ord}(Q),(F_{i}{\restriction}_{<m-1}){\restriction}_{<\mathit{ord}(Q)},(M_{i}{\restriction}_{<m-1}){\restriction}_{<\mathit{ord}(Q)},\tau_{i})\mid i\in I\}$ , because $\mathit{ord}(Q)\leq m-1$ . Having all this, we can apply back the (@) rule, and derive $\Gamma\vdash P:\hat{\sigma}\triangleright c^{\prime}$ . ∎

Lemma 8 says that if we can derive $\varepsilon\vdash P:(m,\emptyset,\{0,\dots,m-1\},o)\triangleright c\succ 0$ with $m>0$ , then we can also derive $\varepsilon\vdash P:(m-1,\emptyset,\{0,\dots,m-2\},o)\triangleright c^{\prime}$ for some $c^{\prime}\geq c$ . Here $\mathit{ord}(P)=0$ , so this is just a special case of Lemma 35.

Appendix G Remaining proofs for Section 5

In the final part of Section 5 we have implicitly used the following lemma, which we now prove.

Lemma 36.

If we can derive $\varepsilon\vdash P:\hat{\rho}_{0}\triangleright c$ , then there exists a tree $t\in\mathcal{L}(P)$ such that $|t|=c$ .

Proof.

Recall that $\hat{\rho}_{0}=(0,\emptyset,\emptyset,o)$ . The proof is by induction on the structure of some fixed derivation of $\varepsilon\vdash P:\hat{\rho}_{0}\triangleright c$ . Let us analyze the shape of $P$ . Because the type environment is empty, $P$ cannot be a variable. The sort of $\hat{\rho}_{0}$ , and hence of $P$ , is $o$ , and thus $P$ cannot start with a $\lambda$ -abstraction. Moreover, $P$ cannot be an application $Q\,R$ , because the (@) rule requires that $\mathit{ord}(Q)\leq\mathit{ord}(\hat{\rho}_{0})=0$ . Thus $P$ starts with a symbol. We have two cases.

Suppose first that $P=a\,P_{1}\,\dots\,P_{r}$ with $a\neq\mathsf{br}$ . We notice that $\hat{\rho}_{0}$ is the only full type in $\mathcal{F}^{o}_{0}$ , and that $\mathit{Split}(\varepsilon\mid\Gamma_{1},\dots,\Gamma_{r})$ implies $\Gamma_{1}=\dots=\Gamma_{r}=\varepsilon$ . Thus the premisses of the final (Con) rule are $\varepsilon\vdash P_{i}:\hat{\rho}_{0}\triangleright c_{i}$ , for $i\in\{1,\dots,r\}$ . Because $\mathit{Comp}_{0}(\emptyset;\allowbreak(\emptyset,1),(\emptyset,c_{1}),\dots,(\emptyset,c_{r}))=(\emptyset,c)$ , we have $c=1+c_{1}+\dots+c_{r}$ . The induction assumption gives us, for $i\in\{1,\dots,r\}$ , trees $t_{i}$ such that $|t_{i}|=c_{i}$ and $t_{i}\in\mathcal{L}(P_{i})$ , which means that $\mathit{BT}(P_{i})\to_{\mathsf{br}}^{*}t_{i}$ . As $t$ we take the tree having $a$ in its root, and $t_{1},\dots,t_{r}$ as subtrees starting in the root’s children. Because $\mathit{BT}(P)$ has $a$ in its root, and $\mathit{BT}(P_{1}),\dots,\mathit{BT}(P_{r})$ as subtrees starting in the root’s children, it should be clear that $\mathit{BT}(P)\to_{\mathsf{br}}^{*}t$ . Moreover, $|t|=1+|t_{1}|+\dots+|t_{r}|=1+c_{1}+\dots+c_{r}=c$ .

Another possibility is that $P=\mathsf{br}\,P_{1}\,P_{2}$ . Then the final (Br) rule has one premiss $\varepsilon\vdash P_{i}:\hat{\rho}_{0}\triangleright c$ for some $i\in\{1,2\}$ . The induction assumption gives us a tree $t$ such that $|t|=c$ and $\mathit{BT}(P_{i})\to_{\mathsf{br}}^{*}t$ . Recalling that $\mathit{BT}(P)$ has $\mathsf{br}$ in its root, and $\mathit{BT}(P_{i})$ as its subtree starting in the $i$ -th child of the root, we see that $\mathit{BT}(P)\to_{\mathsf{br}}\mathit{BT}(P_{i})$ , and thus $t\in\mathcal{L}(P)$ . ∎

Appendix H Proofs for Section 6

Let us recall two definitions from page 6. We say that two type judgments are equivalent if they differ only in the value of the flag counter. Given a $\lambda Y$ -term $P$ and a number $m$ , we have also defined a set $\mathcal{D}$ of all derivations of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ in which on each branch there are at most three type judgments from every equivalence class, and among premisses of each (@) rule there is at most one type judgment from every equivalence class.

We complete the proof of effectiveness contained in Section 6 by a formal proof of the following lemma.

Lemma 37.

Suppose that for some $\lambda Y$ -term $P$ we can derive $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ for arbitrarily large numbers $c\in\mathbb{N}$ . Then in the set $\mathcal{D}$ there is a derivation in which on some branch there are two equivalent type judgments with different values of the flag counter.

Proof.

In this lemma it is convenient to see derivations as trees: type judgments of a derivation constitute nodes of a tree; premisses of a type judgment are located in its children. We consider $P$ and $m$ to be fixed. A derivation is called narrow if among premisses of each its (@) rule there is at most one type judgment from every equivalence class. We have already justified on page 6 that if a type judgment $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ can be derived, then it has a narrow derivation. Moreover, we have justified that there are only finitely many equivalence classes of type judgments that can be used in any derivation of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ , for any $c$ ; let $E$ be their number. This gives a bound on the number of premisses of (@) rules in narrow derivations. Rules (Var), (Br), and ( $\lambda$ ) always have at most one premiss. The number of premisses of a (Con) rule is specified by the rank of the symbol involved; when this rule is used in a derivation of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ , this symbol has to appear in $P$ , which gives a bound on its rank. All this gives us a bound $D$ on the degree of nodes appearing in the considered narrow derivations of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ for arbitrarily large $c$ .

Looking at the definition of the $\mathit{Comp}_{m}$ predicate it is easy to see that if $\mathit{Comp}_{m}(M;\allowbreak(F_{i},c_{i})_{i\in I})=(F,c)$ , then $c\leq|I|\cdot m+\sum_{i\in I}c_{i}$ . In consequence, the flag counter in a conclusion of a (@) rule or a (Con) rule, having at most $D$ premisses, can be higher than the sum of flag counters in the premisses at most by $(D+1)\cdot m+1$ (these “ $+1$ ” appear here, because in the (Con) rule, beside of pairs $(F_{i},c_{i})$ coming from premisses, we pass to the $\mathit{Comp}_{m}$ predicate an additional pair $(F^{\prime},c^{\prime})$ with $c^{\prime}\leq 1$ ). The conclusion of every (Br) and ( $\lambda$ ) rule has the same flag counter as the only premiss, and the conclusion of the (Var) rule always has [math] in its flag counter. This means that there is constant $C$ such that in any node of any narrow derivation of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ the flag counter can be higher than the sum of flag counters in its premisses at most by $C$ (and simultaneously it cannot be smaller than this sum).

We define a level of a node in a derivation, by induction on the depth of the node. Leaves and all nodes with flag counter [math] have level [math]. If an internal node has its flag counter positive and equal to the flag counter in some child of this node, then the level of this node is equal to the level of this child (notice that there is at most one such child, and the flag counter in all other children is [math]). The level of every other internal node is defined as one plus the maximum of levels of its children.

Next, we prove that for every level $i$ there is a bound $C_{i}$ on the value of the flag counter among nodes of this level. Indeed, the value of the flag counter in leaves, and thus in all nodes of level [math], is bounded by $C_{0}=C$ . Take now a node of level $i>0$ having only children of levels smaller than $i$ . In each of these (at most $D$ ) children the flag counter is at most $C_{i-1}$ , so so the flag counter in our node is at most $C_{i}=C+D\cdot C_{i-1}$ . If a node of level $i$ has a child of level $i$ , then their flag counter is equal, thus is also at most $C_{i}$ (trivial induction on the depth of the node).

Let us now take a narrow derivation of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ for some $c>C_{E-1}$ . It necessarily contains a node of level greater than $E-1$ . Moreover, every node of some level $i>0$ has a child of level at least $\geq i-1$ . Thus in the considered derivation there exists a branch having nodes of at least $E+1$ different levels. Among them we can find two nodes with equivalent type judgments and being on different levels (as the type judgments come from at most $E$ equivalence classes). If the two nodes had the same flag counters, then also all nodes on the path between them would have the same flag counter, and thus the two nodes would have the same level, which is not the case. Thus we have found two nodes having equivalent type judgments, different values of the flag counter, and such that one of them is a descendant of the other.

Let $x$ and $y$ be nodes of a derivation of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ , where $y$ is a descendant of $x$ , and they contain equivalent type judgments. In such a situation, we can cut out the fragment between nodes $x$ and $y$ , in the following sense: we decrease by $c_{x}-c_{y}$ the flag counter in every ancestor of $x$ , we remove $x$ and all its descendants not being in the subtree starting in $y$ , and we attach $y$ in the place of $x$ . This results in some (correct) narrow derivation of $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c^{\prime}$ for some $c^{\prime}$ .

Take now the smallest (in the sense of the number of nodes) narrow derivation of a type judgment $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c$ , for any $c$ , in which there are two nodes $u$ , $v$ having equivalent type judgments, different values of the flag counter, and such that $v$ is a descendant of $u$ . If this derivation is in $\mathcal{D}$ , we are done. If not, we can find four nodes $x_{1},x_{2},x_{3},x_{4}$ with equivalent type judgments, such that $x_{i+1}$ is a descendant of $x_{i}$ for $i\in\{1,2,3\}$ . Let $c_{i}$ be the value of the flag counter in $x_{i}$ , for $i\in\{1,2,3,4\}$ . We have two cases. Suppose first that $c_{i}>c_{i+1}$ for some $i\in\{1,2,3\}$ . We then take some $j\in\{1,2,3\}\setminus\{i\}$ , and we cut off the fragment between $x_{j}$ and $x_{j+1}$ ; we obtain a smaller narrow derivation of a type judgment $\varepsilon\vdash P:\hat{\rho}_{m}\triangleright c^{\prime}$ , for some $c^{\prime}$ , in which there are two nodes, namely $x_{i}$ and $x_{i+1}$ , having equivalent type judgments, different values of the flag counter, and such that one of them is a descendant of the other. This contradicts minimality of our derivation. Next, suppose that $c_{1}=c_{2}=c_{3}=c_{4}$ . Then we recall that we already have two nodes $u$ , $v$ having equivalent type judgments, different values of the flag counter, and such that $v$ is a descendant of $u$ . For $i\in\{1,2,3\}$ consider the set $V_{i}$ containing $x_{i}$ and all its descendants not being in the subtree starting in $x_{i+1}$ . These sets are disjoint, so for some $j\in\{1,2,3\}$ we have $u\not\in V_{j}$ and $v\not\in V_{j}$ . We cut off the fragment between $x_{j}$ and $x_{j+1}$ . This removes exactly the nodes from $V_{j}$ , so the nodes $u$ and $v$ are still present in the derivation. Moreover, equality $c_{j}=c_{j+1}$ implies that we have not changed flag counters in any node, in particular in $u$ and $v$ , so $u$ and $v$ in the new derivation again have different values of the flag counter. Thus also in this case we have obtained a contradiction with minimality of our derivation. ∎

Bibliography22

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1]
2[2] Achim Blumensath (2008): On the Structure of Graphs in the Caucal Hierarchy . Theor. Comput. Sci. 400(1-3), pp. 19–45, 10.1016/j.tcs.2008.01.053 . · doi ↗
3[3] Achim Blumensath (2013): Erratum to "On the Structure of Graphs in the Caucal Hierarchy" [Theoret. Comput. Sci 400 (2008) 19-45] . Theor. Comput. Sci. 475, pp. 126–127, 10.1016/j.tcs.2012.12.044 . · doi ↗
4[4] Mikołaj Bojańczyk & Szymon Toruńczyk (2012): Weak MSO+U over Infinite Trees . In Christoph Dürr & Thomas Wilke, editors: 29th International Symposium on Theoretical Aspects of Computer Science, STACS 2012, February 29th - March 3rd, 2012, Paris, France , LIP Ics 14, Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, pp. 648–660, 10.4230/LIP Ics.STACS.2012.648 . · doi ↗
5[5] Christopher H. Broadbent, Arnaud Carayol, Matthew Hague & Olivier Serre (2012): A Saturation Method for Collapsible Pushdown Systems . In Artur Czumaj, Kurt Mehlhorn, Andrew M. Pitts & Roger Wattenhofer, editors: Automata, Languages, and Programming - 39th International Colloquium, ICALP 2012, Warwick, UK, July 9-13, 2012, Proceedings, Part II , Lecture Notes in Computer Science 7392, Springer, pp. 165–176, 10.1007/978-3-642-31585-5_18 . · doi ↗
6[6] Christopher H. Broadbent & Naoki Kobayashi (2013): Saturation-Based Model Checking of Higher-Order Recursion Schemes . In Simona Ronchi Della Rocca, editor: Computer Science Logic 2013 (CSL 2013), CSL 2013, September 2-5, 2013, Torino, Italy , LIP Ics 23, Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, pp. 129–148, 10.4230/LIP Ics.CSL.2013.129 . · doi ↗
7[7] Lorenzo Clemente, Pawel Parys, Sylvain Salvati & Igor Walukiewicz (2016): The Diagonal Problem for Higher-Order Recursion Schemes is Decidable . In Martin Grohe, Eric Koskinen & Natarajan Shankar, editors: Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science, LICS ’16, New York, NY, USA, July 5-8, 2016 , ACM, pp. 96–105, 10.1145/2933575.2934527 . · doi ↗
8[8] Lorenzo Clemente, Paweł Parys, Sylvain Salvati & Igor Walukiewicz (2015): Ordered Tree-Pushdown Systems . In Prahladh Harsha & G. Ramalingam, editors: 35th IARCS Annual Conference on Foundation of Software Technology and Theoretical Computer Science, FSTTCS 2015, December 16-18, 2015, Bangalore, India , LIP Ics 45, Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, pp. 163–177, 10.4230/LIP Ics.FSTTCS.2015.163 . · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Intersection Types and Counting††thanks: Work supported by the National Science Center (decision DEC-2012/07/D/ST6/02443).

Abstract

1 Introduction

Acknowledgements.

2 Preliminaries

Trees.

Infinitary λ\lambdaλ-calculus.

Böhm Trees.

λY\lambda YλY-calculus.

Theorem 1**.**

3 Intersection Type System

Intuitions.

Type Judgments.

Type System.

Example 1**.**

Example 2**.**

Example 3**.**

Example 4**.**

Theorem 2**.**

Example 5**.**

Example 6**.**

Example 7**.**

Example 8**.**

4 Completeness

Lemma 3**.**

Lemma 4**.**

Lemma 5**.**

Proof of Lemma 3 (sketch).

Proof of Lemma 4.

Proof of Lemma 5.

5 Soundness

Lemma 6**.**

Lemma 7**.**

Lemma 8**.**

6 Effectiveness

7 Conclusions

Appendix A Proof of Lemma 3

Lemma 9**.**

Proof.

Lemma 10**.**

Proof.

Lemma 11**.**

Proof.

Lemma 12**.**

Proof.

Corollary 13**.**

Proof.

Lemma 14**.**

Proof.

Lemma 15**.**

Proof.

Proof of Lemma 3.

Appendix B Proof of Lemma 4

Lemma 16**.**

Proof.

Lemma 17**.**

Proof.

Lemma 18**.**

Proof.

Corollary 19**.**

Proof.

Corollary 20**.**

Proof.

Lemma 21**.**

Proof.

Appendix C Proof of Lemma 5

Lemma 22**.**

Proof.

Lemma 23**.**

Proof.

Lemma 24**.**

Proof.

Lemma 25**.**

Infinitary $\lambda$ -calculus.

$\lambda Y$ -calculus.

Theorem 1.

Example 1.

Example 2.

Example 3.

Example 4.

Theorem 2.

Example 5.

Example 6.

Example 7.

Example 8.

Lemma 3.

Lemma 4.

Lemma 5.

Lemma 6.

Lemma 7.

Lemma 8.

Lemma 9.

Lemma 10.

Lemma 11.

Lemma 12.

Corollary 13.

Lemma 14.

Lemma 15.

Lemma 16.

Lemma 17.

Lemma 18.

Corollary 19.

Corollary 20.

Lemma 21.

Lemma 22.

Lemma 23.

Lemma 24.

Lemma 25.

Lemma 26.

Lemma 27.

Lemma 28.

Lemma 29.

Lemma 30.

Lemma 31.

Lemma 32.

Lemma 33.

Lemma 34.

Lemma 35.

Lemma 36.

Lemma 37.