Revisiting Occurrence Typing

Giuseppe Castagna; Victor Lanvin; Micka\"el Laurent; Kim Nguyen

arXiv:1907.05590·cs.PL·February 25, 2022

Revisiting Occurrence Typing

Giuseppe Castagna, Victor Lanvin, Micka\"el Laurent, Kim Nguyen

PDF

TL;DR

This paper revisits occurrence typing, enhancing it with set-theoretic types to create a unified framework that improves type inference, supports intersection types, and optimizes compilation in gradually typed languages.

Contribution

It introduces a general set-theoretic type framework for occurrence typing, unifying and extending existing approaches and enabling new applications.

Findings

01

Develops a comprehensive set-theoretic occurrence typing framework

02

Enables reconstruction of intersection types for unannotated functions

03

Improves compilation efficiency for gradually typed languages

Abstract

We revisit occurrence typing, a technique to refine the type of variables occurring in type-cases and, thus, capturesome programming patterns used in untyped languages. Although occurrence typing was tied from its inceptionto set-theoretic types-union types, in particular-it never fully exploited the capabilities of these types. Here weshow how, by using set-theoretic types, it is possible to develop a general typing framework that encompasses andgeneralizes several aspects of current occurrence typing proposals and that can be applied to tackle other problemssuch as the reconstruction of intersection types for unannotated or partially annotated functions and the optimizationof the compilation of gradually typed languages.

Tables1

Table 1. Table 1 : Types inferred by the implementation

	Code	Inferred type
1	⬇ let basic_inf = fun (y : Int \| Bool) -> if y is Int then incr y else lnot y\end{lstlisting} &\vfill $(\Int\to\Int)\land(\Bool\to\Bool)$ \\\hline 2 & \begin{lstlisting} let any_inf = fun (x : Any) -> if x is Int then incr x else if x is Bool then lnot x else x	$(Int \to Int) \land (\neg Int \to \neg Int) \land$ $(Bool \to Bool) \land (\neg (Int \lor Bool) \to \neg (Int \lor Bool))$
3	⬇ let is_int = fun (x : Any) -> if x is Int then true else false let is_bool = fun (x : Any) -> if x is Bool then true else false let is_char = fun (x : Any) -> if x is Char then true else false	$(Int \to True) \land (\neg Int \to False)$ $(Bool \to True) \land (\neg Bool \to False)$ $(Char \to True) \land (\neg Char \to False)$
4	⬇ let not_ = fun (x : Any) -> if x is True then false else true	$(True \to False) \land (\neg True \to True)$
5	⬇ let or_ = fun (x : Any) -> fun (y: Any) -> if x is True then true else if y is True then true else false	$(True \to Any \to True) \land (\neg True \to True \to True) \land$ $(\neg True \to \neg True \to False)$
6	⬇ let and_ = fun (x : Any) -> fun (y : Any) -> if not_ (or_ (not_ x) (not_ y)) is True then true else false	$(True \to ((\neg True \to False) \land (True \to True))$ $\land (\neg True \to Any \to False)$
7	⬇ let f = fun (x : Any) -> fun (y : Any) -> if and_ (is_int x) (is_bool y) is True then 1 else if or_ (is_char x) (is_int y) is True then 2 else 3	$(Int \to (Int \to 2) \land (\neg Int \to 1 \lor 3) \land (Bool \to 1) \land$ $(\neg (Bool \lor Int) \to 3) \land (\neg Bool \to 2 \lor 3))$ $\land$ $(Char \to (Int \to 2) \land (\neg Int \to 2) \land (Bool \to 2) \land$ $(\neg (Bool \lor Int) \to 2) \land (\neg Bool \to 2))$ $\land$ $(\neg (Int \lor Char) \to (Int \to 2) \land (\neg Int \to 3) \land$ $(Bool \to 3) \land (\neg (Bool \lor Int) \to 3) \land (\neg Bool \to 2 \lor 3))$ $\land \dots$ (two other redundant cases omitted)
	⬇ let test_1 = f 3 true let test_2 = f (42,42) 42 let test_3 = f nil nil	1 2 3
8	⬇ atom nil type Document = { nodeType=9 ..} and Element = { nodeType=1, childNodes=NodeList ..} and Text = { nodeType=3, isElementContentWhiteSpace=Bool ..} and Node = Document \| Element \| Text and NodeList = Nil \| (Node, NodeList) let is_empty_node = fun (x : Node) -> if x.nodeType is 9 then false else if x is { nodeType=3 ..} then x.isElementContentWhiteSpace else if x.childNodes is Nil then true else false	$(Document \to False) \land$ $({nodeType = 1, childNodes = Nil ..} \to True) \land$ $({nodeType = 1, childNodes = (Node, NodeList) ..} \to False) \land$ $(Text \to Bool) \land \dots$ (omitted redundant arrows)
9	⬇ let xor_ = fun (x : Any) -> fun (y : Any) -> if and_ (or_ x y) (not_ (and_ x y)) is True then true else false	$True \to ((True \to False) \land (\neg True \to True)) \land$ $(\neg True \to ((True \to True) \land (\neg True \to False))$
10	⬇ (* f, g have type: (Int->Int) & (Any->Bool) *) let example10 = fun (x : Any) -> if (f x, g x) is (Int, Bool) then 1 else 2	$(Int \to Empty) \land (\neg Int \to 2)$ Warning: line 4, 39-40: unreachable expression
11	⬇ let typeof = fun (x:Any) -> if x is Int then "number" else if x is Char then "string" else if x is Bool then "boolean" else "object" let test = fun (x:Any) -> if typeof x is "number" then incr x else if typeof x is "string" then charcode x else if typeof x is "boolean" then int_of_bool x else 0\end{lstlisting} &\vfill\smallskip $(\Int\to\textsf{"number"}) \wedge$\newline $(\Char\to\textsf{"string"})\wedge$\newline $(\Bool\to\textsf{"boolean"})\wedge$\newline $(\lnot(\Bool{\vee}\Int{\vee}\Char)\to\textsf{"object"})\wedge \ldots$\newline (two other redundant cases omitted) \newline~\newline $(\Int \to \Int) \wedge (\Char \to \Int) \wedge (\Bool \to \Int) \wedge $\newline $(\lnot(\Bool{\vee} \Int {\vee} \Char) \to 0)\wedge \ldots$\newline (two other redundant cases omitted) \\\hline 12 & \begin{lstlisting} atom null type Object = Null \| { prototype = Object ..} type ObjectWithPropertyL = { l = Any ..} \| { prototype = ObjectWithPropertyL ..} let has_property_l = fun (o:Object) -> if o is ObjectWithPropertyL then true else false let has_own_property_l = fun (o:Object) -> if o is { l=Any ..} then true else false let get_property_l = fun (self:Object->Any) o -> if has_own_property_l o is True then o.l else if o is Null then null else self (o.prototype)	$(ObjectWithPropertyL \to True)$ $\land$ $(X1 \to False) where$ $X1 = (Nil \| {l = ? Empty, prototype = X1 ..})$ $({l = Any, prototype = Object ..} \to True)$ $\land$ $((Nil \| {l = ? Empty, prototype = Object ..}) \to False)$ $Object \to Any$

Equations232

\to

\to

(x_{1} x_{2} \in t)? e_{1} : e_{2}

(x_{1} x_{2} \in t)? e_{1} : e_{2}

\texttt{let }x_{1}\texttt{\,=\,}\texttt{\color[rgb]{0,0.2,0.4}foo}\texttt{ in }\texttt{(}x_{1}x_{2}\in\text{{Int}}\texttt{)?}((x_{1}x_{2})+x_{2})\texttt{:}\texttt{42}

\texttt{let }x_{1}\texttt{\,=\,}\texttt{\color[rgb]{0,0.2,0.4}foo}\texttt{ in }\texttt{(}x_{1}x_{2}\in\text{{Int}}\texttt{)?}((x_{1}x_{2})+x_{2})\texttt{:}\texttt{42}

\texttt{(}x_{1}x_{2}\in\text{{Int}}\texttt{)?}((x_{1}x_{2})+x_{2})\texttt{:}((x_{1}x_{2})\texttt{\color[rgb]{0,0.2,0.4}\,@\,}x_{2})

\texttt{(}x_{1}x_{2}\in\text{{Int}}\texttt{)?}((x_{1}x_{2})+x_{2})\texttt{:}((x_{1}x_{2})\texttt{\color[rgb]{0,0.2,0.4}\,@\,}x_{2})

(x_{1} x_{2} \in Int)? (x_{1} (x_{1} x_{2}) + 42) : not (x_{1} (x_{1} x_{2}))

(x_{1} x_{2} \in Int)? (x_{1} (x_{1} x_{2}) + 42) : not (x_{1} (x_{1} x_{2}))

t_{1} \circ s \leq \neg t

t_{1} \circ s \leq \neg t

((Int \lor String \to Int) \land \neg (String \to \neg Int)) \lor ((Bool \lor String \to Bool) \land \neg (String \to \neg Int))

((Int \lor String \to Int) \land \neg (String \to \neg Int)) \lor ((Bool \lor String \to Bool) \land \neg (String \to \neg Int))

(x_{1} x_{2} \in Int)? ... x_{1} x_{2} ... : ... x_{1} x_{2} ...

(x_{1} x_{2} \in Int)? ... x_{1} x_{2} ... : ... x_{1} x_{2} ...

(e (42) \in Bool)? e : ...

(e (42) \in Bool)? e : ...

((f x, g x) \in Int \times Bool)? ... : ...

((f x, g x) \in Int \times Bool)? ... : ...

((x, y) \in ((Int \lor Bool) \times Int))? e : ...

((x, y) \in ((Int \lor Bool) \times Int))? e : ...

\begin{array}[]{lrcl}\textbf{Types}&t&::=&b~{}|~{}t\to t~{}|~{}t\times t~{}|~{}t\vee t~{}|~{}\neg t~{}|~{}\MyMathBb{0}\end{array}

\begin{array}[]{lrcl}\textbf{Types}&t&::=&b~{}|~{}t\to t~{}|~{}t\times t~{}|~{}t\vee t~{}|~{}\neg t~{}|~{}\MyMathBb{0}\end{array}

d

d

\partial

[[\MyMathBb 0]]

[[\MyMathBb 0]]

[[b]]

[[t_{1} \to t_{2}]]

(c : b)

(c : b)

((d_{1}, d_{2}) : t_{1} \times t_{2})

({(d_{1}, \partial_{1}), ..., (d_{n}, \partial_{n})} : t_{1} \to t_{2})

(d : t_{1} \lor t_{2})

(d : \neg t)

(\partial : t)

\begin{array}[]{lrclr}\textbf{Expr}&e&::=&c~{}|~{}x~{}|~{}ee~{}|~{}\lambda^{\wedge_{i\in I}s_{i}\to t_{i}}x.e~{}|~{}\pi_{j}e~{}|~{}(e,e)~{}|~{}(e{\in}t)\,\texttt{{?}}\,e\,\texttt{{:}}\,e\\[0.85358pt] \textbf{Values}&v&::=&c~{}|~{}\lambda^{\wedge_{i\in I}s_{i}\to t_{i}}x.e~{}|~{}(v,v)\\ \end{array}

\begin{array}[]{lrclr}\textbf{Expr}&e&::=&c~{}|~{}x~{}|~{}ee~{}|~{}\lambda^{\wedge_{i\in I}s_{i}\to t_{i}}x.e~{}|~{}\pi_{j}e~{}|~{}(e,e)~{}|~{}(e{\in}t)\,\texttt{{?}}\,e\,\texttt{{:}}\,e\\[0.85358pt] \textbf{Values}&v&::=&c~{}|~{}\lambda^{\wedge_{i\in I}s_{i}\to t_{i}}x.e~{}|~{}(v,v)\\ \end{array}

\begin{array}[]{rcll}(\lambda^{\wedge_{i\in I}s_{i}\to t_{i}}x.e)\,v&\leadsto&e\{x\mapsto v\}\\[-1.13809pt] \pi_{i}(v_{1},v_{2})&\leadsto&v_{i}&i=1,2\\[-1.13809pt] (v{\in}t)\,\texttt{{?}}\,e_{1}\,\texttt{{:}}\,e_{2}&\leadsto&e_{1}&v\in{\llbracket t\rrbracket}_{\mathcal{V}}\\[-1.13809pt] (v{\in}t)\,\texttt{{?}}\,e_{1}\,\texttt{{:}}\,e_{2}&\leadsto&e_{2}&v\not\in{\llbracket t\rrbracket}_{\mathcal{V}}\\[-3.69885pt] \end{array}

\begin{array}[]{rcll}(\lambda^{\wedge_{i\in I}s_{i}\to t_{i}}x.e)\,v&\leadsto&e\{x\mapsto v\}\\[-1.13809pt] \pi_{i}(v_{1},v_{2})&\leadsto&v_{i}&i=1,2\\[-1.13809pt] (v{\in}t)\,\texttt{{?}}\,e_{1}\,\texttt{{:}}\,e_{2}&\leadsto&e_{1}&v\in{\llbracket t\rrbracket}_{\mathcal{V}}\\[-1.13809pt] (v{\in}t)\,\texttt{{?}}\,e_{1}\,\texttt{{:}}\,e_{2}&\leadsto&e_{2}&v\not\in{\llbracket t\rrbracket}_{\mathcal{V}}\\[-3.69885pt] \end{array}

C [] ::= [] ∣ C e ∣ v C ∣ (C, e) ∣ (v, C) ∣ π_{i} C ∣ (C \in t) ? e : e

C [] ::= [] ∣ C e ∣ v C ∣ (C, e) ∣ (v, C) ∣ π_{i} C ∣ (C \in t) ? e : e

\vbox \vbox Γ ⊢ c : b _{c}

\vbox \vbox Γ ⊢ c : b _{c}

\frac{\vbox Γ ⊢ e : t _{1} \times t _{2} \vbox}{\vbox \vbox Γ ⊢ π _{i} e : t _{i}}

\frac{\vbox Γ ⊢ e : t _{1} \times t _{2} \vbox}{\vbox \vbox Γ ⊢ π _{i} e : t _{i}}

\vbox \vbox Γ ⊢ e : Γ ( e )

\vbox \vbox Γ ⊢ e : Γ ( e )

\frac{\vbox Γ ⊢ λ ^{\land_{i \in I} s_{i} \to t_{i}} x . e : t \vbox}{\vbox \vbox Γ ⊢ λ ^{\land_{i \in I} s_{i} \to t_{i}} x . e : \neg ( t _{1} \to t _{2} )}

\frac{\vbox Γ ⊢ λ ^{\land_{i \in I} s_{i} \to t_{i}} x . e : t \vbox}{\vbox \vbox Γ ⊢ λ ^{\land_{i \in I} s_{i} \to t_{i}} x . e : \neg ( t _{1} \to t _{2} )}

\vbox \vbox Γ , ( e : \MyMathBb 0 ) ⊢ e ^{'} : t

\vbox \vbox Γ , ( e : \MyMathBb 0 ) ⊢ e ^{'} : t

\frac{\vbox Γ ⊢ e : t _{0} Γ ⊢ _{e, t}^{Env} Γ _{1} Γ _{1} ⊢ e _{1} : t ^{'} Γ ⊢ _{e, \neg t}^{Env} Γ _{2} Γ _{2} ⊢ e _{2} : t ^{'} \vbox}{\vbox \vbox Γ ⊢ ( e \in t ) ? e _{1} : e _{2} : t ^{'}}

\frac{\vbox Γ ⊢ e : t _{0} Γ ⊢ _{e, t}^{Env} Γ _{1} Γ _{1} ⊢ e _{1} : t ^{'} Γ ⊢ _{e, \neg t}^{Env} Γ _{2} Γ _{2} ⊢ e _{2} : t ^{'} \vbox}{\vbox \vbox Γ ⊢ ( e \in t ) ? e _{1} : e _{2} : t ^{'}}

\begin{array}[]{r@{\downarrow}l@{\quad=\quad}lr@{\downarrow}l@{\quad=\quad}lr@{\downarrow}l@{\quad=\quad}l}e&\epsilon&e&(e_{1},e_{2})&l.\varpi&e_{1}{\downarrow}\varpi&\pi_{1}e&f.\varpi&e{\downarrow}\varpi\\ e_{0}\,e_{1}&i.\varpi&e_{i}{\downarrow}\varpi&(e_{1},e_{2})&r.\varpi&e_{2}{\downarrow}\varpi&\pi_{2}e&s.\varpi&e{\downarrow}\varpi\\[-1.13809pt] \end{array}

\begin{array}[]{r@{\downarrow}l@{\quad=\quad}lr@{\downarrow}l@{\quad=\quad}lr@{\downarrow}l@{\quad=\quad}l}e&\epsilon&e&(e_{1},e_{2})&l.\varpi&e_{1}{\downarrow}\varpi&\pi_{1}e&f.\varpi&e{\downarrow}\varpi\\ e_{0}\,e_{1}&i.\varpi&e_{i}{\downarrow}\varpi&(e_{1},e_{2})&r.\varpi&e_{2}{\downarrow}\varpi&\pi_{2}e&s.\varpi&e{\downarrow}\varpi\\[-1.13809pt] \end{array}

\vbox \vbox Γ ⊢ _{e, t}^{Env} Γ

\vbox \vbox Γ ⊢ _{e, t}^{Env} Γ

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Revisiting Occurrence Typing

Giuseppe Castagna

Institut de Recherche en Informatique Fondamentale (IRIF), CNRS - Université de Paris, France

Victor Lanvin

Mickaël Laurent

Kim Nguyen

Laboratoire de Méthodes Formelles (LMF), CNRS - Université Paris-Saclay, France

Abstract

We revisit occurrence typing, a technique to refine the type of variables occurring in type-cases and, thus, capture some programming patterns used in untyped languages. Although occurrence typing was tied from its inception to set-theoretic types—union types, in particular—it never fully exploited the capabilities of these types. Here we show how, by using set-theoretic types, it is possible to develop a general typing framework that encompasses and generalizes several aspects of current occurrence typing proposals and that can be applied to tackle other problems such as the reconstruction of intersection types for unannotated or partially annotated functions and the optimization of the compilation of gradually typed languages.

keywords:

occurrence typing , type inference , union types , intersection types , TypeScript , Flow language , dynamic languages , type case , gradual typing.

1 Introduction

TypeScript and Flow are extensions of JavaScript that allow the programmer to specify in the code type annotations used to statically type-check the program. For instance, the following function definition is valid in both languages

function foo(x : number | string) { return (typeof(x) === "number")? x+1 : x.trim(); (1) }

Apart from the type annotation (in red) of the function parameter, the above is standard JavaScript code defining a function that checks whether its argument is an integer; if it is so, then it returns the argument’s successor (x+1), otherwise it calls the method trim() of the argument. The annotation specifies that the parameter is either a number or a string (the vertical bar denotes a union type). If this annotation is respected and the function is applied to either an integer or a string, then the application cannot fail because of a type error (trim() is a string method of the ECMAScript 5 standard that trims white-spaces from the beginning and end of the string) and both the type-checker of TypeScript and the one of Flow rightly accept this function. This is possible because both type-checkers implement a specific type discipline called occurrence typing or flow typing:111TypeScript calls it “type guard recognition” while Flow uses the terminology “type refinements”. as a matter of fact, standard type disciplines would reject this function. The reason for that is that standard type disciplines would try to type every part of the body of the function under the assumption that x has type number | string and they would fail, since the successor is not defined for strings and the method trim() is not defined for numbers. This is so because standard disciplines do not take into account the type test performed on x. Occurrence typing is the typing technique that uses the information provided by the test to specialize—precisely, to refine—the type of the occurrences of x in the branches of the conditional: since the program tested that x is of type number, then we can safely assume that x is of type number in the “then” branch, and that it is not of type number (and thus deduce from the type annotation that it must be of type string) in the “else” branch.

Occurrence typing was first defined and formally studied by THF08 to statically type-check untyped Scheme programs,222According to Sam Tobin-Hochstadt, the terminology occurrence typing was first used in a simplistic form by Komon05, although he and Felleisen were not aware of it the at the moment of the writing of [THF08]. and later extended by THF10 yielding the development of Typed Racket. From its inception, occurrence typing was intimately tied to type systems with set-theoretic types: unions, intersections, and negation of types. Union was the first type connective to appear, since it was already used by THF08 where its presence was needed to characterize the different control flows of a type test, as our foo example shows: one flow for integer arguments and another for strings. Intersection types appear (in limited forms) combined with occurrence typing both in TypeScript and in Flow and serve to give, among other, more precise types to functions such as foo. For instance, since x + 1 evaluates to an integer and x.trim() to a string, then our function foo has type (number|string) $\to$ (number|string). But it is clear that a more precise type would be one that states that foo returns a number when it is applied to a number and returns a string when it is applied to a string, so that the type deduced for, say, foo(42) would be number rather than number|string. This is exactly what the intersection type

[TABLE]

states (intuitively, an expression has an intersection of types, noted &, if and only if it has all the types of the intersection) and corresponds in Flow to declaring foo as follows:

var foo : (number => number) & (string => string) = x => { return (typeof(x) === "number")? x+1 : x.trim(); (3) }

For what concerns negation types, they are pervasive in the occurrence typing approach, even though they are used only at meta-theoretic level,333At the moment of writing there is a pending pull request to add negation types to the syntax of TypeScript, but that is all. in particular to determine the type environment when the type case fails. We already saw negation types at work when we informally typed the “else” branch in foo, for which we assumed that $x$ did not have type number—i.e., it had the (negation) type $\neg$ number—and deduced from it that $x$ then had type string—i.e., (number|string)& $\neg$ number which is equivalent to the set-theoretic difference (number|string)\ number and, thus, to string.

The approaches cited above essentially focus on refining the type of variables that occur in an expression whose type is being tested. They do it when the variable occurs at top-level in the test (i.e., the variable is the expression being tested) or under some specific positions such as in nested pairs or at the end of a path of selectors. In this work we aim at removing this limitation on the contexts and develop a general theory to refine the type of variables that occur in tested expressions under generic contexts, such as variables occurring in the left or the right expressions of an application. In other words, we aim at establishing a formal framework to extract as much static information as possible from a type test. We leverage our analysis on the presence of full-fledged set-theoretic types connectives provided by the theory of semantic subtyping. Our analysis will also yield two important byproducts. First, to refine the type of the variables we have to refine the type of the expressions they occur in and we can use this information to improve our analysis. Therefore our occurrence typing approach will refine not only the types of variables but also the types of generic expressions–i.e., any expression whatever form it has—bypassing usual type inference. Second, and most importantly, the result of our analysis can be used to infer intersection types for functions, even in the absence of precise type annotations such as the one in the definition of foo in (1): to put it simply, we are able to infer the type (2) for the unannotated pure JavaScript code of foo (i.e., no type annotation at all), while in TypeScript and Flow (and any other formalism we are aware of) this requires an explicit and full type annotation as the one given in (1).

Finally, the natural target for occurrence typing are languages with dynamic type tests, in particular, dynamic languages. To type such languages occurrence typing is often combined not only, as discussed above, with set-theoretic types, but also with extensible record types (to type objects) and gradual type system (to combine static and dynamic typing) two features that we study in Section 3 as two extensions of our core formalism. Of particular interest is the latter. Gre19 singles out occurrence typing and gradual typing as the two “lineages” that partition the research on combining static and dynamic typing: he identifies the former as the “pragmatic, implementation-oriented dynamic-first” lineage and the latter as the “formal, type-theoretic, static-first” lineage. Here we demonstrate that these two “lineages” are not orthogonal or mutually independent, and we combine occurrence and gradual typing showing, in particular, how the former can be used to optimize the compilation of the latter.

1.1 Motivating examples

We focus our study on conditionals that test types and consider the following syntax: $\texttt{(}e\in t\texttt{)?}e\texttt{:}e$ (e.g., in this syntax the body of foo in (1) and (1) is rendered as $\texttt{(}x\in\text{{Int}}\texttt{)?}x+1\texttt{:}(\textsf{trim }x)$ ). In particular, in this introduction we concentrate on applications, since they constitute the most difficult case and many other cases can be reduced to them. A typical example is the expression

[TABLE]

where $x_{i}$ ’s denote variables, $t$ is some type, and $e_{i}$ ’s are generic expressions. Depending on the actual $t$ and on the static types of $x_{1}$ and $x_{2}$ , we can make type assumptions for $x_{1}$ , for $x_{2}$ , and for the application $x_{1}x_{2}$ when typing $e_{1}$ that are different from those we can make when typing $e_{2}$ . For instance, suppose $x_{1}$ is bound to the function foo defined in (1). Thus $x_{1}$ has type $(\text{{Int}}\to\text{{Int}})\wedge(\text{{String}}\to\text{{String}})$ (we used the syntax of the types of Section 2 where unions and intersections are denoted by $\vee$ and $\wedge$ and have priority over $\to$ and $\times$ , but not over $\neg$ ). Then, it is not hard to see that if $x_{2}:\text{{Int}}{\vee}\text{{String}}$ , then the expression444This and most of the following expressions are just given for the sake of example. Determining the type in each branch of expressions other than variables is interesting for constructors but less so for destructors such as applications, projections, and selections: any reasonable programmer would not repeat the same application twice, (s)he would store its result in a variable. This becomes meaningful with constructor such as pairs, as we do for instance in the expression in (12).

[TABLE]

is well typed with type Int: when typing the branch “then” we know that the test $x_{1}x_{2}\in\text{{Int}}$ succeeded and that, therefore, not only $x_{1}x_{2}$ is of type Int, but also that $x_{2}$ is of type Int: the other possibility, $x_{2}:\text{{String}}$ , would have made the test fail. For (5) we reasoned only on the type of the variables in the “then” branch but we can do the same on the “else” branch as shown by the following expression, where @ denotes string concatenation

[TABLE]

If the static type of $x_{1}$ is $(\text{{Int}}\to\text{{Int}})\wedge(\text{{String}}\to\text{{String}})$ then $x_{1}x_{2}$ is well typed only if the static type of $x_{2}$ is (a subtype of) $\text{{Int}}\vee\text{{String}}$ and from that it is not hard to deduce that (6) has type $\text{{Int}}\vee\text{{String}}$ . Let us see this in detail. The expression in (6) is typed in the following type environment: $x_{1}:(\text{{Int}}\to\text{{Int}})\wedge(\text{{String}}\to\text{{String}}),x_{2}:\text{{Int}}\vee\text{{String}}$ . All we can deduce, then, is that the application $x_{1}x_{2}$ has type $\text{{Int}}\vee\text{{String}}$ , which is not enough to type either the “then” branch or the “else” branch. In order to type the “then” branch $(x_{1}x_{2})+x_{2}$ we must be able to deduce that both $x_{1}x_{2}$ and $x_{2}$ are of type Int. Since we are in the “then” branch, then we know that the type test succeeded and that, therefore, $x_{1}x_{2}$ has type Int. Thus we can assume in typing this branch that $x_{1}x_{2}$ has both its static type and type Int and, thus, their intersection: $(\text{{Int}}\vee\text{{String}})\wedge\text{{Int}}$ , that is Int. For what concerns $x_{2}$ we use the static type of $x_{1}$ , that is $(\text{{Int}}\to\text{{Int}})\wedge(\text{{String}}\to\text{{String}})$ , and notice that this function returns an Int only if its argument is of type Int. Reasoning as above we thus deduce that in the “then” branch the type of $x_{2}$ is the intersection of its static type with Int: $(\text{{Int}}\vee\text{{String}})\wedge\text{{Int}}$ that is Int. To type the “else” branch we reason exactly in the same way, with the only difference that, since the type test has failed, then we know that the type of the tested expression is not Int. That is, the expression $x_{1}x_{2}$ can produce any possible value barring an Int. If we denote by $\MyMathBb{1}$ the type of all values (i.e., the type any of TypeScript and Flow) and by $\setminus$ the set difference, then this means that in the else branch we know that $x_{1}x_{2}$ has type $\MyMathBb{1}{\setminus}\text{{Int}}$ —written $\neg\text{{Int}}$ —, that is, it can return values of any type barred Int. Reasoning as for the “then” branch we then assume that $x_{1}x_{2}$ has type $(\text{{Int}}\vee\text{{String}})\wedge\neg\text{{Int}}$ (i.e., $(\text{{Int}}\vee\text{{String}})\setminus\text{{Int}}$ , that is, String), that $x_{2}$ must be of type String for the application to have type $\neg\text{{Int}}$ and therefore we assume that $x_{2}$ has type $(\text{{Int}}\vee\text{{String}})\wedge\text{{String}}$ (i.e., again String).

We have seen that we can specialize in both branches the type of the whole expression $x_{1}x_{2}$ , the type of the argument $x_{2}$ , but what about the type of the function $x_{1}$ ? Well, this depends on the type of $x_{1}$ itself. In particular, if instead of an intersection type $x_{1}$ is typed by a union type (e.g., when the function bound to $x_{1}$ is the result of a branching expression), then the test may give us information about the type of the function in the various branches. So for instance if in the expression in (4) $x_{1}$ is of type, say, $(s_{1}\to t)\vee(s_{2}\to\neg t)$ , then we can assume for the expression (4) that $x_{1}$ has type $(s_{1}\to t)$ in the branch “then” and $(s_{2}\to\neg t)$ in the branch “else”. As a more concrete example, if $x_{1}:(\text{{Int}}{\vee}\text{{String}}\to\text{{Int}})\vee(\text{{Bool}}{\vee}\text{{String}}\to\text{{Bool}})$ and $x_{1}x_{2}$ is well-typed, then we can deduce for

[TABLE]

the type $\text{{Int}}\vee\text{{Bool}}$ : in the “then” branch $x_{1}$ has type $\text{{Int}}{\vee}\text{{String}}\to\text{{Int}}$ and $x_{1}x_{2}$ is of type Int; in the “else” branch $x_{1}$ has type $\text{{Bool}}{\vee}\text{{String}}\to\text{{Bool}}$ and $x_{1}x_{2}$ is of type Bool.

Let us recap. If $e$ is an expression of type $t_{0}$ and we are trying to type $\texttt{(}e\in t\texttt{)?}e_{1}\texttt{:}e_{2}$ , then we can assume that $e$ has type $t_{0}\wedge t$ when typing $e_{1}$ and type $t_{0}\setminus t$ when typing $e_{2}$ . If furthermore $e$ is of the form $e^{\prime}e^{\prime\prime}$ , then we may also be able to specialize the types for $e^{\prime}$ (in particular if its static type is a union of arrows) and for $e^{\prime\prime}$ (in particular if the static type of $e^{\prime}$ is an intersection of arrows). Additionally, we can repeat the reasoning for all subterms of $e^{\prime}$ and $e^{\prime\prime}$ as long as they are applications, and deduce distinct types for all subexpressions of $e$ that form applications. How to do it precisely—not only for applications, but also for other terms such as pairs, projections, records etc—is explained in the rest of the paper but the key ideas are pretty simple and are presented next.

1.2 Key ideas

First of all, in a strict language we can consider a type as denoting the set of values of that type and subtyping as set-containment of the denoted values. Imagine we are testing whether the result of an application $e_{1}e_{2}$ is of type $t$ or not, and suppose we know that the static types of $e_{1}$ and $e_{2}$ are $t_{1}$ and $t_{2}$ respectively. If the application $e_{1}e_{2}$ is well typed, then there is a lot of useful information that we can deduce from it: first, that $t_{1}$ is a functional type (i.e., it denotes a set of well-typed $\lambda$ -abstractions, the values of functional type) whose domain, denoted by $\textsf{dom}(t_{1})$ , is a type denoting the set of all values that are accepted by any function in $t_{1}$ ; second that $t_{2}$ must be a subtype of the domain of $t_{1}$ ; third, we also know the type of the application, that is the type that denotes all the values that may result from the application of a function in $t_{1}$ to an argument in $t_{2}$ , type that we denote by $t_{1}\circ t_{2}$ . For instance, if $t_{1}=\text{{Int}}\to\text{{Bool}}$ and $t_{2}=\text{{Int}}$ , then $\textsf{dom}(t_{1})=\text{{Int}}$ and $t_{1}\circ t_{2}=\text{{Bool}}$ . Notice that, introducing operations such as $\textsf{dom}()$ and $\circ$ is redundant when working with simple types, but becomes necessary in the presence of set-theoretic types. If for instance $t_{1}$ is the type of (1), that is, $t_{1}=(\text{{Int}}{\to}\text{{Int}})$ $\wedge$ $(\text{{String}}{\to}\text{{String}})$ , then $\textsf{dom}(t)=\text{{Int}}\vee\text{{String}}$ , that is the union of all the possible input types, while the precise return type of such a function depends on the type of the argument the function is applied to: either an integer, or a string, or both (i.e., the union type $\text{{Int}}\vee\text{{String}}$ ). So we have $t_{1}\circ\text{{Int}}=\text{{Int}}$ , $t_{1}\circ\text{{String}}=\text{{String}}$ , and $t_{1}\circ(\text{{Int}}\vee\text{{String}})=\text{{Int}}\vee\text{{String}}$ (see Section 2.6.1 for the formal definition of $\circ$ ).

What we want to do is to refine the types of $e_{1}$ and $e_{2}$ (i.e., $t_{1}$ and $t_{2}$ ) for the cases where the test that $e_{1}e_{2}$ has type $t$ succeeds or fails. Let us start with refining the type $t_{2}$ of $e_{2}$ for the case in which the test succeeds. Intuitively, we want to remove from $t_{2}$ all the values for which the application will surely return a result not in $t$ , thus making the test fail. Consider $t_{1}$ and let $s$ be the largest subtype of $\textsf{dom}(t_{1})$ such that

[TABLE]

In other terms, $s$ contains all the legal arguments that make any function in $t_{1}$ return a result not in $t$ . Then we can safely remove from $t_{2}$ all the values in $s$ or, equivalently, keep in $t_{2}$ all the values of $\textsf{dom}(t_{1})$ that are not in $s$ . Let us implement the second viewpoint: the set of all elements of $\textsf{dom}(t_{1})$ for which an application does not surely give a result in $\neg t$ is denoted $t_{1}\mathop{\,\sqdot\,}t$ (read, “ $t_{1}$ worra $t$ ”) and defined as $\min\{u\leq\textsf{dom}(t_{1})~{}|~{}t_{1}\circ(\textsf{dom}(t_{1})\setminus u)\leq\neg t\}$ : it is easy to see that according to this definition $\textsf{dom}(t_{1})\setminus(t_{1}\mathop{\,\sqdot\,}t)$ is the largest subset of $\textsf{dom}(t_{1})$ satisfying (8). Then we can refine the type of $e_{2}$ for when the test is successful by using the type $t_{2}\wedge(t_{1}\mathop{\,\sqdot\,}t)$ : we intersect all the possible results of $e_{2}$ , that is $t_{2}$ , with the elements of the domain that may yield a result in $t$ , that is $t_{1}\mathop{\,\sqdot\,}t$ . When the test fails, the type of $e_{2}$ can be refined in a similar way just by replacing $t$ by $\neg t$ : we get the refined type $t_{2}\land(t_{1}\mathop{\,\sqdot\,}\neg t)$ . To sum up, to refine the type of an argument in the test of an application, all we need is to define $t_{1}\mathop{\,\sqdot\,}t$ , the set of arguments that when applied to a function of type $t_{1}$ may return a result in $t$ ; then we can refine the type of $e_{2}$ as $t_{2}^{+}\hbox{\;\;$ = $\raise 5.0pt\hbox{\rm\tiny def}\hskip 4.2679pt}t_{2}\wedge(t_{1}\mathop{\,\sqdot\,}t)$ in the “then” branch (we call it the positive branch) and as $t_{2}^{-}\hbox{\;\;$ = $\raise 5.0pt\hbox{\rm\tiny def}\hskip 4.2679pt}t_{2}\setminus(t_{1}\mathop{\,\sqdot\,}t)$ in the “else” branch (we call it the negative branch). As a side remark note††margin: that the set $t_{1}\mathop{\,\sqdot\,}t$ is different from the set of elements that return a result in $t$ (though it is a supertype of it). To see that, consider for $t$ the type String and for $t_{1}$ the type $(\text{{Bool}}\to\text{{Bool}})\wedge(\text{{Int}}\to(\text{{String}}\vee\text{{Int}}))$ , that is, the type of functions that when applied to a Boolean return a Boolean and when applied to an integer return either an integer or a string; then we have that $\textsf{dom}(t_{1})=\text{{Int}}\vee\text{{Bool}}$ and $t_{1}\mathop{\,\sqdot\,}\text{{String}}=\text{{Int}}$ , but there is no (non-empty) type that ensures that an application of a function in $t_{1}$ will surely yield a String result.

Once we have determined $t_{2}^{+}$ , it is then not very difficult to refine the type $t_{1}$ for the positive branch, too. If the test succeeded, then we know two facts: first, that the function was applied to a value in $t_{2}^{+}$ and, second, that the application did not diverge and returned a result in $t$ . Therefore, we can exclude from $t_{1}$ all the functions that, when applied to an argument in $t_{2}^{+}$ , yield a result not in $t$ . It can be obtained simply by removing from $t_{1}$ the functions in $t_{2}^{+}\to\neg t$ , that is, we refine the type of $e_{1}$ in the “then” branch as $t_{1}^{+}=t_{1}\setminus(t_{2}^{+}\to\neg t)$ . Note that this also removes functions diverging on $t_{2}^{+}$ arguments. In particular, the interpretation of a type $t\to s$ is the set of all functions that when applied to an argument of type $t$ either diverge or return a value in $s$ . As such the interpretation of $t\to s$ contains all the functions that diverge (at least) on $t$ . Therefore removing $t\to s$ from a type $u$ removes from $u$ not only all the functions that when applied to a $t$ argument return a result in $s$ , but also all the functions that diverge on $t$ . Ergo $t_{1}\setminus(t_{2}^{+}\to\neg t)$ removes, among others, all functions in $t_{1}$ that diverge on $t_{2}^{+}$ . Let us see all this on our example (7), in particular, by showing how this technique deduces that the type of $x_{1}$ in the positive branch is (a subtype of) $\text{{Int}}{\vee}\text{{String}}\to\text{{Int}}$ . Take the static type of $x_{1}$ , that is $(\text{{Int}}{\vee}\text{{String}}\to\text{{Int}})\vee(\text{{Bool}}{\vee}\text{{String}}\to\text{{Bool}})$ and intersect it with $\lnot(t_{2}^{+}\to\neg t)$ , that is, $\neg(\text{{String}}\to\neg\text{{Int}})$ . Since intersection distributes over unions we obtain

[TABLE]

and since $(\text{{Bool}}{\vee}\text{{String}}{\to}\text{{Bool}})\wedge\neg(\text{{String}}{\to}\neg\text{{Int}})$ is empty (because $\text{{String}}\to\neg\text{{Int}}$ contains $\text{{Bool}}{\vee}\text{{String}}\to\text{{Bool}}$ ), then what we obtain is the left summand, a strict subtype of $(\text{{Int}}{\vee}\text{{String}})\to\text{{Int}}$ , namely the functions of type $\text{{Int}}{\vee}\text{{String}}{\to}\text{{Int}}$ minus those that diverge on all String arguments.

This is essentially what we formalize in Section 2, in the type system by the rule [PAppL] and in the typing algorithm with the case (20) of the definition of the function Constr.

1.3 Technical challenges

In the previous section we outlined the main ideas of our approach to occurrence typing. However, the devil is in the details. So the formalization we give in Section 2 is not so smooth as we just outlined: we must introduce several auxiliary definitions to handle some corner cases. This section presents by tiny examples the main technical difficulties we had to overcome and the definitions we introduced to handle them. As such it provides a kind of road-map for the technicalities of Section 2.

Typing occurrences

As it should be clear by now, not only variables but also generic expressions are given different types in the “then” and “else” branches of type tests. For instance, in (6) the expression $x_{1}x_{2}$ has type Int in the positive branch and type Bool in the negative one. In this specific case it is possible to deduce these typings from the refined types of the variables (in particular, thanks to the fact that $x_{2}$ has type Int the positive branch and Bool in the negative one), but this is not possible in general. For instance, consider $x_{1}:\text{{Int}}\to(\text{{Int}}\vee\text{{Bool}})$ , $x_{2}:\text{{Int}}$ , and the expression

[TABLE]

It is not possible to specialize the type of the variables in the branches. Nevertheless, we want to be able to deduce that $x_{1}x_{2}$ has type Int in the positive branch and type Bool in the negative one. In order to do so in Section 2 we will use special type environments that map not only variables but also generic expressions to types. So to type, say, the positive branch of (9) we extend the current type environment with the hypothesis that the expression $x_{1}x_{2}$ has type Int.

When we test the type of an expression we try to deduce the type of some subexpressions occurring in it. Therefore we must cope with subexpressions occurring multiple times. A simple example is given by using product types and pairs as in $\texttt{(}(x,x)\in t_{1}\times t_{2}\texttt{)?}e_{1}\texttt{:}e_{2}$ . It is easy to see that the positive branch $e_{1}$ is selected only if $x$ has type $t_{1}$ and type $t_{2}$ and deduce from that that $x$ must be typed in $e_{1}$ by their intersection, $t_{1}\wedge t_{2}$ . To deal with multiple occurrences of a same subexpression the type inference system of Section 2 will use the classic rule for introducing intersections [Inter], while the algorithmic counterpart will use the operator $\textsf{{Refine}}(){}$ that intersects the static type of an expression with all the types deduced for the multiple occurrences of it.

Type preservation

We want our type system to be sound in the sense of Wright1994, that is, that it satisfies progress and type preservation. The latter property is challenging because, as explained just above, our type assumptions are not only about variables but also about expressions. Two corner cases are particularly difficult. The first is shown by the following example

[TABLE]

If $e$ is an expression of type $\text{{Int}}\to t$ , then, as discussed before, the positive branch will have type $(\text{{Int}}\to t)\setminus(\text{{Int}}\to\neg\text{{Bool}})$ . If furthermore the negative branch is of the same type (or of a subtype), then this will also be the type of the whole expression in (10). Now imagine that the application $e(42)$ reduces to a Boolean value, then the whole expression in (10) reduces to $e$ ; but this has type $\text{{Int}}\to t$ which, in general, is not a subtype of $(\text{{Int}}\to t)\setminus(\text{{Int}}\to\neg\text{{Bool}})$ , and therefore type is not preserved by the reduction. To cope with this problem, the proof of type preservation (see LABEL:app:subject-reduction) resorts to type schemes, a technique introduced by Frisch2008 to type expressions by sets of types, so that the expression in (10) will have both the types at issue.

The second corner case is a modification of the example above where the positive branch is $e(42)$ , e.g., $\texttt{(}e(42)\in\text{{Bool}}\texttt{)?}e(42)\texttt{:}\textsf{true}$ . In this case the type deduced for the whole expression is Bool, while after reduction we would obtain the expression $e(42)$ which is not of type Bool but of type $t$ (even though it will eventually reduce to a Bool). This problem will be handled in the proof of type preservation by considering parallel reductions (e.g, if $e(42)$ reduces in a step to, say, false, then $\texttt{(}e(42)\in\text{{Bool}}\texttt{)?}e(42)\texttt{:}\textsf{true}$ reduces in one step to $\texttt{(}\textsf{false}\in\text{{Bool}}\texttt{)?}\textsf{false}\texttt{:}\textsf{true}$ ): see LABEL:app:parallel.

Interdependence of checks

The last class of technical problems arise from the mutual dependence of different type checks. In particular, there are two cases that pose a problem. The first can be shown by two functions $f$ and $g$ both of type $(\text{{Int}}\to\text{{Int}})\wedge(\MyMathBb{1}\to\text{{Bool}})$ , $x$ of type $\MyMathBb{1}$ and the test:

[TABLE]

If we independently check $f\,x$ against Int and $g\,x$ against Bool we deduce Int for the first occurrence of $x$ and $\MyMathBb{1}$ for the second. Thus we would type the positive branch of (11) under the hypothesis that $x$ is of type Int. But if we use the hypothesis generated by the test of $f\,x$ , that is, that $x$ is of type Int, to check $g\,x$ against Bool, then the type deduced for $x$ is $\MyMathBb{0}$ —i.e., the branch is never selected. In other words, we want to produce type environments for occurrence typing by taking into account all the available hypotheses, even when these hypotheses are formulated later in the flow of control. This will be done in the type systems of Section 2 by the rule [Path] and will require at algorithmic level to look for a fix-point solution of a function, or an approximation thereof.

Finally, a nested check may help refining the type assumptions on some outer expressions. For instance, when typing the positive branch $e$ of

[TABLE]

we can assume that the expression $(x,y)$ is of type $(\text{{Int}}\vee\text{{Bool}})\times\text{{Int}}$ and put it in the type environment. But if in $e$ there is a test like $\texttt{(}x\in\text{{Int}}\texttt{)?}{\color[rgb]{0.8,0.2,0.2}(x,y)}\texttt{:}(...)$ then we do not want use the assumption in the type environment to type the expression $(x,y)$ occurring in the inner test (in red). Instead we want to give to that occurrence of the expression $(x,y)$ the type $\text{{Int}}\times\text{{Int}}$ . This will be done by temporarily removing the type assumption about $(x,y)$ from the type environment and by retyping the expression without that assumption (see rule [EnvA] in Section 2.6.3).

Outline

In Section 2 we formalize the ideas we just presented: we define the types and expressions of our system, their dynamic semantics and a type system that implements occurrence typing together with the algorithms that decide whether an expression is well typed or not. Section 3 extends our formalism to record types and presents two applications of our analysis: the inference of arrow types for functions and a static analysis to reduce the number of casts inserted by a compiler of a gradually-typed language. Practical aspects are discussed in Section 4 where we give several paradigmatic examples of code typed by our prototype implementation, that can be interactively tested at https://occtyping.github.io/. Section LABEL:sec:related presents related work. A discussion of future work concludes this presentation. To ease the presentation all the proofs are omitted from the main text and can be found in the appendix.

Contributions

The main contributions of our work can be summarized as follows:

•

We provide a theoretical framework to refine the type of expressions occurring in type tests, thus removing the limitations of current occurrence typing approaches which require both the tests and the refinement to take place on variables.

•

We define a type-theoretic approach alternative to the current flow-based approaches. As such it provides different results and it can be thus profitably combined with flow-based techniques.

•

We use our analysis for defining a formal framework that reconstructs intersection types for unannotated or partially-annotated functions, something that, in our ken, no other current system can do.

•

We prove the soundness of our system. We define algorithms to infer the types that we prove to be sound and show different completeness results which in practice yield the completeness of any reasonable implementation.

•

We show how to extend our approach to records with field addition, update, and deletion operations.

•

We show how occurrence typing can be extended to and combined with gradual typing and apply our results to optimize the compilation of the latter.

We end this introduction by stressing the practical implications of our work: a perfunctory inspection may give the wrong impression that the only interest of the heavy formalization that follows is to have generic expressions, rather than just variables, in type cases: this would be a bad trade-off. The important point is, instead, that our formalization is what makes analyses such as those presented in Section 3 possible (e.g., the reconstruction of the type (2) for the unannotated pure JavaScript code of foo), which is where the actual added practical value and potential of our work resides.

2 Language

In this section we formalize the ideas we outlined in the introduction. We start by the definition of types followed by the language and its reduction semantics. The static semantics is the core of our work: we first present a declarative type system that deduces (possibly many) types for well-typed expressions and then the algorithms to decide whether an expression is well typed or not.

2.1 Types

Definition 2.1 (Types).

The set of types Types is formed by the terms $t$ coinductively produced by the grammar:

[TABLE]

and that satisfy the following conditions

•

(regularity) every term has a finite number of different sub-terms;

•

(contractivity) every infinite branch of a term contains an infinite number of occurrences of the arrow or product type constructors.

We use the following abbreviations: $t_{1}\land t_{2}\hbox{\;\;$ = $\raise 5.0pt\hbox{\rm\scriptsize def}\hskip 2.84526pt}\neg(\neg t_{1}\vee\neg t_{2})$ , $t_{1}\setminus t_{2}\hbox{\;\;$ = $\raise 5.0pt\hbox{\rm\scriptsize def}\hskip 2.84526pt}t_{1}\wedge\neg t_{2}$ , $\MyMathBb{1}\hbox{\;\;$ = $\raise 5.0pt\hbox{\rm\scriptsize def}\hskip 2.84526pt}\neg\MyMathBb{0}$ . $b$ ranges over basic types (e.g., Int, Bool), $\MyMathBb{0}$ and $\MyMathBb{1}$ respectively denote the empty (that types no value) and top (that types all values) types. Coinduction accounts for recursive types and the condition on infinite branches bars out ill-formed types such as $t=t\lor t$ (which does not carry any information about the set denoted by the type) or $t=\neg t$ (which cannot represent any set). It also ensures that the binary relation $\vartriangleright\,\subseteq\!\textbf{{Types}}{\times}\textbf{{Types}}$ defined by $t_{1}\lor t_{2}\vartriangleright t_{i}$ , $t_{1}\land t_{2}\vartriangleright t_{i}$ , $\neg t\vartriangleright t$ is Noetherian. This gives an induction principle on Types that we will use without any further explicit reference to the relation.555In a nutshell, we can do proofs by induction on the structure of unions and negations—and, thus, intersections—but arrows, products, and basic types are the base cases for the induction. We refer to $b$ , $\times$ , and $\to$ as type constructors and to $\lor$ , $\land$ , $\lnot$ , and $\setminus$ as type connectives.

The subtyping relation for these types, noted $\leq$ , is the one defined by Frisch2008 and detailed description of the algorithm to decide this relation can be found in [Cas15]. For the reader’s convenience we succinctly recall the definition of the subtyping relation in the next subsection but it is possible to skip this subsection at first reading and jump directly to Subsection 2.3, since to understand the rest of the paper it suffices to consider that types are interpreted as sets of values (i.e., either constants, $\lambda$ -abstractions, or pairs of values: see Section 2.3 right below) that have that type, and that subtyping is set containment (i.e., a type $s$ is a subtype of a type $t$ if and only if $t$ contains all the values of type $s$ ). In particular, $s\to t$ contains all $\lambda$ -abstractions that when applied to a value of type $s$ , if their computation terminates, then they return a result of type $t$ (e.g., $\MyMathBb{0}\to\MyMathBb{1}$ is the set of all functions666Actually, for every type $t$ , all types of the form $\MyMathBb{0}{\to}t$ are equivalent and each of them denotes the set of all functions. and $\MyMathBb{1}\to\MyMathBb{0}$ is the set of functions that diverge on every argument). Type connectives (i.e., union, intersection, negation) are interpreted as the corresponding set-theoretic operators (e.g., $s\vee t$ is the union of the values of the two types). We use $\simeq$ to denote the symmetric closure of $\leq$ : thus $s\simeq t$ (read, $s$ is equivalent to $t$ ) means that $s$ and $t$ denote the same set of values and, as such, they are semantically the same type. All the above is formalized as follows.

2.2 Subtyping

Subtyping is defined by giving a set-theoretic interpretation of the types of Definition 2.1 into a suitable domain $\mathcal{D}$ :

Definition 2.2 (Interpretation domain [Frisch2008]).

The interpretation domain $\mathcal{D}$ is the set of finite terms $d$ produced inductively by the following grammar

[TABLE]

where $c$ ranges over the set $\mathcal{C}$ of constants and where $\Omega$ is such that $\Omega\notin\mathcal{D}$ .

The elements of $\mathcal{D}$ correspond, intuitively, to (denotations of) the results of the evaluation of expressions. In particular, in a higher-order language, the results of computations can be functions which, in this model, are represented by sets of finite relations of the form $\{(d_{1},\partial_{1}),\dots,(d_{n},\partial_{n})\}$ , where $\Omega$ (which is not in $\mathcal{D}$ ) can appear in second components to signify that the function fails (i.e., evaluation is stuck) on the corresponding input. This is implemented by using in the second projection the meta-variable $\partial$ which ranges over $\mathcal{D}_{\Omega}=\mathcal{D}\cup\{\Omega\}$ (we reserve $d$ to range over $\mathcal{D}$ , thus excluding $\Omega$ ). This constant $\Omega$ is used to ensure that $\MyMathBb{1}\to\MyMathBb{1}$ is not a supertype of all function types: if we used $d$ instead of $\partial$ , then every well-typed function could be subsumed to $\MyMathBb{1}\to\MyMathBb{1}$ and, therefore, every application could be given the type $\MyMathBb{1}$ , independently from its argument as long as this argument is typable (see Section 4.2 of [Frisch2008] for details). The restriction to finite relations corresponds to the intuition that the denotational semantics of a function is given by the set of its finite approximations, where finiteness is a restriction necessary (for cardinality reasons) to give the semantics to higher-order functions.

We define the interpretation $\llbracket t\rrbracket$ of a type $t$ so that it satisfies the following equalities, where $\mathcal{P}_{\!\textup{fin}}$ denotes the restriction of the powerset to finite subsets and $\mathbb{B}{}$ denotes the function that assigns to each basic type the set of constants of that type, so that for every constant $c$ we have $c\in\mathbb{B}(\text{b}_{c})$ (we use $\text{b}_{c}$ to denote the basic type of the constant $c$ ):

[TABLE]

We cannot take the equations above directly as an inductive definition of $\llbracket\rrbracket$ because types are not defined inductively but coinductively. However, recall that the contractivity condition of Definition 2.1 ensures that the binary relation $\vartriangleright\,\subseteq\!\textbf{{Types}}{\times}\textbf{{Types}}$ defined by $t_{1}\lor t_{2}\vartriangleright t_{i}$ , $t_{1}\land t_{2}\vartriangleright t_{i}$ , $\neg t\vartriangleright t$ is Noetherian which gives an induction principle on Types that we use combined with structural induction on $\mathcal{D}$ to give the following definition which validates these equalities.

Definition 2.3 (Set-theoretic interpretation of types [Frisch2008]).

We define a binary predicate $(d:t)$ (“the element $d$ belongs to the type $t$ ”), where $d\in\mathcal{D}$ and $t\in\textbf{{Types}}$ , by induction on the pair $(d,t)$ ordered lexicographically. The predicate is defined as follows:

[TABLE]

We define the set-theoretic interpretation $\llbracket\rrbracket:\textbf{{Types}}\to\mathcal{P}(\mathcal{D})$ as $\llbracket t\rrbracket=\{d\in\mathcal{D}\mid(d:t)\}$ .

Finally, we define the subtyping preorder and its associated equivalence relation as follows.

Definition 2.4 (Subtyping relation [Frisch2008]).

We define the subtyping relation $\leq$ and the subtyping equivalence relation $\simeq$ as $t_{1}\leq t_{2}\hbox{\;\;$ \iff $\raise 5.0pt\hbox{\rm\scriptsize def}\hskip 11.38109pt}\llbracket t_{1}\rrbracket\subseteq\llbracket t_{2}\rrbracket$ and $t_{1}\simeq t_{2}\hbox{\;\;$ \iff $\raise 5.0pt\hbox{\rm\scriptsize def}\hskip 11.38109pt}(t_{1}\leq t_{2})\mathrel{\mathsf{and}}(t_{2}\leq t_{1})\>.$

2.3 Syntax

The expressions $e$ and values $v$ of our language are inductively generated by the following grammars:

[TABLE]

for $j=1,2$ . In (13), $c$ ranges over constants (e.g., true, false, 1, 2, …) which are values of basic types; $x$ ranges over variables; $(e,e)$ denotes pairs and $\pi_{i}e$ their projections; $(e{\in}t)\,\texttt{{?}}\,e_{1}\,\texttt{{:}}\,e_{2}$ denotes the type-case expression that evaluates either $e_{1}$ or $e_{2}$ according to whether the value returned by $e$ (if any) has the type $t$ or not; $\lambda^{\wedge_{i\in I}s_{i}\to t_{i}}x.e$ denotes the function of parameter $x$ and body $e$ annotated with the type $\wedge_{i\in I}s_{i}\to t_{i}$ . An expression has an intersection type if and only if it has all the types that compose the intersection. Therefore, intuitively, $\lambda^{\wedge_{i\in I}s_{i}\to t_{i}}x.e$ is a well-typed expression if for all $i{\in}I$ the hypothesis that $x$ is of type $s_{i}$ implies that the body $e$ has type $t_{i}$ , that is to say, it is well typed if $\lambda^{\wedge_{i\in I}s_{i}\to t_{i}}x.e$ has type $s_{i}\to t_{i}$ for all $i\in I$ .

2.4 Dynamic semantics

The dynamic semantics is defined as a classic left-to-right call-by-value weak reduction for a $\lambda$ -calculus with pairs, enriched with specific rules for type-cases. We have the following notions of reduction:

[TABLE]

where ${\llbracket t\rrbracket}_{\mathcal{V}}$ denotes, intuitively, the set of values that have type $t$ . Formally, ${\llbracket t\rrbracket}_{\mathcal{V}}=\{v~{}|~{}\exists t^{\prime}\in\textsf{{typeof}}_{\mathcal{V}}(v).\ t^{\prime}\leq t\}$ where $\textsf{{typeof}}_{\mathcal{V}}(v)$ is inductively defined as: $\textsf{{typeof}}_{\mathcal{V}}(c)\hbox{\;\;$ = $\raise 5.0pt\hbox{\rm\scriptsize def}\hskip 2.84526pt}\{\text{b}_{c}\}$ , $\textsf{{typeof}}_{\mathcal{V}}(\lambda^{\wedge_{i\in I}s_{i}\to t_{i}}x.e)\hbox{\;\;$ = $\raise 5.0pt\hbox{\rm\scriptsize def}\hskip 2.84526pt}\{t~{}|~{}t\simeq(\wedge_{i\in I}s_{i}\to t_{i})\wedge(\wedge_{j\in J}s_{j}^{\prime}\to t_{j}^{\prime}),t\not\leq\MyMathBb{0}\}$ , $\textsf{{typeof}}_{\mathcal{V}}((v_{1},v_{2}))\hbox{\;\;$ = $\raise 5.0pt\hbox{\rm\scriptsize def}\hskip 2.84526pt}\textsf{{typeof}}_{\mathcal{V}}(v_{1})\times\textsf{{typeof}}_{\mathcal{V}}(v_{2})$ 777This definition may look complicated but it is necessary to handle some corner cases for negated arrow types (cf. rule [Abs-] in Section 2.5). For instance, it states that $\lambda^{\text{{Int}}{\to}\text{{Int}}}x.x\in{\llbracket(\text{{Int}}{\to}\text{{Int}})\wedge\neg(\text{{Bool}}{\to}\text{{Int}})\rrbracket}_{\mathcal{V}}$ ..

Contextual reductions are defined by the following evaluation contexts:

[TABLE]

As usual we denote by $\mathcal{C}[e]$ the term obtained by replacing $e$ for the hole in the context $\mathcal{C}$ and we have that $e\leadsto e^{\prime}$ implies $\mathcal{C}[e]\leadsto\mathcal{C}[e^{\prime}]$ .

2.5 Static semantics

While the syntax and reduction semantics are, on the whole, pretty standard, for what concerns the type system we will have to introduce several unconventional features that we anticipated in Section 1.3 and are at the core of our work. Let us start with the standard part, that is the typing of the functional core and the use of subtyping, given by the following typing rules:

[TABLE]

These rules are quite standard and do not need any particular explanation besides those already given in Section 2.3. Just notice subtyping is embedded in the system by the classic [Subs] subsumption rule. Next we focus on the unconventional aspects of our system, from the simplest to the hardest.

The first unconventional aspect is that, as explained in Section 1.3, our type assumptions are about expressions. Therefore, in our rules the type environments, ranged over by $\Gamma$ , map expressions—rather than just variables—into types. This explains why the classic typing rule for variables is replaced by a more general [Env] rule defined below:

[TABLE]

The [Env] rule is coupled with the standard intersection introduction rule [Inter] which allows us to deduce for a complex expression the intersection of the types recorded by the occurrence typing analysis in the environment $\Gamma$ with the static type deduced for the same expression by using the other typing rules. This same intersection rule is also used to infer the second unconventional aspect of our system, that is, the fact that $\lambda$ -abstractions can have negated arrow types, as long as these negated types do not make the type deduced for the function empty:

[TABLE]

In Section 1.3 we explained that in order for our system to satisfy the property of type preservation, the type system must be able to deduce negated arrow types for functions—e.g. the type $(\text{{Int}}\to\text{{Int}})\wedge\neg(\text{{Bool}}\to\text{{Bool}})$ for $\lambda^{\text{{Int}}\to\text{{Int}}}x.x$ . We demonstrated this with the expression in equation (10), for which type preservation holds only if we are able to deduce for this expression the type $(\text{{Int}}\to t)\setminus(\text{{Int}}\to\neg\text{{Bool}})$ , that is, $(\text{{Int}}\to t)\wedge\neg(\text{{Int}}\to\neg\text{{Bool}})$ . But the sole rule [Abs+] above does not allow us to deduce negations of arrows for $\lambda$ -abstractions: the rule [Abs-] makes this possible. This rule ensures that given a function $\lambda^{t}x.e$ (where $t$ is an intersection type), for every type $t_{1}\to t_{2}$ , either $t_{1}\to t_{2}$ can be obtained by subsumption from $t$ or $\neg(t_{1}\to t_{2})$ can be added to the intersection $t$ . In turn this ensures that, for any function and any type $t$ either the function has type $t$ or it has type $\neg t$ (see Pet19phd for a thorough discussion on this rule). As an aside, note that this kind of deduction is already present in the system by Frisch2008 though in that system this presence was motivated by the semantics of types rather than, as in our case, by the soundness of the type system.

Rules [Abs+] and [Abs-] are not enough to deduce for $\lambda$ -abstractions all the types we wish. In particular, these rules alone are not enough to type general overloaded functions. For instance, consider this simple example of a function that applied to an integer returns its successor and applied to anything else returns true:

$\lambda^{(\text{{Int}}\to\text{{Int}})\wedge(\neg\text{{Int}}\to\text{{Bool}})}x\,.\,(x{\in}\text{{Int}})\,\texttt{{?}}\,x+1\,\texttt{{:}}\,\textsf{true}$

Clearly, the expression above is well typed, but the rule [Abs+] alone is not enough to type it. In particular, according to [Abs+] we have to prove that under the hypothesis that $x$ is of type Int the expression $((x{\in}\text{{Int}})\,\texttt{{?}}\,x+1\,\texttt{{:}}\,\textsf{true})$ is of type Int, too. That is, that under the hypothesis that $x$ has type $\text{{Int}}\wedge\text{{Int}}$ (we apply occurrence typing) the expression $x+1$ is of type Int (which holds) and that under the hypothesis that $x$ has type $\text{{Int}}\setminus\text{{Int}}$ , that is $\MyMathBb{0}$ (we apply once more occurrence typing), true is of type Int (which does not hold). The problem is that we are trying to type the second case of a type-case even if we know that there is no chance that, when $x$ is bound to an integer, that case will be ever selected. The fact that it is never selected is witnessed by the presence of a type hypothesis with $\MyMathBb{0}$ type. To avoid this problem (and type the term above) we add the rule [Efq] (ex falso quodlibet) that allows the system to deduce any type for an expression that will never be selected, that is, for an expression whose type environment contains an empty assumption:

[TABLE]

Once more, this kind of deduction was already present in the system by Frisch2008 to type full fledged overloaded functions, though it was embedded in the typing rule for the type-case. Here we need the rule [Efq], which is more general, to ensure the property of subject reduction.

Finally, there remains one last rule in our type system, the one that implements occurrence typing, that is, the rule for the type-case:

[TABLE]

The rule [Case] checks whether the expression $e$ , whose type is being tested, is well-typed and then performs the occurrence typing analysis that produces the environments $\Gamma_{i}$ ’s under whose hypothesis the expressions $e_{i}$ ’s are typed. The production of these environments is represented by the judgments $\Gamma\vdash^{\texttt{Env}}_{e,(\neg)t}\Gamma_{i}$ . The intuition is that when $\Gamma\vdash^{\texttt{Env}}_{e,t}\Gamma_{1}$ is provable then $\Gamma_{1}$ is a version of $\Gamma$ extended with type hypotheses for all expressions occurring in $e$ , type hypotheses that can be deduced assuming that the test $e\in t$ succeeds. Likewise, $\Gamma\vdash^{\texttt{Env}}_{e,\neg t}\Gamma_{2}$ (notice the negation on $t$ ) extends $\Gamma$ with the hypothesis deduced assuming that $e\in\neg t$ , that is, for when the test $e\in t$ fails.

All it remains to do is to show how to deduce judgments of the form $\Gamma\vdash^{\texttt{Env}}_{e,t}\Gamma^{\prime}$ . For that we first define how to denote occurrences of an expression. These are identified by paths in the syntax tree of the expressions, that is, by possibly empty strings of characters denoting directions starting from the root of the tree (we use $\epsilon$ for the empty string/path, which corresponds to the root of the tree).

Let $e$ be an expression and $\varpi\in\{0,1,l,r,f,s\}^{*}$ a path; we denote $e{\downarrow}\varpi$ the occurrence of $e$ reached by the path $\varpi$ , that is (for $i=0,1$ , and undefined otherwise)

[TABLE]

To ease our analysis we used different directions for each kind of term. So we have [math] and $1$ for the function and argument of an application, $l$ and $r$ for the $l$ eft and $r$ ight expressions forming a pair, and $f$ and $s$ for the argument of a $f$ irst or of a $s$ econd projection. Note also that we do not consider occurrences under $\lambda$ ’s (since their type is frozen in their annotations) and type-cases (since they reset the analysis). The judgments $\Gamma\vdash^{\texttt{Env}}_{e,t}\Gamma^{\prime}$ are then deduced by the following two rules:

[TABLE]

These rules describe how to produce by occurrence typing the type environments while checking that an expression $e$ has type $t$ . They state that $(i)$ we can deduce from $\Gamma$ all the hypothesis already in $\Gamma$ (rule [Base]) and that $(ii)$ if we can deduce a given type $t^{\prime}$ for a particular occurrence $\varpi$ of the expression $e$ being checked, then we can add this hypothesis to the produced type environment (rule [Path]). The rule [Path] uses a (last) auxiliary judgement $\vdash^{\texttt{Path}}_{\Gamma,e,t}\varpi:t^{\prime}$ to deduce the type $t^{\prime}$ of the occurrence $e{\downarrow}\varpi$ when checking $e$ against $t$ under the hypotheses $\Gamma$ . This rule [Path] is subtler than it may appear at first sight, insofar as the deduction of the type for $\varpi$ may already use some hypothesis on $e{\downarrow}\varpi$ (in $\Gamma^{\prime}$ ) and, from an algorithmic viewpoint, this will imply the computation of a fix-point (see Section 2.6.2). The last ingredient for our type system is the deduction of the judgements of the form $\vdash^{\texttt{Path}}_{\Gamma,e,t}\varpi:t^{\prime}$ where $\varpi$ is a path to an expression occurring in $e$ . This is given by the following set of rules.

[TABLE]

These rules implement the analysis described in Section 1.2 for functions and extend it to products. Let us comment each rule in detail. [PSubs] is just subsumption for the deduction $\vdash^{\texttt{Path}}$ . The rule [PInter] combined with [PTypeof] allows the system to deduce for an occurrence $\varpi$ the intersection of the static type of $e{\downarrow}\varpi$ (deduced by [PTypeof]) with the type deduced for $\varpi$ by the other $\vdash^{\texttt{Path}}$ rules. The rule [PEps] is the starting point of the analysis: if we are assuming that the test $e\in t$ succeeds, then we can assume that $e$ (i.e., $e{\downarrow}\epsilon$ ) has type $t$ (recall that assuming that the test $e\in t$ fails corresponds to having $\neg t$ at the index of the turnstyle). The rule [PAppR] implements occurrence typing for the arguments of applications, since it states that if a function maps arguments of type $t_{1}$ in results of type $t_{2}$ and an application of this function yields results (in $t^{\prime}_{2}$ ) that cannot be in $t_{2}$ (since $t_{2}\land t_{2}^{\prime}\simeq\MyMathBb{0}$ ), then the argument of this application cannot be of type $t_{1}$ . [PAppL] performs the occurrence typing analysis for the function part of an application, since it states that if an application has type $t_{2}$ and the argument of this application has type $t_{1}$ , then the function in this application cannot have type $t_{1}\to\neg t_{2}$ . Rules [PPair_] are straightforward since they state that the $i$ -th projection of a pair that is of type $t_{1}\times t_{2}$ must be of type $t_{i}$ . So are the last two rules that essentially state that if $\pi_{1}e$ (respectively, $\pi_{2}e$ ) is of type $t^{\prime}$ , then the type of $e$ must be of the form $t^{\prime}\times\MyMathBb{1}$ (respectively, $\MyMathBb{1}\times t^{\prime}$ ).

This concludes the presentation of all the rules of our type system (they are summarized for the reader’s convenience in LABEL:sec:declarative), which satisfies the property of safety, deduced, as customary, from the properties of progress and subject reduction (cf. LABEL:app:soundness).

Theorem 2.5 (type safety).

For every expression $e$ such that $\varnothing\vdash e:t$ either $e$ diverges or there exists a value $v$ of type $t$ such that $e\leadsto^{*}v$ .

2.6 Algorithmic system

The type system we defined in the previous section implements the ideas we illustrated in the introduction and it is safe. Now the problem is to decide whether an expression is well typed or not, that is, to find an algorithm that given a type environment $\Gamma$ and an expression $e$ decides whether there exists a type $t$ such that $\Gamma\vdash e:t$ is provable. For that we need to solve essentially two problems: $(i)$ how to handle the fact that it is possible to deduce several types for the same well-typed expression and $(ii)$ how to compute the auxiliary deduction system $\vdash^{\texttt{Path}}_{\Gamma,e,t}$ for paths.

$(i)$ . Multiple types have two distinct origins each requiring a distinct technical solution. The first origin is the presence of structural rules888In logic, logical rules refer to a particular connective (here, a type constructor, that is, either $\to$ , or $\times$ , or $b$ ), while identity rules (e.g., axioms and cuts) and structural rules (e.g., weakening and contraction) do not.

such as [Subs] and [Inter]. We handle this presence in the classic way: we define an algorithmic system that tracks the minimum type of an expression; this system is obtained from the original system by removing the two structural rules and by distributing suitable checks of the subtyping relation in the remaining rules. To do that in the presence of set-theoretic types we need to define some operators on types, which are given in Section 2.6.1. The second origin is the rule [Abs-] by which it is possible to deduce for every well-typed lambda abstraction infinitely many types, that is the annotation of the function intersected with as (finitely) many negations of arrow types as possible without making the type empty. We do not handle this multiplicity directly in the algorithmic system but only in the proof of its soundness by using and adapting the technique of type schemes defined by Frisch2008. Type schemes are canonical representations of the infinite sets of types of $\lambda$ -abstractions which can be used to define an algorithmic system that can be easily proved to be sound. The simpler algorithm that we propose in this section implies (i.e., it is less precise than) the one with type schemes (cf. Lemma B.20) and it is thus sound, too. The algorithm of this section is not only simpler but, as we discuss in Section 2.6.4, is also the one that should be used in practice. This is why we preferred to present it here and relegate the presentation of the system with type schemes to B.2.1.

$(ii)$ . For what concerns the use of the auxiliary derivation for the $\Gamma\vdash^{\texttt{Env}}_{e,t}\Gamma^{\prime}$ and $\vdash^{\texttt{Path}}_{\Gamma,e,t}\varpi:t^{\prime}$ judgments, we present in Section 2.6.2 an algorithm that is sound and satisfies a limited form of completeness. All these notions are then used in the algorithmic typing system given in Section 2.6.3.

2.6.1 Operators for type constructors

In order to define the algorithmic typing of expressions like applications and projections we need to define the operators on types we used in Section 1.2. Consider the classic rule [App] for applications. It essentially does three things: $(i)$ it checks that the expression in the function position has a functional type; $(ii)$ it checks that the argument is in the domain of the function, and $(iii)$ it returns the type of the application. In systems without set-theoretic types these operations are quite straightforward: $(i)$ corresponds to checking that the expression has an arrow type, $(ii)$ corresponds to checking that the argument is in the domain of the arrow deduced for the function, and $(iii)$ corresponds to returning the codomain of that same arrow. With set-theoretic types things get more difficult, since a function can be typed by, say, a union of intersection of arrows and negations of types. Checking that the function has a functional type is easy since it corresponds to checking that it has a type subtype of $\MyMathBb{0}{\to}\MyMathBb{1}$ . Determining its domain and the type of the application is more complicated and needs the operators $\textsf{dom}()$ and $\circ$ we informally described in Section 1.2 where we also introduced the operator $\mathop{\,\sqdot\,}$ . These three operators are used by our algorithm and formally defined as:

[TABLE]

In short, $\textsf{dom}(t)$ is the largest domain of any single arrow that subsumes $t$ , $t\circ s$ is the smallest codomain of an arrow type that subsumes $t$ and has domain $s$ and $t\mathop{\,\sqdot\,}s$ was explained before.

We need similar operators for projections since the type $t$ of $e$ in $\pi_{i}e$ may not be a single product type but, say, a union of products: all we know is that $t$ must be a subtype of $\MyMathBb{1}\times\MyMathBb{1}$ . So let $t$ be a type such that $t\leq\MyMathBb{1}\times\MyMathBb{1}$ , then we define:

[TABLE]

All the operators above but $\mathop{\,\sqdot\,}$ are already present in the theory of semantic subtyping: the reader can find how to compute them in [Frisch2008, Section 6.11] (see also [Cas15, §4.4] for a detailed description). Below we just show our new formula that computes $t\mathop{\,\sqdot\,}s$ for a $t$ subtype of $\MyMathBb{0}\to\MyMathBb{1}$ . For that, we use a result of semantic subtyping that states that every type $t$ is equivalent to a type in disjunctive normal form and that if furthermore $t\leq\MyMathBb{0}\to\MyMathBb{1}$ , then $t\simeq\bigvee_{i\in I}\left(\bigwedge_{p\in P_{i}}(s_{p}\to t_{p})\bigwedge_{n\in N_{i}}\neg(s_{n}^{\prime}\to t_{n}^{\prime})\right)$ with $\bigwedge_{p\in P_{i}}(s_{p}\to t_{p})\bigwedge_{n\in N_{i}}\neg(s_{n}^{\prime}\to t_{n}^{\prime})\not\simeq\MyMathBb{0}$ for all $i$ in $I$ . For such a $t$ and any type $s$ then we have:

[TABLE]

The formula considers only the positive arrows of each summand that forms $t$ and states that, for each summand, whenever you take a subset $P$ of its positive arrows that cannot yield results in $s$ (since $s$ does not overlap the intersection of the codomains of these arrows), then the success of the test cannot depend on these arrows and therefore the intersection of the domains of these arrows—i.e., the values that would precisely select that set of arrows—can be removed from $\textsf{dom}(t)$ . The proof that this type satisfies (16) is given in the LABEL:app:worra.

2.6.2 Type environments for occurrence typing

The second ingredient necessary to the definition of our algorithmic systems is the algorithm for the deduction of $\Gamma\vdash^{\texttt{Env}}_{e,t}\Gamma^{\prime}$ , that is an algorithm that takes as input $\Gamma$ , $e$ , and $t$ , and returns an environment that extends $\Gamma$ with hypotheses on the occurrences of $e$ that are the most general that can be deduced by assuming that $e\,{\in}\,t$ succeeds. For that we need the notation $\textsf{{typeof}}_{\Gamma}(e)$ which denotes the type deduced for $e$ under the type environment $\Gamma$ in the algorithmic type system of Section 2.6.3. That is, $\textsf{{typeof}}_{\Gamma}(e)=t$ if and only if $\Gamma\vdash_{\!\scriptscriptstyle\mathcal{A}}e:t$ is provable.

We start by defining the algorithm for each single occurrence, that is for the deduction of $\vdash^{\texttt{Path}}_{\Gamma,e,t}\varpi:t^{\prime}$ . This is obtained by defining two mutually recursive functions Constr and Intertype:

[TABLE]

All the functions above are defined if and only if the initial path $\varpi$ is valid for $e$ (i.e., $e{\downarrow}\varpi$ is defined) and $e$ is well-typed (which implies that all $\textsf{{typeof}}_{\Gamma}(e{\downarrow}\varpi)$ in the definition are defined).999Note that the definition is well-founded. This can be seen by analyzing the rule [CaseA] of Section 2.6.3: the definition of $\textsf{{Refine}}_{e,t}(\Gamma)$ and $\textsf{{Refine}}_{e,\neg t}(\Gamma)$ use $\textsf{{typeof}}_{\Gamma}(e{\downarrow}\varpi)$ , and this is defined for all $\varpi$ since the first premisses of [CaseA] states that $\Gamma\vdash e:t_{0}$ (and this is possible only if we were able to deduce under the hypothesis $\Gamma$ the type of every occurrence of $e$ .)

Each case of the definition of the Constr function corresponds to the application of a logical rule (cf. definition in Footnote 8) in the deduction system for $\vdash^{\texttt{Path}}$ : case (19) corresponds to the application of [PEps]; case (20) implements [Pappl] straightforwardly; the implementation of rule [PAppR] is subtler: instead of finding the best $t_{1}$ to subtract (by intersection) from the static type of the argument, (21) finds directly the best type for the argument by applying the $\mathop{\,\sqdot\,}$ operator to the static type of the function and the refined type of the application. The remaining (22–25) cases are the straightforward implementations of the rules [PPairL], [PPairR], [PFst], and [PSnd], respectively.

The other recursive function, Intertype, implements the two structural rules [PInter] and [PTypeof] by intersecting the type obtained for $\varpi$ by the logical rules, with the static type deduced by the type system for the expression occurring at $\varpi$ . The remaining structural rule, [Psubs], is accounted for by the use of the operators $\mathop{\,\sqdot\,}$ and $\bm{\pi}_{i}$ in the definition of Constr.

It remains to explain how to compute the environment $\Gamma^{\prime}$ produced from $\Gamma$ by the deduction system for $\Gamma\vdash^{\texttt{Env}}_{e,t}\Gamma^{\prime}$ . Alas, this is the most delicate part of our algorithm. In a nutshell, what we want to do is to define a function $\textsf{{Refine}}_{\_,\_}(\_)$ that takes a type environment $\Gamma$ , an expression $e$ and a type $t$ and returns the best type environment $\Gamma^{\prime}$ such that $\Gamma\vdash^{\texttt{Env}}_{e,t}\Gamma^{\prime}$ holds. By the best environment we mean the one in which the occurrences of $e$ are associated to the largest possible types (type environments are hypotheses so they are contravariant: the larger the type the better the hypothesis). Recall that in Section 1.3 we said that we want our analysis to be able to capture all the information available from nested checks. If we gave up such a kind of precision then the definition of Refine would be pretty easy: it must map each subexpression of $e$ to the intersection of the types deduced by $\vdash^{\texttt{Path}}$ (i.e., by Intertype) for each of its occurrences. That is, for each expression $e^{\prime}$ occurring in $e$ , $\textsf{{Refine}}_{e,t}(\Gamma)$ would be the type environment that maps $e^{\prime}$ into $\bigwedge_{\{\varpi~{}|~{}e{\downarrow}\varpi\equiv e^{\prime}\}}\textsf{{Intertype}}_{\Gamma,e,t}(\varpi)$ . As we explained in Section 1.3 the intersection is needed to apply occurrence typing to expressions such as $((x,x){\in}t_{1}\times t_{2})\,\texttt{{?}}\,e_{1}\,\texttt{{:}}\,e_{2}$ where some expressions—here $x$ —occur multiple times.

In order to capture most of the type information from nested queries the rule [Path] allows the deduction of the type of some occurrence $\varpi$ to use a type environment $\Gamma^{\prime}$ that may contain information about some suboccurrences of $\varpi$ . On the algorithm this would correspond to applying the Refine defined above to an environment that already is the result of Refine, and so on. Therefore, ideally our algorithm should compute the type environment as a fixpoint of the function $X\mapsto\textsf{{Refine}}_{e,t}(X)$ . Unfortunately, an iteration of Refine may not converge. As an example, consider the (dumb) expression $(xx{\in}\MyMathBb{1})\,\texttt{{?}}\,e_{1}\,\texttt{{:}}\,e_{2}$ . If $x:\MyMathBb{1}\to\MyMathBb{1}$ , then when refining the “then” branch, every iteration of Refine yields for $x$ a type strictly more precise than the type deduced in the previous iteration (because of the $\varpi.0$ case).

The solution we adopt in practice is to bound the number of iterations to some number $n_{o}$ . This is obtained by the following definition of Refine

[TABLE]

Note in particular that $\textsf{{Refine}}_{e,t}(\Gamma)$ extends $\Gamma$ with hypotheses on the expressions occurring in $e$ , since $\textsf{dom}(\textsf{{Refine}}_{e,t}(\Gamma))$ $=$ $\textsf{dom}(\textsf{{RefineStep}}_{e,t}(\Gamma))=\textsf{dom}(\Gamma)\cup\{e^{\prime}~{}|~{}\exists\varpi.\ e{\downarrow}\varpi\equiv e^{\prime}\}$ .

In other terms, we try to find a fixpoint of $\textsf{{RefineStep}}_{e,t}$ but we bound our search to $n_{o}$ iterations. Since $\textsf{{RefineStep}}_{e,t}$ is monotone (w.r.t. the subtyping pre-order extended to type environments pointwise), then every iteration yields a better solution. While this is unsatisfactory from a formal point of view, in practice the problem is a very mild one. Divergence may happen only when refining the type of a function in an application: not only such a refinement is meaningful only when the function is typed by a union type, but also we had to build the expression that causes the divergence in quite an ad hoc way which makes divergence even more unlikely: setting an $n_{o}$ twice the depth of the syntax tree of the outermost type case should be more than enough to capture all realistic cases. For instance, all examples given in Section 4 can be checked (or found to be ill-typed) with $n_{o}=1$ .

2.6.3 Algorithmic typing rules

We now have all the definitions we need for our typing algorithm, which is defined by the following rules.

[TABLE]

The side conditions of the rules ensure that the system is syntax directed, that is, that at most one rule applies when typing a term: priority is given to [EqfA] over all the other rules and to [EnvA] over all remaining logical rules. The subsumption rule is no longer in the system; it is replaced by: $(i)$ using a union type in [CaseA], $(ii)$ checking in [AbsA] that the body of the function is typed by a subtype of the type declared in the annotation, and $(iii)$ using type operators and checking subtyping in the elimination rules [AppA,ProjA]. In particular, for [AppA] notice that it checks that the type of the function is a functional type, that the type of the argument is a subtype of the domain of the function, and then returns the result type of the application of the two types. The intersection rule is (partially) replaced by the rule [EnvA] which intersects the type deduced for an expression $e$ by occurrence typing and stored in $\Gamma$ with the type deduced for $e$ by the logical rules: this is simply obtained by removing any hypothesis about $e$ from $\Gamma$ , so that the deduction of the type $t$ for $e$ cannot but end by a logical rule. Of course, this does not apply when the expression $e$ is a variable, since an hypothesis in $\Gamma$ is the only way to deduce the type of a variable, which is why the algorithm reintroduces the classic rule for variables. Finally, notice that there is no counterpart for the rule [Abs-] and that therefore it is not possible to deduce negated arrow types for functions. This means that the algorithmic system is not complete as we discuss in details in the next section.

2.6.4 Properties of the algorithmic system

In what follow we will use $\Gamma\vdash_{\!\scriptscriptstyle\mathcal{A}}^{n_{o}}e:t$ to stress the fact that the judgment $\Gamma\vdash_{\!\scriptscriptstyle\mathcal{A}}e:t$ is provable in the algorithmic system where $\textsf{{Refine}}_{e,t}$ is defined as $(\textsf{{RefineStep}}_{e,t})^{n_{o}}$ ; we will omit the index $n_{o}$ —thus keeping it implicit—whenever it does not matter in the context.

The algorithmic system above is sound with respect to the deductive one of Section 2.5

Theorem 2.6 (Soundness).

For every $\Gamma$ , $e$ , $t$ , $n_{o}$ , if $\Gamma\vdash_{\!\scriptscriptstyle\mathcal{A}}^{n_{o}}e:t$ , then $\Gamma\vdash e:t$ .

The proof of this theorem (see B.5) is obtained by defining an algorithmic system $\vdash_{\!\scriptscriptstyle\mathcal{A}_{\text{ts}}}$ that uses type schemes, that is, which associates each typable term $e$ with a possibly infinite set of types $\mathbbm{t}$ (in particular a $\lambda$ -expression $\lambda^{\wedge_{i\in I}s_{i}\to t_{i}}x.e$ will be associated to a set of types of the form $\{s~{}|~{}\exists s_{0}=\bigwedge_{i=1..n}t_{i}\to s_{i}\land\bigwedge_{j=1..m}\neg(t_{j}^{\prime}\to s_{j}^{\prime}).\ \MyMathBb{0}\not\simeq s_{0}\leq s\}$ ) and proving that, if $\Gamma\vdash_{\!\scriptscriptstyle\mathcal{A}}e:t$ then $\Gamma\vdash_{\!\scriptscriptstyle\mathcal{A}_{\text{ts}}}e:\mathbbm{t}$ with $t\in\mathbbm{t}$ : the soundness of $\vdash_{\!\scriptscriptstyle\mathcal{A}}$ follows from the soundness of $\vdash_{\!\scriptscriptstyle\mathcal{A}_{\text{ts}}}$ .

Completeness needs a more detailed explanation. The algorithmic system $\vdash_{\!\scriptscriptstyle\mathcal{A}}$ is not complete w.r.t. the language presented in Section 2.3 because it cannot deduce negated arrow types for functions. However, no practical programming language with structural subtyping would implement the full language of Section 2.3, but rather restrict all expressions of the form $(e{\in}t)\,\texttt{{?}}\,e_{1}\,\texttt{{:}}\,e_{2}$ so that the type $t$ tested in them is either non functional (e.g., products, integer, a record type, etc.) or it is $\MyMathBb{0}\to\MyMathBb{1}$ (i.e., the expression can just test whether $e$ returns a function or not).101010Of course, there exist languages in which it is possible to check whether some value has a type that has functional subcomponents—e.g., to test whether an object is of some class that possesses some given methods, but that is a case of nominal rather than structural subtyping, which in our framework corresponds to testing whether a value has some basic type.

There are multiple reasons to impose such a restriction, the most important ones can be summarized as follows:

For explicitly-typed languages it may yield conterintutive results, since for instance $\lambda^{\text{{Int}}\to\text{{Int}}}x.x\in\text{{Bool}}\to\text{{Bool}}$ should fail despite the fact that identity functions maps Booleans to Booleans. 2. 2.

For implicitly-typed languages it yields a semantics that depends on the inference algorithm, since $(\lambda y.(\lambda x.y))3\in 3{\to}3$ may either fail or not according to whether the type deduced for the result of the expression is either $\text{{Int}}{\to}\text{{Int}}$ or $3{\to}3$ (which are both valid but incomparable). 3. 3.

For gradually-typed languages it would yield a problematic system as we explain in Section 3.3.

Now, if we apply this restriction to the language of Section 2.3, then the algorithmic system of section 2.6.3 is complete. Let say that an expression $e$ is positive if it never tests a functional type more precise than $\MyMathBb{0}\to\MyMathBb{1}$ (see B.5 for the formal definition). Then we have:

Theorem 2.7 (Completeness for Positive Expressions).

For every type environment $\Gamma$ and positive expression $e$ , if $\Gamma\vdash e:t$ , then there exist $n_{o}$ and $t^{\prime}$ such that $\Gamma\vdash_{\!\scriptscriptstyle\mathcal{A}}^{n_{o}}e:t^{\prime}$ .

We can use the algorithmic system $\vdash_{\!\scriptscriptstyle\mathcal{A}_{\text{ts}}}$ defined for the proof of Theorem 2.6 to give a far more precise characterization than the above of the terms for which our algorithm is complete: positivity is a practical but rough approximation. The system $\vdash_{\!\scriptscriptstyle\mathcal{A}_{\text{ts}}}$ copes with negated arrow types, but it still is not complete essentially for two reasons: $(i)$ the recursive nature of rule [Path] and $(ii)$ the use of nested [PAppL] that yields a precision that the algorithm loses by using type schemes in defining of Constr (case (20) is the critical one). Completeness is recovered by $(i)$ limiting the depth of the derivations and $(ii)$ forbidding nested negated arrows on the left-hand side of negated arrows.

Definition 2.8 (Rank-0 negation).

A derivation of $\Gamma\vdash e:t$ is rank-0 negated if [Abs–] never occurs in the derivation of a left premise of a [PAppL] rule.

The use of this terminology is borrowed from the ranking of higher-order types, since, intuitively, it corresponds to typing a language in which in the types used in dynamic tests, a negated arrow never occurs on the left-hand side of another negated arrow.

Theorem 2.9 (Rank-0 Completeness).

For every $\Gamma$ , $e$ , $t$ , if $\Gamma\vdash e:t$ is derivable by a rank-0 negated derivation, then there exists $n_{o}$ such that $\Gamma\vdash_{\!\scriptscriptstyle\mathcal{A}_{\text{ts}}}^{n_{o}}e:t^{\prime}$ and $t^{\prime}\leq t$ .

This last result is only of theoretical interest since, in practice, we expect to have only languages with positive expressions. This is why for our implementation we use the library of CDuce [BCF03] in which type schemes are absent and functions are typed only by intersections of positive arrows. We present the implementation in Section 4, but before we study some extensions.

3 Extensions

As we recalled in the introduction, the main application of occurrence typing is to type dynamic languages. In this section we explore how to extend our work to encompass three features that are necessary to type these languages.

First, we consider record types and record expressions which, in dynamic languages, are used to implement objects. In particular, we extend our system to cope with typical usage patterns of objects employed in these languages such as adding, modifying, or deleting a field, or dynamically testing its presence to specify different behaviors.

Second, in order to precisely type applications in dynamic languages it is crucial to refine the type of some functions to account for their different behaviors with specific input types. But current approaches are bad at it: they require the programmer to explicitly specify a precise intersection type for these functions and, even with such specifications, some common cases fail to type (in that case the only solution is to hard-code the function and its typing discipline into the language). We show how we can use the work developed in the previous sections to infer precise intersection types for functions. In our system, these functions do not require any type annotation or just an annotation for the function parameters, whereas some of them fail to type in current alternative approaches even when they are given the full intersection type specification.

Finally, to type dynamic languages it is often necessary to make statically-typed parts of a program coexist with dynamically-typed ones. This is the aim of gradually typed systems that we explore in the third extension of this section.

3.1 Record types

The previous analysis already covers a large gamut of realistic cases. For instance, the analysis already handles list data structures, since products and recursive types can encode them as right-associative nested pairs, as it is done in the language CDuce (e.g., $X=\textsf{Nil}\vee(\text{{Int}}\times X)$ is the type of the lists of integers): see Code 8 in Table 4.2 of Section 4 for a concrete example. Even more, thanks to the presence of union types it is possible to type heterogeneous lists whose content is described by regular expressions on types as proposed by hosoya00regular. However, this is not enough to cover records and, in particular, the specific usage patterns in dynamic languages of records, whose field are dynamically tested, deleted, added, and modified. This is why we extend here our work to records, building on the record types as they are defined in CDuce.

The extension we present in this section is not trivial. Although we use the record types as they are defined in CDuce we cannot do the same for CDuce record expressions. The reasons why we cannot use the record expressions of CDuce and we have to define and study new ones are twofold. On the one hand we want to capture the typing of record field extension and field deletion, two operation widely used in dynamic language; on the other hand we need to have very simple expressions formed by elementary sub-expressions, in order to limit the combinatorics of occurrence typing. For this reason we build our records one field at a time, starting from the empty record and adding, updating, or deleting single fields.

Formally, CDuce record types can be embedded in our types by adding the following two type constructors:

$\textbf{Types}\quad t~{}::=~{}\{\ell_{1}=t\ldots\ell_{n}=t,\ \_=t\}~{}|~{}\text{{Undef}}$

where $\ell$ ranges over an infinite set of labels Labels and Undef is a special singleton type whose only value is a constant undef which is not in $\mathcal{D}$ (for that it is a constant akin to $\Omega$ ): as a consequence Undef and $\MyMathBb{1}$ are distinct types, the interpretation of the former being the constant undef while the interpretation of the latter being the set of all the other values. The type $\{\ell_{1}=t_{1}\ldots\ell_{n}=t_{n},\ \_=t\}$ is a quasi-constant function that maps every $\ell_{i}$ to the type $t_{i}$ and every other $\ell\in\text{{Labels}}$ to the type $t$ (all the $\ell_{i}$ ’s must be distinct). Quasi constant functions are the internal representation of record types in CDuce. These are not visible to the programmer who can use only two specific forms of quasi constant functions, open record types and closed record types (as for OCaml object types), provided by the following syntactic sugar:111111Note that in the definitions “ $\ldots{}$ ” is meta-syntax to denote the presence of other fields while in the open records “..” is the syntax that distinguishes them from closed ones.

•

$\boldsymbol{\texttt{\{}}\ell_{1}=t_{1},\ldots,\ell_{n}=t_{n}\boldsymbol{\texttt{\}}}$ for $\{\ell_{1}=t_{1}\ldots\ell_{n}=t_{n},\ \_=\text{{Undef}}\}$ (closed records).

•

$\boldsymbol{\texttt{\{}}\ell_{1}=t_{1},\ldots,\ell_{n}=t_{n}\ {\large\textbf{..}}\boldsymbol{\texttt{\}}}$ for $\{\ell_{1}=t_{1}\ldots\ell_{n}=t_{n},\ \_=\MyMathBb{1}\vee\text{{Undef}}\}$ (open records).

plus the notation $\mathtt{\ell\operatorname{\texttt{=?}}}t$ to denote optional fields, which corresponds to using in the quasi-constant function notation the field $\ell=t\vee\text{{Undef}}$ .

For what concerns expressions, we cannot use CDuce record expressions as they are, but instead we must adapt them to our analysis. So as anticipated, we consider records that are built starting from the empty record expression {} by adding, updating, or removing fields:

[TABLE]

in particular $e\mathtt{\setminus}\ell$ deletes the field $\ell$ from $e$ , $\texttt{\{}e\texttt{ with }\ell=e\texttt{\}}^{\prime}$ adds the field $\ell=e^{\prime}$ to the record $e$ (deleting any existing $\ell$ field), while $e.\ell$ is field selection with the reduction: $\texttt{\{}...,\ell=e,...\texttt{\}}.\ell\ \leadsto\ e$ .

To define record type subtyping and record expression type inference we need three operators on record types: $t.\ell$ which returns the type of the field $\ell$ in the record type $t$ , $t_{1}+t_{2}$ which returns the record type formed by all the fields in $t_{2}$ and those in $t_{1}$ that are not in $t_{2}$ , and $t\mathtt{\setminus}\ell$ which returns the type $t$ in which the field $\ell$ is undefined. They are formally defined as follows (see alainthesis for more details):

[TABLE]

Then two record types $t_{1}$ and $t_{2}$ are in subtyping relation, $t_{1}\leq t_{2}$ , if and only if for all $\ell\in\text{{Labels}}$ we have $t_{1}.\ell\leq t_{2}.\ell$ . In particular { ..} is the largest record type.

Expressions are then typed by the following rules (already in algorithmic form).

[TABLE]

To extend occurrence typing to records we add the following values to paths: $\varpi\in\{\ldots,a_{\ell},u_{\ell}^{1},u_{\ell}^{2},r_{\ell}\}^{*}$ , with $e.\ell\downarrow a_{\ell}.\varpi=e{\downarrow}\varpi$ , $e\mathtt{\setminus}\ell\downarrow r_{\ell}.\varpi=e{\downarrow}\varpi$ , and $\texttt{\{}e_{1}\texttt{ with }\ell=e_{2}\texttt{\}}\downarrow u_{\ell}^{i}.\varpi=e_{i}{\downarrow}\varpi$ and add the following rules for the new paths:

[TABLE]

Deriving the algorithm from these rules is then straightforward:

$\begin{array}[]{llll}\textsf{{Constr}}_{\Gamma,e,t}(\varpi.a_{\ell})=\boldsymbol{\texttt{\{}}\ell:\textsf{{Intertype}}_{\Gamma,e,t}(\varpi)\ {\large\textbf{..}}\boldsymbol{\texttt{\}}}&\textsf{{Constr}}_{\Gamma,e,t}(\varpi.r_{\ell})=(\textsf{{Intertype}}_{\Gamma,e,t}(\varpi))\mathtt{\setminus}\ell+\boldsymbol{\texttt{\{}}\ell\operatorname{\texttt{=?}}\MyMathBb{1}\boldsymbol{\texttt{\}}}\\ \textsf{{Constr}}_{\Gamma,e,t}(\varpi.u_{\ell}^{2})=(\textsf{{Intertype}}_{\Gamma,e,t}(\varpi)).\ell&\textsf{{Constr}}_{\Gamma,e,t}(\varpi.u_{\ell}^{1})=(\textsf{{Intertype}}_{\Gamma,e,t}(\varpi))\mathtt{\setminus}\ell+\boldsymbol{\texttt{\{}}\ell\operatorname{\texttt{=?}}\MyMathBb{1}\boldsymbol{\texttt{\}}}\\[5.12149pt] \end{array}$

Notice that the effect of doing $t\mathtt{\setminus}\ell+\boldsymbol{\texttt{\{}}\ell\operatorname{\texttt{=?}}\MyMathBb{1}\boldsymbol{\texttt{\}}}$ corresponds to setting the field $\ell$ of the (record) type $t$ to the type $\MyMathBb{1}\vee\text{{Undef}}$ , that is, to the type of all undefined fields in an open record. So [PDel] and [PUpd1] mean that if we remove, add, or redefine a field $\ell$ in an expression $e$ then all we can deduce for $e$ is that its field $\ell$ is undefined: since the original field was destroyed we do not have any information on it apart from the static one. For instance, consider the test:

$\texttt{(}\texttt{\{}x\texttt{ with }a=0\texttt{\}}\in\boldsymbol{\texttt{\{}}a=\text{{Int}},b=\text{{Bool}}\ {\large\textbf{..}}\boldsymbol{\texttt{\}}}\vee\boldsymbol{\texttt{\{}}a=\text{{Bool}},b=\text{{Int}}\ {\large\textbf{..}}\boldsymbol{\texttt{\}}}\texttt{)?}x.b\texttt{:}\text{{False}}$

By $\textsf{{Constr}}_{\Gamma,e,t}(\varpi.u_{\ell}^{1})$ —i.e., by [Ext1], [PTypeof], and [PInter]—the type for $x$ in the positive branch is $((\boldsymbol{\texttt{\{}}a=\text{{Int}},b=\text{{Bool}}\ {\large\textbf{..}}\boldsymbol{\texttt{\}}}\vee\boldsymbol{\texttt{\{}}a=\text{{Bool}},b=\text{{Int}}\ {\large\textbf{..}}\boldsymbol{\texttt{\}}})\land\boldsymbol{\texttt{\{}}a=\text{{Int}}\ {\large\textbf{..}}\boldsymbol{\texttt{\}}})+\boldsymbol{\texttt{\{}}a\operatorname{\texttt{=?}}\MyMathBb{1}\boldsymbol{\texttt{\}}}$ . It is equivalent to the type $\boldsymbol{\texttt{\{}}b=\text{{Bool}}\ {\large\textbf{..}}\boldsymbol{\texttt{\}}}$ , and thus we can deduce that $x.b$ has the type Bool.

3.2 Refining function types

As we explained in the introduction, both TypeScript and Flow deduce for the first definition of the function foo in (1) the type (number $\vee$ string) $\to$ (number $\vee$ string), while the more precise type

[TABLE]

can be deduced by these languages only if they are instructed to do so: the programmer has to explicitly annotate foo with the type (36): we did it in (1) using Flow—the TypeScript annotation for it is much heavier. But this seems like overkill, since a simple analysis of the body of foo in (1) shows that its execution may have two possible behaviors according to whether the parameter x has type number or not (i.e., or (number $\vee$ string) $\setminus$ number, that is string), and this is should be enough for the system to deduce the type (36) even in the absence the annotation given in (1). In this section we show how to do it by using the theory of occurrence typing we developed in the first part of the paper. In particular, we collect the different types that are assigned to the parameter of a function in its body, and use this information to partition the domain of the function and to re-type its body. Consider a more involved example in a pseudo TypeScript that uses our syntax for type-cases function (x : $\tau$ ) { return (x $\in$ Real) ? ((x $\in$ Int) ? x+1 : sqrt(x)) : !x; (37) }

where we assume that Int is a subtype of Real. When $\tau$ is Real $\vee$ Bool we want to deduce for this function the type $(\text{{Int}}\to\text{{Int}})\wedge(\text{{Real}}\backslash\text{{Int}}\to\text{{Real}})\wedge(\text{{Bool}}\to\text{{Bool}})$ . When $\tau$ is $\MyMathBb{1}$ , then the function must be rejected (since it tries to type !x under the assumption that x has type $\neg\text{{Real}}$ ). Notice that typing the function under the hypothesis that $\tau$ is $\MyMathBb{1}$ , allows us to capture user-defined discrimination as defined by THF10 since, for instance let is_int x = (x $\in$ Int)? true : false in if is_int z then z+1 else 42

is well typed since the function is_int is given type $(\text{{Int}}\to\text{{True}})\wedge(\neg\text{{Int}}\to\text{{False}})$ . We propose a more general approach than the one by THF10 since we allow the programmer to hint a particular type for the argument and let the system deduce, if possible, an intersection type for the function.

We start by considering the system where $\lambda$ -abstractions are typed by a single arrow and later generalize it to the case of intersections of arrows. First, we define the auxiliary judgement $\Gamma\vdash e\triangleright\psi$ where $\Gamma$ is a typing environement, $e$ an expression and $\psi$ a mapping from variables to sets of types. Intuitively $\psi(x)$ denotes the set that contains the types of all the occurrences of $x$ in $e$ . This judgement can be deduced by the following deduction system that collects type information on the variables that are $\lambda$ -abstracted (i.e., those in the domain of $\Gamma$ , since lambdas are our only binders):

[TABLE]

Where $\psi\setminus\{x\}$ is the function defined as $\psi$ but undefined on $x$ and $\psi_{1}\cup\psi_{2}$ denotes component-wise union, that is :

[TABLE]

All that remains to do is to replace the rule [Abs+] with the following rule

[TABLE]

Note the invariant that the domain of $\psi$ is always conatined in the domain of $\Gamma$ restricted to variables. Simply put, this rule first collects all possible types that are deduced for a variable $x$ during the typing of the body of the $\lambda$ and then uses them to re-type the body under this new refined hypothesis for the type of $x$ . The re-typing ensures that the type safety property carries over to this new rule.

This system is enough to type our case study (3.2) for the case $\tau$ defined as Real $\vee$ Bool. Indeed, the analysis of the body yields $\psi(x)=\{\text{{Int}},\text{{Real}}\setminus\text{{Int}}\}$ for the branch (x $\in$ Int) ? x+1 : sqrt(x) and, since $(\text{{Bool}}\vee\text{{Real}})\setminus\text{{Real}}=\text{{Bool}}$ , yields $\psi(x)=\{\text{{Bool}}\}$ for the branch !x. So the function will be checked for the input types Int, $\text{{Real}}\setminus\text{{Int}}$ , and Bool, yielding the expected result.

It is not too difficult to generalize this rule when the lambda is typed by an intersection type:

[TABLE]

For each arrow declared in the interface of the function, we first typecheck the body of the function as usual (to check that the arrow is valid) and collect the refined types for the parameter $x$ . Then we deduce all possible output types for this refined set of input types and add the resulting arrows to the type deduced for the whole function (see Section 4 for an even more precise rule).

In summary, in order to type a function we use the type-cases on its parameter to partition the domain of the function and we type-check the function on each single partition rather than on the union thereof. Of course, we could use much a finer partition: the finest (but impossible) one is to check the function against the singleton types of all its inputs. But any finer partition would return, in many cases, not a much better information, since most partitions would collapse on the same return type: type-cases on the parameter are the tipping points that are likely to make a difference, by returning different types for different partitions thus yielding more precise typing.

Even though type cases in the body of a function are tipping points that may change the type of the result of the function, they are not the only ones: applications of overloaded functions play exactly the same role. We therefore add to our deduction system a last further rule:

[OverApp] $\displaystyle\displaystyle{\hbox{\hskip 63.56432pt\vbox{\hbox{\hskip-63.5643pt\hbox{\hbox{$ \displaystyle\displaystyle\Gamma\vdash e:\textstyle\bigvee\bigwedge_{i\in I}t_{i}\to{}s_{i} $}\hskip 10.0pt\hbox{\hbox{$ \displaystyle\displaystyle\Gamma\vdash x:t $}\hskip 10.0pt\hbox{\hbox{$ \displaystyle\displaystyle\Gamma\vdash e\triangleright\psi_{1} $}\hskip 10.0pt\hbox{\hbox{$ \displaystyle\displaystyle\Gamma\vdash x\triangleright\psi_{2} $}}}}}}\vbox{}}}\over\hbox{\hskip 35.919pt\vbox{\vbox{}\hbox{\hskip-35.91899pt\hbox{\hbox{$ \displaystyle\displaystyle\Gamma\vdash\textstyle{e}{~{}x}\triangleright\psi_{1}\cup\psi_{2}\cup\bigcup_{i\in I}{x\mapsto t\wedge t_{i}} $}}}}}}$ $(t\wedge t_{i}\not\simeq\MyMathBb{0})$

Whenever a function parameter is the argument of an overloaded function, we record as possible types for this parameter all the domains $t_{i}$ of the arrows that type the overloaded function, restricted (via intersection) by the static type $t$ of the parameter and provided that the type is not empty ( $t\wedge t_{i}\not\simeq\MyMathBb{0}$ ). We show the remarkable power of this rule on some practical examples in Section 4.

3.3 Integrating gradual typing

Gradual typing is an approach proposed by siek2006gradual to combine the safety guarantees of static typing with the programming flexibility of dynamic typing. The idea is to introduce an unknown (or dynamic) type, denoted $\mathbbm{\qm}$ , used to inform the compiler that some static type-checking can be omitted, at the cost of some additional runtime checks. The use of both static typing and dynamic typing in a same program creates a boundary between the two, where the compiler automatically adds—often costly [takikawa2016sound]—dynamic type-checks to ensure that a value crossing the barrier is correctly typed.

Occurrence typing and gradual typing are two complementary disciplines which have a lot to gain to be integrated, although we are not aware of any study in this sense. We explore this integration for the formalism of Section 2 for which the integration of gradual typing was first defined by CL17 and sucessively considerably improved by castagna2019gradual (see Lanvin21phd for a comprehensive presentation).

In a sense, occurrence typing is a discipline designed to push forward the frontiers beyond which gradual typing is needed, thus reducing the amount of runtime checks needed. For instance, the JavaScript code of (1) and (1) in the introduction can also be typed by using gradual typing: function foo(x : $\mathbbm{\qm}$ ) { return (typeof(x) === "number")? x+1 : x.trim(); (38) }

“Standard” or “safe” gradual typing inserts two dynamic checks since it compiles the code above into: function foo(x) { return (typeof(x) === "number")? (x $\langle$ number $\rangle$ )+1 : (x $\langle$ string $\rangle$ ).trim(); }

where $e$$\langle$$t$$\rangle$ is a type-cast that dynamically checks whether the value returned by $e$ has type $t$ .121212Intuitively, $e$$\langle$$t$$\rangle$ is syntactic sugar for (typeof( $e$ )===" $t$ ") ? $e$ : (throw "Type error"). Not exactly though, since to implement compilation à la sound gradual typing it is necessary to use casts on function types that need special handling. We already saw that thanks to occurrence typing we can annotate the parameter x by number|string instead of $\mathbbm{\qm}$ and avoid the insertion of any cast. But occurrence typing can be used also on the gradually typed code above in order to statically detect the insertion of useless casts. Using occurrence typing to type the gradually-typed version of foo in (3.3), allows the system to avoid inserting the first cast x $\langle$ number $\rangle$ since, thanks to occurrence typing, the occurrence of x at issue is given type number (but the second cast is still necessary though). But removing only this cast is far from being satisfactory, since when this function is applied to an integer there are some casts that still need to be inserted outside the function. The reason is that the compiled version of the function has type $\mathbbm{\qm}$$\to$ number, that is, it expects an argument of type $\mathbbm{\qm}$ , and thus we have to apply a cast (either to the argument or to the function) whenever this is not the case. In particular, the application foo(42) will be compiled as foo(42 $\langle$$\mathbbm{\qm}$$\rangle$ ). Now, the main problem with such a cast is not that it produces some unnecessary overhead by performing useless checks (a cast to $\mathbbm{\qm}$ can easily be detected and safely ignored at runtime). The main problem is that the combination of such a cast with type-cases will lead to unintuitive results under the standard operational semantics of type-cases and casts. Indeed, consider the standard semantics of the type-case (typeof( $e$ )===" $t$ ") which consists in reducing $e$ to a value and checking whether the type of the value is a subtype of $t$ . In standard gradual semantics, 42 $\langle$$\mathbbm{\qm}$$\rangle$ is a value. And this value is of type $\mathbbm{\qm}$ , which is not a subtype of number. Therefore the check in foo would fail for 42 $\langle$$\mathbbm{\qm}$$\rangle$ , and so would the whole function call. Although this behavior is type safe, this violates the gradual guarantee [siek2015refined] since giving a more precise type to the parameter x (such as number) would make the function succeed, as the cast to $\mathbbm{\qm}$ would not be inserted. A solution is to modify the semantics of type-cases, and in particular of typeof, to strip off all the casts in values, even nested ones. While this adds a new overhead at runtime, this is preferable to losing the gradual guarantee, and the overhead can be mitigated by having a proper representation of cast values that allows to strip all casts at once.

However, this problem gets much more complex when considering functional values. In fact, as we hinted in Section 2.6, there is no way to modify the semantics of type cases to preserve both the gradual guarantee and the soundness of the system in the presence of arbitrary type cases. For example, consider the function $f=\lambda^{(\text{{Int}}\to\text{{Int}})\to\text{{Int}}}g.(g{\in}(\text{{Int}}\to\text{{Int}}))\,\texttt{{?}}\,g\ 1\,\texttt{{:}}\,\texttt{\color[rgb]{0,0.2,0.4}true}$ . This function is well-typed since the type of the parameter guarantees that only the first branch can be taken, and thus that only an integer can be returned. However, if we apply this function to $h=(\lambda^{\mathbbm{\qm}\to\mathbbm{\qm}}x.\ x)\langle\text{{Int}}\to\text{{Int}}\rangle$ , the type case strips off the cast around $h$ (to preserve the gradual guarantee), then checks if $\lambda^{\mathbbm{\qm}\to\mathbbm{\qm}}x.\ x$ has type $\text{{Int}}\to\text{{Int}}$ . Since $\mathbbm{\qm}\to\mathbbm{\qm}$ is not a subtype of $\text{{Int}}\to\text{{Int}}$ , the check fails and the application returns true, which is unsound. Therefore, to preserve soundness in the presence of gradual types, type cases should not test functional types other than $\MyMathBb{0}\to\MyMathBb{1}$ , which is the same restriction as the one presented by siek2016recursive.

While this solves the problem of the gradual guarantee, it is clear that it would be much better if the application foo(42) were compiled as is, without introducing the cast 42 $\langle$$\mathbbm{\qm}$$\rangle$ , thus getting rid of the overhead associated with removing this cast in the type case. This is where the previous section about refining function types comes in handy. To get rid of all superfluous casts, we have to fully exploit the information provided to us by occurrence typing and deduce for the function in (3.3) the type (number $\to$ number) $\wedge$ (( $\mathbbm{\qm}$ \number) $\to$ string), so that no cast is inserted when the function is applied to a number. To achieve this, we simply modify the typing rule for functions that we defined in the previous section to accommodate for gradual typing. Let $\sigma$ and $\tau$ range over gradual types, that is the types produced by the grammar in Definition 2.1 to which we add $\mathbbm{\qm}$ as basic type (see castagna2019gradual for the definition of the subtyping relation on these types). For every gradual type $\tau$ , define $\tau^{\Uparrow}$ as the (non gradual) type obtained from $\tau$ by replacing all covariant occurrences of $\mathbbm{\qm}$ by $\MyMathBb{1}$ and all contravariant ones by $\MyMathBb{0}$ . The type $\tau^{\Uparrow}$ can be seen as the maximal interpretation of $\tau$ , that is, every expression that can safely be cast to $\tau$ is of type $\tau^{\Uparrow}$ . In other words, if a function expects an argument of type $\tau$ but can be typed under the hypothesis that the argument has type $\tau^{\Uparrow}$ , then no casts are needed, since every cast that succeeds will be a subtype of $\tau^{\Uparrow}$ . Taking advantage of this property, we modify the rule for functions as:

[TABLE]

The main idea behind this rule is the same as before: we first collect all the information we can into $\psi$ by analyzing the body of the function. We then retype the function using the new hypothesis $x:\sigma$ for every $\sigma\in\psi(x)$ . Furthermore, we also retype the function using the hypothesis $x:\sigma^{\Uparrow}$ : as explained before the rule, whenever this typing suceeds it eliminates unnecessary gradual types and, thus, unecessary casts. Let us see how this works on the function foo in (3.3). First, we deduce the refined hypothesis $\psi(\texttt{\color[rgb]{0,0.2,0.4}x})=\{\,\texttt{\color[rgb]{0,0.2,0.4}number}{\land}\mathbbm{\qm}\;,\;\mathbbm{\qm}\textbackslash\texttt{\color[rgb]{0,0.2,0.4}number}\,\}$ . Typing the function using this new hypothesis but without considering the maximal interpretation would yield $(\mathbbm{\qm}\to\texttt{\color[rgb]{0,0.2,0.4}number}\vee\texttt{\color[rgb]{0,0.2,0.4}string})\land((\texttt{\color[rgb]{0,0.2,0.4}number}\land\mathbbm{\qm})\to\texttt{\color[rgb]{0,0.2,0.4}number})\land((\mathbbm{\qm}\textbackslash\texttt{\color[rgb]{0,0.2,0.4}number})\to\texttt{\color[rgb]{0,0.2,0.4}string})$ . However, as we stated before, this would introduce an unnecessary cast if the function were to be applied to an integer.131313Notice that considering $\texttt{\color[rgb]{0,0.2,0.4}number}\land\mathbbm{\qm}\simeq\texttt{\color[rgb]{0,0.2,0.4}number}$ is not an option, since it would force us to choose between having the gradual guarantee or having, say, $\texttt{\color[rgb]{0,0.2,0.4}number}\land\texttt{\color[rgb]{0,0.2,0.4}string}$ be more precise than $\texttt{\color[rgb]{0,0.2,0.4}number}\land\mathbbm{\qm}$ .

Hence the need for the second part of Rule [AbsInf+]: the maximal interpretation of $\texttt{\color[rgb]{0,0.2,0.4}number}\land\mathbbm{\qm}$ is number, and it is clear that, if x is given type number, the function type-checks, thanks to occurrence typing. Thus, after some routine simplifications, we can actually deduce the desired type $(\texttt{\color[rgb]{0,0.2,0.4}number}\to\texttt{\color[rgb]{0,0.2,0.4}number})\land((\mathbbm{\qm}\textbackslash\texttt{\color[rgb]{0,0.2,0.4}number})\to\texttt{\color[rgb]{0,0.2,0.4}string})$ .

4 Implementation

We present in this section preliminary results obtained by our implementation. After giving some technical highlights, we focus on demonstrating the behavior of our typing algorithm on meaningful examples. We also provide an in-depth comparison with the fourteen examples of [THF10].

4.1 Implementation details

We have implemented the algorithmic system $\vdash_{\!\scriptscriptstyle\mathcal{A}}$ we presented in Section 2.6.3. Besides the type-checking algorithm defined on the base language, our implementation supports the record types and expressions of Section 3.1 and the refinement of function types described in Section 3.2. Furthermore, our implementation uses for the inference of arrow types the following improved rule:

[TABLE]

instead of the simpler [AbsInf+] given in Section 3.2. The difference of this new rule with respect to [AbsInf+] is that the typing of the body is made under the hypothesis $x:s\setminus\bigvee_{s^{\prime}\in\psi(x)}s^{\prime}$ , that is, the domain of the function minus all the input types determined by the $\psi$ -analysis. This yields an even better refinement of the function type that makes a difference for instance with the inference for the function xor_ (see Code 3 in Table 4.2): the old rule would have returned a less precise type. The rule above is defined for functions annotated by a single arrow type: the extension to annotations with intersections of multiple arrows is similar to the one we did in the simpler setting of Section 3.2.

The implementation is rather crude and consists of 2000 lines of OCaml code, including parsing, type-checking of programs, and pretty printing of types. CDuce is used as a library to provide set-theoretic types and semantic subtyping. The implementation faithfully transcribes in OCaml the algorithmic system $\vdash_{\!\scriptscriptstyle\mathcal{A}}$ as well as all the type operations defined in this work. One optimization that our implementation features (with respect to the formal presentation) is the use of a memoization environment in the code of the $\textsf{{Refine}}_{e,t}(\Gamma)$ function, which allows the inference to avoid unnecessary traversals of $e$ . Lastly, while our prototype allows the user to specify a particular value for the $n_{o}$ parameter we introduced in Section 2.6.2, a value of $1$ for $n_{o}$ is sufficient to check all examples we present in the rest of the section.

4.2 Experiments

We demonstrate the output of our type-checking implementation in Table 4.2 and Table LABEL:tab:implem2. Table 4.2 lists some examples, none of which can be typed by current systems. Even though some systems such as Flow and TypeScript can type some of these examples by adding explicit type annotations, the code 6, 7, 9, and 10 in Table 4.2 and, even more, the and_ and xor_ functions given in (4.2) and (4.2) later in this section are out of reach of current systems, even when using the right explicit annotations.

It should be noted that for all the examples we present, the time for the type inference process is less than 5ms, hence we do not report precise timings in the table. These and other examples can be tested in the online toplevel available at https://occtyping.github.io/

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Revisiting Occurrence Typing

Abstract

keywords:

1 Introduction

1.1 Motivating examples

1.2 Key ideas

1.3 Technical challenges

Typing occurrences

Type preservation

Interdependence of checks

Outline

Contributions

2 Language

2.1 Types

Definition 2.1** (Types).**

2.2 Subtyping

Definition 2.2** (Interpretation domain [Frisch2008]).**

Definition 2.3** (Set-theoretic interpretation of types [Frisch2008]).**

Definition 2.4** (Subtyping relation [Frisch2008]).**

2.3 Syntax

2.4 Dynamic semantics

2.5 Static semantics

Theorem 2.5** (type safety).**

2.6 Algorithmic system

2.6.1 Operators for type constructors

2.6.2 Type environments for occurrence typing

2.6.3 Algorithmic typing rules

2.6.4 Properties of the algorithmic system

Theorem 2.6** (Soundness).**

Theorem 2.7** (Completeness for Positive Expressions).**

Definition 2.8** (Rank-0 negation).**

Theorem 2.9** (Rank-0 Completeness).**

3 Extensions

3.1 Record types

3.2 Refining function types

3.3 Integrating gradual typing

4 Implementation

4.1 Implementation details

4.2 Experiments

Definition 2.1 (Types).

Definition 2.2 (Interpretation domain [Frisch2008]).

Definition 2.3 (Set-theoretic interpretation of types [Frisch2008]).

Definition 2.4 (Subtyping relation [Frisch2008]).

Theorem 2.5 (type safety).

Theorem 2.6 (Soundness).

Theorem 2.7 (Completeness for Positive Expressions).

Definition 2.8 (Rank-0 negation).

Theorem 2.9 (Rank-0 Completeness).