Verifying that a compiler preserves concurrent value-dependent   information-flow security

Robert Sison (Data61; CSIRO; UNSW Sydney); Toby Murray (University; of Melbourne)

arXiv:1907.00713·cs.LO·October 23, 2020

Verifying that a compiler preserves concurrent value-dependent information-flow security

Robert Sison (Data61, CSIRO, UNSW Sydney), Toby Murray (University, of Melbourne)

PDF

TL;DR

This paper introduces a decomposition principle for verifying that compilers preserve value-dependent information-flow security in concurrent programs, demonstrated through formal proof and application to a real compiler in Isabelle/HOL.

Contribution

It provides a new decomposition method that simplifies proving security preservation in compiler verification for concurrent, value-dependent security properties.

Findings

01

Decomposition principle reduces proof complexity by nearly half.

02

Successfully verified security preservation in a compiler from a While language to RISC assembly.

03

Applied verification to a real-world concurrent program model, demonstrating practical impact.

Abstract

It is common to prove by reasoning over source code that programs do not leak sensitive data. But doing so leaves a gap between reasoning and reality that can only be filled by accounting for the behaviour of the compiler. This task is complicated when programs enforce value-dependent information-flow security properties (in which classification of locations can vary depending on values in other locations) and complicated further when programs exploit shared-variable concurrency. Prior work has formally defined a notion of concurrency-aware refinement for preserving value-dependent security properties. However, that notion is considerably more complex than standard refinement definitions typically applied in the verification of semantics preservation by compilers. To date it remains unclear whether it can be applied to a realistic compiler, because there exist no general decomposition…

Equations106

coupling - inv - pres B R I \equiv

coupling - inv - pres B R I \equiv

\forall lc_{1 A} lc_{1 C} . (lc_{1 A}, lc_{1 C}) \in R ⟶

(\forall lc_{1 C}^{'} . lc_{1 C} ⇝_{C} lc_{1 C}^{'} ⟶

(\exists n lc_{1 A}^{'} . lc_{1 A} ⇝_{A}^{n} lc_{1 A}^{'} \land (lc_{1 A}^{'}, lc_{1 C}^{'}) \in R \land

(\forall lc_{2 A} lc_{2 C} lc_{2 A}^{'} . (lc_{1 A}, lc_{2 A}) \in B \land lc_{1 A} =_{mds} lc_{2 A} \land

(lc_{2 A}, lc_{2 C}) \in R \land (lc_{1 C}, lc_{2 C}) \in I \land

lc_{1 C} =_{mds} lc_{2 C} \land lc_{2 A} ⇝_{A}^{n} lc_{2 A}^{'} \land lc_{1 A}^{'} =_{mds} lc_{2 A}^{'}

⟶ (\exists lc_{2 C}^{'} . lc_{2 C} ⇝_{C} lc_{2 C}^{'} \land lc_{1 C}^{'} =_{mds} lc_{2 C}^{'} \land

(lc_{2 A}^{'}, lc_{2 C}^{'}) \in R \land (lc_{1 C}^{'}, lc_{2 C}^{'}) \in I))))

mem_{1} =_{mds}^{Low} mem_{2} \equiv

mem_{1} =_{mds}^{Low} mem_{2} \equiv

\forall x . x \in C \lor L mem_{1} x = Low \land readable mds x ⟶ m e m_{1} x = mem_{2} x

com - secure (tps, mds) \equiv \forall m e m_{1} mem_{2} . mem_{1} =_{mds}^{Low} mem_{2} ⟶

com - secure (tps, mds) \equiv \forall m e m_{1} mem_{2} . mem_{1} =_{mds}^{Low} mem_{2} ⟶

(\exists B . strong - low - bisim - mm B \land (⟨ tps, mds, mem_{1} ⟩, ⟨ tps, mds, mem_{2} ⟩) \in B)

strong - low - bisim - mm B \equiv cg - consistent B \land sym B \land

strong - low - bisim - mm B \equiv cg - consistent B \land sym B \land

(\forall lc_{1} lc_{2} . (lc_{1}, lc_{2}) \in B \land lc_{1} =_{mds} lc_{2} ⟶ lc_{1} =_{mds}^{Low} lc_{2} \land

(\forall lc_{1}^{'} . lc_{1} ⇝ lc_{1}^{'} ⟶ (\exists lc_{2}^{'} . lc_{2} ⇝ lc_{2}^{'} \land lc_{1}^{'} =_{mds} lc_{2}^{'} \land (lc_{1}^{'}, lc_{2}^{'}) \in B)))

cg - consistent B \equiv \forall tps_{1} mem_{1} tps_{2} mem_{2} mds .

cg - consistent B \equiv \forall tps_{1} mem_{1} tps_{2} mem_{2} mds .

(⟨ tps_{1}, mds, mem_{1} ⟩, ⟨ tps_{2}, mds, mem_{2} ⟩) \in B ⟶

(\forall mem_{1}^{'} mem_{2}^{'} . (\forall x . (mem_{1} x \neq = mem_{1}^{'} x \lor mem_{2} x \neq = mem_{2}^{'} x \lor

L mem_{1} x \neq = L mem_{1}^{'} x) ⟶ writable mds x) \land mem_{1}^{'} =_{mds}^{Low} mem_{2}^{'} ⟶

(⟨ tps_{1}, mds, mem_{1}^{'} ⟩, ⟨ tps_{2}, mds, mem_{2}^{'} ⟩) \in B)

preserves - modes - mem R \equiv \forall lc_{A} lc_{C} . (lc_{A}, lc_{C}) \in R ⟶ lc_{A} =_{mds}^{mem} lc_{C}

preserves - modes - mem R \equiv \forall lc_{A} lc_{C} . (lc_{A}, lc_{C}) \in R ⟶ lc_{A} =_{mds}^{mem} lc_{C}

closed - others R \equiv \forall tps_{A} tps_{C} mds mem mem^{'} .

closed - others R \equiv \forall tps_{A} tps_{C} mds mem mem^{'} .

(⟨ tps_{A}, mds, mem ⟩_{A}, ⟨ tps_{C}, mds, mem ⟩_{C}) \in R) \land

(\forall x . (mem x \neq = m e m^{'} x \lor L mem x \neq = L mem^{'} x) ⟶ writable mds x) ⟶

(⟨ tps_{A}, mds, mem^{'} ⟩_{A}, ⟨ tps_{C}, mds, mem^{'} ⟩_{C}) \in R)

\displaystyle\mathsf{secure{\text{-}}refinement}\ \mathcal{B}\ \mathcal{R}\ \mathcal{I}\ \equiv\

\displaystyle\mathsf{secure{\text{-}}refinement}\ \mathcal{B}\ \mathcal{R}\ \mathcal{I}\ \equiv\

cg - consistent I \land sym I \land coupling - inv - pres B R I

\displaystyle\mathcal{B}\mathsf{{}_{C}of}\leavevmode\nobreak\ \mathcal{B}\leavevmode\nobreak\ \mathcal{R}\leavevmode\nobreak\ \mathcal{I}\equiv\{(\mathit{lc}_{1C},\mathit{lc}_{2C})\ |\

\displaystyle\mathcal{B}\mathsf{{}_{C}of}\leavevmode\nobreak\ \mathcal{B}\leavevmode\nobreak\ \mathcal{R}\leavevmode\nobreak\ \mathcal{I}\equiv\{(\mathit{lc}_{1C},\mathit{lc}_{2C})\ |\

(lc_{1 A}, lc_{2 A}) \in B \land lc_{1 C} =_{mds}^{Low} lc_{2 C} \land (lc_{1 C}, lc_{2 C}) \in I}

strong - low - bisim - mm B \land secure - refinement B R I ⟹ strong - low - bisim - mm (B_{C} of \leavevmode B \leavevmode R \leavevmode I)

strong - low - bisim - mm B \land secure - refinement B R I ⟹ strong - low - bisim - mm (B_{C} of \leavevmode B \leavevmode R \leavevmode I)

secure - refinement - decomp B R I abs - steps \equiv

secure - refinement - decomp B R I abs - steps \equiv

preserves - modes - mem R \land closed - others R \land cg - consistent I \land sym I \land

decomp - refinement - safe B R I abs - steps \land (\forall lc_{A} lc_{C} . (lc_{A}, lc_{C}) \in R ⟶

(\forall lc_{C}^{'} . lc_{C} ⇝_{C} lc_{C}^{'} ⟶ (\exists lc_{A}^{'} . lc_{A} ⇝_{A}^{(abs - steps lc_{A} lc_{C})} lc_{A}^{'} \land (lc_{A}^{'}, lc_{C}^{'}) \in R)))

decomp - refinement - safe B R I abs - steps \equiv \forall lc_{1 A} lc_{2 A} lc_{1 C} lc_{2 C} . (lc_{1 A}, lc_{2 A}) \in B \land

decomp - refinement - safe B R I abs - steps \equiv \forall lc_{1 A} lc_{2 A} lc_{1 C} lc_{2 C} . (lc_{1 A}, lc_{2 A}) \in B \land

lc_{1 A} =_{mds} lc_{2 A} \land (lc_{1 A}, lc_{1 C}) \in R \land (lc_{2 A}, lc_{2 C}) \in R \land (lc_{1 C}, lc_{2 C}) \in I \land lc_{1 C} =_{mds} lc_{2 C}

⟶ stops lc_{1 C} = stops lc_{2 C} \land abs - steps lc_{1 A} lc_{1 C} = abs - steps lc_{2 A} lc_{2 C} \land

(\forall lc_{1 C}^{'} lc_{2 C}^{'} . lc_{1 C} ⇝_{C} lc_{1 C}^{'} \land lc_{2 C} ⇝_{C} lc_{2 C}^{'} ⟶ (lc_{1 C}^{'}, lc_{2 C}^{'}) \in I \land lc_{1 C}^{'} =_{mds} lc_{2 C}^{'})

secure - refinement - decomp B R I abs - steps ⟹ secure - refinement B R I

secure - refinement - decomp B R I abs - steps ⟹ secure - refinement B R I

\begin{array}[]{r@{\ }l}\mathit{exp}::=&n\ |\ v\ |\ \mathit{exp}\ \oplus\mathit{exp}\\ \mathit{cmd}::=&\textbf{skip}\ |\ \mathit{cmd}{}\mathbin{;}{}\mathit{cmd}\ |\ \textbf{if}\ exp\ \textbf{then}\ \mathit{cmd}\ \textbf{else}\ \mathit{cmd}\ \textbf{fi}\ |\\ &\textbf{while}\ exp\ \textbf{do}\ \mathit{cmd}\ \textbf{od}\ |\ v{}\mathbin{:=}{}exp\ |\\ &\textbf{lock}(k)\ |\ \textbf{unlock}(k)\end{array}

\begin{array}[]{r@{\ }l}\mathit{exp}::=&n\ |\ v\ |\ \mathit{exp}\ \oplus\mathit{exp}\\ \mathit{cmd}::=&\textbf{skip}\ |\ \mathit{cmd}{}\mathbin{;}{}\mathit{cmd}\ |\ \textbf{if}\ exp\ \textbf{then}\ \mathit{cmd}\ \textbf{else}\ \mathit{cmd}\ \textbf{fi}\ |\\ &\textbf{while}\ exp\ \textbf{do}\ \mathit{cmd}\ \textbf{od}\ |\ v{}\mathbin{:=}{}exp\ |\\ &\textbf{lock}(k)\ |\ \textbf{unlock}(k)\end{array}

\begin{array}[]{r@{\ }l}I::=&[l:]B\\ B::=&\textbf{Load}\ r\ v\ |\ \textbf{Store}\ v\ r\ |\ \textbf{Jmp}\ l\ |\ \textbf{Jz}\ l\ r\ |\ \textbf{Nop}\\ &\textbf{MoveK}\ r\ n\ |\ \textbf{MoveR}\ r\ r\ |\ \textbf{Op}\ \oplus\ r\ r\\ &\textbf{LockAcq}\ k\ |\ \textbf{LockRel}\ k\end{array}

\begin{array}[]{r@{\ }l}I::=&[l:]B\\ B::=&\textbf{Load}\ r\ v\ |\ \textbf{Store}\ v\ r\ |\ \textbf{Jmp}\ l\ |\ \textbf{Jz}\ l\ r\ |\ \textbf{Nop}\\ &\textbf{MoveK}\ r\ n\ |\ \textbf{MoveR}\ r\ r\ |\ \textbf{Op}\ \oplus\ r\ r\\ &\textbf{LockAcq}\ k\ |\ \textbf{LockRel}\ k\end{array}

\displaystyle\mathsf{compile{\text{-}}cmd}::\

\displaystyle\mathsf{compile{\text{-}}cmd}::\

(I \times C o m pR ec) l i s t \times Lab o pt i o n \times Lab \times C o m pR ec \times b oo l

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Data61, CSIRO, Australia and UNSW Sydney, Australia [email protected]://orcid.org/0000-0003-0313-9764Australian Government RTP Scholarship & Data61 Research Project AwardUniversity of Melbourne, [email protected]\CopyrightRobert Sison and Toby Murray{CCSXML} <ccs2012> <concept> <concept_id>10002978.10002986.10002990</concept_id> <concept_desc>Security and privacy Logic and verification</concept_desc> <concept_significance>500</concept_significance> </concept> <concept> <concept_id>10002978.10003006.10011608</concept_id> <concept_desc>Security and privacy Information flow control</concept_desc> <concept_significance>500</concept_significance> </concept> <concept> <concept_id>10011007.10011006.10011041</concept_id> <concept_desc>Software and its engineering Compilers</concept_desc> <concept_significance>300</concept_significance> </concept> </ccs2012>

\ccsdesc[500]Security and privacy Logic and verification \ccsdesc[500]Security and privacy Information flow control \ccsdesc[300]Software and its engineering Compilers \relatedversion\supplementThe Isabelle/HOL theories are available at https://covern.org/itp19.html.

Acknowledgements.

We would like to thank our anonymous reviewers, as well as Carroll Morgan, Kai Engelhardt, Gerwin Klein, Christine Rizkallah, Matthew Brecknell, Johannes Åman Pohjola, and Qian Ge, for their very helpful feedback on earlier versions of this paper.\EventEditorsJohn Harrison, John O’Leary, and Andrew Tolmach \EventNoEds3 \EventLongTitle10th International Conference on Interactive Theorem Proving (ITP 2019) \EventShortTitleITP 2019 \EventAcronymITP \EventYear2019 \EventDateSeptember 9–12, 2019 \EventLocationPortland, OR, USA \EventLogo \SeriesVolume141 \ArticleNo5

Verifying that a compiler preserves concurrent value-dependent information-flow security

Robert Sison

Toby Murray

Abstract

It is common to prove by reasoning over source code that programs do not leak sensitive data. But doing so leaves a gap between reasoning and reality that can only be filled by accounting for the behaviour of the compiler. This task is complicated when programs enforce value-dependent information-flow security properties—in which classification of locations can vary depending on values in other locations—and complicated further when programs exploit shared-variable concurrency.

Prior work has formally defined a notion of concurrency-aware refinement for preserving value-dependent security properties. However, that notion is considerably more complex than standard refinement definitions typically applied in the verification of semantics preservation by compilers. To date it remains unclear whether it can be applied to a realistic compiler, because there exist no general decomposition principles for separating it into smaller, more familiar, proof obligations.

In this work, we provide such a decomposition principle, which we show can almost halve the complexity of proving secure refinement. Further, we demonstrate its applicability to secure compilation, by proving in Isabelle/HOL the preservation of value-dependent security by a proof-of-concept compiler from an imperative While language to a generic RISC-style assembly language, for programs with shared-memory concurrency mediated by locking primitives. Finally, we execute our compiler in Isabelle on a While language model of the Cross Domain Desktop Compositor, demonstrating to our knowledge the first use of a compiler verification result to carry an information-flow security property down to the assembly-level model of a non-trivial concurrent program.

keywords:

Secure compilation, Information flow security, Concurrency, Verification

category:

\hideLIPIcs

1 Introduction

It is well known that program translations of the kind carried out by compilers can in principle break security properties like confidentiality [12, 2]. Yet source level reasoning about confidentiality remains common [20, 19, 18]. Existing verified compilers like CompCert [15] and CakeML [14] preserve semantics, but semantics preservation alone may be insufficient to preserve confidentiality, especially for shared memory concurrent programs whose threads must guard against timing leaks in order to prevent them manifesting as storage leaks [22].

Supporting secure compilation of programs that must enforce value-dependent security policies poses an additional challenge, because in such policies the sensitivity of a memory location can depend on the values held in other memory locations. Thus, unlike prior work on secure compilation [4], preserving security under refinement requires a refinement relation that is strong enough to preserve those memory contents on which the policy depends.

In prior work [22], we presented a definition for a notion of value-dependent security-preserving refinement that is compositional for concurrent programs: by applying it to each thread individually, one can derive a secure refinement of the concurrent composition.

The essence of this notion of security-preserving refinement (presented fully in Section 2.2) is in its refinement preservation obligation ( $\mathsf{coupling{\text{-}}inv{\text{-}}pres}$ in Figure 1). Here, the usual square-shaped commuting diagram that is commonly used to depict (semantics-preserving) refinement (Figure 4(a)) has been replaced by a cube (Figure 1). The additional dimension of this cube reflects that it preserves a 2-safety hyperproperty [6] that compares two executions rather than examining a single one. As such, it is significantly more complicated to prove than standard notions of semantics-preserving refinement typical in verified compilation [15, 14].

To date there exist no verified compilers for shared-variable concurrent programs proved to preserve value-dependent information-flow security. We argue that without a decomposition principle the cube-shaped refinement notion is too cumbersome to prove for realistic compilers.

In this paper, we tackle the central problem of making our notion of secure refinement applicable to verified secure compilation. Firstly, we present a decomposition principle that makes the cube-shaped notion more tractable. Secondly, we demonstrate its tractability with our major contribution: a machine-checked formal proof of concurrent value-dependent security preservation, for a proof-of-concept compiler.

In Section 3 we present our decomposition principle, which decomposes the cube (Figure 1) into three separate obligations (Figure 4). The first of these is akin to semantics-preserving refinement, while the second and third essentially ensure together that the refinement has not introduced any termination- and timing-leaks.

In Section 4 we show how the decomposition principle can almost halve the effort to prove secure refinement – in this case, of a program that is especially prone to introduced timing leaks because it branches on secrets (a feature not yet allowed by our compiler). There, we present a side-by-side comparison of the proof effort, both with and without the decomposition principle. We find that using it reduces the proof’s complexity by 44%.

In Section 5, we present our compiler and its formal verification, as an application of the decomposition principle. This compiler translates concurrent programs written in an imperative While language, with locking primitives for mediating access to shared memory, into a RISC-style assembly language. It does so by compiling each thread individually, and in doing so preserves a formal security property that remains compositional between threads. Furthermore, our compiler demonstrates a way of formalising and proving when it is safe for a compiler to perform optimisations in the presence of concurrency. To ensure that the contents of shared memory locations are preserved under compilation despite potential interference from other threads, our compiler tracks which shared memory locations are stable (free from any such interference). It then makes use of this tracking to avoid redundant loads from stable shared variables safely, that would otherwise be considered unsafe to omit.

All results are mechanised in Isabelle/HOL,111The wr-compiler totals $\sim$ 7k lines, and verification + compilation of the 2-thread CDDC model totals $\sim$ 1.6k lines of Isabelle proof script, excluding whitespace and comments. See “Supplement Material”. and in Section 6 we explain how, in order to validate our theory, we instantiated it so that we could execute our compiler in Isabelle. This enabled us to execute it over a While language model of the Cross Domain Desktop Compositor [5] (CDDC), a concurrent program that enforces information flow control over value-dependently classified input. To our knowledge this is the first proof of information flow security for an assembly-level model of a non-trivial concurrent program, demonstrating the power of verified secure compilation for deriving security properties of compiled code.

2 Background and example

We begin by introducing with an illustrative example (Figure 2) the challenges of verifying value-dependent information-flow security in the presence of shared-variable concurrency.

Consider the task of verifying a multithreaded system that manages the user interface (UI) for a dual-personality smartphone, a phone that provides clearly distinguished user contexts (personalities), typically for work versus leisure. Specifically, our task is to verify that it does not leak sensitive information intended only for one of those personalities, which we classify $\mathsf{High}$ (2(c)), to locations belonging to the other, which we classify $\mathsf{Low}$ (2(c)).

Here and generally, our attacker model is an entity that can read from the system’s untrusted sinks: some subset of permanently $\mathsf{Low}$ -classified locations not subject to synchronisation. In our example, this may include WLAN device registers in a hostile environment.

The smartphone’s UI system consists of a number of threads running concurrently with a shared address space, and we aim to verify that as a whole it satisfies the security requirement. But to avoid a state space explosion that is exponential in the number of threads, we must do this compositionally: one thread at a time, then combining the results of these analyses.

We focus on a particular worker thread (2(a)), the one responsible for sending touchscreen input from the $\mathit{source}$ variable to its intended destination.

The first challenge is that the destination depends on which personality the phone is currently providing, which is indicated by the value of $\mathit{domain}$ . This is reflected by the classification of $\mathit{source}$ being dependent on the value of $\mathit{domain}$ : $\mathit{source}$ is classified $\mathsf{Low}$ exactly when $\mathit{domain}=\textsf{LOW}$ (where LOW is a designated constant), and is classified $\mathsf{High}$ otherwise. Due to this dependency, $\mathit{domain}$ is known as a control variable of $\mathit{source}$ .

The second challenge is the worker thread runs in a shared address space that might be accessed or modified by other threads, for various purposes. One of these threads may be responsible for maintaining that $\mathit{domain}=\textsf{LOW}$ exactly when the phone indicates it is providing the $\mathsf{Low}$ personality (2(c)), so the user knows not to type in anything sensitive. Another thread may be responsible for assigning $\mathit{suspended}{}\mathbin{:=}{}\textsf{TRUE}$ when the user turns the phone’s screen off, to make the worker stop processing touchscreen input. We may then wish for $\mathit{workspace}$ to be usable by some other thread—e.g. processing input from a fingerprint scanner—in such a way that it can assume $\mathit{workspace}$ no longer contains any sensitive values.

When we analyse one thread like this worker in terms of our compositional security property (Section 2.1), all of the other threads in the system are trusted to do two things:

They follow a synchronisation scheme: here, if read- or write-access to a certain variable is governed by a lock, they must hold it in order to access the variable in that manner. 2. 2.

They themselves do not leak values from $\mathsf{High}$ -classified locations (we refer to such values themselves as $\mathsf{High}$ ) to $\mathsf{Low}$ -classified locations that are read-accessible to other threads. Note we are proving that the thread we are analysing can be trusted in the same way.

Even under these assumptions, the concurrency gives rise to some tricky considerations.

Firstly, it is important that no thread in the system (including the thread under analysis) modifies any control variables carelessly. For example, writing $\mathit{domain}{}\mathbin{:=}{}\textsf{LOW}$ immediately after the worker reads a $\mathsf{High}$ value from $\mathit{source}$ , will cause it to leak to $\mathit{low\_sink}$ . To prevent this, the worker uses $\mathit{source\_lock}$ , granting it exclusive write-access to $\mathit{source}$ and $\mathit{domain}$ .

Furthermore as noted above, we may want to ensure that a non-attacker-observable location is nevertheless cleared of any sensitive values before being used by another thread. In our example, we classify $\mathit{workspace}$ $\mathsf{Low}$ for the analysis to enforce this when the worker is suspended, but as the worker sometimes uses it to process $\mathsf{High}$ values, it is important to know $\mathit{workspace}$ is accessible only to the worker during that time. To ensure this, the worker uses $\mathit{workspace\_lock}$ , granting it exclusive read- and write-access to $\mathit{workspace}$ . It is then responsible for clearing it of any $\mathsf{High}$ values by the time it releases exclusive read-access.

2.1 Concurrent value-dependent noninterference (CVDNI)

Having illustrated the challenges with an example, we now focus on the formalisation of our information-flow security property CVDNI, which we target with our per-thread analysis, and which our compiler preserves. It is defined in terms of two main elements:

a binary strong low-bisimulation (modulo modes) relation $\mathcal{B}$ between program configurations, that establishes the required information-flow security property. Like Goguen & Meseguer-style noninterference [10], any states it relates must agree on their “low” portions, and it demands that lock-step execution preserve that correspondence. This section will explain how it is specialised further for shared-variable concurrency. 2. 2.

a classification function $\mathcal{L}$ that determines the “low” portion of a program configuration, thus affecting $\mathcal{B}$ ’s requirements. Unlike [10] however, $\mathcal{L}$ here can depend on values in the program configuration itself, thus expressing dynamic and not just static classifications.

We now present definitions from Section III-2b of our previous work [22] simplified as noted. The theory is parameterised over the type of values $\mathit{Val}$ , a finite set of shared variables $\mathit{Var}$ , and a deterministic evaluation step semantics $\rightsquigarrow$ between local configurations (of a thread in a concurrent program) each denoted by a triple $\langle\mathit{tps},\mathit{mds},\mathit{mem}\rangle$ :

•

$\mathit{tps}$ is the thread-private state, which is permanently inaccessible to the attacker and the other threads. Note that due to this inaccessibility, we allow the user of the theory to parameterise the type of $\mathit{tps}$ , and do not impose any particular structure.

•

$\mathit{mds}::\mathit{Mode}\Rightarrow\mathit{Var}\ \mathit{set}$ is the (access) mode state, which is ghost state associating each $\mathit{Mode}=\{\mathbf{AsmNoW},\mathbf{AsmNoRW},\mathbf{GuarNoW},\mathbf{GuarNoRW}\}$ with a set of shared variables. Intuitively, it identifies the set of variables for which the thread currently possesses (or respects) a kind of exclusivity of access granted (or obligated) by a synchronisation scheme. This facilitates compositional, assume-guarantee [11] style reasoning. For example, when our worker thread holds $\mathit{source\_lock}$ , it assumes no other threads write to $\mathit{source}$ or its control variable ( $\{\mathit{source},\mathit{domain}\}\subseteq\mathit{mds}\ \mathbf{AsmNoW}$ ), otherwise it guarantees it does not write to them ( $\mathbf{GuarNoW}$ ). Similarly, holding $\mathit{workspace\_lock}$ it assumes no other threads read or write to $\mathit{workspace}$ ( $\mathit{workspace}\in\mathit{mds}\ \mathbf{AsmNoRW}$ ), and at all other times it makes the corresponding guarantee ( $\mathbf{GuarNoRW}$ ).

•

$\mathit{mem}::\mathit{Mem}$ is shared memory considered potentially accessible to the attacker and other threads. In order to make what is accessible amenable to analysis, we impose the structure $\mathit{Mem}=\mathit{Var}\Rightarrow\mathit{Val}$ , a total map from shared variable names to their values.

The theory is then further parameterised by the value-dependent classification function $\mathcal{L}::\mathit{Mem}\Rightarrow\mathit{Var}\Rightarrow\{\mathsf{High},\mathsf{Low}\}$ , and a function $\mathcal{C}\mathsf{vars}::\mathit{Var}\Rightarrow\mathit{Var}\ \mathit{set}$ that returns all the control variables of a given variable. In our worker thread example, $\mathcal{L}\ {\mathit{mem}}\ x$ gives:

•

$\mathsf{High}$ when $x$ is $\mathit{high\_sink}$ , meaning $\mathit{high\_sink}$ is classified $\mathsf{High}$ at all times.

•

when $x$ is $\mathit{source}$ : $\mathsf{Low}$ if $\mathit{mem}\ \mathit{domain}=\textsf{LOW}$ , and $\mathsf{High}$ otherwise.

•

$\mathsf{Low}$ for all other variables $x$ , meaning they are classified $\mathsf{Low}$ at all times.

The set $\mathcal{C}=\{y\ |\ \exists x.\ y\in\mathcal{C}\mathsf{vars}\ x\}$ is then defined to contain all control variables in the system. Thus in our worker thread example, $\mathcal{C}\mathsf{vars}\ \mathit{source}=\{\mathit{domain}\}$ and $\mathcal{C}=\{\mathit{domain}\}$ .

To support compositionality for concurrent programs, the “low” portion demanded to be equal by the analysis is tightened up to be modulo modes – it includes non-control variables only if they are assumed to be readable by other threads according to the mode state: $\mathsf{readable}\ \mathit{mds}\ x\equiv x\notin\mathit{mds}\ \mathbf{AsmNoRW}$ . Thus intuitively, the user of the theory should model permanent untrusted output sinks of the whole concurrent program, as variables for which $\mathcal{L}$ always returns $\mathsf{Low}$ , ungoverned by any synchronisation scheme that the attacker cannot be trusted to follow. (In our example, $\mathit{low\_sink}$ is untrusted permanently in this way, but $\mathit{workspace}$ is untrusted only when unlocked.) The notion of observational indistinguishability used for the noninterference property is then defined over memories as follows.

Definition 2.1 (Low-equivalent memories modulo modes).

[TABLE]

For this paper, we will use notation $\mathit{lc}_{1}=_{\mathsf{mds}}^{\mathsf{Low}}\mathit{lc}_{2}$ to lift $=_{\mathit{mds}}^{\mathsf{Low}}$ to local program configurations, asserting also that $\mathit{lc}_{1}$ and $\mathit{lc}_{2}$ are modes-equal (have the same mode state). Additionally, we will use notation $\mathit{lc}_{1}=_{\mathsf{mds}}\mathit{lc}_{2}$ to denote (alone) that $\mathit{lc}_{1}$ and $\mathit{lc}_{2}$ are modes-equal.

The per-thread compositional security property $\mathsf{com{\text{-}}secure}$ asserts the existence of a witness relation $\mathcal{B}$ for every possible observationally equivalent pair of starting configurations:

Definition 2.2 (Per-thread compositional CVDNI property).

[TABLE]

where all such witness relations $\mathcal{B}$ must be a strong low-bisimulation (modulo modes):

[TABLE]

That is, $\mathcal{B}$ must maintain observational indistinguishability by requiring that all configuration pairs it relates that have the same mode state, are low-equivalent modulo modes.

Furthermore, it must be a bisimulation by being symmetric and progressing to itself: any step taken by one of the configurations must be able to be matched by a step taken by the configuration related to it, such that the destinations remain related by $\mathcal{B}$ (and modes-equal).

Finally—and the most crucial element ensuring the property’s compositionality for concurrent programs—is the condition that $\mathcal{B}$ must be $\mathsf{cg{\text{-}}consistent}$ : closed under globally consistent changes made to memory by other threads, which is to say, changes that preserve low-equivalence and are permitted by the current mode state $\mathit{mds}$ . Specifically, the environment (of other threads) is permitted to change either of variable $x$ ’s value or its classification only when $x$ is writable: $\mathsf{writable}\ \mathit{mds}\ x\equiv x\notin\mathit{mds}\ \mathbf{AsmNoW}\ \land\ x\notin\mathit{mds}\ \mathbf{AsmNoRW}$ .

Definition 2.3 (Closedness under globally consistent changes).

[TABLE]

Theorem 3.1 of our prior work [22] then gives us that the parallel composition of $\mathsf{com{\text{-}}secure}$ programs is itself a program that enforces a system-wide value-dependent noninterference property ( $\mathsf{sys{\text{-}}secure}$ , for whose details we refer the reader to Section III-2(a) of [22]).

2.2 CVDNI-preserving refinement

Having described the formal security property that we wish to be preserved under refinement (and compilation), we now define formally a suitable notion of secure refinement that preserves it. The proof of CVDNI-preserving refinement for a thread of a concurrent program relies on two binary relations (illustrated by Figure 3) to be nominated by the user of the theory:

a refinement relation $\mathcal{R}$ relating local configurations of the abstract program to local configurations of the concrete program: abstract must simulate concrete, in a sense typical of much other work on program refinement, including compiler verification efforts. 2. 2.

a concrete coupling invariant $\mathcal{I}$ that allows us to use $\mathcal{B}$ and $\mathcal{R}$ to build a new strong low-bisimulation (modulo modes) for the concrete program, by discarding unreachable pairs of local configurations after the refinement. It thereby witnesses that any changes a refinement (or compiler) makes to execution time, do not introduce any timing channels.

The essence of the proof technique is to require that a number of conditions—analogous to those for $\mathsf{strong{\text{-}}low{\text{-}}bisim{\text{-}}mm}$ —be imposed on the nominated $\mathcal{R}$ and $\mathcal{I}$ in relation to a given witness relation $\mathcal{B}$ establishing CVDNI for the abstract program. The definitions to follow are adapted from Murray et al. [22] Section V. For better readability, we present a simplified version in which no new shared variables are added by the refinement. Consequently we introduce the notation $=_{\mathsf{mds}}^{\mathsf{mem}}$ to denote that two local configurations have equal mode state and memory, regardless of whether relating configurations of the same or differing languages.

Regarding the maintenance of modes- and observational-equivalence across the relation, the restrictions on refinement are tighter than those that applied to $\mathsf{strong{\text{-}}low{\text{-}}bisim{\text{-}}mm}$ . The refinement relation $\mathcal{R}$ is required to preserve the shared memory in its entirety:

Definition 2.4 (Preservation of modes and memory).

[TABLE]

Regarding the closedness under changes by other threads that ensures compositionality for concurrency, on $\mathcal{I}$ we again impose $\mathsf{cg{\text{-}}consistent}$ (2.3) from Section 2.1. However in the case of $\mathcal{R}$ , we instead impose $\mathsf{closed{\text{-}}others}$ , a simplification of $\mathsf{cg{\text{-}}consistent}$ considering only environmental actions that affect the memories on both sides of the relation identically. Furthermore it ensures equality of all shared variables, not just those judged observable:

Definition 2.5 (Closedness of refinements under changes by others).

[TABLE]

The final major requirement for CVDNI-preservation is then to prove $\mathcal{R}$ and $\mathcal{I}$ closed simultaneously under the pairwise executions of the concrete and abstract programs, using the aforementioned cube-shaped diagram ( $\mathsf{coupling{\text{-}}inv{\text{-}}pres}$ , Figure 1) whose edges are pairs in $\mathcal{B}$ , $\mathcal{R}$ , and $\mathcal{I}$ . All that then remains is for the nominated concrete coupling invariant $\mathcal{I}$ to be symmetric, and the predicate $\mathsf{secure{\text{-}}refinement}$ puts together all the requirements:

Definition 2.6 (Requirements for secure refinement of the per-thread CVDNI property).

[TABLE]

Theorem 5.1 of our prior work [22] gives us that under the aforementioned conditions,

[TABLE]

is a witness $\mathsf{strong{\text{-}}low{\text{-}}bisim{\text{-}}mm}$ for the concrete program:

[TABLE]

3 Decomposition principle for CVDNI-preserving refinement

Having presented our previous work [22]’s formalisation of our security property CVDNI and its preservation by refinement, we now present our first contribution: an alternative way of proving $\mathsf{secure{\text{-}}refinement}$ (2.6) that does away with the use of the cube-shaped, two-sided refinement obligation $\mathsf{coupling{\text{-}}inv{\text{-}}pres}\ \mathcal{B}\ \mathcal{R}\ \mathcal{I}$ (depicted by Figure 1), by decomposing its concerns into (1) proving $\mathcal{R}$ closed under the pairwise executions of the concrete and abstract programs alone using a square-shaped diagram (depicted by 4(a), which is akin to ordinary semantics-preserving refinement), and (2) a number of smaller and more separable obligations gathered together under the side-condition predicate $\mathsf{decomp{\text{-}}refinement{\text{-}}safe}$ .

Definition 3.1 (Decomposed requirements for CVDNI-preserving secure refinement).

[TABLE]

The decomposition requires the provision of a new refinement parameter that we will call $\mathit{abs{\text{-}}steps}$ or the pacing function, whose role is to dictate the pace of the refinement by returning the number of abstract steps that ought to be taken for a single concrete step, for a given abstract-concrete local configuration pair related by $\mathcal{R}$ . The side-conditions on all of the refinement parameters (depicted by Figures 4(b), 4(c)) are then defined as follows:

Definition 3.2 (Side-conditions for CVDNI-preserving refinement decomposition).

[TABLE]

On the intuitive meaning of the side-conditions in 3.2:

•

$\mathsf{stops}\ \mathit{lc}_{1C}=\mathsf{stops}\ \mathit{lc}_{2C}$ ensures that the refinement has not introduced any termination leaks, by asserting consistent stopping behaviour for $\mathcal{I}$ -related concrete program configurations, which we know to be observationally indistinguishable.

•

$\mathit{abs{\text{-}}steps}\ \mathit{lc}_{1A}\ \mathit{lc}_{1C}=\mathit{abs{\text{-}}steps}\ \mathit{lc}_{2A}\ \mathit{lc}_{2C}$ ensures that the refinement has not introduced any timing leaks, by asserting consistency of the pace of the refinement for $\mathcal{R}$ -related program configurations, which we again know to be observationally indistinguishable.

•

The final $\forall$ -quantified clause asserts $\mathcal{I}$ ’s suitability as a coupling invariant, in that it must remain closed under lockstep evaluation of the concrete program configurations it relates. Furthermore it must maintain mode state equality with each lockstep evaluation, which ensures that the refinement has not introduced any inconsistencies in the memory access assumptions and guarantees needed for the concurrent compositionality of the property.

Note the $\mathcal{B}$ - and $\mathcal{R}$ -edges in 4(c) may capture useful facts about a particular program verification technique and compiler, so their availability as assumptions is intended to reduce greatly the effort needed to specify a coupling invariant $\mathcal{I}$ and prove it satisfies the condition.

Assuming the fulfilment of all of the decomposed requirements, we obtain that they are a sound method for establishing secure refinement of the per-thread CVDNI property:

Theorem 3.3 (Soundness of $\mathsf{secure{\text{-}}refinement{\text{-}}decomp}$ ).

[TABLE]

In the interests of brevity we relegate proof sketches for all results to Appendices C and D, and for fuller details we refer the reader to our Isabelle/HOL formalisation.

We now devote our attention to two instantiations of this new decomposition principle: (Section 4) for a proof of CVDNI-preservation for the refinement of a program that branches on a secret, and (Section 5.5) for the proof of CVDNI-preservation by a compiler.

4 Proof effort comparison

To demonstrate how the decomposition principle reduces proof complexity and effort, we returned to the example refinement discussed in Section V-E of our previous work [22], an excerpt of which is shown in Figure 3. The abstract program (9 imperative commands) branches on a sensitive value, and executes a single atomic expression assignment in each branch. Its refinement (to 16 commands) models expansion of the expressions into multiple steps, resolving a timing disparity between the two branches by padding with skip.

We use proof size as a proxy for proof effort, since the former is known to be strongly linearly correlated with the latter [28]. Formalised in Isabelle/HOL as EgHighBranchRevC.thy [21], the proof line count for that theory stood at about 4.6K lines of definitions and proof, of which approx. 3.6K line were proofs. Adapting the proof instead to use the decomposition principle $\mathsf{secure{\text{-}}refinement{\text{-}}decomp}$ (3.1), the proof line count drops from 3.6K to approx. 2K, a 44% reduction. Regarding definition changes, the new proof makes <10 lines of adaptations to a coupling invariant and pacing function used by the old proof, and adds about 30 lines worth of new helper definitions, for use with the decomposition principle. The rest of the theory and its external dependencies remain in common between the two versions.

As would be expected, the bulk of the deletions are from the full cube-shaped refinement diagram proof (Figure 1) of $\mathsf{secure{\text{-}}refinement}$ (2.6) for the refinement relation. The surviving parts of that proof just become the square-shaped refinement diagram proof (4(a)) of $\mathsf{secure{\text{-}}refinement{\text{-}}decomp}$ without much modification. The deletions are replaced by newly added proofs of the three sub-obligations of $\mathsf{decomp{\text{-}}refinement{\text{-}}safe}$ (3.2).

5 The Covern wr-compiler

Having presented our new decomposition principle for CVDNI-preserving refinement, we now turn to our compiler, whose most notable features for formal proof of secure refinement are:

Its implementation tracks variable stability (Section 5.4) responsive to use of locking primitives, to know when accesses to shared variables are safe to optimise, and when register contents can be still be considered consistent with shared variable contents. 2. 2.

Its verification uses a pacing function (Section 5.5.2) and coupling invariant (Section 5.5.3) as the decomposition demands, to ensure it does not introduce timing leaks.

First, we describe its source and target languages, and parameters to the compilation.

5.1 Source language

The Covern wr-compiler—short for While-to-RISC compiler—takes the simple imperative language with while-looping and lock-based synchronisation targeted by the Covern program logic [20], which we will refer to as While, consisting of the commands $cmd$ :

[TABLE]

The language is parameterised over a type of values $\mathit{Val}$ , and binary operators $\oplus::\mathit{Val}\Rightarrow\mathit{Val}\Rightarrow\mathit{Val}$ . Constants $n::\mathit{Val}$ ; $v::\mathit{Var}$ and $k::\mathit{Lock}$ are (resp.) shared program- and lock-variables. The semantics of the locking primitives $\textbf{lock}(k)$ and $\textbf{unlock}(k)$ is informed by a locking discipline provided by the user of the theory as a parameter (see Section 5.3). We leave for future work adding support for pointers and arrays, which we believe will be straightforward because our assume-guarantee framework already provides the means to encode the memory footprint of a command in a way that depends on values in memory.

We assume that the underlying concurrent execution model (e.g. operating system, scheduler) for the While language prevents threads from seeing each others’ current program location, and thus (as in previous work [22, 19]) the While program command $c::\mathit{cmd}$ being executed we model as thread-private state: $\langle c,\mathit{mds},\mathit{mem}\rangle_{\mathsf{w}}$ . In contrast, all program variables $v::\mathit{Var}$ and lock variables $k::\mathit{Lock}$ reside in the shared memory $\mathit{mem}$ .

5.2 Target language

The wr-compiler’s target is a generic RISC-style assembly language like that of Tedesco et al. [29] but with lock-based synchronisation primitives added, which we will refer to as RISC:

[TABLE]

The language is parameterised over the same value type $Val$ and binary operators $\oplus$ , shared program variables $v::\mathit{Var}$ and shared lock variables $k::\mathit{Lock}$ as the While language. Presently, direct-addressing Load and Store instructions (referring to registers $r::\mathit{Reg}$ ) are adequate for RISC to implement all existing While features, and we expect adding indirect addressing to RISC to be as straightforward as adding pointer and array support to While.

RISC program texts $P$ are just lists of binary instructions $I$ , each optionally associated with a label $l::\mathit{Lab}$ . We assume that the underlying concurrency model for the RISC language (e.g. OS, scheduler etc.) prevents one thread from reading the program code (instructions) of another,222As is usual for program analyses, we omit any explicit modelling of the microarchitectural state used by superscalar processors (like CPU caches, and state relied on by speculative and out-of-order execution, on whose behaviour attacks like Spectre [13] and Meltdown [16] relied). We argue however that our present assumptions are reasonable under two circumstances: when there is no such state (e.g. on microcontrollers like AVR [7]), or when such state is correctly partitioned by the underlying hardware [30] or the OS [8] – if the hardware allows it [9]! In the latter case, our analysis assumes that microarchitectural state footprints are partitioned according to thread (for memory containing program text) and according to classification by $\mathcal{L}$ (for shared memory), and furthermore that each value-dependently classified region is given a distinct partition that is flushed on reclassification. as well as another’s registers (including the program counter). Thus, we model the distinguished program counter register’s value $\mathit{pc}::{\rm Nature}$ , program text $P$ , and register bank $\mathit{regs}::\mathit{Reg}\Rightarrow\mathit{Val}$ as thread-private state: $\langle((\mathit{pc},P),\mathit{regs}),\mathit{mds},\mathit{mem}\rangle_{\mathsf{r}}$ . Apart from this adaptation to our triple format, evaluation semantics follows that of the RISC target of [29].

Finally, like Tedesco et al. [29] we generalise over the (user-supplied) register allocation scheme, and assume there are enough registers to service the maximum depth of expressions in the source program. (More details are available in Appendix D.1.) We leave for future work the modelling and analysis of a compiler phase that spills register contents to memory, in order to make this assumption unnecessary.

5.3 Locking discipline

Like the Covern logic [20], we assume that the While language program being compiled follows a certain locking discipline, about which the compiler has knowledge, so as to ensure that the RISC program it produces follows the same discipline.

The user of the theory provides the details of the locking discipline in the form of a lock interpretation parameter: $\mathit{lock{\text{-}}interp}::\mathit{Lock}\Rightarrow(\mathit{Var}\ \mathit{set}\times\mathit{Var}\ \mathit{set})$ , which for each lock gives the two non-overlapping sets of program variables over which acquiring the lock grants exclusive permission to write, (resp.) read and write. These permissions are then reflected in the way the semantics of the While and RISC locking primitives act on the mode state.

Regarding lock interpretations and the way they interact with the user-provided value-dependent classification function $\mathcal{L}$ (see Section 2.1), we inherit a few cleanliness conditions from that earlier work [20], chief of which are that lock variables $k$ cannot be control variables, a lock variable $k$ governing access to a program variable $v$ must govern the same kind of access to all of $v$ ’s control variables, and $\mathcal{L}$ must classify all lock variables as $\mathsf{Low}$ .

5.4 Compiler implementation and tracking of shared variable stability

We chose as a starting point the compilation scheme of [29], on the basis of their preserving a noninterference property that like ours exhibits resilience to changes made by an environment—in their case, intended for fault-resilience. Aiming to repurpose that for shared-variable concurrency, we adapted it to Isabelle, implementing it as a primitive recursive function:

[TABLE]

where we choose $\mathit{Lab}={\rm Nature}$ for RISC instruction labels, and the compilation record type $CompRec$ is bookkeeping maintained by the compiler that we will describe further below.

A typical invocation to compile a While program $c::cmd$ takes the form:

[TABLE]

Here, $\mathsf{compile{\text{-}}cmd}$ takes an initial compilation record $C$ , an optional entry label $l$ , and the next available label $\mathit{nl}$ , and for the benefit of the next invocation returns an optional exit label $l^{\prime}$ if one is used by the program just compiled, the new next available label $\mathit{nl}^{\prime}$ , and a final compilation record $C^{\prime}$ . We leave details of label allocation and its impact on achieving sequential composability for compiled RISC programs to Appendix D.2.

In addition to the output RISC program $P::I\ list$ itself, a call to $\mathsf{compile{\text{-}}cmd}$ also outputs every $CompRec$ associated with the state of the program just before executing every instruction in $P$ . These are returned zipped up together with $P$ as the $CompRec$ -annotated RISC program $\mathit{PCs}::(I\times CompRec)\ list$ . ( $P$ can trivially be recovered as $\mathsf{map}\ \mathsf{fst}\ \mathit{PCs}$ .) Finally, $\mathsf{compile{\text{-}}cmd}$ may return $\mathsf{True}$ for $\mathit{failed}$ to reject the input program, such as when it detects a data race (see below), or if expression depth exceeds the assumed limit (Section 5.2).

In the style of the compilation scheme on which it was based [29], the wr-compiler maintains a register record $\Phi::reg\rightharpoonup exp$ , i.e. a partial map of registers to expressions on shared variables. In addition to using it to compile away any unnecessary loads from variables in shared memory, we also use it to ensure that an expression calculated by RISC in registers is equal to the value of the expression as if it had all been calculated by While in one step. This is especially important when writing the result of an expression back to shared memory, because the refinement is required to maintain all shared memory values.

New to the wr-compiler is the responsibility of maintaining an assumption record, which it uses primarily to detect and reject programs with data races on shared memory, and to rule out the introduction of any new ones. Each assumption record $\mathcal{S}::(\mathit{Var}\ \mathit{set}\times\mathit{Var}\ \mathit{set})$ is a pair tracking the set of variables on which (resp.) AsmNoW, AsmNoRW assumptions are currently active at a given point in the program being compiled. As a secondary concern we also use it to assert that the two sides of any if-conditional branches act consistently on the mode state, and that while-loops restore the original mode state on termination.

A compilation record $C=(\Phi,\mathcal{S})::CompRec$ is then just a register/assumption record pair. For readability, we use $\mathsf{regrec}$ , $\mathsf{asmrec}$ to denote (resp.) a $CompRec$ ’s $\mathsf{fst}$ , $\mathsf{snd}$ projections.

To explain how the compilation record is used to rule out data races, and to ensure consistency of expression evaluation between source and target program, firstly we must introduce the concept of stability of a variable $v$ according to an assumption record $\mathcal{S}$ :

[TABLE]

In short, this means that the variable and all its control variables ( $\mathcal{C}\mathsf{vars}\ v$ ) are recorded as having either of AsmNoW or AsmNoRW active on them.

For register record entries to be of any help in ensuring consistency of While and RISC expression evaluation, we exclude expression evaluation on data race-prone variables by lifting the concept of stability to register records. The following predicate asserts internal consistency of the compilation record $C$ created by $\mathsf{compile{\text{-}}cmd}$ , in the sense that the register record may only map to expressions that mention variables that are recorded as stable by the assumption record accompanying it. (Here, $\mathsf{ran}$ denotes the range of a map.)

[TABLE]

To ensure that an input While program maintains register record stability, we define the predicate no-unstable-exprs $c$ $C$ to capture the requirement that a program $c$ , if started with a configuration consistent with compilation record $C$ , will never access a lock-protected variable without holding the relevant lock. (It also checks the secondary, mode-state consistency concerns of the assumption record mentioned earlier.) We implement it as a simple static check carried out by a primitive recursive function on the structure of While programs.

Together, $\mathsf{regrec{\text{-}}stable}$ and no-unstable-exprs make up the main two requirements of a predicate $\mathsf{compile{\text{-}}cmd{\text{-}}input{\text{-}}reqs}\ C\ l\ \mathit{nl}\ c$ imposed on the input arguments to $\mathsf{compile{\text{-}}cmd}$ , which gives us enough information to prove a lemma that $\mathsf{compile{\text{-}}cmd}$ only ever outputs stable register records. Full details of these we leave to Appendix D.3.

5.5 Proof of CVDNI-preserving compilation

Having covered the most significant aspects of the Covern wr-compiler’s parameters and machinery, we can now present the refinement relation $\mathcal{R}_{\mathsf{wr}}$ (Section 5.5.1), pacing function $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ (Section 5.5.2), and coupling invariant $\mathcal{I}_{\mathsf{wr}}$ (Section 5.5.3) that we use with our new decomposition principle (of Section 3) to prove that it preserves CVDNI (Section 5.5.4).

5.5.1 Refinement relation $\mathcal{R}_{\mathsf{wr}}$ and its invariants

Just like our example $\mathcal{R}$ of Figure 3, $\mathcal{R}_{\mathsf{wr}}$ pairs abstract with concrete configurations.

Here, we will focus on $\mathcal{R}_{\mathsf{wr}}$ ’s most notable characteristics for understanding why it is suitable to describe a CVDNI-preserving compilation.333We provide an informal description of all of the cases, their purpose, and the invariants they maintain, along with a code listing from $\mathsf{compile{\text{-}}cmd}$ relevant to the part that will be presented, in Appendices A and B (respectively). For full details, we refer the reader to the Isabelle formalisation. We focus on the case $\mathtt{if\_expr}$ of $\mathcal{R}_{\mathsf{wr}}$ , which relates the expression evaluation part of the While program $\textbf{if}\ e\ \textbf{then}\ c_{1}\ \textbf{else}\ c_{2}\ \textbf{fi}$ , with the corresponding part (including the conditional jump Jz after expression evaluation) of the RISC program obtained by running $\mathsf{compile{\text{-}}cmd}$ on it. (Variables ignored are in gray.)

Example 5.1 (Introduction rule for case if_expr of $\mathcal{R}_{\mathsf{wr}}$ ).

[TABLE]

This is a fairly typical case of $\mathcal{R}_{\mathsf{wr}}$ in a number of respects:

Firstly, there is a direct reference to the call to $\mathsf{compile{\text{-}}cmd}$ for the given While program. Secondly, various guards ( $\mathsf{compiled{\text{-}}cmd{\text{-}}config{\text{-}}consistent}$ introduced below, and $\mathsf{regrec{\text{-}}stable}$ defined in Section 5.4) are asserted in order to restrict the scope of $\mathcal{R}_{\mathsf{wr}}$ only to consider wellformed local program configurations that line up with the conditions captured by the compilation record. Thirdly, the inductive references to $\mathcal{R}_{\mathsf{wr}}$ for $P_{1}$ and $P_{2}$ , the branches of the conditional that have not been reached yet, are quantified over all configurations that obey the guards $\mathsf{compiled{\text{-}}cmd{\text{-}}config{\text{-}}consistent}$ and $\mathsf{regrec{\text{-}}stable}$ relative to $C_{1}$ , the initial compilation record for each of the sub-calls to $\mathsf{compile{\text{-}}cmd}$ for those sub-programs.

The guard $\mathsf{compiled{\text{-}}cmd{\text{-}}config{\text{-}}consistent}$ mentioned above asserts that the compilation record $C$ is consistent with the registers $\mathit{regs}$ , memory $\mathit{mem}$ and mode state $\mathit{mds}$ .

[TABLE]

Firstly, for all entries in register record mapping some register $r$ to some expression $e$ , the value held in $r$ of the register bank $\mathit{regs}$ must match the value of $e$ if evaluated under memory $\mathit{mem}$ . Secondly, the assumption record must consist exactly of the program variables the mode state $\mathit{mds}$ says have AsmNoW, AsmNoRW on them respectively.

As we will see in Theorem 5.8, $\mathsf{compiled{\text{-}}cmd{\text{-}}config{\text{-}}consistent}$ also serves as initial configuration requirements for compiled programs: only configurations obeying them may be used to initialise a RISC program compiled by the wr-compiler with initial compilation record $C$ .

With $\mathcal{R}_{\mathsf{wr}}$ specified, we then prove the two requirements for $\mathsf{secure{\text{-}}refinement{\text{-}}decomp}$ that pertain to $\mathcal{R}_{\mathsf{wr}}$ alone: $\mathsf{preserves{\text{-}}modes{\text{-}}mem}$ (2.4) and $\mathsf{closed{\text{-}}others}$ (2.5).

Lemma 5.2 ( $\mathcal{R}_{\mathsf{wr}}$ preserves modes and memory).

$\mathsf{preserves{\text{-}}modes{\text{-}}mem}\ \mathcal{R}_{\mathsf{wr}}$ **

Lemma 5.3 ( $\mathcal{R}_{\mathsf{wr}}$ is closed under changes by others).

$\mathsf{closed{\text{-}}others}\ \mathcal{R}_{\mathsf{wr}}$ **

5.5.2 Refinement pacing function $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$

We now nominate an $\mathit{abs{\text{-}}steps}$ function, determining the pace at which While programs progress in comparison to the RISC programs that they are compiled to by the wr-compiler.

To assist here and elsewhere, we define a primitive recursive helper leftmost-cmd that given a sequence of ;-separated While commands, strips all but the first: given $c_{1}{}\mathbin{;}{}c_{2}$ it returns leftmost-cmd $c_{1}$ , and given any other While program $c$ it returns $c$ .

Our pacing function $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ primarily looks at the form of the RISC program instruction about to be executed. The RISC instructions are divided into three categories:

•

Instructions output by $\mathsf{compile{\text{-}}expr}$ : Load, Op, and MoveK. For these, $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ returns 1 if the leftmost-cmd of the While program is $\textbf{while}\ e\ \textbf{do}\ c\ \textbf{od}$ , to allow it to step to $\textbf{if}\ e\ \textbf{then}\ (c{}\mathbin{;}{}\textbf{while}\ e\ \textbf{do}\ c\ \textbf{od})\ \textbf{else}\ \textbf{stop}\ \textbf{fi}$ concurrently with the first RISC step of the compiled expression itself. Otherwise, $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ returns 0 to indicate the While program standing still while the RISC program takes new steps to evaluate the expression.

•

“Epilogue” steps: Jmp and Nop when used for control flow at the end of a smaller compiled program in the context of a larger one. For these, $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ returns 0.

•

All other RISC instructions are assumed to proceed at a lockstep pace with the While command they were compiled from, and for these $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ returns 1.

Having nominated $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ and $\mathcal{R}_{\mathsf{wr}}$ , we now have the parameters over which we are obliged to prove refinement preservation (4(a)) as demanded by $\mathsf{secure{\text{-}}refinement{\text{-}}decomp}$ (3.1). To this end, we prove firstly (elided to Appendix D.3) that every step of execution of a RISC program produced by the wr-compiler from a While program, maintains the consistency demanded by $\mathsf{compiled{\text{-}}cmd{\text{-}}config{\text{-}}consistent}$ between configurations and compilation records. Also, we must prove a correctness lemma for the expression compiler:

Lemma 5.4.

$(\mathit{PCs},r,C^{\prime},\mathsf{False})=\mathsf{compile{\text{-}}expr}\ C\ A\ l\ e\ \implies(\mathsf{regrec}\ C^{\prime})\ r=\mathsf{Some}\ e$ **

Armed with these facts, we can now prove the main refinement preservation result:

Lemma 5.5 ( $\mathcal{R}_{\mathsf{wr}}$ is a refinement paced by $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ ).

[TABLE]

5.5.3 Concrete coupling invariant $\mathcal{I}_{\mathsf{wr}}$

The next element needed is the concrete coupling invariant $\mathcal{I}_{\mathsf{wr}}$ , which we define as follows:

$\mathcal{I}_{\mathsf{wr}}\equiv\{(\langle((\mathit{pc},P),\mathit{regs}),\mathit{mds},\mathit{mem}\rangle_{\mathsf{r}},\langle((\mathit{pc}^{\prime},P^{\prime}),\mathit{regs}^{\prime}),\mathit{mds}^{\prime},\mathit{mem}^{\prime}\rangle_{\mathsf{r}})\ |\ (\mathit{pc},P)=(\mathit{pc}^{\prime},P^{\prime})\}$

In other words, $\mathcal{I}_{\mathsf{wr}}$ asserts that we only need compare local configurations that are at the same location $\mathit{pc}=\mathit{pc}^{\prime}$ of the same RISC program $P=P^{\prime}$ . When used in concert with a $\mathsf{no{\text{-}}high{\text{-}}branching}\ \mathcal{B}$ (see Section 5.5.4), the effect of $\mathcal{I}_{\mathsf{wr}}$ is to ensure that the wr-compiler has not introduced any new branching on sensitive values.

5.5.4 Successful compilations are CVDNI-preserving refinements

We are ready to prove preservation. First we qualify that we allow only $\mathsf{strong{\text{-}}low{\text{-}}bisim{\text{-}}mm}\ \mathcal{B}$ that describe only While-programs with no branching on $\mathsf{High}$ -classified values, as follows:

[TABLE]

That is, it refuses to relate configurations at different program locations. Furthermore if it is at a conditional branching point, the expression $e$ determining which branch will be taken evaluates to the same boolean value for both configurations’ memories. When imposed on a relation that already ensures $\mathsf{Low}$ -equivalent memory modulo modes, this effectively disallows any present or past branching on sensitive values. Then, for such programs:

Lemma 5.6.

$\begin{aligned} \inferrule{\mathsf{strong{\text{-}}low{\text{-}}bisim{\text{-}}mm}\ \mathcal{B}\mathsf{no{\text{-}}high{\text{-}}branching}\ \mathcal{B}}{\mathsf{secure{\text{-}}refinement{\text{-}}decomp}\ \mathcal{B}\ \mathcal{R}_{\mathsf{wr}}\ \mathcal{I}_{\mathsf{wr}}\ \mathsf{abs{\text{-}}steps}_{\mathsf{wr}}}\end{aligned}$ **

From this it follows immediately via Theorem 3.3 that $\mathcal{R}_{\mathsf{wr}}$ with the help of $\mathcal{I}_{\mathsf{wr}}$ describes a CVDNI-preserving refinement for non- $\mathsf{High}$ -branching While programs:

Corollary 5.7 ( $\mathcal{R}_{\mathsf{wr}}$ is a CVDNI-preserving refinement for non-High-branching programs).

[TABLE]

Finally, we prove that successful compilation produces a RISC program related by $\mathcal{R}_{\mathsf{wr}}$ to its input While program, when started with corresponding and reasonable initial configurations:

Theorem 5.8 (Successful compilations are refinements in $\mathcal{R}_{\mathsf{wr}}$ ).

[TABLE]

6 Case study: the wr-compiler in action

To test the theory, we instantiated it and applied the wr-compiler to a While-language model of the Cross Domain Desktop Compositor [5] (CDDC), a non-trivial concurrent program that facilitates a trusted user’s interaction with multiple desktop machines of differing clearance.

The CDDC model to which we applied the compiler is a 2-thread program that was a precursor to the 3-thread model that was verified using the Covern program logic [20].444We leave for future work an adaptation of the refinement theory and wr-compiler in order to support the shared data invariants added by the Covern logic, required to verify the 3-thread CDDC model. Each of the threads of the CDDC program (together about 150 lines of While) we proved satisfy the compositional security property $\mathsf{com{\text{-}}secure}$ (2.2), using a precursor to the Covern logic that yields CVDNI-witness bisimulations that are non- $\mathsf{High}$ -branching.

The resulting compiler is executable in Isabelle, meaning that $\mathsf{compile{\text{-}}cmd}$ can be executed on the While program text for each of the two threads to obtain their compilations (together totalling about 250 RISC instructions) using the Isabelle tactic eval. The secure compilation theorems (Section 5.5.4), together with $\mathsf{strong{\text{-}}low{\text{-}}bisim{\text{-}}mm}$ preservation and compositionality for $\mathsf{com{\text{-}}secure}$ (Theorems 5.1, 3.1 of [22], mentioned in Section 2) then allow us to derive that the compiled program is secure when its threads are run concurrently.

To our knowledge this is the first proof of source-level information-flow security being carried by a verified compiler to an assembly-level model of a non-trivial concurrent program.

7 Related work

The following three works, like ours, focus on compilation preserving a form of noninterference.

Tedesco et al. [29] present a type-directed compilation scheme that preserves a fault-resilient noninterference property. The compilation scheme of our wr-compiler was inspired by theirs. Like our $\mathsf{com{\text{-}}secure}$ CVDNI security property that wr-compiler preserves, Tedesco et al.’s security property is also strong bisimulation-based [27]. But where our property accounts (via mode states) for controlled interference by other threads, theirs instead quantifies over all possible interference by the environment with the memory contents. While this simplifies their task of proving that their security property is preserved under compilation—as it need not require the compiler to preserve the contents of memory—it means their security property cannot capture value-dependent noninterference. In contrast, our wr-compiler must obey our $\mathsf{secure{\text{-}}refinement}$ notion’s requirement that memory contents are preserved.555Consequently, we found and fixed a bug in their expression compiler (acknowledged privately) whereby registers in use were incorrectly reallocated. Expressions like $v+(v+1)$ were thus compiled incorrectly to programs yielding $(v+1)+(v+1)$ instead, causing a violation of memory contents preservation.

Barthe et al. [2] consider the problem of preserving cryptographic constant-time policies, a class of noninterference properties similar to CVDNI in its explicit consideration for capturing timing-sensitivity. Barthe et al. consider a wider scope of common categories of compile-time optimisations (than those performed by our wr-compiler), and mechanise proofs in Coq that such optimisations preserve various constant-time security properties. The sharing of variables in our setting severely limits the scope of our optimisations, to those that the compiler can perform knowing that a shared variable is stable because it has been locked. At present, our wr-compiler avoids redundant loads during expression compilation, but other optimisations like loop hoisting and constant folding we are yet to implement. Their preservation proof technique, constant-time simulation was developed independently to our original cube-shaped secure refinement definition [22]. Like ours, theirs is also a cube-shaped obligation and makes use of a pacing function analogous to our $\mathit{abs{\text{-}}steps}$ . Unlike our work here, Barthe et al. do not give a general method for decomposing their cube-shaped simulation diagrams.

Neither of the above consider per-thread compositional compilation of concurrent, shared memory programs, nor value-dependent noninterference policies – the focus of our theory and compiler. Barthe et al. [4] however did aim to preserve noninterference of multithreaded programs by compilation, extending a prior (security) type-preserving compilation approach [3]. Their noninterference property however was termination- and timing-insensitive, so preventing internal timing leaks relied on the scheduler disallowing certain interleavings between threads. Also, their type-preservation argument was derived from a big-step semantics preservation property for their compiler. Here we instead rely on preservation of a small-step semantics (specifically memory contents), which is necessary for us to preserve value-dependent security under compilation, as well as to avoid imposing non-standard requirements on the scheduler.

Other recent works have improved on fully abstract compilation (surveyed [23]) by mapping out the spectrum [1] or developing specific forms [25] of robust property preservation, concerned with robustness of source program (hyper)properties to concrete adversarial contexts. Like Tedesco et al. [29], these works differ from ours in quantifying over a wider range of hostile interference. They also focus prominently on changes to data types, which we do not support. Thus, as a 2-safety hyperproperty quantifying over a lesser range of interference, we expect CVDNI-preservation to be implied by R2HSP (robust 2-hypersafety preservation), but do not expect it to imply any other secure compilation criterion on Abate et al.’s [1] spectrum.

While recently Patrignani and Garg [25] instantiated their robustly safe compilation for shared-memory fork-join concurrent programs, it only preserves (1-)safety properties. Previously however, Patrignani et al. [24] proved their trace-preserving compilation preserves $k$ -safety hyperproperties [6], including noninterference properties. However, it disallows the removal or addition of trace entries, which would be necessary to change the passage of time as seen in the observable trace events. Thus it excludes optimisations carried out by our compiler (when it permits changes to pacing regulated by $\mathit{abs{\text{-}}steps}$ ) and studied by the two other works [29, 2] on timing-sensitive security-preserving compilation mentioned above.

Finally, there has been much work on large-scale verified compilation [15, 14] some of which has also treated compilation of shared-memory concurrent programs [17] including taking weak-memory consistency into account [26]. Our work here does not consider the effects of weak-memory models. However, it differs to prior work on verified concurrent compilation, in that it formalises and proves a compiler’s ability to use information about the application’s locking protocol, to exclude unsafe access to shared variables, and conversely to know when it is safe to allow optimisations that would typically be excluded (see Section 5.4).

8 Conclusion

To our knowledge, we have presented the first mechanised verification that a compiler preserves concurrent, value-dependent noninterference. To this end, we provided a general decomposition principle for compositional, secure refinement. Although our compiler is a proof-of-concept targeting simple source and target languages, we nevertheless applied it to produce a verified assembly-level model of the CDDC [5], a non-trivial concurrent program.

This work serves to demonstrate that verified security-preserving compilation for concurrent programs is now within reach, by augmenting traditional proof obligations for verified compilation (e.g. square-shaped semantics preservation) with those specific to security (e.g. absence of termination- and timing-leaks) as depicted in Figure 4. We hope that this work paves the way for future large-scale verified security-preserving compilation efforts.

Appendix A Informal descriptions of the cases of refinement relation $\mathcal{R}_{\mathsf{wr}}$

A.1 Base cases

•

stop: This case relates a terminated While program with a terminated RISC program (i.e. one where the program counter is at the length of the program text).

•

skip_nop: This case relates the While program skip with the configuration where the program counter is at the start of the RISC program $[\textbf{Nop}]$ .

•

assign_expr: This case relates the expression evaluation part (for the expression $e$ ) of the While program $v{}\mathbin{:=}{}e$ with the corresponding part of the RISC program obtained by compiling it with the wr-compiler.

•

assign_store: As for assign_expr, but for the very last Store instruction that commits the result of the expression evaluation back to shared memory variable $v$ .

It asserts additionally that $v$ must be stable if lock-governed, and non-lock-governed otherwise. This prevents threads from violating the locking discipline (see Section 5.3).

•

lock_acq: This case relates $\textbf{lock}(k)$ with $\textbf{LockAcq}\ k$ .

•

lock_rel: This case relates $\textbf{unlock}(k)$ with $\textbf{LockRel}\ k$ .

A.2 Inductive cases

•

seq: This case relates the While program $c_{1}{}\mathbin{;}{}c_{2}$ with the concatenation $P_{1}@P_{2}$ of the RISC programs $P_{1}$ and $P_{2}$ that are respectively the outputs of successful consecutive compilation of $c_{1}$ and $c_{2}$ by the wr-compiler. It is intended for cases where the While (resp. RISC) program is currently in $c_{1}$ (resp. $P_{1}$ ).

It is an inductive case of $\mathcal{R}_{\mathsf{wr}}$ , in that:

–

$c_{1}$ is required to be related by $\mathcal{R}_{\mathsf{wr}}$ to the present location in $P_{1}$ .

–

For all local configurations that obey the $\mathsf{compiled{\text{-}}cmd{\text{-}}config{\text{-}}consistent}$ requirements, $c_{2}$ is required to be related by $\mathcal{R}_{\mathsf{wr}}$ to the first instruction of $P_{2}$ . This quantification ensures that $\mathcal{R}_{\mathsf{wr}}$ remains closed when execution progresses from the first program to the second program.

It asserts that $P_{1}$ and $P_{2}$ are $\mathsf{joinable}$ (Section D.2), particularly relevant here to ensure that $P_{1}$ can only jump to locations within or at the end of itself (i.e. the start of $P_{2}$ ).

•

join: This case relates a While program $c$ with an offset $\mathit{pc}>\mathsf{length}\ P_{1}$ into a RISC program $P_{1}@P_{2}$ , assuming the inductive hypothesis that $c$ is related by $\mathcal{R}_{\mathsf{wr}}$ with the offset $\mathit{pc}-\mathsf{length}\ P_{1}$ into the RISC program $P_{2}$ alone.

It is intended primarily for cases where the While (resp. RISC) program is currently in the $c_{2}$ (resp. $P_{2}$ ) of some consecutively compiled $c_{1}{}\mathbin{;}{}c_{2}$ (resp. $P_{1}$ concatenated with $P_{2}$ ) but applies more broadly to allow any prepend of dead, unreachable instructions onto the front of a RISC program without breaking $\mathcal{R}_{\mathsf{wr}}$ .

It also asserts that $P_{1}$ and $P_{2}$ are $\mathsf{joinable}$ , which is important here to ensure that $P_{2}$ cannot jump back into $P_{1}$ .

•

if_expr: This case relates the expression evaluation part (for the expression $e$ ) of the While program $\textbf{if}\ e\ \textbf{then}\ c_{1}\ \textbf{else}\ c_{2}\ \textbf{fi}$ with the corresponding part (including the conditional jump Jz at the end of expression evaluation) of the RISC program obtained by compiling it with the wr-compiler.

It relies on both $c_{1}$ and $c_{2}$ being related by $\mathcal{R}_{\mathsf{wr}}$ to its compiled RISC counterparts when started with initialisation states judged valid by $\mathsf{compiled{\text{-}}cmd{\text{-}}config{\text{-}}consistent}$ .

•

if_c1: This case relates some While program $c_{1}^{\prime}$ reachable from $c_{1}$ with the corresponding part within the $c_{1}$ part of the RISC program obtained by compiling $\textbf{if}\ e\ \textbf{then}\ c_{1}\ \textbf{else}\ c_{2}\ \textbf{fi}$ with the wr-compiler.

It relies on $c_{1}$ being related by $\mathcal{R}_{\mathsf{wr}}$ to its compiled RISC counterpart at the appropriate program counter offset.

•

if_c2: As for if_c1, but for $c_{2}$ .

•

epilogue_step: This case relates a terminated While program to the silent control flow steps navigating to the end of a RISC program from the end of the “then” and “else” branches of a compiled if-conditional.

It works only for epilogue step forms (see Section 5.5.2).

It is inductive in that it asserts closedness of $\mathcal{R}_{\mathsf{wr}}$ over pairwise reachability from the pair currently under consideration – the only case to do so directly.

•

while_expr: This case relates the While program ( $\textbf{while}\ e\ \textbf{do}\ c\ \textbf{od}$ )’s initial intermediate step to $\textbf{if}\ e\ \textbf{then}\ (c{}\mathbin{;}{}\ \textbf{while}\ e\ \textbf{do}\ c\ \textbf{od})\ \textbf{else}\ \textbf{stop}\ \textbf{fi}$ , and its expression evaluation part, with the expression evaluation and conditional jump of the RISC program that $\textbf{while}\ e\ \textbf{do}\ c\ \textbf{od}$ was compiled to by $\mathsf{compile{\text{-}}cmd}$ .

It relies on $c$ being related by $\mathcal{R}_{\mathsf{wr}}$ to its compiled RISC counterpart when started with initialisation states judged valid by $\mathsf{compiled{\text{-}}cmd{\text{-}}config{\text{-}}consistent}$ .

•

while_inner: This case relates some program $c_{I}{}\mathbin{;}{}\textbf{while}\ e\ \textbf{do}\ c\ \textbf{od}$ reachable from $c{}\mathbin{;}{}\textbf{while}\ e\ \textbf{do}\ c\ \textbf{od}$ to the loop body part of the RISC program compiled from $\textbf{while}\ e\ \textbf{do}\ c\ \textbf{od}$ .

It relies on $c_{I}$ being related by $\mathcal{R}_{\mathsf{wr}}$ to its compiled RISC counterpart at the appropriate program counter offset.

It also carries around the same reliance on $c$ being related by $\mathcal{R}_{\mathsf{wr}}$ to its compiled RISC counterpart for all initialisation states judged valid by $\mathsf{compiled{\text{-}}cmd{\text{-}}config{\text{-}}consistent}$ .

•

while_loop: This case handles epilogue steps for the inner loop body program, and the final jump back to the beginning of the While-loop.

It requires $\mathcal{R}_{\mathsf{wr}}$ to relate the terminated While program to the end of the compiled loop body, and furthermore also carries around the same reliance on $c$ being related by $\mathcal{R}_{\mathsf{wr}}$ to its compiled RISC counterpart for all initialisation states judged valid by $\mathsf{compiled{\text{-}}cmd{\text{-}}config{\text{-}}consistent}$ .

Appendix B Code listing for the case of $\mathsf{compile{\text{-}}cmd}$ for if-conditionals

This code listing has been adapted slightly to improve the clarity of the presentation. $\Phi\sqcap_{R}\Phi^{\prime}$ denotes the subset of mappings on which $\Phi$ and $\Phi^{\prime}$ agree.

⬇

compile_cmd C l nl (If e c1 c2) =

(let (Pe, r, C1, faile) = (compile_expr C {} l e);

   (br, nl’) = (nl, Suc nl); (ex, nl”) = (nl’, Suc nl’);

   (P1, l1, nl1, C2, fail1) = (compile_cmd C1 None nl” c1);

   (P2, l2, nl2, C3, fail2) = (compile_cmd C1 (Some br) nl1 c2);

   (* Pre-compilation check ensures asmrec C2 = asmrec C3 *)

   C’ = (regrec C2 $\sqcap_{R}$ regrec C3, asmrec C2)

in (P*e* @ [((if P*e* = [] then l else None, Jz br r), C1)] @

    P1 @ [((l1, Jmp ex), C2)] @ P2 @ [((l2, Nop’), C3)],

    Some ex, nl2, C’, fail*e* $\lor$ fail1 $\lor$ fail2))

Appendix C Proof sketch for decomposition principle soundness result

Theorem C.1 (Soundness of $\mathsf{secure{\text{-}}refinement{\text{-}}decomp}$ ).

[TABLE]

Proof C.2.

The only obligation for $\mathsf{secure{\text{-}}refinement}$ (2.6) not obtained immediately from $\mathsf{secure{\text{-}}refinement{\text{-}}decomp}$ (3.1) is the cube-shaped $\mathsf{coupling{\text{-}}inv{\text{-}}pres}$ (Figure 1).

The front face of the cube is just ordinary square-shaped refinement preservation (depicted in 4(a)), given to us by $\mathsf{secure{\text{-}}refinement{\text{-}}decomp}$ . This gives us that a single concrete step from $\mathit{lc}_{1C}$ is simulated by $n$ abstract steps $\mathit{lc}_{1A}$ , where $n$ is given by $\mathit{abs{\text{-}}steps}$ .

We are then obliged to prove a simulation in the other direction (the back face of the cube), that $n$ abstract steps from all configurations $\mathit{lc}_{2A}$ related by $\mathcal{B}$ to $\mathit{lc}_{1A}$ are simulated by some concrete step from $\mathit{lc}_{2C}$ related by $\mathcal{R}$ to $\mathit{lc}_{2A}$ and by $\mathcal{I}$ to $\mathit{lc}_{1C}$ .

Here, we lean on the determinism of the abstract program’s evaluation semantics (required by the theory) to flip the direction of simulation, knowing that $n$ abstract steps from $\mathit{lc}_{2A}$ , simulating a single concrete step from $\mathit{lc}_{2C}$ , could only be the very same $n$ abstract steps from $\mathit{lc}_{2A}$ that we were required to consider. This allows us to use once again the square-shaped refinement preservation (4(a)) given to us by $\mathsf{secure{\text{-}}refinement{\text{-}}decomp}$ .

Consistency of refinement pacing and stopping behaviour (depicted in 4(b)) given by $\mathsf{decomp{\text{-}}refinement{\text{-}}safe}$ (3.2) then respectively ensure that $n$ (via $\mathit{abs{\text{-}}steps}$ ) is the correct number of abstract steps to consider, and that there will indeed be a concrete step from $\mathit{lc}_{2C}$ to drive the matching simulation step.

Finally, the remainder of $\mathsf{decomp{\text{-}}refinement{\text{-}}safe}$ (depicted in 4(c)) discharges the requirement of closedness and modes-equality maintenance of $\mathcal{I}$ under lockstep execution, demanded by the bottom face of the cube.

Appendix D More details on the Covern wr-compiler

D.1 Register allocation scheme model

We model the (user-supplied) register allocation scheme by two functions $reg\_alloc$ and $reg\_alloc\_cached$ on the register record $\Phi$ (see Section 5.4) and the set $A$ of registers whose contents are needed to evaluate the current expression. In order to avoid loading from memory unnecessarily, the compiler may first call $reg\_alloc\_cached\ \Phi\ A\ v$ to identify a register that $\Phi$ records as already containing the variable $v$ . When the compiler needs a fresh register, it will call $reg\_alloc\ \Phi\ A$ . Neither function is allowed to allocate a register in $A$ , so the allocator is permitted to fail if it cannot find any suitable register. As mentioned in Section 5.2 we assume there are enough registers to service the expressions in the source program. Also, registers typically become available again as expression evaluation is resolved.

D.2 Label allocation and sequential composability

For allocating natural numbers to use as labels for RISC instructions the wr-compiler ensures freshness merely by using the highest number reached so far on a “next label” counter ( $\mathit{nl}$ in the invocation example (1)), incrementing the counter before passing it along to subsequent calls, and outputting the next available unused label on return (as $\mathit{nl}^{\prime}$ in the example).

We define two RISC programs $P_{1},P_{2}$ to be $\mathsf{joinable}$ if they are both:

•

$\mathsf{joinable{\text{-}}forward}$ : $P_{1}$ only ever jumps to labels that are either

–

labelling an instruction in $P_{1}$ itself, or

–

the label of the very first instruction in $P_{2}$ .

•

$\mathsf{joinable{\text{-}}backward}$ : $P_{2}$ does not jump to any of the labels of instructions in $P_{1}$ .

We prove a lemma that says that two RISC programs that were compiled by the wr-compiler consecutively—in the sense that the relevant outputs from the first call are fed directly into the second call—are $\mathsf{joinable}$ .

D.3 More detail on $\mathsf{compile{\text{-}}cmd{\text{-}}input{\text{-}}reqs}$ and the wr-compiler proofs

The first two requirements to the predicate $\mathsf{compile{\text{-}}cmd{\text{-}}input{\text{-}}reqs}\ C\ l\ \mathit{nl}\ c$ were given in Section 5.4. Its other two requirements reflect that the terminated While program stop has no valid compilation, and that the initial label (if provided) must be valid (see Section D.2 for more information on label allocation).

Definition D.1 (Requirements on inputs to $\mathsf{compile{\text{-}}cmd}$ ).

[TABLE]

These input conditions give us enough information to prove that every instruction of a $CompRec$ -annotated RISC program output by a successful run of $\mathsf{compile{\text{-}}cmd}$ is annotated by a stable register record, and that the output $CompRec$ ’s register record is also stable:

Lemma D.2 (Successful compilations output only stable register records).

[TABLE]

Proof D.3.

By induction on the structure of the While language program $c$ , making reference to the implementation of $\mathsf{compile{\text{-}}cmd}$ . For cases that must compile expressions, we furthermore prove and make use of a lemma by induction on the structure of expressions, making reference to the implementation of the expression compiler function $\mathsf{compile{\text{-}}expr}$ called by $\mathsf{compile{\text{-}}cmd}$ . In essence, we prove that (sub)expressions that appear in register records must be stable, for two reasons. Firstly, they are always only ever subexpressions over variables that must have been stable in the input program when their contents were first loaded into registers. Furthermore, when compiling an $\textbf{unlock}($ ), the wr-compiler will always flush all register records that make reference to any variables that the $\textbf{unlock}($ ) makes unstable.

Before proceeding, we name the parts of $\mathsf{compiled{\text{-}}cmd{\text{-}}config{\text{-}}consistent}$ more explicitly:

Definition D.4 (Configuration consistency requirements for compiled commands).

[TABLE]

Definition D.5 (Consistency between a register record, register bank, and shared memory).

[TABLE]

Definition D.6 (Consistency between an assumption record and a mode state).

[TABLE]

Lemma D.7 ( $\mathcal{R}_{\mathsf{wr}}$ preserves modes and memory).

$\mathsf{preserves{\text{-}}modes{\text{-}}mem}\ \mathcal{R}_{\mathsf{wr}}$ **

Proof D.8.

By induction on the structure of $\mathcal{R}_{\mathsf{wr}}$ . For all cases of $(\mathit{lc}_{w},\mathit{lc}_{r})\in\mathcal{R}_{\mathsf{wr}}$ , $\mathit{lc}_{w}=_{\mathsf{mds}}^{\mathsf{mem}}\mathit{lc}_{r}$ is either asserted directly by the guards or obtainable from the inductive hypothesis.

Lemma D.9 ( $\mathcal{R}_{\mathsf{wr}}$ is closed under changes by others).

$\mathsf{closed{\text{-}}others}\ \mathcal{R}_{\mathsf{wr}}$ **

Proof D.10.

By induction on the structure of $\mathcal{R}_{\mathsf{wr}}$ . Changes by others (2.5) only modify $\mathsf{writable}$ variables the same way for both configurations, so preservation of $=_{\mathsf{mds}}^{\mathsf{mem}}$ is immediate. Also, $\mathsf{regrec{\text{-}}mem{\text{-}}consistent}$ is unaffected because $\mathsf{compile{\text{-}}cmd}$ only creates $\mathsf{regrec{\text{-}}stable}$ records (referring to no $\mathsf{writable}$ variables). No other $\mathcal{R}_{\mathsf{wr}}$ guards mention shared memory.

Lemma D.11 (Successfully compiled programs maintain config consistency requirements).

[TABLE]

Proof D.12.

We in fact prove it separately for $\mathsf{regrec{\text{-}}mem{\text{-}}consistent}$ and $\mathsf{asmrec{\text{-}}mds{\text{-}}consistent}$ , in both cases by induction on the structure of the While program $c$ . In each case, we use the simplifiers for the $\mathsf{compile{\text{-}}cmd}$ implementation to yield the corresponding RISC program fragment in question, and then prove the lemma for each of the possible locations of $\mathit{pc}$ in the compiled program. For both proofs, there is some trickiness in accounting for (and ruling out) which destination $\mathit{pc}^{\prime}$ must be considered for each of these cases of $\mathit{pc}$ , particularly for those While programs that compile to RISC programs that may have jumps in them.

Control flow trickiness aside, the intuition for $\mathsf{regrec{\text{-}}mem{\text{-}}consistent}$ is that it tests the correctness of the compilation of expressions, and so for this we must prove a sub-lemma for maintenance of $\mathsf{compiled{\text{-}}cmd{\text{-}}config{\text{-}}consistent}$ by induction on the structure of expressions $e$ that are encountered in the While programs $\textbf{if}\ e\ \textbf{then}\ c_{1}\ \textbf{else}\ c_{2}\ \textbf{fi},\ \textbf{while}\ e\ \textbf{do}\ c^{\prime}\ \textbf{od},\ v{}\mathbin{:=}{}e$ . Additionally, $\textbf{unlock}()$ flushes register record entries mentioning variables that are to become unstable, and $\textbf{while}\ e\ \textbf{do}\ c^{\prime}\ \textbf{od}$ conservatively flushes entries to force evaluation of the loop condition expression. This is safe trivially because flushing entries can never make a consistent register record inconsistent. The rest of the cases for $c$ are straightforward because they do not touch the register record.

Then for $\mathsf{asmrec{\text{-}}mds{\text{-}}consistent}$ , the substantial part of the proof is as a test of the correctness of the compiler’s bookkeeping of assumptions being consistent with the semantics of $\textbf{lock}()$ and $\textbf{unlock}()$ . The other cases for $c$ do not touch the mode state.

Lemma D.13 (Correctness of the expression compiler).

[TABLE]

Proof D.14.

By induction on the structure of expressions $e$ , using the simplification rules for the implementation of $\mathsf{compile{\text{-}}expr}$ , and also relying on assumptions of correctness of the register allocation scheme supplied by the instantiator of the theory.

Lemma D.15 ( $\mathcal{R}_{\mathsf{wr}}$ is a refinement paced by $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ ).

[TABLE]

Proof D.16.

By induction on the structure of $\mathcal{R}_{\mathsf{wr}}$ .

The base case stop is immediate, because it pertains to a terminated While and RISC program. The base cases that proceed in one step to a terminating program configuration (skip_nop, assign_store, lock_acq, lock_rel) are fairly straightforward because after dealing with the single step, the resulting obligation can then be handled by the stop case. This leaves the last remaining base case assign_expr, which proceeds in one step either to itself, or to assign_store. In all of these cases, we use D.11 to obtain the preservation of the guards demanded by the $\mathcal{R}_{\mathsf{wr}}$ introduction rule for the destination configuration of the step. Particularly, the assign_store case must make use of $\mathsf{regrec{\text{-}}mem{\text{-}}consistent}$ and the correctness of $\mathsf{compile{\text{-}}expr}$ (D.13) in order to ensure that once the expression evaluation result is written back to shared memory, $\mathit{lc}_{w}^{\prime}=_{\mathsf{mds}}^{\mathsf{mem}}\mathit{lc}_{r}^{\prime}$ holds as demanded by the stop case.

*The inductive cases that concern expression evaluation (if_expr, while_expr) are much like assign_expr in that they have the possibility of progressing in one step to themselves. Unlike assign_expr however, their other possibility is a conditional jump based on the result of that expression. Again we use D.13 to obtain that the result is an accurate calculation of the expression, and this time we prove by the two different cases whether if_expr ends up in if_c1 or if_c2, or if while_expr ends up in while_inner or at stop (having jumped to the exit label). In these cases, the guards over which the inductive references to $\mathcal{R}_{\mathsf{wr}}$ have been quantified are versatile enough to discharge themselves (when _expr steps to itself), or to discharge any reachable initial starting state for the nested compiled RISC program, given that D.11 ensures the invariance of these guards.

This just leaves the inductive cases that pertain to configurations inside a nested compiled RISC program (if_c1, if_c2, while_inner), or at the end of one (epilogue_step, while_loop). In these cases, the inductive hypotheses obtained from the inductive reference to $\mathcal{R}_{\mathsf{wr}}$ are always enough to satisfy the guards demanded by the possible destination cases. Like in the proof of D.11, the trickiness mostly comes from accounting for all the possible cases of control flow (ruling out spurious destinations) that need to be considered.

Lemma D.17.

$\begin{aligned} \inferrule{\mathsf{strong{\text{-}}low{\text{-}}bisim{\text{-}}mm}\ \mathcal{B}\mathsf{no{\text{-}}high{\text{-}}branching}\ \mathcal{B}}{\mathsf{decomp{\text{-}}refinement{\text{-}}safe}\ \mathcal{B}\ \mathcal{R}_{\mathsf{wr}}\ \mathcal{I}_{\mathsf{wr}}\ \mathsf{abs{\text{-}}steps}_{\mathsf{wr}}}\end{aligned}$ **

Proof D.18.

3.2* gives us the following obligations.*

For consistent stopping behaviour, we prove a lemma that RISC programs stop if and only if their $\mathit{pc}$ is outside the program text $P$ , i.e. $\mathit{pc}>\mathsf{length}\ P$ . Because $\mathcal{I}_{\mathsf{wr}}$ equates $\mathit{pc}$ and $P$ for the two configurations, then clearly both have identical stopping behaviour.

For consistency of change in timing behaviour, $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ depends only on While and RISC program locations, and $\mathsf{no{\text{-}}high{\text{-}}branching}$ and $\mathcal{I}_{\mathsf{wr}}$ forces them (resp.) to be equal for the local configurations under consideration.

For closedness of $\mathcal{I}_{\mathsf{wr}}$ under lockstep execution, the only non-straightforward cases to consider are conditional branching, and the locking primitives. For conditional branching, we use $\mathsf{no{\text{-}}high{\text{-}}branching}$ for $\mathcal{B}$ with memory preservation via $\mathcal{R}_{\mathsf{wr}}$ (5.2) to ensure that the conditional branching outcome is the same on both sides.

Finally, as the only operations that touch mode state, the locking primitives are the only non-straightforward cases for mode state equality maintenance under lockstep execution. As all lock memory is classified $\mathsf{Low}$ (see Section 5.3), we use $\mathsf{strong{\text{-}}low{\text{-}}bisim{\text{-}}mm}$ for $\mathcal{B}$ with memory preservation via $\mathcal{R}_{\mathsf{wr}}$ to ensure the RISC configurations behave consistently.

Lemma D.19.

$\begin{aligned} \inferrule{\mathsf{strong{\text{-}}low{\text{-}}bisim{\text{-}}mm}\ \mathcal{B}\mathsf{no{\text{-}}high{\text{-}}branching}\ \mathcal{B}}{\mathsf{secure{\text{-}}refinement{\text{-}}decomp}\ \mathcal{B}\ \mathcal{R}_{\mathsf{wr}}\ \mathcal{I}_{\mathsf{wr}}\ \mathsf{abs{\text{-}}steps}_{\mathsf{wr}}}\end{aligned}$ **

Proof D.20.

Referring to 3.1, the obligations pertaining only to $\mathcal{R}_{\mathsf{wr}}$ and $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ are discharged by 5.5, 5.3, and 5.2. Pertaining to $\mathcal{I}_{\mathsf{wr}}$ : clearly $\mathcal{I}_{\mathsf{wr}}$ is symmetric, and furthermore it is $\mathsf{cg{\text{-}}consistent}$ (2.3) because the actions over which $\mathcal{I}_{\mathsf{wr}}$ must be closed modify only the shared memory, and $\mathcal{I}_{\mathsf{wr}}$ places only restrictions on the program text and current location. The final obligation is discharged by D.17.

Theorem D.21 (Successful compilations are refinements in $\mathcal{R}_{\mathsf{wr}}$ ).

[TABLE]

Proof D.22.

By induction on the structure of While. The compiler input and initial configuration conditions we impose allow us to have each of skip, $\mathit{cmd}{}\mathbin{;}{}\mathit{cmd}$ , $\textbf{if}\ exp\ \textbf{then}\ \mathit{cmd}\ \textbf{else}\ \mathit{cmd}\ \textbf{fi}$ , $\textbf{while}\ exp\ \textbf{do}\ \mathit{cmd}\ \textbf{od}$ , $v{}\mathbin{:=}{}exp$ , $\textbf{lock}(k)$ , and $\textbf{unlock}(k)$ and their compiled output meet the guards of the introduction rules for the cases skip, seq, if_expr, while_expr, assign_expr, lock_acq, and lock_rel of $\mathcal{R}_{\mathsf{wr}}$ that were designed for them respectively.

Bibliography30

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Carmine Abate, Roberto Blanco, Deepak Garg, Catalin Hritcu, Marco Patrignani, and Jérémy Thibault. Exploring robust property preservation for secure compilation. Co RR , abs/1807.04603, 2018. URL: http://arxiv.org/abs/1807.04603 .
2[2] G. Barthe, B. Grégoire, and V. Laporte. Secure compilation of side-channel countermeasures: The case of cryptographic “constant-time”. In 2018 IEEE 31st Computer Security Foundations Symposium (CSF) , pages 328–343, July 2018.
3[3] Gilles Barthe, Tamara Rezk, and Amitabh Basu. Security types preserving compilation. Comput. Lang. Syst. Struct. , 33(2):35–59, July 2007. URL: http://dx.doi.org/10.1016/j.cl.2005.05.002 . · doi ↗
4[4] Gilles Barthe, Tamara Rezk, Alejandro Russo, and Andrei Sabelfeld. Security of multithreaded programs by compilation. ACM Trans. Inf. Syst. Secur. , 13(3):21:1–21:32, July 2010. URL: http://doi.acm.org/10.1145/1805974.1805977 .
5[5] Mark Beaumont, Jim Mc Carthy, and Toby Murray. The cross domain desktop compositor: Using hardware-based video compositing for a multi-level secure user interface. In Annual Computer Security Applications Conference (ACSAC) , pages 533–545, 2016.
6[6] Michael R. Clarkson and Fred B. Schneider. Hyperproperties. J. Comput. Secur. , 18(6):1157–1210, September 2010. URL: http://dl.acm.org/citation.cfm?id=1891823.1891830 .
7[7] Florian Dewald, Heiko Mantel, and Alexandra Weber. AVR processors as a platform for language-based security. In Computer Security - ESORICS 2017 - 22nd European Symposium on Research in Computer Security, Oslo, Norway, September 11-15, 2017, Proceedings, Part I , pages 427–445, 2017. URL: https://doi.org/10.1007/978-3-319-66402-6_25 . · doi ↗
8[8] Qian Ge, Yuval Yarom, Tom Chothia, and Gernot Heiser. Time protection: the missing OS abstraction. In Eurosys 19 , Dresden, Germany, March 2019. ACM.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Acknowledgements.

Verifying that a compiler preserves concurrent value-dependent information-flow security

Abstract

keywords:

category:

1 Introduction

2 Background and example

2.1 Concurrent value-dependent noninterference (CVDNI)

Definition 2.1** (Low-equivalent memories modulo modes).**

Definition 2.2** (Per-thread compositional CVDNI property).**

Definition 2.3** (Closedness under globally consistent changes).**

2.2 CVDNI-preserving refinement

Definition 2.4** (Preservation of modes and memory).**

Definition 2.5** (Closedness of refinements under changes by others).**

Definition 2.6** (Requirements for secure refinement of the per-thread CVDNI property).**

3 Decomposition principle for CVDNI-preserving refinement

Definition 3.1** (Decomposed requirements for CVDNI-preserving secure refinement).**

Definition 3.2** (Side-conditions for CVDNI-preserving refinement decomposition).**

Theorem 3.3** (Soundness of secure-refinement-decomp\mathsf{secure{\text{-}}refinement{\text{-}}decomp}secure-refinement-decomp).**

4 Proof effort comparison

5 The Covern wr-compiler

5.1 Source language

5.2 Target language

5.3 Locking discipline

5.4 Compiler implementation and tracking of shared variable stability

5.5 Proof of CVDNI-preserving compilation

5.5.1 Refinement relation Rwr\mathcal{R}_{\mathsf{wr}}Rwr​ and its invariants

Example 5.1** (Introduction rule for case if_expr of Rwr\mathcal{R}_{\mathsf{wr}}Rwr​).**

Lemma 5.2** (Rwr\mathcal{R}_{\mathsf{wr}}Rwr​ preserves modes and memory).**

Lemma 5.3** (Rwr\mathcal{R}_{\mathsf{wr}}Rwr​ is closed under changes by others).**

5.5.2 Refinement pacing function abs-stepswr\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}abs-stepswr​

Lemma 5.4**.**

Lemma 5.5** (Rwr\mathcal{R}_{\mathsf{wr}}Rwr​ is a refinement paced by abs-stepswr\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}abs-stepswr​).**

5.5.3 Concrete coupling invariant Iwr\mathcal{I}_{\mathsf{wr}}Iwr​

5.5.4 Successful compilations are CVDNI-preserving refinements

Lemma 5.6**.**

Corollary 5.7** (Rwr\mathcal{R}_{\mathsf{wr}}Rwr​ is a CVDNI-preserving refinement for non-High-branching programs).**

Theorem 5.8** (Successful compilations are refinements in Rwr\mathcal{R}_{\mathsf{wr}}Rwr​).**

6 Case study: the wr-compiler in action

7 Related work

8 Conclusion

Appendix A Informal descriptions of the cases of refinement relation Rwr\mathcal{R}_{\mathsf{wr}}Rwr​

A.1 Base cases

A.2 Inductive cases

Appendix B Code listing for the case of compile-cmd\mathsf{compile{\text{-}}cmd}compile-cmd for if-conditionals

Appendix C Proof sketch for decomposition principle soundness result

Theorem C.1** (Soundness of secure-refinement-decomp\mathsf{secure{\text{-}}refinement{\text{-}}decomp}secure-refinement-decomp).**

Proof C.2**.**

Appendix D More details on the Covern wr-compiler

D.1 Register allocation scheme model

D.2 Label allocation and sequential composability

D.3 More detail on compile-cmd-input-reqs\mathsf{compile{\text{-}}cmd{\text{-}}input{\text{-}}reqs}compile-cmd-input-reqs and the wr-compiler proofs

Definition D.1** (Requirements on inputs to compile-cmd\mathsf{compile{\text{-}}cmd}compile-cmd).**

Lemma D.2** (Successful compilations output only stable register records).**

Proof D.3**.**

Definition D.4** (Configuration consistency requirements for compiled commands).**

Definition D.5** (Consistency between a register record, register bank, and shared memory).**

Definition D.6** (Consistency between an assumption record and a mode state).**

Lemma D.7** (Rwr\mathcal{R}_{\mathsf{wr}}Rwr​ preserves modes and memory).**

Proof D.8**.**

Lemma D.9** (Rwr\mathcal{R}_{\mathsf{wr}}Rwr​ is closed under changes by others).**

Proof D.10**.**

Lemma D.11** (Successfully compiled programs maintain config consistency requirements).**

Proof D.12**.**

Lemma D.13** (Correctness of the expression compiler).**

Proof D.14**.**

Lemma D.15** (Rwr\mathcal{R}_{\mathsf{wr}}Rwr​ is a refinement paced by abs-stepswr\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}abs-stepswr​).**

Proof D.16**.**

Lemma D.17**.**

Proof D.18**.**

Lemma D.19**.**

Proof D.20**.**

Theorem D.21** (Successful compilations are refinements in Rwr\mathcal{R}_{\mathsf{wr}}Rwr​).**

Proof D.22**.**

Definition 2.1 (Low-equivalent memories modulo modes).

Definition 2.2 (Per-thread compositional CVDNI property).

Definition 2.3 (Closedness under globally consistent changes).

Definition 2.4 (Preservation of modes and memory).

Definition 2.5 (Closedness of refinements under changes by others).

Definition 2.6 (Requirements for secure refinement of the per-thread CVDNI property).

Definition 3.1 (Decomposed requirements for CVDNI-preserving secure refinement).

Definition 3.2 (Side-conditions for CVDNI-preserving refinement decomposition).

Theorem 3.3 (Soundness of $\mathsf{secure{\text{-}}refinement{\text{-}}decomp}$ ).

5.5.1 Refinement relation $\mathcal{R}_{\mathsf{wr}}$ and its invariants

Example 5.1 (Introduction rule for case if_expr of $\mathcal{R}_{\mathsf{wr}}$ ).

Lemma 5.2 ( $\mathcal{R}_{\mathsf{wr}}$ preserves modes and memory).

Lemma 5.3 ( $\mathcal{R}_{\mathsf{wr}}$ is closed under changes by others).

5.5.2 Refinement pacing function $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$

Lemma 5.4.

Lemma 5.5 ( $\mathcal{R}_{\mathsf{wr}}$ is a refinement paced by $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ ).

5.5.3 Concrete coupling invariant $\mathcal{I}_{\mathsf{wr}}$

Lemma 5.6.

Corollary 5.7 ( $\mathcal{R}_{\mathsf{wr}}$ is a CVDNI-preserving refinement for non-High-branching programs).

Theorem 5.8 (Successful compilations are refinements in $\mathcal{R}_{\mathsf{wr}}$ ).

Appendix A Informal descriptions of the cases of refinement relation $\mathcal{R}_{\mathsf{wr}}$

Appendix B Code listing for the case of $\mathsf{compile{\text{-}}cmd}$ for if-conditionals

Theorem C.1 (Soundness of $\mathsf{secure{\text{-}}refinement{\text{-}}decomp}$ ).

Proof C.2.

D.3 More detail on $\mathsf{compile{\text{-}}cmd{\text{-}}input{\text{-}}reqs}$ and the wr-compiler proofs

Definition D.1 (Requirements on inputs to $\mathsf{compile{\text{-}}cmd}$ ).

Lemma D.2 (Successful compilations output only stable register records).

Proof D.3.

Definition D.4 (Configuration consistency requirements for compiled commands).

Definition D.5 (Consistency between a register record, register bank, and shared memory).

Definition D.6 (Consistency between an assumption record and a mode state).

Lemma D.7 ( $\mathcal{R}_{\mathsf{wr}}$ preserves modes and memory).

Proof D.8.

Lemma D.9 ( $\mathcal{R}_{\mathsf{wr}}$ is closed under changes by others).

Proof D.10.

Lemma D.11 (Successfully compiled programs maintain config consistency requirements).

Proof D.12.

Lemma D.13 (Correctness of the expression compiler).

Proof D.14.

Lemma D.15 ( $\mathcal{R}_{\mathsf{wr}}$ is a refinement paced by $\mathsf{abs{\text{-}}steps}_{\mathsf{wr}}$ ).

Proof D.16.

Lemma D.17.

Proof D.18.

Lemma D.19.

Proof D.20.

Theorem D.21 (Successful compilations are refinements in $\mathcal{R}_{\mathsf{wr}}$ ).

Proof D.22.