Information Structure Design in Team Decision Problems

Tyler Summers; Changyuan Li; Maryam Kamgarpour

arXiv:1706.05572·math.OC·June 20, 2017

Information Structure Design in Team Decision Problems

Tyler Summers, Changyuan Li, Maryam Kamgarpour

PDF

TL;DR

This paper introduces scalable greedy algorithms for designing information structures in team decision problems, aiming to optimize performance and resilience against adversarial agents, despite the lack of supermodularity.

Contribution

It proposes simple greedy algorithms for information structure design in team problems and demonstrates their practical effectiveness through numerical experiments.

Findings

01

Greedy algorithms can effectively improve team performance.

02

The set function for information links is not supermodular.

03

Numerical results show near-optimal performance of the proposed methods.

Abstract

We consider a problem of information structure design in team decision problems and team games. We propose simple, scalable greedy algorithms for adding a set of extra information links to optimize team performance and resilience to non-cooperative and adversarial agents. We show via a simple counterexample that the set function mapping additional information links to team performance is in general not supermodular. Although this implies that the greedy algorithm is not accompanied by worst-case performance guarantees, we illustrate through numerical experiments that it can produce effective and often optimal or near optimal information structure modifications.

Figures1

Click any figure to enlarge with its caption.

Equations88

z_{i} = H_{i} x + w_{i}, i = 1, .., N

z_{i} = H_{i} x + w_{i}, i = 1, .., N

S_{0} = {(H_{1}, R_{1}), (H_{2}, R_{2}), ..., (H_{N}, R_{N})} .

S_{0} = {(H_{1}, R_{1}), (H_{2}, R_{2}), ..., (H_{N}, R_{N})} .

\overset{ˉ}{J} (u) = u^{T} Q x + \frac{1}{2} u^{T} P u,

\overset{ˉ}{J} (u) = u^{T} Q x + \frac{1}{2} u^{T} P u,

Q=\left[\begin{array}[]{c}Q_{1}\\ Q_{2}\\ \vdots\\ Q_{N}\end{array}\right],\quad P=\left[\begin{array}[]{cccc}P_{11}&P_{12}&\cdots&P_{1N}\\ P_{12}^{T}&P_{22}&\cdots&P_{2N}\\ \vdots&\vdots&\ddots&\vdots\\ P_{1N}^{T}&P_{2N}^{T}&\cdots&P_{NN}\end{array}\right]

Q=\left[\begin{array}[]{c}Q_{1}\\ Q_{2}\\ \vdots\\ Q_{N}\end{array}\right],\quad P=\left[\begin{array}[]{cccc}P_{11}&P_{12}&\cdots&P_{1N}\\ P_{12}^{T}&P_{22}&\cdots&P_{2N}\\ \vdots&\vdots&\ddots&\vdots\\ P_{1N}^{T}&P_{2N}^{T}&\cdots&P_{NN}\end{array}\right]

J (γ, S_{0}) = E_{x} (u^{T} Q x + u^{T} P u), u_{i} = γ_{i} (z_{i} (x))

J (γ, S_{0}) = E_{x} (u^{T} Q x + u^{T} P u), u_{i} = γ_{i} (z_{i} (x))

J^{*} (S_{0}) = γ min J (γ), γ^{*} = ar g γ min J (γ)

J^{*} (S_{0}) = γ min J (γ), γ^{*} = ar g γ min J (γ)

\overset{x}{^}_{i}

\overset{x}{^}_{i}

= \overset{x}{ˉ} + X H_{i}^{T} (H_{i} X H_{i}^{T} + R_{i})^{- 1} (z_{i} - H_{i} \overset{x}{ˉ})

u_{i} = γ_{i} (z_{i}) = A_{i} \overset{x}{ˉ} + B_{i} (\overset{x}{^}_{i} - \overset{x}{ˉ}),

u_{i} = γ_{i} (z_{i}) = A_{i} \overset{x}{ˉ} + B_{i} (\overset{x}{^}_{i} - \overset{x}{ˉ}),

P_{ii} A_{i} + j \neq = i \sum P_{ij} A_{j}

P_{ii} A_{i} + j \neq = i \sum P_{ij} A_{j}

P_{ii} B_{i} + j \neq = i \sum P_{ij} B_{j} X H_{j}^{T} (H_{j} X H_{j}^{T} + R_{j})^{- 1} H_{j}

V = {

V = {

(h_{21}, r_{21}), (h_{22}, r_{22}), ..., (h_{2 q_{2}}, r_{2 q_{2}}) ...,

(h_{N 1}, r_{N 1}), (h_{N 2}, r_{N 2}), ..., (h_{N q_{N}}, r_{N q_{N}})}

S = {(h_{13}, r_{13}), (h_{32}, r_{32}), (h_{43}, r_{43}), (h_{45}, r_{45})} \subset V

S = {(h_{13}, r_{13}), (h_{32}, r_{32}), (h_{43}, r_{43}), (h_{45}, r_{45})} \subset V

z_{1}

z_{1}

z_{4}

S \subset V, ∣ S ∣ = k min J^{*} (S) .

S \subset V, ∣ S ∣ = k min J^{*} (S) .

z_{i} = H_{i} x + w_{i}, i = 1, .., N

z_{i} = H_{i} x + w_{i}, i = 1, .., N

y_{j} = G_{j} x + t_{j}, j = 1, .., M

y_{j} = G_{j} x + t_{j}, j = 1, .., M

S_{0} = {(H_{1}, R_{1}), (H_{2}, R_{2}), ..., (H_{N}, R_{N})},

S_{0} = {(H_{1}, R_{1}), (H_{2}, R_{2}), ..., (H_{N}, R_{N})},

T_{0} = {(G_{1}, T_{1}), (G_{2}, T_{2}), ..., (G_{N}, T_{N})} .

T_{0} = {(G_{1}, T_{1}), (G_{2}, T_{2}), ..., (G_{N}, T_{N})} .

\overset{ˉ}{J}^{1} (u, v) = u^{T} Q^{1} x + \frac{1}{2} (u^{T} P^{1} u + v^{T} R^{1} u),

\overset{ˉ}{J}^{1} (u, v) = u^{T} Q^{1} x + \frac{1}{2} (u^{T} P^{1} u + v^{T} R^{1} u),

\overset{ˉ}{J}^{2} (u, v) = v^{T} Q^{2} x + \frac{1}{2} (v^{T} P^{2} v + v^{T} R^{2} u),

\overset{ˉ}{J}^{2} (u, v) = v^{T} Q^{2} x + \frac{1}{2} (v^{T} P^{2} v + v^{T} R^{2} u),

J^{1} (S_{0}, T_{0}, γ, λ) = E_{x} (u^{T} Q^{1} x + u^{T} P^{1} u + 2 v^{T} R^{1} u),

J^{1} (S_{0}, T_{0}, γ, λ) = E_{x} (u^{T} Q^{1} x + u^{T} P^{1} u + 2 v^{T} R^{1} u),

J^{2} (S_{0}, T_{0}, γ, λ) = E_{x} (v^{T} Q^{2} x + v^{T} P^{2} v + 2 v^{T} R^{2} u),

J^{2} (S_{0}, T_{0}, γ, λ) = E_{x} (v^{T} Q^{2} x + v^{T} P^{2} v + 2 v^{T} R^{2} u),

with u_{i} = γ_{i} (z_{i} (x)), v_{j} = λ_{i} (y_{j} (x))

γ^{*} \in ar g γ min J^{1} (S_{0}, T_{0}, γ, λ^{*})

γ^{*} \in ar g γ min J^{1} (S_{0}, T_{0}, γ, λ^{*})

λ^{*} \in ar g λ min J^{2} (S_{0}, T_{0}, γ^{*}, λ),

J^{1 *} (S_{0}, T_{0}) = J^{1} (S_{0}, T_{0}, γ^{*}, λ^{*})

J^{1 *} (S_{0}, T_{0}) = J^{1} (S_{0}, T_{0}, γ^{*}, λ^{*})

J^{2 *} (S_{0}, T_{0}) = J^{2} (S_{0}, T_{0}, γ^{*}, λ^{*}) .

\overset{x}{^}_{i}^{1}

\overset{x}{^}_{i}^{1}

= \overset{x}{ˉ} + X H_{i}^{T} (H_{i} X H_{i}^{T} + R_{i})^{- 1} (z_{i} - H_{i} \overset{x}{ˉ})

\overset{x}{^}_{j}^{2}

\overset{x}{^}_{j}^{2}

= \overset{x}{ˉ} + X G_{j}^{T} (G_{j} X G_{j}^{T} + T_{j})^{- 1} (y_{j} - G_{j} \overset{x}{ˉ})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Information Structure Design in

Team Decision Problems

Tyler Summers*†*

Changyuan Li*†*

Maryam Kamgarpour*‡*

*†*University of Texas at Dallas *‡*ETH Zürich

Abstract

We consider a problem of information structure design in team decision problems and team games. We propose simple, scalable greedy algorithms for adding a set of extra information links to optimize team performance and resilience to non-cooperative and adversarial agents. We show via a simple counterexample that the set function mapping additional information links to team performance is in general not supermodular. Although this implies that the greedy algorithm is not accompanied by worst-case performance guarantees, we illustrate through numerical experiments that it can produce effective and often optimal or near optimal information structure modifications.

keywords:

Team decision theory, team games, information structure design, decentralized control

††thanks: The work of T. Summers is supported by the National Science Foundation under Grant CNS-1566127. M. Kamgarpour is supported by the European Union ERC Starting Grant, CONENE. e-mail: [email protected], [email protected].

1 Introduction

Future critical infrastructures, including electric power, transportation, water, etc., are emerging as cyber-physical networks that will feature cooperative autonomous decision making agents equipped with embedded sensing, computation, communication, and actuation capabilities. A key challenge in the analysis and design of these networks is decentralization of information: each decision making agent must act based on partial information measured or received locally to optimize network operation. Moreover, with growing concerns over cyber-physical security Cardenas et al. (2008); Zhu et al. (2011a, b); Pasqualetti et al. (2013); Sandberg et al. (2015), each agent must not only coordinate its actions with team members, but must also counter against teams of malicious agents to mitigate attack impacts and provide resiliency. The information structure – who knows what and when – is a basic component in formal analyses of these issues and plays a crucial role in determining optimal strategies and computational tractability Radner (1962); Witsenhausen (1971); Ho and Chu (1972); Basar (1978); Ho (1980); Rotkowitz and Lall (2006); Nayyar et al. (2013); Yüksel and Başar (2013); Başar (2014); Lessard and Lall (2015).

While the importance of information structure is widely recognized in team decision theory, decentralized control, and game theory, the vast majority of the literature focuses on designing decision and control strategies for a given information structure. The design of information structures – who should know what and when – has been recognized as an important problem since the earliest work on team decision theory Marschak (1955); Radner (1962), but has received very little formal attention. Radner notes the emphasis on analysis of strategies for given information structures in a seminal paper on team decision theory:

An important organizational problem is the determination of what statistical information shall be made available to the various decision makers in the organization. Implicit in the solution of such a problem is the determination of the best use that can be made of any given structure of information, i.e., the best decision functions. The results to be presented here are concerned with this latter problem.

This emphasis on analysis of given fixed information structure has followed in much of the related work, and has presented rich challenges for many decades.

We believe it is important to shift some focus to information structure design, where one jointly optimizes the information structure together with decision strategies, especially in the context of emerging cyber-physical networks. We consider problems of information structure design in team decision problems and team games. We focus here on static problems since no work to our knowledge has been done even in this setting. We also focus on linear quadratic problems since they are analytically tractable, admitting closed-form equilibrium solutions that provide insight into essential properties. In the non-cooperative game setting, we focus on a specific class of games involving two teams with decentralized information structure Colombino et al. (2015), maintaining a sharp distinction between cooperative and adversarial features.

The main contributions are as follows. We formulate several information structure design problems as (finite, combinatorial) set function optimization problems. These can be solved in principle by brute force enumeration, but this approach is not feasible even for moderately sized networks, and is certainly ineffective for the large networks of critical infrastructure that motivate this work. We therefore propose simple greedy algorithms that provide an effective and scalable heuristic. We show via a simple counterexample that the set function mapping additional information links to team performance is in general not supermodular. Although this implies that the greedy algorithm is not accompanied by theoretical worst-case performance guarantees, we illustrate its effectiveness and scalability through numerical experiments, showing that it often produces optimal or near optimal information structure modifications.

Our focus here is on a general mathematical framework, but many emerging applications in cyber-physical networks feature distributed estimation and control problems that can be formulated as team decision problems and games. For example, information structure design in electrical power networks includes optimal sensor placement (e.g., phasor measurement units and other advanced metering) and optimal communication design for wide area monitoring and control. Furthermore, many large interconnected power grids are operated by a set of independent transmission system operators with objective functions that are not necessarily aligned, and are susceptible to influence by distributed attacking teams with adversarial objectives. Similar distributed estimation and control problems can be formulated in other critical infrastructure, such as transportation networks.

The rest of the paper is laid out as follows. In Section 2 we formulate information structure design problems for single team and multiple team decision problems. In Section 3 we present a greedy algorithm for information structure design. Section 4 presents numerical experiments. Section 5 gives concluding remarks.

2 Problem formulation

We formulate two separate information structure design problems. The first is a (cooperative) single team problem, and the second is a two team problem, which has both cooperative and non-cooperative/adversarial features.

2.1 Team Decision Problems

Fixed information structure.

A team decision problem involves coordinating the decisions of a team of $N$ decision making agents in a stochastic environment. The state of the environment is assumed to be a normal random vector $x\in\mathbf{R}^{n}$ with mean $\bar{x}=\mathbf{E}x$ and covariance matrix $X=\mathbf{E}xx^{T}\succ 0$ , i.e., $x\sim\mathcal{N}(\bar{x},X)$ . It is assumed that every agent knows the environment state statistics $\bar{x}$ and $X$ . In addition, each agent of the team receives its own noisy local information about the environment state $x$ , which we assume to be linear:

[TABLE]

where $H_{i}\in\mathbf{R}^{p_{i}\times n}$ and $w_{i}\sim\mathcal{N}(0,R_{i})$ with $R_{i}\succeq 0$ . Each row of $H_{i}$ can represent information obtained from a sensor or a communication link with another device or agent in the team. For example, in a power network each agent may be a local network monitoring station, and $H_{i}$ could include local measurements from phasor measurement units or communicated information from other parts of the network. We define the information structure as a collection of the parameters specifying the information for each agent

[TABLE]

Each agent must select a decision function $\gamma_{i}:\mathbf{R}^{p_{i}}\rightarrow\mathbf{R}^{m_{i}}$ from a set of Borel measurable functions that specifies its decision $u_{i}=\gamma_{i}(z_{i})$ based on realizations of the random variable $z_{i}$ . We define the team decision function $\gamma=(\gamma_{1},...,\gamma_{N})$ and the associated team decision vector $u=[u_{1}^{T},...,u_{N}^{T}]^{T}\in\mathbf{R}^{\Sigma_{i}m_{i}}$ , which may represent a distributed parameter estimate or control action. The quadratic team cost function is

[TABLE]

where

[TABLE]

with $Q$ and $P$ partitioned according to agent decision dimensions, i.e., $Q_{i}\in\mathbf{R}^{m_{i}\times n}$ , $P_{ii}\in\mathbf{R}^{m_{i}\times m_{i}}$ , and $P_{ij}\in\mathbf{R}^{m_{i}\times m_{j}}$ . The cost function is assumed to be strictly convex in $u$ , i.e., $P\succ 0$ . For any given team decision function $\gamma$ we define the expected cost

[TABLE]

The optimal value of the objective function under the optimal team decision function is denoted by

[TABLE]

Under the stated assumptions, the optimal decision functions $\gamma_{i}^{*}$ are affine and can be computed by solving a set of linear equations derived from stationarity conditions; see Radner (1962), or for a more general multi-objective game formulation Basar (1978). In particular, the optimal solution consists of each agent forming the conditional state estimate

[TABLE]

and using the affine decision function

[TABLE]

where $A_{i}$ and $B_{i}$ are the unique solutions to the linear equations

[TABLE]

Information Structure Design.

We now suppose that for each agent there is a finite set of possible measurements or communicated information about the environmental state that could be added to its information; we let $q_{i}$ denote the number of possible additional measurements or communication links that could be added to agent $i$ . We collect the parameters defining these possibilities for the whole team into the finite set

[TABLE]

where $h_{ij}\in\mathbf{R}^{n}$ represents the $j$ th possible additional measurement or communicated information about the environmental state that could be added to the information of agent $i$ , and $r_{ij}\geq 0$ represents the associated variance. We assume that each additional observation has an associated measurement noise that is independent of other measurement noise variables.111It is straightforward to allow noise of additional observation to be statistically dependent on other noise variables, but we assume independence to simplify notation. In a power network, $V$ may represent, e.g., a set of additional phasor measurement units or wide area communication links that could augment the information set of each agent.

For any subset $S\subseteq V$ , we associate a modified information structure by including the selected information links in the appropriate agents’ information model. For example, the information structure modification

[TABLE]

means that we add the third possible additional link to agent 1, the second possible additional link to agent 3, and the third and fifth possible additional links to agent 4, so that

[TABLE]

where $w_{13}\sim\mathcal{N}(0,r_{13})$ , $w_{32}\sim\mathcal{N}(0,r_{32})$ , $w_{43}\sim\mathcal{N}(0,r_{43})$ , and $w_{45}\sim\mathcal{N}(0,r_{45})$ are independent of all other measurement noise variables.

Let $J^{*}(S)$ denote the optimal value of the team cost function associated with the information structure modification $S$ . Our first problem of interest is to select an information structure modification of size $k$ to minimize the optimal value of the team decision problem using the associated optimal decision functions for the modified information structure222Our algorithms can be easily adapted to a setting where each information structure modification has its own fixed cost, and we search for an optimal modification that satisfies a total budget constraint.. We can pose this as a cardinality constrained set function optimization problem

[TABLE]

2.2 Two Team Games

We now formulate an analogous problem for a two team stochastic game. In this setting, there are two teams, which we call blue and red, each of which consists of a set of decision making agents interacting in a stochastic environment. We assume again that the environment state is a normal random vector $x\in\mathbf{R}^{n}$ with mean $\bar{x}=\mathbf{E}x$ and covariance matrix $X=\mathbf{E}xx^{T}\succ 0$ and that every agent knows the environment state statistics $\bar{x}$ and $X$ . The blue team has $N$ decision making agents, and the red team has $M$ decision making agents. The blue team may represent agents associated with a network operator, while the red team may represent a set of non-cooperative agents or malicious attackers. The difference here is that each team has its own objective function, introducing a non-cooperative or adversarial element to the problem in addition to the cooperation required amongst team members.

Fixed information structure.

The blue team receives information

[TABLE]

where $H_{i}\in\mathbf{R}^{p_{i}\times n}$ and $w_{i}\sim\mathcal{N}(0,R_{i})$ , and the red team receives information

[TABLE]

where $G_{j}\in\mathbf{R}^{l_{j}\times n}$ and $t_{j}\sim\mathcal{N}(0,T_{j})$ . The information structure for the blue team is

[TABLE]

and the information structure for the red team is

[TABLE]

Each agent on the blue team must select a decision function $\gamma_{i}:\mathbf{R}^{p_{i}}\rightarrow\mathbf{R}^{m_{i}}$ from a set of measurable functions that specifies its decision $u_{i}=\gamma_{i}(z_{i})$ , and each agent on the red team must select a decision function $\lambda_{j}:\mathbf{R}^{l_{j}}\rightarrow\mathbf{R}^{k_{j}}$ from a set of measurable functions that specifies its decision $v_{j}=\lambda_{j}(y_{j})$ .

In a non-cooperative two team game, each team has a separate objective function that is neither directly aligned nor misaligned with that of the opposing team. We define the team decision functions $\gamma=(\gamma_{1},...,\gamma_{N})$ and $\lambda=(\lambda_{1},...,\lambda_{M})$ and the associated team decision vectors $u=[u_{1}^{T},...,u_{N}^{T}]^{T}\in\mathbf{R}^{\Sigma_{i}m_{i}}$ and $v=[v_{1}^{T},...,v_{N}^{T}]^{T}\in\mathbf{R}^{\Sigma_{j}k_{j}}$ . The blue team cost function is

[TABLE]

and the red team seeks to optimize a cost function

[TABLE]

It is assumed that $P^{i}\succ 0,\ i=1,2$ , so that $\bar{J}^{1}(u,v)$ is strictly convex in $u$ and $\bar{J}^{2}(u,v)$ is strictly convex in $v$ .

For any given team decision functions $\gamma$ and $\lambda$ we define the expected costs

[TABLE]

A pair of team decision strategies $(\gamma^{*},\lambda^{*})$ are called Nash equilibrium strategies if

[TABLE]

and the corresponding Nash equilibrium values are denoted by

[TABLE]

Under the stated assumptions, the Nash equilibrium decision strategies $\gamma^{*}_{i}$ and $\lambda^{*}_{j}$ are unique and affine, and can be computed by solving a set of linear equations derived from stationarity conditions; see Basar (1978). In particular, the Nash equilibrium solution also consists of each agent on each team forming the conditional state estimates

[TABLE]

and using the affine decision functions

[TABLE]

where $A_{i}$ and $B_{i}$ are the unique solutions to the linear equations

[TABLE]

and $C_{i}$ and $D_{i}$ are the unique solutions to the linear equations

[TABLE]

with $P^{1}$ , $Q^{1}$ , $R^{1}$ , $P^{2}$ , $Q^{2}$ , $R^{2}$ partitioned according to the dimensions of $u_{i}$ and $v_{j}$ .

Information Structure Design.

We now pose an information structure design problem for the blue team, with the information structure of the red team held fixed; an analogous problem can be posed for the red team. As above, we form the finite set $V$ in (8) consisting of all possible measurements or communicated information about the environmental state that could be added to the information structure of the blue team. For any subset $S\subseteq V$ , we associate a modified information structure by including the selected information links in the appropriate agents’ information model.

Let $J^{1*}(S)$ denote the Nash equilibrium value of the blue team associated with the information structure modification $S$ . Our second problem of interest is to select an information structure modification of size $k$ to minimize the Nash equilibrium value of the blue team under the associated Nash equilibrium strategies. Again, we can pose this as a cardinality constrained set function optimization problem

[TABLE]

Remark 1

In adversarial settings, resilient information structure design problems can be formulated for zero-sum games as a special case of the above by setting $J^{1}=-J^{2}$ , i.e., the blue team seeks to minimize $J^{1}$ while the red team seeks to maximize it.

Remark 2

One can also formulate variations where the blue or red team is allowed to modify the information structure of the other team (perhaps by adding links when the objectives are relatively aligned, or to sabotage by removing links or increasing noise when the objectives are relatively unaligned), or a meta-game where both teams are simultaneously allowed to modify their information structures.

3 Information Structure Design and Lack of Supermodularity

In this section we propose a simple greedy algorithm for the set function optimization problems defined above to formalize information structure design in team decision and game problems. We show that the set functions are not in general supermodular. This implies that the information structure modifications produced by the greedy algorithm are not in general guaranteed to come along with worst-case theoretical suboptimality gurantees. Nevertheless, the greedy algorithm can scale to far larger networks than exhaustive search, and we will demonstrate empirically that it often produces near optimal designs.

3.1 Set functions and submodularity

The information structure problems described above are formulated as cardinality constrained set function optimization problems. These problems are combinatorial and finite, and so can be solved simply by brute force enumeration and exhaustive search. However, this approach quickly becomes intractable even for moderately sized problems. The motivating context of large cyber-physical networks requires a different approach.

Greedy algorithms are a simple alternative to exhaustive search. When a set function minimization problem has a certain property called supermodularity, a greedy algorithm achieves results that are provable within a constant factor of the optimal value. Supermodularity (and the closely related submodularity) plays a similar role in combinatorial optimization as convexity and concavity play in continuous optimization Lovász (1983); Krause and Golovin (2012)**.

Definition 1

A set function $f:2^{V}\rightarrow\mathbf{R}$ is called supermodular if for all subsets $A\subseteq B\subseteq V$ and all elements $s\notin B$ , it holds that

[TABLE]

or equivalently, if for all subsets $A,B\subseteq V$ , it holds that

[TABLE]

A set function is called submodular if the reversed inequalities in (26) and (27) hold and is called modular if (26) and (27) hold with equality.

Intuitively, supermodularity is a diminishing returns property where adding an element to a smaller set gives a larger benefit than adding it to a larger set. Minimization of supermodular functions (equivalently, maximization of submodular functions) is NP-hard, but a simple greedy heuristic can be used to obtain a solution that is provably close to the optimal solution Nemhauser et al. (1978)**. The greedy algorithm for set function minimization is shown in Algorithm 1. Several problems in systems and control that feature greedy algorithms and sub- or supermodularity have been recently explored Bushnell et al. (2014); Clark et al. (2014); Summers et al. (2016); Summers and Lygeros (2014); Shames and Summers (2015); Tzoumas et al. (2015). However, other important set function optimization problems in systems and control fail to be sub- or supermodular Summers (2016).

3.2 A greedy algorithm and lack of supermodularity

The simple greedy algorithm described in Algorithm 1 can be directly applied to the information structure design problems that we formulated as cardinality constrained set function optimization problems in (10) and (25). At each iteration, one simply adds the information link that reduces the optimal cost the most by evaluating the optimal cost associated with each possible additional link. The algorithm terminates after $k$ links have been added.

For the single team information structure design problem, each iteration requires a set of $2n\sum_{i}m_{i}$ linear equations to be solved to compute the $2n\sum_{i}m_{i}$ optimal strategy coefficients $A_{i}$ and $B_{i}$ in (7), so the total computational complexity is order $k|V|(n\sum_{i}m_{i})^{3}$ . Within each iteration, the function evaluations for computing the cost of each possible additional link are trivially parallelizable, so distributed computing platforms could be used to scale computations to large networks. Further, it may be possible to exploit the sparsity often found in many cyber-physical networks that motivate these problems.

Unfortunately, it turns out that the set functions defined in (10) and (25) that map information structure modifications to associated optimal team cost values or Nash equilibrium values are not in general supermodular. Consider a single team (cooperative) problem with 2 players, each of whose information could be modified by a single additional link. Suppose

[TABLE]

Let $V=\{(h_{11},r_{11}),(h_{21},r_{21})\}$ , which has four subsets: $A=\{(h_{11},r_{11})\}$ , $B=\{(h_{21},r_{21})\}$ , $A\cap B=\emptyset$ , and $A\cup B=V$ . Evaluating the cost of all of these information structure modifications, we have:

[TABLE]

so that

[TABLE]

which violates the supermodularity inequality in Definition 1. Effectively, the cost benefit provided by each additional link individually is less than the benefit of adding both of them together, so there is no diminishing returns property. It is also easy to construct examples where the submodularity inequality does not hold, so that the set function is in general neither sub- nor supermodular. Since the single team is a special case of the two team problem, the set function for the Nash equilibrium value is neither sub- nor supermodular.

This implies that the greedy algorithm does not in general produce information structure modifications that are within a constant factor of the globally optimal information structure modifications of a given cardinality. However, the greedy algorithm can be an effective and scalable heuristic, which we demonstrate empirically next.

4 Numerical experiments

To illustrate the effectiveness of our proposed greedy algorithms for information structure design, we considered problems with randomly generated data that were small enough to solve globally by exhaustive search. The data was generated in the following way. We consider a single team (cooperative) problem with 10 states and 4 players. Each player has 3 decision variables and 2 measurements of the state, and the set of information structure modifications consists of 2 possible additional measurements for each player. The goal is to find the 4 best new measurements (out of the 8 possible) to minimize the team cost function. We let $P=\tilde{P}^{T}\tilde{P},X=\tilde{X}^{T}\tilde{X}$ to ensure that $P$ and $X$ are symmetric and positive definite, while each element of $Q,\tilde{P},H_{i},R_{i},h_{ij},r_{ij},\tilde{X}$ are generated independently from a standard normal distribution $\mathcal{N}(0,1)$ .

We compare the information structure obtained by the greedy algorithm with the globally optimal information structure found by exhaustive search. For this problem size, the greedy algorithm is about 60 times faster than exhaustive search. We observe that the greedy algorithm often finds a structure with the same value as or very near the globally optimal value. In several hundred problem instances, the greedy algorithm achieves the globally optimal value around 80% of the time, while in the worst instance is only 25% worse than the globally optimal value. Although there are no guarantees, our experiment shows that the greedy algorithm can produce very good results. Moreover, it scales to problem sizes far larger than what can be handled by exhaustive search, making it much more suitable for scaling to problems involving distributed estimation and control in large cyber-physical networks.

5 Conclusions and Outlook

We have formulated information structure design problem for team decision problems and team games, in which the objective is to jointly design information structure modifications together with optimal strategies. We posed these as set function optimization problems and proposed a greedy algorithm as a heuristic for designing good information structures. We showed via a simple counterexample that the associated set functions are in general not supermodular, so that the greedy algorithms do not in general come with worst-case performance guarantees. However, we observed empirically that the greedy algorithm often produces effective information structure modifications.

Our immediate future work will consider team decision problems and games with dynamics, focusing on tractable information structures in that setting, such as partially nested and quadratically invariant. We will also explore alternative convex relaxation approaches and other techniques for scaling the computations to large networks. Finally, we plan to apply the results to application areas, including power systems and transportation networks.

Bibliography27

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Basar (1978) Basar, T. (1978). Decentralized multicriteria optimization of linear stochastic systems. IEEE Transactions on Automatic Control , 23(2), 233–243.
2Başar (2014) Başar, T. (2014). Stochastic differential games and intricacy of information structures. In Dynamic Games in Economics , 23–49. Springer.
3Bushnell et al. (2014) Bushnell, L., Clark, A., and Poovendran, R. (2014). A supermodular optimization framework for leader selection under link noise in linear multi-agent systems. IEEE Transactions on Automatic Control , 59(2), 283–296.
4Cardenas et al. (2008) Cardenas, A., Amin, S., and Sastry, S. (2008). Secure control: Towards survivable cyber-physical systems. In 28th International Conference on Distributed Computing Systems , 495–500. IEEE.
5Clark et al. (2014) Clark, A., Alomair, B., Bushnell, L., and Poovendran, R. (2014). Minimizing convergence error in multi-agent systems via leader selection: A supermodular optimization approach. 59(6), 1480–1494.
6Colombino et al. (2015) Colombino, M., Summers, T., and Smith, R. (2015). Quadratic two-team games. In IEEE Conference on Decision and Control, Osaka, Japan , 3784–3789.
7Ho (1980) Ho, Y.C. (1980). Team decision theory and information structures. Proceedings of the IEEE , 68(6), 644–654.
8Ho and Chu (1972) Ho, Y.C. and Chu, K.C. (1972). Team decision theory and information structures in optimal control problems–part I. IEEE Trans. on Automatic Control , 17(1), 15–22.