Determinant Codes with Helper-Independent Repair for Single and Multiple   Failures

Mehran Elyasi; Soheil Mohajer

arXiv:1812.01142·cs.IT·January 18, 2021

Determinant Codes with Helper-Independent Repair for Single and Multiple Failures

Mehran Elyasi, Soheil Mohajer

PDF

TL;DR

This paper introduces a new helper-independent repair mechanism for determinant codes in distributed storage, enabling efficient repair of single and multiple failures while maintaining the code's properties.

Contribution

It proposes a helper-independent repair method for determinant codes and demonstrates their capability to repair multiple failures with sub-linear repair bandwidth.

Findings

01

Helper-independent repair mechanism achieved for determinant codes.

02

Determinant codes can repair multiple failures with sub-linear bandwidth.

03

Preserves all original properties of determinant codes.

Abstract

Determinant codes are a class of exact-repair regenerating codes for distributed storage systems with parameters (n, k = d, d). These codes cover the entire trade-off between per-node storage and repair-bandwidth. In an earlier work of the authors, the repair data of the determinant code sent by a helper node to repair a failed node depends on the identity of the other helper nodes participating in the process, which is practically undesired. In this work, a new repair mechanism is proposed for determinant codes, which relaxes this dependency, while preserving all other properties of the code. Moreover, it is shown that the determinant codes are capable of repairing multiple failures, with a per-node repair-bandwidth which scales sub-linearly with the number of failures.

Equations143

(α^{(m)}, β^{(m)}, F^{(m)}) = ((m d), (m - 1 d - 1), m (m + 1 d + 1))

(α^{(m)}, β^{(m)}, F^{(m)}) = ((m d), (m - 1 d - 1), m (m + 1 d + 1))

(\frac{α ^{(m)}}{F ^{(m)}}, \frac{β ^{(m)}}{F ^{(m)}})

(\frac{α ^{(m)}}{F ^{(m)}}, \frac{β ^{(m)}}{F ^{(m)}})

F \leq \frac{d + 1}{ℓ + 2} (ℓ α + \frac{d}{ℓ + 1} β),

F \leq \frac{d + 1}{ℓ + 2} (ℓ α + \frac{d}{ℓ + 1} β),

(\frac{α}{F}, \frac{β}{F}) = (\frac{ℓ + 1}{ℓ ( d + 1 )}, \frac{ℓ + 1}{d ( d + 1 )}) .

(\frac{α}{F}, \frac{β}{F}) = (\frac{ℓ + 1}{ℓ ( d + 1 )}, \frac{ℓ + 1}{d ( d + 1 )}) .

β_{e}^{(m)} = (m d) - (m d - e)

β_{e}^{(m)} = (m d) - (m d - e)

β_{1}^{(m)} = (m d) - (m d - 1) = (m - 1 d - 1) = β^{(m)} .

β_{1}^{(m)} = (m d) - (m d - 1) = (m - 1 d - 1) = β^{(m)} .

\overset{ˉ}{β}_{e}^{(m)} = \frac{1}{d} [m (m + 1 d + 1) - m (m + 1 d - e + 1)]

\overset{ˉ}{β}_{e}^{(m)} = \frac{1}{d} [m (m + 1 d + 1) - m (m + 1 d - e + 1)]

\frac{β ˉ _{e}^{(m)}}{α ^{(m)}} = \frac{m ( d + 1 )}{d ( m + 1 )},

\frac{β ˉ _{e}^{(m)}}{α ^{(m)}} = \frac{m ( d + 1 )}{d ( m + 1 )},

V

V

W

y \in I \sum (- 1)^{ind_{I} (y)} w_{y, I} = 0,

y \in I \sum (- 1)^{ind_{I} (y)} w_{y, I} = 0,

\displaystyle\mathbf{D}_{x,\mathcal{I}}=\left\{\begin{array}[]{l l}v_{x,\mathcal{I}}&\textrm{if $x\in\mathcal{I}$},\\ w_{x,\mathcal{I}\cup\left\{x\right\}}&\textrm{if $x\notin\mathcal{I}$}.\end{array}\right.

\displaystyle\mathbf{D}_{x,\mathcal{I}}=\left\{\begin{array}[]{l l}v_{x,\mathcal{I}}&\textrm{if $x\in\mathcal{I}$},\\ w_{x,\mathcal{I}\cup\left\{x\right\}}&\textrm{if $x\notin\mathcal{I}$}.\end{array}\right.

\displaystyle\mathbf{\Xi}^{f,(m)}_{\mathcal{I},\mathcal{J}}=\left\{\begin{array}[]{l l}(-1)^{\mathsf{ind}_{\mathcal{I}}(x)}\psi_{f,x}&\textrm{if $\mathcal{I}\cup\left\{x\right\}=\mathcal{J}$},\\ 0&\textrm{otherwise},\end{array}\right.

\displaystyle\mathbf{\Xi}^{f,(m)}_{\mathcal{I},\mathcal{J}}=\left\{\begin{array}[]{l l}(-1)^{\mathsf{ind}_{\mathcal{I}}(x)}\psi_{f,x}&\textrm{if $\mathcal{I}\cup\left\{x\right\}=\mathcal{J}$},\\ 0&\textrm{otherwise},\end{array}\right.

R^{f, (m)} = D \cdot Ξ^{f, (m)} .

R^{f, (m)} = D \cdot Ξ^{f, (m)} .

[Ψ_{f} \cdot D]_{I} = x \in I \sum (- 1)^{ind_{I} (x)} [R^{f, (m)}]_{x, I ∖ {x}} .

[Ψ_{f} \cdot D]_{I} = x \in I \sum (- 1)^{ind_{I} (x)} [R^{f, (m)}]_{x, I ∖ {x}} .

\displaystyle\mathbf{\Xi}^{\mathcal{E},(m)}=\left[\begin{array}[]{c|c|c|c}\mathbf{\Xi}^{f_{1},(m)}&\mathbf{\Xi}^{f_{2},(m)}&\cdots&\mathbf{\Xi}^{f_{e},(m)}\end{array}\right].

\displaystyle\mathbf{\Xi}^{\mathcal{E},(m)}=\left[\begin{array}[]{c|c|c|c}\mathbf{\Xi}^{f_{1},(m)}&\mathbf{\Xi}^{f_{2},(m)}&\cdots&\mathbf{\Xi}^{f_{e},(m)}\end{array}\right].

(α^{(2)}, β^{(2)}, F^{(2)}) = ((2 4), (2 - 1 4 - 1), 2 (2 + 1 4 + 1)) = (6, 3, 20) .

(α^{(2)}, β^{(2)}, F^{(2)}) = ((2 4), (2 - 1 4 - 1), 2 (2 + 1 4 + 1)) = (6, 3, 20) .

V

V

W

\displaystyle\begin{split}\left\{\begin{array}[]{l}\mathcal{I}=\left\{1,2,3\right\}:w_{3,\left\{1,2,3\right\}}=w_{2,\left\{1,2,3\right\}}-w_{1,\left\{1,2,3\right\}},\\ \mathcal{I}=\left\{1,2,4\right\}:w_{4,\left\{1,2,4\right\}}=w_{2,\left\{1,2,4\right\}}-w_{1,\left\{1,2,4\right\}},\\ \mathcal{I}=\left\{1,3,4\right\}:w_{4,\left\{1,3,4\right\}}=w_{3,\left\{1,3,4\right\}}-w_{1,\left\{1,3,4\right\}},\\ \mathcal{I}=\left\{2,3,4\right\}:w_{4,\left\{2,3,4\right\}}=w_{3,\left\{2,3,4\right\}}-w_{2,\left\{2,3,4\right\}}.\end{array}\right.\end{split}

\displaystyle\begin{split}\left\{\begin{array}[]{l}\mathcal{I}=\left\{1,2,3\right\}:w_{3,\left\{1,2,3\right\}}=w_{2,\left\{1,2,3\right\}}-w_{1,\left\{1,2,3\right\}},\\ \mathcal{I}=\left\{1,2,4\right\}:w_{4,\left\{1,2,4\right\}}=w_{2,\left\{1,2,4\right\}}-w_{1,\left\{1,2,4\right\}},\\ \mathcal{I}=\left\{1,3,4\right\}:w_{4,\left\{1,3,4\right\}}=w_{3,\left\{1,3,4\right\}}-w_{1,\left\{1,3,4\right\}},\\ \mathcal{I}=\left\{2,3,4\right\}:w_{4,\left\{2,3,4\right\}}=w_{3,\left\{2,3,4\right\}}-w_{2,\left\{2,3,4\right\}}.\end{array}\right.\end{split}

Ψ_{8 \times 4}

Ψ_{8 \times 4}

\displaystyle\begin{split}\left\{\begin{array}[]{lllll}{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{1,2\right\}:}&\psi_{f,1}v_{1,\{1,2\}}&+\psi_{f,2}v_{2,\{1,2\}}&+\psi_{f,3}w_{3,\{1,2,3\}}&+\psi_{f,4}w_{4,\{1,2,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{1,3\right\}:}&\psi_{f,1}v_{1,\{1,3\}}&+\psi_{f,2}w_{2,\{1,2,3\}}&+\psi_{f,3}v_{3,\{1,3\}}&+\psi_{f,4}w_{4,\{1,3,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{1,4\right\}:}&\psi_{f,1}v_{1,\{1,4\}}&+\psi_{f,2}w_{2,\{1,2,4\}}&+\psi_{f,3}w_{3,\{1,3,4\}}&+\psi_{f,4}v_{4,\{1,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{2,3\right\}:}&\psi_{f,1}w_{1,\{1,2,3\}}&+\psi_{f,2}v_{2,\{2,3\}}&+\psi_{f,3}v_{3,\{2,3\}}&+\psi_{f,4}w_{4,\{2,3,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{2,4\right\}:}&\psi_{f,1}w_{1,\{1,2,4\}}&+\psi_{f,2}v_{2,\{2,4\}}&+\psi_{f,3}w_{3,\{2,3,4\}}&+\psi_{f,4}v_{4,\{2,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{3,4\right\}:}&\psi_{f,1}w_{1,\{1,3,4\}}&+\psi_{f,2}w_{2,\{2,3,4\}}&+\psi_{f,3}v_{3,\{3,4\}}&+\psi_{f,4}v_{4,\{3,4\}}.\end{array}\right.\end{split}

\displaystyle\begin{split}\left\{\begin{array}[]{lllll}{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{1,2\right\}:}&\psi_{f,1}v_{1,\{1,2\}}&+\psi_{f,2}v_{2,\{1,2\}}&+\psi_{f,3}w_{3,\{1,2,3\}}&+\psi_{f,4}w_{4,\{1,2,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{1,3\right\}:}&\psi_{f,1}v_{1,\{1,3\}}&+\psi_{f,2}w_{2,\{1,2,3\}}&+\psi_{f,3}v_{3,\{1,3\}}&+\psi_{f,4}w_{4,\{1,3,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{1,4\right\}:}&\psi_{f,1}v_{1,\{1,4\}}&+\psi_{f,2}w_{2,\{1,2,4\}}&+\psi_{f,3}w_{3,\{1,3,4\}}&+\psi_{f,4}v_{4,\{1,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{2,3\right\}:}&\psi_{f,1}w_{1,\{1,2,3\}}&+\psi_{f,2}v_{2,\{2,3\}}&+\psi_{f,3}v_{3,\{2,3\}}&+\psi_{f,4}w_{4,\{2,3,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{2,4\right\}:}&\psi_{f,1}w_{1,\{1,2,4\}}&+\psi_{f,2}v_{2,\{2,4\}}&+\psi_{f,3}w_{3,\{2,3,4\}}&+\psi_{f,4}v_{4,\{2,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{3,4\right\}:}&\psi_{f,1}w_{1,\{1,3,4\}}&+\psi_{f,2}w_{2,\{2,3,4\}}&+\psi_{f,3}v_{3,\{3,4\}}&+\psi_{f,4}v_{4,\{3,4\}}.\end{array}\right.\end{split}

w_{3, {1, 2, 3}}

w_{3, {1, 2, 3}}

w_{4, {1, 2, 4}}

A + B

A + B

= ψ_{f, 1} v_{1, {1, 2}} + ψ_{f, 2} v_{2, {1, 2}} + ψ_{f, 3} (w_{2, {1, 2, 3}} - w_{1, {1, 2, 3}}) + ψ_{f, 4} (w_{2, {1, 2, 4}} - w_{1, {1, 2, 4}})

= ψ_{f, 1} v_{1, {1, 2}} + ψ_{f, 2} v_{2, {1, 2}} + ψ_{f, 3} w_{3, {1, 2, 3}} + ψ_{f, 4} w_{4, {1, 2, 4}} .

\displaystyle\mathsf{repair\ symbols\ sent\ by\ node\;}1:\begin{split}\left\{\begin{array}[]{l}{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{1,2\right\}:}\psi_{f,1}v_{1,\{1,2\}}-\psi_{f,3}w_{1,\{1,2,3\}}-\psi_{f,4}w_{1,\{1,2,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{1,3\right\}:}\psi_{f,1}v_{1,\{1,3\}}+\psi_{f,2}w_{1,\{1,2,3\}}-\psi_{f,4}w_{1,\{1,3,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{1,4\right\}:}\psi_{f,1}v_{1,\{1,4\}}+\psi_{f,2}w_{1,\{1,2,4\}}+\psi_{f,3}w_{1,\{1,3,4\}}.\end{array}\right.\end{split}

\displaystyle\mathsf{repair\ symbols\ sent\ by\ node\;}1:\begin{split}\left\{\begin{array}[]{l}{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{1,2\right\}:}\psi_{f,1}v_{1,\{1,2\}}-\psi_{f,3}w_{1,\{1,2,3\}}-\psi_{f,4}w_{1,\{1,2,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{1,3\right\}:}\psi_{f,1}v_{1,\{1,3\}}+\psi_{f,2}w_{1,\{1,2,3\}}-\psi_{f,4}w_{1,\{1,3,4\}},\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}\mathcal{I}=\left\{1,4\right\}:}\psi_{f,1}v_{1,\{1,4\}}+\psi_{f,2}w_{1,\{1,2,4\}}+\psi_{f,3}w_{1,\{1,3,4\}}.\end{array}\right.\end{split}

repair symbols sent by node 2 :

repair symbols sent by node 2 :

repair symbols sent by node 3 :

repair symbols sent by node 4 :

Ξ^{f, (2)} \cdot [ψ_{f, 1} ψ_{f, 2} ψ_{f, 3} ψ_{f, 4}]^{⊺} = 0 .

Ξ^{f, (2)} \cdot [ψ_{f, 1} ψ_{f, 2} ψ_{f, 3} ψ_{f, 4}]^{⊺} = 0 .

[Ψ_{h} \cdot D \cdot Ξ^{f, (2)}]_{4} = ψ_{f, 4}^{- 1} \cdot i = 1 \sum 3 ψ_{f, i} \cdot [Ψ_{h} \cdot D \cdot Ξ^{f, (2)}]_{i} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Determinant Codes with Helper-Independent Repair for Single and Multiple Failures

Mehran Elyasi, and Soheil Mohajer M. Elyasi and S. Mohajer are with the Department of Electrical and Computer Engineering, University of Minnesota, Twin Cities, MN 55455, USA, (email: {melyasi, soheil}@umn.edu).

Abstract

Determinant codes are a class of exact-repair regenerating codes for distributed storage systems with parameters $(n,k=d,d)$ . These codes cover the entire trade-off between per-node storage and repair-bandwidth. In an earlier work of the authors, the repair data of the determinant code sent by a helper node to repair a failed node depends on the identity of the other helper nodes participating in the process, which is practically undesired. In this work, a new repair mechanism is proposed for determinant codes, which relaxes this dependency, while preserving all other properties of the code. Moreover, it is shown that the determinant codes are capable of repairing multiple failures, with a per-node repair-bandwidth which scales sub-linearly with the number of failures.

I Introduction

While individual storage units in distributed storage systems (DSS) are subject to temporal or permanent failure, the entire system should be designed to avoid losing the stored data. Coding and storing redundant data is a standard approach to guarantee durability in such systems. Moreover, these systems are equipped with a repair mechanism that allows for a replacement of a failed node. Such replacement can be performed in the functional or exact sense. In functional repair, a failed node will be replaced by another one, so that the consequent family of nodes maintains the data recovery and node-repair properties. In an exact repair process, the content of a failed node will be exactly replicated by the helpers.

Regeneration codes are introduced to manage data recovery and node repair mechanism in DSS. Formally, an $(n,k,d)$ regeneration code with parameters $(\alpha,\beta,F)$ encodes a file comprised of $F$ symbols (from a finite field $\mathbb{F}$ ) into $n$ segments (nodes) $W_{1},W_{2},\dots,W_{n}$ each of size $\alpha$ , such that two important properties are fulfilled: (1) the entire file can be recovered from every subset of $k$ nodes, and (2) whenever a node fails (become inaccessible), it can be repaired by accessing $d$ remaining nodes and downloading $\beta$ symbols from each.

It turns out that there is a fundamental trade-off between the minimum required per-node storage $\alpha$ and the repair-bandwidth $\beta$ , to store a given amount of data $F$ in a DSS. This tradeoff is fully characterized for functional repair in the seminal work of Dimakis et. al [1], where it is shown to be achievable by random network coding. However, for the exact-repair problem, which is notably important from the practical perspective, characterization of the trade-off and design of optimum codes are widely open, except for some special cases. Construction of exact-repair regenerating codes for a system with arbitrary parameters $(n,k,d)$ is a complex task due to several combinatorial constraints to be satisfied. The number of such constraints dramatically increases with $n$ , the total number of nodes in the system.

There are known code constructions for the two extreme points on the tradeoff, namely, minimum bandwidth regeneration (MBR) [2] and minimum storage regeneration (MSR) [2, 3, 4, 5, 6, 7] points.

A lower bound for the trade-off of the exact-repair regenerating codes with parameters $(n,k=d,d)$ is presented independently by [8, 9, 10], under the assumption that the underlying code is linear. This lower bound could be achieved for the special case of $n=k+1$ (i.e., the DSS can only tolerate one failure) by code constructions proposed in [11] and [12]. While the lower bound does not depend on $n$ (the total number of nodes in the system), the performance (storage capacity) of both code constructions in [11] and [12] degrades as $n$ exceeds $k+1$ (see Fig. 1).

I-A Determinant Codes and Helper-Independent Repair

Determinant codes are a family of exact repair regenerating codes, which are introduced in [14, 13] for a DSS with parameters $(n,k=d,d)$ . The main property of these codes is to maintain a constant trade-off between $\alpha/F$ and $\beta/F$ , regardless of the number of the nodes. In particular, these codes can achieve the lower bound [8, 9, 10], and hence they are optimum. The determinant codes have a linear structure and can be obtained from the inner product between an encoder matrix and the message matrix. Especially, product-matrix codes introduced in [2] for MBR and MSR points can be subsumed from the general construction of the determinant codes.

The repair mechanism proposed for the determinant codes in the original paper [13] requires a rather heavy computation at the helper nodes in order to prepare their repair symbols to send to the failed node. More importantly, each helper node $h\in\mathcal{H}$ needs to know the identity of all the other helper nodes participating in the repair process. The assumption of knowing the set of helpers in advance is a limitation of the determinant codes, and it is undesired in real-world systems. In practice, it is preferable that once a request for a repair of a failed node is made, each node can independently decide to whether or not to participate in the repair process and generate the repair data from its content, regardless of the other helper nodes.

On the other hand, besides the repair bandwidth, one of the crucial bottlenecks in the performance of the storage systems is the I/O load, which refers to the amount of data to be read by a helper node to encode for a repair process. While the native constructions for exact repair generating code require a heavy I/O read, the repair-by-transfer (RBT) codes [15] offer an optimum I/O. In [16] an elegant modification is proposed to improve the I/O cost of product-matrix MSR codes, by pre-processing the content of the nodes and storing the repair data on non-systematic nodes instead of the original node content. This results in a semi-RBT code: whenever such modified nodes contribute in a repair process, they merely transfer some of their stored symbols without any computation. Such modification could not be applied on the original determinant codes since the repair symbols from a helper node $h$ to a failed node $f$ could be computed only when the set of other helper nodes $\mathcal{H}$ is identified.

In this paper, we propose a novel repair mechanism for the determinant codes introduced in [13]. In the new repair procedure, data repair symbols from helper node $h$ to a failed node $f$ solely depend on the content of the helper node and the identity of the failed node $f$ . The failed node collects a total of $d\beta$ repair symbols from the helper nodes and can reconstruct all of its missing symbols by simple addition and subtraction of some of the received symbols. This simple repair scheme further allows for modifications proposed in [16], to further improve the I/O overhead of the code.

I-B Multiple Failures Repair

The second contribution of this work is the simultaneous repair for multiple failures. Although single failures are the dominant type of failures in distributed storage systems [17], multiple simultaneous failures occur rather frequently and need to be handled in order to maintain the system’s reliability and fault-tolerance. The naive approach to deal with such failures is to repair each failed node individually and independently from the others. This requires a repair bandwidth from each helper node that scales linearly with the number of failures. There are two types of repair for multiple failures studied in the literature [18]: (i) centralized regenerating codes and (ii) cooperative regenerating codes. In centralized regenerating codes, a single data center is responsible for the repair of all failed nodes. More precisely, once a set of $e$ nodes in the system fail, an arbitrary set of $d\leq n-e$ nodes are chosen, and $\beta_{e}$ repair symbols will be downloaded from each helper node. This leads to a total of $d\cdot\beta_{e}$ symbols which will be used to repair the content of all the failed nodes. The storage-bandwidth trade-off of these codes are studied for two extreme points, namely the minimum storage multi-node repair (MSMR) and the minimum bandwidth multi-node repair (MBMR) points. In particular, in [4] a class of MSMR code is introduced, that are capable of repairing any number of failed nodes $e\leq n-k$ from any number of helper nodes $k\leq d\leq n-e$ , using an optimal repair bandwidth. In cooperative regenerating codes upon failure of a node, the replacement node downloads repair data from a subset of $d$ helper nodes. In the case of multiple failures, the replacement nodes not only download repair data from the helper nodes, but also exchange information among themselves before regenerating the lost data, and this exchanged data between them is included in the repair bandwidth. Similar to the centralized case, the trade-off for these codes for the two extreme points, namely the minimum bandwidth cooperative regeneration (MBCR) codes [19] and the minimum storage cooperative regenerating (MSCR) codes [20, 21, 22] are studied. In particular, in [22] authors introduced explicit constructions of MDS codes with optimal cooperative repair for all possible parameters. Also, they have shown that any MDS code with optimal repair bandwidth under the cooperative model also has optimal bandwidth under the centralized model.

In this work we show that the repair bandwidth required for multiple failures repair in determinant codes can be reduced by exploiting two facts: (i) the overlap between the repair space (linear dependency between the repair symbols) that each helper node sends to the set of failed nodes, and (ii) in the centralized repair, the data center (responsible for the repair process) can perform the repair of the nodes in a sequential manner, and utilize already repaired nodes as helpers for the repair of the remaining failed nodes. Interestingly, using these properties we can limit the maximum (normalized) repair-bandwidth of the helper nodes to a certain fraction of $\alpha$ , regardless of the number of failures. The structure of the code allows us to analyze this overlap, and obtain a closed-form expression for the repair bandwidth. Our codes are not restricted only to the extreme points of the trade-off and can operate at any intermediate point on the optimum trade-off. A similar problem is studied in [18], where a class of codes is introduced to operate at the intermediate points of the trade-off, with an improved repair bandwidth for multiple failures. However, this improvement is obtained at the price of degradation of the system’s storage capacity as $n$ (the total number of nodes) increases. Consequently, the resulting codes designed for two or more simultaneous failures are sub-optimum, and cannot achieve the optimum trade-off between the per-node capacity, repair bandwidth, and the overall storage capacity. One of the main advantages of our proposed code and repair mechanism is to offer a universal code, which provides a significant reduction in the repair bandwidth for multiple failures, without compromising the system performance.

The rest of this paper is organized as follows: For the sake of completeness, we first review the achievable trade-off and the construction of the determinant codes [13] in Sections II and Section III. The new encoding and decoding for the node repair are presented in Section III-B. An illustrative example is provided in Section IV, in which the core idea of data recovery and node repair are demonstrated. The formal proofs of the properties of the proposed code are presented in Section V. Finally, in Section VI we discuss the improved repair-bandwidth for multiple failures in a centralized repair setting.

II Main Result

We start by introducing a few symbols and notations, which are frequently used in this paper.

Notation: We use $\left[k+1:d\right]$ to denote the set of integer numbers $\{k+1,\dots,d\}$ , and $\left[k\right]=\left[1:k\right]$ to represent the set $\left\{1,2,...,k\right\}$ . For a set $\mathcal{X}$ and a member $x\in\mathcal{X}$ , we define $\mathsf{ind}_{\mathcal{X}}(x)=\left|\left\{y\in\mathcal{X}:y\leq x\right\}\right|$ . We use boldface symbols to refer to matrices, and for a matrix $\mathbf{X}$ , we denote its $i$ -th row by $\mathbf{X}_{i}$ . We also use the notation $\mathbf{X}_{:,j}$ to refer to the $j$ -th column of $\mathbf{X}$ . Moreover, we use $\mathbf{X}[\mathcal{A},\mathcal{B}]$ to denote a sub-matrix of $\mathbf{X}$ obtained by rows $i\in\mathcal{A}$ and columns $j\in\mathcal{B}$ . Accordingly, $\mathbf{X}[\mathcal{A},:]$ denotes the sub-matrix of $\mathbf{X}$ by stacking rows $i\in\mathcal{A}$ . Moreover, we may use sets to label rows and/or columns of a matrix, and hence $\mathbf{X}_{\mathcal{I},\mathcal{J}}$ refers to an entry of matrix $\mathbf{X}$ at the row indexed by $\mathcal{I}$ and the column labeled by $\mathcal{J}$ . Finally, for a set $\mathcal{I}$ , we denote the maximum entry of $\mathcal{I}$ by $\max\mathcal{I}$ .

The optimum storage repair-bandwidth of the exact-repair regenerating codes for an $(n,k=d,d)$ system is a piece-wise linear function [13, 14], which is fully characterized by its corner (intersection) points [8, 10, 9]. The determinant codes provide a universal construction for all corner points on the optimum trade-off curve. We assign a mode (denoted by $m$ ) to each corner point, which is an integer in $\{1,2,\dots,d\}$ (from $1$ for MBR to $d$ for MSR point). The main distinction between the result of this work and that of [13, 14] is the fact that the repair data sent by one helper node does not depend on the identity of all the other helper nodes participating the repair process. The following definition formalizes this distinction.

Definition 1.

Consider the repair process of a failed node $f$ using a set of helper nodes $\mathcal{H}$ . The repair process is called helper-independent if the repair data sent by each helper node $h\in\mathcal{H}$ to the failed node $f$ only depends on $f$ and the content of node $h$ (but not the other helpers participating in the repair process).

The following theorem formally states the trade-off achievable by determinant codes.

Theorem 1.

For an $(n,k=d,d)$ distributed storage system and any mode $m=1,2,\dots,d$ , the triple $(\alpha,\beta,F)$ with

[TABLE]

can be achieved under helper-independent exact repair by the code construction proposed in this paper.

It is worth mentioning that this theorem and the achievable points on the trade-off curve are identical to those of [13]. However, the novel repair process presented here has the advantage that the repair data sent by a helper node does not depend on the identity of other helpers participating in the repair process. Moreover, we present a repair mechanism for multiple simultaneous failures. The proposed scheme exploits the overlap between the repair data sent for different failed nodes and offers a reduced repair-bandwidth compared to naively repairing the failed nodes independent of each other.

The code construction is reviewed in Section III for completeness. In order to prove Theorem 1, it suffices to show that the proposed code satisfies the two fundamental properties, namely data recovery and exact node repair. The proof data recovery property is similar to that of [13, Proposition 1], and hence omitted here. The exact-repair property is formally stated in Proposition 2, and proved in Section V. Moreover, Proposition 1 shows that the repair bandwidth of the proposed code does not exceed $\beta^{(m)}$ . This is also proved in Section V.

In Fig. 2 the linear trade-off for a system $d=4$ together with achievable corner points of this paper are depicted.

Remark 1.

Theorem 1 offers an achievable trade-off for the normalized parameters $(\alpha/F,\beta/F)$ given by

[TABLE]

It is shown in [8, 9, 10] that for any linear exact-repair regenerating code with parameters $(n,k=d,d)$ that is capable of storing $F$ symbols, $(\alpha,\beta)$ should satisfy

[TABLE]

where $\ell=\lfloor d\beta/\alpha\rfloor$ takes values in $\{0,1,\dots,d\}$ . This establishes a piece-wise linear lower bound curve, with $d$ (normalized) corner points obtained at integer values of $\ell=d\beta/\alpha$ . For these corner points, the (normalized) operating points $(\alpha/F,\beta/F)$ are given

[TABLE]

These operating points are matching with the achievable (normalized) pairs given in (2). Therefore, determinant codes are optimal, and together with the lower bound of [8, 9, 10] fully characterize the optimum trade-off for exact-repair regenerating codes with parameters $(n,k=d,d)$ .

The next result of this paper provides an achievable bandwidth for multiple repairs.

Theorem 2.

In an $(n,k=d,d)$ determinant codes operating at mode $m$ , the content of any set of $e$ simultaneously failed nodes can be exactly repaired by accessing an arbitrary set of $d$ nodes and downloading

[TABLE]

repair symbols from each helper node.

The repair mechanism for multiple failures is similar to that of single failure presented in Proposition 2. In order to prove Theorem 2, it suffices to show that the repair bandwidth required for multiple failures does not exceed $\beta_{e}^{(m)}$ . This is formally stated in Proposition 3 and proved in Section V.

Remark 2.

Note that the repair bandwidth proposed for multiple repairs in Theorem 2 subsumes the one in Theorem 1 for single failure for setting $e=1$ :

[TABLE]

Remark 3.

It is worth mentioning that the repair-bandwidth proposed in Theorem 2 is universally and simultaneously achievable. That is, the same determinant code can simultaneously achieve $\beta_{e}^{(m)}$ for every $e\in\{1,2,\dots,n-d\}$ .

The next theorem shows that the repair bandwidth for multiple failures can be further reduced in the centralized repair setting [4, 18], by a sequential repair mechanism, and exploiting the repair symbols contributed by already repaired failed nodes which can act as helpers.

Theorem 3.

In an $(n,k=d,d)$ determinant code with (up to a scalar factor) parameters $\alpha^{(m)}=\binom{d}{m}$ and $F^{(m)}=m\binom{d+1}{m+1}$ , any set of $e$ simultaneously failed nodes can be centrally repaired by accessing an arbitrary set of $d$ helper nodes and downloading a total of

[TABLE]

repair symbols from each helper node.

Remark 4.

It is worth noting that for $e>d-m$ we have $\bar{\beta}_{e}^{(m)}=\frac{m}{d}\binom{d+1}{m+1}=F^{(m)}/d$ (independent of $e$ ), and

[TABLE]

which is strictly less than $1$ as shown in Fig. 3 (for all corner points except the MSR point, $m=d$ ). The fact that $\bar{\beta}_{e}^{(m)}=F^{(m)}/d$ implies that the helper nodes contribute just enough number of repair symbols to be able to recover the entire file, without sending any redundant data. It is clear that this repair-bandwidth is optimum for $e\geq d$ , since such a set of $e$ failed nodes should be able to recover the entire file after being repaired.

This theorem is built on the result of Theorem 2, by exploiting the repair data can be exchanged among the failed nodes. Note that in the centralized repair setting, the information exchanged among the failed nodes at the repair center are not counted against the repair bandwidth. We prove this theorem in Section VI.

III Construction of $(n,k=d,d)$ determinant codes

The code construction described in this section is identical to that of [13], except the repair process which is different and simpler. However, for the sake of completeness, we start with the details of code construction.

III-A Code Construction

For a distributed storage system with parameters $(n,k=d,d)$ and corresponding to a mode $m\in\left\{1,2,\dots,d\right\}$ , our construction provides an exact-repair regenerating code with per-node storage capacity $\alpha^{(m)}=\binom{d}{m}$ and per-node repair-bandwidth $\beta^{(m)}=\binom{d-1}{m-1}$ . This code can store up to $F^{(m)}=m\binom{d+1}{m+1}$ symbols.

We represent the coded symbols in a matrix $\mathcal{C}_{n\times\alpha}$ , in which the $i$ -th row corresponds to the encoded data to be stored in $i$ -th node of DSS. The proposed code is linear, i.e., the encoded matrix $\mathcal{C}$ is obtained by multiplying an encoder matrix $\mathbf{\Psi}_{n\times d}$ and a message matrix111The number of entries in this matrix is more than $F$ , the size of the file to be coded. Indeed, there are some redundancies among the entries of this matrix as will be explained later. $\mathbf{D}_{d\times\alpha}$ , whose construction will be explained later. All entries of the encoder matrix and the message matrix222In general elements of the message matrix can be chosen from a Galois field $\mathbb{F}=\mathsf{GF}(q^{s})$ for some prime number $q$ and an integer $s$ . are assumed to be from a finite field $\mathbb{F}$ , which has at least $n$ distinct elements. Moreover, all the arithmetic operations are performed with respect to the underlying finite field. The structures of the encoder and message matrices are given below.

Encoder Matrix: The matrix $\mathbf{\Psi}_{n\times d}$ is a fixed matrix which is shared among all the nodes in the system. The main property required for matrix $\mathbf{\Psi}$ is being Maximum-Distance-Separable (MDS), that is, any $d\times d$ sub-matrix of $\mathbf{\Psi}_{n\times d}$ is full-rank. Examples of MDS matrices include Vandermonde or Cauchy matrices. We can always convert an MDS matrix to a systematic MDS matrix, by multiplying it by the inverse of its top $d\times d$ sub-matrix (see the example in Section IV). We refer to the first $k$ nodes by systematic nodes if a systematic MDS matrix is used for encoding.

Message Matrix: The message matrix $\mathbf{D}$ is filled with raw (source) symbols and parity symbols. Recall that $\mathbf{D}$ is a $d\times\alpha$ matrix, that has $d\alpha=d\binom{d}{m}$ entries, while we wish to store only $F=m\binom{d+1}{m+1}$ source symbols. Hence, there are $d\alpha-F=\binom{d}{m+1}$ redundant entries in $\mathbf{D}$ , which are filled with parity symbols. More precisely, we divide the set of $F$ data symbols into two groups, namely, $\mathcal{V}$ and $\mathcal{W}$ , whose elements are indexed by sets as follows

[TABLE]

Note that each element of $\mathcal{V}$ is indexed by a set ${\mathcal{I}}\subseteq\left[d\right]$ of length $|{\mathcal{I}}|=m$ and an integer number $x\in{\mathcal{I}}$ . Hence, $\left|\mathcal{V}\right|=m\binom{d}{m}$ . Similarly, symbols in $\mathcal{W}$ are indexed by a pair $(x,\mathcal{I})$ , where $\mathcal{I}$ is a subset of $\left[d\right]$ with $m+1$ entries, and $x$ can take any value in $\mathcal{I}$ except the largest one. So, there are $\left|\mathcal{W}\right|=m\binom{d}{m+1}$ symbols in set $\mathcal{W}$ . Note that $F=|\mathcal{V}|+|\mathcal{W}|$ .

For the sake of completeness, we define parity symbols indexed by pairs $(x,\mathcal{I})$ , where $\left|\mathcal{I}\right|=m+1$ and $\mathsf{ind}_{\mathcal{I}}(x)=m+1$ . Such symbols are constructed such that parity equations

[TABLE]

hold for any $\mathcal{I}\subseteq\left[d\right]$ with $|\mathcal{I}|=m+1$ . In other words, such missing symbols are given by333Note that for an underlying Galois field $\mathsf{GF}(2^{s})$ with characteristic $2$ , the parity equation reduces to $\sum_{y\in\mathcal{I}}w_{y,\mathcal{I}}=0$ . $(-1)^{m+1}w_{\max\mathcal{I},\mathcal{I}}=-\sum_{y\in\mathcal{I}\setminus\left\{\max\mathcal{I}\right\}}(-1)^{\mathsf{ind}_{\mathcal{I}}(y)}w_{y,\mathcal{I}}$ .

The rows of matrix $\mathbf{D}$ are labeled by numbers $1,2,\dots,d$ , and the columns are labeled by subsets $\mathcal{I}$ of $\left[d\right]$ of size $|\mathcal{I}|=m$ . The entries of matrix $\mathbf{D}$ are given by

[TABLE]

It is shown in [13, Proposition 1] that the entire data encoded by this code can be recovered from the content of any $k=d$ nodes. Next, we show the exact-repair properties for single and multiple failures.

III-B Single Failure Exact Repair

The second important property of the proposed code is its ability to exactly repair the content of a failed node using the repair data sent by the helper nodes. Let node $f\in\left[n\right]$ fails, and a set of helper nodes $\mathcal{H}\subseteq\{1,2,\dots,n\}\setminus\{f\}$ with $|\mathcal{H}|=d$ wishes to repair node $f$ . We first determine the repair data sent from each helper node in order to repair node $f$ .

Repair Encoder Matrix at the Helper Nodes: For a determinant code operating in mode $m$ and a failed node $f$ , the repair-encoder matrix $\mathbf{\Xi}^{f,(m)}$ is defined as a $\binom{d}{m}\times\binom{d}{m-1}$ matrix, whose rows are labeled by $m$ -element subsets of $\left[d\right]$ and columns are labeled by $(m-1)$ -element subsets of $\left[d\right]$ . The entry in row $\mathcal{I}$ and column $\mathcal{J}$ is given by

[TABLE]

where $\psi_{f,x}$ is the entry of the encoder matrix $\mathbf{\Psi}$ at position $(f,x)$ . An example of the $\Xi$ matrix is given in (24) in Section IV.

In order to repair node $f$ , each helper node $h\in\mathcal{H}$ multiplies its content $\mathbf{\Psi}_{h}\cdot\mathbf{D}$ by the repair-encoder matrix of node $f$ to obtain $\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(m)}$ , and sends it to node $f$ . Note that matrix $\mathbf{\Xi}^{f,(m)}$ has $\binom{d}{m-1}$ columns, and hence the length of the repair data $\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(m)}$ is $\binom{d}{m-1}$ , which is greater than $\beta=\binom{d-1}{m-1}$ . However, the following proposition states that out of $\binom{d}{m-1}$ columns of matrix $\mathbf{\Xi}^{f,(m)}$ at most $\beta^{(m)}=\binom{d-1}{m-1}$ are linearly independent. Thus, the entire vector $\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(m)}$ can be sent by communicating at most $\beta$ symbols (corresponding to the linearly independent columns of $\mathbf{\Xi}^{f,(m)}$ ) to the failed node, and other symbols can be reconstructed using the linear dependencies among the repair symbols. This is formally stated in the following proposition, which is proved in Section V.

Proposition 1.

The rank of matrix $\mathbf{\Xi}^{f,(m)}$ is at most $\beta^{(m)}=\binom{d-1}{m-1}$ .

Decoding at the Failed Node: Upon receiving $d$ repair-data vectors $\left\{\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(m)}:h\in\mathcal{H}\right\}$ , the failed node stacks them to form a matrix $\mathbf{\Psi}[\mathcal{H},:]\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(m)}$ , where $\mathbf{\Psi}[\mathcal{H},:]$ in the sub-matrix of $\mathbf{\Psi}$ obtained from nodes $h\in\mathcal{H}$ . This matrix is full-rank by the definition of the $\Psi$ matrix. Multiplying by $\mathbf{\Psi}[\mathcal{H},:]^{-1}$ , the failed node retrieves

[TABLE]

This is a $d\times\binom{d}{m-1}$ matrix. These $d\binom{d}{m-1}$ linear combinations of the data symbols span a linear subspace, which we refer to by repair space of node $f$ . The following proposition shows that all of the missing symbols of node $f$ can be recovered from its repair space.

Proposition 2.

In the $(n,k=d,d)$ proposed codes with parameters $(\alpha^{(m)},\beta^{(m)},F^{(m)})=\left(\binom{d}{m},\binom{d-1}{m-1},m\binom{d+1}{m+1}\right)$ , for every failed node $f\in\left[n\right]$ and set of helpers $\mathcal{H}\subseteq\left[n\right]\setminus\left\{f\right\}$ with $\left|H\right|=d$ , the content of node $f$ can be exactly regenerated by downloading $\beta$ symbols from each of nodes in $\mathcal{H}$ . More precisely, the $\mathcal{I}$ -th entry of the node $f$ can be recovered using

[TABLE]

The proof of this proposition is presented in Section V.

Remark 1.

Note that for a code defined on the Galois field $\mathsf{GF}(2^{s})$ with characteristic $2$ , we have $-1=+1$ , and hence, all the positive and negative signs disappear. In particular, the parity equation in (5) will simply reduce to $\sum_{y\in\mathcal{I}}w_{y,\mathcal{I}}=0$ , the non-zero entries of the repair encoder matrix in (11) will be $\psi_{f,x}$ , and the repair equation in (13) will be replaced by $\left[\mathbf{\Psi}_{f}\cdot\mathbf{D}\right]_{\mathcal{I}}=\sum_{x\in\mathcal{I}}\left[\mathbf{R}^{f,(m)}\right]_{x,\mathcal{I}\setminus\left\{x\right\}}$ .

III-C Multiple Failure Exact Repair

The repair mechanism proposed for multiple failure scenario is similar to that of the single failure case. We consider a set of failed nodes $\mathcal{E}$ with $e=|\mathcal{E}|$ failures. Each helper node $h\in\mathcal{H}$ sends its repair data to all failed nodes simultaneously. Each failed node $f\in\mathcal{E}$ can recover the repair data $\left\{\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(m)}:h\in\mathcal{H}\right\}$ , and the repair mechanism is similar to that explained in Proposition 2.

A naive approach is to simply concatenate all the required repair data $\left\{\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(m)}:f\in\mathcal{E}\right\}$ at the helper node $h\in\mathcal{H}$ and send it to the failed nodes. More precisely, for a set of failed nodes $\mathcal{E}=\{f_{1},f_{2},\dots,f_{e}\}$ and a helper node $h\in\mathcal{H}$ , we define its repair data as $\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{\mathcal{E},(m)}$ , where

[TABLE]

This is simply a concatenation of the repair data for individual repair of $f\in\mathcal{E}$ , and the content of each failed node can be exactly reconstructed according to Proposition 2. The repair bandwidth required for naive concatenation scheme is $e\times\beta^{(m)}_{1}=e\binom{d-1}{m-1}$ . Instead, we show that the bandwidth can be opportunistically utilized by exploiting the intersection between the repair space of the different failed nodes. The following proposition shows that the repair data $\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{\mathcal{E},(m)}$ can be delivered to the failed nodes by communicating only $\beta_{e}^{(m)}$ repair symbols.

Proposition 3.

Assume that a family of $e$ nodes $\mathcal{E}=\left\{f_{1},f_{2},\cdots,f_{e}\right\}$ are failed. Then the rank of matrix $\mathbf{\Xi}^{\mathcal{E},(m)}$ defined in (15) is at most $\binom{d}{m}-\binom{d-e}{m}$ .

IV An Illustrative Example for $(n=8,k=4,d=4)$ codes

Before presenting the formal proof of the main properties of the proposed code, we show the code construction and the repair mechanism through an example in this section. This example is similar to that of [13], and will be helpful to understand the notation and the details of the code construction, as well as to provide an intuitive justification for its underlying properties.

Let’s consider a distributed storage system with parameters $(n,k,d)=(8,4,4)$ and an operating mode $m=2$ . The parameters of the proposed regeneration code for this point of the trade-off are given by

[TABLE]

We first label and partition the information symbols into two groups, $\mathcal{V}$ and $\mathcal{W}$ , with $|\mathcal{V}|=m\binom{d}{m}=2\binom{4}{2}=12$ and $|\mathcal{W}|=m\binom{d}{m+1}=2\binom{4}{3}=8$ . Note that $|\mathcal{V}|+|\mathcal{W}|=20=F$ .

[TABLE]

Moreover, for each subset $\mathcal{I}\subseteq\left[4\right]$ with $|\mathcal{I}|=m+1=3$ , we define parity symbols as

[TABLE]

Next, the message matrix $\mathbf{D}$ will be formed by placing $v$ and $w$ symbols as specified in (8). The resulting message matrix is given by

The next step for encoding the data is multiplying $\mathbf{D}$ by an encoder matrix $\mathbf{\Psi}$ . To this end, we choose a finite field $\mathbb{F}_{13}$ (with at least $n=8$ distinct non-zero entries), and pick an $8\times 4$ Vandermonde matrix generated by $i=1,2,3,4,5,6,7,8$ . We convert this matrix to a systematic MDS matrix by multiplying it from the right by the inverse of its top $4\times 4$ matrix. That is,

[TABLE]

Note that every $k=4$ rows of matrix $\mathbf{\Psi}$ are linearly independent, and form an invertible matrix. Then the content of node $i$ is formed by row $i$ in the matrix product $\mathbf{\Psi}\cdot\mathbf{D}$ , which we denote by $\mathbf{\Psi}_{i}\cdot\mathbf{D}$ .

Data recovery from the content of any $k=4$ node is immediately implied by the MDS property of the encoder matrix. For further details, we refer to Section IV in [13]. Next, we describe the repair process for single and multiple failures.

IV-A Single Failure Repair

First, suppose that a non-systematic node $f$ fails, and we wish to repair it by the help of the systematic nodes $\mathcal{H}=\{1,2,3,4\}$ , by downloading $\beta=3$ from each. The content of node $f$ is given by $\mathbf{\Psi}_{f}\cdot\mathbf{D}$ , which includes $\alpha=6$ symbols. Note that the content of this node is a row vector whose elements has the same labeling as the columns of $\mathbf{D}$ , i.e all $m=2$ elements subsets of $\left[d\right]=\left\{1,2,3,4\right\}$ . The symbols of this node are given by:

[TABLE]

In the repair procedure using the systematic nodes as helpers, every symbol will be repaired by $m$ nodes. Recall that $d$ helper nodes contribute in the repair process by sending $\beta=\binom{d-1}{m-1}$ symbols each, in order to repair $\alpha=\binom{d}{m}$ missing symbols. Hence, the number of repair equations per missing symbol is $d\beta/\alpha=m$ , which matches with the proposed repair mechanism.

The $m=2$ helpers for each missing encoded symbol are those who have a copy of the corresponding $v$ -symbols, e.g., for the symbol indexed by $\mathcal{I}=\left\{1,2\right\}$ which has $v_{1,\{1,2\}}$ and $v_{2,\{1,2\}}$ , the contributing helpers are nodes $1$ (who has a copy of $v_{1,\{1,2\}}$ ) and node $2$ (who stores a copy of $v_{2,\{1,2\}}$ ). To this end, node $1$ can send $\psi_{f,1}v_{1,\{1,2\}}$ which node $2$ sends $\psi_{f,2}v_{2,\{1,2\}}$ to perform the repair.

It can be seen that the $\left\{1,2\right\}$ -th missing symbols has also two other terms depending on $w_{3,\{1,2,3\}}$ and $w_{4,\{1,2,4\}}$ , which are stored at nodes $3$ and $4$ , respectively. A naive repair mechanism requires these two nodes also to contribute in this repair procedure, which yields in a full data recovery in order to repair a failed node. Alternatively, we can reconstruct these $w$ -symbols using the parity equations, and the content of the the first two helper nodes. Recall from (5) that

[TABLE]

where $w_{1,\{1,2,3\}}$ and $w_{1,\{1,2,4\}}$ are stored in node $1$ and $w_{2,\{1,2,3\}}$ and $w_{2,\{1,2,4\}}$ are stored in node $2$ . Hence, the content of nodes $1$ and $2$ are sufficient to reconstruct the $\left\{1,2\right\}$ -th symbol at the failed node $f$ . To this end, node $1$ computes $A=\psi_{f,1}v_{1,\{1,2\}}-\psi_{f,3}w_{1,\{1,2,3\}}-\psi_{f,4}w_{1,\{1,2,4\}}$ (a linear combination of its first, forth, and fifth entries), and sends $A$ to $f$ . Similarly, node $2$ sends $B=\psi_{f,2}v_{2,\{1,2\}}+\psi_{f,3}w_{2,\{1,2,3\}}+\psi_{f,4}w_{2,\{1,2,4\}}$ (a linear combination of its first, second, and third coded symbols). Upon receiving these symbols, the $\left\{1,2\right\}$ -th missing symbols of node $f$ can be recovered from

[TABLE]

In general, $v$ symbols are repaired directly by communicating an identical copy of them, while $w$ symbols are repaired indirectly, using their parity equations. This is the general rule that we use for repair of all other missing symbols of node $f$ . It can be seen that each helper node participates in the repair of $\beta=3$ missing symbols, by sending one repair symbol for each. For instance, node $1$ contributes in the repair of symbols indexed by $\left\{1,2\right\}$ , $\left\{1,3\right\}$ , and $\left\{1,4\right\}$ . The repair equation sent by node $1$ for each these repair scenarios are listed below:

[TABLE]

Similarly, the repair symbols sent from helper nodes $2$ , $3$ , and $4$ are given by

[TABLE]

The repair symbols of helper node $h\in\{1,2,3,4\}$ in (20)-(23) could be obtain from $\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(2)}$ , which is the content of the helper nodes (i.e., $\mathbf{\Psi}_{h}\cdot\mathbf{D}$ ) times the repair encoder matrix for $m=2$ (i.e., $\mathbf{\Xi}^{f,(2)}$ ) defined in (11):

[TABLE]

Note that, even though this matrix has $\binom{4}{2-1}=4$ columns, and hence, $\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(2)}$ is a vector of length $4$ , it suffices to communicate only444Indeed the entire repair process can be expressed in terms of the $\beta=3$ repair symbols sent by the helper nodes. However, the recovery equations for the missing symbols are mathematically more symmetric and compact if we allow the fourth symbol to appear in the repair equations. $\beta=3$ symbols from the helper node to the failed node and the fourth symbol can be reconstructed from the other $3$ symbols at the failed node. This is due to the fact that the rank of matrix $\mathbf{\Xi}^{f,(2)}$ equals to $\beta=3$ . More precisely, a non-zero linear combination of the columns of $\mathbf{\Xi}^{f,(2)}$ is zero, that is,

[TABLE]

Therefore, (if $\psi_{f,4}\neq 0$ ) the helper node $h$ only sends the first $\beta=3$ symbols of the vector $\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(2)}$ , namely, $[\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(2)}]_{1}$ , $[\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(2)}]_{2}$ , and $[\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(2)}]_{3}$ , and the forth symbol $[\mathbf{\Psi}_{h}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(2)}]_{4}$ can be appended to it at node $f$ from

[TABLE]

Upon receiving the repair data from $d=4$ helper nodes $\{1,2,3,4\}$ , namely $\{\mathbf{\Psi}_{1}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(2)},\mathbf{\Psi}_{2}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(2)},\mathbf{\Psi}_{3}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(2)},\mathbf{\Psi}_{4}\cdot\mathbf{D}\cdot\mathbf{\Xi}^{f,(2)}\}$ , the failed can stack them to obtain a matrix

[TABLE]

where the last identity is due to the fact that $\mathbf{\Psi}[\{1,2,3,4\},:]=\mathbf{I}$ is the identity matrix. We refer to this matrix by the repair space matrix of node $f$ , and denote it by $\mathbf{R}^{f,(2)}=\mathbf{D}\cdot\mathbf{\Xi}^{f,(2)}$ , as presented at the top of the next page.

Bibliography22

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. G. Dimakis, P. Godfrey, Y. Wu, M. J. Wainwright, and K. Ramchandran, “Network coding for distributed storage systems,” Information Theory, IEEE Transactions on , vol. 56, no. 9, pp. 4539–4551, 2010.
2[2] K. V. Rashmi, N. B. Shah, and P. V. Kumar, “Optimal exact-regenerating codes for distributed storage at the msr and mbr points via a product-matrix construction,” Information Theory, IEEE Transactions on , vol. 57, no. 8, pp. 5227–5239, 2011.
3[3] S.-J. Lin, W.-H. Chung, Y. S. Han, and T. Y. Al-Naffouri, “A unified form of exact-msr codes via product-matrix frameworks,” Information Theory, IEEE Transactions on , vol. 61, no. 2, pp. 873–886, 2015.
4[4] M. Ye and A. Barg, “Explicit constructions of high-rate mds array codes with optimal repair bandwidth,” IEEE Transactions on Information Theory , vol. 63, no. 4, pp. 2001–2014, 2017.
5[5] B. Sasidharan, M. Vajha, and P. V. Kumar, “An explicit, coupled-layer construction of a high-rate msr code with low sub-packetization level, small field size and all-node repair,” ar Xiv preprint ar Xiv:1607.07335 , 2016.
6[6] C. Tian, J. Li, and X. Tang, “A generic transformation for optimal repair bandwidth and rebuilding access in mds codes,” in Information Theory (ISIT), 2017 IEEE International Symposium on , 2017, pp. 1623–1627.
7[7] M. Ye and A. Barg, “Explicit constructions of optimal-access mds codes with nearly optimal sub-packetization,” IEEE Transactions on Information Theory , vol. 63, no. 10, pp. 6307–6317, 2017.
8[8] M. Elyasi, S. Mohajer, and R. Tandon, “Linear exact repair rate region of ( k + 1 , k , k ) 𝑘 1 𝑘 𝑘 (k+1,k,k) distributed storage systems: A new approach,” in Information Theory Proceedings (ISIT), 2015 IEEE International Symposium on . IEEE, 2015, pp. 2061–2065.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Determinant Codes with Helper-Independent Repair for Single and Multiple Failures

Abstract

I Introduction

I-A Determinant Codes and Helper-Independent Repair

I-B Multiple Failures Repair

II Main Result

Definition 1**.**

Theorem 1**.**

Remark 1**.**

Theorem 2**.**

Remark 2**.**

Remark 3**.**

Theorem 3**.**

Remark 4**.**

III Construction of (n,k=d,d)(n,k=d,d)(n,k=d,d) determinant codes

III-A Code Construction

III-B Single Failure Exact Repair

Proposition 1**.**

Proposition 2**.**

Remark 1**.**

III-C Multiple Failure Exact Repair

Proposition 3**.**

IV An Illustrative Example for (n=8,k=4,d=4)(n=8,k=4,d=4)(n=8,k=4,d=4) codes

IV-A Single Failure Repair

Definition 1.

Theorem 1.

Remark 1.

Theorem 2.

Remark 2.

Remark 3.

Theorem 3.

Remark 4.

III Construction of $(n,k=d,d)$ determinant codes

Proposition 1.

Proposition 2.

Remark 1.

Proposition 3.

IV An Illustrative Example for $(n=8,k=4,d=4)$ codes