A Decoding Approach to Reed-Solomon Codes from Their Definition

Maria Bras-Amor\'os

arXiv:1706.03504·cs.IT·June 13, 2017

A Decoding Approach to Reed-Solomon Codes from Their Definition

Maria Bras-Amor\'os

PDF

TL;DR

This paper presents a new, more intuitive decoding approach for Reed-Solomon codes, making error correction concepts more accessible for beginners by deriving the algorithm from fundamental definitions.

Contribution

A self-contained decoding method for Reed-Solomon codes based on polynomial interpolation degree, simplifying understanding for nonexperts.

Findings

01

Decoding algorithm derived from interpolation polynomial degree

02

Algorithm is more natural and easier to understand than classical methods

03

Related to Peterson-Gorenstein-Zierler algorithm

Abstract

Because of their importance in applications and their quite simple definition, Reed-Solomon codes can be explained in any introductory course on coding theory. However, decoding algorithms for Reed-Solomon codes are far from being simple and it is difficult to fit them in introductory courses for undergraduates. We introduce a new decoding approach, in a self-contained presentation, which we think may be appropriate for introducing error correction of Reed-Solomon codes to nonexperts. In particular, we interpret Reed-Solomon codes by means of the degree of the interpolation polynomial of the code words and from this derive a decoding algorithm. Compared to the classical algorithms, our algorithm appears to arise more naturally from definitions and to be easier to understand. It is related to the Peterson-Gorenstein-Zierler algorithm.

Equations110

G_{ex}=\left(\begin{array}[]{cccccc}1&1&1&1&1&1\\ 1&5&4&6&2&3\\ \end{array}\right).

G_{ex}=\left(\begin{array}[]{cccccc}1&1&1&1&1&1\\ 1&5&4&6&2&3\\ \end{array}\right).

\left(\begin{array}[]{cc}1&1\end{array}\right)G_{ex}=\left(\begin{array}[]{ccccccc}2&6&5&0&3&4\\ \end{array}\right),

\left(\begin{array}[]{cc}1&1\end{array}\right)G_{ex}=\left(\begin{array}[]{ccccccc}2&6&5&0&3&4\\ \end{array}\right),

\left(\begin{array}[]{cc}0&2\end{array}\right)G_{ex}=\left(\begin{array}[]{ccccccc}2&3&1&5&4&6\\ \end{array}\right),

\left(\begin{array}[]{cc}0&2\end{array}\right)G_{ex}=\left(\begin{array}[]{ccccccc}2&3&1&5&4&6\\ \end{array}\right),

\left(\begin{array}[]{cc}5&6\end{array}\right)G_{ex}=\left(\begin{array}[]{ccccccc}4&0&1&6&3&2\\ \end{array}\right).

\left(\begin{array}[]{cc}5&6\end{array}\right)G_{ex}=\left(\begin{array}[]{ccccccc}4&0&1&6&3&2\\ \end{array}\right).

H_{ex}=\left(\begin{array}[]{cccccc}1&5&4&6&2&3\\ 1&4&2&1&4&2\\ 1&6&1&6&1&6\\ 1&2&4&1&2&4\\ \end{array}\right)

H_{ex}=\left(\begin{array}[]{cccccc}1&5&4&6&2&3\\ 1&4&2&1&4&2\\ 1&6&1&6&1&6\\ 1&2&4&1&2&4\\ \end{array}\right)

V_{r}(\alpha_{1},\alpha_{2},\dots,\alpha_{n})=\left(\begin{array}[]{cccc}1&1&\dots&1\\ \alpha_{1}&\alpha_{2}&\dots&\alpha_{n}\\ \alpha_{1}^{2}&\alpha_{2}^{2}&\dots&\alpha_{n}^{2}\\ \vdots&\vdots&\ddots&\vdots\\ \alpha_{1}^{r-1}&\alpha_{2}^{r-1}&\dots&\alpha_{n}^{r-1}\\ \end{array}\right).

V_{r}(\alpha_{1},\alpha_{2},\dots,\alpha_{n})=\left(\begin{array}[]{cccc}1&1&\dots&1\\ \alpha_{1}&\alpha_{2}&\dots&\alpha_{n}\\ \alpha_{1}^{2}&\alpha_{2}^{2}&\dots&\alpha_{n}^{2}\\ \vdots&\vdots&\ddots&\vdots\\ \alpha_{1}^{r-1}&\alpha_{2}^{r-1}&\dots&\alpha_{n}^{r-1}\\ \end{array}\right).

G=\left(\begin{array}[]{ccccc}1&1&1&\dots&1\\ 1&\alpha&\alpha^{2}&\dots&\alpha^{n-1}\\ 1&\alpha^{2}&\alpha^{4}&\dots&\alpha^{2(n-1)}\\ \vdots&\vdots&\vdots&\ddots&\vdots\\ 1&\alpha^{k-1}&\alpha^{(k-1)2}&\dots&\alpha^{(k-1)(n-1)}\end{array}\right).

G=\left(\begin{array}[]{ccccc}1&1&1&\dots&1\\ 1&\alpha&\alpha^{2}&\dots&\alpha^{n-1}\\ 1&\alpha^{2}&\alpha^{4}&\dots&\alpha^{2(n-1)}\\ \vdots&\vdots&\vdots&\ddots&\vdots\\ 1&\alpha^{k-1}&\alpha^{(k-1)2}&\dots&\alpha^{(k-1)(n-1)}\end{array}\right).

H=\left(\begin{array}[]{ccccc}1&\alpha&\alpha^{2}&\dots&\alpha^{n-1}\\ 1&\alpha^{2}&\alpha^{4}&\dots&\alpha^{2(n-1)}\\ \vdots&\vdots&\vdots&\ddots&\vdots\\ 1&\alpha^{n-k}&\alpha^{(n-k)2}&\dots&\alpha^{(n-k)(n-1)}\end{array}\right).

H=\left(\begin{array}[]{ccccc}1&\alpha&\alpha^{2}&\dots&\alpha^{n-1}\\ 1&\alpha^{2}&\alpha^{4}&\dots&\alpha^{2(n-1)}\\ \vdots&\vdots&\vdots&\ddots&\vdots\\ 1&\alpha^{n-k}&\alpha^{(n-k)2}&\dots&\alpha^{(n-k)(n-1)}\end{array}\right).

\left|\begin{array}[]{cccc}\alpha^{j_{1}}&\alpha^{j_{2}}&\dots&\alpha^{j_{n-k}}\\ \alpha^{2j_{1}}&\alpha^{2j_{2}}&\dots&\alpha^{2j_{n-k}}\\ \alpha^{3j_{1}}&\alpha^{3j_{2}}&\dots&\alpha^{3j_{n-k}}\\ \vdots&\vdots&\ddots&\vdots\\ \alpha^{(n-k)j_{1}}&\alpha^{(n-k)j_{2}}&\dots&\alpha^{(n-k)j_{n-k}}\\ \end{array}\right|=\alpha^{j_{1}}\dots\alpha^{j_{n-k}}\cdot\left|V_{n}(\alpha^{j_{1}},\dots,\alpha^{j_{n-k}})\right|,

\left|\begin{array}[]{cccc}\alpha^{j_{1}}&\alpha^{j_{2}}&\dots&\alpha^{j_{n-k}}\\ \alpha^{2j_{1}}&\alpha^{2j_{2}}&\dots&\alpha^{2j_{n-k}}\\ \alpha^{3j_{1}}&\alpha^{3j_{2}}&\dots&\alpha^{3j_{n-k}}\\ \vdots&\vdots&\ddots&\vdots\\ \alpha^{(n-k)j_{1}}&\alpha^{(n-k)j_{2}}&\dots&\alpha^{(n-k)j_{n-k}}\\ \end{array}\right|=\alpha^{j_{1}}\dots\alpha^{j_{n-k}}\cdot\left|V_{n}(\alpha^{j_{1}},\dots,\alpha^{j_{n-k}})\right|,

\left(\begin{array}[]{cccccc}1&1&1&1&\dots&1\\ 1&\alpha&\alpha^{2}&\alpha^{3}&\dots&\alpha^{(n-1)}\\ 1&\alpha^{2}&\alpha^{4}&\alpha^{6}&\dots&\alpha^{2(n-1)}\\ 1&\alpha^{3}&\alpha^{6}&\alpha^{9}&\dots&\alpha^{3(n-1)}\\ \vdots&\vdots&\vdots&\vdots&\ddots&\vdots\\ 1&\alpha^{n-1}&\alpha^{2(n-1)}&\alpha^{3(n-1)}&\dots&\alpha^{(n-1)(n-1)}\\ \end{array}\right)\left(\begin{array}[]{c}a_{0}\\ a_{1}\\ \vdots\\ a_{n-1}\end{array}\right)=\left(\begin{array}[]{c}u_{0}\\ u_{1}\\ \vdots\\ u_{n-1}\end{array}\right).

\left(\begin{array}[]{cccccc}1&1&1&1&\dots&1\\ 1&\alpha&\alpha^{2}&\alpha^{3}&\dots&\alpha^{(n-1)}\\ 1&\alpha^{2}&\alpha^{4}&\alpha^{6}&\dots&\alpha^{2(n-1)}\\ 1&\alpha^{3}&\alpha^{6}&\alpha^{9}&\dots&\alpha^{3(n-1)}\\ \vdots&\vdots&\vdots&\vdots&\ddots&\vdots\\ 1&\alpha^{n-1}&\alpha^{2(n-1)}&\alpha^{3(n-1)}&\dots&\alpha^{(n-1)(n-1)}\\ \end{array}\right)\left(\begin{array}[]{c}a_{0}\\ a_{1}\\ \vdots\\ a_{n-1}\end{array}\right)=\left(\begin{array}[]{c}u_{0}\\ u_{1}\\ \vdots\\ u_{n-1}\end{array}\right).

\begin{array}[]{rcl}f_{0}&=&6x^{5}+6x^{4}+6x^{3}+6x^{2}+6x+6,\\ f_{1}&=&2x^{5}+3x^{4}+x^{3}+5x^{2}+4x+6,\\ f_{2}&=&3x^{5}+5x^{4}+6x^{3}+3x^{2}+5x+6,\\ f_{3}&=&x^{5}+6x^{4}+x^{3}+6x^{2}+x+6,\\ f_{4}&=&5x^{5}+3x^{4}+6x^{3}+5x^{2}+3x+6,\\ f_{5}&=&4x^{5}+5x^{4}+x^{3}+3x^{2}+2x+6.\\ \end{array}

\begin{array}[]{rcl}f_{0}&=&6x^{5}+6x^{4}+6x^{3}+6x^{2}+6x+6,\\ f_{1}&=&2x^{5}+3x^{4}+x^{3}+5x^{2}+4x+6,\\ f_{2}&=&3x^{5}+5x^{4}+6x^{3}+3x^{2}+5x+6,\\ f_{3}&=&x^{5}+6x^{4}+x^{3}+6x^{2}+x+6,\\ f_{4}&=&5x^{5}+3x^{4}+6x^{3}+5x^{2}+3x+6,\\ f_{5}&=&4x^{5}+5x^{4}+x^{3}+3x^{2}+2x+6.\\ \end{array}

\left(\begin{array}[]{cccccc}6&6&6&6&6&6\\ 6&4&5&1&3&2\\ 6&5&3&6&5&3\\ 6&1&6&1&6&1\\ 6&3&5&6&3&5\\ 6&2&3&1&5&4\\ \end{array}\right).

\left(\begin{array}[]{cccccc}6&6&6&6&6&6\\ 6&4&5&1&3&2\\ 6&5&3&6&5&3\\ 6&1&6&1&6&1\\ 6&3&5&6&3&5\\ 6&2&3&1&5&4\\ \end{array}\right).

\begin{array}[]{rcl}u(5^{1})=u(5)&=3+6+1+1+3&=0,\\ u(5^{2})=u(4)&=3+2+4+6+6&=0,\\ u(5^{3})=u(6)&=3+3+2+1+5&=0,\\ u(5^{4})=u(2)&=3+1+1+6+3&=0.\end{array}

\begin{array}[]{rcl}u(5^{1})=u(5)&=3+6+1+1+3&=0,\\ u(5^{2})=u(4)&=3+2+4+6+6&=0,\\ u(5^{3})=u(6)&=3+3+2+1+5&=0,\\ u(5^{4})=u(2)&=3+1+1+6+3&=0.\end{array}

\begin{array}[]{ccc}{\mathbb{F}}^{n}&\rightarrow&{\mathbb{F}}^{n}\\ (v_{0},\dots,v_{n-1})&\mapsto&(v(\alpha^{0}),v(\alpha),v(\alpha^{2}),\dots,v(\alpha^{n-1}))\end{array}

\begin{array}[]{ccc}{\mathbb{F}}^{n}&\rightarrow&{\mathbb{F}}^{n}\\ (v_{0},\dots,v_{n-1})&\mapsto&(v(\alpha^{0}),v(\alpha),v(\alpha^{2}),\dots,v(\alpha^{n-1}))\end{array}

\begin{array}[]{ccc}(-u(\alpha^{n}),-u(\alpha^{n-1}),-u(\alpha^{n-2}),\dots,-u(\alpha))&\mathrel{\reflectbox{$\mapsto$}}&(u_{0},\dots,u_{n-1}),\end{array}

\begin{array}[]{ccc}(-u(\alpha^{n}),-u(\alpha^{n-1}),-u(\alpha^{n-2}),\dots,-u(\alpha))&\mathrel{\reflectbox{$\mapsto$}}&(u_{0},\dots,u_{n-1}),\end{array}

f_{u}

f_{u}

\begin{array}[]{rcl}u(5^{1})=u(5)&=5+6+6+4&=0,\\ u(5^{2})=u(4)&=5+2+1+1&=2,\\ u(5^{3})=u(6)&=5+3+6+2&=2,\\ u(5^{4})=u(2)&=5+1+1+4&=4,\\ u(5^{5})=u(3)&=5+5+6+1&=3,\\ u(5^{6})=u(1)&=5+4+1+2&=5.\end{array}

\begin{array}[]{rcl}u(5^{1})=u(5)&=5+6+6+4&=0,\\ u(5^{2})=u(4)&=5+2+1+1&=2,\\ u(5^{3})=u(6)&=5+3+6+2&=2,\\ u(5^{4})=u(2)&=5+1+1+4&=4,\\ u(5^{5})=u(3)&=5+5+6+1&=3,\\ u(5^{6})=u(1)&=5+4+1+2&=5.\end{array}

\begin{array}[]{rcl}v(5^{0})=v(1)&=2+4+3+5+5&=5,\\ v(5^{1})=v(5)&=2+6+5+2+3&=4,\\ v(5^{2})=v(4)&=2+2+6+5+6&=0,\\ v(5^{3})=v(6)&=2+3+3+2+5&=1,\\ v(5^{4})=v(2)&=2+1+5+5+3&=2,\\ v(5^{5})=v(3)&=2+5+6+2+6&=0,\\ \end{array}

\begin{array}[]{rcl}v(5^{0})=v(1)&=2+4+3+5+5&=5,\\ v(5^{1})=v(5)&=2+6+5+2+3&=4,\\ v(5^{2})=v(4)&=2+2+6+5+6&=0,\\ v(5^{3})=v(6)&=2+3+3+2+5&=1,\\ v(5^{4})=v(2)&=2+1+5+5+3&=2,\\ v(5^{5})=v(3)&=2+5+6+2+6&=0,\\ \end{array}

g_{c} = g_{u} - g_{h_{u}} .

g_{c} = g_{u} - g_{h_{u}} .

Λ = {λ \in F_{q} [x] \mbox s u c h t ha t λ (h_{u} + g) \mbox v ani s h es a t a l l F_{q} ∖ {0}, \mbox f or so m e g \in F_{q} [x]^{< k}} .

Λ = {λ \in F_{q} [x] \mbox s u c h t ha t λ (h_{u} + g) \mbox v ani s h es a t a l l F_{q} ∖ {0}, \mbox f or so m e g \in F_{q} [x]^{< k}} .

Λ = {λ \in F_{q} [x] \mbox s u c h t ha t (x^{n} - 1) \mbox d i v i d es λ (h_{u} + g) \mbox f or so m e g \in F_{q} [x]^{< k}} .

Λ = {λ \in F_{q} [x] \mbox s u c h t ha t (x^{n} - 1) \mbox d i v i d es λ (h_{u} + g) \mbox f or so m e g \in F_{q} [x]^{< k}} .

Λ_{g} = {λ \in F_{q} [x] \mbox s u c h t ha t (x^{n} - 1) \mbox d i v i d es λ (h_{u} + g)}

Λ_{g} = {λ \in F_{q} [x] \mbox s u c h t ha t (x^{n} - 1) \mbox d i v i d es λ (h_{u} + g)}

γ \in F_{q} ∖ {0} (h_{u} + g) (γ) \neq = 0 \prod (x - γ) .

γ \in F_{q} ∖ {0} (h_{u} + g) (γ) \neq = 0 \prod (x - γ) .

\begin{array}[]{ccccccccccc}\xi_{k+t+i}&=&a_{k+t+i}l_{0}&+&a_{k+t+i-1}l_{1}&+&\cdots&+&a_{k+i+1}l_{t-1}&+&a_{k+i},\\ \end{array}

\begin{array}[]{ccccccccccc}\xi_{k+t+i}&=&a_{k+t+i}l_{0}&+&a_{k+t+i-1}l_{1}&+&\cdots&+&a_{k+i+1}l_{t-1}&+&a_{k+i},\\ \end{array}

ξ_{k + t + i} = - u (α^{n - k - t - i}) l_{0} - u (α^{n - k - t - i + 1}) l_{1} - \dots - u (α^{n - k - i}) .

ξ_{k + t + i} = - u (α^{n - k - t - i}) l_{0} - u (α^{n - k - t - i + 1}) l_{1} - \dots - u (α^{n - k - i}) .

\left(\begin{array}[]{cccc}u(\alpha)&u(\alpha^{2})&\dots&u(\alpha^{t})\\ u(\alpha^{2})&u(\alpha^{3})&\dots&u(\alpha^{t+1})\\ \vdots&\vdots&\ddots&\vdots\\ u(\alpha^{n-k-t})&u(\alpha^{n-k-t+1})&\dots&u(\alpha^{n-k-1})\\ \end{array}\right)\left(\begin{array}[]{c}l_{0}\\ l_{1}\\ \vdots\\ l_{t-1}\end{array}\right)=\left(\begin{array}[]{c}-u(\alpha^{t+1})\\ -u(\alpha^{t+2})\\ \vdots\\ -u(\alpha^{n-k})\end{array}\right).

\left(\begin{array}[]{cccc}u(\alpha)&u(\alpha^{2})&\dots&u(\alpha^{t})\\ u(\alpha^{2})&u(\alpha^{3})&\dots&u(\alpha^{t+1})\\ \vdots&\vdots&\ddots&\vdots\\ u(\alpha^{n-k-t})&u(\alpha^{n-k-t+1})&\dots&u(\alpha^{n-k-1})\\ \end{array}\right)\left(\begin{array}[]{c}l_{0}\\ l_{1}\\ \vdots\\ l_{t-1}\end{array}\right)=\left(\begin{array}[]{c}-u(\alpha^{t+1})\\ -u(\alpha^{t+2})\\ \vdots\\ -u(\alpha^{n-k})\end{array}\right).

\left(\begin{array}[]{cccc}u(\alpha)&u(\alpha^{2})&\dots&u(\alpha^{t^{\prime}})\\ u(\alpha^{2})&u(\alpha^{3})&\dots&u(\alpha^{t^{\prime}+1})\\ \vdots&\vdots&\ddots&\vdots\\ u(\alpha^{n-k-t^{\prime}})&u(\alpha^{n-k-t^{\prime}+1})&\dots&u(\alpha^{n-k-1})\\ \end{array}\right)\left(\begin{array}[]{c}l_{0}\\ l_{1}\\ \vdots\\ l_{t^{\prime}-1}\end{array}\right)=\left(\begin{array}[]{c}-u(\alpha^{t^{\prime}+1})\\ -u(\alpha^{t^{\prime}+2})\\ \vdots\\ -u(\alpha^{n-k})\end{array}\right).

\left(\begin{array}[]{cccc}u(\alpha)&u(\alpha^{2})&\dots&u(\alpha^{t^{\prime}})\\ u(\alpha^{2})&u(\alpha^{3})&\dots&u(\alpha^{t^{\prime}+1})\\ \vdots&\vdots&\ddots&\vdots\\ u(\alpha^{n-k-t^{\prime}})&u(\alpha^{n-k-t^{\prime}+1})&\dots&u(\alpha^{n-k-1})\\ \end{array}\right)\left(\begin{array}[]{c}l_{0}\\ l_{1}\\ \vdots\\ l_{t^{\prime}-1}\end{array}\right)=\left(\begin{array}[]{c}-u(\alpha^{t^{\prime}+1})\\ -u(\alpha^{t^{\prime}+2})\\ \vdots\\ -u(\alpha^{n-k})\end{array}\right).

\left(\begin{array}[]{cccc}u(\alpha)&u(\alpha^{2})&\dots&u(\alpha^{t})\\ u(\alpha^{2})&u(\alpha^{3})&\dots&u(\alpha^{t+1})\\ \vdots&\vdots&\ddots&\vdots\\ u(\alpha^{t})&u(\alpha^{t+1})&\dots&u(\alpha^{2t-1})\\ \end{array}\right)\left(\begin{array}[]{c}l_{0}\\ l_{1}\\ \vdots\\ l_{t-1}\end{array}\right)=\left(\begin{array}[]{c}-u(\alpha^{t+1})\\ -u(\alpha^{t+2})\\ \vdots\\ -u(\alpha^{2t})\end{array}\right).

\left(\begin{array}[]{cccc}u(\alpha)&u(\alpha^{2})&\dots&u(\alpha^{t})\\ u(\alpha^{2})&u(\alpha^{3})&\dots&u(\alpha^{t+1})\\ \vdots&\vdots&\ddots&\vdots\\ u(\alpha^{t})&u(\alpha^{t+1})&\dots&u(\alpha^{2t-1})\\ \end{array}\right)\left(\begin{array}[]{c}l_{0}\\ l_{1}\\ \vdots\\ l_{t-1}\end{array}\right)=\left(\begin{array}[]{c}-u(\alpha^{t+1})\\ -u(\alpha^{t+2})\\ \vdots\\ -u(\alpha^{2t})\end{array}\right).

\left(\begin{array}[]{cccc}u(\alpha)&u(\alpha^{2})&\dots&u(\alpha^{t})\\ u(\alpha^{2})&u(\alpha^{3})&\dots&u(\alpha^{t+1})\\ \vdots&\vdots&\ddots&\vdots\\ u(\alpha^{t})&u(\alpha^{t+1})&\dots&u(\alpha^{2t-1})\\ \end{array}\right)

\left(\begin{array}[]{cccc}u(\alpha)&u(\alpha^{2})&\dots&u(\alpha^{t})\\ u(\alpha^{2})&u(\alpha^{3})&\dots&u(\alpha^{t+1})\\ \vdots&\vdots&\ddots&\vdots\\ u(\alpha^{t})&u(\alpha^{t+1})&\dots&u(\alpha^{2t-1})\\ \end{array}\right)

\left(\begin{array}[]{ccc}u(\alpha)&\dots&u(\alpha^{t})\\ \vdots&\ddots&\vdots\\ u(\alpha^{t})&\dots&u(\alpha^{2t-1})\end{array}\right)=\left(\begin{array}[]{ccc}e(\alpha)&\dots&e(\alpha^{t})\\ \vdots&\ddots&\vdots\\ e(\alpha^{t})&\dots&e(\alpha^{2t-1})\end{array}\right).

\left(\begin{array}[]{ccc}u(\alpha)&\dots&u(\alpha^{t})\\ \vdots&\ddots&\vdots\\ u(\alpha^{t})&\dots&u(\alpha^{2t-1})\end{array}\right)=\left(\begin{array}[]{ccc}e(\alpha)&\dots&e(\alpha^{t})\\ \vdots&\ddots&\vdots\\ e(\alpha^{t})&\dots&e(\alpha^{2t-1})\end{array}\right).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A Decoding Approach to Reed–Solomon Codes

from Their Definition

Maria Bras-Amorós M. Bras-Amorós is with Universitat Rovira i Virgili, Tarragona, Catalonia (e-mail: [email protected])

Abstract

Because of their importance in applications and their quite simple definition, Reed–Solomon codes can be explained in any introductory course on coding theory. However, decoding algorithms for Reed–Solomon codes are far from being simple and it is difficult to fit them in introductory courses for undergraduates. We introduce a new decoding approach, in a self-contained presentation, which we think may be appropriate for introducing error correction of Reed–Solomon codes to nonexperts. In particular, we interpret Reed–Solomon codes by means of the degree of the interpolation polynomial of the code words and from this derive a decoding algorithm. Compared to the classical algorithms, our algorithm appears to arise more naturally from definitions and to be easier to understand. It is related to the Peterson–Gorenstein–Zierler algorithm (see [10] and [20]).

1 Introduction.

Error control codes are used to detect and correct errors that may occur in data transmission or storage through eventually defective channels or storage devices that can distort the sent or stored information. For example, the atmosphere introduces errors in the transmission of images from the Meteosat satellite to Earth, different interferences in communications by mobile phone may cause transmission errors, or reading devices need correcting algorithms for handling CDs, DVDs, or USB memories. Error control codes are also used in distributed data storage in the cloud to recover lost or damaged chunks of information. In the words of Elwyn R. Berlekamp, one of the fathers of coding theory,

Communication links transmit information from here to there. Computer memories transmit information from now to then. In either case, noise causes the received data to differ slightly from the original data. As Shannon [24] showed in 1948, the noise need not cause any degradation in reliability. The noise does impose some limiting capacity on the throughput rate, although that limit is typically well above the throughput rate at which real systems operate. Error-correcting codes enable a system to achieve a high degree of reliability despite the presence of noise [2].

The modus operandi of those codes is to send along with the original information a small amount of redundancy, so that from all the received information one can deduce what is actually transmitted. The simplest example is adding for every transmitted bit (a [math] or a $1$ ) two identical copies. If the original bit or one of its copies is received with an error, we can still correct it from the other two, which we expect to coincide. Note that by adding redundancy, on one side we improve the quality of the received information. But, on the other side, we augment the transmission cost. In the example of repeating bits, the transmission cost is multiplied by three.

Coding theory aims at designing and implementing codes with good correcting capacity, while maintaining a low transmission cost, as well as designing detection and correction algorithms that allow the receiver to recover the original information.

Berlekamp’s reference [2] gives a detailed historical review (up to 1980) of coding theory since Shannon’s cornerstone contribution [24]. At that time, the so-called Reed–Solomon codes [21] and the most relevant algorithms for decoding Reed–Solomon codes had already appeared. They are the most universal error control codes and are currently being used directly or indirectly in most transmission devices and storage systems. Reed–Solomon codes admit different definitions as will be explained in this article, and they are all based in polynomials of bounded degrees over a finite field. One way to explain how they work is as follows. Fix a finite field of cardinality $q$ . From the data one wants to transmit (say $k$ elements of ${\mathbb{F}}_{q}$ ), one interpolates a polynomial of degree less than $k$ that takes these values when evaluated at $k$ given nonzero elements of the finite field. Then one adds to the original $k$ information values the redundancy which consists of the evaluation of the polynomial at the remaining $q-1-k$ nonzero values of the finite field. Basic polynomial theory shows how any small part of the whole $(q-1)$ -length of the received information can be restored from the rest.

Because of their importance in applications and their quite simple definition, Reed–Solomon codes can be explained in any introductory course. However, decoding algorithms for Reed–Solomon codes are far from being so simple and it is difficult to explain them in introductory courses for undergraduates. This is why we introduce our new decoding approach, in a self-contained presentation, which we think may be appropriate to introduce error correction of Reed–Solomon codes to nonexperts. Although a direct implementation of the algorithm presented in this article may not be as efficient as the most efficient known algorithms, we think that it is performable by any undergraduate student using basic software tools. However, we do not rule out the possibility that technical improvements to the algorithm may make it much more efficient, especially if one can deemphasize matrices in favor of polynomials.

The most celebrated algorithms for decoding Reed–Solomon codes have been the Peterson–Gorenstein–Zierler algorithm [20, 10] for its simplicity, and the algorithms designed to solve Berlekamp’s key equation [1]. The two primary decoding algorithms that solve Berlekamp’s key equation are the Berlekamp-Massey algorithm [15] and the Sugiyama et al. adaptation of the Euclidean algorithm [26]. The alternative so-called Welch–Berlekamp equations are solved in the Welch–Berlekamp algorithm [27]. Bit-serialized multiplication and bit-serial encoders are more efficient for hardware implementation of shift registers [3]. This is used in the algorithm in [4]. The Welch–Berlekamp equations were also solved by Chambers’ algorithm [7] and by Fedorenko’s algorithm [9]. Another general perspective is that of decoding pairs [18, 19]. Guruswami and Sudan presented their breakout algorithm [25, 11] decoding beyond half the minimum distance by means of list decoding. All these algorithms and their relationships are analyzed in several papers such as [8, 12, 16, 6, 17].

In Section 3 we revisit the definition of Reed–Solomon codes, giving four different, but equivalent, versions. Reinterpreting a definition related to the degree of the interpolation polynomial, we derive a decoding algorithm. The key result for the new formulation is Theorem 20 in Section 4. Now, for correcting a received word, its interpolation polynomial is split into two parts, one with low order terms (lower than the code dimension) and the other one with the remaining terms. The latter part is fixed while the first part is modified in order to maximize the number of nonzero roots. This gives the code word at minimum distance from the received word.

Our decoding algorithm is related to the Peterson–Gorenstein–Zierler algorithm. We compare both algorithms in Sections 6 and 7 and see how our algorithm is well suited for the optimistic view of best case decoding [4]. This is the case when error correction codes of high correction capability are used, but with a low expectation of errors.

2 Some background on coding theory.

Let us start with some background definitions and results on coding theory. Standard references are [14, 22, 13, 5, 23].

The alphabet ${\mathbb{F}}_{q}$ .

The symbols that contain the information that needs to be sent as well as the symbols corresponding to the transformed and transmitted data are the elements of a finite field, which is often called the transmission alphabet. One can consider the case in which ${\mathbb{F}}_{q}$ is a prime field, that is, $q$ is a prime number and ${\mathbb{F}}_{q}$ can be identified by the set $\{0,1,\dots,q-1\}$ , equipped with the usual addition and product modulo $q$ . There will always exist an element $\alpha$ in ${\mathbb{F}}_{q}$ such that all the powers of $\alpha$ with exponent smaller than $q-1$ are different. Then, ${\mathbb{F}}_{q}=\{0,1,\alpha,\alpha^{2},\dots,\alpha^{q-2}\}$ . In this case, $\alpha$ is called a primitive element.

Example 1.

Consider ${\mathbb{F}}_{7}$ . It is the set $\{0,1,2,3,4,5,6\}$ equipped with the addition and multiplication operations, always modulo $7$ . For instance, in ${\mathbb{F}}_{7}$ , $4+5=2$ , $1-2=6$ , $3\cdot 5=5$ . It is easy to verify that $\alpha=5$ is a primitive element of ${\mathbb{F}}_{7}$ .

Linear codes.

A linear code $C$ of length $n$ over a finite field ${\mathbb{F}}_{q}$ is a vector subspace of ${\mathbb{F}}_{q}^{n}$ . Its vectors are called code words. The dimension $k$ of the code is the dimension of the subspace. In particular, the number of code words of $C$ is $q^{k}$ .

Generator matrices.

We say that a matrix $G$ of $k$ rows and $n$ columns is a generator matrix of $C$ if its rows are a set of vectors generating the code. The generator matrix is not unique, for example we can permute the rows. To encode a word of $k$ symbols of ${\mathbb{F}}_{q}$ , we multiply it by the generator matrix.

Example 2.

The following matrix is the generator matrix of a code $C$ of length $6$ and dimension $2$ over ${\mathbb{F}}_{7}$ .

[TABLE]

To encode the information $110256$ we split it into blocks of $k=2$ symbols and multiply each block by $G_{ex}$ .

[TABLE]

The encoded information will then be $265034231546401632$ .

Dual code and parity-check matrices.

Consider the scalar product of two vectors $(u_{0},u_{1},\dots,u_{n-1})$ and $(v_{0},v_{1},\dots,v_{n-1})$ of ${\mathbb{F}}_{q}^{n}$ , defined as $u_{0}v_{0}+u_{1}v_{1}+\dots+u_{n-1}v_{n-1}\in{\mathbb{F}}_{q}$ . The dual code of $C$ is $C^{\perp}=\{v\in{\mathbb{F}}_{q}^{n}:v\cdot c=0\mbox{ for all }c\in C\}$ . It is a linear code with the same length as $C$ and dimension $n-k$ . We can define it from a system of linear equations with coefficient matrix $G$ . A matrix $H$ generating $C^{\perp}$ is called a parity-check-matrix of $C$ . Equivalently, a parity-check matrix of $C$ is a matrix such that the code $C$ can be redefined as $C=\{c\in{\mathbb{F}}_{q}^{n}:c\cdot h=0\mbox{ for every row }h\mbox{ of }H\}$ .

Example 3.

The following matrix is a parity-check matrix of the code $C$ of Example 2.

[TABLE]

Hamming distance, correction capability, and Singleton bound.

The Hamming distance between two words of the same length is the number of positions in which their symbols differ. The purpose of decoding algorithms is, given an input vector $u$ of the same length as the code, output a code word $c\in C$ minimizing the Hamming distance between $u$ and $c$ . The weight of a word is the number of nonzero symbols or, equivalently, its Hamming distance from the zero vector. The minimum distance $d$ of a linear code $C$ can be equivalently defined as (i) the minimum Hamming distance between two words of $C$ ; (ii) the minimum weight of nonzero words of $C$ ; (iii) the minimum number of linearly dependent columns of $H$ . The minimum distance of a code is an important parameter quantifying the error correction capability of the code. Indeed, if at most $\lfloor\frac{d-1}{2}\rfloor$ errors are added to a code word $c\in C$ , corrupting it into a word $u$ , then $c$ is the unique code word of $C$ at Hamming distance at most $\lfloor\frac{d-1}{2}\rfloor$ from $u$ , and in this sense we say that $\lfloor\frac{d-1}{2}\rfloor$ errors can be corrected.

The Singleton bound states that for a linear code of length $n$ and minimum distance $d$ , the dimension $k$ satisfies $k\leq n-d+1.$ The codes attaining this bound are called maximum distance separable codes (MDS).

Example 4.

The Hamming distance between the code words $265034$ and $231546$ of the code $C$ in Example 2 is $5$ . The Hamming distance between the code word $111111$ corresponding to the first row of $G_{ex}$ and $265034$ is $6$ .

Notice that the elements of the first row of the generator matrix of $C$ are all equal while the elements of the second row are all different. Any code word of $C$ will be a multiple of the second row plus a multiple of the first row. The components of any multiple of the second row, except for the zero vector, will all be different by field properties. Similarly, if we add to a vector whose components are all different a constant vector, then the components of the vector so obtained will also all be different by field properties. So, for any vector in $C$ , either it is constant or all its components are different. This makes the Hamming distance between any two words in $C$ either equal to $6$ or to $6-1=5$ . Consequently, the minimum distance of $C$ is $5$ .

Vandermonde matrices.

Although Vandermonde matrices can be defined over any field, for our purposes we concentrate on finite fields. Given $\alpha_{1},\alpha_{2},\dots,\alpha_{n}\in{\mathbb{F}}_{q}$ , the Vandermonde matrix of $\alpha_{1},\dots,\alpha_{n}$ of order $r$ is defined as

[TABLE]

It can be proved that the determinant of $V_{n}(\alpha_{1},\alpha_{2},\dots,\alpha_{n})$ satisfies $\left|V(\alpha_{1},\alpha_{2},\dots,\alpha_{n})\right|=\prod_{1\leq i<j\leq n}(\alpha_{j}-\alpha_{i}).$ Consequently, $V_{n}(\alpha_{1},\alpha_{2},\dots,\alpha_{n})$ has an inverse matrix if and only if $\alpha_{i}\neq\alpha_{j}$ for all $1\leq i<j\leq n$ .

3 Four definitions of Reed–Solomon codes.

Let us introduce Reed–Solomon codes from four different, but complementary, points of view.

3.1 Reed–Solomon codes from generator matrices.

Let ${\mathbb{F}}_{q}$ be the field with $q$ elements ( $q$ a prime power) and let $\alpha$ be a primitive element of ${\mathbb{F}}_{q}$ . Then ${\mathbb{F}}_{q}=\{0,1,\alpha,\alpha^{2},\dots,\alpha^{q-2}\}$ . Let $n=q-1$ .

Definition 5.

The Reed–Solomon code over ${\mathbb{F}}_{q}$ and of dimension $k$ , $RS_{q,\alpha}(k)$ , is the linear code of ${\mathbb{F}}_{q}^{n}$ with generator matrix

[TABLE]

Example 6.

Consider the finite field ${\mathbb{F}}_{7}$ . As noted above, the element $5\in{\mathbb{F}}_{7}$ is primitive. Indeed, $5^{0}=1$ , $5^{1}=5$ , $5^{2}=4$ , $5^{3}=6$ , $5^{4}=2$ , $5^{5}=3$ and $5^{6}$ is again $1$ . The code $RS_{7,5}(2)$ is exactly the code $C$ of Example 2.

3.2 Reed–Solomon codes from parity-check matrices.

Consider the matrix

[TABLE]

It is a matrix of maximum rank (namely $n-k$ ) because of the Vandermonde structure. Furthermore, the product of matrix $G$ and the transpose of matrix $H$ is the zero matrix. Indeed, the product of the $i$ th row of matrix $G$ ( $1\leq i\leq k$ ) times the $j$ th row of matrix $H$ ( $1\leq j\leq n-k$ ) is $\sum_{r=1}^{n}\alpha^{(i-1)(r-1)}\alpha^{j(r-1)}=\sum_{r=1}^{n}\alpha^{(i+j-1)(r-1)}.$ Now, because of the limits of $i$ and $j$ , we have that $i+j-1<q-1$ and so $\alpha^{i+j-1}\neq 1$ . Finally, the sum equals $\frac{(\alpha^{i+j-1})^{n}-1}{\alpha^{i+j-1}-1}=0$ .

This enables us to give the following equivalent definition.

Definition 7.

The Reed–Solomon code over ${\mathbb{F}}_{q}$ and of dimension $k$ , $RS_{q,\alpha}(k)$ , is the linear code of ${\mathbb{F}}_{q}^{n}$ with parity-check matrix equal to $H$ .

Example 8.

One can check that matrix $H_{ex}$ in Example 3 is of the form of the matrix $H$ in (1).

Lemma 9 uses Definition 7 to deduce that Reed–Solomon codes attain the Singleton bound, and so they are maximum distance separable (MDS) codes.

Lemma 9.

The minimum distance of $RS_{q,\alpha}(k)$ is exactly $n-k+1$ . Hence, it is an MDS code.

Proof.

The submatrix given by any subset of $n-k$ columns (with column indices $0\leq j_{1},\dots,j_{n-k}\leq{n-1}$ ) has determinant

[TABLE]

which is not zero. So, any set of $n-k$ columns of the parity-check matrix are independent, and so the minimum distance must be at least $n-k+1$ . By the Singleton bound the minimum distance must be exactly equal to $n-k+1$ . ∎

Example 10.

The minimum distance of $RS_{7,5}(2)$ is $5$ as justified in Example 4. This equals $n-k+1=6-2+1$ .

3.3 Reed–Solomon codes and interpolation polynomials.

Consider the set ${\mathbb{F}}_{q}[x]^{<k}$ of all polynomials with coefficients in ${\mathbb{F}}_{q}$ and of degree strictly less than $k$ . A general element $a\in{\mathbb{F}}_{q}[x]^{<k}$ is of the form $a=a_{0}+a_{1}x+\dots+a_{k-1}x^{k-1}$ with $a_{i}\in{\mathbb{F}_{q}}$ . Observe that evaluating $a$ at $\alpha^{i-1}$ gives $a(\alpha^{i-1})=a_{0}+a_{1}\alpha^{i-1}+\dots+a_{k-1}\alpha^{(i-1)(k-1)},$ which is exactly the result of the product of the vector $(a_{0},\dots,a_{k-1})$ by the $i$ th column of matrix $G$ . So, the product of the vector $(a_{0},\dots,a_{k-1})$ by matrix $G$ is exactly the vector $(a(1),a(\alpha),a(\alpha^{2}),\dots,a(\alpha^{n-1})).$

Definition 11.

The Reed–Solomon code over ${\mathbb{F}}_{q}$ and of dimension $k$ , $RS_{q,\alpha}(k)$ , is the set $\{(a(1),a(\alpha),a(\alpha^{2}),\dots,a(\alpha^{n-1})):a\in{\mathbb{F}}_{q}[x]^{<k}\}.$

Example 12.

The three code words computed in Example 2, which are $265034$ , $231546$ , and $401632$ are, respectively, the evaluation of the polynomials $x+1$ , $2x$ , and $6x+5$ at $5^{0},5^{1},5^{2},5^{3},5^{4},5^{5}$ . The degree of the three polynomials is less than $2$ which is the dimension of the code.

Now, for each vector $u=(u_{0},\dots,u_{n-1})$ in ${\mathbb{F}}_{q}^{n}$ , there exists a unique polynomial $f_{u}$ of degree at most $n-1$ such that $f_{u}(\alpha^{i})=u_{i}$ for all $i$ in $\{0,\dots,n-1\}$ . It can be computed using the formula $f_{u}=\sum_{i=0}^{n-1}u_{i}f_{i}$ , where $f_{i}$ is the interpolation polynomial of the $i$ th standard basis vector, that is, $f_{i}=\prod_{\begin{subarray}{c}j=0\\ j\neq i\end{subarray}}^{n-1}\frac{x-\alpha^{j}}{\alpha^{i}-\alpha^{j}}.$ The uniqueness of $f_{u}$ is a consequence of the fact that if $f_{u}=a_{0}+a_{1}x+\dots+a_{n-1}x^{n-1}$ , then the coefficients $a_{0},\dots,a_{n-1}$ are a solution of the linear system of equations

[TABLE]

The matrix of this system is a square Vandermonde matrix which is known to be invertible. So, any $u$ in ${\mathbb{F}}_{q}^{n}$ is of the form $(f(1),f(\alpha),f(\alpha^{2}),\dots,f(\alpha^{n-1}))$ for some unique $f\in{\mathbb{F}}_{q}[x]$ of degree less than $n$ .

Example 13.

In ${\mathbb{F}}_{7}$ , taking $\alpha=5$ as primitive element, we have

[TABLE]

Then, for a general vector $u\in{\mathbb{F}}_{7}^{6}$ , the coefficients of $f_{u}$ (in increasing order) can be computed as the product of $u$ by the matrix

[TABLE]

For instance, the coefficients of the polynomial interpolating $u=(4,2,1,6,3,2)$ are $(3,0,3,2,6,4)$ , and the coefficients of the polynomial interpolating $w=(0,2,5,6,0,6)$ are $(2,2,2,2,6,0)$ .

Code word checking.

From Definition 11, a vector $u=(u_{0},\dots,u_{n-1})$ in ${\mathbb{F}}_{q}^{n}$ is a code word if and only if its interpolation polynomial $f_{u}$ satisfies $\deg(f_{u})<k$ .

Example 14.

The words $265034$ , $231546$ , and $401632$ are code words of $RS_{7,5}(2)$ because, as seen in Example 12, their interpolation polynomials are, respectively, $x+1$ , $2x$ , and $6x+5$ , whose degrees are less than $k=2$ . The words $421632$ and $025606$ are not code words of $RS_{7,5}(2)$ because, as seen in Example 13, their interpolation polynomials are, respectively, $6x^{5}+6x^{3}+5x^{2}+2x$ and $6x^{4}+2x^{3}+2x^{2}+2x+2$ , whose degrees are larger than $k=2$ .

3.4 Reed–Solomon codes and polynomial evaluation.

Consider now the set ${\mathbb{F}}_{q}[x]^{<n}$ of all polynomials with coefficients in ${\mathbb{F}}_{q}$ and degree strictly less than $n$ . A general element $u\in{\mathbb{F}}_{q}[x]^{<n}$ is of the form $u=u_{0}+u_{1}x+\dots+u_{n-1}x^{n-1}$ with $u_{i}\in{\mathbb{F}_{q}}$ . Observe that evaluating $u$ at $\alpha^{i}$ gives $u(\alpha^{i})=u_{0}+u_{1}\alpha^{i}+\dots+u_{n-1}\alpha^{i(n-1)},$ which, if $i\leq n-k$ , is exactly the result of the product of the $i$ th row of matrix $H$ and vector $(u_{0},\dots,u_{n-1})^{T}$ . The value $u(\alpha^{i})$ , if $i\leq n-k$ , is called the $i$ th syndrome of $u$ with respect to $C$ . Now, the product of matrix $H$ and vector $(u_{0},\dots,u_{n-1})^{T}$ is exactly the vector $(u(\alpha),u(\alpha^{2}),\dots,u(\alpha^{n-k})),$ which is called the syndrome vector of $u$ with respect to $C$ . On the other hand, by definition of parity-check matrix, $(u_{0},\dots,u_{n-1})$ is a code word if and only if the product of matrix $H$ and $(u_{0},\dots,u_{n-1})^{T}$ is zero.

Definition 15.

The Reed–Solomon code over ${\mathbb{F}}_{q}$ and of dimension $k$ , $RS_{q,\alpha}(k)$ , is the set of vectors $u=(u_{0},\dots,u_{n-1})$ in ${\mathbb{F}}_{q}^{n}$ such that the polynomial $u_{0}+u_{1}x+\dots+u_{n-1}x^{n-1}$ vanishes at $\alpha^{j}$ for all $j$ with $1\leq j\leq n-k$ .

Code word checking.

Now, given a vector $u=(u_{0},\dots,u_{n-1})$ in ${\mathbb{F}}_{q}^{n}$ , $u$ is a code word if and only if $u(\alpha^{i})=0$ for all $i$ with $1\leq i\leq n-k$ .

Example 16.

Suppose we want to check whether the word $342650$ belongs to $RS_{7,5}(2)$ . We consider the polynomial $u(x)=3+4x+2x^{2}+6x^{3}+5x^{4}$ and evaluate it at $5$ , $5^{2}$ , $5^{3}$ and $5^{4}$ . We obtain

[TABLE]

Since $u(5)=u(5^{2})=u(5^{3})=u(5^{4})=0$ , the word $342650$ belongs to $RS_{7,5}(2)$ .

3.5 Connection of the coefficients of an interpolation polynomial and its evaluation at all points.

Next we will see that the coefficients of an interpolation polynomial over a finite field are intimately related to the values obtained when evaluating the polynomial at all the nonzero elements of the finite field.

Lemma 17.

Suppose that $\alpha$ is a primitive element of a finite field of $q$ elements and let $n=q-1$ . The polynomials $f_{i}=\prod_{\begin{subarray}{c}j=0\\ j\neq i\end{subarray}}^{n-1}\frac{x-\alpha^{j}}{\alpha^{i}-\alpha^{j}}$ satisfy $f_{i}=-(\alpha^{i}x^{n-1}+\alpha^{2i}x^{n-2}+\alpha^{3i}x^{n-3}+\dots+\alpha^{(n-1)i}x+\alpha^{ni}).$

Proof.

Suppose $\beta\in{\mathbb{F}}_{q}\setminus\{0\}$ . From the equality $(x-\beta)(x^{n-1}+\beta x^{n-2}+\beta^{2}x^{n-3}+\dots+\beta^{n-2}x+\beta^{n-1})=x^{n}-1,$ it follows that $x^{n-1}+\beta x^{n-2}+\beta^{2}x^{n-3}+\dots+\beta^{n-2}x+\beta^{n-1}=\frac{x^{n}-1}{x-\beta}.$ This, together with the fact $x^{n}-1=\prod_{\gamma\in{\mathbb{F}}_{q}\setminus\{0\}}(x-\gamma)$ , implies that $x^{n-1}+\beta x^{n-2}+\beta^{2}x^{n-3}+\dots+\beta^{n-2}x+\beta^{n-1}$ vanishes at all the elements of ${\mathbb{F}}_{q}\setminus\{0\}$ except at $\beta$ , where it evaluates to $\beta^{n-1}+\beta\beta^{n-2}+\beta^{2}\beta^{n-3}+\dots+\beta^{n-2}\beta+\beta^{n-1}=n\beta^{n-1}=\frac{-1}{\beta}$ . Hence, $-\beta(x^{n-1}+\beta x^{n-2}+\beta^{2}x^{n-3}+\dots+\beta^{n-2}x+\beta^{n-1})$ vanishes at all the elements of ${\mathbb{F}}_{q}\setminus\{0\}$ except at $\beta$ , where it evaluates to $1$ . Finally, $-\beta(x^{n-1}+\beta x^{n-2}+\beta^{2}x^{n-3}+\dots+\beta^{n-2}x+\beta^{n-1})=-(\beta x^{n-1}+\beta^{2}x^{n-2}+\beta^{3}x^{n-3}+\beta^{4}x^{n-4}+\dots+\beta^{(n-1)}x+\beta^{n})$ .

If we take $\beta=\alpha^{i}$ then $f_{i}$ and the expression have degree $q-2$ and take the same values at $q-1$ points, hence are equal. ∎

The main result relating the last two definitions of Reed–Solomon codes is the following lemma.

Lemma 18.

The inverse of the map

[TABLE]

is

[TABLE]

where $v(\beta)$ is the evaluation of $v_{0}+v_{1}x+\dots+v_{n-1}x^{n-1}$ at $\beta$ and $u(\beta)$ is the evaluation of $u_{0}+u_{1}x+\dots+u_{n-1}x^{n-1}$ at $\beta$ .

Proof.

The inverse map is giving the coefficients of the interpolation polynomial $f_{u}$ . By Lemma 17 we have that $f_{u}=\sum_{i=0}^{n-1}u_{i}f_{i}$ , where $f_{i}=-(\alpha^{i}x^{n-1}+\alpha^{2i}x^{n-2}+\alpha^{3i}x^{n-3}+\dots+\alpha^{(n-1)i}x+\alpha^{ni}).$ Now,

[TABLE]

So, $f_{u}=-u(\alpha)x^{n-1}-u(\alpha^{2})x^{n-2}-\dots-u(\alpha^{n})=\sum_{i=0}^{n-1}(-u(\alpha^{n-i}))x^{i}$ . ∎

Example 19.

Consider the word $(u_{0},u_{1},u_{2},u_{3},u_{4},u_{5})=(5,4,0,1,2,0)$ and the related polynomial $u=5+4x+x^{3}+2x^{4}$ . Its evaluation at the powers of $5$ is

[TABLE]

What Lemma 18 says is that the polynomial $v(x)=-5-3x-4x^{2}-2x^{3}-2x^{4}=2+4x+3x^{2}+5x^{3}+5x^{4}$ satisfies that $u=(v(1),v(5),v(5^{2}),v(5^{3}),v(5^{4}),v(5^{5})).$ Indeed,

[TABLE]

and it follows that $(v(1),v(5),v(5^{2}),v(5^{3}),v(5^{4}),v(5^{5}))=(5,4,0,1,2,0)=(u_{0},u_{1},u_{2},u_{3},u_{4},u_{5})$ .

4 New decoding approach.

We approach decoding from the point of view of Definition 11. However, we use Definition 15 for the proofs.

Let ${\mathbb{F}}_{q}[x]^{<d}$ be the set of polynomials with coefficients in ${\mathbb{F}}_{q}$ and degree strictly less than $d$ , and let ${\mathbb{F}}_{q}[x]^{<d}_{\geq d^{\prime}}$ be the set of polynomials with coefficients in ${\mathbb{F}}_{q}$ and with only terms of degrees at least $d^{\prime}$ and at most $d-1$ .

Suppose we receive $u\in{\mathbb{F}}_{q}^{n}$ . Let $f_{u}$ be the interpolation polynomial of $u$ . Decoding $u$ is the same as finding $c\in RS_{q,\alpha}(k)$ such that $u$ and $c$ are at minimum Hamming distance. Since words $c\in RS_{q,\alpha}(k)$ are the evaluation of polynomials of degrees smaller than $k$ at the nonzero elements of ${\mathbb{F}}_{q}$ , decoding $u$ is equivalent to finding $g_{c}\in{\mathbb{F}}_{q}[x]^{<k}$ such that $f_{u}-g_{c}$ has maximum number of nonzero roots. In fact, $g_{c}$ is then the interpolation polynomial of $c$ .

The monomials of $f_{u}$ can be split into those that have degree less than $k$ and those having degree at least $k$ . Let $h_{u},g_{u}$ be the unique polynomials with $h_{u}\in{\mathbb{F}}_{q}[x]^{<n}_{\geq k}$ , $g_{u}\in{\mathbb{F}}_{q}[x]^{<k}$ such that $f_{u}=h_{u}+g_{u}$ . Once $f_{u}$ is fixed, and so is $h_{u}$ , consider, from all the polynomials in ${\mathbb{F}}_{q}[x]^{<k}$ , a polynomial $g_{h_{u}}$ that maximizes the number of nonzero roots of $h_{u}+g_{h_{u}}$ . That is, the number of nonzero roots of $h_{u}+g_{h_{u}}$ is larger than or equal to the number of nonzero roots of $h_{u}+g^{\prime}$ for any $g^{\prime}\in{\mathbb{F}}_{q}[x]^{<k}$ . Then,

[TABLE]

Notice that if $e$ is the minimum weight word such that $u-e\in RS_{q,\alpha}(k)$ , then $e=u-c$ and its interpolation polynomial is $f_{e}=f_{u}-g_{c}=h_{u}+g_{h_{u}}$ .

Consider the set

[TABLE]

Because of the fact that $x^{n}-1=\prod_{\gamma\in{\mathbb{F}}_{q}\setminus\{0\}}(x-\gamma)$ , an equivalent definition is

[TABLE]

Notice that $\Lambda$ is not empty because $x^{n}-1$ belongs to $\Lambda$ .

Theorem 20.

Let $\lambda_{u}$ be a monic polynomial with minimum degree among the polynomials in $\Lambda$ . For a polynomial $g\in{\mathbb{F}}_{q}[x]^{<k}$ , if $(x^{n}-1)$ divides $\lambda_{u}(h_{u}+g)$ , then the number of nonzero roots of $h_{u}+g$ is greater than or equal to the number of nonzero roots of $h_{u}+g^{\prime}$ for any $g^{\prime}\in{\mathbb{F}}_{q}[x]^{<k}$ .

Proof.

For a fixed $g\in{\mathbb{F}}_{q}[x]^{<k}$ , the set of polynomials

[TABLE]

is, since $x^{n}-1=\prod_{\gamma\in{\mathbb{F}}_{q}\setminus\{0\}}(x-\gamma)$ , the set of polynomials that are multiples of

[TABLE]

The monic polynomial with minimum degree among $\Lambda_{g}$ is then the polynomial (7) itself. Now, $\Lambda=\cup_{g\in{\mathbb{F}}_{q}[x]^{<k}}\Lambda_{g}$ . So a monic polynomial with minimum degree among $\Lambda$ must be one of the polynomials as in (7) for some $g\in{\mathbb{F}}_{q}[x]^{<k}$ . The minimality of the degree of $\lambda_{u}$ implies the maximality of the number of nonzero roots of $h_{u}+g$ , where $g$ is such that $\lambda_{u}\in\Lambda_{g}$ . ∎

Let $\lambda_{u}$ be as in Theorem 20 and suppose $\mu\in{\mathbb{F}}_{q}[x]$ is such that $\lambda_{u}(h_{u}+g)=\mu(x^{n}-1)$ for some $g\in{\mathbb{F}}_{q}[x]^{<k}$ . Suppose that $\deg(\lambda_{u})=t$ and $\deg(h_{u})=d_{u}$ . In particular, $t\leq n$ and $d_{u}\leq n-1$ . Now, $\deg(\mu)=t+d_{u}-n$ .

Let the coefficients of $\lambda_{u}(h_{u}+g)$ be $\xi_{0},\dots,\xi_{d_{u}+t}$ . If $h_{u}=a_{d_{u}}x^{d_{u}}+a_{d_{u}-1}x^{d_{u}-1}+\dots+a_{k}x^{k}$ and $\lambda_{u}=x^{t}+l_{t-1}x^{t-1}+\dots+l_{1}x+l_{0},$ then, letting $a_{j}=0$ for all $j>d_{u}$ and $\xi_{j}=0$ for all $j>d_{u}+t$ , we have for all $i\geq 0$ ,

[TABLE]

which, by Lemma 18, is equivalent to

[TABLE]

Since $\deg(\mu)=t+d_{u}-n<n$ , the coefficients of $\lambda_{u}(h_{u}+g)=\mu(x^{n}-1)=\mu x^{n}-\mu$ satisfy $\xi_{i}=-\xi_{n}$ , $\xi_{1}=-\xi_{n+1}$ , …, $\xi_{t+d_{u}-n}=-\xi_{t+d_{u}}$ and $\xi_{t+d_{u}-n+1}=\xi_{t+d_{u}-n+2}=\dots=\xi_{n-1}=0.$

Lemma 21.

Let $\lambda_{u}$ be as in Theorem 20. The nonleading coefficients of $\lambda_{u}$ give a solution to the linear system

[TABLE]

Proof.

The lemma is a consequence of equation (8) and the fact that $\xi_{k+t},\dots,\xi_{n-1}=0,$ since $k+t\geq d_{u}+t-n+1$ . ∎

Lemma 22.

Let $t$ be the weight of a minimum weight vector $e\in{\mathbb{F}}_{q}^{n}$ such that $u-e\in RS_{q,\alpha}(k)$ and consider the linear system

[TABLE]

If $t\leq\frac{n-k}{2}$ and $t^{\prime}=t$ , then the linear system has a unique solution, which can be found as a solution to the square system

[TABLE] 2. 2.

If $t\leq\frac{n-k}{2}$ and $t^{\prime}=t$ , then the unique solution to the previous system satisfies $l_{0}\neq 0$ . 3. 3.

If $t\leq\frac{n-k}{2}$ and $t^{\prime}<t$ , then the system has no solution.

Proof.

The existence of a solution is a consequence of Lemma 21. For the uniqueness, we will see that the square submatrix

[TABLE]

has nonzero determinant. As a consequence of Definition 15 of $RS_{q,\alpha}(k)$ and the fact that $2t-1\leq n-k$ ,

[TABLE]

Suppose that the nonzero positions of $e$ are $i_{1},\dots,i_{t}$ , with $0\leq i_{1}<i_{2}<\dots<i_{t}\leq n-1$ . Then it is easy to check that, letting

[TABLE]

we have

[TABLE]

which clearly has nonzero determinant because $W$ is a Vandermonde matrix. 2. 2.

Suppose that

[TABLE]

Then, rearranging the columns, and considering Definition 15 of $RS_{q,\alpha}(k)$ together with the fact that $2t\leq n-k$ , we obtain

[TABLE]

but

[TABLE]

which, again, has nonzero determinant. This contradicts (10). 3. 3.

Suppose $t^{\prime}<t$ and $t-t^{\prime}=\delta$ . If

[TABLE]

then, supressing the first $\delta$ rows we obtain

[TABLE]

and adding $\delta$ columns at the beginning,

[TABLE]

This contradicts the two previous points. ∎

We obtain the following decoding algorithm for a $RS_{q,\alpha}(k)$ code, where $n=q-1$ .

Input: $u\in{\mathbb{F}}_{q}^{n}$ .

Let $t$ be the minimum integer such that

[TABLE]

For $t=0$ , the first matrix is the null matrix. In this case we consider $\mbox{rank}()=0$ . 2. 2.

Solve the linear system

[TABLE]

for $l_{0}\ldots,l_{t-1}$ and denote by $\lambda_{u}$ the polynomial $x^{t}+l_{t-1}x^{t-1}+\dots+l_{1}x+l_{0}$ . 3. 3.

Obtain as in Lemma 18 the interpolation polynomial $f_{u}$ of $u$ , and let $d_{u}$ be its degree. 4. 4.

Let $\zeta_{0},\dots,\zeta_{d_{u}+t}$ be the coefficients of $\lambda_{u}f_{u}$ ; that is, $\lambda_{u}f_{u}=\zeta_{0}+\zeta_{1}x+\dots+\zeta_{d_{u}+t}x^{d_{u}+t}.$

Let $g_{c}=f_{u}-\frac{(x^{n}-1)(\zeta_{n}+\zeta_{n+1}x+\dots+\zeta_{d_{u}+t}x^{d_{u}+t-n})}{\lambda_{u}}$ . 5. 5.

Output: $(g_{c}(1),g_{c}(\alpha),g_{c}(\alpha^{2}),\dots,g_{c}(\alpha^{n-1}))$ .

Theorem 23.

Suppose we received $u\in{\mathbb{F}}_{q}^{n}$ . Let $t$ be the weight of a minimum weight vector $e\in{\mathbb{F}}_{q}^{n}$ such that $u-e\in RS_{q,\alpha}(k)$ . If $t\leq\frac{n-k}{2}$ , then the previous algorithm outputs $u-e$ .

Proof.

By Lemma 22, step 1 gives the actual number of errors $t$ . By Lemma 22 again, the system in step 2 has a unique solution and, by Lemma 21, the polyomial one obtains is exactly the polynomial $\lambda_{u}$ in Theorem 20. After step 3 we get the interpolation polynomial $f_{u}$ . Let $h_{u},g_{u}$ be the unique polynomials with $h_{u}\in{\mathbb{F}}_{q}[x]^{<n}_{\geq k}$ , $g_{u}\in{\mathbb{F}}_{q}[x]^{<k}$ such that $f_{u}=h_{u}+g_{u}$ . By Theorem 20, $x^{n}-1$ divides $\lambda_{u}(h_{u}+g_{h_{u}})$ for some $g_{h_{u}}$ maximizing the number of nonzero roots of $h_{u}+{\mathbb{F}}_{q}[x]^{<k}$ . In particular, there exists $\mu$ such that

[TABLE]

and $\mu$ must have degree less than $n$ . Hence, the degrees of the monomials in $x^{n}\mu$ and those in $-\mu$ do not overlap. On the other hand, the monomials of $\lambda_{u}g_{h_{u}}$ have degree less than $t+k\leq\frac{n-k}{2}+k=\frac{n+k}{2}\leq n$ . So, the monomials in $x^{n}\mu$ of degrees at least $n$ coincide with the monomials in $\lambda_{u}h_{u}$ of degrees at least $n$ . That is, $x^{n}\mu=\zeta_{n}x^{n}+\zeta_{n+1}x^{n+1}+\dots+\zeta_{d_{u}+t}x^{d_{u}+t}$ , and we deduce that $\mu=\zeta_{n}+\zeta_{n+1}x+\dots+\zeta_{d_{u}+t}x^{d_{u}+t-n}$ . Now, from (12), we deduce that $g_{h_{u}}=\frac{(x^{n}-1)(\zeta_{n}+\zeta_{n+1}x+\dots+\zeta_{d_{u}+t}x^{d_{u}+t-n})}{\lambda_{u}}-h_{u}$ . Now, as explained in equation (6), the polynomial $g_{c}$ interpolating the code word $c\in RS_{q,\alpha}(k)$ at minimum distance of $u$ is $g_{c}=g_{u}-g_{h_{u}}=f_{u}-\frac{(x^{n}-1)(\zeta_{n}+\zeta_{n+1}x+\dots+\zeta_{d_{u}+t}x^{d_{u}+t-n})}{\lambda_{u}}$ . From here it follows that the output is, indeed, the code word $c\in RS_{q,\alpha}(k)$ at minimum distance of $u$ . ∎

Remark.

Steps 3 and 4 of the algorithm can be replaced by the equivalent steps in the Peterson–Gorenstein–Zierler algorithm, that is, we can find the error positions by means of the roots of $\lambda_{u}$ and then obtain the error values by means of the linear system

[TABLE]

Example 24.

Consider the same code as in Example 2, that is, the code $C=RS_{7,5}(2)$ . Suppose that after transmission of three code words we receive $u=421632$ , $v=342650$ , $w=025606$ .

Denote by the same symbol $u$ the vector $421632$ and the polynomial $4+2x+x^{2}+6x^{3}+3x^{4}+2x^{5}$ . The syndromes of $u$ are

[TABLE]

Since the syndromes are nonzero we deduce that there is at least one error. We have $\mbox{rank}\left(\begin{array}[]{c}\\ \\ \\ \end{array}\right)\neq\mbox{rank}\left(\begin{array}[]{c}3\\ 1\\ 5\\ 4\end{array}\right)$ , but $\mbox{rank}\left(\begin{array}[]{c}3\\ 1\\ 5\end{array}\right)=\mbox{rank}\left(\begin{array}[]{cc}3&1\\ 1&5\\ 5&4\end{array}\right)$ . So $t=1$ and there is only one error. We solve the system $3l_{0}=-1$ , whose solution is $l_{0}=2$ . We deduce that the error locator polynomial is $\lambda=x+2$ . We compute $f_{u}$ as in Example 13, obtaining $f_{u}=4x^{5}+6x^{4}+2x^{3}+3x^{2}+3$ . Now, since $f_{u}\cdot\lambda=4x^{6}+6x^{2}+3x+6$ , we deduce that $\zeta_{n}+\dots+\zeta_{d_{u}+t}x^{d_{u}+t-n}=4$ and $g_{c}=6x+5$ , so that the corrected word is $(401632)$ . We could also have found the single root of $\lambda$ , which is $5=5^{1}$ , and then deduce that there is an error at the second position (first position, if we start counting by [math]). Then, to find the error value we could have solved the system $5^{1}e_{1}=u(a)=3$ , whose solution is $e_{1}=2$ . The corrected word is then $u-(020000)=(421632)-(020000)=(401632)$ .

Denote by the same symbol $v$ the vector $342650$ and the polynomial $3+4x+2x^{2}+6x^{3}+5x^{4}$ . The syndromes of $v$ are

[TABLE]

Since the syndromes are all zero we deduce that there is no error.

Denote by the same symbol $w$ the vector $025606$ and the polynomial $2x+5x^{2}+6x^{3}+6x^{5}$ . The syndromes of $w$ are

[TABLE]

Since the syndromes are nonzero we deduce that there is at least one error. We have $\mbox{rank}\left(\begin{array}[]{c}\\ \\ \\ \end{array}\right)\neq\mbox{rank}\left(\begin{array}[]{c}0\\ 1\\ 5\\ 5\end{array}\right)$ , $\mbox{rank}\left(\begin{array}[]{c}0\\ 1\\ 5\end{array}\right)\neq\mbox{rank}\left(\begin{array}[]{cc}0&1\\ 1&5\\ 5&5\end{array}\right)$ , while $\mbox{rank}\left(\begin{array}[]{cc}0&1\\ 1&5\end{array}\right)=\mbox{rank}\left(\begin{array}[]{ccc}0&1&5\\ 1&5&5\end{array}\right)=2$ . So, $t=2$ . We solve the system

[TABLE]

whose solution is $l_{0}=6$ , $l_{1}=2$ . We deduce that the error locator polynomial is $\lambda=x^{2}+2x+6.$ We compute $f_{w}$ as in Example 13, obtaining $f_{w}=6x^{4}+2x^{3}+2x^{2}+2x+2$ . Now, since $f_{w}\cdot\lambda=6x^{6}+4x^{3}+4x^{2}+2x+5$ , we deduce that $\zeta_{n}+\dots+\zeta_{d_{u}+t}x^{d_{u}+t-n}=6$ and $g_{c}=4x+3$ , so that the corrected word is $(025641)$ . We could also have found the roots of $\lambda$ , which are $2=5^{4}$ and $3=5^{5}$ and then deduce that the error positions are the fifth and sixth positions (fourth and fifth positions, if we start counting by [math]). Then, to find the error value we could have solved the system

[TABLE]

whose solution is $e_{4}=3$ , $e_{5}=5$ . The corrected word is then $w-(000035)=(025606)-(000035)=(025641)$ .

5 A glimpse of the Peterson–Gorenstein–Zierler algorithm.

Peterson [20] and Gorenstein and Zierler [10] proposed a decoding algorithm which is very similar to the one we just presented. It is based on the following lemma.

For all $h$ with $t\leq h\leq\frac{n-k}{2}$ , define

[TABLE]

Lemma 25.

If $t<h\leq\frac{n-k}{2}$ , then $\det(A_{h})=0$ , while $\det(A_{t})\not=0.$ That is, the number of errors (if it is at most $\frac{n-k}{2}$ ) is the maximum integer $h\leq\frac{n-k}{2}$ such that $\det(A_{h})\neq 0$ .

Proof.

Since $2h-1\leq n-k$ ,

[TABLE]

As before, let us denote the error positions as $i_{1},\dots,i_{t}$ and let $M=\{m_{1},...,m_{h}\}\subseteq\{0,...,n-1\}$ be any subset containing all the error positions. Let $D$ be the diagonal matrix

[TABLE]

Clearly, $\left|D\right|\neq 0$ if $h=t$ and $\left|D\right|=0$ if $h>t$ . Let

[TABLE]

Since $W$ is a Vandermonde matrix and the indices in $M$ are all different, $\left|W\right|\neq 0$ . We have

[TABLE]

Now it is straightforward to check that this product of matrices has zero determinant if and only if $M$ contains no error positions, that is, if $e_{s}=0$ for some $s\in M$ . ∎

The Peterson–Gorenstein–Zierler algorithm is as follows.

Input: $u\in{\mathbb{F}}_{q}^{n}$ .

Let $t$ be the maximum integer smaller than or equal to $\frac{n-k}{2}$ such that

[TABLE] 2. 2.

Solve the linear system

[TABLE]

for $l_{0},\ldots,l_{t-1}$ and denote by $\lambda_{u}$ the polynomial $x^{t}+l_{t-1}x^{t-1}+\dots+l_{1}x+l_{0}$ . 3. 3.

Find the error positions by means of the roots of $\lambda_{u}$ . 4. 4.

Find the error values by means of the linear system

[TABLE] 5. 5.

Output: $c=u-e$ .

6 Comparison of our algorithm with the Peterson–Gorenstein–Zierler algorithm.

The main differences between our proposed algorithm and the Peterson–Gorenstein–Zierler algorithm are

•

Computation of the number of errors (step 1 in both algorithms);

•

Computation of the error values (steps 3–4 in both algorithms).

Error location (step 2) is done exactly in the same way.

As for the computation of the error values, step 4 in our algorithm needs two polynomial multiplications and one division (all of them of order $t+n$ ), while steps 3 and 4 in the Peterson–Gornestein–Zierler algorithm involve two linear square systems of $t$ equations. This already makes our algorithm simpler than the Peterson–Gorenstein–Zierler algorithm.

But the main difference is in the computation of the number of errors. In the Peterson–Gorenstein–Zierler algorithm we start computing the determinant of a (large) $\frac{n-k}{2}\times\frac{n-k}{2}$ matrix and continue computing determinants of decreasing order, while in our algorithm we start computing the rank of a (small) $2\times(n-k-1)$ matrix and continue computing ranks of $(h+1)\times(n-k-h)$ matrices with an increasing value of $h$ . In the Peterson–Gorenstein–Zierler algorithm, the smaller the number of errors, the more determinant computations will be needed. In our algorithm, the smaller the number of errors, the fewer rank computations will be needed. Furthermore, in the Peterson–Gorenstein–Zierler algorithm we start with the most complex determinants and then they get simpler, while in our algorithm we start with the simpler rank computations and, as the number of errors increases we get more complex rank computations.

7 The optimistic view of best case decoding.

There are many scenarios where a high reliability is required but errors rarely occur. In this case, error correcting codes with a high error correction capability are required although the expected number of errors is low. In terms of error correction, the decoding approach that takes this perspective into consideration is called best case decoding. See [4] for a deep analysis. In Berlekamp’s clarifying words,

…a best case decoder is philosophically analogous to a small child who continually asks, “Are we almost there now?” This question may occur at many places in a long decoding program. But, in a high-reliability application, the odds are quite favorable that any time the question is asked, the answer is likely to be “YES”.

Our algorithm is very well suited for best case decoding because, in contrast to the Peterson–Gorenstein–Zierler algorithm, it is fast when the number of errors is small and it is only a bit slower when the number of errors approaches the correction capability.

Acknowledgments.

The author would like to thank the anonymous referees for deeply reading the manuscript and for making very useful comments. She would specially like to thank the editor Susan Jane Colley for her careful reading.

Bibliography27

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] E. R. Berlekamp, Algebraic Coding Theory . Mc Graw-Hill Book Co., New York, 1968.
2[2] , The technology of error-correcting codes, Proceedings of the IEEE 68 (1980) 564–593.
3[3] , Bit-serial Reed-Solomon encoders, IEEE Trans. Information Theory IT 28 (1982) 869–874.
4[4] , Bounded distance+1 soft-decision reed-solomon decoding, IEEE Trans. Information Theory IT 42 (1996) 704–720.
5[5] J. Bierbrauer, Introduction to Coding Theory . Chapman & Hall/CRC, Boca Raton, FL, 2005.
6[6] M. Bossert, S. Bezzateev, A unified view on known algebraic decoding algorithms and new decoding concepts, IEEE Trans. Inform. Theory 59 (2013) 7320–7336.
7[7] W. G. Chambers, Solution of Welch-Berlekamp key equation by Euclidean algorithm, Electronics Letters 29 (1993).
8[8] J.-L. Dornstetter, On the equivalence between Berlekamp’s and Euclid’s algorithms, IEEE Trans. Inform. Theory 33 (1987) 428–431.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A Decoding Approach to Reed–Solomon Codes

Abstract

1 Introduction.

2 Some background on coding theory.

The alphabet Fq{\mathbb{F}}_{q}Fq​.

Example 1**.**

Linear codes.

Generator matrices.

Example 2**.**

Dual code and parity-check matrices.

Example 3**.**

Hamming distance, correction capability, and Singleton bound.

Example 4**.**

Vandermonde matrices.

3 Four definitions of Reed–Solomon codes.

3.1 Reed–Solomon codes from generator matrices.

Definition 5**.**

Example 6**.**

3.2 Reed–Solomon codes from parity-check matrices.

Definition 7**.**

Example 8**.**

Lemma 9**.**

Proof.

Example 10**.**

3.3 Reed–Solomon codes and interpolation polynomials.

Definition 11**.**

Example 12**.**

Example 13**.**

Code word checking.

Example 14**.**

3.4 Reed–Solomon codes and polynomial evaluation.

Definition 15**.**

Code word checking.

Example 16**.**

3.5 Connection of the coefficients of an interpolation polynomial and its evaluation at all points.

Lemma 17**.**

Proof.

Lemma 18**.**

Proof.

Example 19**.**

4 New decoding approach.

Theorem 20**.**

Proof.

Lemma 21**.**

Proof.

Lemma 22**.**

Proof.

Theorem 23**.**

Proof.

Remark**.**

Example 24**.**

5 A glimpse of the Peterson–Gorenstein–Zierler algorithm.

Lemma 25**.**

Proof.

6 Comparison of our algorithm with the Peterson–Gorenstein–Zierler algorithm.

7 The optimistic view of best case decoding.

Acknowledgments.

The alphabet ${\mathbb{F}}_{q}$ .

Example 1.

Example 2.

Example 3.

Example 4.

Definition 5.

Example 6.

Definition 7.

Example 8.

Lemma 9.

Example 10.

Definition 11.

Example 12.

Example 13.

Example 14.

Definition 15.

Example 16.

Lemma 17.

Lemma 18.

Example 19.

Theorem 20.

Lemma 21.

Lemma 22.

Theorem 23.

Remark.

Example 24.

Lemma 25.