Characteristic Matrices and Trellis Reduction for Tail-Biting   Convolutional Codes

Masato Tajima

arXiv:1705.03982·cs.IT·May 25, 2017

Characteristic Matrices and Trellis Reduction for Tail-Biting Convolutional Codes

Masato Tajima

PDF

Open Access

TL;DR

This paper explores the properties of characteristic matrices for tail-biting convolutional codes and demonstrates how cyclic transformations and polynomial matrix reductions can lead to trellis complexity reduction.

Contribution

It introduces a cyclic structure-based analysis of characteristic matrices and proposes a method for trellis reduction using polynomial matrix transformations and partial cyclic shifts.

Findings

01

Characteristic matrices have a cyclic structure related to the code's generator matrix.

02

Trellis reduction can be achieved through polynomial matrix reduction and cyclic shifts.

03

Partial cyclic shifts of code sequences facilitate trellis complexity reduction.

Abstract

Basic properties of a characteristic matrix for a tail-biting convolutional code are investigated. A tail-biting convolutional code can be regarded as a linear block code. Since the corresponding scalar generator matrix Gt has a kind of cyclic structure, an associated characteristic matrix also has a cyclic structure, from which basic properties of a characteristic matrix are obtained. Next, using the derived results, we discuss the possibility of trellis reduction for a given tail-biting convolutional code. There are cases where we can find a scalar generator matrix Gs equivalent to Gt based on a characteristic matrix. In this case, if the polynomial generator matrix corresponding to Gs has been reduced, or can be reduced by using appropriate transformations, then trellis reduction for the original tail-biting convolutional code is realized. In many cases, the polynomial generator…

Equations339

[a,b]\stackrel{{\scriptstyle\triangle}}{{=}}\left\{\begin{array}[]{ll}\{a,a+1,\cdots,b\},&\quad\mbox{if $a\leq b$}\\ \{a,a+1,\cdots,n-1,0,1,\cdots,b\},&\quad\mbox{if $a>b$}\end{array}\right.

[a,b]\stackrel{{\scriptstyle\triangle}}{{=}}\left\{\begin{array}[]{ll}\{a,a+1,\cdots,b\},&\quad\mbox{if $a\leq b$}\\ \{a,a+1,\cdots,n-1,0,1,\cdots,b\},&\quad\mbox{if $a>b$}\end{array}\right.

X = △ X_{0}^{*} \cup ρ_{1} (X_{1}^{*}) \cup \dots \cup ρ_{n - 1} (X_{n - 1}^{*}) .

X = △ X_{0}^{*} \cup ρ_{1} (X_{1}^{*}) \cup \dots \cup ρ_{n - 1} (X_{n - 1}^{*}) .

X=\left(\begin{array}[]{c}x_{1}\\ x_{2}\\ \cdots\\ x_{n}\end{array}\right)

X=\left(\begin{array}[]{c}x_{1}\\ x_{2}\\ \cdots\\ x_{n}\end{array}\right)

T = {(a_{l}, b_{l}] : l = 1, 2, \dots, n}

T = {(a_{l}, b_{l}] : l = 1, 2, \dots, n}

G (D) = G_{0} + G_{1} D + \dots + G_{L} D^{L}

G (D) = G_{0} + G_{1} D + \dots + G_{L} D^{L}

G_{N}^{tb}\stackrel{{\scriptstyle\triangle}}{{=}}\left(\begin{array}[]{cccccccc}G_{0}&G_{1}&\scriptstyle{\ldots}&G_{L-1}&G_{L}&&&\\ &G_{0}&\scriptstyle{\ldots}&\scriptstyle{\ldots}&G_{L-1}&G_{L}&&\\ &&\scriptstyle{\ldots}&\scriptstyle{\ldots}&\scriptstyle{\ldots}&\scriptstyle{\ldots}&\scriptstyle{\ldots}&\\ &&&G_{0}&G_{1}&\scriptstyle{\ldots}&\scriptstyle{\ldots}&G_{L}\\ G_{L}&&&&G_{0}&G_{1}&\scriptstyle{\ldots}&G_{L-1}\\ G_{L-1}&G_{L}&&&&G_{0}&\scriptstyle{\ldots}&\scriptstyle{\ldots}\\ \scriptstyle{\ldots}&\scriptstyle{\ldots}&\scriptstyle{\ldots}&&&&\scriptstyle{\ldots}&G_{1}\\ G_{1}&G_{2}&\scriptstyle{\ldots}&G_{L}&&&&G_{0}\end{array}\right)

G_{N}^{tb}\stackrel{{\scriptstyle\triangle}}{{=}}\left(\begin{array}[]{cccccccc}G_{0}&G_{1}&\scriptstyle{\ldots}&G_{L-1}&G_{L}&&&\\ &G_{0}&\scriptstyle{\ldots}&\scriptstyle{\ldots}&G_{L-1}&G_{L}&&\\ &&\scriptstyle{\ldots}&\scriptstyle{\ldots}&\scriptstyle{\ldots}&\scriptstyle{\ldots}&\scriptstyle{\ldots}&\\ &&&G_{0}&G_{1}&\scriptstyle{\ldots}&\scriptstyle{\ldots}&G_{L}\\ G_{L}&&&&G_{0}&G_{1}&\scriptstyle{\ldots}&G_{L-1}\\ G_{L-1}&G_{L}&&&&G_{0}&\scriptstyle{\ldots}&\scriptstyle{\ldots}\\ \scriptstyle{\ldots}&\scriptstyle{\ldots}&\scriptstyle{\ldots}&&&&\scriptstyle{\ldots}&G_{1}\\ G_{1}&G_{2}&\scriptstyle{\ldots}&G_{L}&&&&G_{0}\end{array}\right)

X

X

ρ_{n_{0}} (X_{n_{0}}^{*})

ρ_{n_{0}} (X_{n_{0}}^{*})

ρ_{n_{0} + 1} (X_{n_{0} + 1}^{*})

ρ_{2 n_{0} - 1} (X_{2 n_{0} - 1}^{*})

\left\{\begin{array}[]{l}\tilde{X}_{1}^{*}\stackrel{{\scriptstyle\triangle}}{{=}}\rho_{1}(X_{1}^{*})\\ \tilde{X}_{2}^{*}\stackrel{{\scriptstyle\triangle}}{{=}}\rho_{2}(X_{2}^{*})\\ \cdots\\ \tilde{X}_{n_{0}-1}^{*}\stackrel{{\scriptstyle\triangle}}{{=}}\rho_{n_{0}-1}(X_{n_{0}-1}^{*}).\end{array}\right.

\left\{\begin{array}[]{l}\tilde{X}_{1}^{*}\stackrel{{\scriptstyle\triangle}}{{=}}\rho_{1}(X_{1}^{*})\\ \tilde{X}_{2}^{*}\stackrel{{\scriptstyle\triangle}}{{=}}\rho_{2}(X_{2}^{*})\\ \cdots\\ \tilde{X}_{n_{0}-1}^{*}\stackrel{{\scriptstyle\triangle}}{{=}}\rho_{n_{0}-1}(X_{n_{0}-1}^{*}).\end{array}\right.

ρ_{2 n_{0}} (X_{2 n_{0}}^{*})

ρ_{2 n_{0}} (X_{2 n_{0}}^{*})

ρ_{2 n_{0} + 1} (X_{2 n_{0} + 1}^{*})

ρ_{3 n_{0} - 1} (X_{3 n_{0} - 1}^{*})

ρ_{i n_{0}} (X_{i n_{0}}^{*})

ρ_{i n_{0}} (X_{i n_{0}}^{*})

ρ_{i n_{0} + 1} (X_{i n_{0} + 1}^{*})

ρ_{(i + 1) n_{0} - 1} (X_{(i + 1) n_{0} - 1}^{*})

X

X

X

X

X_{0}^{*} \cup ρ_{1} (X_{1}^{*}) \cup \dots \cup ρ_{n_{0} - 1} (X_{n_{0} - 1}^{*}) \subseteq X_{0}^{*} \cup ρ_{n_{0}} (X_{0}^{*})

X_{0}^{*} \cup ρ_{1} (X_{1}^{*}) \cup \dots \cup ρ_{n_{0} - 1} (X_{n_{0} - 1}^{*}) \subseteq X_{0}^{*} \cup ρ_{n_{0}} (X_{0}^{*})

X = X_{0}^{*} \cup ρ_{n_{0}} (X_{0}^{*}) \cup ρ_{2 n_{0}} (X_{0}^{*}) \cup \dots \cup ρ_{(N - 1) n_{0}} (X_{0}^{*}) .

X = X_{0}^{*} \cup ρ_{n_{0}} (X_{0}^{*}) \cup ρ_{2 n_{0}} (X_{0}^{*}) \cup \dots \cup ρ_{(N - 1) n_{0}} (X_{0}^{*}) .

ρ_{n_{0}} (X_{0}^{*}) \cup ρ_{n_{0}} (\tilde{X}_{1}^{*}) \cup \dots \cup ρ_{n_{0}} (\tilde{X}_{n_{0} - 1}^{*}) \subseteq ρ_{n_{0}} (X_{0}^{*}) \cup ρ_{2 n_{0}} (X_{0}^{*}) .

ρ_{n_{0}} (X_{0}^{*}) \cup ρ_{n_{0}} (\tilde{X}_{1}^{*}) \cup \dots \cup ρ_{n_{0}} (\tilde{X}_{n_{0} - 1}^{*}) \subseteq ρ_{n_{0}} (X_{0}^{*}) \cup ρ_{2 n_{0}} (X_{0}^{*}) .

ρ_{2 n_{0}} (X_{0}^{*}) \cup ρ_{2 n_{0}} (\tilde{X}_{1}^{*}) \cup \dots \cup ρ_{2 n_{0}} (\tilde{X}_{n_{0} - 1}^{*})

ρ_{2 n_{0}} (X_{0}^{*}) \cup ρ_{2 n_{0}} (\tilde{X}_{1}^{*}) \cup \dots \cup ρ_{2 n_{0}} (\tilde{X}_{n_{0} - 1}^{*})

ρ_{(N - 1) n_{0}} (X_{0}^{*}) \cup ρ_{(N - 1) n_{0}} (\tilde{X}_{1}^{*}) \cup \dots \cup ρ_{(N - 1) n_{0}} (\tilde{X}_{n_{0} - 1}^{*})

X = X_{0}^{*} \cup ρ_{n_{0}} (X_{0}^{*}) \cup ρ_{2 n_{0}} (X_{0}^{*}) \cup \dots \cup ρ_{(N - 1) n_{0}} (X_{0}^{*}) .

X = X_{0}^{*} \cup ρ_{n_{0}} (X_{0}^{*}) \cup ρ_{2 n_{0}} (X_{0}^{*}) \cup \dots \cup ρ_{(N - 1) n_{0}} (X_{0}^{*}) .

G (D)

G (D)

= △

G_{3}^{t b}

G_{3}^{t b}

=

X_{0}^{*}=\left(\begin{array}[]{ccccccccc}\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&0&0&0\\ 0&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}\\ 0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0\\ \end{array}\right)\begin{array}[]{c}$(0, 5]$\\ $(3, 8]$\\ $(1, 7]$\end{array}

X_{0}^{*}=\left(\begin{array}[]{ccccccccc}\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&0&0&0\\ 0&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}\\ 0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0\\ \end{array}\right)\begin{array}[]{c}$(0, 5]$\\ $(3, 8]$\\ $(1, 7]$\end{array}

\tilde{X}_{1}^{*}=\left(\begin{array}[]{ccccccccc}\mbox{\boldmath$1$}&0&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}\\ 0&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}\\ 0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0\\ \end{array}\right)\begin{array}[]{l}$(2, 0]$\\ $(3, 8]$\\ $(1, 7]$\end{array}

\tilde{X}_{1}^{*}=\left(\begin{array}[]{ccccccccc}\mbox{\boldmath$1$}&0&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}\\ 0&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}\\ 0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0\\ \end{array}\right)\begin{array}[]{l}$(2, 0]$\\ $(3, 8]$\\ $(1, 7]$\end{array}

\tilde{X}_{2}^{*}=\left(\begin{array}[]{ccccccccc}\mbox{\boldmath$1$}&0&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}\\ 0&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}\\ \mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}\\ \end{array}\right)\begin{array}[]{l}$(2, 0]$\\ $(3, 8]$\\ $(4, 1]$.\end{array}

\tilde{X}_{2}^{*}=\left(\begin{array}[]{ccccccccc}\mbox{\boldmath$1$}&0&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}\\ 0&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}\\ \mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}\\ \end{array}\right)\begin{array}[]{l}$(2, 0]$\\ $(3, 8]$\\ $(4, 1]$.\end{array}

X=\left(\begin{array}[]{ccccccccc}\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&0&0&0\\ 0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0\\ \mbox{\boldmath$1$}&0&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}\\ 0&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}\\ \mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}\\ \mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}\\ \mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&0&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}\\ \mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}\\ \mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0&\mbox{\boldmath$1$}\end{array}\right)\begin{array}[]{l}$(0, 5]$\\ $(1, 7]$\\ $(2, 0]$\\ $(3, 8]$\\ $(4, 1]$\\ $(5, 3]$\\ $(6, 2]$\\ $(7, 4]$\\ $(8, 6]$.\end{array}

X=\left(\begin{array}[]{ccccccccc}\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&0&0&0\\ 0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0\\ \mbox{\boldmath$1$}&0&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}\\ 0&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}\\ \mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}\\ \mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}\\ \mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&0&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}\\ \mbox{\boldmath$0$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0&0&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}\\ \mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$1$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$0$}&\mbox{\boldmath$1$}&0&\mbox{\boldmath$1$}\end{array}\right)\begin{array}[]{l}$(0, 5]$\\ $(1, 7]$\\ $(2, 0]$\\ $(3, 8]$\\ $(4, 1]$\\ $(5, 3]$\\ $(6, 2]$\\ $(7, 4]$\\ $(8, 6]$.\end{array}

X_{0}^{*} \cup ρ_{1} (X_{1}^{*}) \cup ρ_{2} (X_{2}^{*}) ⊈ X_{0}^{*} \cup ρ_{3} (X_{0}^{*}) .

X_{0}^{*} \cup ρ_{1} (X_{1}^{*}) \cup ρ_{2} (X_{2}^{*}) ⊈ X_{0}^{*} \cup ρ_{3} (X_{0}^{*}) .

X=\left(\begin{array}[]{c}x_{1}\\ x_{2}\\ \cdots\\ x_{n}\end{array}\right)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCoding theory and cryptography · Advanced Wireless Communication Techniques · Error Correcting Code Techniques

Full text

Characteristic Matrices and Trellis Reduction for Tail-Biting Convolutional Codes

Masato Tajima M. Tajima was with the graduate School of Science and Engineering, University of Toyama, 3190 Gofuku, Toyama 930-8555, Japan (e-mail: [email protected]).Manuscript received April 19, 2005; revised August 26, 2015. This paper was presented in part at the IEICE Technical Committee IT Conference in March 2017.

Abstract

Basic properties of a characteristic matrix for a tail-biting convolutional code are investigated. A tail-biting convolutional code can be regarded as a linear block code. Since the corresponding scalar generator matrix $G^{tb}$ has a kind of cyclic structure, an associated characteristic matrix also has a cyclic structure, from which basic properties of a characteristic matrix are obtained. Next, using the derived results, we discuss the possibility of trellis reduction for a given tail-biting convolutional code. There are cases where we can find a scalar generator matrix $G^{\prime}$ equivalent to $G^{tb}$ based on a characteristic matrix. In this case, if the polynomial generator matrix corresponding to $G^{\prime}$ has been reduced, or can be reduced by using appropriate transformations, then trellis reduction for the original tail-biting convolutional code is realized. In many cases, the polynomial generator matrix corresponding to $G^{\prime}$ has a monomial factor in some column and is reduced by dividing the column by the factor. Note that this transformation corresponds to cyclically shifting the associated code subsequence (a tail-biting path is regarded as a code sequence) to the left. Thus if we allow partial cyclic shifts of a tail-biting path, then trellis reduction is accomplished.

Index Terms:

Tail-biting convolutional codes, tail-biting trellis, characteristic matrix, cyclic shift, trellis reduction.

I Introduction

From the 1980s to 1990s, trellis representations of linear block codes were studied with a great interest [2, 7, 8, 14, 17, 18, 19]. Subsequently, tail-biting trellises of linear block codes have received much attention. Given a linear block code, there exists a unique minimal conventional trellis. This trellis simultaneously minimizes all measures of trellis complexity. However, tail-biting trellises do not have such a property. That is, minimality of tail-biting trellises depends on the measure being used [13]. In general, the complexity of a tail-biting trellis may be much lower than that of the minimal conventional trellis. There have been many contributions to the subject, including [3, 4, 9, 10, 11, 13, 15, 20, 22, 32]. The works [3, 13] had a strong influence on the subsequent studies. A remarkable progress has been made by Koetter and Vardy in their paper [13]. They showed that for a $k$ -dimensional linear block code of length $n$ with full support, there exists a list of $n$ characteristic generators (i.e., a characteristic matrix [13]) from which all minimal tail-biting minimal trellises can be obtained. A different method of producing tail-biting trellises was proposed by Nori and Shankar [20]. They used the Bahl-Cocke-Jelinek-Raviv (BCJR) construction [2]. These works were further investigated by Gluesing-Luerssen and Weaver [9, 10]. In particular, noting that a characteristic matrix for a given code is not necessarily unique, they have refined and generalized the previous works. More recent works [4, 11] provide further research on the subject.

On the other hand, tail-biting convolutional codes were proposed by Ma and Wolf in 1986 [16] (tail-biting representations of block codes were introduced by Solomon and van Tilborg [24]). Tail-biting (abbreviated TB) is a technique by which a convolutional code can be used to construct a block code without any loss of rate. In connection with the subject, there have been also many works, including [1, 16, 23, 25, 28, 33]. Since a TB convolutional code is identified with a linear block code, the results on TB trellises for linear block codes can be used. In particular, we can think of a characteristic matrix of a given TB convolutional code. In this paper, we first investigate a characteristic matrix for a TB convolutional code. And then, based on the derived results, we discuss the possibility of trellis reduction for a given TB convolutional code. An outline of the rest of the paper is as follows:

In Section II, we review the basic notions needed for this paper.

In Section III, we investigate the basic properties of a characteristic matrix for a TB convolutional code. When a TB convolutional code with generator matrix $G(D)$ is regarded as a linear block code $C$ , a (scalar) generator matrix (denoted by $G^{tb}$ ) for $C$ is constructed using the coefficients which appear in the polynomial expansion of $G(D)$ . We see that $G^{tb}$ has a kind of cyclic structure. Then it is shown that the (characteristic) span list associated with a characteristic matrix for $C$ consists of some basic spans and their right cyclic shifts, from which basic properties of a characteristic matrix are derived.

In Section IV, we deal with transformations of $G(D)$ and discuss the relationship between these transformations and the corresponding scalar generator matrices $G^{tb}\mbox{'}s$ . We see that dividing a column of $G(D)$ by a monomial factor corresponds to cyclically shifting a column subsequence of $G^{tb}$ to the left, whereas multiplying a column of $G(D)$ by a monomial corresponds to cyclically shifting a column subsequence of $G^{tb}$ to the right. These properties are essentially used for trellis reduction to be discussed in Section V.

In Section V, we discuss the possibility of trellis reduction for a given TB convolutional code (we identify the code with an $(n,k)$ block code $C$ ). As is stated above, we can think of a characteristic matrix for $C$ . Consider the case where some $k$ characteristic generators, which consist of some basic generators and their right cyclic shifts, can generate the same code $C$ . We see that these characteristic generators form a (scalar) generator matrix associated with a (polynomial) generator matrix of another convolutional code. In this case, if the constraint length of the obtained generator matrix is smaller than that of the original one, then trellis reduction is realized. Even if this kind of reduction is not possible, there are cases where a newly obtained generator matrix contains a monomial factor in some column. Then there is a possibility that the generator matrix is reduced by sweeping the monomial factor out of the column. Note that this operation corresponds to cyclically shifting the corresponding code subsequence to the left. In this way, trellis reduction can be accomplished. We also present a trellis reduction method for high rate codes which uses a reciprocal dual encoder. We remark that the (trellis) section length is an important parameter and the proposed method is restricted to TB convolutional codes with short to moderate section length. We give an upper bound for the section length by evaluating the span lengths of characteristic generators.

Finally, conclusions are provided in Section VI.

II Preliminaries

We begin with the basic notions needed in this paper, where the underlying field is assumed to be $F=\mbox{GF}(2)$ . Let $C$ be an $(n,k)$ linear block code, where the set of indices for a codeword in $C$ is denoted by $I\stackrel{{\scriptstyle\triangle}}{{=}}\{0,1,\cdots,n-1\}$ . Then a codeword $x\in C$ is expressed as $x=(x_{0},x_{1},\cdots,x_{n-1})$ . $I$ is also regarded as the time axis for TB trellises for $C$ . Since TB trellises for $C$ are considered in this paper, it is convenient to identify $I$ with $Z_{n}$ , the ring of integers modulo $n$ . Hence, when dealing with TB trellises, all index arithmetic will be implicitly performed modulo $n$ [13].

The notion of span is fundamental in trellis theory. Given a codeword $x\in C$ , a span of $x$ , denoted by $[x]$ , is a semiopen interval $(a,b]\in I$ such that the corresponding closed interval $[a,b]$ contains all the nonzero positions of $x$ [13]. Due to the cyclic structure of the time axis $I$ , we adopt the following interpretation of intervals [9, 10, 13]. For $a,b\in I$ , we define

[TABLE]

and $(a,b]\stackrel{{\scriptstyle\triangle}}{{=}}[a,b]\backslash\{a\}$ . We call the intervals $(a,b]$ and $[a,b]$ conventional if $a\leq b$ and circular otherwise.

In connection with the construction of minimal TB trellises for $C$ , Koetter and Vardy [13] introduced the notion of characteristic generator for $C$ . Denote by $\sigma_{j}(\cdot)$ a cyclic shift to the left by $j$ positions [13]. Similarly, denote by $\rho_{j}(\cdot)$ a cyclic shift to the right by $j$ positions. Let $X_{j}^{*}$ be a basis in minimal-span form [17] for the code $C_{j}\stackrel{{\scriptstyle\triangle}}{{=}}\sigma_{j}(C)$ . A characteristic generator for $C$ is a pair consisting of a codeword $x=(x_{0},x_{1},\cdots,x_{n-1})\in C$ and a span $[x]=(a,b]$ such that $x_{a},x_{b}$ are nonzero. The set of all the characteristic generators for $C$ is given by

[TABLE]

Here we have an understanding that if $x^{*}\in X_{j}^{*}$ , then $[\rho_{j}(x^{*})]=(a+j,b+j]$ , where $[x^{*}]=(a,b]$ .

Assume that $C$ has a full support. Then a characteristic matrix for $C$ is the $n\times n$ matrix having the elements of $X$ as its rows. The above definition implies that when we refer to a characteristic matrix, the associated spans are taken into account. Here note that a basis in minimal-span form is not necessarily unique. Hence, $X$ may not be uniquely determined. On the other hand, the set of spans (denoted by $T$ ) accompanied by $X$ is, up to ordering, uniquely determined by the code $C$ [9, 10, 13]. $T$ is called the characteristic span list of $C$ (an element $\in T$ is called a characteristic span of $C$ ) [9, 10]. In order to clarify this fact, Gluesing-Luerssen and Weaver introduced the notion of characteristic pair $(X,T)$ of $C$ [9, Definition III.8], where $X$ is a generating set of $C$ and $T$ represents the associated spans. In this paper, we basically follow the definition of Gluesing-Luerssen and Weaver, but in order to emphasize the fact that a characteristic matrix inherently assumes the associated spans, we leave the term characteristic matrix for the definition. Thus we define as follows (cf. [9, Definition III.8]).

Definition II.1

Let $C$ be an $(n,k)$ linear block code with support $I$ . A characteristic matrix for $C$ with (characteristic) span list $T$ is defined to be a pair $(X,T)$ , where

[TABLE]

have the properties:

$\{x_{1},\cdots,x_{n}\}$ * generates $C$ .*

2)

$(a_{l},b_{l}]$ * is a span of $x_{l},~{}l=1,\cdots,n$ .*

3)

$a_{1},\cdots,a_{n}$ * are distinct and $b_{1},\cdots,b_{n}$ are distinct.*

4)

For all $j\in I$ , there exist exactly $n-k$ row indices, $l_{1},\cdots,l_{n-k}$ , such that $j\in(a_{l_{i}},b_{l_{i}}]$ for $i=1,\cdots,n-k$ .

Remark: Property 3) is derived from [13, Lemma 5.7] and the related remarks. Also, Property 4) is derived from the proof of [13, Theorem 5.10].

In the following, when there is no danger of confusion, we shall use the terms characteristic matrix $X$ and characteristic matrix $X$ with span list $T$ interchangeably.

III Characteristic Matrices for a Tail-Biting Convolutional Code

Let $G(D)$ be a polynomial generator matrix of size $k_{0}\times n_{0}$ . Denote by $H(D)$ a corresponding polynomial check matrix. Both $G(D)$ and $H(D)$ are assumed to be canonical [18]. Consider a standard trellis of $N$ sections for a convolutional code defined by $G(D)$ . Here $\max(L,M)+1\leq N$ is assumed, where $L$ and $M$ are the memory lengths of $G(D)$ and $H(D)$ , respectively. The TB condition is a restriction that the encoder starts and ends in the same state. That is, only those paths in the trellis that start and end in the same state are admissible. We call such paths TB paths. Let $C$ be the set of all TB paths. In the following, we call $C$ a TB convolutional code of section length $N$ defined by $G(D)$ (cf. Fig.1 in Section V). When there is no danger of confusion, we will omit the phrase of section length $N$ . $C$ can be regarded as a linear block code $B^{tb}$ of length $n=n_{0}N$ . To simplify the notations, $B^{tb}$ is identified with $C$ and is denoted simply by $C$ . Let

[TABLE]

be the polynomial expansion of $G(D)$ , where $G_{i}~{}(0\leq i\leq L)$ are $k_{0}\times n_{0}$ matrices. Then the scalar generator matrix for $C~{}(=B^{tb})$ is given by

[TABLE]

with size $k\times n=k_{0}N\times n_{0}N$ [12]. Hence, we can say that a TB convolutional code $C$ is generated by $G_{N}^{tb}$ . In the following, we call $G_{N}^{tb}$ the tail-biting generator matrix (abbreviated TBGM) associated with a TB convolutional code $C$ defined by $G(D)$ , or simply the TBGM associated with $G(D)$ .

III-A Computation of Characteristic Matrices

Koetter and Vardy [13] have given an algorithm which can compute a characteristic matrix for a linear block code. Consider a TB convolutional code $C$ generated by $G_{N}^{tb}$ . Note that $\sigma_{n_{0}}(G_{N}^{tb})$ is equivalent to $G_{N}^{tb}$ . That is, $G_{N}^{tb}$ has a periodic structure of period $n_{0}$ . Using this property, a characteristic matrix $X$ for $C$ can be computed efficiently.

Let $C_{j}\stackrel{{\scriptstyle\triangle}}{{=}}\sigma_{j}(C)$ . $C_{j}$ is the code generated by $\sigma_{j}(G_{N}^{tb})$ . Let $X_{j}^{*}$ be a basis in minimal-span form for the code $C_{j}$ . Then a characteristic matrix $X$ for $C$ is defined as follows [13]:

[TABLE]

Since $\sigma_{n_{0}}(G_{N}^{tb})$ is equivalent to $G_{N}^{tb}$ , we have

[TABLE]

where

[TABLE]

Similarly, we have

[TABLE]

In general, for $i=1,\cdots,N-1$ , we have

[TABLE]

Hence,

[TABLE]

is obtained. Thus we have shown the following.

Proposition III.1

A characteristic matrix $X$ for a TB convolutional code $C$ generated by $G_{N}^{tb}$ is given by

[TABLE]

Corollary III.1

If the relation

[TABLE]

holds, then a characteristic matrix $X$ is given by

[TABLE]

Proof:

From the assumption, we have

[TABLE]

Similarly, we have

[TABLE]

Then, from Proposition 3.1, it follows that

[TABLE]

∎

We remark that in many practical applications, a characteristic matrix for a TB convolutional code is obtained based on the above corollary.

Example 1: Consider the TB convolutional code of section length $N=3$ defined by

[TABLE]

The associated TBGM is given by

[TABLE]

In this case, we have

[TABLE]

By applying $\rho_{3}$ and $\rho_{6}$ to these matrices, a characteristic matrix $X$ is obtained as follows:

[TABLE]

Note that the spans $(0,5],~{}(1,7],~{}(2,0],~{}(3,8],~{}(4,1]$ are connected with $X_{0}^{*}\cup\rho_{1}(X_{1}^{*})\cup\rho_{2}(X_{2}^{*})$ , whereas the spans $(0,5],~{}(1,7],~{}(3,8],~{}(4,1],~{}(6,2]$ are connected with $X_{0}^{*}\cup\rho_{3}(X_{0}^{*})$ . Hence,

[TABLE]

We see that a characteristic matrix $X$ cannot be obtained simply by applying $\rho_{3}$ and $\rho_{6}$ to $X_{0}^{*}$ .

III-B Structure of the Characteristic Span List

Let $(X,T)$ , where

[TABLE]

be a characteristic matrix for $C$ with span list $T$ , then $(\sigma_{1}(X),\sigma_{1}(T))$ is a characteristic matrix for $\sigma_{1}(C)$ with span list $\sigma_{1}(T)$ [9, Remark III.9 (b)], where

[TABLE]

Using repeatedly this relation, we see that $(\sigma_{j}(X),\sigma_{j}(T))$ is a characteristic matrix for $\sigma_{j}(C)$ with span list $\sigma_{j}(T)$ . Consider a TB convolutional code $C$ generated by $G_{N}^{tb}$ and set $j$ to $n_{0}$ . Since $\sigma_{n_{0}}(G_{N}^{tb})$ is equivalent to $G_{N}^{tb}$ , $\sigma_{n_{0}}(C)=C$ holds. Thus we have the following.

Lemma III.1

Let $C$ be a TB convolutional code generated by $G_{N}^{tb}$ . If $(X,T)$ is a characteristic matrix for $C$ with span list $T$ , then $(\sigma_{n_{0}}(X),\sigma_{n_{0}}(T))$ is also a characteristic matrix for $C$ with span list $\sigma_{n_{0}}(T)$ . Let

[TABLE]

Then $\sigma_{n_{0}}(T)$ is given by

[TABLE]

Since the characteristic span list is uniquely determined, $T$ and $\sigma_{n_{0}}(T)$ coincide up to ordering.

Proposition III.2

The characteristic span list $T$ of a TB convolutional code $C$ generated by $G_{N}^{tb}$ consists of the set of basic spans

[TABLE]

and $\rho_{in_{0}}(T_{0})~{}(i=1,2,\cdots,N-1)$ .

Proof:

Suppose that the spans in $T$ are sorted such that

[TABLE]

Then we have

[TABLE]

Here take notice of the following set of spans in $T$ :

[TABLE]

In $\sigma_{n_{0}}(T)$ , it is transformed to

[TABLE]

Since $T$ and $\sigma_{n_{0}}(T)$ coincide up to ordering,

[TABLE]

holds. Hence, we have

[TABLE]

Similarly, the set of spans

[TABLE]

is transformed to

[TABLE]

Then for the same reason,

[TABLE]

holds. Hence, we have

[TABLE]

Continuing the same argument, we have

[TABLE]

for $i=1,\cdots,N-1$ . ∎

Example 2: Consider the TB convolutional code of section length $N=3$ defined by the rate $R=2/3$ encoder

[TABLE]

Using the associated TBGM, i.e.,

[TABLE]

a charactreristic matrix $X$ is computed as follows:

[TABLE]

We see that the characteristic span list $T$ consists of the set of basic spans

[TABLE]

and its right cyclic shifts by $3$ and $6$ positions.

III-C Counting Characteristic Matrices

Recall the definition of a characteristic matrix $X$ for a given code $C$ , i.e.,

[TABLE]

where $X_{j}^{*}$ is a basis in minimal-span form for the code $C_{j}=\sigma_{j}(C)$ . Note that $X_{j}^{*}$ is not necessarily unique. Hence, $X$ is not uniquely determined [9]. With respect to this subject, Weaver [32] discussed the relationship between the characteristic span list of $C$ and the number of characteristic matrices for $C$ .

Let $T=\{(a_{l},b_{l}]:~{}l=1,2,\cdots,n\}$ be the characteristic span list of $C$ . Define the set $\Theta_{l}$ as follows [32]:

[TABLE]

$|\Theta_{l}|$ represents the number of spans (in $T$ ) included in a specified span $(a_{l},b_{l}]$ . Weaver [32] proved the following.

Lemma III.2 (Weaver [32])

Let $(a_{l},b_{l}]$ be a characteristic span of $C$ . Then there exist $2^{|\Theta_{l}|}$ characteristic generators for $C$ having this span.

This fact is derived from the next observation:

Let $(a_{r},b_{r}]\subsetneq(a_{l},b_{l}]$ and consider two characteristic generators $x_{r}$ and $x_{l}$ with spans $(a_{r},b_{r}]$ and $(a_{l},b_{l}]$ , respectively. Then $x_{l}+x_{r}$ is also a characteristic generator with span $(a_{l},b_{l}]$ .

Consider a TB convolutional code $C$ generated by $G_{N}^{tb}$ . We have already shown that the characteristic span list $T$ of $C$ consists of the set of basic spans

[TABLE]

and $\rho_{in_{0}}(T_{0})~{}(i=1,2,\cdots,N-1)$ . Hence, it suffices to consider the spans in $T_{0}$ for the purpose of counting the number of characteristic matrices. Define $\Theta_{i}~{}(1\leq i\leq n_{0})$ as follows:

[TABLE]

Also, let

[TABLE]

Then we have the following:

•

There exist $2^{\theta_{1}}$ characteristic generators having span $(0,b_{0}]$ .

•

There exist $2^{\theta_{2}}$ characteristic generators having span $(1,b_{1}]$ .

$\cdots$
•

There exist $2^{\theta_{n_{0}}}$ characteristic generators having span $(n_{0}-1,b_{n_{0}-1}]$ .

As a result, the degree of freedom related to the spans in $T_{0}$ is given by

[TABLE]

Since this degree of freedom is common to other $(N-1)$ blocks of spans in $T$ , the overall degree of freedom related to $T$ becomes

[TABLE]

Thus we have shown the following.

Proposition III.3

Let $C$ be a TB convolutional code generated by $G_{N}^{tb}$ . Let $\theta_{i}~{}(1\leq i\leq n_{0})$ and $\theta$ be as above. Then there exist $2^{\theta N}$ characteristic matrices for $C$ .

Example 2 (Continued): Take notice of the first three rows of the characteristic matrix $X$ . We have

[TABLE]

Hence, $\theta=1+0+0=1$ and there exist $2^{1\times 3}=8$ characteristic matrices.

III-D Span Lengths of Characteristic Generators

Let $[x]=(a,b]$ be a span of a codeword $x$ . Then the span length of $x$ is defined by $|[a,b]|$ , i.e., the number of elements in the closed interval $[a,b]$ . When a span alone is referred to without specifying the accompanied codeword, we use the term the span length of a span $(a,b]$ . Let $T$ be the characteristic span list of a TB convolutional code $C$ generated by $G_{N}^{tb}$ . Suppose that the spans in $T$ are sorted such that

[TABLE]

Then by [13, Theorem 5.10],

[TABLE]

holds. Due to the structure of $T$ (see Proposition 3.2), the left-hand side of the above equality becomes

[TABLE]

where

[TABLE]

In the derivation, we also used the relation

[TABLE]

Replacing $n$ and $k$ by $n_{0}N$ and $k_{0}N$ , respectively, the above equality reduces to

[TABLE]

Thus we have shown the following.

Proposition III.4

Let $T$ be the characteristic span list of a TB convolutional code $C$ generated by $G_{N}^{tb}$ . Denote by $T_{0}$ the set of basic spans in $T$ . Then the sum $\ell$ of span lengths of spans in $T_{0}$ is given by

[TABLE]

Example 2 (Continued): We have

[TABLE]

Also, we have

[TABLE]

IV Transformations of $G(D)$ and the Corresponding TBGM’s

In this section, we discuss the relationship between transformations of a generator matrix $G(D)$ and the corresponding TBGM’s ( $G_{N}^{tb}\mbox{'}s$ ). We consider the following transformations of $G(D)$ :

a)

Dividing the $j$ th column by $D^{p}$ .

b)

Multiplying the $j$ th column by $D^{q}$ .

c)

Adding the $i$ th row multiplied by $D^{q}$ to the $j$ th row.

d)

Implicit transformations.

In the next section, we will see that these transformations play an essential role in trellis reduction for TB convolutional codes.

IV-A Dividing a Column of $G(D)$ by $D^{p}$

Suppose that the $j$ th column of $G(D)$ has a monomial factor $D^{p}~{}(1\leq p\leq L)$ . We can assume without loss of generality that $j=1$ and $p=1$ . Hence, $G(D)$ has the form

[TABLE]

Let

[TABLE]

be the polynomial expansion of $G(D)$ . Comparing the $(i,1)~{}(1\leq i\leq k_{0})$ entries, we have

[TABLE]

By these equations, we have

[TABLE]

Dividing the first column of $G(D)$ by $D$ , let the resulting matrix be $G^{\prime}(D)$ . Then $G^{\prime}(D)$ has the polynomial expansion:

[TABLE]

Consider the TBGM associated with $G^{\prime}(D)$ , denoted by $G_{N}^{\prime tb}$ , where

[TABLE]

Note that both $G_{N}^{tb}$ and $G_{N}^{\prime tb}$ can be regarded as matrices having $N$ blocks of $n_{0}$ columns. Then in view of the entries of $G_{i}^{\prime}~{}(0\leq i\leq L)$ and the relation

[TABLE]

$G_{N}^{\prime tb}$ is obtained from $G_{N}^{tb}$ by cyclically shifting the first column of each block to the left by $n_{0}$ positions. Thus we have the following.

Proposition IV.1

Regard $G_{N}^{tb}$ as a matrix having $N$ blocks of $n_{0}$ columns. Suppose that the $j$ th column of $G(D)$ has a monomial factor $D^{p}~{}(1\leq p\leq L)$ . Then dividing the $j$ th column of $G(D)$ by $D^{p}$ is equivalent to cyclically shifting the $j$ th column of each block of $G_{N}^{tb}$ to the left by $pn_{0}$ positions.

Let $C$ be a TB convolutional code of section length $N$ defined by $G(D)$ . Note that each codeword in $C$ consists of $N$ blocks of $n_{0}$ components. Here let us cyclically shift the $j$ th component of each block to the left by $pn_{0}$ positions. Denote by $C^{\prime}$ the set of resulting (modified) codewords. We have already shown that $G_{N}^{\prime tb}$ is obtained from $G_{N}^{tb}$ by cyclically shifting the $j$ th column of each block to the left by $pn_{0}$ positions. Hence, $C^{\prime}$ is generated by $G_{N}^{\prime tb}$ . In words, $C^{\prime}$ is represented as a TB convolutional code defined by $G^{\prime}(D)$ .

IV-B Multiplying a Column of $G(D)$ by $D^{q}$

Consider multiplication of the $j$ th column of $G(D)$ by $D^{q}$ , where $q+L+1\leq N$ . In the following, we assume without loss of generality that $j=1$ C $q=1~{}(L+2\leq N)$ . Hence, the resulting matrix $G^{\prime}(D)$ has the form

[TABLE]

Then we have

[TABLE]

Accordingly, the polynomial expansion of $G^{\prime}(D)$ becomes

[TABLE]

Consider the TBGM associated with $G^{\prime}(D)$ , denoted by $G_{N}^{\prime tb}$ , where

[TABLE]

Note that $G_{N}^{tb}$ and $G_{N}^{\prime tb}$ consist of $N$ blocks of $n_{0}$ columns as above. In view of the entries of $G_{i}^{\prime}~{}(0\leq i\leq L+1)$ , we see that $G_{N}^{\prime tb}$ is obtained from $G_{N}^{tb}$ by cyclically shifting the first column of each block to the right by $n_{0}$ positions. Thus we have the following.

Proposition IV.2

Regard $G_{N}^{tb}$ as a matrix having $N$ blocks of $n_{0}$ columns. Suppose that $q+L+1\leq N$ . Then multiplying the $j$ th column of $G(D)$ by $D^{q}$ is equivalent to cyclically shifting the $j$ th column of each block of $G_{N}^{tb}$ to the right by $qn_{0}$ positions.

Remark: In order for $G_{N}^{\prime tb}$ to be defined, the condition $q+L+1\leq N$ is required.

Let $C$ be a TB convolutional code of section length $N$ with generator matrix $G(D)$ . Let $C^{\prime}$ be as in the previous section. In this case, however, the $j$ th component of each block is cyclically shifted to the right by $qn_{0}$ positions. We have shown that $G_{N}^{\prime tb}$ is obtained from $G_{N}^{tb}$ by cyclically shifting the $j$ th column of each block to the right by $qn_{0}$ positions. Hence, $C^{\prime}$ is generated by $G_{N}^{\prime tb}$ . In words, $C^{\prime}$ is represented as a TB convolutional code defined by $G^{\prime}(D)$ .

IV-C $g_{j}(D)\leftarrow g_{j}(D)+D^{q}g_{i}(D)$

Consider addition of the $i$ th row $g_{i}(D)$ multiplied by $D^{q}$ to the $j$ th row $g_{j}(D)$ , denoted by $g_{j}(D)\leftarrow g_{j}(D)+D^{q}g_{i}(D)$ , where $q+L+1\leq N$ . In the following, we assume without loss of generality that $i=1$ and $j=2$ . Let the first row of $G(D)$ be

[TABLE]

Also, let

[TABLE]

be the polynomial expansion, where the size of $g_{i}~{}(0\leq i\leq L)$ is $1\times n_{0}$ . Then the polynomial expansion of $g_{1}(D)D^{q}$ becomes

[TABLE]

Note that the first row of $G_{N}^{tb}$ is expressed as

[TABLE]

Hence, its right cyclic shift by $qn_{0}$ positions, i.e.,

[TABLE]

coincides with the $(qk_{0}+1)$ th row of $G_{N}^{tb}$ . That is, $g_{2}(D)\leftarrow g_{2}(D)+D^{q}g_{1}(D)$ corresponds to addition of the $(qk_{0}+1)$ th row to the second row within the matrix $G_{N}^{tb}$ . Note that this is an elementary row operation. Thus we have the following.

Proposition IV.3

Suppose that $q+L+1\leq N$ . Consider the operation $g_{j}(D)\leftarrow g_{j}(D)+D^{q}g_{i}(D)$ . Let the resulting matrix be $G^{\prime}(D)$ and the associated TBGM be $G_{N}^{\prime tb}$ . Then $G_{N}^{\prime tb}$ is equivalent to $G_{N}^{tb}$ .

Taking into consideration Proposition 4.3, let us introduce a useful notion. Let $C$ and $C^{\prime}$ be TB convolutional codes of section length $N$ defined by $G(D)$ and $G^{\prime}(D)$ , respectively. Denote by $L$ and $L^{\prime}$ the memory lengths of $G(D)$ and $G^{\prime}(D)$ , respectively, where $\max(L,L^{\prime})+1\leq N$ . Let $G_{N}^{tb}$ and $G_{N}^{\prime tb}$ be the TBGM’s associated with $C$ and $C^{\prime}$ , respectively. We see that if $G_{N}^{tb}$ and $G_{N}^{\prime tb}$ are equivalent, then $C=C^{\prime}$ . All of this leads to the following definition.

Definition IV.1

When $G_{N}^{tb}$ and $G_{N}^{\prime tb}$ are equivalent, we say that $G(D)$ and $G^{\prime}(D)$ are “TB-equivalent”.

Thus we have the following.

Proposition IV.4

If $G(D)$ and $G^{\prime}(D)$ are TB-equivalent, then a TB convolutional code defined by $G(D)$ is represented as a TB convolutional code defined by $G^{\prime}(D)$ , and vice versa.

Proof:

A direct consequence of the definition of TB-equivalent. ∎

V Trellis Reduction for TB Convolutional Codes

In this section, we will show that for a TB convolutional code of short to moderate section length, the associated TB trellis can be reduced. We begin with an example.

V-A An Example of Trellis Reduction

Consider the TB convolutional code $C$ defined by $G(D)=(1+D+D^{2},1+D^{2})$ , where the section length $N$ is set to $5$ . The corresponding TB trellis is shown in Fig.1, where the paths which start and end in the same state are TB paths (i.e., valid codewords). Then $C$ is the set of all TB paths. Since $G(D)$ has the polynomial expansion

[TABLE]

the TBGM associated with $C$ is given by

[TABLE]

Based on $G_{5}^{tb}$ , a characteristic matrix $X$ for $C$ is computed as follows:

[TABLE]

Choosing $5$ rows from $X$ , let

[TABLE]

We see that the rows of $G^{\prime}$ are linearly independent and thus generate $C$ , i.e., $G^{\prime}$ is equivalent to $G_{5}^{tb}$ .

Here note that $G^{\prime}$ consists of the first row and its right cyclic shifts by $i\times 2~{}(1\leq i\leq 4)$ positions. Accordingly, $G^{\prime}$ can be regarded as the TBGM associated with

[TABLE]

Hence, $C$ is equally represented as a TB convolutional code defined by $G^{\prime}(D)=(D^{3},1+D)$ . We remark that the constraint length of $G^{\prime}(D)$ is $\nu^{\prime}=3$ and is greater than that of $G(D)$ .

On the other hand, observe that the first column of $G^{\prime}(D)$ has a factor $D^{2}$ . Then dividing the first column by $D^{2}$ , we have

[TABLE]

Note that this transformation corresponds to cyclically shifting the first component of each branch (of a TB path) to the left by two branches (cf. Proposition 4.1). By this transformation, the original TB convolutional code is represented using a trellis associated with $G^{\prime\prime}(D)$ as well. The trellis for $G^{\prime\prime}(D)=(D,1+D)$ is shown in Fig.2. For example, take notice of the TB path in Fig.1 which starts and ends in state $(00)$ :

[TABLE]

Cyclically shifting the first component of each branch to the left by two branches, it becomes

[TABLE]

We see that the modified path is represented as a path which starts and ends in state (1) in Fig.2.

This example shows that there are cases where a given TB convolutional code is represented using a reduced trellis with less state complexity, if we allow partial cyclic shifts of a TB path.

V-B Trellis Reduction for TB Convolutional Codes

The argument in the previous section, though it was presented in terms of a specific example, is entirely general. Then the method can be directly extended to a general case. Let $G(D)$ be as in Section III. Denote by $\nu$ the constraint length of $G(D)$ . Consider a TB convolutional code $C$ of section length $N$ defined by $G(D)$ . The trellis reduction procedure becomes as follows.

Procedure for trellis reduction:

i)

Compute a characteristic matrix $X$ for $C$ based on the TBGM $G_{N}^{tb}$ , where $X$ consists of $n_{0}$ rows and their right cyclic shifts by integer multiple of $n_{0}$ .

ii)

Choosing $k$ rows from $X$ , form $G^{\prime}$ , where $G^{\prime}$ has the properties:

The rows of $G^{\prime}$ are linearly independent and thus generate $C$ .

2)

$G^{\prime}$ consists of $k_{0}$ rows and their right cyclic shifts by integer multiple of $n_{0}$ .

iii)

(Direct reduction) $G^{\prime}$ is regarded as the TBGM associated with another generator matrix $G^{\prime}(D)$ . Let $\nu^{\prime}$ be the constraint length of $G^{\prime}(D)$ . If $\nu^{\prime}<\nu$ , then trellis reduction for $C$ is realized.

iv)

(Indirect reduction) Even if $\nu^{\prime}\geq\nu$ , there are cases where $G^{\prime}(D)$ has a monomial factor $D^{p}$ in some ( $j$ th) column. Then there is a possibility that $\nu^{\prime}$ is reduced by dividing the $j$ th column of $G^{\prime}(D)$ by $D^{p}$ (the resulting matrix is denoted by $G^{\prime\prime}(D)$ ). Let $\nu^{\prime\prime}$ be the constraint length of $G^{\prime\prime}(D)$ . If $\nu^{\prime\prime}<\nu$ , then the original TB trellis can be reduced. That is, by cyclically shifting the $j$ th component of each branch of a TB path ( $\in C$ ) to the left by $p$ branches, the set of modified paths are equally represented as a TB convolutional code defined by $G^{\prime\prime}(D)$ (this is justified by Proposition 4.1). Thus trellis reduction is accomplished.

v)

$X$ is not necessarily unique [9]. Hence, if necessary, try i) $\sim$ iv) using another characteristic matrix $X^{\prime}$ for $C$ .

Remark: For row rate codes, it is rather easy to find $G^{\prime}$ which is equivalent to $G_{N}^{tb}$ . Also, row rate codes make it easy to determine whether $G^{\prime}(D)$ can be reduced or not.

As is stated above, there are some restrictions on the selection of $X$ and $G^{\prime}$ . We have the following.

Proposition V.1

The number of characteristic matrices $X\mbox{'}s$ in i) is given by $2^{\theta}$ , where $\theta$ is defined in Section III-C. For a fixed $X$ , the number of $G^{\prime}\mbox{'}s$ which satisfy the condition 2) in ii) is given by ${}_{n_{0}}\mbox{C}_{k_{0}}$ .

Proof:

$G^{\prime}$ is a candidate for a TBGM associated with an encoder. Hence, the above is a consequence of the structure of TBGM. ∎

Example 3: Consider the TB convolutional code $C$ of section lengh $N=6$ defined by

[TABLE]

Using the associated TBGM, i.e., $G_{6}^{tb}$ , a characteristic matrix $X$ for $C$ is computed as follows:

[TABLE]

Choosing $6$ rows from $X$ , define $G^{\prime}$ as

[TABLE]

We see that $G^{\prime}$ is equivalent to $G_{6}^{tb}$ . Also, we see that $G^{\prime}$ is the TBGM associated with

[TABLE]

Note that the constraint length $\nu^{\prime}=3$ of $G^{\prime}(D)$ is not reduced compared to that of $G(D)$ . On the other hand, observe that the second column of $G^{\prime}(D)$ has a factor $D$ . Then dividing the column by $D$ , we have

[TABLE]

This transformation corresponds to cyclically shifting the second component of each branch of a TB path to the left by one branch (cf. Proposition 4.1). As a result, the modified paths are represented using the trellis for $G^{\prime\prime}(D)$ . Thus trellis reduction for $C$ is accomplished.

Remark: As is stated above, $X$ is not necessarily unique. For example, if a characteristic matrix

[TABLE]

is used, then trellis reduction cannot be realized using the above procedure.

Using appropriate characteristic matrices, the above reduction method can also be applied to the following cases:

(1)

$R=1/2,~{}\nu=4,~{}N=6:$

[TABLE]

(2)

$R=1/2,~{}\nu=5,~{}N=10:$

[TABLE]

(3)

$R=1/2,~{}\nu=6,~{}N=8:$

[TABLE]

(4)

$R=1/2,~{}\nu=6,~{}N=8:$

[TABLE]

(5)

$R=1/3,~{}\nu=3,~{}N=5:$

[TABLE]

(6)

$R=2/3,~{}\nu=4,~{}N=6:$

[TABLE]

V-C Trellis Reduction Using a Reciprocal Dual Encoder

For high rate codes, $G^{\prime}(D)$ may not have a monomial factor in any columns. Then it is not easily determined whether $G^{\prime}(D)$ can be reduced or not. In such cases, it is useful to consider a reciprocal dual encoder $\tilde{H}^{\prime}(D)$ associated with $G^{\prime}(D)$ . A reciprocal dual encoder [21] is defined as follows: Let $G(D)$ be as in Section III. Also, let $H(D)$ be a corresponding check matrix with size $(n_{0}-k_{0})\times n_{0}$ . A reciprocal dual encoder $\tilde{H}(D)$ is obtained by substituting $D^{-1}$ for $D$ in $H(D)$ and by multiplying the $i$ th ( $1\leq i\leq n_{0}-k_{0}$ ) row of the resulting matrix by $D^{\nu_{i}^{\perp}}$ , where $\nu_{i}^{\perp}$ is the degree of the $i$ th row of $H(D)$ .

Definition V.1 (McEliece and Lin [18])

Let $G_{scalar}$ be a scalar generator matrix for a terminated convolutional code defined by $G(D)$ [18, 21]. $G_{scalar}$ is given by

[TABLE]

The $(L+1)k_{0}\times n_{0}$ matrix

[TABLE]

which repeatedly appears as a vertical slice in $G_{scalar}$ except initial and final transient sections, is called the matrix module. Then the trellis module $\mathcal{T}_{0}$ for the trellis associated with $G_{scalar}$ corresponds to $\hat{G}$ . If $G_{scalar}$ is in minimal-span form, then $\mathcal{T}_{0}$ is minimal. The state complexity profile of $\mathcal{T}_{0}$ is an $n_{0}$ -tuple consisting of the dimensions of state spaces $V_{i}~{}(0\leq i\leq n_{0}-1)$ of $\mathcal{T}_{0}$ .

The meaning of obtaining a reciprocal dual encoder is based on the following result [26, 30, 31].

Proposition V.2 (Tang and Lin [30])

Consider a minimal trellis module of $G(D)$ and that of an associated reciprocal dual encoder $\tilde{H}(D)$ . Then their state complex profiles are identical.

Hence, in order to determine whether $G^{\prime}(D)$ is reduced or not, we can compute a reciprocal dual encoder $\tilde{H}^{\prime}(D)$ associated with $G^{\prime}(D)$ . In connection with an encoder $G(D)$ and an associated reciprocal dual encoder $\tilde{H}(D)$ , we have the following.

Proposition V.3

Let $G_{N}^{tb}$ be the TBGM associated with $G(D)$ . Then a check matrix corresponding to $G_{N}^{tb}$ is obtained as the TBGM (denoted by $\tilde{H}_{N}^{tb}$ ) associated with a reciprocal dual encoder $\tilde{H}(D)$ .

Proof:

Let the polynomial expansion of $H(D)$ be

[TABLE]

where $M$ is the memory length of $H(D)$ and $H_{i}~{}(0\leq i\leq M)$ are $(n_{0}-k_{0})\times n_{0}$ matrices. It is known (e.g., [28]) that a check matrix corresponding to $G_{N}^{tb}$ is given by

[TABLE]

with size $(n_{0}-k_{0})N\times n_{0}N$ . On the other hand, let the polynomial expansion of $\tilde{H}(D)$ be

[TABLE]

Then the TBGM associated with $\tilde{H}(D)$ (denoted by $\tilde{H}_{N}^{tb}$ ) is defined by

[TABLE]

with size $(n_{0}-k_{0})N\times n_{0}N$ .

Here take notice of the $i$ th ( $1\leq i\leq n_{0}-k_{0}$ ) row of

[TABLE]

We see that the row is identical to the $i$ th row of

[TABLE]

Similarly, the $i$ th ( $1\leq i\leq n_{0}-k_{0}$ ) row of

[TABLE]

is identical to the $i$ th row of

[TABLE]

Due to the cyclic structures of $H^{tb}$ and $\tilde{H}_{N}^{tb}$ , similar correspondences hold successively. Hence, $\tilde{H}_{N}^{tb}$ is given as a row permutation of $H^{tb}$ . ∎

A procedure for computing $\tilde{H}^{\prime}(D)$ is obtained based on the above proposition.

Procedure for computing $\tilde{H}^{\prime}(D)$ :

i)

Compute a characteristic matrix $Y$ for the dual code $C^{\perp}$ based on $\tilde{H}_{N}^{tb}$ , where $Y$ consists of $n_{0}$ rows and their right cyclic shifts by integer multiple of $n_{0}$ .

ii)

Choosing $(n-k)$ rows from $Y$ , form $\tilde{H}^{\prime}$ , where $\tilde{H}^{\prime}$ has the properties:

The rows of $\tilde{H}^{\prime}$ are linearly independent and thus generate $C^{\perp}$ .

2)

$\tilde{H}^{\prime}$ consists of $(n_{0}-k_{0})$ rows and their right cyclic shifts by integer multiple of $n_{0}$ .

iii)

Let $\tilde{H}^{\prime}(D)$ be the polynomial matrix whose TBGM is $\tilde{H}^{\prime}$ .

iv)

Note that $G^{\prime}$ and $\tilde{H}^{\prime}$ are equivalent to $G_{N}^{tb}$ and $\tilde{H}_{N}^{tb}$ , respectively. Hence, $\tilde{H}^{\prime}$ is a check matrix corresponding to $G^{\prime}$ . Then it follows from Proposition 5.3 that $\tilde{H}^{\prime}(D)$ is a reciprocal dual encoder associated with $G^{\prime}(D)$ .

v)

$Y$ is not necessarily unique. Hence, if necessary, try i) $\sim$ iv) using another characteristic matrix $Y^{\prime}$ for $C^{\perp}$ .

The following is an example where trellis reduction is realized using a reciprocal dual encoder.

Example 4: Consider the rate $R=2/3$ TB convolutional code $C$ of section length $N=5$ with generator matrix

[TABLE]

Based on the associated TBGM, i.e.,

[TABLE]

a characteristic matrix $X$ for $C$ is computed as follows:

[TABLE]

The span list for $X$ is given by

[TABLE]

Choosing $10$ rows from $X$ , let

[TABLE]

The span list for $G^{\prime}$ is given by

[TABLE]

We see that the rows of $G^{\prime}$ are linearly independent and thus generate $C$ , i.e., $G^{\prime}$ is equivalent to $G_{5}^{tb}$ . Also, note that $G^{\prime}$ is the TBGM associated with

[TABLE]

Hence, the original TB convolutional code is equally represented as a TB convolutional code defined by $G^{\prime}(D)$ .

Observe that the constraint length of $G^{\prime}(D)$ is $\nu^{\prime}=3$ and is equal to that of $G(D)$ . Also, notice that the second column of $G^{\prime}(D)$ has a factor $D$ . However, $\nu^{\prime}$ is not reduced by dividing the column by $D$ . In general, it is difficult to tell a possibility of reduction of $G^{\prime}(D)$ just by looking at its entries. So, we will compute a reciprocal dual encoder $\tilde{H}^{\prime}(D)$ associated with $G^{\prime}(D)$ .

We begin with a reciprocal dual encoder $\tilde{H}(D)$ associated with $G(D)$ . $\tilde{H}(D)$ is given by

[TABLE]

Based on $\tilde{H}_{5}^{tb}$ , a characteristic matrix $Y$ for $C^{\perp}$ is computed as follows:

[TABLE]

The span list for $Y$ is given by

[TABLE]

Note that if the span list for $X$ is $T=\{(a_{l},b_{l}],~{}l=1,\cdots,15\}$ , then the span list for $Y$ is given by $\hat{T}=\{(b_{l},a_{l}],~{}l=1,\cdots,15\}$ [10, 13].

Next, choosing $5$ rows from $Y$ , let

[TABLE]

The span list for $\tilde{H}^{\prime}$ is given by

[TABLE]

We see that $\tilde{H}^{\prime}$ is equivalent to $\tilde{H}_{5}^{tb}$ . Thus $\tilde{H}^{\prime}$ is a scalar check matrix corresponding to $G^{\prime}$ . Also, note that $\tilde{H}^{\prime}$ is the TBGM associated with $\tilde{H}^{\prime}(D)=(D+D^{2},D^{3},1)$ . We already know that $G^{\prime}$ is the TBGM associated with $G^{\prime}(D)$ . Hence, by Proposition 5.3, a reciprocal dual encoder associated with $G^{\prime}(D)$ is given by

[TABLE]

Observe that $\tilde{H}^{\prime}(D)=(D+D^{2},D^{3},1)$ has a factor $D$ in the first column and a factor $D^{2}$ in the second column. Then sweeping these factors out of the corresponding columns, the constraint length of $\tilde{H}^{\prime}(D)$ is reduced to one. This fact implies that the constraint length of $G^{\prime}(D)$ can also be reduced.

In the following, we will show that reduction of $G^{\prime}(D)$ is actually realized. For the purpose, a check matrix corresponding to $G^{\prime}(D)$ , i.e.,

[TABLE]

is used.

Let $G(D)$ and $H(D)$ be a generator matrix and a corresponding check matrix for a convolutional code, respectively. In the following, this relation is denoted by $G(D)\Leftrightarrow H(D)$ . It is shown [27] that $G(D)$ and $H(D)$ can be reduced simultaneously, if reduction is possible, where the relation $\Leftrightarrow$ is retained in the whole reduction process. We apply the method to our case under consideration.

Step 1: For $G^{\prime}(D)$ , add the first row multiplied by $D$ to the second row. By Proposition 4.3, this is a TB-equivalent transformation. As a result, we have

[TABLE]

Step 2: Divide the second column of $G^{\prime\prime}(D)$ by $D$ , while divide the first and third columns of $H^{\prime}(D)$ by $D$ . Then we have

[TABLE]

Step 3: Multiply the third column of $G^{\prime\prime\prime}(D)$ by $D$ , while divide the third column of $H^{\prime\prime\prime}(D)$ by $D$ . Then we have

[TABLE]

Step 4: Note that $G^{(4)}(D)=\left(\begin{array}[]{ccc}1+D&1&D\\ D&D&D\end{array}\right)$ is not basic [5]. Using an invariant-factor decomposition [5] of $G^{(4)}(D)$ , an equivalent basic matrix

[TABLE]

is obtained. Note that $G^{(4)}(D)$ and $G^{(5)}(D)$ are TB-equivalent (cf. Proposition 4.4).

In the above reduction process for $G^{\prime}(D)$ , except for TB-equivalent transformations, the second column is divided by $D$ , whereas the third column is multiplied by $D$ . Accordingly, for each TB path, let us cyclically shift the second component of each branch to the left by one branch and cyclically shift the third component of each branch to the right by one branch. Then the modified TB paths are represented as a TB convolutional code defined by $G^{(5)}(D)$ (see Propositions 4.1 and 4.2). The trellis for $G^{(5)}(D)$ is shown in Fig.3. For example, take an information sequence

[TABLE]

and the corresponding TB path

[TABLE]

By cyclically shifting the second component of each branch to the left by one branch, and by cyclically shifting the third component of each branch to the right by one branch, we have

[TABLE]

We see that $\mbox{\boldmath$ w $}_{m}$ is a TB path which starts and ends in state $(0)$ in Fig.3.

Remark: We remark that in the above argument, it is assumed that $G^{\prime}$ is equivalent to $G_{5}^{tb}$ (i.e., the equivalence has been checked beforehand). In general, however, $k=k_{0}N$ is relatively large for high rate codes. Hence, it is preferable that the equivalence of $G^{\prime}$ and $G_{5}^{tb}$ is derived without checking it beforehand. Actually, the equivalence of $G^{\prime}$ and $G_{5}^{tb}$ is derived from the equivalence of $\tilde{H}^{\prime}$ and $\tilde{H}_{5}^{tb}$ using the result of Gluesing-Luerssen and Weaver [10, Theorem IV.3] (see Appendix A).

V-D Relation Between Trellis Reduction and Section Length

In the proposed trellis reduction method, the section length $N$ is an important parameter. Actually, the method is effective for TB convolutional codes of short to moderate section length. This is because the span lengths of characteristic generators increase as $N$ grows (see Section III-D). We have already shown that a TB trellis with generator matrix $G(D)=(1+D+D^{2},1+D^{2})$ can be reduced for the case of $N=5$ . Consider the same trellis. This time, however, $N$ is set to $6$ . Then $G_{6}^{tb}$ is given by

[TABLE]

Note that to each generator in $G_{6}^{tb}$ , its span is assigned in the natural manner. Observe that the span lengths of these spans are the same, i.e., $6$ . A characteristic matrix is computed as follows:

[TABLE]

Thus the set of basic spans is given by

[TABLE]

With respect to $G^{\prime}$ , there are two cases. When the first row of $X$ is used as a basic generator of $G^{\prime}$ , $G^{\prime}$ is identical to $G_{6}^{tb}$ . When the second row of $X$ is used as a basic generator of $G^{\prime}$ , the span lengths of rows of $G^{\prime}$ are $8$ and are greater than $6$ . These facts mean that in either case, trellis reduction is not realized using the proposed method. On the other hand, this example implies that the upper bound for $N$ can be estimated by comparing the span lengths of generators in $G^{\prime}$ with those of generators in $G_{N}^{tb}$ .

Let $X$ be a characteristic matrix for a TB convolutional code of section length $N$ . We already know that the associated span list $T$ consists of the set of basic spans

[TABLE]

and $\rho_{in_{0}}(T_{0})~{}(i=1,2,\cdots,N-1)$ . Also, the sum of span lengths of spans in $T_{0}$ is given by

[TABLE]

In the proposed method, $G^{\prime}$ consists of $k$ generators in $X$ . From a span viewpoint, this corresponds to choosing $k_{0}$ spans from $T_{0}$ . Accordingly, the sum of span lengths of these $k_{0}$ spans, denoted by $\ell^{\prime}$ , is approximated by

[TABLE]

On the other hand, consider $G_{N}^{tb}$ , where to each generator, its span is assigned in the natural manner. Then the span list consists of the set of basic spans $\hat{T}_{0}$ and $\rho_{in_{0}}(\hat{T}_{0})~{}(i=1,2,\cdots,N-1)$ . We evaluate the sum of span lengths of spans in $\hat{T}_{0}$ , denoted by $\hat{\ell}$ . Let $\nu_{i}$ be the degree of the $i$ th row of $G(D)$ . Here take notice of the first block of $k_{0}$ rows in $G_{N}^{tb}$ , i.e.,

[TABLE]

The span length of the $i$ th ( $1\leq i\leq k_{0}$ ) row is approximated by $n_{0}(\nu_{i}+1)$ . Hence, we have

[TABLE]

where $\nu\stackrel{{\scriptstyle\triangle}}{{=}}\nu_{1}+\nu_{2}+\cdots+\nu_{k_{0}}$ is the constraint length of $G(D)$ . Since trellis reduction is realized in the case where $G^{\prime}$ consists of generators with short span length, we can take the inequality

[TABLE]

as a criterion for trellis reduction. That is, we can estimate the upper bound for $N$ using the inequality

[TABLE]

For several concrete cases, we will show the condition $(\sharp)$ .

(1)

$R=1/2$ :

[TABLE]

(2)

$R=1/3$ :

[TABLE]

(3)

$R=2/3$ :

[TABLE]

We observe that the TB convolutional codes presented in Section V-B all satisfy the condition $(\sharp)$ . Also, the rate $R=2/3$ TB convolutional code discussed in the previous section satisfies the condition $(\sharp)$ .

VI Conclusion

In this paper, we have derived several basic properties of a characteristic matrix for a TB convolutional code. We have shown that the characteristic span list consists of some basic spans and their right cyclic shifts. Using the derived results, we have shown that a trellis associated with a given TB convolutional code can be reduced in some cases. As candidates for trellis reduction, we have taken the generator matrices from the tables in [12, Chapter 8] in principle. For example, the rate $R=1/2$ encoders in Section V-B were chosen from [12, TABLE 8.1]. On the other hand, good TB convolutional encoders have been obtained [12, 25]. Here, for a given rate $R=k_{0}/n_{0}$ , the optimal encoder of memory length $L$ produces the largest minimum distance $d$ for each section length $N$ . We have applied the proposed reduction method to some of such encoders (see [12, TABLE 8.19]). As a result, for example, we have obtained $G=(6,7)$ ( $\nu=2,~{}N=5$ ) from $G=(50,64)$ ( $\nu=3,~{}N=5$ ), where the octal notation for generator matrices is used. Similarly, we have obtained $G=(54,60)$ ( $\nu=3,~{}N=6$ ) from $G=(46,60)$ ( $\nu=4,~{}N=6$ ). Note that both $G=(6,7)$ ( $\nu=2,~{}N=5$ ) and $G=(54,60)$ ( $\nu=3,~{}N=6$ ) are listed in the same table.

Finally, we remark that the proposed trellis reduction method depends on the choice of a characteristic matrix for a given convolutional code. Though the number of characteristic matrices to be examined is rather restricted (cf. Proposition 5.1), the method is not fully constructive. Also, a detailed condition that trellis reduction is realized has to be clarified.

Appendix A Proof of the equivalence of $G^{\prime}$ and $G_{5}^{tb}$

We first prove the following.

Proposition A.1

Let $G(D)$ be as in Section III. Consider a TB convolutional code $C$ generated by $G_{N}^{tb}$ and the corresponding dual code $C^{\perp}$ . Let $X$ be a characteristic matrix for $C$ with span list $T$ . Also, let $Y$ be a characteristic matrix for $C^{\perp}$ with span list $\hat{T}$ . Let $\tilde{X}$ and $\tilde{Y}$ be submatrices of $X$ and $Y$ , respectively, where $\tilde{X}$ consists of $k$ rows in $X$ , whereas $\tilde{Y}$ consists of $(n-k)$ rows in $Y$ . Denote by $S$ and $\hat{S}$ the span lists for $\tilde{X}$ and $\tilde{Y}$ , respectively. Here assume the following:

i)

$\tilde{X}$ * consists of $k_{0}$ rows and their right cyclic shifts by integer multiple of $n_{0}$ .*

ii)

Each span in $S$ does not include any spans in $T$ except itself.

iii)

The rows of $\tilde{Y}$ are linearly independent and thus generate $C^{\perp}$ .

iv)

$\tilde{Y}$ * consists of $(n_{0}-k_{0})$ rows and their right cyclic shifts by integer multiple of $n_{0}$ .*

v)

Each span in $\hat{S}$ does not include any spans in $\hat{T}$ except itself.

vi)

$S$ * and $\hat{S}$ satisfy $(b,a]\in\hat{S}\leftrightarrow(a,b]\notin S$ (that is, $\hat{S}$ consists of the spans in $\hat{T}$ whose reverse is not in $S$ ).*

Then the rows of $\tilde{X}$ are linearly independent, thus generate $C$ , i.e., $\tilde{X}$ is equivalent to $G_{N}^{tb}$ .

Remark: When $\tilde{X}$ consists of generators in $X$ with short span length, it is probable that the condition ii) holds. Similarly, when $\tilde{Y}$ consists of generators in $Y$ with short span length, it is probable that the condition v) holds.

Proof:

From ii), it follows that $\tilde{X}$ is common to all the characteristic matrices for $C$ . Similarly, from v), it follows that $\tilde{Y}$ is common to all the characteristic matrices for $C^{\perp}$ . Also, by vi), $(\tilde{X},\tilde{Y})$ is a dual selection of $(X,Y)$ [10, Definition IV.2]. As a result [10, Theorem IV.3], we have

$\mbox{rank}~{}\tilde{X}=k\leftrightarrow\mbox{rank}~{}\tilde{Y}=n-k$

2)

Let $\mbox{rank}~{}\tilde{X}=k$ . Then the KV trellises [10] based on $(\tilde{X},S)$ and $(\tilde{Y},\hat{S})$ are dual to each other.

By iii), $\mbox{rank}~{}\tilde{Y}=n-k$ . Hence, by 1), $\mbox{rank}~{}\tilde{X}=k$ . ∎

Let us go back to Example 4. In this example, the code $C$ generated by $G_{5}^{tb}$ and the dual code $C^{\perp}$ generated by $\tilde{H}_{5}^{tb}$ are considered. $X$ is a characteristic matrix for $C$ , whereas $Y$ is a characteristic matrix for $C^{\perp}$ . Note that neither $X$ nor $Y$ are unique. These are observed from the relation of inclusion in the associated span lists $T$ and $\hat{T}$ , where

[TABLE]

Next, take notice of the matrices $G^{\prime}$ and $\tilde{H}^{\prime}$ , which are submatrices of $X$ and $Y$ , respectively. The corresponding span lists are given by

[TABLE]

Here note the following:

•

Each span in $S$ does not include any spans in $T$ except itself.

•

Each span in $\hat{S}$ does not include any spans in $\hat{T}$ except itself.

Moreover, $\hat{S}$ consists of the spans in $\hat{T}$ whose reverse is not in $S$ . Actually, by reversing the spans in $\hat{S}$ , we have

[TABLE]

We see that these spans are not in $S$ .

All these facts show that the conditions in Proposition A.1 are satisfied, when $\tilde{X}$ and $\tilde{Y}$ are replaced by $G^{\prime}$ and $\tilde{H}^{\prime}$ , respectively. Hence, the equivalence of $G^{\prime}$ and $G_{5}^{tb}$ is derived.

Remark: [10, Theorem IV.3] holds only for a pair $(X,Y)$ , where $X$ is a characteristic matrix for $C$ and $Y$ is the corresponding dual one for $C^{\perp}$ (see [10]). On the other hand, the pair $(X,Y)$ computed above may not be in the duality relation. However, $G^{\prime}$ is common to all the characteristic matrices for $C$ , and $\tilde{H}^{\prime}$ is common to all the characteristic matrices for $C^{\perp}$ as well. Hence, the theorem can be applied to our case.

Acknowledgment

The author would like to thank Prof. Heide Gluesing-Luerssen for valuable comments on the duality of Koetter-Vardy (KV) trellises.

Bibliography33

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] J. B. Anderson and S. M. Hladik, “An optimal circular Viterbi decoder for the bounbed distance criterion,” IEEE Trans. Commun. , vol. 50, no. 11, pp. 1736–1742, Nov. 2002.
2[2] L. R. Bahl, J. Cocke, F. Jelinek, and J. Raviv, “Optimal decoding of linear codes for minimizing symbol error rate,” IEEE Trans. Inform. Theory , vol. IT-20, no. 2, pp. 284–287, March 1974.
3[3] A. R. Calderbank, G. D. Forney. Jr., and A. Vardy, “Minimal tail-biting trellises: The Golay code and more,” IEEE Trans. Inform. Theory , vol. 45, no. 5, pp. 1435–1455, July 1999.
4[4] D. Conti and N. Boston, “On the algebraic structure of linear tail-biting trellises,” IEEE Trans. Inform. Theory , vol. 61, no. 5, pp. 2283–2299, May 2015.
5[5] G. D. Forney, Jr., “Convolutional codes I: Algebraic structure,” IEEE Trans. Inform. Theory , vol. IT-16, no. 6, pp. 720–738, Nov. 1970.
6[6] , “Structural analysis of convolutional codes via dual codes,” IEEE Trans. Inform. Theory , vol. IT-19, no. 4, pp. 512–518, July 1973.
7[7] , “Coset codes–Part II: Binary lattices and related codes,” IEEE Trans. Inform. Theory , vol. 34, no. 5, pp. 1152–1187 (Appendix A), Sept. 1988.
8[8] , “Dimension/length profiles and trellis complexity of linear block codes,” IEEE Trans. Inform. Theory , vol. 40, no. 6, pp. 1741–1752, Nov. 1994.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Characteristic Matrices and Trellis Reduction for Tail-Biting Convolutional Codes

Abstract

Index Terms:

I Introduction

II Preliminaries

Definition II.1

III Characteristic Matrices for a Tail-Biting Convolutional Code

III-A Computation of Characteristic Matrices

Proposition III.1

Corollary III.1

Proof:

III-B Structure of the Characteristic Span List

Lemma III.1

Proposition III.2

Proof:

III-C Counting Characteristic Matrices

Lemma III.2** (Weaver [32])**

Proposition III.3

III-D Span Lengths of Characteristic Generators

Proposition III.4

IV Transformations of G(D)G(D)G(D) and the Corresponding TBGM’s

IV-A Dividing a Column of G(D)G(D)G(D) by DpD^{p}Dp

Proposition IV.1

IV-B Multiplying a Column of G(D)G(D)G(D) by DqD^{q}Dq

Proposition IV.2

IV-C gj(D)←gj(D)+Dqgi(D)g_{j}(D)\leftarrow g_{j}(D)+D^{q}g_{i}(D)gj​(D)←gj​(D)+Dqgi​(D)

Proposition IV.3

Definition IV.1

Proposition IV.4

Proof:

V Trellis Reduction for TB Convolutional Codes

V-A An Example of Trellis Reduction

V-B Trellis Reduction for TB Convolutional Codes

Proposition V.1

Proof:

V-C Trellis Reduction Using a Reciprocal Dual Encoder

Definition V.1** (McEliece and Lin [18])**

Proposition V.2** (Tang and Lin [30])**

Proposition V.3

Proof:

V-D Relation Between Trellis Reduction and Section Length

VI Conclusion

Appendix A Proof of the equivalence of G′G^{\prime}G′ and G5tbG_{5}^{tb}G5tb​

Proposition A.1

Proof:

Acknowledgment

Lemma III.2 (Weaver [32])

IV Transformations of $G(D)$ and the Corresponding TBGM’s

IV-A Dividing a Column of $G(D)$ by $D^{p}$

IV-B Multiplying a Column of $G(D)$ by $D^{q}$

IV-C $g_{j}(D)\leftarrow g_{j}(D)+D^{q}g_{i}(D)$

Definition V.1 (McEliece and Lin [18])

Proposition V.2 (Tang and Lin [30])

Appendix A Proof of the equivalence of $G^{\prime}$ and $G_{5}^{tb}$