High-Throughput and Memory-Efficient Parallel Viterbi Decoder for   Convolutional Codes on GPU

Alireza Mohammadidoost; Matin Hashemi

arXiv:2011.09337·cs.DC·November 19, 2020·1 cites

High-Throughput and Memory-Efficient Parallel Viterbi Decoder for Convolutional Codes on GPU

Alireza Mohammadidoost, Matin Hashemi

PDF

Open Access

TL;DR

This paper presents a GPU-based parallel Viterbi decoder that significantly improves throughput and memory efficiency for convolutional codes, enhancing wireless communication system performance.

Contribution

It introduces a novel parallel implementation with unified kernel and parallel traceback, optimizing both throughput and memory usage over existing GPU solutions.

Findings

01

Higher throughput than previous GPU solutions

02

Reduced memory usage through optimization

03

Effective parallel traceback implementation

Abstract

This paper describes a parallel implementation of Viterbi decoding algorithm. Viterbi decoder is widely used in many state-of-the-art wireless systems. The proposed solution optimizes both throughput and memory usage by applying optimizations such as unified kernel implementation and parallel traceback. Experimental evaluations show that the proposed solution achieves higher throughput compared to previous GPU-accelerated solutions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Wireless Communication Techniques · Error Correcting Code Techniques · Wireless Communication Networks Research