Flexible Bit-Truncation Memory for Approximate Applications on the Edge
William Oswald, Mario Renteria-Pinon, Md. Sajjad Hossain, Kyle Mooney, Md. Bipul Hossain, Destinie Diggs, Yiwen Xu, Mohamed Shaban, Jinhui Wang, and Na Gong

TL;DR
This paper introduces a flexible bit-truncation memory that dynamically adjusts data precision at runtime, significantly improving power efficiency for various approximate applications on edge devices.
Contribution
It presents a novel, adaptable bit-truncation memory design that supports multiple data bit-widths at runtime, unlike prior fixed or application-specific solutions.
Findings
Supports three video application types with up to 47.02% power savings
Achieves up to 51.69% power reduction in deep learning models
Low implementation cost with 2.89% silicon area overhead
Abstract
Bit truncation has demonstrated great potential to enable run-time quality-power adaptive data storage, thereby optimizing the power/energy efficiency of approximate applications and supporting their deployment in edge environments. However, existing bit-truncation memories require custom designs for a specific application. In this paper, we present a novel bit-truncation memory with full adaptation flexibility, which can truncate any number of data bits at run time to meet different quality and power trade-off requirements for various approximate applications. The developed bit-truncation memory has been applied to two representative data-intensive approximate applications: video processing and deep learning. Our experiments show that the proposed memory can support three different video applications (including luminance-aware, content-aware, and region-of-interest-aware) with enhanced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Big Data and Digital Economy · Network Packet Processing and Optimization
