AQuRate: MRAM-based Stochastic Oscillator for Adaptive Quantization Rate Sampling of Sparse Signals
Soheil Salehi, Ramtin Zand, Alireza Zaeemzadeh, Nazanin Rahnavard,, Ronald F. DeMara

TL;DR
This paper introduces AQuRate, an MRAM-based stochastic oscillator that efficiently generates adaptive, non-uniform sampling clocks for sparse signals, significantly reducing area and power consumption compared to existing solutions.
Contribution
The paper presents a novel MRAM-based stochastic oscillator for adaptive quantization rate sampling, achieving substantial reductions in area and power over prior non-uniform clock generators.
Findings
~25-fold reduction in area
~6-fold reduction in power dissipation
Effective adaptive sampling for sparse signals
Abstract
Recently, the promising aspects of compressive sensing have inspired new circuit-level approaches for their efficient realization within the literature. However, most of these recent advances involving novel sampling techniques have been proposed without considering hardware and signal constraints. Additionally, traditional hardware designs for generating non-uniform sampling clock incur large area overhead and power dissipation. Herein, we propose a novel non-uniform clock generator called Adaptive Quantization Rate (AQR) generator using Magnetic Random Access Memory (MRAM)-based stochastic oscillator devices. Our proposed AQR generator provides ~25-fold reduction in area, on average, while offering ~6-fold reduced power dissipation, on average, compared to the state-of-the-art non-uniform clock generators.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Image and Signal Denoising Methods · Blind Source Separation Techniques
AQuRate: MRAM-based Stochastic Oscillator for Adaptive
Quantization Rate Sampling of Sparse Signals
Soheil Salehi, Ramtin Zand, Alireza Zaeemzadeh, Nazanin Rahnavard, Ronald F. DeMara
Department of Electrical and Computer Engineering, University of Central Florida, Orlando, FL, 32816 USA
(2019)
Abstract.
Recently, the promising aspects of compressive sensing have inspired new circuit-level approaches for their efficient realization within the literature. However, most of these recent advances involving novel sampling techniques have been proposed without considering hardware and signal constraints. Additionally, traditional hardware designs for generating non-uniform sampling clock incur large area overhead and power dissipation. Herein, we propose a novel non-uniform clock generator called Adaptive Quantization Rate (AQR) generator using Magnetic Random Access Memory (MRAM)-based stochastic oscillator devices. Our proposed AQR generator provides -fold reduction in area, on average, while offering -fold reduced power dissipation, on average, compared to the state-of-the-art non-uniform clock generators.
Analog to Digital Converter, Adaptive Sampling Rate, Non-uniform Clock Generator, MRAM-based Stochastic Oscillator, Compressive Sensing.
††journalyear: 2019††copyright: acmcopyright††conference: Great Lakes Symposium on VLSI 2019; May 9–11, 2019; Tysons Corner, VA, USA††booktitle: Great Lakes Symposium on VLSI 2019 (GLSVLSI ’19), May 9–11, 2019, Tysons Corner, VA, USA††price: 15.00††doi: 10.1145/3299874.3318037††isbn: 978-1-4503-6252-8/19/05††ccs: Hardware Data conversion††ccs: Hardware Spintronics and magnetic technologies††ccs: Hardware Emerging architectures
1. Introduction
Recently, non-uniform sampling approaches such as Compressive Sensing (CS) have been proposed to reduce the energy consumption of sampling operation by reducing number of samples in each frame, reduce required storage to save the sampled data, and reduce the data transmission due to lower number of samples taken (Sarvotham et al., 2006; Zaeemzadeh et al., 2017; Zaeemzadeh et al., 2018). Additionally, event-driven sampling, such as level-crossing sampling, has been widely adopted as a promising CS technique to maximize the performance of sampling operation while reducing energy consumption (Wu et al., 2017). Furthermore, CS techniques are utilized to sample spectrally sparse wide-band signals close to their information rate rather than their Nyquist rate, which can be a challenge using conventional uniform sampling techniques due to the high cost of the hardware that is capable of performing the sampling operation at a high Nyquist rate.
Despite all the benefits that CS techniques offer, they are typically realized oblivious to the hardware limitations such as energy, bandwidth, and battery capacity. Additionally, signal-dependent constraints such as sparsity and noise level are ignored while studying the quantization rate and resolution trade-off. The aforementioned hardware-dependent and signal-dependent constraints alter during the sampling operation. Thus, an adaptive quantization rate and resolution optimization circuitry is required to maximize sampling performance while minimizing the number of samples to reduce energy consumption, data transmission, and storage. Adaptive quantization rate and resolution sampling might be readily achieved from the algorithm perspective, however it requires a hardware platform that is capable of real-time adaptation according to certain signal behavior such as sparsity rate. Recently, an adaptive optimization of the quantization rate and resolution during signal acquisition has been investigated in (Salehi et al., 2018).
Previous works on adaptive quantization rate and resolution ADCs have been implemented using Complementary Metal Oxide Semiconductor (CMOS) technology and considering a low-pass signal model (Bellasi et al., 2014; Wu et al., 2017). Herein, we propose an spin-based Adaptive quantization rate (AQR) generator circuit that considers the signal dependent constraint as well as hardware limitations. The proposed AQR generator circuit utilized Magnetic Random Access Memory (MRAM)-based stochastic oscillator devices, which offer miniaturization and significant energy savings (Camsari et al., 2017b).
2. Background and Related Work
Recently researchers have achieved significant performance improvements using sparse signal recovery techniques. Spectrally sparse signals are utilized in many applications such as frequency hopping communications, musical audio signals, cognitive radio networks, and radar/sonar imaging systems (Salehi et al., 2018). Additionally, a major challenge in spectrum sensing is that in most cases, the sparse components of the signal are spread over a wide-band spectrum and need to be acquired without prior knowledge of their frequencies. Moreover, spectrum-aware communication networks require Radio Frequency and mixed-signal hardware architectures that can achieve very wide-band but energy-efficient spectrum sensing (Salehi et al., 2018).
The cornerstone to achieving CS approaches and non-uniform sampling techniques is the utilization of an asynchronous pseudo-random clock generator, usually referred to as non-uniform clock generator, which is consisted of a Linear Feedback Shift Register (LFSR) that selects a clock signal at random from a series of ring oscillators with different frequency and phases (Lee et al., 2017; Osama et al., 2016; Bhatti et al., 2007; Bellasi et al., 2014; Kose et al., 2008). In most cases, these circuits require a large number of CMOS transistors and incur significant area overhead and power dissipation. Recently, a novel approach for generating the non-uniform clock using VCMA-MTJ devices is proposed in (Lee et al., 2017) and the authors have shown that their proposed design can achieve significant area and power dissipation reduction compared to the previous CMOS-based pseudo-random clock generators. However, the authors in (Lee et al., 2017) considered the frequency of the signal in order to generate the sampling clock, which limits the bandwidth and in case of spectrally sparse signals, where no prior knowledge of frequency is available, their proposed approach will face challenges. Herein, we consider the sparsity rate of the signal to generate the sampling clock. This will minimize the number of samples and results in more energy savings. Furthermore, our proposed design has reduced complexity compared to other designs proposed in the literature due to significant reduction in the CMOS circuit elements.
3. Adaptive quantization rate Generator
3.1. MRAM-based Stochastic Device as a Building Block for AQR generator
In this section, we show that a recently proposed building block with embedded MRAM technology can enable the hardware realization of an AQR generator. The structure of the MRAM-based stochastic device is shown in Fig. 1, which includes a magnetic tunnel junction (MTJ) that is a 2-terminal device with two different resistive levels depending on the orientation of its ferromagnetic (FM) layers, called fixed layer and free layer. The fixed layer is designed to have a fixed magnetic orientation, while the magnetization orientation of the free layer can be switched. In MRAM-based memory devices, a thermally-stable nanomagnet with a large energy barrier with respect to the thermal energy (kT) is utilized for free layer so that the fixed layer can function as a non-volatile memory. In recent years the use of superparamagnetic MTJs that are not thermally stable have been experimentally and theoretically investigated in search of functional spintronic devices (Locatelli et al., 2014; Choi et al., 2014; Fukushima et al., 2014; Sutton et al., 2017; Debashis et al., 2016; Camsari et al., 2017a; Zand et al., 2018b; Zand et al., 2018a).
In this paper, we use an MRAM device with a low energy-barrier nanomagnet (), which is thermally unstable (Camsari et al., 2017b). The resistance of an MTJ with such a low energy barrier nanomagnet randomly fluctuates between high () and low resistance states (). This creates a fluctuating output voltage at the drain of the NMOS transistor, which can be amplified by an inverter circuit to produce a stochastic output that can be modulated by the input voltage. In particular, the output voltage at the drain of the NMOS transistor can be shorted to the ground by reducing its drain-source resistance () through increasing the input voltage (), or it can be near by increasing the through decreasing . The device operation can be comprehended by considering the MTJ conductance (Camsari et al., 2017b):
[TABLE]
where is the free layer magnetization that is stochastically fluctuating due to the thermal noise, is the average MTJ conductance, , and is the tunneling magnetoresistance ratio. The drain voltage can be expressed as:
[TABLE]
where is the ratio of the transistor conductance () to the average MTJ conductance (). The maximum fluctuations at the drain occurs when , thus the MTJ resistance is approximately equal to the NMOS resistance when . Since the drain voltage fluctuations are in the order of hundreds of mV for typical TMR values, an additional inverter is used to amplify the noise to produce output voltages ranging from 0 to .
3.2. AQR Generator Circuit
To realize an effective hybrid emerging device and CMOS circuit, one useful approach can be to consider stochastic and deterministic attributes separately. For instance, Fig. 2 depicts the proposed AQR generator circuit wherein a -terminal MTJ realizes stochastic behavior to provide the non-uniform clock generation capability.
The quantized Sparsity Rate Estimator (SRE) module shown in Fig. 2 estimates the sparsity rate of the digital output bit-stream by estimating the sparse spectral components of the digital output using an iterative algorithm. Recently, rapid and optimized sparse component estimation method is proposed in (Salehi et al., 2018). In the approach proposed in (Salehi et al., 2018) in order to minimize the computational complexity of the sparse component estimation, an sliding window approach is utilized and the algorithm operates only one iteration on each frame of the input by utilizing the previous estimate as an initial value. This will result in gradual convergence of the sparse components to the actual values across iterations. These algorithms can be employed to find the sparsity rate of the signal. In most cases, sparsity rate of analog signals, which can be described as the number of non-zero elements in divided by the total number of elements the sparse representation of the signal, is between to in many applications including those targeted herein.
When the SRE module estimates the sparsity rate of the signal based on the digital output of the previous frame, it will then generate a voltage level according to that sparsity rate of the input analog signal. This voltage, referred to as , will be applied to the gate of the NMOS transistor shown in Fig. 2 and results in an stochastic bit-stream generated by the MRAM-based stochastic oscillator device. The stochastic bit-stream output generated by the MRAM-based stochastic oscillator device will be forwarded to the D-Flip-Flop (D-FF) as shown in Fig. 2 and the result of the -input NAND gate between the output of the D-FF and the actual clock of the circuit will generate the required quantization rate to be used for the following frame of the signal acquisition, referred to as Asynchronous Clock (A-Clk) in Fig. 2. Additionally, the SRE module can also used by the recovery algorithms to efficiently recover the sampled signal (Salehi et al., 2018). Additionally, the A-Clk will be forwarded to the sparse recovery algorithm to provide necessary information about the samples taken from the signal to assist with the signal reconstruction.
To obtain the relation between the output probability of the stochastic MRAM-based AQR generator and its input voltage, we have applied an input pulse that its amplitude starts from and is increased by mV every ns until it reaches . The output of the building block is sampled with a GHz clock frequency using a D-FF circuit, as shown in Fig. 3.
4. Simulation Results
In order to evaluate and validate the behavior and functionality of the proposed AQR generator circuit, SPICE and MATLAB simulations were performed. We have utilized the nm High Performance FinFET Predictive Technology Model (PTM) ((ASU), [n. d.]) as well as the MRAM-based stochastic oscillator device model and parameters represented in (Camsari et al., 2017b) to implement and evaluate the proposed AQR generator circuit.
According to our results, AQR provides significant power dissipation and area reductions compared to the state-of-the-art nonuniform clock generators listed in Table 1 (Lee et al., 2017; Osama et al., 2016; Bhatti et al., 2007; Bellasi et al., 2014). According to our simulation results, power dissipation of the proposed AQR generator circuit is W on average. With respect to area utilization, our proposed AQR design requires only FinFET transistors, which attains a significant reduction in the transistor count and complexity of the non-uniform clock generator circuit present in state-of-the-art designs (Lee et al., 2017; Osama et al., 2016; Bhatti et al., 2007; Bellasi et al., 2014). Thus, AQR avoids high transistor counts while making it unnecessary to use of large LFSR circuits that contain numerous D-FFs as well as several logic gates and multiplexers. For a more equitable comparison in terms of area and power dissipation, we have derived (3) and (4) considering General Scaling method (Stillmaker and Baas, 2017) to normalize the power dissipation and area of the designs listed in Table 1. Based on the General Scaling method, voltage and area scale at different rate of and , respectively. Thus, the power dissipation is scaled with respect to and area per device is scaled according to (Stillmaker and Baas, 2017).
[TABLE]
[TABLE]
where, is the nominal voltage of the technology model, refers to the technology node in nanometers, and subscript refers to the design that we want to scale its power dissipation and area according to the technology models. According to (3) and (4), AQR provides power dissipation reduction up to one-order-of-magnitude compared to the state-of-the-art nonuniform clock generators as listed in Table 1. Additionally, AQR offers up to one-orders-of-magnitude area reduction compared to the designs provided in Table 1 using the scaling comparison trends accepted in the literature.
As described in Section 3.2, sparsity rate of analog signals is usually within the range of . Fig. 4 depicts an example output of the AQR generator for sampling of a sparse signal with sparsity rate. Moreover, we have embedded our proposed AQR generator within CS recovery algorithms called Orthogonal Matching Pursuit (OMP) and Compressive Sampling Matching Pursuit (CoSaMP) (200, 2009) in order to evaluate the architectural simulation results and in order to recover the signal from the samples taken using the AQR generator. According to the results, the mean normalized errors of the reconstruction of the signals with , , and sparsity rates using OMP are , , and , respectively. Moreoever, the mean normalized errors of the reconstruction of the signals with , , and sparsity rates using CoSaMP are , , and , respectively.
5. Conclusions
We have devised a novel non-uniform clock generator called Adaptive quantization rate (AQR) generator using MRAM-based stochastic oscillator devices. Our proposed AQR generator considers signal constraints, such as sparsity rate, as well as hardware constraints, such as area and power dissipation, in order to generate the non-uniform clock for the asynchronous CS-ADC. Compared to similar non-uniform clock generators presented in the literature, AQR generator provides significant area reduction of -fold on average, while achieving power dissipation reduction of -fold, on average.
Acknowledgement
This work was supported in part by the Center for Probabilistic Spin Logic for Low-Energy Boolean and Non-Boolean Computing (CAPSL), one of the Nanoelectronic Computing Research (nCORE) Centers as task 2759.006, a Semiconductor Research Corporation (SRC) program sponsored by the NSF through CCF 1739635, and by NSF through ECCS 1810256.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1)
- 2200 (2009) 2009. Co Sa MP: Iterative signal recovery from incomplete and inaccurate samples. Applied and Computational Harmonic Analysis 26, 3 (2009), 301–321.
- 3(ASU) ([n. d.]) Arizona State University (ASU). [n. d.]. 14nm HP-Fin FET Predictive Technology Model (PTM), accessed on 26 November 2018, available at: http://ptm.asu.edu/. http://ptm.asu.edu/. ([n. d.]). http://ptm.asu.edu/
- 4Bellasi et al . (2014) David Bellasi, Luca Bettini, Thomas Burger, Qiuting Huang, Christian Benkeser, and Christoph Studer. 2014. A 1.9 GS/s 4-bit sub-Nyquist flash ADC for 3.8 G Hz compressive spectrum sensing in 28 nm CMOS. In 2014 IEEE 57th International Midwest Symposium on Circuits and Systems (MWSCAS) . IEEE, 101–104. https://doi.org/10.1109/MWSCAS.2014.6908362 · doi ↗
- 5Bhatti et al . (2007) Rashed Zafar Bhatti, Keith M. Chugg, and Jeff Draper. 2007. Standard cell based pseudo-random clock generator for statistical random sampling of digital signals. In 2007 50th Midwest Symposium on Circuits and Systems . IEEE, 1110–1113. https://doi.org/10.1109/MWSCAS.2007.4488752 · doi ↗
- 6Camsari et al . (2017 a) Kerem Yunus Camsari, Rafatul Faria, Brian M Sutton, and Supriyo Datta. 2017 a. Stochastic p-bits for invertible logic. Physical Review X 7, 3 (2017), 031014.
- 7Camsari et al . (2017 b) Kerem Yunus Camsari, Sayeef Salahuddin, and Supriyo Datta. 2017 b. Implementing p-bits with embedded mtj. IEEE Electron Device Letters 38, 12 (2017), 1767–1770.
- 8Choi et al . (2014) Won Ho Choi, Yang Lv, Jongyeon Kim, Abhishek Deshpande, Gyuseong Kang, Jian-Ping Wang, and Chris H Kim. 2014. A magnetic tunnel junction based true random number generator with conditional perturb and real-time output probability tracking. In Electron Devices Meeting (IEDM), 2014 IEEE International . IEEE, 12–5.
