November 2024
·
1 Read
This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.
November 2024
·
1 Read
September 2024
·
32 Reads
August 2024
·
19 Reads
IEEE Transactions on Circuits and Systems I Regular Papers
This article presents a novel dual 7T static random-access memory (SRAM)-based compute-in-memory (CIM) macro for processing quantized neural networks. The proposed SRAM-based CIM macro decouples read/write operations and employs a zero-input/weight skipping scheme. A 65nm test chip with integrated dual 7T bitcells demonstrated reconfigurable precision multiply and accumulate operations with 384 binary inputs (0/1) and programmable multi-bit weights (3/7/15-levels). Each column comprises 384 bitcells for a dot product, 48 bitcells for offset calibration, and 96 bitcells for binary-searching analog-to-digital conversion. The analog-to-digital converter (ADC) converts a voltage difference between two read bitlines (i.e., an analog dot-product result) to a 1-6b digital output code using binary searching in 1-6 conversion cycles using replica bitcells. The test chip with 66Kb embedded dual SRAM bitcells was evaluated for processing neural networks, including the MNIST image classifications using a multi-layer perceptron (MLP) model with its layer configuration of 784-256-256-256-10 The measured classification accuracies are 97.62%, 97.65%, and 97.72% for the 3, 7, and 15 level weights, respectively. The accuracy degradations are only 0.58 to 0.74% off the baseline with software simulations. For the VGG6 model using the CIFAR-10 image dataset, the accuracies are 88.59%, 88.21%, and 89.07% for the 3, 7, and 15 level weights, with degradations of only 0.6 to 1.32% off the software baseline. The measured energy efficiencies are 258.5, 67.9, and 23.9 TOPS/W for the 3, 7, and 15 level weights, respectively, measured at 0.45/0.8V supplies.
October 2023
·
13 Reads
·
1 Citation
October 2023
·
8 Reads
·
4 Citations
IEEE Transactions on Circuits and Systems II: Express Briefs
This brief presents a low-power and low-noise K-band (18-21.2GHz) analog beamforming receiver (BFIC-Rx) IC for Satcom on the Move (SOTM) applications. The BFIC-Rx employs a 2-stage cascode LNA, a modified Gilbert-cell type dB-linear VGA and a passive phase shifter (PS) in the signal chain for low cascaded noise figure (NF) and low power consumption. The cascode LNA has low NF and a high gain of 24dB for RF input isolation to the subsequent stages. The VGA has a 15dB dB-linear gain, controllable in 0.5dB fine steps, and a 180° phase shift feature. The passive PS has 5-bit control plus 2 bits for calibration and together with VGA’s 180°, achieves a total phase shift of 360°. The BFIC-Rx achieves a state-of-art NF of 2.32dB, a total signal gain of 31dB, IP1dB of −35.8 dBm, phase, and amplitude error of 1.1° and < 0.5dB respectively while consuming only 35mW from a 1.8V supply GHz. The BFIC-Rx was fabricated on a 130nm SiGe process and occupies a core area of mm2. The low NF, high signal gain, and low power are critical parameters to enable large-scale efficient phased arrays for SOTM applications in the toughest conditions.
September 2023
·
5 Reads
·
1 Citation
September 2023
·
9 Reads
September 2023
·
8 Reads
September 2023
·
3 Reads
August 2023
·
17 Reads
·
2 Citations
IEEE Journal of Solid-State Circuits
This work presents a hardware accelerator realizing true time-domain wavefront computing in a massive parallel two-dimensional (2-D) processing element (PE) array. The proposed 2-D time-domain PE array is designed for multiple applications based on its scalable and reconfigurable architecture. The shortest path problem (a classical problem in graph theory) is one of the critical problems to solve using the proposed accelerator. Unlike the A search algorithm, a heuristic method widely used in shortest path searching problems, the proposed accelerator requires only the propagation of rising-edge signals through the PE array without calculating or estimating the distances from the start to the goal. Hence, a single execution of the proposed time-domain wavefront computing provides all the optimal paths from a start point to an arbitrary goal. Besides the King’s graph model used for solving the shortest path searching, the PE array is reconfigured to a simpler lattice graph model and solves other problems, such as maze solving we used in this article as a benchmark. In addition, we used the proposed accelerator to demonstrate a scientific simulation. The propagation of circular or planar wavefronts was simulated using single or multiple start points using King’s graph configuration. A 1 1 mm test chip with a 32 32 reconfigurable time-domain PE array is fabricated using a 65-nm process. For a 2-D map with 32 32 vertices, the proposed PE array consumes 776 pJ per task and achieves 1.6 G edges/second search rate using 1.2-/1.0-V core supply voltages.
... Low Earth orbit (LEO) satellites undergo a period of rapid development driven by ever-increasing user demands, reduced costs, and technological progress. Electronically scanned phasedarray antennas (ESAs), offer fast electronic beam steering, which is essential for maintaining uninterrupted communication links with the rapidly moving LEO satellites [1][2][3][4][5][6][7][8][9][10]. Large-scale ESAs with thousands of antenna elements have become cost-effective due to the availability of low-cost silicon beamforming chipsets [1][2][3][4][5][6]. ...
October 2023
IEEE Transactions on Circuits and Systems II: Express Briefs
... Reliable MAJX: PUD's MAJX operations inherently contain errors in some columns of commercial DRAM modules [32], [36]. To address this reliability challenge, MV-DRAM employs Frac operations [34] and calibration techniques [48] to increase the number of reliable columns, which achieves error-free computation. The number of reliable columns is shown in Table I. ...
November 2022
IEEE Journal of Solid-State Circuits
... Recently, an emerging ultrasonic wavefront computing (WFC) technique was proposed to compute the FFT 11,12 . This method uses the principles of wave mechanics in the acoustic domain by implementing the Fourier transform through ultrasonic waves propagating within Silicon. ...
January 2022
... ADCs utilizing Binary Search Comparison (BSC) have become popular due to their low power and fast comparison times. Yu et al. [8] first applied BSC ADCs in a CIM architecture, shown in Fig. 5. Here, reference bitcells (RBCs) are attached to bitlines as ADC reference voltage sources, and offset bitcells (OBCs) are included for correction. and denote the number of CIM bitcells (CBCs) and the original bitline length, respectively, while and represent the total bitcells and bitline length with RBCs and OBCs added. ...
September 2021
... It is a common and widely used unsupervised learning rule for SNNs [8]. Its bio-plausibility and fast learning speed make it useful for updating synaptic weights in SNNs [9,26]. Figure 3 shows the relation between synaptic weight changes and time differences of spikes in STDP. ...
September 2021
... Several works have explored the FPGA and ASIC acceleration of the conventional graph and sampling-based methods, e.g., A* [23], [24], [25], [26] and RRT [27], [28], [29], [30]. Kosuge et al. [24] develops an accelerator for A* graph construction and search on the Xilinx ZCU102. ...
April 2021
... Their robust performance ensures the faithful representation of intricate cardiac patterns, facilitating precise analysis and detection of biomarkers associated with cardiovascular health. The modulators' inherent oversampling and noiseshaping capabilities further enhance sensitivity, enabling early detection of abnormalities and variations in biomarker levels [2]. The integration of sigma-delta ADC modulators in biomedical applications, particularly in cardiac monitoring, holds great promise for advancing diagnostic tools and wearable devices, ultimately improving the monitoring and management of cardiac conditions [3]. ...
March 2021
IEEE Sensors Journal