P.G. Gulak’s research while affiliated with University of Toronto and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (52)


CMOS 3-T digital pixel sensor with in-pixel shared comparator
  • Conference Paper

May 2012

·

36 Reads

·

11 Citations

·

P.G. Gulak

·

Roman Genov

A CMOS digital pixel sensor (DPS) VLSI architecture with in-pixel one-bit quantization is presented. A single column-parallel comparator is shared by all pixels in the column. This results in a compact 3-T pixel implementation. By eliminating the in-pixel source follower the pixel effective power dissipation is reduced by over two orders of magnitude compared to a conventional 3-T pixel. A 64×64 DPS test prototype with 10μm pixel pitch has been fabricated in 0.35μm standard CMOS and experimentally characterized.


Single-filter multi-color CMOS fluorescent contact sensing microsystem

May 2012

·

41 Reads

·

5 Citations

·

·

Ulrich J. Krull

·

[...]

·

Roman Genov

A multi-color fluorescent contact sensing microsystem is presented. The microsystem employs a CMOS field-modulated color sensor (FCS) to spectrally detect and differentiate among multiple emission bands, requiring only one on-CMOS longpass filter. A FCS prototype has been fabricated in a standard 0.35μm CMOS technology. The multi-color imaging capability of the FCS microsystem has been validated in the detection of green-emitting and red-emitting quantum dots (QDs) with QD concentration detection limits of 313nM and 78nM, respectively.


A 0.13µm CMOS 655Mb/s 4×4 64-QAM K-Best MIMO detector

March 2009

·

77 Reads

·

49 Citations

The high spectral efficiency offered by multiple-input-multiple-output (MIMO) technology has made it the technology-of-choice in many standards like IEEE 802.16e/m (WiMAX) and the long term evolution (LTE) project and emerging 4G systems. One challenge to the widespread adoption of MIMO technology is the design of high-throughput detectors with near maximum-likelihood (ML) performance and cost-effective VLSI realization for a large number of antennas and constellation order.


A CMOS Integrated Bacterial Sensor for Rapid Detection of Pseudomonas aeruginosa

December 2008

·

25 Reads

·

2 Citations

An integrated bacterial detection chip is implemented in 0.18 mum CMOS technology. The chip has been tested using pyocins as biological detecting elements along with the study of electrical noise generated in an integrated nanowell to detect the presence of two different bacterial clinical isolates of Pseudomonas aeruginosa . The chip successfully identifies the presence of bacterial strains sensitive to the pyocin in less than 10 minutes. The effect of bacterial cell concentration is also presented in the experimental results. The chip consumes 122 muW from 3.3 V supply for two recording channels and occupies an area of 0.48 mm2 in a 0.18 mum standard CMOS process.


A 0.18¿m CMOS Integrated Sensor for the Rapid Identification of Bacteria

March 2008

·

20 Reads

·

5 Citations

This paper presents an integrated sensor for identifying bacteria that addresses drawbacks of traditional detection techniques by combining the specificity of phages with the sensitivity of integrated electronic circuits. This is the first silicon-based implementation known to date; it uses standard CMOS technology, does not require noble-metal electrodes and co-integrates active circuitry on the same substrate as the measurement site containing the bacteria. The chip was fabricated in 0.18 mum CMOS technology using thick-oxide transistors and consumes 122 muW with a 3.3 V supply for two recording channels.


Fig. 1. The considered system block diagram.  
Fig. 2. Calculated polynomials of order 3, 5, 7.  
Fig. 3. The signal constellation for (a) the original signal and (b) its compressed version in the frequency domain.  
Fig. 4. The CCDF of PAPR for original and compressed signals.  
Fig. 5. Effect of iteration on improving performance.  

+1

Papr Reduction in OFDM Systems Using Polynomial-Based Compressing and Iterative Expanding
  • Conference Paper
  • Full-text available

June 2006

·

81 Reads

·

4 Citations

Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on

In this paper we propose a novel algorithm for PAPR reduction of an OFDM system, based on a companding scheme. In this method a compressing polynomial is appended to the IFFT block at the transmitter and at the receiver the FFT block is combined with a reverse expanding function where the iterative Jacobi's method is used for solving equations. The proposed method entails less complexity at the transmitter in comparison with other PAPR reduction algorithms. It also requires less increase in SNR for the same BER compared to other companding methods. A trade off between complexity and performance can set the order of compressing polynomial and the number of iterations for the proposed algorithm at the receiver

Download

VLSI architecture of a wireless channel estimator using sequential Monte Carlo methods

July 2005

·

52 Reads

·

5 Citations

The regular and repetitive nature of the sequential Monte Carlo (SMC) method makes it very attractive for implementation using parallel and pipelined architectures. This paper develops a VLSI architecture for the hardware implementation of the SMC algorithm using bootstrap filter. A flat fading wireless channel is considered as our framework on which the channel estimator is designed and implemented using the SMC method. The design and verification activities at the algorithm level, architecture level, and the circuit level are reviewed. The proposed architecture is verified with an FPGA implementation.


An FPGA interpolation processor for soft-decision Reed-Solomon decoding

May 2004

·

16 Reads

·

9 Citations

We propose a parallel architecture for implementing the interpolation step in the Koetter-Vardy soft-decision Reed-Solomon decoding algorithm. The key feature is the embedding of both a binary tree and a linear array into a two-dimensional array processor, enabling fast polynomial evaluation operations. An FPGA interpolation processor was implemented and demonstrated at a clock frequency of 23 MHz, corresponding to decoding rates of 10-15 Mbps.


A 13.3Mb/s 0.35μm CMOS analog turbo decoder IC with a configurable interleaver

February 2003

·

26 Reads

·

52 Citations

IEEE Journal of Solid-State Circuits

A 0.35μm CMOS analog decoder for a 4-state, rate 1/3, block length 16 turbo code operates at 13.3Mb/s and latency of 1.2μs and consumes 13.9nJ per decoded bit with a 3.3V supply. The 1.42mm2 core IC implements two logarithmic domain MAP decoders and a fully programmable analog interleaver that is configured at power-up.


An assessment of VLSI and embedded software implementations for Reed-Solomon decoders

November 2002

·

12 Reads

·

10 Citations

This paper examines Reed-Solomon time-domain and frequency-domain decoder implementations in both software and hardware. The focus was on designing area-efficient, low-power and low-complexity decoders suitable for today's moderate data-rate applications. Two decoder chips were designed using a synthesized standard cell library in a 0.18 μm CMOS process, targeting a 160 Mbps decoding rate. The time-domain decoder was fabricated with a core area of 1.50 mm2.


Citations (44)


... This section covers some famous FPAA architectures presented in the past decades, highlighting the different CAB building blocks, switching networks, and overall architecture. Some CABs were designed based on different active circuits such as op-amps [9]- [11], operational transconductance amplifiers (OTA) [8], [12]- [25], current conveyors (CC) [26]- [31], current feedback amplifier (CFOA) [32], and some built using simple circuits like differential amplifier and current mirrors [33]- [35]. As mentioned earlier, the FPAA is composed of the CABs and the interconnecting routing network between them, both of which are interdependent on the function desired. ...

Reference:

09286441
A CMOS Field-programmable Analog Array
  • Citing Conference Paper
  • January 1991

... Previous work on graphs of higher co-ordination is to be found in [16], where the Shuffle-Exchange network was generalized to the case of a ternary alphabet for a small example. The difficulty of a more general analysis was noted in [17]. In this paper the authors developed the idea of using Cartesian products of DeBruijn graphs. ...

Decoding of Rate k/n Convolutional Codes in VLSI
  • Citing Chapter
  • January 1988

... Moreover, top-down approaches lack the physical connection with reality, thus they do not describe signal propagation in the network. Transmission line theory-based models are also common [19][20][21][22][23][24][25][26][27][28][29][30][31], which describes the PLC channel as a series of cascaded two-port networks and can thus be adapted to different network topologies using popular transmission matrices [13]. Though this approach has a direct relationship with signal propagation in the network, determination of such matrices is usually complex and the complexity grows with the complexity of the given powerline network. ...

An In-building Power Line Channel Simulator
  • Citing Article

... The main part of an image sensor chip is a 2D pixel array 17 . The basic building unit of a pixel array is typically a 3-transistor (3-T) pixel circuit as shown in Fig. 1(d) 18 . This circuit consists of a reverse-biased photodetector and three transistors: X1, X2, and X3, which are referred to as the reset, source follower, and pixel selector transistors, respectively. ...

CMOS 3-T digital pixel sensor with in-pixel shared comparator
  • Citing Conference Paper
  • May 2012

... The most recently reported prototype is fabricated in a 5 μm custom process [88]. Figure 8c depicts a new approach that employs a voltage to tune the spectral response of a detector [89][90][91]. Sensing of a small set of well separated wavelengths (e.g., >50 nm apart) is achieved by tuning the spectral response of the device with a bias voltage. Termed the CMOS photogate (CPG), it employs the polysilicon gate as an optical filter, which eliminates the need for an external color filter. ...

Single-filter multi-color CMOS fluorescent contact sensing microsystem
  • Citing Conference Paper
  • May 2012

... The parallel implementation of decoders has been widely studied. For example, the flooding decoding schedule of an LDPC code is highly parallel [15] and the parallel decoding of turbo type codes is studied in [26][29]. The following example compares the combined effect of serial and parallel processing for both equalization and decoding . ...

Concurrent turbo-decoding
  • Citing Conference Paper
  • January 1997

... L'architecture des turbo-codes dite "Turbo-codes à roulettes", [GNA03a] [GNA03b], est conçue pour résoudre ce type de problème. Les auteurs proposent une nouvelle famille de turbo-codes dans laquelle le brassage se fait dans les deux dimensions (cf. Figure 2.16) et est réalisé par un ensemble de codes Convolutifs Récursifs Systématiques Circulaires (CRSC) [BER99], indépendants appelés "roulettes". ...

Les Turbo-Codes à Roulettes(1)(2)

... Hitherto, a lot of models have been put forward in the literature to characterize signal variations in both amplitudes and phases under the context of fading signals. For instance, the Rayleigh, Lth-order Rice and analog erasure fading models [7], [8], [106]. Based on such models, various control/filtering problems have been broadly studied in recent decades with some typical works presented in [78], [80], [81], [146], [147]. ...

Joint Data and Kalman Estimation for Rayleigh Fading Channels
  • Citing Article
  • August 1999

Wireless Personal Communications

... Viterbi decoders can be classified according to the number of ACS units as follows: state serial decoders (in which a single ACS unit is used to perform the ACS computation for all states), fully parallel decoders (in which each state has an ACS unit assigned to it), and intermediate decoders (in which several states share the same ACS unit). In addition, there are cascaded pipelined structures and parallel pipelined cascades, in which each ACS unit is responsible for one of stages [129], [130]. ...

A VLSI implementation of a cascade Viterbi decoder with traceback