Statistical Methods of Background Correction for Illumina BeadArray Data

Division of Biostatistics, Department of Clinical Sciences, University of Texas Southwestern Medical Center, Dallas, USA.
Bioinformatics (Impact Factor: 4.98). 03/2009; 25(6):751-7. DOI: 10.1093/bioinformatics/btp040
Source: PubMed


Advances in technology have made different microarray platforms available. Among the many, Illumina BeadArrays are relatively new and have captured significant market share. With BeadArray technology, high data quality is generated from low sample input at reduced cost. However, the analysis methods for Illumina BeadArrays are far behind those for Affymetrix oligonucleotide arrays, and so need to be improved.
In this article, we consider the problem of background correction for BeadArray data. One distinct feature of BeadArrays is that for each array, the noise is controlled by over 1000 bead types conjugated with non-specific oligonucleotide sequences. We extend the robust multi-array analysis (RMA) background correction model to incorporate the information from negative control beads, and consider three commonly used approaches for parameter estimation, namely, non-parametric, maximum likelihood estimation (MLE) and Bayesian estimation. The proposed approaches, as well as the existing background correction methods, are compared through simulation studies and a data example. We find that the maximum likelihood and Bayes methods seem to be the most promising.
Supplementary data are available at Bioinformatics online.

Download full-text


Available from: Michael D Story, Oct 02, 2015
41 Reads
  • Source
    • "Hence, the two main steps implemented in our preprocessing of microarray array data were background correction and normalization. In this study, we used a non-parametric version of model-based background correction method (MBCB), which uses an extended model of robust multiarray analysis (RMA), to incorporate the information from negative control beads [15]. These background-corrected data were subjected to quantile normalization to obtain identical sample distributions in terms of their statistical properties. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Integrating the analysis of the cistrome of a transcription factor by ChIP-Seq with the study of its transcriptional output by microarray or RNA-Seq analysis is a powerful approach to elucidate the genomic functions of a transcription factor. Recently, we employed this approach to determine the mechanism of action by which the nuclear receptor PPARγ elicits its antitumorigenic effects in lung cancer cells upon activation by TZDs (1). Here we describe in detail the design, contents and quality controls for the gene expression and cistrome analyses associated with our study published in Cell Metabolism in 2014.
    Genomics Data 03/2015; 3:80-86. DOI:10.1016/j.gdata.2014.11.015
  • Source
    • "Compared with similar models previously proposed (Bolstad et al., 2003; Xie et al., 2009), our model takes into account the contribution of noise peaks by the second term in Eqn (6) and the influence of noise on the detection of small signal peaks by the error function in Eqn (7). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Simultaneous recordings of multiple neuron activities with multi-channel extracellular electrodes are widely used for studying information processing by the brain's neural circuits. In this method, the recorded signals containing the spike events of a number of adjacent or distant neurons must be correctly sorted into spike trains of individual neurons, and a variety of methods have been proposed for this spike sorting. However, spike sorting is computationally difficult because the recorded signals are often contaminated by biological noise. Here, we propose a novel method for spike detection, which is the first stage of spike sorting and hence crucially determines overall sorting performance. Our method utilizes a model of extracellular recording data that takes into account variations in spike waveforms, such as the widths and amplitudes of spikes, by detecting the peaks of band-pass-filtered data. We show that the new method significantly improves the cost-performance of multi-channel electrode recordings by increasing the number of cleanly sorted neurons.
    European Journal of Neuroscience 06/2014; 39(11):1943-1950. DOI:10.1111/ejn.12614 · 3.18 Impact Factor
  • Source
    • "Expression level data from the Illumina Bead Studio software were normalized using a quantile normalization [43]. Probes whose expression level exceeds a threshold value in at least one sample are called detected. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Aortic valve calcification is a significant and serious clinical problem for which there are no effective medical treatments. Individuals born with bicuspid aortic valves, 1-2% of the population, are at the highest risk of developing aortic valve calcification. Aortic valve calcification involves increased expression of calcification and inflammatory genes. Bicuspid aortic valve leaflets experience increased biomechanical strain as compared to normal tricuspid aortic valves. The molecular pathogenesis involved in the calcification of BAVs are not well understood, especially the molecular response to mechanical stretch. HOTAIR is a long non-coding RNA (lncRNA) that has been implicated with cancer but has not been studied in cardiac disease. We have found that HOTAIR levels are decreased in BAVs and in human aortic interstitial cells (AVICs) exposed to cyclic stretch. Reducing HOTAIR levels via siRNA in AVICs results in increased expression of calcification genes. Our data suggest that β-CATENIN is a stretch responsive signaling pathway that represses HOTAIR. This is the first report demonstrating that HOTAIR is mechanoresponsive and repressed by WNT β-CATENIN signaling. These findings provide novel evidence that HOTAIR is involved in aortic valve calcification.
    PLoS ONE 05/2014; 9(5):e96577. DOI:10.1371/journal.pone.0096577 · 3.23 Impact Factor
Show more