Microarray scanner calibration curves: characteristics and implications

National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, Arkansas 72079, USA.
BMC Bioinformatics (Impact Factor: 2.67). 08/2005; 6 Suppl 2(Suppl 2):S11. DOI: 10.1186/1471-2105-6-S2-S11
Source: PubMed

ABSTRACT Microarray-based measurement of mRNA abundance assumes a linear relationship between the fluorescence intensity and the dye concentration. In reality, however, the calibration curve can be nonlinear.
By scanning a microarray scanner calibration slide containing known concentrations of fluorescent dyes under 18 PMT gains, we were able to evaluate the differences in calibration characteristics of Cy5 and Cy3. First, the calibration curve for the same dye under the same PMT gain is nonlinear at both the high and low intensity ends. Second, the degree of nonlinearity of the calibration curve depends on the PMT gain. Third, the two PMTs (for Cy5 and Cy3) behave differently even under the same gain. Fourth, the background intensity for the Cy3 channel is higher than that for the Cy5 channel. The impact of such characteristics on the accuracy and reproducibility of measured mRNA abundance and the calculated ratios was demonstrated. Combined with simulation results, we provided explanations to the existence of ratio underestimation, intensity-dependence of ratio bias, and anti-correlation of ratios in dye-swap replicates. We further demonstrated that although Lowess normalization effectively eliminates the intensity-dependence of ratio bias, the systematic deviation from true ratios largely remained. A method of calculating ratios based on concentrations estimated from the calibration curves was proposed for correcting ratio bias.
It is preferable to scan microarray slides at fixed, optimal gain settings under which the linearity between concentration and intensity is maximized. Although normalization methods improve reproducibility of microarray measurements, they appear less effective in improving accuracy.

Download full-text


Available from: Roger G Perkins, Jul 07, 2015
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: In this study, we propose a calibration method for preprocessing spiked-in microarray experiments based on nonlinear mixed-effects models. This method uses a spike-in calibration curve to estimate normalized absolute expression values. Moreover, using the asymptotic properties of the calibration estimate, 100(1-α)% confidence intervals for the estimated expression values can be constructed. Simulations are used to show that the approximations on which the construction of the confidence intervals are based are sufficiently accurate to reach the desired coverage probabilities. We illustrate applicability of our method, by estimating the normalized absolute expression values together with the corresponding confidence intervals for two publicly available cDNA microarray experiments (Hilson et al., 2004; Smets et al., 2008). This method can easily be adapted to preprocess one-color oligonucleotide microarray data with a slight adjustment to the mixed model.
    Statistical Applications in Genetics and Molecular Biology 02/2009; 8(1):5-5. DOI:10.2202/1544-6115.1401 · 1.52 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: This investigation deals with a new distance measure for genes using their microarray expressions and a new algorithm for fast gene ordering without clustering. This distance measure is called "Maxrange distance," where the distance between two genes corresponding to a particular type of experiment is computed using a normalization factor, which is dependent on the dynamic range of the gene expression values of that experiment. The new gene-ordering method called "Minimal Neighbor" is based on the concept of nearest neighbor heuristic involving O(n2) time complexity. The superiority of this distance measure and the comparability of the ordering algorithm have been extensively established on widely studied microarray data sets by performing statistical tests. An interesting application of this ordering algorithm is also demonstrated for finding useful groups of genes within clusters obtained from a nonhierarchical clustering method like the self-organizing map.
    IEEE TRANSACTIONS ON CYBERNETICS 07/2007; 37(3):742-9. DOI:10.1109/TSMCB.2006.889812 · 3.47 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: DNA microarray technologies are used in a variety of biological disciplines. The diversity of platforms and analytical methods employed has raised concerns over the reliability, reproducibility and correlation of data produced across the different approaches. Initial investigations (years 2000-2003) found discrepancies in the gene expression measures produced by different microarray technologies. Increasing knowledge and control of the factors that result in poor correlation among the technologies has led to much higher levels of correlation among more recent publications (years 2004 to present). Here, we review the studies examining the correlation among microarray technologies. We find that with improvements in the technology (optimization and standardization of methods, including data analysis) and annotation, analysis across platforms yields highly correlated and reproducible results. We suggest several key factors that should be controlled in comparing across technologies, and are good microarray practice in general.
    Environmental and Molecular Mutagenesis 06/2007; 48(5):380-94. DOI:10.1002/em.20290 · 2.55 Impact Factor