Article

Borrowing information across genes and experiments for improved error variance estimation in microarray data analysis.

Iowa State University, USA.
Statistical Applications in Genetics and Molecular Biology (impact factor: 1.52). 01/2012; 11(3):Article 12. DOI:10.1515/1544-6115.1806
Source: PubMed

ABSTRACT Statistical inference for microarray experiments usually involves the estimation of error variance for each gene. Because the sample size available for each gene is often low, the usual unbiased estimator of the error variance can be unreliable. Shrinkage methods, including empirical Bayes approaches that borrow information across genes to produce more stable estimates, have been developed in recent years. Because the same microarray platform is often used for at least several experiments to study similar biological systems, there is an opportunity to improve variance estimation further by borrowing information not only across genes but also across experiments. We propose a lognormal model for error variances that involves random gene effects and random experiment effects. Based on the model, we develop an empirical Bayes estimator of the error variance for each combination of gene and experiment and call this estimator BAGE because information is Borrowed Across Genes and Experiments. A permutation strategy is used to make inference about the differential expression status of each gene. Simulation studies with data generated from different probability models and real microarray data show that our method outperforms existing approaches.

0 0
 · 
0 Bookmarks

Keywords

borrowing information
 
different probability models
 
differential expression status
 
empirical Bayes approaches
 
empirical Bayes estimator
 
error variance
 
error variances
 
genes
 
involves random gene effects
 
microarray experiments
 
random experiment effects
 
real microarray data
 
recent years
 
sample size available
 
Shrinkage methods
 
Simulation studies
 
stable estimates
 
study similar biological systems
 
usual unbiased estimator
 
variance estimation