A response to information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments

BMC Bioinformatics (Impact Factor: 2.58). 12/2009; 10(1):438; author reply 438. DOI: 10.1186/1471-2105-10-438
Source: PubMed


For gene expression data obtained from a time-course microarray experiment, Liu et al. developed a new algorithm for clustering genes with similar expression profiles over time. Performance of their proposal was compared with three other methods including the order-restricted inference based methodology of Peddada et al. In this note we point out several inaccuracies in Liu et al. and conclude that the order-restricted inference based methodology of Peddada et al. (programmed in the software ORIOGEN) indeed operates at the desired nominal Type 1 error level, an important feature of a statistical decision rule, while being computationally substantially faster than indicated by Liu et al.
Application of ORIOGEN to the well-known breast cancer cell line data of Lobenhofer et al. revealed that ORIOGEN software took only 21 minutes to run (using 100,000 bootstraps with p = 0.0025), substantially faster than the 72 hours found by Liu et al. using Matlab. Also, based on a data simulated according to the model and parameters of simulation 1 (sigma2 = 1, M = 5) in [1] we found that ORIOGEN took less than 30 seconds to run in stark contrast to Liu et al. who reported that their implementation of the same algorithm in R took 2979.29 seconds. Furthermore, for the simulation studies reported in [1], unlike the claims made by Liu et al., ORIOGEN always maintained the desired false positive rate. According to Figure three in Liu et al. their algorithm had a false positive rate ranging approximately from 0.20 to 0.70 for the scenarios that they simulated.
Our comparisons of run times indicate that the implementations of ORIOGEN's algorithm in Matlab and R by Liu et al. is inefficient compared to the publicly available JAVA implementation. Our results on the false positive rate of ORIOGEN suggest some error in Figure three of Liu et al., perhaps due to a programming error.

5 Reads
  • [Show abstract] [Hide abstract]
    ABSTRACT: Fourier Transform InfraRed (FTIR) spectroscopy is an increasingly used technique in biology, especially for whole cell metabolomic fingerprint. The reproducibility of this technique is influenced by a large number of factors such as the physiological state of cells, sample manipulation and growth conditions. Evidence exists suggesting that the cell shape and dimension can be further elements to consider in whole cell FTIR analysis. In this study we aimed to address the effect of cell geometry on the FTIR spectra and to define the extent of variability occurring between machine and biological replicas with a standardized protocol. The yeast species Saccharomyces cerevisiae (large oval-shaped cells) and Debaryomyces hansenii (small round shaped cells) were employed for their different morphology. Thirty machine replicas of each were analyzed separately and after averaging in groups of three, showing a three to four-fold reduction of the variability. Similarly, a two-fold reduction of variability was observed when thirty biological replicas of the two yeast species were analyzed. The optimal number of replicas to average was then estimated with a bootstrap-like procedure in which biological and machine replicas were randomly resampled 2000 times and averaged in groups spanning from 2 to 12 replicas. This simulation has shown that little if any advantage can be obtained by increasing the number of replicas over five and that the variability exhibited by the small regular cells of D. hansenii was always roughly half of that displayed by the large S. cerevisiae cells, confirming the results obtained with standard non-bootstrapped averages.
    The Analyst 08/2010; 135(8):2099-105. DOI:10.1039/c0an00127a · 4.11 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: A major challenge in genomic research is identifying significant biological processes and generating new hypotheses from large gene sets. Gene sets often consist of multiple separate biological pathways, controlled by distinct regulatory mechanisms. Many of these pathways and the associated regulatory mechanisms might be obscured by a large number of other significant processes and thus not identified as significant by standard gene set enrichment analysis tools. We present a novel method called Independent Enrichment Analysis (IEA) and software TAFFEL that eases the task by clustering genes to subgroups using Gene Ontology categories and transcription regulators. IEA indicates transcriptional regulators putatively controlling biological functions in studied condition. We demonstrate that the developed method and TAFFEL tool give new insight to the analysis of differentially expressed genes and can generate novel hypotheses. Our comparison to other popular methods showed that the IEA method implemented in TAFFEL can find important biological phenomena, which are not reported by other methods.
    BMC Bioinformatics 05/2011; 12:171. DOI:10.1186/1471-2105-12-171 · 2.58 Impact Factor

Preview (2 Sources)

5 Reads
Available from