Article

Avoiding inconsistencies over time and tracking difficulties in Applied Biosystems AB1700/Panther probe-to-gene annotations.

Institut des Hautes Etudes Scientifiques, CNRS/INSERM, 35, route de Chartres, 91440 Bures sur Yvette, France.
BMC Bioinformatics (impact factor: 2.75). 02/2005; 6:307. DOI:10.1186/1471-2105-6-307 pp.307
Source: PubMed

ABSTRACT Significant inconsistencies between probe-to-gene annotations between different releases of probe set identifiers by commercial microarray platform solutions have been reported. Such inconsistencies lead to misleading or ambiguous interpretation of published gene expression results.
We report here similar inconsistencies in the probe-to-gene annotation of Applied Biosystems AB1700 data, demonstrating that this is not an isolated concern. Moreover, the online information source PANTHER does not provide information required to track such inconsistencies, hence, even correctly annotated datasets, when resubmitted after PANTHER was updated to a new probe-to-gene annotation release, will generate differing results without any feedback on the origin of the change.
The importance of unequivocal annotation of microarray experiments can not be underestimated. Inconsistencies greatly diminish the usefulness of the technology. Novel methods in the analysis of transcriptome profiles often rely on large disparate datasets stemming from multiple sources. The predictive and analytic power of such approaches rapidly diminishes if only least-common subsets can be used for analysis. We present here the information that needs to be provided together with the raw AB1700 data, and the information required together with the biologic interpretation of such data to avoid inconsistencies and tracking difficulties.

0 0
 · 
0 Bookmarks
 · 
26 Views
  • Source
    Article: Applications of DNA microarrays in biology.
    [show abstract] [hide abstract]
    ABSTRACT: DNA microarrays have enabled biology researchers to conduct large-scale quantitative experiments. This capacity has produced qualitative changes in the breadth of hypotheses that can be explored. In what has become the dominant mode of use, changes in the transcription rate of nearly all the genes in a genome, taking place in a particular tissue or cell type, can be measured in disease states, during development, and in response to intentional experimental perturbations, such as gene disruptions and drug treatments. The response patterns have helped illuminate mechanisms of disease and identify disease subphenotypes, predict disease progression, assign function to previously unannotated genes, group genes into functional pathways, and predict activities of new compounds. Directed at the genome sequence itself, microarrays have been used to identify novel genes, binding sites of transcription factors, changes in DNA copy number, and variations from a baseline sequence, such as in emerging strains of pathogens or complex mutations in disease-causing human genes. They also serve as a general demultiplexing tool to sort spatially the sequence-tagged products of highly parallel reactions performed in solution. A brief review of microarray platform technology options, and of the process steps involved in complete experiment workflows, is included.
    Annual Review of Biochemistry 02/2005; 74:53-82. · 34.32 Impact Factor
  • Source
    Article: NCBI GEO: mining tens of millions of expression profiles--database and tools update.
    [show abstract] [hide abstract]
    ABSTRACT: The Gene Expression Omnibus (GEO) repository at the National Center for Biotechnology Information (NCBI) archives and freely disseminates microarray and other forms of high-throughput data generated by the scientific community. The database has a minimum information about a microarray experiment (MIAME)-compliant infrastructure that captures fully annotated raw and processed data. Several data deposit options and formats are supported, including web forms, spreadsheets, XML and Simple Omnibus Format in Text (SOFT). In addition to data storage, a collection of user-friendly web-based interfaces and applications are available to help users effectively explore, visualize and download the thousands of experiments and tens of millions of gene expression patterns stored in GEO. This paper provides a summary of the GEO database structure and user facilities, and describes recent enhancements to database design, performance, submission format options, data query and retrieval utilities. GEO is accessible at http://www.ncbi.nlm.nih.gov/geo/
    Nucleic Acids Research 02/2007; 35(Database issue):D760-5. · 8.03 Impact Factor
  • Article: Linking experimental results, biological networks and sequence analysis methods using Ontologies and Generalized Data Structures.
    In Silico Biology. 01/2004; 5.

Full-text

View
0 Downloads
Available from

Keywords

ambiguous interpretation
 
annotated datasets
 
Applied Biosystems AB1700 data
 
biologic interpretation
 
commercial microarray platform solutions
 
different releases
 
gene expression results
 
identifiers
 
inconsistencies
 
inconsistencies lead
 
large disparate datasets
 
microarray experiments
 
misleading
 
multiple sources
 
new probe-to-gene annotation release
 
Novel methods
 
online information source PANTHER
 
probe-to-gene annotation
 
probe-to-gene annotations
 
Significant inconsistencies
 

Sebastian Noth