Meta-Analysis of Microarray Studies Reveals a Novel Hematopoietic Progenitor Cell Signature and Demonstrates Feasibility of Inter-Platform Data Integration

Albert Einstein College of Medicine, Bronx, New York, United States of America.
PLoS ONE (Impact Factor: 3.53). 02/2008; 3(8):e2965. DOI: 10.1371/journal.pone.0002965
Source: PubMed

ABSTRACT Microarray-based studies of global gene expression (GE) have resulted in a large amount of data that can be mined for further insights into disease and physiology. Meta-analysis of these data is hampered by technical limitations due to many different platforms, gene annotations and probes used in different studies. We tested the feasibility of conducting a meta-analysis of GE studies to determine a transcriptional signature of hematopoietic progenitor and stem cells. Data from studies that used normal bone marrow-derived hematopoietic progenitors was integrated using both RefSeq and UniGene identifiers. We observed that in spite of variability introduced by experimental conditions and different microarray platforms, our meta-analytical approach can distinguish biologically distinct normal tissues by clustering them based on their cell of origin. When studied in terms of disease states, GE studies of leukemias and myelodysplasia progenitors tend to cluster with normal progenitors and remain distinct from other normal tissues, further validating the discriminatory power of this meta-analysis. Furthermore, analysis of 57 normal hematopoietic stem and progenitor cell GE samples was used to determine a gene expression signature characteristic of these cells. Genes that were most uniformly expressed in progenitors and at the same time differentially expressed when compared to other normal tissues were found to be involved in important biological processes such as cell cycle regulation and hematopoiesis. Validation studies using a different microarray platform demonstrated the enrichment of several genes such as SMARCE, Septin 6 and others not previously implicated in hematopoiesis. Most interestingly, alpha-integrin, the only common stemness gene discovered in a recent comparative murine analysis (Science 302(5644):393) was also enriched in our dataset, demonstrating the usefulness of this analytical approach.

Download full-text


Available from: Tushar Bhagat, Aug 15, 2015
  • Source
    • "MK5 seems to be ubiquitously expressed because MK5 transcripts and proteins have been detected in all cell types and tissues examined. MK5 seems to be most abundantly expressed in heart, brain, and hematopoietic progenitors (New et al. , 1998 ; Ni et al. , 1998 ; Sohal et al. , 2008 ; Gerits et al. , 2009 ). The primary sequence shows that the protein is evolutionarily highly conserved with 87 – 98% amino acid identity between hMK5 and fish MK5, except for lamprey ( Petromyzon marinus ) MK5, which shares only 146/188 (78%) identical residues with hMK5 in its N-terminal part. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Abstract Mitogen-activated protein kinase (MAPK) pathways are important signal transduction pathways that control pivotal cellular processes including proliferation, differentiation, survival, apoptosis, gene regulation and motility. MAPK pathways consist of a relay of consecutive phosphorylation events exerted by MAPK kinase kinases, MAPK kinases and MAPKs. Conventional MAPKs are characterized by a conserved Thr-X-Tyr motif in the activation loop of the kinase domain, while atypical MAPKs lack this motif and do not seem to be organized into the classical three-tiered kinase cascade. One functional group of conventional and atypical MAPK substrates consists of protein kinases known as MAPK-activated protein kinases. Eleven mammalian MAPK-activated protein kinases have been identified and they are divided into five subgroups: the ribosomal-S6-kinases RSK1-4, the MAPK-interacting kinases MNK1 and 2, the mitogen-and stress-activated kinases MSK1 and 2, the MAPK-activated protein kinases MK2 and 3, and the MAPK-activated protein kinase MK5 (also referred to as PRAK). MK5/PRAK is the only MAPK-activated protein kinase that is substrate for both conventional and atypical MAPK, while all other MAPKAPKs are exclusively phosphorylated by conventional MAPKs. This review focuses on the structure, activation, substrates, functions and possible implications of MK5/PRAK in malignant and non-malignant diseases.
    Biological Chemistry 05/2013; 394(9). DOI:10.1515/hsz-2013-0149 · 2.69 Impact Factor
  • Source
    • "Over the past few years, a number of smaller scale efforts have attempted to define the specialised gene expression profiles of cells within the hemopoietic lineages in various states of differentiation and activation. Amongst these efforts, there have been several studies of isolated progenitor cells, aiming to identify the genes associated with "stemness", the capacity for self-renewal, to find additional markers that would enable isolation of these rare cells in high yield and purity for transplantation or in vitro regeneration [12]. Others have compared profiles of cells of the innate and acquired immune system in various states of activation or differentiation [13]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Very large microarray datasets showing gene expression across multiple tissues and cell populations provide a window on the transcriptional networks that underpin the differences in functional activity between biological systems. Clusters of co-expressed genes provide lineage markers, candidate regulators of cell function and, by applying the principle of guilt by association, candidate functions for genes of currently unknown function. We have analysed a dataset comprising pure cell populations from hemopoietic and non-hemopoietic cell types ( Using a novel network visualisation and clustering approach, we demonstrate that it is possible to identify very tight expression signatures associated specifically with embryonic stem cells, mesenchymal cells and hematopoietic lineages. Selected examples validate the prediction that gene function can be inferred by co-expression. One expression cluster was enriched in phagocytes, which, alongside endosome-lysosome constituents, contains genes that may make up a 'pathway' for phagocyte differentiation. Promoters of these genes are enriched for binding sites for the ETS/PU.1 and MITF families. Another cluster was associated with the production of a specific extracellular matrix, with high levels of gene expression shared by cells of mesenchymal origin (fibroblasts, adipocytes, osteoblasts and myoblasts). We discuss the limitations placed upon such data by the presence of alternative promoters with distinct tissue specificity within many protein-coding genes.
    Genomics 03/2010; 95(6):328-38. DOI:10.1016/j.ygeno.2010.03.002 · 2.79 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: A wide variety of stem cells has been reported to exist and renew several adult tissues, raising the question of the existence of a stemness signature-that is, a common molecular program of differentiation. To detect such a signature, we applied a data integration algorithm on several DNA microarray datasets generated by the Stem Cell Genome Anatomy Project (SCGAP) Consortium on several mouse and human tissues, to generate a cross-organism compendium that we submitted to a single layer artificial neural network (ANN) trained to attribute differentiation labels-from totipotent stem cells to differentiated ones (five labels in total were used). The inherent architecture of the system allowed studing the biology behind stem cells differentiation stages and the ANN isolated a 63 gene stemness signature. This chapter presents technological details on DNA microarray integration, ANN training through leave-one-out cross-validation, and independent testing on uncharacterized adult tissues by automated detection of differentiation capabilities on human prostate and mouse stomach progenitors. All scripts of the Stem Cell Analysis and characterization by Neural Networks (SCANN) project are available on the SourceForge Web site:
    Methods in enzymology 01/2009; 467:229-45. DOI:10.1016/S0076-6879(09)67009-9 · 2.19 Impact Factor
Show more