[Show abstract][Hide abstract] ABSTRACT: The woodland strawberry, Fragaria vesca (2n = 2x = 14), is a versatile experimental plant system. This diminutive herbaceous perennial has a small genome (240 Mb), is amenable to genetic transformation and shares substantial sequence identity with the cultivated strawberry (Fragaria × ananassa) and other economically important rosaceous plants. Here we report the draft F. vesca genome, which was sequenced to ×39 coverage using second-generation technology, assembled de novo and then anchored to the genetic linkage map into seven pseudochromosomes. This diploid strawberry sequence lacks the large genome duplications seen in other rosids. Gene prediction modeling identified 34,809 genes, with most being supported by transcriptome mapping. Genes critical to valuable horticultural traits including flavor, nutritional value and flowering time were identified. Macrosyntenic relationships between Fragaria and Prunus predict a hypothetical ancestral Rosaceae genome that had nine chromosomes. New phylogenetic analysis of 154 protein-coding genes suggests that assignment of Populus to Malvidae, rather than Fabidae, is warranted.
[Show abstract][Hide abstract] ABSTRACT: MicroRNAs (miRNA) have been globally profiled in cancers but there tends to be poor agreement between studies including in the same cancers. In addition, few putative miRNA targets have been validated. To overcome the lack of reproducibility, we profiled miRNAs by next generation sequencing and locked nucleic acid miRNA microarrays and verified concordant changes by quantitative RT-PCR. Notably, miR-125b and the miR-99 family members miR-99a, -99b, and -100 were downregulated in all assays in advanced prostate cancer cell lines relative to the parental cell lines from which they were derived. All four miRNAs were also downregulated in human prostate tumor tissue compared with normal prostate. Transfection of miR-99a, -99b, or -100 inhibited the growth of prostate cancer cells and decreased the expression of prostate-specific antigen (PSA), suggesting potential roles as tumor suppressors in this setting. To identify targets of these miRNAs, we combined computational prediction of potential targets with experimental validation by microarray and polyribosomal loading analysis. Three direct targets of the miR-99 family that were validated in this manner were the chromatin-remodeling factors SMARCA5 and SMARCD1 and the growth regulatory kinase mTOR. We determined that PSA is posttranscriptionally regulated by the miR-99 family members, at least partially, by repression of SMARCA5. Together, our findings suggest key functions and targets of miR-99 family members in prostate cancer suppression and prognosis.
Cancer Research 02/2011; 71(4):1313-24. · 9.28 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: In the oral cavity, chronic inflammation has been observed at various stages of oral squamous cell carcinomas (OSCC). Such inflammation could result from persistent mucosal or epithelial cell colonization by microorganisms. There is increasing evidence of the involvement of oral bacteria in inflammation, warranting further studies on the association of bacteria with the progression of OSCC. The objective of this study was to evaluate the diversity and relative abundance of bacteria in the saliva of subjects with OSCC. Using 454 parallel DNA sequencing, ∼58,000 PCR amplicons that span the V4-V5 hypervariable region of rRNAs from five subjects were sequenced. Members of eight phyla (divisions) of bacteria were detected. The majority of classified sequences belonged to the phyla Firmicutes (45%) and Bacteroidetes (25%). Further, 52 different genera containing approximately 860 (16.51%) known species were identified and 1077 (67%) sequences belonging to various uncultured bacteria or unclassified groups. The species diversity estimates obtained with abundance-based coverage estimators and Chao1 were greater than published analyses of other microbial profiles from the oral cavity. Fifteen unique phylotypes were present in all three OSCC subjects.
[Show abstract][Hide abstract] ABSTRACT: Entamoeba histolytica contains a large and novel family of transmembrane kinases (TMKs). The expression patterns of the E. histolytica TMKs in individual trophozoites and the roles of the TMKs for sensing and responding to extracellular cues were incompletely characterised. Here we provide evidence that single cells express multiple TMKs and that TMK39 and TMK54 likely serve non-redundant cellular functions. Laser-capture microdissection was used in conjunction with microarray analysis to demonstrate that single trophozoites express more than one TMK gene. Anti-peptide antibodies were raised against unique regions in the extracellular domains of TMK39, TMK54 and PaTMK, and TMK expression was analysed at the protein level. Flow cytometric assays revealed that populations of trophozoites homogeneously expressed TMK39, TMK54 and PaTMK, while confocal microscopy identified different patterns of cell surface expression for TMK39 and TMK54. The functions of TMK39 and TMK54 were probed by the inducible expression of dominant-negative mutants. While TMK39 co-localised with ingested beads and expression of truncated TMK39 interfered with trophozoite phagocytosis of apoptotic lymphocytes, expression of a truncated TMK54 inhibited growth of amoebae and altered the surface expression of the heavy subunit of the E. histolytica Gal/GalNAc lectin. Overall, our data indicates that multiple members of the novel E. histolytica TMK family are utilised for non-redundant functions by the parasite.
International journal for parasitology 06/2010; 40(7):833-43. · 3.39 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: It has been shown previously that administration of Francisella tularensis (Ft) Live Vaccine Strain (LVS) lipopolysaccharide (LPS) protects mice against subsequent challenge with Ft LVS and blunts the pro-inflammatory cytokine response.
To further investigate the molecular mechanisms that underlie Ft LVS LPS-mediated protection, we profiled global hepatic gene expression following Ft LVS LPS or saline pre-treatment and subsequent Ft LVS challenge using Affymetrix arrays.
A large number of genes (> 3,000) were differentially expressed at 48 hours post-infection. The degree of modulation of inflammatory genes by infection was clearly attenuated by pre-treatment with Ft LVS LPS in the surviving mice. However, Ft LVS LPS alone had a subtle effect on the gene expression profile of the uninfected mice. By employing gene set enrichment analysis, we discovered significant up-regulation of the fatty acid metabolism pathway, which is regulated by peroxisome proliferator activated receptors (PPARs).
We hypothesize that the LPS-induced blunting of pro-inflammatory response in mouse is, in part, mediated by PPARs (alpha and gamma).
[Show abstract][Hide abstract] ABSTRACT: A synergistic combination of two next-generation sequencing platforms with a detailed comparative BAC physical contig map provided a cost-effective assembly of the genome sequence of the domestic turkey (Meleagris gallopavo). Heterozygosity of the sequenced source genome allowed discovery of more than 600,000 high quality single nucleotide variants. Despite this heterozygosity, the current genome assembly (∼1.1 Gb) includes 917 Mb of sequence assigned to specific turkey chromosomes. Annotation identified nearly 16,000 genes, with 15,093 recognized as protein coding and 611 as non-coding RNA genes. Comparative analysis of the turkey, chicken, and zebra finch genomes, and comparing avian to mammalian species, supports the characteristic stability of avian genomes and identifies genes unique to the avian lineage. Clear differences are seen in number and variety of genes of the avian immune system where expansions and novel genes are less frequent than examples of gene loss. The turkey genome sequence provides resources to further understand the evolution of vertebrate genomes and genetic variation underlying economically important quantitative traits in poultry. This integrated approach may be a model for providing both gene and chromosome level assemblies of other species with agricultural, ecological, and evolutionary interest.
[Show abstract][Hide abstract] ABSTRACT: Transcriptome sequencing using next-generation sequencing platforms will soon be competing with DNA microarray technologies for global gene expression analysis. As a preliminary evaluation of these promising technologies, we performed deep sequencing of cDNA synthesized from the Microarray Quality Control (MAQC) reference RNA samples using Roche's 454 Genome Sequencer FLX.
We generated more that 3.6 million sequence reads of average length 250 bp for the MAQC A and B samples and introduced a data analysis pipeline for translating cDNA read counts into gene expression levels. Using BLAST, 90% of the reads mapped to the human genome and 64% of the reads mapped to the RefSeq database of well annotated genes with e-values </= 10-20. We measured gene expression levels in the A and B samples by counting the numbers of reads that mapped to individual RefSeq genes in multiple sequencing runs to evaluate the MAQC quality metrics for reproducibility, sensitivity, specificity, and accuracy and compared the results with DNA microarrays and Quantitative RT-PCR (QRTPCR) from the MAQC studies. In addition, 88% of the reads were successfully aligned directly to the human genome using the AceView alignment programs with an average 90% sequence similarity to identify 137,899 unique exon junctions, including 22,193 new exon junctions not yet contained in the RefSeq database.
Using the MAQC metrics for evaluating the performance of gene expression platforms, the ExpressSeq results for gene expression levels showed excellent reproducibility, sensitivity, and specificity that improved systematically with increasing shotgun sequencing depth, and quantitative accuracy that was comparable to DNA microarrays and QRTPCR. In addition, a careful mapping of the reads to the genome using the AceView alignment programs shed new light on the complexity of the human transcriptome including the discovery of thousands of new splice variants.
[Show abstract][Hide abstract] ABSTRACT: High throughput methods, such as high density oligonucleotide microarray measurements of mRNA levels, are popular and critical to genome scale analysis and systems biology. However understanding the results of these analyses and in particular understanding the very wide range of levels of transcriptional changes observed is still a significant challenge. Many researchers still use an arbitrary cut off such as two-fold in order to identify changes that may be biologically significant. We have used a very large-scale microarray experiment involving 72 biological replicates to analyze the response of soybean plants to infection by the pathogen Phytophthora sojae and to analyze transcriptional modulation as a result of genotypic variation.
With the unprecedented level of statistical sensitivity provided by the high degree of replication, we show unambiguously that almost the entire plant genome (97 to 99% of all detectable genes) undergoes transcriptional modulation in response to infection and genetic variation. The majority of the transcriptional differences are less than two-fold in magnitude. We show that low amplitude modulation of gene expression (less than two-fold changes) is highly statistically significant and consistent across biological replicates, even for modulations of less than 20%. Our results are consistent through two different normalization methods and two different statistical analysis procedures.
Our findings demonstrate that the entire plant genome undergoes transcriptional modulation in response to infection and genetic variation. The pervasive low-magnitude remodeling of the transcriptome may be an integral component of physiological adaptation in soybean, and in all eukaryotes.
[Show abstract][Hide abstract] ABSTRACT: The unicellular eukaryote Entamoeba histolytica is a human parasite that causes amebic dysentery and liver abscess. A genome-wide analysis of gene expression modulated by intestinal colonization and invasion identified an upregulated transcript that encoded a putative high-mobility-group box (HMGB) protein, EhHMGB1. We tested if EhHMGB1 encoded a functional HMGB protein and determined its role in control of parasite gene expression. Recombinant EhHMGB1 was able to bend DNA in vitro, a characteristic of HMGB proteins. Core conserved residues required for DNA bending activity in other HMGB proteins were demonstrated by mutational analysis to be essential for EhHMGB1 activity. EhHMGB1 was also able to enhance the binding of human p53 to its cognate DNA sequence in vitro, which is expected for an HMGB1 protein. Confocal microscopy, using antibodies against the recombinant protein, confirmed its nuclear localization. Overexpression of EhHMGB1 in HM1:IMSS trophozoites led to modulation of 33 transcripts involved in a variety of cellular functions. Of these, 20 were also modulated at either day 1 or day 29 in the mouse model of intestinal amebiasis. Notably, four transcripts with known roles in virulence, including two encoding Gal/GalNAc lectin light chains, were modulated in response to EhHMGB1 overexpression. We concluded that EhHMGB1 was a bona fide HMGB protein with the capacity to recapitulate part of the modulation of parasite gene expression seen during adaptation to the host intestine.
[Show abstract][Hide abstract] ABSTRACT: BACKGROUND:Sinorhizobium meliloti is an agriculturally important model symbiont. There is an ongoing need to update and improve its genome annotation. In this study, we used a high-throughput pyrosequencing approach to sequence the transcriptome of S. meliloti, and search for new bacterial genes missed in the previous genome annotation. This is the first report of sequencing a bacterial transcriptome using the pyrosequencing technology.
RESULTS:Our pilot sequencing run generated 19,005 reads with an average length of 136 nucleotides per read. From these data, we identified 20 new genes. These new gene transcripts were confirmed by RT-PCR and their possible functions were analyzed.
CONCLUSION:Our results indicate that high-throughput sequence analysis of bacterial transcriptomes is feasible and next-generation sequencing technologies will greatly facilitate the discovery of new genes and improve genome annotation.
[Show abstract][Hide abstract] ABSTRACT: The Brucella abortus strain S19, a spontaneously attenuated strain, has been used as a vaccine strain in vaccination of cattle against brucellosis for six decades. Despite many studies, the physiological and molecular mechanisms causing the attenuation are not known. We have applied pyrosequencing technology together with conventional sequencing to rapidly and comprehensively determine the complete genome sequence of the attenuated Brucella abortus vaccine strain S19. The main goal of this study is to identify candidate virulence genes by systematic comparative analysis of the attenuated strain with the published genome sequences of two virulent and closely related strains of B. abortus, 9-941 and 2308. The two S19 chromosomes are 2,122,487 and 1,161,449 bp in length. A total of 3062 genes were identified and annotated. Pairwise and reciprocal genome comparisons resulted in a total of 263 genes that were non-identical between the S19 genome and any of the two virulent strains. Amongst these, 45 genes were consistently different between the attenuated strain and the two virulent strains but were identical amongst the virulent strains, which included only two of the 236 genes that have been implicated as virulence factors in literature. The functional analyses of the differences have revealed a total of 24 genes that may be associated with the loss of virulence in S19. Of particular relevance are four genes with more than 60 bp consistent difference in S19 compared to both the virulent strains, which, in the virulent strains, encode an outer membrane protein and three proteins involved in erythritol uptake or metabolism.
PLoS ONE 02/2008; 3(5):e2193. · 3.53 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: The design and construction of novel biological systems by combining basic building blocks represents a dominant paradigm in synthetic biology. Creating and maintaining a database of these building blocks is a way to streamline the fabrication of complex constructs. The Registry of Standard Biological Parts (Registry) is the most advanced implementation of this idea.
By analyzing inclusion relationships between the sequences of the Registry entries, we build a network that can be related to the Registry abstraction hierarchy. The distribution of entry reuse and complexity was extracted from this network. The collection of clones associated with the database entries was also analyzed. The plasmid inserts were amplified and sequenced. The sequences of 162 inserts could be confirmed experimentally but unexpected discrepancies have also been identified.
Organizational guidelines are proposed to help design and manage this new type of scientific resources. In particular, it appears necessary to compare the cost of ensuring the integrity of database entries and associated biological samples with their value to the users. The initial strategy that permits including any combination of parts irrespective of its potential value leads to an exponential and economically unsustainable growth that may be detrimental to the quality and long-term value of the resource to its users.
PLoS ONE 01/2008; 3(7):e2671. · 3.53 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: The Entamoeba histolytica transcription factor Upstream Regulatory Element 3-Binding Protein (URE3-BP) is a calcium-responsive regulator of two E. histolytica virulence genes, hgl5 and fdx1. URE3-BP was previously identified by a yeast one-hybrid screen of E. histolytica proteins capable of binding to the sequence TATTCTATT (Upstream Regulatory Element 3 (URE3)) in the promoter regions of hgl5 and fdx1. In this work, precise definition of the consensus URE3 element was performed by electrophoretic mobility shift assays (EMSA) using base-substituted oligonucleotides, and the consensus motif validated using episomal reporter constructs. Transcriptome profiling of a strain induced to produce a dominant-positive URE3-BP was then used to identify additional genes regulated by URE3-BP. Fifty modulated transcripts were identified, and of these the EMSA defined motif T[atg]T[tc][cg]T[at][tgc][tg] was found in over half of the promoters (54% p<0.0001). Fifteen of the URE3-BP regulated genes were potential membrane proteins, suggesting that one function of URE3-BP is to remodel the surface of E. histolytica in response to a calcium signal. Induction of URE3-BP leads to an increase in tranwell migration, suggesting a possible role in the regulation of cellular motility.
[Show abstract][Hide abstract] ABSTRACT: Mathematical modelling of cellular processes is crucial for the understanding of the cell or organism as a whole. Genome-wide observations, at the levels of the transcriptome, proteome and metabolome, provide a high coverage of the molecular constituents of the system in study. Time-course experiments are important for gaining insight into a system's dynamics and are needed for mathematical modelling. In time-course experiments it is crucial to use efficient and fast sampling techniques. We evaluated several techniques to sample and process yeast cultures for parallel analysis of the transcriptome and metabolome. The evaluation was made by measuring the quality of the RNA obtained with UV-spectroscopy, capillary electrophoresis and microarray hybridization. The protocol developed involves rapid collection by spraying the sample into -40 degrees C tricine-buffered methanol (as previously described for yeast metabolome analysis), followed by the separation of cells from the culture medium in low-temperature rapid centrifugation. Removal of the residual methanol is carried out by freeze-drying the pellet at -35 degrees C. RNA and metabolites can then be extracted from the same freeze-dried sample obtained with this procedure.
[Show abstract][Hide abstract] ABSTRACT: A genome-wide transcriptional analysis of Entamoeba histolytica was performed on trophozoites isolated from the colon of six infected mice and from in vitro culture. An Affymetrix platform gene expression array was designed for this analysis that included probe sets for 9435 open reading frames (ORFs) and 9066 5' and 3' flanking regions. Transcripts were detected for > 80% of all ORFs. A total of 523 transcripts (5.2% of all E. histolytica genes) were significantly changed in amebae isolated from the intestine on Days 1 and 29 after infection: 326 and 109 solely on Days 1 and 29, and 88 on both days. Quantitative real-time reverse transcriptase PCR confirmed these changes in 11/12 genes tested using mRNA isolated from an additional six mice. Adaptation to the intestinal environment was accompanied by increases in a subset of cell signaling genes including transmembrane kinases, ras and rho family GTPases, and calcium binding proteins. Significant decreases in mRNA abundance for genes involved in glycolysis and concomitant increases in lipases were consistent with a change in energy metabolism. Defense against bacteria present in the intestine (but lacking from in vitro culture) was suggested by alterations in mRNA levels of genes similar to the AIG1 plant antibacterial proteins. Decreases in oxygen detoxification pathways were observed as expected in the anaerobic colonic lumen. Of the known virulence factors the most remarkable changes were a 20-35-fold increase in a cysteine proteinase four-like gene, and a 2-3-fold decrease in two members of the Gal/GalNAc lectin light subunit family. Control of the observed changes in mRNA abundance in the intestine might potentially rest with four related proteins with DNA binding domains that were down-regulated 6-16-fold in the intestinal environment. In conclusion, the first genome-wide analysis of the transcriptome of E. histolytica demonstrated that the vast majority of genes are transcribed in trophozoites, and that in the host intestine trophozoites altered the expression of mRNAs for genes implicated in metabolism, oxygen defense, cell signaling, virulence, antibacterial activity, and DNA binding.
Molecular and Biochemical Parasitology 06/2006; 147(2):163-76. · 2.73 Impact Factor