Publications (17)102.04 Total impact
-
Article: Compression of next-generation sequencing reads aided by highly efficient de novo assembly.
[show abstract] [hide abstract]
ABSTRACT: We present Quip, a lossless compression algorithm for next-generation sequencing data in the FASTQ and SAM/BAM formats. In addition to implementing reference-based compression, we have developed, to our knowledge, the first assembly-based compressor, using a novel de novo assembly algorithm. A probabilistic data structure is used to dramatically reduce the memory required by traditional de Bruijn graph assemblers, allowing millions of reads to be assembled very efficiently. Read sequences are then stored as positions within the assembled contigs. This is combined with statistical compression of read identifiers, quality scores, alignment information and sequences, effectively collapsing very large data sets to <15% of their original size with no loss of information. Availability: Quip is freely available under the 3-clause BSD license from http://cs.washington.edu/homes/dcjones/quip.Nucleic Acids Research 08/2012; · 8.03 Impact Factor -
Article: A new approach to bias correction in RNA-Seq.
[show abstract] [hide abstract]
ABSTRACT: Quantification of sequence abundance in RNA-Seq experiments is often conflated by protocol-specific sequence bias. The exact sources of the bias are unknown, but may be influenced by polymerase chain reaction amplification, or differing primer affinities and mixtures, for example. The result is decreased accuracy in many applications, such as de novo gene annotation and transcript quantification. We present a new method to measure and correct for these influences using a simple graphical model. Our model does not rely on existing gene annotations, and model selection is performed automatically making it applicable with few assumptions. We evaluate our method on several datasets, and by multiple criteria, demonstrating that it effectively decreases bias and increases uniformity. Additionally, we provide theoretical and empirical results showing that the method is unlikely to have any effect on unbiased data, suggesting it can be applied with little risk of spurious adjustment. The method is implemented in the seqbias R/Bioconductor package, available freely under the LGPL license from http://bioconductor.org dcjones@cs.washington.edu Supplementary data are available at Bioinformatics online.Bioinformatics 01/2012; 28(7):921-8. · 5.47 Impact Factor -
Article: SAP1 is a critical post-transcriptional regulator of infectivity in malaria parasite sporozoite stages.
[show abstract] [hide abstract]
ABSTRACT: Plasmodium salivary gland sporozoites upregulate expression of a unique subset of genes, collectively called the UIS (upregulated in infectious sporozoites). Many UIS were shown to be essential for early liver stage development, although little is known about their regulation. We previously identified a conserved sporozoite-specific protein, SAP1, which has an essential role in Plasmodium liver infection. Targeted deletion of SAP1 in Plasmodium yoelii caused the depletion of a number of selectively tested UIS transcripts in sporozoites, resulting in a complete early liver stage arrest. Here, we use a global gene expression survey to more comprehensively identify transcripts that are affected by SAP1 deletion. We find an effect upon both the transcript abundance of UIS genes, as well as of select genes previously not grouped as UIS. Importantly, we show that the lack of SAP1 causes the specific degradation of these transcripts. Collectively, our data suggest that SAP1 is involved in a selective post-transcriptional mechanism to regulate the abundance of transcripts critical to the infectivity of sporozoites. Although Pysap1(-) sporozoites are depleted of many of these important transcripts, they confer long-lasting sterile protection against wild-type sporozoite challenge in mice. SAP1 is therefore an appealing candidate locus for attenuation of Plasmodium falciparum.Molecular Microbiology 02/2011; 79(4):929-39. · 5.01 Impact Factor -
Article: Next-generation sequencing reveals HIV-1-mediated suppression of T cell activation and RNA processing and regulation of noncoding RNA expression in a CD4+ T cell line.
[show abstract] [hide abstract]
ABSTRACT: Next-generation sequencing (NGS) enables the highly sensitive measurement of whole transcriptomes. We report the first application to our knowledge of this technology to the analysis of RNA from a CD4+ T cell line infected with intact HIV. We sequenced the total mRNA from infected cells and detected differences in the expression of both host and viral mRNA. Viral reads represented a large portion of the total mapped sequencing reads: approximately 20% at 12 h postinfection (hpi) and 40% at 24 hpi. We also detected a small but significant suppression of T cell activation-related genes at 12 hpi. This suppression persisted and expanded by 24 hpi, providing new possible markers of virus-induced T cell cytopathology. By 24 hpi, the expression of over 50% of detectable host loci was also altered, indicating widespread alteration of host processes, including RNA processing, splicing, and transport to an extent not previously reported. In addition, next-generation sequencing provided insights into alternative viral RNA splice events and the expression of noncoding RNAs, including microRNA host genes. IMPORTANCE: Recent advances in sequencing technology now allow the measurement of effectively all the RNA in a cell. This approach is especially useful for studying models of virus infection, as it allows the simultaneous measurement of both host and viral RNA. Using next-generation sequencing (NGS), we measured changes in total mRNA from a HIV-infected T cell line. To our knowledge, this is the first application of this technology to the investigation of HIV-host interactions involving intact HIV. We directly measured the amount of viral mRNA in infected cells and detected novel viral RNA splice variants and changes in the host expression of noncoding RNA species. We also detected small changes in T cell activation and other host processes during the early stages of viral replication that increased near the peak of viral replication, providing new candidate biomarkers of T cell death.mBio 01/2011; 2(5). · 5.31 Impact Factor -
Article: Integrative deep sequencing of the mouse lung transcriptome reveals differential expression of diverse classes of small RNAs in response to respiratory virus infection.
[show abstract] [hide abstract]
ABSTRACT: We previously reported widespread differential expression of long non-protein-coding RNAs (ncRNAs) in response to virus infection. Here, we expanded the study through small RNA transcriptome sequencing analysis of the host response to both severe acute respiratory syndrome coronavirus (SARS-CoV) and influenza virus infections across four founder mouse strains of the Collaborative Cross, a recombinant inbred mouse resource for mapping complex traits. We observed differential expression of over 200 small RNAs of diverse classes during infection. A majority of identified microRNAs (miRNAs) showed divergent changes in expression across mouse strains with respect to SARS-CoV and influenza virus infections and responded differently to a highly pathogenic reconstructed 1918 virus compared to a minimally pathogenic seasonal influenza virus isolate. Novel insights into miRNA expression changes, including the association with pathogenic outcomes and large differences between in vivo and in vitro experimental systems, were further elucidated by a survey of selected miRNAs across diverse virus infections. The small RNAs identified also included many non-miRNA small RNAs, such as small nucleolar RNAs (snoRNAs), in addition to nonannotated small RNAs. An integrative sequencing analysis of both small RNAs and long transcripts from the same samples showed that the results revealing differential expression of miRNAs during infection were largely due to transcriptional regulation and that the predicted miRNA-mRNA network could modulate global host responses to virus infection in a combinatorial fashion. These findings represent the first integrated sequencing analysis of the response of host small RNAs to virus infection and show that small RNAs are an integrated component of complex networks involved in regulating the host response to infection. IMPORTANCE: Most studies examining the host transcriptional response to infection focus only on protein-coding genes. However, mammalian genomes transcribe many short and long non-protein-coding RNAs (ncRNAs). With the advent of deep-sequencing technologies, systematic transcriptome analysis of the host response, including analysis of ncRNAs of different sizes, is now possible. Using this approach, we recently discovered widespread differential expression of host long (>200 nucleotide [nt]) ncRNAs in response to virus infection. Here, the samples described in the previous report were again used, but we sequenced another fraction of the transcriptome to study very short (about 20 to 30 nt) ncRNAs. We demonstrated that virus infection also altered expression of many short ncRNAs of diverse classes. Putting the results of the two studies together, we show that small RNAs may also play an important role in regulating the host response to virus infection.mBio 01/2011; 2(6). · 5.31 Impact Factor -
Article: Lethal dissemination of H5N1 influenza virus is associated with dysregulation of inflammation and lipoxin signaling in a mouse model of infection.
[show abstract] [hide abstract]
ABSTRACT: Periodic outbreaks of highly pathogenic avian H5N1 influenza viruses and the current H1N1 pandemic highlight the need for a more detailed understanding of influenza virus pathogenesis. To investigate the host transcriptional response induced by pathogenic influenza viruses, we used a functional-genomics approach to compare gene expression profiles in lungs from 129S6/SvEv mice infected with either the fully reconstructed H1N1 1918 pandemic virus (1918) or the highly pathogenic avian H5N1 virus Vietnam/1203/04 (VN/1203). Although the viruses reached similar titers in the lung and caused lethal infections, the mean time of death was 6 days for VN/1203-infected animals and 9 days for mice infected with the 1918 virus. VN/1203-infected animals also exhibited an earlier and more potent inflammatory response. This response included induction of genes encoding components of the inflammasome. VN/1203 was also able to disseminate to multiple organs, including the brain, which correlated with changes in the expression of genes associated with hematological functions and lipoxin biogenesis and signaling. Both viruses elicited expression of type I interferon (IFN)-regulated genes in wild-type mice and to a lesser extent in mice lacking the type I IFN receptor, suggesting alternative or redundant pathways for IFN signaling. Our findings suggest that VN/1203 is more pathogenic in mice as a consequence of several factors, including the early and sustained induction of the inflammatory response, the additive or synergistic effects of upregulated components of the immune response, and inhibition of lipoxin-mediated anti-inflammatory responses, which correlated with the ability of VN/1203 to disseminate to extrapulmonary organs.Journal of Virology 08/2010; 84(15):7613-24. · 5.40 Impact Factor -
Article: MicroRNA expression and virulence in pandemic influenza virus-infected mice.
[show abstract] [hide abstract]
ABSTRACT: The worst known H1N1 influenza pandemic in history resulted in more than 20 million deaths in 1918 and 1919. Although the underlying mechanism causing the extreme virulence of the 1918 influenza virus is still obscure, our previous functional genomics analyses revealed a correlation between the lethality of the reconstructed 1918 influenza virus (r1918) in mice and a unique gene expression pattern associated with severe immune responses in the lungs. Lately, microRNAs have emerged as a class of crucial regulators for gene expression. To determine whether differential expression of cellular microRNAs plays a role in the host response to r1918 infection, we compared the lung cellular "microRNAome" of mice infected by r1918 virus with that of mice infected by a nonlethal seasonal influenza virus, A/Texas/36/91. We found that a group of microRNAs, including miR-200a and miR-223, were differentially expressed in response to influenza virus infection and that r1918 and A/Texas/36/91 infection induced distinct microRNA expression profiles. Moreover, we observed significant enrichment in the number of predicted cellular target mRNAs whose expression was inversely correlated with the expression of these microRNAs. Intriguingly, gene ontology analysis revealed that many of these mRNAs play roles in immune response and cell death pathways, which are known to be associated with the extreme virulence of r1918. This is the first demonstration that cellular gene expression patterns in influenza virus-infected mice may be attributed in part to microRNA regulation and that such regulation may be a contributing factor to the extreme virulence of the r1918.Journal of Virology 03/2010; 84(6):3023-32. · 5.40 Impact Factor -
Article: Unique signatures of long noncoding RNA expression in response to virus infection and altered innate immune signaling.
[show abstract] [hide abstract]
ABSTRACT: Studies of the host response to virus infection typically focus on protein-coding genes. However, non-protein-coding RNAs (ncRNAs) are transcribed in mammalian cells, and the roles of many of these ncRNAs remain enigmas. Using next-generation sequencing, we performed a whole-transcriptome analysis of the host response to severe acute respiratory syndrome coronavirus (SARS-CoV) infection across four founder mouse strains of the Collaborative Cross. We observed differential expression of approximately 500 annotated, long ncRNAs and 1,000 nonannotated genomic regions during infection. Moreover, studies of a subset of these ncRNAs and genomic regions showed the following. (i) Most were similarly regulated in response to influenza virus infection. (ii) They had distinctive kinetic expression profiles in type I interferon receptor and STAT1 knockout mice during SARS-CoV infection, including unique signatures of ncRNA expression associated with lethal infection. (iii) Over 40% were similarly regulated in vitro in response to both influenza virus infection and interferon treatment. These findings represent the first discovery of the widespread differential expression of long ncRNAs in response to virus infection and suggest that ncRNAs are involved in regulating the host response, including innate immunity. At the same time, virus infection models provide a unique platform for studying the biology and regulation of ncRNAs.mBio 01/2010; 1(5). · 5.31 Impact Factor -
Article: The alpha/beta interferon receptor provides protection against influenza virus replication but is dispensable for inflammatory response signaling.
[show abstract] [hide abstract]
ABSTRACT: The innate immune response provides the first line of defense against foreign pathogens by responding to molecules that are a signature of a pathogenic infection. Certain RNA viruses, such as influenza virus, produce double-stranded RNA as an intermediate during the replication life cycle, which activates pathogen recognition receptors capable of inducing interferon production. By engaging interferon receptors, interferon activates the JAK-STAT pathway and results in the positive feedback of interferon production, amplifying the response to viral infection. To examine how deficiencies in interferon signaling affect the cellular response to infection, we performed influenza virus infections of mouse embryonic fibroblasts lacking the alpha/beta interferon receptor, the gamma interferon receptor, or both. In the absence of the alpha/beta interferon receptor, we observed increased viral replication but decreased activation of PKR, Stat1, and NF-kappaB; the presence or absence of the gamma interferon receptor did not exhibit discernible differences in these readouts. Analysis of gene expression profiles showed that while cells lacking the alpha/beta interferon receptor exhibited decreased levels of transcription of antiviral genes, genes related to inflammatory and apoptotic responses were transcribed to levels similar to those of cells containing the receptor. These results indicate that while the alpha/beta interferon receptor is needed to curb viral replication, it is dispensable for the induction of certain inflammatory and apoptotic genes. We have identified potential pathways, via interferon regulatory factor 3 (IRF3) activation or Hoxa13, Polr2a, Nr4a1, or Ing1 induction, that contribute to this redundancy. This study illustrates another way in which the host has evolved to establish several overlapping mechanisms to respond to viral infections.Journal of Virology 11/2009; 84(4):2027-37. · 5.40 Impact Factor -
Article: Lethal influenza virus infection in macaques is associated with early dysregulation of inflammatory related genes.
[show abstract] [hide abstract]
ABSTRACT: The enormous toll on human life during the 1918-1919 Spanish influenza pandemic is a constant reminder of the potential lethality of influenza viruses. With the declaration by the World Health Organization of a new H1N1 influenza virus pandemic, and with continued human cases of highly pathogenic H5N1 avian influenza virus infection, a better understanding of the host response to highly pathogenic influenza viruses is essential. To this end, we compared pathology and global gene expression profiles in bronchial tissue from macaques infected with either the reconstructed 1918 pandemic virus or the highly pathogenic avian H5N1 virus A/Vietnam/1203/04. Severe pathology was observed in respiratory tissues from 1918 virus-infected animals as early as 12 hours after infection, and pathology steadily increased at later time points. Although tissues from animals infected with A/Vietnam/1203/04 also showed clear signs of pathology early on, less pathology was observed at later time points, and there was evidence of tissue repair. Global transcriptional profiles revealed that specific groups of genes associated with inflammation and cell death were up-regulated in bronchial tissues from animals infected with the 1918 virus but down-regulated in animals infected with A/Vietnam/1203/04. Importantly, the 1918 virus up-regulated key components of the inflammasome, NLRP3 and IL-1beta, whereas these genes were down-regulated by A/Vietnam/1203/04 early after infection. TUNEL assays revealed that both viruses elicited an apoptotic response in lungs and bronchi, although the response occurred earlier during 1918 virus infection. Our findings suggest that the severity of disease in 1918 virus-infected macaques is a consequence of the early up-regulation of cell death and inflammatory related genes, in which additive or synergistic effects likely dictate the severity of tissue damage.PLoS Pathogens 10/2009; 5(10):e1000604. · 9.13 Impact Factor -
Article: Computational identification of hepatitis C virus associated microRNA-mRNA regulatory modules in human livers.
[show abstract] [hide abstract]
ABSTRACT: Hepatitis C virus (HCV) is a major cause of chronic liver disease by infecting over 170 million people worldwide. Recent studies have shown that microRNAs (miRNAs), a class of small non-coding regulatory RNAs, are involved in the regulation of HCV infection, but their functions have not been systematically studied. We propose an integrative strategy for identifying the miRNA-mRNA regulatory modules that are associated with HCV infection. This strategy combines paired expression profiles of miRNAs and mRNAs and computational target predictions. A miRNA-mRNA regulatory module consists of a set of miRNAs and their targets, in which the miRNAs are predicted to coordinately regulate the level of the target mRNA. We simultaneously profiled the expression of cellular miRNAs and mRNAs across 30 HCV positive or negative human liver biopsy samples using microarray technology. We constructed a miRNA-mRNA regulatory network, and using a graph theoretical approach, identified 38 miRNA-mRNA regulatory modules in the network that were associated with HCV infection. We evaluated the direct miRNA regulation of the mRNA levels of targets in regulatory modules using previously published miRNA transfection data. We analyzed the functional roles of individual modules at the systems level by integrating a large-scale protein interaction network. We found that various biological processes, including some HCV infection related canonical pathways, were regulated at the miRNA level during HCV infection. Our regulatory modules provide a framework for future experimental analyses. This report demonstrates the utility of our approach to obtain new insights into post-transcriptional gene regulation at the miRNA level in complex human diseases.BMC Genomics 09/2009; 10:373. · 4.07 Impact Factor -
Article: Virus-host interactions: from systems biology to translational research.
[show abstract] [hide abstract]
ABSTRACT: Research embracing systems biology approaches and careful analysis of the critical host response has greatly expanded our understanding of infectious diseases. First-generation studies based on genomics and proteomics have made significant progress in establishing the foundation for network-based investigations on virus-host interactions. More recently, data from complementary high-throughput technologies, such as siRNA and microRNA screens and next-generation sequencing, are augmenting systems level analyses and are providing a more detailed and insightful multidimensional view of virus-host networks. Together with advances in data integration, systems biology approaches now have the potential to provide profound impacts on translational research, leading to the more rapid development of new therapeutics and vaccines for infectious diseases. In this review, we highlight new high-throughput technologies, a new philosophy for studying virus-host interactions, and discuss the potential of systems biology to facilitate bench-to-bedside research and create novel strategies to combat disease. Can we save the world using these approaches? Read on.Current opinion in microbiology 08/2009; 12(4):432-8. · 7.87 Impact Factor -
Article: P58(IPK): a novel "CIHD" member of the host innate defense response against pathogenic virus infection.
[show abstract] [hide abstract]
ABSTRACT: To support their replication, viruses take advantage of numerous cellular factors and processes. Recent large-scale screens have identified hundreds of such factors, yet little is known about how viruses exploit any of these. Influenza virus infection post-translationally activates P58(IPK), a cellular inhibitor of the interferon-induced, dsRNA-activated eIF2alpha kinase, PKR. Here, we report that infection of P58(IPK) knockout mice with influenza virus resulted in increased lung pathology, immune cell apoptosis, PKR activation, and mortality. Analysis of lung transcriptional profiles, including those induced by the reconstructed 1918 pandemic virus, revealed increased expression of genes associated with the cell death, immune, and inflammatory responses. These experiments represent the first use of a mammalian infection model to demonstrate the role of P58(IPK) in the antiviral response. Our results suggest that P58(IPK) represents a new class of molecule, a cellular inhibitor of the host defense (CIHD), as P58(IPK) is activated during virus infection to inhibit virus-induced apoptosis and inflammation to prolong host survival, even while prolonging viral replication.PLoS Pathogens 06/2009; 5(5):e1000438. · 9.13 Impact Factor -
Article: Distinct malaria parasite sporozoites reveal transcriptional changes that cause differential tissue infection competence in the mosquito vector and mammalian host.
[show abstract] [hide abstract]
ABSTRACT: The malaria parasite sporozoite transmission stage develops and differentiates within parasite oocysts on the Anopheles mosquito midgut. Successful inoculation of the parasite into a mammalian host is critically dependent on the sporozoite's ability to first infect the mosquito salivary glands. Remarkable changes in tissue infection competence are observed as the sporozoites transit from the midgut oocysts to the salivary glands. Our microarray analysis shows that compared to oocyst sporozoites, salivary gland sporozoites upregulate expression of at least 124 unique genes. Conversely, oocyst sporozoites show upregulation of at least 47 genes (upregulated in oocyst sporozoites [UOS genes]) before they infect the salivary glands. Targeted gene deletion of UOS3, encoding a putative transmembrane protein with a thrombospondin repeat that localizes to the sporozoite secretory organelles, rendered oocyst sporozoites unable to infect the mosquito salivary glands but maintained the parasites' liver infection competence. This phenotype demonstrates the significance of differential UOS expression. Thus, the UIS-UOS gene classification provides a framework to elucidate the infectivity and transmission success of Plasmodium sporozoites on a whole-genome scale. Genes identified herein might represent targets for vector-based transmission blocking strategies (UOS genes), as well as strategies that prevent mammalian host infection (UIS genes).Molecular and cellular biology 09/2008; 28(20):6196-207. · 6.06 Impact Factor -
Article: Assessment and improvement of the Plasmodium yoelii yoelii genome annotation through comparative analysis.
[show abstract] [hide abstract]
ABSTRACT: The sequencing of the Plasmodium yoelii genome, a model rodent malaria parasite, has greatly facilitated research for the development of new drug and vaccine candidates against malaria. Unfortunately, only preliminary gene models were annotated on the partially sequenced genome, mostly by in silico gene prediction, and there has been no major improvement of the annotation since 2002. Here we report on a systematic assessment of the accuracy of the genome annotation based on a detailed analysis of a comprehensive set of cDNA sequences and proteomics data. We found that the coverage of the current annotation tends to be biased toward genes expressed in the blood stages of the parasite life cycle. Based on our proteomic analysis, we estimate that about 15% of the liver stage proteome data we have generated is absent from the current annotation. Through comparative analysis we identified and manually curated a further 510 P. yoelii genes which have clear orthologs in the P. falciparum genome, but were not present or incorrectly annotated in the current annotation. This study suggests that improvements of the current P. yoelii genome annotation should focus on genes expressed in stages other than blood stages. Comparative analysis will be critically helpful for this re-annotation. The addition of newly annotated genes will facilitate the use of P. yoelii as a model system for studying human malaria. Supplementary data are available at Bioinformatics online.Bioinformatics 08/2008; 24(13):i383-9. · 5.47 Impact Factor -
Article: A combined transcriptome and proteome survey of malaria parasite liver stages.
[show abstract] [hide abstract]
ABSTRACT: For 50 years since their discovery, the malaria parasite liver stages (LS) have been difficult to analyze, impeding their utilization as a critical target for antiinfection vaccines and drugs. We have undertaken a comprehensive transcriptome analysis in combination with a proteomic survey of LS. Green fluorescent protein-tagged Plasmodium yoelii (PyGFP) was used to efficiently isolate LS-infected hepatocytes from the rodent host. Genome-wide LS gene expression was profiled and compared with other parasite life cycle stages. The analysis revealed approximately 2,000 genes active during LS development, and proteomic analysis identified 816 proteins. A subset of proteins appeared to be expressed in LS only. The data revealed exported parasite proteins and LS metabolic pathways including expression of FASII pathway enzymes. The FASII inhibitor hexachlorophene and the antibiotics, tetracycline and rifampicin, that target the apicoplast inhibited LS development, identifying FASII and other pathways localized in the apicoplast as potential drug targets to prevent malaria infection.Proceedings of the National Academy of Sciences 02/2008; 105(1):305-10. · 9.68 Impact Factor -
Conference Proceeding: Assessment and improvement of the
Proceedings 16th International Conference on Intelligent Systems for Molecular Biology (ISMB), Toronto, Canada, July 19-23, 2008; 01/2008
Top Journals
Institutions
-
2012
-
Fred Hutchinson Cancer Research Center
Seattle, WA, USA
-
-
2009–2012
-
University of Washington Seattle
- • Department of Computer Science and Engineering
- • Department of Microbiology
Seattle, WA, USA
-
-
2008–2011
-
Seattle Institute for Biomedical and Clinical Research
Seattle, WA, USA
-