Publications (11)40.77 Total impact
-
Article: Comparative genomics of early-diverging Brucella strains reveals a novel lipopolysaccharide biosynthesis pathway.
[show abstract] [hide abstract]
ABSTRACT: Brucella species are Gram-negative bacteria that infect mammals. Recently, two unusual strains (Brucella inopinata BO1(T) and B. inopinata-like BO2) have been isolated from human patients, and their similarity to some atypical brucellae isolated from Australian native rodent species was noted. Here we present a phylogenomic analysis of the draft genome sequences of BO1(T) and BO2 and of the Australian rodent strains 83-13 and NF2653 that shows that they form two groups well separated from the other sequenced Brucella spp. Several important differences were noted. Both BO1(T) and BO2 did not agglutinate significantly when live or inactivated cells were exposed to monospecific A and M antisera against O-side chain sugars composed of N-formyl-perosamine. While BO1(T) maintained the genes required to synthesize a typical Brucella O-antigen, BO2 lacked many of these genes but still produced a smooth LPS (lipopolysaccharide). Most missing genes were found in the wbk region involved in O-antigen synthesis in classic smooth Brucella spp. In their place, BO2 carries four genes that other bacteria use for making a rhamnose-based O-antigen. Electrophoretic, immunoblot, and chemical analyses showed that BO2 carries an antigenically different O-antigen made of repeating hexose-rich oligosaccharide units that made the LPS water-soluble, which contrasts with the homopolymeric O-antigen of other smooth brucellae that have a phenol-soluble LPS. The results demonstrate the existence of a group of early-diverging brucellae with traits that depart significantly from those of the Brucella species described thus far. IMPORTANCE: This report examines differences between genomes from four new Brucella strains and those from the classic Brucella spp. Our results show that the four new strains are outliers with respect to the previously known Brucella strains and yet are part of the genus, forming two new clades. The analysis revealed important information about the evolution and survival mechanisms of Brucella species, helping reshape our knowledge of this important zoonotic pathogen. One discovery of special importance is that one of the strains, BO2, produces an O-antigen distinct from any that has been seen in any other Brucella isolates to date.mBio 01/2012; 3(5):e00246-12. · 5.31 Impact Factor -
Article: A Rickettsia genome overrun by mobile genetic elements provides insight into the acquisition of genes characteristic of an obligate intracellular lifestyle.
[show abstract] [hide abstract]
ABSTRACT: We present the draft genome for the Rickettsia endosymbiont of Ixodes scapularis (REIS), a symbiont of the deer tick vector of Lyme disease in North America. Among Rickettsia species (Alphaproteobacteria: Rickettsiales), REIS has the largest genome sequenced to date (>2 Mb) and contains 2,309 genes across the chromosome and four plasmids (pREIS1 to pREIS4). The most remarkable finding within the REIS genome is the extraordinary proliferation of mobile genetic elements (MGEs), which contributes to a limited synteny with other Rickettsia genomes. In particular, an integrative conjugative element named RAGE (for Rickettsiales amplified genetic element), previously identified in scrub typhus rickettsiae (Orientia tsutsugamushi) genomes, is present on both the REIS chromosome and plasmids. Unlike the pseudogene-laden RAGEs of O. tsutsugamushi, REIS encodes nine conserved RAGEs that include F-like type IV secretion systems similar to that of the tra genes encoded in the Rickettsia bellii and R. massiliae genomes. An unparalleled abundance of encoded transposases (>650) relative to genome size, together with the RAGEs and other MGEs, comprise ~35% of the total genome, making REIS one of the most plastic and repetitive bacterial genomes sequenced to date. We present evidence that conserved rickettsial genes associated with an intracellular lifestyle were acquired via MGEs, especially the RAGE, through a continuum of genomic invasions. Robust phylogeny estimation suggests REIS is ancestral to the virulent spotted fever group of rickettsiae. As REIS is not known to invade vertebrate cells and has no known pathogenic effects on I. scapularis, its genome sequence provides insight on the origin of mechanisms of rickettsial pathogenicity.Journal of bacteriology 11/2011; 194(2):376-94. · 3.94 Impact Factor -
Article: Integration and visualization of host-pathogen data related to infectious diseases.
[show abstract] [hide abstract]
ABSTRACT: Infectious disease research is generating an increasing amount of disparate data on pathogenic systems. There is a growing need for resources that effectively integrate, analyze, deliver and visualize these data, both to improve our understanding of infectious diseases and to facilitate the development of strategies for disease control and prevention. We have developed Disease View, an online host-pathogen resource that enables infectious disease-centric access, analysis and visualization of host-pathogen interactions. In this resource, we associate infectious diseases with corresponding pathogens, provide information on pathogens, pathogen virulence genes and the genetic and chemical evidences for the human genes that are associated with the diseases. We also deliver the relationships between pathogens, genes and diseases in an interactive graph and provide the geolocation reports of associated diseases around the globe in real time. Unlike many other resources, we have applied an iterative, user-centered design process to the entire resource development, including data acquisition, analysis and visualization. Freely available at http://www.patricbrc.org; all major web browsers supported. cmao@vbi.vt.edu Supplementary data are available at Bioinformatics online.Bioinformatics 06/2011; 27(16):2279-87. · 5.47 Impact Factor -
Article: Data integration for dynamic and sustainable systems biology resources: challenges and lessons learned.
[show abstract] [hide abstract]
ABSTRACT: Systems-biology and infectious-disease (host-pathogen-environment) research and development is becoming increasingly dependent on integrating data from diverse and dynamic sources. Maintaining integrated resources over long periods of time presents distinct challenges. This review describes experiences and lessons learned from integrating data in two five-year projects focused on pathosystems biology: the Pathosystems Resource Integration Center (PATRIC, http://patric.vbi.vt.edu/), with a goal of developing bioinformatics resources for the research and countermeasures-development communities based on genomics data, and the Resource Center for Biodefense Proteomics Research (RCBPR, http://www.proteomicsresource.org/), with a goal of developing resources based on the experiment data such as microarray and proteomics data from diverse sources and technologies. Some challenges include integrating genomic sequence and experiment data, data synchronization, data quality control, and usability engineering. We present examples of a variety of data-integration problems drawn from our experiences with PATRIC and RBPRC, as well as open research questions related to long-term sustainability, and describe the next steps to meeting these challenges. Novel contributions of this work include 1) an approach for addressing discrepancies between experiment results and interpreted results, and 2) expanding the range of data-integration techniques to include usability engineering at the presentation level.Chemistry & Biodiversity 05/2010; 7(5):1124-41. · 1.80 Impact Factor -
Article: Analysis of ten Brucella genomes reveals evidence for horizontal gene transfer despite a preferred intracellular lifestyle.
[show abstract] [hide abstract]
ABSTRACT: The facultative intracellular bacterial pathogen Brucella infects a wide range of warm-blooded land and marine vertebrates and causes brucellosis. Currently, there are nine recognized Brucella species based on host preferences and phenotypic differences. The availability of 10 different genomes consisting of two chromosomes and representing six of the species allowed for a detailed comparison among themselves and relatives in the order Rhizobiales. Phylogenomic analysis of ortholog families shows limited divergence but distinct radiations, producing four clades as follows: Brucella abortus-Brucella melitensis, Brucella suis-Brucella canis, Brucella ovis, and Brucella ceti. In addition, Brucella phylogeny does not appear to reflect the phylogeny of Brucella species' preferred hosts. About 4.6% of protein-coding genes seem to be pseudogenes, which is a relatively large fraction. Only B. suis 1330 appears to have an intact beta-ketoadipate pathway, responsible for utilization of plant-derived compounds. In contrast, this pathway in the other species is highly pseudogenized and consistent with the "domino theory" of gene death. There are distinct shared anomalous regions (SARs) found in both chromosomes as the result of horizontal gene transfer unique to Brucella and not shared with its closest relative Ochrobactrum, a soil bacterium, suggesting their acquisition occurred in spite of a predominantly intracellular lifestyle. In particular, SAR 2-5 appears to have been acquired by Brucella after it became intracellular. The SARs contain many genes, including those involved in O-polysaccharide synthesis and type IV secretion, which if mutated or absent significantly affect the ability of Brucella to survive intracellularly in the infected host.Journal of bacteriology 05/2009; 191(11):3569-79. · 3.94 Impact Factor -
Article: Atomic level computational identification of ligand migration pathways between solvent and binding site in myoglobin.
[show abstract] [hide abstract]
ABSTRACT: Myoglobin is a globular protein involved in oxygen storage and transport. No consensus yet exists on the atomic level mechanism by which oxygen and other small nonpolar ligands move between the myoglobin's buried heme, which is the ligand binding site, and surrounding solvent. This study uses room temperature molecular dynamics simulations to provide a complete atomic level picture of ligand migration in myoglobin. Multiple trajectories--providing a cumulative total of 7 micros of simulation--are analyzed. Our simulation results are consistent with and tie together previous experimental findings. Specifically, we characterize: (i) Explicit full trajectories in which the CO ligand shuttles between the internal binding site and the solvent and (ii) pattern and structural origins of transient voids available for ligand migration. The computations are performed both in sperm whale myoglobin wild-type and in sperm whale V68F myoglobin mutant, which is experimentally known to slow ligand-binding kinetics. On the basis of these independent, but mutually consistent ligand migration and transient void computations, we find that there are two discrete dynamical pathways for ligand migration in myoglobin. Trajectory hops between these pathways are limited to two bottleneck regions. Ligand enters and exits the protein matrix in common identifiable portals on the protein surface. The pathways are located in the "softer" regions of the protein matrix and go between its helices and in its loop regions. Localized structural fluctuations are the primary physical origin of the simulated CO migration pathways inside the protein.Proceedings of the National Academy of Sciences 08/2008; 105(27):9204-9. · 9.68 Impact Factor -
Article: Rickettsia phylogenomics: unwinding the intricacies of obligate intracellular life.
[show abstract] [hide abstract]
ABSTRACT: Completed genome sequences are rapidly increasing for Rickettsia, obligate intracellular alpha-proteobacteria responsible for various human diseases, including epidemic typhus and Rocky Mountain spotted fever. In light of phylogeny, the establishment of orthologous groups (OGs) of open reading frames (ORFs) will distinguish the core rickettsial genes and other group specific genes (class 1 OGs or C1OGs) from those distributed indiscriminately throughout the rickettsial tree (class 2 OG or C2OGs). We present 1823 representative (no gene duplications) and 259 non-representative (at least one gene duplication) rickettsial OGs. While the highly reductive (approximately 1.2 MB) Rickettsia genomes range in predicted ORFs from 872 to 1512, a core of 752 OGs was identified, depicting the essential Rickettsia genes. Unsurprisingly, this core lacks many metabolic genes, reflecting the dependence on host resources for growth and survival. Additionally, we bolster our recent reclassification of Rickettsia by identifying OGs that define the AG (ancestral group), TG (typhus group), TRG (transitional group), and SFG (spotted fever group) rickettsiae. OGs for insect-associated species, tick-associated species and species that harbor plasmids were also predicted. Through superimposition of all OGs over robust phylogeny estimation, we discern between C1OGs and C2OGs, the latter depicting genes either decaying from the conserved C1OGs or acquired laterally. Finally, scrutiny of non-representative OGs revealed high levels of split genes versus gene duplications, with both phenomena confounding gene orthology assignment. Interestingly, non-representative OGs, as well as OGs comprised of several gene families typically involved in microbial pathogenicity and/or the acquisition of virulence factors, fall predominantly within C2OG distributions. Collectively, we determined the relative conservation and distribution of 14354 predicted ORFs from 10 rickettsial genomes across robust phylogeny estimation. The data, available at PATRIC (PathoSystems Resource Integration Center), provide novel information for unwinding the intricacies associated with Rickettsia pathogenesis, expanding the range of potential diagnostic, vaccine and therapeutic targets.PLoS ONE 02/2008; 3(4):e2018. · 4.09 Impact Factor -
Article: Photosynthetic acclimation is reflected in specific patterns of gene expression in drought-stressed loblolly pine.
[show abstract] [hide abstract]
ABSTRACT: Because the product of a single gene can influence many aspects of plant growth and development, it is necessary to understand how gene products act in concert and upon each other to effect adaptive changes to stressful conditions. We conducted experiments to improve our understanding of the responses of loblolly pine (Pinus taeda) to drought stress. Water was withheld from rooted plantlets of to a measured water potential of -1 MPa for mild stress and -1.5 MPa for severe stress. Net photosynthesis was measured for each level of stress. RNA was isolated from needles and used in hybridizations against a microarray consisting of 2173 cDNA clones from five pine expressed sequence tag libraries. Gene expression was estimated using a two-stage mixed linear model. Subsequently, data mining via inductive logic programming identified rules (relationships) among gene expression, treatments, and functional categories. Changes in RNA transcript profiles of loblolly pine due to drought stress were correlated with physiological data reflecting photosynthetic acclimation to mild stress or photosynthetic failure during severe stress. Analysis of transcript profiles indicated that there are distinct patterns of expression related to the two levels of stress. Genes encoding heat shock proteins, late embryogenic-abundant proteins, enzymes from the aromatic acid and flavonoid biosynthetic pathways, and from carbon metabolism showed distinctive responses associated with acclimation. Five genes shown to have different transcript levels in response to either mild or severe stress were chosen for further analysis using real-time polymerase chain reaction. The real-time polymerase chain reaction results were in good agreement with those obtained on microarrays.Plant physiology 01/2004; 133(4):1702-16. · 6.53 Impact Factor -
Article: Expresso and Chips: Creating a Next Generation
[show abstract] [hide abstract]
ABSTRACT: Expresso is an experiment management system that is designed to assist biologists in planning, executing, and interpreting microarray experiments. It serves as a unifying framework to study data-driven application composition systems, as envisaged under the NSF Next Generation Software (NGS) program. Physical and analytical stages of the microarray process are mirrored in Expresso with computational models from biophysics, molecular biology, biochemistry, robotics, image processing, statistics, and knowledge representation. These models are pushed deeper (earlier) into the design process to help avoid costly design errors and to provide, as needed, surrogate functions for the traditional stages of microarray experiments. In this paper, we describe ongoing work in the design of Expresso, with specific reference to application composition, application optimization, experiment protocol design, and `closing the loop.' 1.02/2003; -
Conference Proceeding: Expresso and Chips: Creating a Next Generation Microarray Experiment Management System.
17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 22-26 April 2003, Nice, France, CD-ROM/Abstracts Proceedings; 01/2003 -
Article: Expresso and Chips: Creating a Next Generation Microarray Experiment Management System (PDF)
[show abstract] [hide abstract]
ABSTRACT: Expresso is an experiment management system that is designed to assist biologists in planning, executing, and interpreting microarray experiments. It serves as a unifying framework to study data-driven application composition systems, as envisaged under the NSF Next Generation Software (NGS) program. Physical and analytical stages of the microarray process are mirrored in Expresso with computational models from biophysics, molecular biology, biochemistry, robotics, image processing, statistics, and knowledge representation. These models are pushed deeper (earlier) into the design process to help avoid costly design errors and to provide, as needed, surrogate functions for the traditional stages of microarray experiments. In this paper, we describe ongoing work in the design of Expresso, with specific reference to application composition, application optimization, experiment protocol design, and ?closing the loop.?Parallel and Distributed Processing Symposium, International.
Top Journals
Institutions
-
2011
-
University of Maryland-School of Medicine
Baltimore, MD, USA
-