-
Matthew B Rogers,
James D Hilley,
Nicholas J Dickens,
Jon Wilkes,
Paul A Bates,
Daniel P Depledge,
David Harris,
Yerim Her,
Pawel Herzyk,
Hideo Imamura,
Thomas D Otto,
Mandy Sanders,
Kathy Seeger,
Jean-Claude Dujardin,
Matthew Berriman,
Deborah F Smith,
Christiane Hertz-Fowler,
Jeremy C Mottram
[show abstract]
[hide abstract]
ABSTRACT: Leishmania parasites cause a spectrum of clinical pathology in humans ranging from disfiguring cutaneous lesions to fatal visceral leishmaniasis. We have generated a reference genome for Leishmania mexicana and refined the reference genomes for Leishmania major, Leishmania infantum, and Leishmania braziliensis. This has allowed the identification of a remarkably low number of genes or paralog groups (2, 14, 19, and 67, respectively) unique to one species. These were found to be conserved in additional isolates of the same species. We have predicted allelic variation and find that in these isolates, L. major and L. infantum have a surprisingly low number of predicted heterozygous SNPs compared with L. braziliensis and L. mexicana. We used short read coverage to infer ploidy and gene copy numbers, identifying large copy number variations between species, with 200 tandem gene arrays in L. major and 132 in L. mexicana. Chromosome copy number also varied significantly between species, with nine supernumerary chromosomes in L. infantum, four in L. mexicana, two in L. braziliensis, and one in L. major. A significant bias against gene arrays on supernumerary chromosomes was shown to exist, indicating that duplication events occur more frequently on disomic chromosomes. Taken together, our data demonstrate that there is little variation in unique gene content across Leishmania species, but large-scale genetic heterogeneity can result through gene amplification on disomic chromosomes and variation in chromosome number. Increased gene copy number due to chromosome amplification may contribute to alterations in gene expression in response to environmental conditions in the host, providing a genetic basis for disease tropism.
Genome Research 12/2011; 21(12):2129-42. · 13.61 Impact Factor
-
Flora J Logan-Klumpler,
Nishadi De Silva,
Ulrike Boehme, Matthew B Rogers,
Giles Velarde,
Jacqueline A McQuillan,
Tim Carver,
Martin Aslett,
Christian Olsen,
Sandhya Subramanian, [......],
Julian Parkhill,
Matthew Holden,
Omar S Harb,
Brian P Brunk,
Peter J Myler,
David Roos,
Mark Carrington,
Deborah F Smith,
Christiane Hertz-Fowler,
Matthew Berriman
[show abstract]
[hide abstract]
ABSTRACT: GeneDB (http://www.genedb.org) is a genome database for prokaryotic and eukaryotic pathogens and closely related organisms. The resource provides a portal to genome sequence and annotation data, which is primarily generated by the Pathogen Genomics group at the Wellcome Trust Sanger Institute. It combines data from completed and ongoing genome projects with curated annotation, which is readily accessible from a web based resource. The development of the database in recent years has focused on providing database-driven annotation tools and pipelines, as well as catering for increasingly frequent assembly updates. The website has been significantly redesigned to take advantage of current web technologies, and improve usability. The current release stores 41 data sets, of which 17 are manually curated and maintained by biologists, who review and incorporate data from the scientific literature, as well as other sources. GeneDB is primarily a production and annotation database for the genomes of predominantly pathogenic organisms.
Nucleic Acids Research 11/2011; 40(Database issue):D98-108. · 8.03 Impact Factor
-
Martin Aslett,
Cristina Aurrecoechea,
Matthew Berriman,
John Brestelli,
Brian P. Brunk,
Mark Carrington,
Daniel P. Depledge,
Steve Fischer,
Bindu Gajria,
Xin Gao, [......],
Dhileep Sivam,
Deborah F. Smith,
Ganesh Srinivasamoorthy,
Christian J. Stoeckert Jr,
Sandhya Subramanian,
Ryan Thibodeau,
Adrian Tivey,
Charles Treatman,
Giles Velarde,
Haiming Wang
Nucleic Acids Research. 01/2010; 38:457-462.
-
Martin Aslett,
Cristina Aurrecoechea,
Matthew Berriman,
John Brestelli,
Brian P Brunk,
Mark Carrington,
Daniel P Depledge,
Steve Fischer,
Bindu Gajria,
Xin Gao, [......],
Dhileep Sivam,
Deborah F Smith,
Ganesh Srinivasamoorthy,
Christian J Stoeckert,
Sandhya Subramanian,
Ryan Thibodeau,
Adrian Tivey,
Charles Treatman,
Giles Velarde,
Haiming Wang
[show abstract]
[hide abstract]
ABSTRACT: TriTrypDB (http://tritrypdb.org) is an integrated database providing access to genome-scale datasets for kinetoplastid parasites, and supporting a variety of complex queries driven by research and development needs. TriTrypDB is a collaborative project, utilizing the GUS/WDK computational infrastructure developed by the Eukaryotic Pathogen Bioinformatics Resource Center (EuPathDB.org) to integrate genome annotation and analyses from GeneDB and elsewhere with a wide variety of functional genomics datasets made available by members of the global research community, often pre-publication. Currently, TriTrypDB integrates datasets from Leishmania braziliensis, L. infantum, L. major, L. tarentolae, Trypanosoma brucei and T. cruzi. Users may examine individual genes or chromosomal spans in their genomic context, including syntenic alignments with other kinetoplastid organisms. Data within TriTrypDB can be interrogated utilizing a sophisticated search strategy system that enables a user to construct complex queries combining multiple data types. All search strategies are stored, allowing future access and integrated searches. 'User Comments' may be added to any gene page, enhancing available annotation; such comments become immediately searchable via the text search, and are forwarded to curators for incorporation into the reference annotation when appropriate.
Nucleic Acids Research 10/2009; 38(Database issue):D457-62. · 8.03 Impact Factor