Publications (61) View all
-
Article: The Genomes of the Fungal Plant Pathogens Cladosporium fulvum and Dothistroma septosporum Reveal Adaptation to Different Hosts and Lifestyles But Also Signatures of Common Ancestry.
Pierre J G M de Wit, Ate van der Burgt, Bilal Okmen, Ioannis Stergiopoulos, Kamel A Abd-Elsalam, Andrea L Aerts, Ali H Bahkali, Henriek G Beenen, Pranav Chettri, Murray P Cox, [......], Arne Schwelm, Elio Schijlen, Hui Sun, Harrold A van den Burg, Roeland C H J van Ham, Shuguang Zhang, Stephen B Goodwin, Igor V Grigoriev, Jérôme Collemare, Rosie E Bradshaw[show abstract] [hide abstract]
ABSTRACT: We sequenced and compared the genomes of the Dothideomycete fungal plant pathogens Cladosporium fulvum (Cfu) (syn. Passalora fulva) and Dothistroma septosporum (Dse) that are closely related phylogenetically, but have different lifestyles and hosts. Although both fungi grow extracellularly in close contact with host mesophyll cells, Cfu is a biotroph infecting tomato, while Dse is a hemibiotroph infecting pine. The genomes of these fungi have a similar set of genes (70% of gene content in both genomes are homologs), but differ significantly in size (Cfu >61.1-Mb; Dse 31.2-Mb), which is mainly due to the difference in repeat content (47.2% in Cfu versus 3.2% in Dse). Recent adaptation to different lifestyles and hosts is suggested by diverged sets of genes. Cfu contains an α-tomatinase gene that we predict might be required for detoxification of tomatine, while this gene is absent in Dse. Many genes encoding secreted proteins are unique to each species and the repeat-rich areas in Cfu are enriched for these species-specific genes. In contrast, conserved genes suggest common host ancestry. Homologs of Cfu effector genes, including Ecp2 and Avr4, are present in Dse and induce a Cf-Ecp2- and Cf-4-mediated hypersensitive response, respectively. Strikingly, genes involved in production of the toxin dothistromin, a likely virulence factor for Dse, are conserved in Cfu, but their expression differs markedly with essentially no expression by Cfu in planta. Likewise, Cfu has a carbohydrate-degrading enzyme catalog that is more similar to that of necrotrophs or hemibiotrophs and a larger pectinolytic gene arsenal than Dse, but many of these genes are not expressed in planta or are pseudogenized. Overall, comparison of their genomes suggests that these closely related plant pathogens had a common ancestral host but since adapted to different hosts and lifestyles by a combination of differentiated gene content, pseudogenization, and gene regulation.PLoS Genetics 11/2012; 8(11):e1003088. · 8.69 Impact Factor -
Article: Sequencing the Potato Genome: Outline and First Results to Come from the Elucidation of the Sequence of the World’s Third Most Important Food Crop
Richard G. F. Visser, Christian W. B. Bachem, Jan M. de Boer, Glenn J. Bryan, Swarup K. Chakrabati, Sergio Feingold, Robert Gromadka, Roeland C. H. J. van Ham, Sanwen Huang, Jeanne M. E. Jacobs, Boris Kuznetsov, Paulo E. de Melo, Dan Milbourne, Gisella Orjeda, Boris Sagredo, Xiaomin Tang[show abstract] [hide abstract]
ABSTRACT: Potato is a member of the Solanaceae, a plant family that includes several other economically important species, such as tomato, eggplant, petunia, tobacco and pepper. The Potato Genome Sequencing Consortium (PGSC) aims to elucidate the complete genome sequence of potato, the third most important food crop in the world. The PGSC is a collaboration between 13 research groups from China, India, Poland, Russia, the Netherlands, Ireland, Argentina, Brazil, Chile, Peru, USA, New Zealand and the UK. The potato genome consists of 12 chromosomes and has a (haploid) length of approximately 840 million base pairs, making it a medium-sized plant genome. The sequencing project builds on a diploid potato genomic bacterial artificial chromosome (BAC) clone library of 78000 clones, which has been fingerprinted and aligned into ~7000 physical map contigs. In addition, the BAC-ends have been sequenced and are publicly available. Approximately 30000 BACs are anchored to the Ultra High Density genetic map of potato, composed of 10000 unique AFLPTM markers. From this integrated genetic-physical map, between 50 to 150 seed BACs have currently been identified for every chromosome. Fluorescent in situ hybridization experiments on selected BAC clones confirm these anchor points. The seed clones provide the starting point for a BAC-by-BAC sequencing strategy. This strategy is being complemented by whole genome shotgun sequencing approaches using both 454 GS FLX and Illumina GA2 instruments. Assembly and annotation of the sequence data will be performed using publicly available and tailor-made tools. The availability of the annotated data will help to characterize germplasm collections based on allelic variance and to assist potato breeders to more fully exploit the genetic potential of potato. La papa es un miembro de las Solanaceae, una familia de plantas que incluye varias otras especies económicamente importantes, tales como tomate, berenjena, petunia, tabaco y ají o chili. El consorcio de secuenciación del genoma de la papa (PGSC) tiene por objeto dilucidar la secuencia completa del genoma de la papa, el tercer cultivo alimentario más importante del mundo. El PGSC es una colaboración entre 13 grupos de investigación procedentes de China, India, Polonia, Rusia, los Países Bajos, Irlanda, Argentina, Brasil, Chile, Perú, EE.UU., Nueva Zelanda y el Reino Unido. El genoma de la papa consiste de 12 cromosomas y tiene una longitud (haploide) de aproximadamente 840 millones de pares de bases, por lo que es una planta con un genoma de tamaño mediano. El proyecto de secuenciación se basa en una biblioteca de 78000 clones de cromosoma artificial bacteriano genomico de papa diploide (BAC), del que se ha obtenido la huella genética y alineado en 7000 ~ contigs de mapa físico. Además, los extremos terminales BAC se han secuenciado y están a disposición del público. Aproximadamente 30000 BACS están anclados al mapa genético de ultra alta densidad de la papa, compuesto de 10000 marcadores AFLPTM únicos. De esta mapa genético-físico integrado, entre 50 a 150 semillas BACs han sido identificadas para cada cromosoma. Experimentos de hibridación in situ fluorescente en clones BAC selectos confirman estos puntos de anclaje. La clones semilla proveen el punto de partida para la estrategia de secuenciación de BAC a BAC. Esta estrategia se complementa con los enfoques de secuenciación escopeta del genoma completo usando los instrumentos 454 GS FLX e Illumina GA2. El ensamblaje y anotación de los datos de la secuencia será realizados mediante herramientas publicas disponibles y hechas a la medida. La disponibilidad de los datos anotados ayudarán a caracterizar las colecciones de germoplasma basándose en variación alélica y ayudará a los fitomejoradores de papa a explotar más plenamente el potencial genético de la papa.American Journal of Potato Research 04/2012; 86(6):417-429. · 1.23 Impact Factor -
SourceAvailable from: Aalt - Jan Van Dijk
Article: Predicting the impact of alternative splicing on plant MADS domain protein function.
Edouard I Severing, Aalt D J van Dijk, Giuseppa Morabito, Jacqueline Busscher-Lange, Richard G H Immink, Roeland C H J van Ham[show abstract] [hide abstract]
ABSTRACT: Several genome-wide studies demonstrated that alternative splicing (AS) significantly increases the transcriptome complexity in plants. However, the impact of AS on the functional diversity of proteins is difficult to assess using genome-wide approaches. The availability of detailed sequence annotations for specific genes and gene families allows for a more detailed assessment of the potential effect of AS on their function. One example is the plant MADS-domain transcription factor family, members of which interact to form protein complexes that function in transcription regulation. Here, we perform an in silico analysis of the potential impact of AS on the protein-protein interaction capabilities of MIKC-type MADS-domain proteins. We first confirmed the expression of transcript isoforms resulting from predicted AS events. Expressed transcript isoforms were considered functional if they were likely to be translated and if their corresponding AS events either had an effect on predicted dimerisation motifs or occurred in regions known to be involved in multimeric complex formation, or otherwise, if their effect was conserved in different species. Nine out of twelve MIKC MADS-box genes predicted to produce multiple protein isoforms harbored putative functional AS events according to those criteria. AS events with conserved effects were only found at the borders of or within the K-box domain. We illustrate how AS can contribute to the evolution of interaction networks through an example of selective inclusion of a recently evolved interaction motif in the MADS AFFECTING FLOWERING1-3 (MAF1-3) subclade. Furthermore, we demonstrate the potential effect of an AS event in SHORT VEGETATIVE PHASE (SVP), resulting in the deletion of a short sequence stretch including a predicted interaction motif, by overexpression of the fully spliced and the alternatively spliced SVP transcripts. For most of the AS events we were able to formulate hypotheses about the potential impact on the interaction capabilities of the encoded MIKC proteins.PLoS ONE 01/2012; 7(1):e30524. · 4.09 Impact Factor -
Article: Mutational robustness of gene regulatory networks.
Aalt D J van Dijk, Simon van Mourik, Roeland C H J van Ham[show abstract] [hide abstract]
ABSTRACT: Mutational robustness of gene regulatory networks refers to their ability to generate constant biological output upon mutations that change network structure. Such networks contain regulatory interactions (transcription factor-target gene interactions) but often also protein-protein interactions between transcription factors. Using computational modeling, we study factors that influence robustness and we infer several network properties governing it. These include the type of mutation, i.e. whether a regulatory interaction or a protein-protein interaction is mutated, and in the case of mutation of a regulatory interaction, the sign of the interaction (activating vs. repressive). In addition, we analyze the effect of combinations of mutations and we compare networks containing monomeric with those containing dimeric transcription factors. Our results are consistent with available data on biological networks, for example based on evolutionary conservation of network features. As a novel and remarkable property, we predict that networks are more robust against mutations in monomer than in dimer transcription factors, a prediction for which analysis of conservation of DNA binding residues in monomeric vs. dimeric transcription factors provides indirect evidence.PLoS ONE 01/2012; 7(1):e30591. · 4.09 Impact Factor -
SourceAvailable from: Roeland van Ham
Article: Correlated mutations via regularized multinomial regression.
[show abstract] [hide abstract]
ABSTRACT: In addition to sequence conservation, protein multiple sequence alignments contain evolutionary signal in the form of correlated variation among amino acid positions. This signal indicates positions in the sequence that influence each other, and can be applied for the prediction of intra- or intermolecular contacts. Although various approaches exist for the detection of such correlated mutations, in general these methods utilize only pairwise correlations. Hence, they tend to conflate direct and indirect dependencies. We propose RMRCM, a method for Regularized Multinomial Regression in order to obtain Correlated Mutations from protein multiple sequence alignments. Importantly, our method is not restricted to pairwise (column-column) comparisons only, but takes into account the network nature of relationships between protein residues in order to predict residue-residue contacts. The use of regularization ensures that the number of predicted links between columns in the multiple sequence alignment remains limited, preventing overprediction. Using simulated datasets we analyzed the performance of our approach in predicting residue-residue contacts, and studied how it is influenced by various types of noise. For various biological datasets, validation with protein structure data indicates a good performance of the proposed algorithm for the prediction of residue-residue contacts, in comparison to previous results. RMRCM can also be applied to predict interactions (in addition to only predicting interaction sites or contact sites), as demonstrated by predicting PDZ-peptide interactions. A novel method is presented, which uses regularized multinomial regression in order to obtain correlated mutations from protein multiple sequence alignments. R-code of our implementation is available via http://www.ab.wur.nl/rmrcm.BMC Bioinformatics 11/2011; 12:444. · 2.75 Impact Factor