Chris Duran

University of Melbourne, Melbourne, Victoria, Australia

Are you Chris Duran?

Claim your profile

Publications (14)51.64 Total impact

  • Article: Predicting polymorphic EST-SSRs in silico.
    [show abstract] [hide abstract]
    ABSTRACT: The public availability of large quantities of gene sequence data provides a valuable resource of the mining of Simple Sequence Repeat (SSR) molecular genetic markers for genetic analysis. These markers are inexpensive, require minimal labour to produce and can frequently be associated with functionally annotated genes. This study presents the characterization of barley EST-SSRs and the identification of putative polymorphic SSRs from EST data. Polymorphic SSRs are distinguished from monomorphic SSRs by the representation of varying motif lengths within an alignment of sequence reads. Two measures of confidence are calculated, redundancy of a polymorphism and co-segregation with accessions. The utility of this method is demonstrated through the discovery of 597 candidate polymorphic SSRs, from a total of 452 642 consensus expressed sequences. PCR amplification primers were designed for the identified SSRs. Ten primer pairs were validated for polymorphism in barley and for transferability across species. Analysis of the polymorphisms in relation to SSR motif, length, position and annotation is discussed.
    Molecular Ecology Resources 02/2013; · 3.06 Impact Factor
  • Source
    Article: Single nucleotide polymorphism discovery from wheat next-generation sequence data.
    [show abstract] [hide abstract]
    ABSTRACT: Single nucleotide polymorphisms (SNPs) are the most abundant type of molecular genetic marker and can be used for producing high-resolution genetic maps, marker-trait association studies and marker-assisted breeding. Large polyploid genomes such as wheat present a challenge for SNP discovery because of the potential presence of multiple homoeologs for each gene. AutoSNPdb has been successfully applied to identify SNPs from Sanger sequence data for several species, including barley, rice and Brassica, but the volume of data required to accurately call SNPs in the complex genome of wheat has prevented its application to this important crop. DNA sequencing technology has been revolutionized by the introduction of next-generation sequencing, and it is now possible to generate several million sequence reads in a timely and cost-effective manner. We have produced wheat transcriptome sequence data using 454 sequencing technology and applied this for SNP discovery using a modified autoSNPdb method, which integrates SNP and gene annotation information with a graphical viewer. A total of 4,694,141 sequence reads from three bread wheat varieties were assembled to identify a total of 38 928 candidate SNPs. Each SNP is within an assembly complete with annotation, enabling the selection of polymorphism within genes of interest.
    Plant Biotechnology Journal 07/2012; 10(6):743-9. · 5.44 Impact Factor
  • Article: WheatGenome.info: an integrated database and portal for wheat genome information.
    [show abstract] [hide abstract]
    ABSTRACT: Bread wheat (Triticum aestivum) is one of the most important crop plants, globally providing staple food for a large proportion of the human population. However, improvement of this crop has been limited due to its large and complex genome. Advances in genomics are supporting wheat crop improvement. We provide a variety of web-based systems hosting wheat genome and genomic data to support wheat research and crop improvement. WheatGenome.info is an integrated database resource which includes multiple web-based applications. These include a GBrowse2-based wheat genome viewer with BLAST search portal, TAGdb for searching wheat second-generation genome sequence data, wheat autoSNPdb, links to wheat genetic maps using CMap and CMap3D, and a wheat genome Wiki to allow interaction between diverse wheat genome sequencing activities. This system includes links to a variety of wheat genome resources hosted at other research organizations. This integrated database aims to accelerate wheat genome research and is freely accessible via the web interface at http://www.wheatgenome.info/.
    Plant and Cell Physiology 02/2012; 53(2):e2. · 4.70 Impact Factor
  • Article: Bioinformatics tools and databases for analysis of next-generation sequence data.
    [show abstract] [hide abstract]
    ABSTRACT: Genome sequencing has been revolutionized by next-generation technologies, which can rapidly produce vast quantities of data at relatively low cost. With data production now no longer being limited, there is a huge challenge to analyse the data flood and interpret biological meaning. Bioinformatics scientists have risen to the challenge and a large number of software tools and databases have been produced and these continue to evolve with this rapidly advancing field. Here, we outline some of the tools and databases commonly used for the analysis of next-generation sequence data with comment on their utility.
    Briefings in functional genomics 12/2011; 11(1):12-24. · 4.13 Impact Factor
  • Source
    Article: Sequencing and assembly of low copy and genic regions of isolated Triticum aestivum chromosome arm 7DS.
    [show abstract] [hide abstract]
    ABSTRACT: The genome of bread wheat (Triticum aestivum) is predicted to be greater than 16 Gbp in size and consist predominantly of repetitive elements, making the sequencing and assembly of this genome a major challenge. We have reduced genome sequence complexity by isolating chromosome arm 7DS and applied second-generation technology and appropriate algorithmic analysis to sequence and assemble low copy and genic regions of this chromosome arm. The assembly represents approximately 40% of the chromosome arm and all known 7DS genes. Comparison of the 7DS assembly with the sequenced genomes of rice (Oryza sativa) and Brachypodium distachyon identified large regions of conservation. The syntenic relationship between wheat, B. distachyon and O. sativa, along with available genetic mapping data, has been used to produce an annotated draft 7DS syntenic build, which is publicly available at http://www.wheatgenome.info. Our results suggest that the sequencing of isolated chromosome arms can provide valuable information of the gene content of wheat and is a step towards whole-genome sequencing and variation discovery in this important crop.
    Plant Biotechnology Journal 02/2011; 9(7):768-75. · 5.44 Impact Factor
  • Article: Future tools for association mapping in crop plants.
    [show abstract] [hide abstract]
    ABSTRACT: Association mapping currently relies on the identification of genetic markers. Several technologies have been adopted for genetic marker analysis, with single nucleotide polymorphisms (SNPs) being the most popular where a reasonable quantity of genome sequence data are available. We describe several tools we have developed for the discovery, annotation, and visualization of molecular markers for association mapping. These include autoSNPdb for SNP discovery from assembled sequence data; TAGdb for the identification of gene specific paired read Illumina GAII data; CMap3D for the comparison of mapped genetic and physical markers; and BAC and Gene Annotator for the online annotation of genes and genomic sequences.
    Genome 11/2010; 53(11):1017-23. · 1.65 Impact Factor
  • Source
    Article: Targeted identification of genomic regions using TAGdb.
    [show abstract] [hide abstract]
    ABSTRACT: The introduction of second generation sequencing technology has enabled the cost effective sequencing of genomes and the identification of large numbers of genes and gene promoters. However, the assembly of DNA sequences to create a representation of the complete genome sequence remains costly, especially for the larger and more complex plant genomes. We have developed an online database, TAGdb, that enables researchers to identify paired read sequences that share identity with a submitted query sequence. These tags can be used to design oligonucleotide primers for the PCR amplification of the region in the target genome. The ability to produce large numbers of paired read genome tags using second generation sequencing provides a cost effective method for the identification of genes and promoters in large, complex or orphan species without the need for whole genome assembly.
    Plant Methods 01/2010; 6:19. · 2.83 Impact Factor
  • Article: CMap3D: a 3D visualization tool for comparative genetic maps.
    [show abstract] [hide abstract]
    ABSTRACT: Genetic linkage mapping enables the study of genome organization and the association of heritable traits with regions of sequenced genomes. Comparative genetic mapping is particularly powerful as it allows translation of information between related genomes and gives an insight into genome evolution. A common tool for the storage, comparison and visualization of genetic maps is CMap. However, current visualization in CMap is limited to the comparison of adjacent aligned maps. To overcome this limitation, we have developed CMap3D, a tool to compare multiple genetic maps in three-dimensional space. CMap3D is based on a client/server model ensuring operability with current CMap data repositories. This tool can be applied to any species where genetic map information is available and enables rapid, direct comparison between multiple aligned maps. Availability and Implementation: The software is a stand-alone application written in Processing and Java. Binaries are available for Windows, OSX and Linux, and require Sun Microsystems Java Runtime Environment 1.6 or later. The software is freely available for non-commercial use from http://flora.acpfg.com.au/.
    Bioinformatics 11/2009; 26(2):273-4. · 5.47 Impact Factor
  • Article: Discovering genetic polymorphisms in next-generation sequencing data.
    [show abstract] [hide abstract]
    ABSTRACT: The ongoing revolution in DNA sequencing technology now enables the reading of thousands of millions of nucleotide bases in a single instrument run. However, this data quantity is often compromised by poor confidence in the read quality. The identification of genetic polymorphisms from this data is therefore problematic and, combined with the vast quantity of data, poses a major bioinformatics challenge. However, once these difficulties have been addressed, next-generation sequencing will offer a means to identify and characterize the wealth of genetic polymorphisms underlying the vast phenotypic variation in biological systems. We describe the recent advances in next-generation sequencing technology, together with preliminary approaches that can be applied for single nucleotide polymorphism discovery in plant species.
    Plant Biotechnology Journal 06/2009; 7(4):312-7. · 5.44 Impact Factor
  • Article: Single nucleotide polymorphism discovery in barley using autoSNPdb.
    [show abstract] [hide abstract]
    ABSTRACT: Molecular markers are used to provide the link between genotype and phenotype, for the production of molecular genetic maps and to assess genetic diversity within and between related species. Single nucleotide polymorphisms (SNPs) are the most abundant molecular genetic marker. SNPs can be identified in silico, but care must be taken to ensure that the identified SNPs reflect true genetic variation and are not a result of errors associated with DNA sequencing. The SNP detection method autoSNP has been developed to identify SNPs from sequence data for any species. Confidence in the predicted SNPs is based on sequence redundancy, and haplotype co-segregation scores are calculated for a further independent measure of confidence. We have extended the autoSNP method to produce autoSNPdb, which integrates SNP and gene annotation information with a graphical viewer. We have applied this software to public barley expressed sequences, and the resulting database is available over the Internet. SNPs can be viewed and searched by sequence, functional annotation or predicted synteny with a reference genome, in this case rice. The correlation between SNPs and barley cultivar, expressed tissue type and development stage has been collated for ease of exploration. An average of one SNP per 240 bp was identified, with SNPs more prevalent in the 5' regions and simple sequence repeat (SSR) flanking sequences. Overall, autoSNPdb can provide a wealth of genetic polymorphism information for any species for which sequence data are available.
    Plant Biotechnology Journal 06/2009; 7(4):326-33. · 5.44 Impact Factor
  • Chapter: Molecular Marker Discovery and Genetic Map Visualisation
    [show abstract] [hide abstract]
    ABSTRACT: The bulk of variation at the nucleotide level is often not visible at the phenotypic level. However, this variation can be exploited using molecular genetic marker systems. Molecular genetic markers represent one of the most powerful tools for genome analysis and permit the association of heritable traits with underlying genomic variation. Molecular marker technology has developed rapidly over the last decade, with the development of high-throughput genotyping methods and the availability of large amounts of sequence data for automated marker discovery. Two forms of sequence based marker, Simple Sequence Repeats (SSRs), also known as microsatellites, and Single Nucleotide Polymorphisms (SNPs) are the principal markers currently applied in modern genetic analysis. This are supplemented with anonymous marker systems such as Amplified Fragment Length Polymorphisms (AFLPs; Vos et al. 1995), and Diversity Array Technology (DArT; Jaccoud et al. 2001). The reducing cost of DNA sequencing has led to the availability of large sequence data sets that enable the mining of sequence based markers, such as SSRs and SNPs, which may then be applied to diversity analysis, genetic trait mapping, association studies, and marker assisted selection.
    03/2009: pages 165-189;
  • Article: Genetic maps and the use of synteny.
    [show abstract] [hide abstract]
    ABSTRACT: Genetic linkage maps represent the order of known molecular genetic markers along a given chromosome for a given species. This provides an insight into the organisation of a plant genome. In comparative genomics, synteny is the preserved order of genes on chromosomes of related species which results from descent from a common ancestor. Comparative mapping is a valuable technique to identify similarities and differences between species and enables the transfer of information from one map to another and assists in the reconstruction of ancestral genomes. This chapter demonstrates the application of online resources to identify candidate genes underlying a QTL, conduct genome comparisons, identify syntenic regions and view comparative genetic maps in grass and Brassica species.
    Methods in molecular biology (Clifton, N.J.) 02/2009; 513:41-55.
  • Article: net.researchgate.refind.jaxb.schema.dblp.I@159c0a5d
    Nucleic Acids Research. 01/2009; 37:951-953.
  • Source
    Article: AutoSNPdb: an annotated single nucleotide polymorphism database for crop plants.
    [show abstract] [hide abstract]
    ABSTRACT: Single nucleotide polymorphisms (SNPs) may be considered the ultimate genetic marker as they represent the finest resolution of a DNA sequence (a single nucleotide), are generally abundant in populations and have a low mutation rate. Analysis of assembled EST sequence data provides a cost-effective means to identify large numbers of SNPs associated with functional genes. We have developed an integrated SNP discovery pipeline, which identifies SNPs from assembled EST sequences. The results are maintained in a custom relational database along with EST source and annotation information. The current database hosts data for the important crops rice, barley and Brassica. Users may rapidly identify polymorphic sequences of interest through BLAST sequence comparison, keyword searches of annotations derived from UniRef90 and GenBank comparisons, GO annotations or in genes corresponding to syntenic regions of reference genomes. In addition, SNPs between specific varieties may be identified for targeted mapping and association studies. SNPs are viewed using a user-friendly graphical interface. The database is freely accessible at http://autosnpdb.qfab.org.au/.
    Nucleic Acids Research 11/2008; 37(Database issue):D951-3. · 8.03 Impact Factor