Bos Taurus genome assembly

Department of Molecular and Human Genetics, Human Genome Sequencing Center, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA.
BMC Genomics (Impact Factor: 3.99). 05/2009; 10(1):180. DOI: 10.1186/1471-2164-10-180
Source: PubMed


We present here the assembly of the bovine genome. The assembly method combines the BAC plus WGS local assembly used for the rat and sea urchin with the whole genome shotgun (WGS) only assembly used for many other animal genomes including the rhesus macaque.
The assembly process consisted of multiple phases: First, BACs were assembled with BAC generated sequence, then subsequently in combination with the individual overlapping WGS reads. Different assembly parameters were tested to separately optimize the performance for each BAC assembly of the BAC and WGS reads. In parallel, a second assembly was produced using only the WGS sequences and a global whole genome assembly method. The two assemblies were combined to create a more complete genome representation that retained the high quality BAC-based local assembly information, but with gaps between BACs filled in with the WGS-only assembly. Finally, the entire assembly was placed on chromosomes using the available map information.Over 90% of the assembly is now placed on chromosomes. The estimated genome size is 2.87 Gb which represents a high degree of completeness, with 95% of the available EST sequences found in assembled contigs. The quality of the assembly was evaluated by comparison to 73 finished BACs, where the draft assembly covers between 92.5 and 100% (average 98.5%) of the finished BACs. The assembly contigs and scaffolds align linearly to the finished BACs, suggesting that misassemblies are rare. Genotyping and genetic mapping of 17,482 SNPs revealed that more than 99.2% were correctly positioned within the Btau_4.0 assembly, confirming the accuracy of the assembly.
The biological analysis of this bovine genome assembly is being published, and the sequence data is available to support future bovine research.

Download full-text


Available from: George Weinstock,
  • Source
    • "The pair-end reads from male and female libraries were merged and subjected to genome assembly. SOAPdenovo [21] was used for contig assembly and scaffolding with k-mer size of 23 bp. DNA paired-end reads were used for bridging scaffold gaps by using GapCloser [21]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Asparagus officinalis is an economically and nutritionally important vegetable crop that is widely cultivated and is used as a model dioecious species to study plant sex determination and sex chromosome evolution. To improve our understanding of its genome composition, especially with respect to transposable elements (TEs), which make up the majority of the genome, we performed Illumina HiSeq2000 sequencing of both male and female asparagus genomes followed by bioinformatics analysis. We generated 17 Gb of sequence (12×coverage) and assembled them into 163,406 scaffolds with a total cumulated length of 400 Mbp, which represent about 30% of asparagus genome. Overall, TEs masked about 53% of the A. officinalis assembly. Majority of the identified TEs belonged to LTR retrotransposons, which constitute about 28% of genomic DNA, with Ty1/copia elements being more diverse and accumulated to higher copy numbers than Ty3/gypsy. Compared with LTR retrotransposons, non-LTR retrotransposons and DNA transposons were relatively rare. In addition, comparison of the abundance of the TE groups between male and female genomes showed that the overall TE composition was highly similar, with only slight differences in the abundance of several TE groups, which is consistent with the relatively recent origin of asparagus sex chromosomes. This study greatly improves our knowledge of the repetitive sequence construction of asparagus, which facilitates the identification of TEs responsible for the early evolution of plant sex chromosomes and is helpful for further studies on this dioecious plant.
    PLoS ONE 05/2014; 9(5):e97189. DOI:10.1371/journal.pone.0097189 · 3.23 Impact Factor
  • Source
    • "At the same time as genome scans were beginning to yield causal mutations in non-laboratory animals, the first draft genome sequence assemblies were published for humans (Lander et al. 2001; Venter et al. 2001) and mice (Waterston et al. 2002), ushering in a new revolution. Draft genome sequence assemblies soon became available for the main non-laboratory animals, and genome-assembly publications followed for chicken (International Chicken Genome Sequencing Consortium 2004), dog (Lindblad- Toh et al. 2005), cat (Pontius et al. 2007), sheep (Dalrymple et al. 2007), cattle (Elsik et al. 2009; Liu et al. 2009; Zimin et al. 2009), horse (Wade et al. 2009) and pig (Archibald et al. 2010; Groenen et al. 2012). The continual improvements in the quality and the extent of annotation of these genome assemblies saw a consequential decrease in the need for comparative mapping analysis. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Within two years of the re-discovery of Mendelism, Bateson and Saunders had described six traits in non-laboratory animals (five in chickens and one in cattle) that show single-locus (Mendelian) inheritance. In the ensuing decades, much progress was made in documenting an ever-increasing number of such traits. In 1987 came the first discovery of a causal mutation for a Mendelian trait in non-laboratory animals: a non-sense mutation in the thyroglobulin gene (TG), causing familial goitre in cattle. In the years that followed, the rate of discovery of causal mutations increased, aided mightily by the creation of genome-wide microsatellite maps in the 1990s and even more mightily by genome assemblies and single-nucleotide polymorphism (SNP) chips in the 2000s. With sequencing costs decreasing rapidly, by 2012 causal mutations were being discovered in non-laboratory animals at a rate of more than one per week. By the end of 2012, the total number of Mendelian traits in non-laboratory animals with known causal mutations had reached 499, which was half the number of published single-locus (Mendelian) traits in those species. The distribution of types of mutations documented in non-laboratory animals is fairly similar to that in humans, with almost half being missense or non-sense mutations. The ratio of missense to non-sense mutations in non-laboratory animals to the end of 2012 was 193:78. The fraction of non-sense mutations (78/271 = 0.29) was not very different from the fraction of non-stop codons that are just one base substitution away from a stop codon (21/61 = 0.34).
    Animal Genetics 12/2013; 45(2). DOI:10.1111/age.12103 · 2.21 Impact Factor
  • Source
    • "The SNP positions within a chromosome were based on the Bos taurus genome assembly (Btau_4.0) [19]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: For several years, in human nutrition there has been a focus on the proportion of unsaturated fatty acids (UFA) and saturated fatty acids (SFA) found in bovine milk. The positive health-related properties of UFA versus SFA have increased the demand for food products with a higher proportion of UFA. To be able to change the UFA and SFA content of the milk by breeding it is important to know whether there is a genetic component underlying the individual FA in the milk. We have estimated the heritability for individual FA in the milk of Danish Holstein. For this purpose we used information of SNP markers instead of the traditional pedigree relationships. Estimates of heritability were moderate within the range of 0.10 for C18:1 trans-11 to 0.34 for C8:0 and C10:0, whereas the estimates for saturated fatty acids and unsaturated fatty acids were 0.14 and 0.18, respectively. Posterior standard deviations were in the range from 0.07 to 0.17. The correlation estimates showed a general pattern of two groups, one group mainly consisting of saturated fatty acids and one group mainly consisting of unsaturated fatty acids. The phenotypic correlation ranged from -0.95 (saturated fatty acids and unsaturated fatty acids) to 0.99 (unsaturated fatty acids and monounsaturated fatty acids) and the genomic correlation for fatty acids ranged from -0.29 to 0.91. The heritability estimates obtained in this study are in general accordance with heritability estimates from studies using pedigree data and/or a genomic relationship matrix in the context of a REML approach. SFA and UFA expressed a strong negative phenotypic correlation and a weaker genetic correlation. This is in accordance with the theory that SFA is synthesized de novo, while UFA can be regulated independently from the regulation of SFA by the feeding regime.
    BMC Genetics 09/2013; 14(1):79. DOI:10.1186/1471-2156-14-79 · 2.40 Impact Factor
Show more