Increased accuracy of artificial selection by using the realized relationship matrix.

Biosciences Research Division, Department of Primary Industries Victoria, 1 Park Drive, Bundoora 3083, Australia.
Genetics Research (Impact Factor: 2.2). 03/2009; 91(1):47-60. DOI: 10.1017/S0016672308009981
Source: PubMed

ABSTRACT Dense marker genotypes allow the construction of the realized relationship matrix between individuals, with elements the realized proportion of the genome that is identical by descent (IBD) between pairs of individuals. In this paper, we demonstrate that by replacing the average relationship matrix derived from pedigree with the realized relationship matrix in best linear unbiased prediction (BLUP) of breeding values, the accuracy of the breeding values can be substantially increased, especially for individuals with no phenotype of their own. We further demonstrate that this method of predicting breeding values is exactly equivalent to the genomic selection methodology where the effects of quantitative trait loci (QTLs) contributing to variation in the trait are assumed to be normally distributed. The accuracy of breeding values predicted using the realized relationship matrix in the BLUP equations can be deterministically predicted for known family relationships, for example half sibs. The deterministic method uses the effective number of independently segregating loci controlling the phenotype that depends on the type of family relationship and the length of the genome. The accuracy of predicted breeding values depends on this number of effective loci, the family relationship and the number of phenotypic records. The deterministic prediction demonstrates that the accuracy of breeding values can approach unity if enough relatives are genotyped and phenotyped. For example, when 1000 full sibs per family were genotyped and phenotyped, and the heritability of the trait was 0.5, the reliability of predicted genomic breeding values (GEBVs) for individuals in the same full sib family without phenotypes was 0.82. These results were verified by simulation. A deterministic prediction was also derived for random mating populations, where the effective population size is the key parameter determining the effective number of independently segregating loci. If the effective population size is large, a very large number of individuals must be genotyped and phenotyped in order to accurately predict breeding values for unphenotyped individuals from the same population. If the heritability of the trait is 0.3, and N(e)=100, approximately 12474 individuals with genotypes and phenotypes are required in order to predict GEBVs of un-phenotyped individuals in the same population with an accuracy of 0.7 [corrected].

1 Follower
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The application of quantitative genetics in plant and animal breeding has largely focused on additive models, which may also capture dominance and epistatic effects. Partitioning genetic variance into its additive and non-additive components using pedigree-based models (P-BLUP) is difficult with most commonly available family structures. However, the availability of dense panels of molecular markers makes possible the use of additive and dominance realized genomic relationships for the estimation of variance components and the prediction of genetic values (G-BLUP). We evaluated height data from a multi-family population of the tree species Pinus taeda with a systematic series of models accounting for additive, dominance and first order epistatic interactions (additive-by-additive, dominance-by-dominance, and additive-by-dominance), using either pedigree- or marker-based information. We show that, compared with the pedigree, use of realized genomic relationships in marker-base models yields a substantially more precise separation of additive and non-additive components of genetic variance. We conclude that the marker-based relationship matrices in a model including additive and non-additive effects performed better, improving breeding value prediction. Moreover, our results suggest that, for tree height in this population, the additive and non-additive components of genetic variance are similar in magnitude. This novel result improves our current understanding of the genetic control and architecture of a quantitative trait and should be considered when developing breeding strategies.
    Genetics 10/2014; DOI:10.1534/genetics.114.171322 · 4.87 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Various models have been used for genomic prediction. Bayesian variable selection models often predict more accurate genomic breeding values than genomic BLUP (GBLUP), but GBLUP is generally preferred for routine genomic evaluations because of low computational demand. The objective of this study was to achieve the benefits of both models using results from Bayesian models and genome-wide association studies as weights on single nucleotide polymorphism (SNP) markers when constructing the genomic matrix (G-matrix) for genomic prediction. The data comprised 5,221 progeny-tested bulls from the Nordic Holstein population. The animals were genotyped using the Illumina Bovine SNP50 BeadChip (Illumina Inc., San Diego, CA). Weighting factors in this investigation were the posterior SNP variance, the square of the posterior SNP effect, and the corresponding minus base-10 logarithm of the marker association P-value [−log10(P)] of a t-test obtained from the analysis using a Bayesian mixture model with 4 normal distributions, the square of the estimated SNP effect, and the corresponding −log10(P) of a t-test obtained from the analysis using a classical genome-wide association study model (linear regression model). The weights were derived from the analysis based on data sets that were 0, 1, 3, or 5 yr before performing genomic prediction. In building a G-matrix, the weights were assigned either to each marker (single-marker weighting) or to each group of approximately 5 to 150 markers (group-marker weighting). The analysis was carried out for milk yield, fat yield, protein yield, fertility, and mastitis. Deregressed proofs (DRP) were used as response variables to predict genomic estimated breeding values (GEBV). Averaging over the 5 traits, the Bayesian model led to 2.0% higher reliability of GEBV than the GBLUP model with an original unweighted G-matrix. The superiority of using a GBLUP with weighted G-matrix over GBLUP with an original unweighted G-matrix was the largest when using a weighting factor of posterior variance, resulting in 1.7 percentage points higher reliability. The second best weighting factors were −log10 (P-value) of a t-test corresponding to the square of the posterior SNP effect from the Bayesian model and −log10 (P-value) of a t-test corresponding to the square of the estimated SNP effect from the linear regression model, followed by the square of estimated SNP effect and the square of the posterior SNP effect. In addition, group-marker weighting performed better than single-marker weighting in terms of reducing bias of GEBV, and also slightly increased prediction reliability. The differences between weighting factors and scenarios were larger in prediction bias than in prediction accuracy. Finally, weights derived from a data set having a lag up to 3 yr did not reduce reliability of GEBV. The results indicate that posterior SNP variance estimated from a Bayesian mixture model is a good alternative weighting factor, and common weights on group markers with a size of 30 markers is a good strategy when using markers of the 50,000-marker (50K) chip. In a population with gradually increasing reference data, the weights can be updated once every 3 yr.
    Journal of Dairy Science 10/2014; 97(10). DOI:10.3168/jds.2014-8210 · 2.55 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: In this review, we argue that breeding schemes need well-designed breeding plans to maximise long-term genetic gains from genomic information. Genomic information has been implemented in livestock breeding schemes with ad hoc breeding plans, suggesting that the potential benefits of genomic information are not being fully exploited. Breeding schemes need well-designed breeding plans to exploit the benefits of genomic information for two reasons. First, there are several components of breeding schemes with genomic information that impact on long-term genetic gains. Second, these components interact, which implies that breeding schemes need to optimise components simultaneously in order to maximise long-term genetic gains. Designing breeding plans that optimise components simultaneously is a complex task. In more cases than not, breeding schemes, their components, and interactions between these components do not allow optimum breeding plans to be designed by mere reasoning. We recommend using decision frameworks to design breeding plans for schemes that use genomic information: testing sound hypotheses by designing and executing controlled experiments using decision tools, such as mathematical-statistical models. These decision frameworks enable us to design optimum breeding plans by providing an objective and theoretical basis to make and validate breeding decisions, enabling us to understand the underlying mechanisms of breeding schemes with genomic information, and allowing us to test the practical implementation of breeding decisions against theoretical models. Genomic information is an exciting prospect for animal breeding, and there is clearly an important role for breeding plans that maximise long-term genetic gains in breeding schemes using genomic information.
    Livestock Science 08/2014; 166. DOI:10.1016/j.livsci.2014.06.016 · 1.10 Impact Factor


Available from