Bayes factor between Student t and Gaussian mixed models within an animal breeding context

Genètica i Millora Animal, IRTA-Lleida, 25198 Lleida, Spain.
Genetics Selection Evolution (Impact Factor: 3.82). 07/2008; 40(4):395-413. DOI: 10.1051/gse:2008007
Source: PubMed


The implementation of Student t mixed models in animal breeding has been suggested as a useful statistical tool to effectively mute the impact of preferential treatment or other sources of outliers in field data. Nevertheless, these additional sources of variation are undeclared and we do not know whether a Student t mixed model is required or if a standard, and less parameterized, Gaussian mixed model would be sufficient to serve the intended purpose. Within this context, our aim was to develop the Bayes factor between two nested models that only differed in a bounded variable in order to easily compare a Student t and a Gaussian mixed model. It is important to highlight that the Student t density converges to a Gaussian process when degrees of freedom tend to infinity. The two models can then be viewed as nested models that differ in terms of degrees of freedom. The Bayes factor can be easily calculated from the output of a Markov chain Monte Carlo sampling of the complex model (Student t mixed model). The performance of this Bayes factor was tested under simulation and on a real dataset, using the deviation information criterion (DIC) as the standard reference criterion. The two statistical tools showed similar trends along the parameter space, although the Bayes factor appeared to be the more conservative. There was considerable evidence favoring the Student t mixed model for data sets simulated under Student t processes with limited degrees of freedom, and moderate advantages associated with using the Gaussian mixed model when working with datasets simulated with 50 or more degrees of freedom. For the analysis of real data (weight of Pietrain pigs at six months), both the Bayes factor and DIC slightly favored the Student t mixed model, with there being a reduced incidence of outlier individuals in this population.


Available from: Luis Varona
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Transmission ratio distortion (TRD) is the departure from the expected genotypic frequencies under Mendelian inheritance. This departure can be due to multiple physiological mechanisms during gametogenesis, fertilization, fetal and embryonic development, and early neonatal life. Although a few TRD loci have been reported in mouse, inheritance patterns have never been evaluated for TRD. In this article, we developed a Bayesian binomial model accounting for additive and dominant deviation TRD mechanisms. Moreover, this model was used to perform genome-wide scans for TRD quantitative trait loci (QTL) on six F2 mouse crosses involving between 296 and 541 mice and between 72 and 1854 genetic markers. Statistical significance of each model was checked at each genetic marker with Bayes factors. Genome scans revealed overdominance TRD QTL located in mouse chromosomes 1, 2, 12, 13, and 14 and additive TRD QTL in mouse chromosomes 2, 3, and 15, although these results did not replicate across mouse crosses. This research contributes new statistical tools for the analysis of specific genetic patterns involved in TRD in F2 populations, our results suggesting a relevant incidence of TRD phenomena in mouse with important implications for both statistical analyses and biological research.
    Genetics 02/2012; 191(1):247-59. DOI:10.1534/genetics.111.135988 · 5.96 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: We discuss the problem of selecting among alternative parametric models within the Bayesian framework. For model selection problems, which involve non-nested models, the common objective choice of a prior on the model space is the uniform distribution. The same applies to situations where the models are nested. It is our contention that assigning equal prior probability to each model is over simplistic. Consequently, we introduce a novel approach to objectively determine model prior probabilities, conditionally, on the choice of priors for the parameters of the models. The idea is based on the notion of the worth of having each model within the selection process. At the heart of the procedure is the measure of this worth using the Kullback–Leibler divergence between densities from different models.
    Scandinavian Journal of Statistics 03/2015; DOI:10.1111/sjos.12145 · 0.87 Impact Factor