Bert Kappen

Radboud Universiteit Nijmegen, Nijmegen, Provincie Gelderland, Netherlands

Are you Bert Kappen?

Claim your profile

Publications (15)2.85 Total impact

  • Source
    Article: KL-learning: Online solution of Kullback-Leibler control problems
    Joris Bierkens, Bert Kappen
    [show abstract] [hide abstract]
    ABSTRACT: We introduce a stochastic approximation method for the solution of an ergodic Kullback-Leibler control problem. A Kullback-Leibler control problem is a Markov decision process on a finite state space in which the control cost is proportional to a Kullback-Leibler divergence of the controlled transition probabilities with respect to the uncontrolled transition probabilities. The algorithm discussed in this work allows for a sound theoretical analysis using the ODE method. In a numerical experiment the algorithm is shown to be comparable to the power method and the related Z-learning algorithm in terms of convergence speed. It may be used as the basis of a reinforcement learning style algorithm for Markov decision problems.
    12/2011;
  • Article: Stochastic Optimal Control of State Constrained Systems
    [show abstract] [hide abstract]
    ABSTRACT: In this article we consider the problem of stochastic optimal control in continuous-time and state-action space of systems with state constraints. These systems typically appear in the area of robotics, where hard obstacles constrain the state space of the robot. A common approach is to solve the problem locally using a linear-quadratic Gaussian (LQG) method. We take a different approach and apply path integral control as introduced by Kappen (Kappen, H.J. (2005a), ‘Path Integrals and Symmetry Breaking for Optimal Control Theory’, Journal of Statistical Mechanics: Theory and Experiment, 2005, P11011; Kappen, H.J. (2005b), ‘Linear Theory for Control of Nonlinear Stochastic Systems’, Physical Review Letters, 95, 200201). We use hybrid Monte Carlo sampling to infer the control. We introduce an adaptive time discretisation scheme for the simulation of the controlled dynamics. We demonstrate our approach on two examples, a simple particle in a halfspace and a more complex two-joint manipulator, and we show that in a high noise regime our approach outperforms the iterative LQG method.
    International Journal of Control 04/2011; 84(3). · 0.98 Impact Factor
  • Source
    Chapter: Bayesian Networks for Expert Systems: Theory and Practical Applications
    [show abstract] [hide abstract]
    ABSTRACT: Bayesian networks are widely accepted as models for reasoning with uncertainty. In this chapter, we focus on models that are created using domain expertise only. After a short review of Bayesian network models and common Bayesian network modeling approaches, we will discuss in more detail three applications of Bayesian networks.With these applications, we aim to illustrate the modeling power and flexibility of the Bayesian networks, which go beyond the standard textbook applications. The first network is applied in a system for medical diagnostic decision support. A distinguishing feature of this network is the large amount of variables in the model. The second one involves an application for petrophysical decision support to determine the mineral content of a well, based on borehole measurements. This model differs from standard Bayesian networks in terms of its continuous variables and nonlinear relations. Finally, we will discuss an application for victim identification by kinship analysis based on DNA profiles. The distinguishing feature in this application is that Bayesian networks are generated and computed on-the-fly based on case information.
    03/2010: pages 547-578;
  • Source
    Article: A Bayesian petrophysical decision support system for estimation of reservoir compositions.
    Expert Syst. Appl. 01/2010; 37:7526-7532.
  • Source
    Article: Graphical Model Inference in Optimal Control of Stochastic Multi-Agent Systems.
    J. Artif. Intell. Res. (JAIR). 01/2008; 32:95-122.
  • Source
    Article: Loop corrections for message passing algorithms in continuous variable models
    Bastian Wemmenhove, Bert Kappen
    [show abstract] [hide abstract]
    ABSTRACT: In this paper we derive the equations for Loop Corrected Belief Propagation on a continuous variable Gaussian model. Using the exactness of the averages for belief propagation for Gaussian models, a different way of obtaining the covariances is found, based on Belief Propagation on cavity graphs. We discuss the relation of this loop correction algorithm to Expectation Propagation algorithms for the case in which the model is no longer Gaussian, but slightly perturbed by nonlinear terms.
    07/2007;
  • Source
    Conference Proceeding: Optimal Control in Large Stochastic Multi-agent Systems.
    Adaptive Agents and Multi-Agent Systems III. Adaptation and Multi-Agent Learning, 5th, 6th, and 7th European Symposium, ALAMAS 2005-2007 on Adaptive and Learning Agents and Multi-Agent Systems, Revised Selected Papers; 01/2007
  • Source
    Article: Loop Corrected Belief Propagation.
    Journal of Machine Learning Research - Proceedings Track. 01/2007; 2:331-338.
  • Source
    Conference Proceeding: Optimal on-line scheduling in stochastic multiagent systems in continuous space-time.
    6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), Honolulu, Hawaii, USA, May 14-18, 2007; 01/2007
  • Source
    Article: Loop corrections for approximate inference
    Joris Mooij, Bert Kappen
    [show abstract] [hide abstract]
    ABSTRACT: We propose a method for improving approximate inference methods that corrects for the influence of loops in the graphical model. The method is applicable to arbitrary factor graphs, provided that the size of the Markov blankets is not too large. It is an alternative implementation of an idea introduced recently by Montanari and Rizzo (2005). In its simplest form, which amounts to the assumption that no loops are present, the method reduces to the minimal Cluster Variation Method approximation (which uses maximal factors as outer clusters). On the other hand, using estimates of the effect of loops (obtained by some approximate inference algorithm) and applying the Loop Correcting (LC) method usually gives significantly better results than applying the approximate inference algorithm directly without loop corrections. Indeed, we often observe that the loop corrected error is approximately the square of the error of the approximate inference method used to estimate the effect of loops. We compare different variants of the Loop Correcting method with other approximate inference methods on a variety of graphical models, including "real world" networks, and conclude that the LC approach generally obtains the most accurate results.
    12/2006;
  • Article: Improving Cox survival analysis with a neural-Bayesian approach.
    [show abstract] [hide abstract]
    ABSTRACT: In this article we show that traditional Cox survival analysis can be improved upon when supplemented with sensible priors and analysed within a neural Bayesian framework. We demonstrate that the Bayesian method gives more reliable predictions, in particular for relatively small data sets. The obtained posterior (the probability distribution of network parameters given the data) which in itself is intractable, can be made accessible by several approximations. We review approximations by Hybrid Markov Chain Monte Carlo sampling, a variational method and the Laplace approximation. We argue that although each Bayesian approach circumvents the shortcomings of the original Cox analysis, and therefore yields better predictive results, in practice the use of variational methods or Laplace is preferable. Since Cox survival analysis is infamous for its poor results with (too) many inputs, we use the Bayesian posterior to estimate p-values on the inputs and to formulate an algorithm for backward elimination. We show that after removal of irrelevant inputs Bayesian methods still achieve significantly better results than classical Cox.
    Statistics in Medicine 11/2004; 23(19):2989-3012. · 1.88 Impact Factor
  • Conference Proceeding: Approximate Inference and Constrained Optimization.
    Tom Heskes, Kees Albers, Bert Kappen
    UAI '03, Proceedings of the 19th Conference in Uncertainty in Artificial Intelligence, Acapulco, Mexico, August 7-10 2003; 01/2003
  • Source
    Article: Approximations of Bayesian networks through KL minimisation
    Wim Wiegerinck, Bert Kappen
    [show abstract] [hide abstract]
    ABSTRACT: Exact inference in large, complex Bayesian networks is computationally intractable. Approximate schemes are therefore of great importance for real world computation. In this paper we consider an approximation scheme in which the original Bayesian network is approximated by another Bayesian network. The approximating network is optimised by an iterative procedure, which minimises the Kullback-Leibler divergence between the two networks. The procedure is guaranteed to converge to a local minimum of the Kullback-Leibler divergence. An important question in this scheme is how to choose the structure of the approximating network. In this paper we show how redundant structures of the approximating model can be pruned in advance. Simulation results of model selection and model optimisation are provided to illustrate the methods.
    03/2000;
  • Source
    Article: A Gaussian approximation for stochastic non-linear dynamical processes with annihilation
    Wim Wiegerinck, Bert Kappen
    [show abstract] [hide abstract]
    ABSTRACT: We consider a stochastic nonlinear dynamical process with annihilation of parti-cles. This process can be viewed as the continuous time version of the extended Kalman filter/smoother. It also plays an important role in stochastic optimal con-trol theory. We derive a Gaussian approximation for this process. With the use of the path integral formalism we derive Euler-Lagrange equations for the mode. Furthermore, we derive a linear noise approximation to estimate the size of the fluctuations around the mode, and estimates of the partition function, based on the mode and Gaussian corrections. Numerical experiments confirm the validity of the approximation method. In addition, they show that the Gaussian correction provides a significant improvement of the estimate of the partition function.
  • Article: Approximate algorithms for neural-Bayesian approaches
    Tom Heskes, Bart Bakker, Bert Kappen
    [show abstract] [hide abstract]
    ABSTRACT: We describe two specific examples of neural-Bayesian approaches for complex modeling tasks: survival analysis and multitask learning. In both cases, we can come up with reasonable priors on the parameters of the neural network. As a result, the Bayesian approaches improve their (maximum likelihood) frequentist counterparts dramatically. By illustrating their application on the models under study, we review and compare algorithms that can be used for Bayesian inference: Laplace approximation, variational algorithms, Monte Carlo sampling, and empirical Bayes.
    Theoretical Computer Science.