David Bolin

David Bolin
  • PhD
  • Professor (Associate) at King Abdullah University of Science and Technology

About

114
Publications
22,807
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
2,087
Citations
Introduction
I am leading a research group in statistics at KAUST. If you are interested in joining as a PhD student or PostDoc, send me an email for further details.
Current institution
King Abdullah University of Science and Technology
Current position
  • Professor (Associate)
Additional affiliations
May 2016 - October 2019
University of Gothenburg
Position
  • Professor (Associate)
January 2014 - April 2016
Chalmers University of Technology
Position
  • Professor (Assistant)
December 2012 - December 2013
Umeå University
Position
  • PostDoc Position

Publications

Publications (114)
Preprint
The R software package rSPDE contains methods for approximating Gaussian random fields based on fractional-order stochastic partial differential equations (SPDEs). A common example of such fields are Whittle-Mat\'ern fields on bounded domains in $\mathbb{R}^d$, manifolds, or metric graphs. The package also implements various other models which are...
Preprint
Full-text available
The modeling of spatial point processes has advanced considerably, yet extending these models to non-Euclidean domains, such as road networks, remains a challenging problem. We propose a novel framework for log-Gaussian Cox processes on general compact metric graphs by leveraging the Gaussian Whittle-Mat\'ern fields, which are solutions to fraction...
Article
Full-text available
Spatial statistics is traditionally based on stationary models on $${\mathbb{R}}^{d}$$ like Matérn fields. The adaptation of traditional spatial statistical methods, originally designed for stationary models in Euclidean spaces, to effectively model phenomena on linear networks such as stream systems and urban road networks is challenging. The curr...
Preprint
The increasing availability of network data has driven the development of advanced statistical models specifically designed for metric graphs, where Gaussian processes play a pivotal role. While models such as Whittle-Mat\'ern fields have been introduced, there remains a lack of practically applicable options that accommodate flexible non-stationar...
Preprint
We derive an explicit link between Gaussian Markov random fields on metric graphs and graphical models, and in particular show that a Markov random field restricted to the vertices of the graph is, under mild regularity conditions, a Gaussian graphical model with a distribution which is faithful to its pairwise independence graph, which coincides w...
Article
Model checking is essential to evaluate the adequacy of statistical models and the validity of inferences drawn from them. Particularly, hierarchical models such as latent Gaussian models (LGMs) pose unique challenges as it is difficult to check assumptions on the latent parameters. Diagnostic statistics are often used to quantify the degree to whi...
Preprint
The computational cost for inference and prediction of statistical models based on Gaussian processes with Mat\'ern covariance functions scales cubicly with the number of observations, limiting their applicability to large data sets. The cost can be reduced in certain special cases, but there are currently no generally applicable exact methods with...
Preprint
Full-text available
Scoring rules are aimed at evaluation of the quality of predictions, but can also be used for estimation of parameters in statistical models. We propose estimating parameters of multivariate spatial models by maximising the average leave-one-out cross-validation score. This method, LOOS, thus optimises predictions instead of maximising the likeliho...
Preprint
Full-text available
In econometrics, the Efficient Market Hypothesis posits that asset prices reflect all available information in the market. Several empirical investigations show that market efficiency drops when it undergoes extreme events. Many models for multivariate extremes focus on positive dependence, making them unsuitable for studying extremal dependence in...
Article
Full-text available
Trees in agroforestry parklands influence crops both through competitive and facilitative mechanism, but the effects are challenging to disentangle due to the complexity of the system with high variability in tree cover structure and species diversity and crop combinations. Focusing on a landscape in central Burkina Faso dominated by Vitellaria par...
Preprint
Full-text available
Moving average processes driven by exponential-tailed L\'evy noise are important extensions of their Gaussian counterparts in order to capture deviations from Gaussianity, more flexible dependence structures, and sample paths with jumps. Popular examples include non-Gaussian Ornstein--Uhlenbeck processes and type G Mat\'ern stochastic partial diffe...
Preprint
Full-text available
Model checking is essential to evaluate the adequacy of statistical models and the validity of inferences drawn from them. Particularly, hierarchical models such as latent Gaussian models (LGMs) pose unique challenges as it is difficult to check assumptions about the distribution of the latent parameters. Discrepancy measures are often used to quan...
Preprint
Statistical analysis of extremes can be used to predict the probability of future extreme events, such as large rainfalls or devastating windstorms. The quality of these forecasts can be measured through scoring rules. Locally scale invariant scoring rules put equal importance on the forecasts at different locations regardless of differences in the...
Preprint
The Whittle-Mat\'ern fields are a recently introduced class of Gaussian processes on metric graphs, which are specified as solutions to a fractional-order stochastic differential equation on the metric graph. Contrary to earlier covariance-based approaches for specifying Gaussian fields on metric graphs, the Whittle-Mat\'ern fields are well-defined...
Preprint
There has recently been much interest in Gaussian processes on linear networks and more generally on compact metric graphs. One proposed strategy for defining such processes on a metric graph $\Gamma$ is through a covariance function that is isotropic in a metric on the graph. Another is through a fractional order differential equation $L^\alpha (\...
Preprint
Full-text available
The fractional differential equation $L^\beta u = f$ posed on a compact metric graph is considered, where $\beta>\frac14$ and $L = \kappa - \frac{\mathrm{d}}{\mathrm{d} x}(H\frac{\mathrm{d}}{\mathrm{d} x})$ is a second-order elliptic operator equipped with certain vertex conditions and sufficiently smooth and positive coefficients $\kappa,H$. We de...
Chapter
Spatial fields, aka random fields, are important models for analyzing data collected at different spatial locations. This article provides a short introduction to spatial fields for continuously indexed data, focusing on how to construct flexible Gaussian and non‐Gaussian fields. A brief introduction to spatial prediction of random fields is given,...
Preprint
Latent Gaussian models (LGMs) are perhaps the most commonly used class of models in statistical applications. Nevertheless, in areas ranging from longitudinal studies in biostatistics to geostatistics, it is easy to find datasets that contain inherently non-Gaussian features, such as sudden jumps or spikes, that adversely affect the inferences and...
Preprint
Full-text available
Task functional magnetic resonance imaging (fMRI) is a type of neuroimaging data used to identify areas of the brain that activate during specific tasks or stimuli. These data are conventionally modeled using a massive univariate approach across all data locations, which ignores spatial dependence at the cost of model power. We previously developed...
Preprint
Full-text available
The stochastic partial differential equation (SPDE) approach is widely used for modeling large spatial datasets. It is based on representing a Gaussian random field $u$ on $\mathbb{R}^d$ as the solution of an elliptic SPDE $L^\beta u = \mathcal{W}$ where $L$ is a second-order differential operator, $2\beta$ (belongs to natural number starting from...
Article
Independent component analysis is commonly applied to functional magnetic resonance imaging (fMRI) data to extract independent components (ICs) representing functional brain networks. While ICA produces reliable group-level estimates, single-subject ICA often produces noisy results. Template ICA is a hierarchical ICA model using empirical populatio...
Preprint
We define a new class of Gaussian processes on compact metric graphs such as street or river networks. The proposed models, the Whittle-Matérn fields, are defined via a fractional stochastic partial differential equation on the compact metric graph and are a natural extension of Gaussian fields with Matérn covariance functions on Euclidean domains...
Preprint
The normal inverse Gaussian (NIG) and generalized asymmetric Laplace (GAL) distributions can be seen as skewed and heavy-tailed extensions of the Gaussian distribution. Models driven by these more flexible noise distributions are then regarded as generalizations of simpler Gaussian models. Inferential procedures tend to overestimate the degree of n...
Preprint
Full-text available
Analysis of brain imaging scans is critical to understanding the way the human brain functions, which can be leveraged to treat injuries and conditions that affect the quality of life for a significant portion of the human population. In particular, functional magnetic resonance imaging (fMRI) scans give detailed data on a living subject at high sp...
Article
The ocean wave distribution in a specific region of space and time is described by its sea state. Knowledge about the sea states a ship encounters on a journey can be used to assess various parameters of risk and wear associated with this journey. Two important characteristics of the sea state are significant wave height and mean wave period. We pr...
Article
Gaussian processes and random fields have a long history, covering multiple approaches to representing spatial and spatio-temporal dependence structures, such as covariance functions, spectral representations, reproducing kernel Hilbert spaces, and graph based models. This article describes how the stochastic partial differential equation approach...
Article
Full-text available
The general linear model (GLM) is a widely popular and convenient tool for estimating the functional brain response and identifying areas of significant activation during a task or stimulus. However, the classical GLM is based on a massive univariate approach that does not explicitly leverage the similarity of activation patterns among neighboring...
Preprint
Various natural phenomena exhibit spatial extremal dependence at short distances only, while it usually vanishes as the distance between sites increases arbitrarily. However, models proposed in the literature for spatial extremes, which are based on max-stable or Pareto processes or comparatively less computationally demanding ``sub-asymptotic'' mo...
Preprint
Full-text available
Gaussian processes and random fields have a long history, covering multiple approaches to representing spatial and spatio-temporal dependence structures, such as covariance functions, spectral representations, reproducing kernel Hilbert spaces, and graph based models. This article describes how the stochastic partial differential equation approach...
Article
Both intensities of individual extreme rainfall events and the frequency of such events are important for infrastructure planning. We develop a new statistical extreme value model, the PGEV model, which makes it possible to use high quality annual maximum series data instead of lesswell checked daily data to estimate trends in intensity and frequen...
Preprint
Full-text available
The general linear model (GLM) is a popular and convenient tool for estimating the functional brain response and identifying areas of significant activation during a task or stimulus. However, the classical GLM is based on a massive univariate approach that does not leverage the similarity of activation patterns among neighboring brain locations. A...
Preprint
Full-text available
Methods for inference and simulation of linearly constrained Gaussian Markov Random Fields (GMRF) are computationally prohibitive when the number of constraints is large. In some cases, such as for intrinsic GMRFs, they may even be unfeasible. We propose a new class of methods to overcome these challenges in the common case of sparse constraints, w...
Article
Full-text available
Pore geometry characterization-methods are important tools for understanding how pore structure influences properties such as transport through a porous material. Bottlenecks can have a large influence on transport and related properties. However, existing methods only catch certain types of bottleneck effects caused by variations in pore size. We...
Preprint
Full-text available
We consider Gaussian measures $\mu, \tilde{\mu}$ on a separable Hilbert space, with fractional-order covariance operators $A^{-2\beta}$ resp. $\tilde{A}^{-2\tilde{\beta}}$, and derive necessary and sufficient conditions on $A, \tilde{A}$ and $\beta, \tilde{\beta} > 0$ for I. equivalence of the measures $\mu$ and $\tilde{\mu}$, and II. uniform asymp...
Article
Full-text available
The relation between pedestrian flows, the structure of the city and the street network is of central interest in urban research. However, studies of this have traditionally been based on small data sets and simplistic statistical methods. Because of a recent large‐scale cross‐country pedestrian survey, there is now enough data available to study t...
Article
Full-text available
We consider the analysis of continuous repeated measurement outcomes that are collected longitudinally. A standard framework for analysing data of this kind is a linear Gaussian mixed effects model within which the outcome variable can be decomposed into fixed effects, time invariant and time-varying random effects, and measurement noise. We develop...
Preprint
Full-text available
The Mat\' ern field is the most well known family of covariance functions used for Gaussian processes in spatial models. We build upon the original research of Whittle (1953, 1964) and develop the diffusion-based extension of the Mat\' ern field to space-time (DEMF). We argue that this diffusion-based extension is the natural extension of these pro...
Preprint
ICA is commonly applied to fMRI data to extract independent components (ICs) representing functional brain networks. While ICA produces highly reliable results at the group level, single-subject ICA often produces inaccurate results due to high noise levels. Previously, we proposed template ICA (tICA), a hierarchical ICA model using empirical popul...
Preprint
Full-text available
Optimal linear prediction (also known as kriging) of a random field $\{Z(x)\}_{x\in\mathcal{X}}$ indexed by a compact metric space $(\mathcal{X},d_{\mathcal{X}})$ can be obtained if the mean value function $m\colon\mathcal{X}\to\mathbb{R}$ and the covariance function $\varrho\colon\mathcal{X}\times\mathcal{X}\to\mathbb{R}$ of $Z$ are known. We cons...
Article
Full-text available
A non-stationary Gaussian random field model is developed based on a combination of the stochastic partial differential equation (SPDE) approach and the classical deformation method. With the deformation method, a stationary field is defined on a domain which is deformed so that the field becomes non-stationary. We show that if the stationary field...
Article
Full-text available
Vehicle-to-vehicle (V2V) wireless communications can improve traffic safety at road intersections and enable congestion avoidance. However, detailed knowledge about the wireless propagation channel is needed for the development and realistic assessment of V2V communication systems. In this paper, we present a novel geometry-based stochastic MIMO ch...
Article
For many applications with multivariate data, random‐field models capturing departures from Gaussianity within realizations are appropriate. For this reason, we formulate a new class of multivariate non‐Gaussian models based on systems of stochastic partial differential equations with additive type G noise whose marginal covariance functions are of...
Preprint
Full-text available
Averages of proper scoring rules are often used to rank probabilistic forecasts. In many cases, the individual observations and their predictive distributions in these averages have variable scale (variance). We show that some of the most popular proper scoring rules, such as the continuous ranked probability score (CRPS), up-weight observations wi...
Article
Full-text available
Plot systems (also referred to as "property", "parcel", or "lot") are generally recognised as the organisational framework of urban form that contributes to the economic performance of cities. However, studies that link the spatial form of plots to economic data are limited. The paper builds on the theory of Webster and Lai, which argues that the p...
Conference Paper
Full-text available
This work follows a long line of studies and empirical investigations in Space Syntax research, that, in general, try to conceptualise, describe and quantify the relation between physical space and human agency. How many people share public space is known to affect many socioeconomic processes in cities, such as segregation, vitality and local comm...
Preprint
Bayesian whole-brain functional magnetic resonance imaging (fMRI) analysis with three-dimensional spatial smoothing priors have been shown to produce state-of-the-art activity maps without pre-smoothing the data. The proposed inference algorithms are computationally demanding however, and the proposed spatial priors have several less appealing prop...
Preprint
Full-text available
The ocean wave distribution in a specific region of space and time is described by its sea state. Knowledge about the sea states a ship encounters on a journey can be used to assess various parameters of risk and wear associated with the journey. Two important characteristics of the sea state are the significant wave height and mean wave period. We...
Preprint
The general condition of the ocean surface at a certain location in space and time is described by the sea state. Knowing the distribution of the sea state is, for example, important when estimating the wear and risks associated with a planned journey of a ship. One important characteristic of the sea state is the significant wave height. We propos...
Preprint
Vehicle-to-vehicle (V2V) wireless communications can improve traffic safety at road intersections and enable congestion avoidance. However, detailed knowledge about the wireless propagation channel is needed for the development and realistic assessment of V2V communication systems. In this paper, we present a novel geometry-based stochastic MIMO ch...
Article
Full-text available
The numerical approximation of solutions to stochastic partial differential equations with additive spatial white noise on bounded domains in $\mathbb{R}^d$ is considered. The differential operator is given by the fractional power $L^\beta$, $\beta\in(0,1)$, of an integer order elliptic differential operator $L$ and is therefore non-local. Its inve...
Article
Full-text available
Diffusive transport through porous materials is to a large extent determined by the microstructure of the material. To design materials with controlled transport properties, it is hence important to connect properties of the pore geometry to diffusive transport rates. Different kinds of microstructures from a stochastic model are generated and mult...
Article
Full-text available
The R software package excursions contains methods for calculating probabilistic excursion sets, contour credible regions, and simultaneous confidence bands for latent Gaussian stochastic processes and fields. It also contains methods for uncertainty quantification of contour maps and computation of Gaussian integrals. This article describes the th...
Article
For many problems in geostatistics, land cover classification, and brain imaging the classical Gaussian process models are unsuitable due to sudden, discontinuous, changes in the data. To handle data of this type, we introduce a new model class that combines discrete Markov random fields (MRFs) with Gaussian Markov random fields. The model is defin...
Preprint
Full-text available
There is an interest to replace computed tomography (CT) images with magnetic resonance (MR) images for a number of diagnostic and therapeutic workflows. In this article, predicting CT images from a number of magnetic resonance imaging (MRI) sequences using regression approach is explored. Two principal areas of application for estimated CT images...
Article
Full-text available
Coming up with Bayesian models for spatial data is easy, but performing inference with them can be challenging. Writing fast inference code for a complex spatial model with realistically‐sized datasets from scratch is time‐consuming, and if changes are made to the model, there is little guarantee that the code performs well. The key advantages of R...
Article
The performance of Markov chain Monte Carlo (MCMC) algorithms like the Metropolis Hastings Random Walk (MHRW) is highly dependent on the choice of scaling matrix for the proposal distributions. A popular choice of scaling matrix in adaptive MCMC methods is to use the empirical covariance matrix (ECM) of previous samples. However, this choice is pro...
Preprint
We consider the analysis of continuous repeated measurement outcomes that are collected through time, also known as longitudinal data. A standard framework for analysing data of this kind is a linear Gaussian mixed-effects model within which the outcome variable can be decomposed into fixed-effects, time-invariant and time-varying random-effects, a...
Article
Full-text available
Coming up with Bayesian models for spatial data is easy, but performing inference with them can be challenging. Writing fast inference code for a complex spatial model with realistically-sized datasets from scratch is time-consuming, and if changes are made to the model, there is little guarantee that the code performs well. The key advantages of R...
Article
Full-text available
The numerical approximation of the solution $u$ to a stochastic partial differential equation with additive spatial white noise on a bounded domain in $\mathbb{R}^d$ is considered. The differential operator is assumed to be a fractional power $L^\beta$ of an integer order elliptic differential operator $L$, where $\beta\in(0,1)$. The solution $u$ i...
Preprint
The numerical approximation of the solution to a stochastic partial differential equation with additive spatial white noise on a bounded domain is considered. The differential operator is assumed to be a fractional power of an integer order elliptic differential operator. The solution is approximated by means of a finite element discretization in s...
Article
Full-text available
A popular approach for modeling and inference in spatial statistics is to represent Gaussian random fields as solutions to stochastic partial differential equations (SPDEs) of the form Lβu=W, where W is Gaussian white noise, L is a second-order differential operator, and β>0 is a parameter that determines the smoothness of u. However, this approach...
Preprint
A popular approach for modeling and inference in spatial statistics is to represent Gaussian random fields as solutions to stochastic partial differential equations (SPDEs) of the form $L^{\beta}u = \mathcal{W}$, where $\mathcal{W}$ is Gaussian white noise, $L$ is a second-order differential operator, and $\beta>0$ is a parameter that determines th...
Article
Full-text available
The log-Gaussian Cox process (LGCP) is a popular point process for modeling non interacting spatial point patterns. This paper extends the LGCP model to data exhibiting a finite number of fundamentally different behaviors partitioned on the spatial domain into an unknown set of subregions. The aim of the analyst might be either to identify and clas...
Preprint
The log-Gaussian Cox process (LGCP) is a popular point process for modeling non-interacting spatial point patterns. This paper extends the LGCP model to handle data exhibiting fundamentally different behaviors in different subregions of the spatial domain. The aim of the analyst might be either to identify and classify these regions, to perform kri...
Article
Full-text available
Cortical surface fMRI (cs-fMRI) has recently grown in popularity versus traditional volumetric fMRI, as it allows for more meaningful spatial smoothing and is more compatible with the common assumptions of isotropy and stationarity in Bayesian spatial models. However, as no Bayesian spatial model has been proposed for cs-fMRI data, most analyses co...
Preprint
Cortical surface fMRI (cs-fMRI) has recently grown in popularity versus traditional volumetric fMRI, as it allows for more meaningful spatial smoothing and is more compatible with the common assumptions of isotropy and stationarity in Bayesian spatial models. However, as no Bayesian spatial model has been proposed for cs-fMRI data, most analyses co...
Article
Full-text available
The use of sparse precision (inverse covariance) matrices has become popular because they allow for efficient algorithms for joint inference in high-dimensional models. Many applications require the computation of certain elements of the covariance matrix, such as the marginal variances, which may be non-trivial to obtain when the dimension is larg...
Preprint
The use of sparse precision (inverse covariance) matrices has become popular because they allow for efficient algorithms for joint inference in high-dimensional models. Many applications require the computation of certain elements of the covariance matrix, such as the marginal variances, which may be non-trivial to obtain when the dimension is larg...
Preprint
The numerical approximation of solutions to stochastic partial differential equations with additive spatial white noise on bounded domains in $\mathbb{R}^d$ is considered. The differential operator is given by the fractional power $L^\beta$, $\beta\in(0,1)$, of an integer order elliptic differential operator $L$ and is therefore non-local. Its inve...
Article
A thresholded Gaussian random field model is developed for the microstructure of porous materials. Defining the random field as a solution to a stochastic partial differential equation allows for flexible modelling of non-stationarities in the material and facilitates computationally efficient methods for simulation and model fitting. A Markov Chai...
Preprint
Full-text available
A thresholded Gaussian random field model is developed for the microstructure of porous materials. Defining the random field as a solution to stochastic partial differential equation allows for flexible modelling of non-stationarities in the material and facilitates computationally efficient methods for simulation and model fitting. A Markov Chain...
Article
Full-text available
There is an interest to replace computed tomography (CT) images with magnetic resonance (MR) images for a number of diagnostic and therapeutic workflows. In this article, predicting CT images from a number of magnetic resonance imaging (MRI) sequences using regression approach is explored. Two principal areas of application for estimated CT images...
Preprint
There is an interest to replace computed tomography (CT) images with magnetic resonance (MR) images for a number of diagnostic and therapeutic workflows. In this article, predicting CT images from a number of magnetic resonance imaging (MRI) sequences using regression approach is explored. Two principal areas of application for estimated CT images...
Article
Full-text available
Spatial whole-brain Bayesian modeling of task-related functional magnetic resonance imaging (fMRI) is a serious computational challenge. Most of the currently proposed methods therefore do inference in subregions of the brain separately or do approximate inference without comparison to the true posterior distribution. A popular such method, which i...
Article
Full-text available
Computed tomography (CT) equivalent information is needed for attenuation correction in PET imaging and for dose planning in radiotherapy. Prior work has shown that Gaussian mixture models can be used to generate a substitute CT (s-CT) image from a specific set of MRI modalities. This work introduces a more flexible class of mixture models for s-CT...
Preprint
For many applications with multivariate data, random field models capturing departures from Gaussianity within realisations are appropriate. For this reason, we formulate a new class of multivariate non-Gaussian models based on systems of stochastic partial differential equations with additive type G noise whose marginal covariance functions are of...
Article
Full-text available
Dynamical downscaling of earth system models is intended to produce high-resolution climate information at regional to local scales. Current models, while adequate for describing temperature distributions at relatively small scales, struggle when it comes to describing precipitation distributions. In order to better match the distribution of observ...
Preprint
Spatial whole-brain Bayesian modeling of task-related functional magnetic resonance imaging (fMRI) is a great computational challenge. Most of the currently proposed methods therefore do inference in subregions of the brain separately or do approximate inference without comparison to the true posterior distribution. A popular such method, which is...
Article
Full-text available
Contour maps are widely used to display estimates of spatial fields. Instead of showing the estimated field, a contour map only shows a fixed number of contour lines for different levels. However, despite the ubiquitous use these maps, the uncertainty associated with them has been given a surprisingly small amount of attention. We derive measures o...
Article
Covariance tapering is a popular approach for reducing the computational cost of spatial prediction and parameter estimation for Gaussian process models. However, previous work has shown that tapering can have poor performance for example when the data is sampled at spatially irregular locations. In this work we introduce a computationally convenie...
Article
Full-text available
The pathloss exponent and the variance of the large-scale fading are two parameters that are of great importance when modeling and characterizing wireless propagation channels. The pathloss is typically modeled using a log-distance power law with a large-scale fading term that is log-normal. In practice, the received signal is affected by the dynam...

Network

Cited By