Matthias C.M. Troffaes’s research while affiliated with Durham University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (128)


Data-driven estimation of the amount of under frequency load shedding in small power systems
  • Article

January 2025

·

15 Reads

·

1 Citation

Engineering Applications of Artificial Intelligence

·

Matthias C.M. Troffaes

·

·

[...]

·


Figure 1: A biomarker influencing both the treatment decision and the treatment outcome, thereby acting as confounder. Solid arrows indicate causation, whilst the dashed arrow indicates correlation without causation.
Figure 3: Probabilistic graphical representation for causal inference with our Bayesian hierarchical model.
Figure 4: Diagram of the simulation setup for comparative study.
Figure 5: Comparison of different methods in estimating the causal effect for varying number of observations. The top (bottom) row represents case 1a (case 1b). The left(right) images show the average(median) causal effects obtained from 20 replications.
Figure 6: Comparison of different methods in identifying the confounders for varying number of observations. One the left (right) we present case 1a (1b). The red line presents RBCE; blue line presents SSCE; green line presents BSSCE; and purple line presents BSSL

+7

Robust Bayesian causal estimation for causal inference in medical diagnosis
  • Preprint
  • File available

November 2024

·

23 Reads

Causal effect estimation is a critical task in statistical learning that aims to find the causal effect on subjects by identifying causal links between a number of predictor (or, explanatory) variables and the outcome of a treatment. In a regressional framework, we assign a treatment and outcome model to estimate the average causal effect. Additionally, for high dimensional regression problems, variable selection methods are also used to find a subset of predictor variables that maximises the predictive performance of the underlying model for better estimation of the causal effect. In this paper, we propose a different approach. We focus on the variable selection aspects of high dimensional causal estimation problem. We suggest a cautious Bayesian group LASSO (least absolute shrinkage and selection operator) framework for variable selection using prior sensitivity analysis. We argue that in some cases, abstaining from selecting (or, rejecting) a predictor is beneficial and we should gather more information to obtain a more decisive result. We also show that for problems with very limited information, expert elicited variable selection can give us a more stable causal effect estimation as it avoids overfitting. Lastly, we carry a comparative study with synthetic dataset and show the applicability of our method in real-life situations.

Download



Figure 1. Bathtub curve Figure 2. Two state Markov chain
Figure 3. Simulated prior predictive data with an exponential likelihood function
Figure 4. Simulated prior predictive data with a Weibull likelihood function
Figure 7. Exponential and two parameter Weibull distribution parameter estimation using Maximum Likelihood Estimation
Figure 8. Parameter estimation for í µí¼ƒ in case of an exponential curve
Data-Driven Infrastructure Planning for Offshore Wind Farms

June 2024

·

24 Reads

Journal of Physics Conference Series

Offshore wind farms are one of the major renewable energy resources that can help the UK to reach its net zero target. Under the 10 point plan of the green revolution, the UK is set to quadruple its wind energy production by increasing its offshore wind capacity to 40GW by 2030 [1]. Research needs to be conducted to study the failure and repair processes of wind turbines under various conditions as the current models make a simplifying assumption that the failure/repair rate remains constant over time. This research aims to create a more accurate model using SCADA data. In this research, different mathematical models are fitted to the time to failure and time to repair data of wind turbine components using frequentist methods (such as Maximum Likelihood Estimation) and Bayesian methods. Further analysis will be conducted using complex system analysis considering the failures of each electrical and mechanical component of the wind turbine. The aim of this project is to perform a more accurate reliability analysis that can help to further drive down costs of wind energy by potentially reducing the downtimes of the wind turbines.




A Robust Bayesian Approach for Causal Inference Problems

November 2023

·

31 Reads

Lecture Notes in Computer Science

Causal inference concerns finding the treatment effect on subjects along with causal links between the variables and the outcome. However, the underlying heterogeneity between subjects makes the problem practically unsolvable. Additionally, we often need to find a subset of explanatory variables to understand the treatment effect. Currently, variable selection methods tend to maximise the predictive performance of the underlying model, and unfortunately, under limited data, the predictive performance is hard to assess, leading to harmful consequences. To address these issues, in this paper, we consider a robust Bayesian analysis which accounts for abstention in selecting explanatory variables in the high dimensional regression model. To achieve that, we consider a set of spike and slab priors through prior elicitation to obtain a set of posteriors for both the treatment and outcome model. We are specifically interested in the sensitivity of the treatment effect in high dimensional causal inference as well as identifying confounder variables. However, confounder selection can be deceptive in this setting, especially when a predictor is strongly associated with either the treatment or the outcome. To avoid that we apply a post-hoc selection scheme, attaining a smaller set of confounders as well as separate sets of variables which are only related to treatment or outcome model. Finally, we illustrate our method to show its applicability.



ROBUST BAYESIAN ANALYSIS OF CAUSAL INFERENCE PROBLEMS

July 2023

·

144 Reads

Causal inference using observational data is an important aspect in many fields such as epidemiology, social science, economics, etc. In particular, our goal is to find the treatment effect on the subjects along with the causal links between the variables and the outcome. However, estimation for such problems are extremely difficult as the treatment effects may vary from subject to subject and modelling the underlying heterogeneity explicitly makes the problem practically unsolvable. Another issue we often face is the dimensionality of the problem and we need to find a subset of explanatory variables to initiate the treatment. However, currently variable selection methods tend to maximise the predictive performance of the outcome model only. This can be problematic in the case of limited information. As the consequence of mistreatment can be harmful. So, in this paper, we suggest a general framework with robust Bayesian analysis which accounts for abstention in deciding an explanatory variable in the high dimensional regression model. To achieve that, we consider a set of spike and slab priors through prior elicitation to obtain robust estimates for both the treatment and outcome model. We are specifically interested in the sensitivity of the treatment effect in the high dimensional causal inference as well as the identifying the confounder variables by means of variable selection. However, indicator based confounder selection can be deceptive in some cases. Especially, when the predictor is strongly associated with either the treatment or the outcome. This increases the posterior expectation of the selection indicators. To avoid that we apply a post-hoc selection scheme which successfully remove negligible non-zero effects from the model attaining a smaller set of confounders. Finally, we illustrate our result using synthetic dataset.


Citations (45)


... In general, AUFLS implementation involves calculating power shortages and disconnecting certain loads 3 . Previous studies have explored various methods for assessing power deficits, including the rate of frequency change 4 , the second derivative of frequency, and the gradient method. Accurate power estimation is crucial for determining the precise amount of load to shed 3 . ...

Reference:

Adaptive non-parametric kernel density estimation for under-frequency load shedding with electric vehicles and renewable power uncertainty
Data-driven estimation of the amount of under frequency load shedding in small power systems
  • Citing Article
  • January 2025

Engineering Applications of Artificial Intelligence

... From an operational planning perspective, SOs must have robust tools and strategies to handle these frequency excursions, particularly in low-inertia systems. To address these issues, there have been various studies incorporating frequency stability constraints, such as-rate of change of frequency (RoCoF), frequency nadir, steady state frequency (QSS) within the unit commitment (UC) problem deduced from the swing equation, which is a vital tool used by SOs to determine the optimal set of generating units to meet demand while ensuring system security [2], [3], [5], [6]. When the security of the system is factored in, the UC problem becomes a Security-Constrained Unit Commitment (SCUC), which includes constraints related to operational requirements such as nodal power balance, transmission line thermal limits, and generation unit limits (e.g., capacity, ramp rates, and minimum up/down times). ...

Inclusion of frequency nadir constraint in the unit commitment problem of small power systems using machine learning
  • Citing Article
  • September 2023

Sustainable Energy Grids and Networks

... This motivates us to investigate the role and applicability of prior sensitivity analysis in high dimensional causal estimation problems. This is particularly beneficial, as in high dimensional problems, we have to rely on very limited observations to perform our Bayesian analysis and as a result variable selection with a single prior can be unreliable [18] in many cases, Moreover, in causal effect estimation, failing to correctly identify a relation between the treatment effect and predictor can lead to harmful side-effects. Therefore, it is extremely important to adopt a cautious approach in selecting or rejecting a variable. ...

A robust Bayesian analysis of variable selection under prior ignorance
  • Citing Article
  • June 2022

Sankhya A

... Now, instead of a set of functions as in the previous section, we are interested in a vector of integrals induced by having a set of target pdfs and a fixed scalar function. This setting corresponds to, e.g., robust Bayesian analysis, where one is interested in computing a lower bound on expectations of a specific function with respect to a family of posterior distributions Cruz et al. (2022). Let us denote withπ m (θ) for m = 1, . . . ...

Iterative importance sampling with Markov chain Monte Carlo sampling in robust Bayesian analysis
  • Citing Article
  • July 2022

Computational Statistics & Data Analysis

... Using a synthetic control group can have large impact on the supply of medical research and new drugs by decreasing research costs (Jahanshahi et al., 2021;Food and Drug Administration, 2023;Wong et al., 2014). However, using a synthetic control group may increase the estimate's mean-squared error due to lack of randomization (see Raices Cruz et al., 2022;Rhys Bernard et al., 2024), raising the question of whether a social planner should allow or incentivize their use. (Another relevant application is choosing between two experiments with different number of participants. ...

A robust Bayesian bias‐adjusted random effects model for consideration of uncertainty about bias terms in evidence synthesis

Statistics in Medicine

... Model-based uncertainty quantification using sampling is now more often used in geohazard assessments, e.g. Uzielli and Lacasse (2007), Wellmann and Regenauer-Lieb (2012), Rodríguez-Ochoa et al. (2015), Pakyuz-Charrier et al. (2018), Huang et al. (2021), Luo et al. (2021), andSun et al. (2021a). ...

A suggestion for the quantification of precise and bounded probability to quantify epistemic uncertainty in scientific assessments

... As a result, System Operators (SOs) are facing fresh difficulties in maintaining balance between generation and demand mismatch, primarily due to the intrinsic unpredictability of RES. A key challenge is the reduction of system inertia, which arises from the largescale incorporation of converter-interfaced RES that do not inherently provide inertia, alongside the displacement of conventional generation to meet environmental targets [1], [2]. Inertia, is inherently provided by large rotating machines in synchronous power plants directly coupled to the grid, and is crucial for maintaining frequency stability following contingency events. ...

Inclusion of frequency stability constraints in unit commitment using separable programming
  • Citing Article
  • February 2022

Electric Power Systems Research

... The theory suggests that the central nervous system needs to construct estimates of sensorimotor transformations, in the form of internal models, and represent the structure of uncertainty in the inputs, outputs, and transformations themselves to respond optimally to environmental stimuli. Bayesian decision theory has been used to model the behavior of the sensorimotor system and investigate the mechanisms used by the nervous system to solve estimation and decision problems [26] . The formula for Bayesian decision theory is: P(Ci|X) = P(Ci)P(X|Ci) / P(X), where: P(Ci) is the prior probability, P(X|Ci) is the likelihood probability, P(X) is the evidence. ...

Robust Decision Analysis under Severe Uncertainty and Ambiguous Tradeoffs: An Invasive Species Case Study

... Furthermore, although Guillaume et al. [6] showed that in general it is intractable to find best solutions, our results indicate that it can nonetheless be done when focal sets are of a particular kind. Finally, we may note that the particular optimization problems that we consider allow us to take advantage of specialized (SPP-related) algorithms, in contrast to [11] which also provides means to find best elements according to some criteria, such as strong dominance, but which cannot benefit from such specialized algorithms as it is framed in a more general setting. ...

Improving and benchmarking of algorithms for Γ-maximin, Γ-maximax and interval dominance
  • Citing Article
  • March 2021

International Journal of Approximate Reasoning

... Thus, posterior distributions represent our updated beliefs about regression coefficients after having observed the data. The range of a posterior distribution allows quantifying the uncertainty regarding whether the corresponding regression coefficient is probably zero or not [73,74]. Moreover, by simulating samples from these posterior distributions, we computed (posterior) medians of these distributions [75]. ...

Uncertainty Quantification in Lasso-Type Regularization Problems