Daniel Tabas’s research while affiliated with University of Washington and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (9)


Fig. 3: An illustrative example of the gauge map from B8 to a polyhedral C-set Q. The 1, 3 4 , 1 2 and 1 5 level curves of each set are plotted in blue. For each point in B8, it is transformed to its image (marked using the same color) in Q with the same level curve.
An Efficient Learning-Based Solver for Two-Stage DC Optimal Power Flow with Feasibility Guarantees
  • Preprint
  • File available

April 2023

·

61 Reads

Ling Zhang

·

Daniel Tabas

·

Baosen Zhang

In this paper, we consider the scenario-based two-stage stochastic DC optimal power flow (OPF) problem for optimal and reliable dispatch when the load is facing uncertainty. Although this problem is a linear program, it remains computationally challenging to solve due to the large number of scenarios needed to accurately represent the uncertainties. To mitigate the computational issues, many techniques have been proposed to approximate the second-stage decisions so they can dealt more efficiently. The challenge of finding good policies to approximate the second-stage decisions is that these solutions need to be feasible, which has been difficult to achieve with existing policies. To address these challenges, this paper proposes a learning method to solve the two-stage problem in a more efficient and optimal way. A technique called the gauge map is incorporated into the learning architecture design to guarantee the learned solutions' feasibility to the network constraints. Namely, we can design policies that are feed forward functions that only output feasible solutions. Simulation results on standard IEEE systems show that, compared to iterative solvers and the widely used affine policy, our proposed method not only learns solutions of good quality but also accelerates the computation by orders of magnitude.

Download


Figure 1: Building energy management with a voltage constraint at the point of common coupling.
Figure 2: Example of the occupation measure for various levels of γ.
Figure 3: Effective horizon length as a function of γ.
Figure 4: Example of VaR and CVaR at risk level β = 0.9.
Figure 6: Pr{C(x) ≥ 0.1 | x ∼ µ γ } measured throughout training. Key: SC = structured critic, MP = modified penalty (Prop. 6). Both modifications speed convergence to a safe policy. The shaded region represents ±1 standard deviation across 5 training runs.
Interpreting Primal-Dual Algorithms for Constrained MARL

November 2022

·

81 Reads

Constrained multiagent reinforcement learning (C-MARL) is gaining importance as MARL algorithms find new applications in real-world systems ranging from energy systems to drone swarms. Most C-MARL algorithms use a primal-dual approach to enforce constraints through a penalty function added to the reward. In this paper, we study the structural effects of the primal-dual approach on the constraints and value function. First, we show that using the constraint evaluation as the penalty leads to a weak notion of safety, but by making simple modifications to the penalty function, we can enforce meaningful probabilistic safety constraints. Second, we exploit the structural effects of primal-dual methods on value functions, leading to improved value estimates. Simulations in a simple constrained multiagent environment show that our reinterpretation of the primal-dual method in terms of probabilistic constraints is meaningful, and that our proposed value estimation procedure improves convergence to a safe joint policy.



Safe and Efficient Model Predictive Control Using Neural Networks: An Interior Point Approach

March 2022

·

113 Reads

Model predictive control (MPC) provides a useful means for controlling systems with constraints, but suffers from the computational burden of repeatedly solving an optimization problem in real time. Offline (explicit) solutions for MPC attempt to alleviate real time computational challenges using either multiparametric programming or machine learning. The multiparametric approaches are typically applied to linear or quadratic MPC problems, while learning-based approaches can be more flexible and are less memory-intensive. Existing learning-based approaches offer significant speedups, but the challenge becomes ensuring constraint satisfaction while maintaining good performance. In this paper, we provide a neural network parameterization of MPC policies that explicitly encodes the constraints of the problem. Therefore, the constraints are satisfied by design. By exploring the interior of the MPC feasible set in an unsupervised learning paradigm, the neural network finds better policies faster than projection-based methods and exhibits substantially faster solve times. We use the proposed policy to solve both a robust (tube-based) and a scenario-based MPC problem, and demonstrate the performance and computational gains on two standard test systems.


Fig. 3. In the policy network, the gauge map is used to map virtual actions to safe actions.
Computationally Efficient Safe Reinforcement Learning for Power Systems

October 2021

·

81 Reads

We propose a computationally efficient approach to safe reinforcement learning (RL) for frequency regulation in power systems with high levels of variable renewable energy resources. The approach draws on set-theoretic control techniques to craft a neural network-based control policy that is guaranteed to satisfy safety-critical state constraints, without needing to solve a model predictive control or projection problem in real time. By exploiting the properties of robust controlled-invariant polytopes, we construct a novel, closed-form "safety-filter" that enables end-to-end safe learning using any policy gradient-based RL algorithm. We then apply the safety filter in conjunction with the deep deterministic policy gradient (DDPG) algorithm to regulate frequency in a modified 9-bus power system, and show that the learned policy is more cost-effective than robust linear feedback control techniques while maintaining the same safety guarantee. We also show that the proposed paradigm outperforms DDPG augmented with constraint violation penalties.


Consensus-Based Set-Theoretic Control in Inverter-Dominated Power Systems

November 2020

·

19 Reads

We propose a novel consensus-based approach for set-theoretic frequency control in power systems. A robust controlled-invariant set (RCI) for the system is generated by composing RCIs for each bus in the network. The process of generating these sets uses a consensus-based approach in order to facilitate discovery of mutually compatible subsystem RCIs when each bus seeks to maximize the size of its own RCI while treating the effects of coupling as an unknown-but-bounded disturbance. The consensus routine, which demonstrates linear convergence, is embedded into a backwards reachability analysis of initial safe sets. Results for a 9-bus test case show that the controllers associated with the resulting RCIs maintain safe operation when the system is subjected to worst-case (adversarial) fluctuations in net demand.


Optimal L-Infinity Frequency Control in Microgrids Considering Actuator Saturation

October 2019

·

8 Reads

Inverter-connected resources can improve transient stability in low-inertia grids by injecting active power to minimize system frequency deviations following disturbances. In practice, most generation and load disturbances are step changes and the engineering figure-of-merit is often the peak overshoot in frequency resulting from these step disturbances. In addition, the inverter-connected resources tend to saturate much more easily than conventional synchronous machines. However, despite these challenges, standard controller designs must deal with averaged quantities through H2H_2 or HH_\infty norms and must account for saturation in ad hoc manners. In this paper, we address these challenges by explicitly considering LL_\infty control with saturation using a linear matrix inequality-based approach. We show that this approach leads to significant improvements in stability performance.


Wind Energy Prediction in Highly Complex Terrain by Computational Fluid Dynamics

April 2019

·

66 Reads

·

30 Citations

With rising levels of wind power penetration in global electricity production, the relevance of wind power prediction is growing. More accurate forecasts reduce the required total amount of energy reserve capacity needed to ensure grid reliability and the risk of penalty for wind farm operators. This study analyzes the Computational Fluid Dynamics (CFD) software WindSim regarding its ability to perform accurate wind power predictions in complex terrain. Simulations of the wind field and wind farm power output in the Swiss Jura Mountains at the location of the Juvent Wind Farm during winter were performed. The study site features the combined presence of three complexities: topography, heterogeneous vegetation including forest, and interactions between wind turbine wakes. Hence, it allows a comprehensive evaluation of the software. Various turbulence models, forest models, and wake models, as well as the effects of domain size and grid resolution were evaluated against wind and power observations from nine Vestas V90’s 2.0-MW turbines. The results show that, with a proper combination of modeling options, WindSim is able to predict the performance of the wind farm with sufficient accuracy.

Citations (3)


... Gauge functions These works, which are non-iterative, are based on gauge functions that are a generalization of norms [16]. Tabas and Zhang [17,18] propose gauge mapping, which maps a hypercube to a polytope, ensuring that the network output remains within the target polytope. One of the limiting assumptions of this method is that a feasible point of the target polytope is known. ...

Reference:

Enforcing Hard Linear Constraints in Deep Learning Models with Decision Rules
Safe and Efficient Model Predictive Control Using Neural Networks: An Interior Point Approach
  • Citing Conference Paper
  • December 2022

... To test whether the XAI technique shows similar patterns in other datasets, simulations are conducted on a real wind power dataset from JUVENT [30]. After data preprocessing, the time resolution is 1 h, and the look-ahead time is 24 h. ...

Wind Energy Prediction in Highly Complex Terrain by Computational Fluid Dynamics

... Current safeguards are applied in conjunction with reinforcement learning algorithms that rely on the policy gradient theorem to estimate reward landscapes [11][12][13][14][15][16]. With the advent of differentiable physics simulators [17][18][19][20][21], analytically computing the gradient of the reward with respect to the actions has become possible. ...

Computationally Efficient Safe Reinforcement Learning for Power Systems
  • Citing Conference Paper
  • June 2022