Sadegh Soudjani’s research while affiliated with Max Planck Institute for Software Systems and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (140)


Intent-Aware MPC for Aircraft Detect-and-Avoid with Response Delay: A Comparative Study with ACAS Xu
  • Preprint
  • File available

March 2025

·

24 Reads

·

·

·

Sadegh Soudjani

In this paper, we propose an intent-aware Model Predictive Control (MPC) approach for the remain-well-clear (RWC) functionality of a multi-agent aircraft detect-and-avoid (DAA) system and compare its performance with the standardized Airborne Collision Avoidance System Xu (ACAS Xu). The aircraft system is modeled as a linear system for horizontal maneuvering, with advisories on the rate of turn as the control input. Both deterministic and stochastic time delays are considered to account for the lag between control guidance issuance and the response of the aircraft. The capability of the MPC scheme in producing an optimal control profile over the entire horizon is used to mitigate the impact of the delay. We compare the proposed MPC method with ACAS Xu using various evaluation metrics, including loss of DAA well-clear percentage, near mid-air collision percentage, horizontal miss distance, and additional flight distance across different encounter scenarios. It is shown that the MPC scheme achieves better evaluation metrics than ACAS Xu for both deterministic and stochastic scenarios.

Download

Fig. 1. The controlled agent in blue, aims to reach the location indicated by a green circle while avoiding collisions with the other agent. The other agent in black is modeled as an uncontrollable agent with an uncertainty tube from the perspective of the controlled agent. The red color indicates collisions along the path of the controlled agent indicated by blue circles.
Fig. 2. Illustration of the different risk measures E[R], VaR 1−ε and CVaR 1−ε for random variable R with probability density function f R .
Fig. 3. Feasible domains for the original CCP (green), the CVaR-based approximation (blue), and the CoM-based approximation (red).
Fig. 4. Feasible domain for the DRP-CoM (in red) and DRP-CVaR (in blue) optimizations for varying probabilistic thresholds ε and fixed Wasserstein radius r = 1 (top), and for varying Wasserstein radii r and fixed probabilistic threshold ε = 0.6 (bottom). It can be observed that, depending on the problem parameters such as ε and r, either the CVaR or the CoM approach may be employed effectively.
Data-Driven Distributionally Robust Control for Interacting Agents under Logical Constraints

March 2025

·

40 Reads

·

·

·

[...]

·

Sadegh Soudjani

In this paper, we propose a distributionally robust control synthesis for an agent with stochastic dynamics that interacts with other agents under uncertainties and constraints expressed by signal temporal logic (STL). We formulate the control synthesis as a chance-constrained program (CCP) with STL specifications that must be satisfied with high probability under all uncertainty tubes induced by the other agents. To tackle the CCP, we propose two methods based on concentration of measure (CoM) theory and conditional value at risk (CVaR) and compare the required assumptions and resulting optimizations. These approaches convert the CCP into an expectation-constrained program (ECP), which is simpler to solve than the original CCP. To estimate the expectation using a finite set of observed data, we adopt a distributionally robust optimization (DRO) approach. The underlying DRO can be approximated as a robust data-driven optimization that provides a probabilistic under-approximation to the original ECP, where the probability depends on the number of samples. Therefore, under feasibility, the original STL constraints are satisfied with two layers of designed confidence: the confidence of the chance constraint and the confidence of the approximated data-driven optimization, which depends on the number of samples. We then provide details on solving the resulting robust data-driven optimization numerically. Finally, we compare the two proposed approaches through case studies.


Fig. 1. A visual representation of the convergence of u i 9 , u i 17 , u i 21
Fig. 2. A visual representation of the convergence of res(z) and res(˜ z) during the execution of Algorithm 1.
Fig. 3. Aggregative demand and profile of power exchange of all the households with the grid.
Stochastic Generalized Dynamic Games with Coupled Chance Constraints

January 2025

·

16 Reads

Designing multi-agent systems with safety constraints and uncertain dynamics is a challenging problem. This paper studies a stochastic dynamic non-cooperative game with coupling safety chance constraints. The uncertainty is assumed to satisfy a concentration of measure property. Firstly, due to the non-convexity of chance constraints, a convex under-approximation of chance constraints is given using constraints on the expectation. Then, the conditions for the existence of the stochastic generalized Nash equilibrium (SGNE) of the under-approximated game are investigated, and the relation between the ε\varepsilon-SGNE of the original game and the under-approximated one is derived. A sampling-based algorithm is proposed for the SGNE seeking of the under-approximated game that does not require knowing the distribution of the uncertainty nor the analytical computation of expectations. Finally, under some assumptions on the game's pseudo-gradient mapping, the almost sure convergence of the algorithm to SGNE is proven. A numerical study is carried out on demand-side management in microgrids with shared battery to demonstrate the applicability of the proposed scheme.


Data-Driven Yet Formal Policy Synthesis for Stochastic Nonlinear Dynamical Systems

January 2025

·

11 Reads

The automated synthesis of control policies for stochastic dynamical systems presents significant challenges. A standard approach is to construct a finite-state abstraction of the continuous system, typically represented as a Markov decision process (MDP). However, generating abstractions is challenging when (1) the system's dynamics are nonlinear, and/or (2) we do not have complete knowledge of the dynamics. In this work, we introduce a novel data-driven abstraction technique for nonlinear dynamical systems with additive stochastic noise that addresses both of these issues. As a key step, we use samples of the dynamics to learn the enabled actions and transition probabilities of the abstraction. We represent abstractions as MDPs with intervals of transition probabilities, known as interval MDPs (IMDPs). These abstractions enable the synthesis of control policies for the concrete nonlinear system, with probably approximately correct (PAC) guarantees on the probability of satisfying a specified control objective. Through numerical experiments, we illustrate the effectiveness and robustness of our approach in achieving reliable control under uncertainty.





Figure 2: Variations of the empirical normalized regret (R(K)/K) when our proposed algorithm is implemented for the gridworld example with l = 6.
Figure 3: Comparison of the empirical normalized regret between our proposed regret-free algorithm and the ω-PAC algorithm (Perez et al., 2023) for the gridworld example with l = 4
Figure 4: Variations of the computed deadline (H k ) for the gridworld example with l = 6.
Figure 5: Comparison of the theoretical sample complexities for our proposed algorithm and the ω-PAC algorithm (Perez et al., 2023), when applied to the gridworld example with various sizes (4 ≤ l ≤ 16).
Regret-Free Reinforcement Learning for LTL Specifications

November 2024

·

7 Reads

Reinforcement learning (RL) is a promising method to learn optimal control policies for systems with unknown dynamics. In particular, synthesizing controllers for safety-critical systems based on high-level specifications, such as those expressed in temporal languages like linear temporal logic (LTL), presents a significant challenge in control systems research. Current RL-based methods designed for LTL tasks typically offer only asymptotic guarantees, which provide no insight into the transient performance during the learning phase. While running an RL algorithm, it is crucial to assess how close we are to achieving optimal behavior if we stop learning. In this paper, we present the first regret-free online algorithm for learning a controller that addresses the general class of LTL specifications over Markov decision processes (MDPs) with a finite set of states and actions. We begin by proposing a regret-free learning algorithm to solve infinite-horizon reach-avoid problems. For general LTL specifications, we show that the synthesis problem can be reduced to a reach-avoid problem when the graph structure is known. Additionally, we provide an algorithm for learning the graph structure, assuming knowledge of a minimum transition probability, which operates independently of the main regret-free algorithm.


Fig. 2: A motivating example of the workspace consisting of an autonomous car, traffic signals, a target destination, and obstacles.
Logic-based Knowledge Awareness for Autonomous Agents in Continuous Spaces

November 2024

·

15 Reads

This paper presents a step towards a formal controller design method for autonomous agents based on knowledge awareness to improve decision-making. Our approach is to first create an organized repository of information (a knowledge base) for autonomous agents which can be accessed and then translated into temporal specifications. Secondly, to develop a controller with formal guarantees that meets a combination of mission-specific objective and the specification from the knowledge base, we utilize an abstraction-based controller design (ABCD) approach, capable of managing both nonlinear dynamics and temporal requirements. Unlike the conventional offline ABCD approach, our method dynamically updates the controller whenever the knowledge base prompts changes in the specifications. A three-dimensional nonlinear car model navigating an urban road scenario with traffic signs and obstacles is considered for validation. Results show the effectiveness of the method in guiding the autonomous agents to the target while complying with the knowledge base and the mission-specific objective.



Citations (45)


... The current paper extends the preliminary results presented in the conference paper [39] in the following directions: (a) we address controlled agents in a dynamically changing environment while [39] is limited to static environments; (b) we propose to use CVaR as an alternative approach for tackling CCPs; (c) we make the computations robust to uncertainty tubes in the behavior of uncontrollable agents and provide a comparison between the results based on CoM and CVaR through detailed case studies; and (d) we provide the proofs of statements with additional numerical examples to illustrate the proposed techniques. ...

Reference:

Data-Driven Distributionally Robust Control for Interacting Agents under Logical Constraints
Distributionally Robust Control for Chance-Constrained Signal Temporal Logic Specifications
  • Citing Conference Paper
  • December 2024

... The possibility of artificial consciousness (roughly, subjective awareness in a human-designed artificial system) has been assumed within certain theoretical frameworks [1]. However, whether artificial consciousness is indeed theoretically possible, much less empirically feasible, is not self-evident, and neither proposition should be taken for granted. ...

Awareness in Robotics: An Early Perspective from the Viewpoint of the EIC Pathfinder Challenge “Awareness Inside”

... Prior work has exploited the connection between infinite-horizon specifications and Lévy's 0-1 Law (cf. [ [75,Theorem 3.2]). Under the assumption of a countable state space and bounded discrete probabilistic choices, recent work has introduced a sound and complete supermartingale proof rule for almost-sure termination [74,Lemma 3.4], that is applicable to programs that are almost-surely terminating but not with finite expected time [44]. ...

Necessary and Sufficient Certificates for Almost Sure Reachability
  • Citing Article
  • January 2024

IEEE Control Systems Letters

... Results obtained on this model hence rely on quantifying the probabilistic deviation to the true system to transfer guarantees (Abate et al., 2008;Haesaert et al., 2018). For example, probabilistic coupling relations can be used to transfer guarantees obtained on a parameterized surrogate model of the unknown system back to the original latent system by quantifying the expected parametric uncertainty (Schön et al., 2023). Ver-ification is conducted using a robust version of dynamic programming (Haesaert and Soudjani, 2020). ...

Bayesian Formal Synthesis of Unknown Systems via Robust Simulation Relations
  • Citing Article
  • January 2024

IEEE Transactions on Automatic Control

... Prediction intervals that are valid under distribution shift are designed in [20] using conformal prediction. In [21], barrier certificates are constructed with guarantees under distribution shift. Robust prediction under distribution shift is proposed in autonomous driving [22] and epistemic uncertainty-aware planning is considered in [23]. ...

Data-Driven Distributionally Robust Safety Verification Using Barrier Certificates and Conditional Mean Embeddings

... Inspired by the successful use of reinforcement learning in state preparation, researchers have also explored GAs for optimizing quantum circuits. GAs excel at navigating large search spaces, making them promising candidates for quantum state preparation tasks [6,20,29]. Miranda et al. [15] notably demonstrated the effectiveness of an island-model GA, in which the population splits into partially isolated subpopulations. This division helps avoid premature convergence by maintaining diversity across subgroups. ...

T-Count Optimizing Genetic Algorithm for Quantum State Preparation
  • Citing Conference Paper
  • July 2024

... We are only interested in quantum circuits without measurement. However, for our purposes, we only need to be concerned that a basis state, | ⟩, of a quantum state, | ⟩, is measured with a probability based on its amplitude: (| ⟩ measured from | ⟩) = ( ) 2 . See Nielsen and Chuang's volume [36] for details on measurement. ...

Safe Reach Set Computation via Neural Barrier Certificates
  • Citing Article
  • January 2024

IFAC-PapersOnLine

... In a nutshell, a CBC is analogous to a Lyapunov function defined over the state space of a dynamical system, establishing a framework of inequality constraints that apply to both its value and its evolution over time, governed by the system's dynamics. Therefore, if an appropriate level set of the CBC can delineate an unsafe region from all possible system trajectories originating from a specified initial set, the existence of such a CBC provides a formal (probabilistic) safety guarantee for the system (see e.g., [6][7][8][9][10][11][12][13][14]). However, these works fundamentally rely on the assumption that an accurate mathematical model of the system dynamics is available. ...

Multiplicative Barrier Certificates for Probabilistic Safety of Markov Jump Systems
  • Citing Article
  • January 2024

IFAC-PapersOnLine

... Fuelled by increasing data availability and advances in machine learning, data-driven abstractions have emerged as an alternative to conventional model-based abstractions (Makdesi et al., 2021;Coppola et al., 2023;Kazemi et al., 2022;Hashimoto et al., 2022;Devonport et al., 2021;Banse et al., 2023;Schön et al., 2024). These techniques generate abstractions by sampling system trajectories, often obtained from (black-box) simulation models. ...

Data-Driven Abstractions via Binary-Tree Gaussian Processes for Formal Verification

IFAC-PapersOnLine

... This assumption allows us to establish the Lipschitz continuity of robustness functions, formulate a well-defined chance constraint for the robustness function, and utilize the CoM property. Note that, unlike restrictive assumptions in the literature, such as the linearity of predicate functions [45], which lead to sets composed of multiple hyperplanes, the assumption above does not impose such restrictions on the shape of the sets. For example, the predicate function α(z) = 1 − ∥z∥, which describes the interior of a circle centered at the origin with a radius of one, is Lipschitz continuous with a Lipschitz constant of one. ...

Control Barrier Functions for Stochastic Systems under Signal Temporal Logic Tasks