ArticlePDF Available

Abstract and Figures

Model-based systems engineering approaches are commonly used to develop safety-critical mechatronic systems. Recently, a new SysML-based method for the dependability analysis of Unmanned Aerial Vehicles (UAVs) has been introduced. The method consists of three main steps: (i) creation of a structural SysML model using building blocks from the underlying UAV dependability profile that extends the model with block-level reliability and time properties, (ii) transformation of the semi-formal SysML model into a formal Dual-Graph Error Propagation Model (DEPM) that captures relevant structural and behavioral properties of the system, (iii) DEPM-based evaluation of system dependability metrics using Markov chain models and state-of-the-art probabilistic model checking techniques. This paper describes the practitioner experiences and lessons learned after the application of the aforementioned method to a sophisticated real-world embedded fault-tolerant inertial navigation system. The case study revealed two particular limitations that have been overcome by the optimization of the method against the state-space explosion of underlying Markov chain models and the introduction of a new computation algorithm for DEPMs with realistic extremely low fault activation probabilities.
IFAC PapersOnLine 52-12 (2019) 394–399
ScienceDirect
Available online at www.sciencedirect.com
2405-8963 Copyright © 2019. The Authors. Published by Elsevier Ltd. All rights reserved.
Peer review under responsibility of International Federation of Automatic Control.
10.1016/j.ifacol.2019.11.275
10.1016/j.ifacol.2019.11.275 2405-8963
Model-based Dependability Analysis of
Fault-tolerant Inertial Navigation System:
A Practical Experience Report
Mikael Steurer Andrey Morozov Klaus Janschek
Klaus-Peter Neitzke ∗∗
Institute of Automation, Technische Universit¨at Dresden, Dresden,
Germany, (e-mail: [mikael.steurer, andrey.morozov,
klaus.janschek]@tu-dresden.de).
∗∗ Institute for Informatics, Automation and Electronics, Hochschule
Nordhausen, Nordhausen, Germany, (e-mail:
klaus-peter.neitzke@hs-nordhausen.de
Abstract: Model-based systems engineering approaches are commonly used to develop safety-
critical mechatronic systems. Recently, a new SysML-based method for the dependability
analysis of Unmanned Aerial Vehicles (UAVs) has been introduced. The method consists of
three main steps: (i) creation of a structural SysML model using building blocks from the
underlying UAV dependability profile that extends the model with block-level reliability and
time properties, (ii) transformation of the semi-formal SysML model into a formal Dual-Graph
Error Propagation Model (DEPM) that captures relevant structural and behavioral properties
of the system, (iii) DEPM-based evaluation of system dependability metrics using Markov
chain models and state-of-the-art probabilistic model checking techniques. This paper describes
the practitioner experiences and lessons learned after the application of the aforementioned
method to a sophisticated real-world embedded fault-tolerant inertial navigation system. The
case study revealed two particular limitations that have been overcome by the optimization
of the method against the state-space explosion of underlying Markov chain models and the
introduction of a new computation algorithm for DEPMs with realistic extremely low fault
activation probabilities.
Keywords: system analysis and design, systems architecture, reliability engineering,
dependability, SysML, Markov chains, microsensors, space technology
1. INTRODUCTION
Safety is defined as the absence of catastrophic conse-
quences on the user(s) and the environment Aviˇzienis
et al. (2004). The measurement of angular velocity and
acceleration is an important task of every Unmanned
Aerial Vehicle (UAV). Usually, an Inertial Measurement
Unit (IMU) fulfills this task. A failure in the measurement
results in a high risk to human and environment. Also,
the integrated IMU data, merged with the data from other
sensors, e.g. magnetoresistive sensors, help to navigate the
UAV in case of degraded Global Navigation Satellite Sys-
tem (GNSS) or during indoor missions. The IMUFUSION
case study system, presented in this paper, was developed
in order to satisfy the following objectives: (1) record
inertial measurement data in a near space vehicle with
a specially designed, robust, and miniaturized system, (2)
estimate the flight trajectory including orientation by the
recorded inertial measurement data, and (3) integrate a
redundancy concept for higher reliability, diagnosis capa-
bility, and accuracy. A prototypical IMU with redundant
sensors, microcontrollers, and memory modules has been
developed and tested in the gondola of the stratospheric
research balloon of the Balloon EXperiments for Uni-
versity Students (BEXUS) program BEXUS (2018) and
REXUS/BEXUS (2018). The IMUFUSION project was
one of seven BEXUS experiments carried in two BEXUS
balloons which are lifted to a payload depending altitude
of 30 km with a flight duration of about five hours. The
main hardware components and the wiring concept are
shown in Fig. 1. Two Micro Controller Units (MCUs)
with redundant memory modules, 3-axis gyroscopes, 3-axis
acceleration sensors, and 3-axis magnetic field sensors are
the central parts of the IMUFUSION system. These com-
ponents are distributed on three Printed Circuit Boards
(PCBs): Master PCBA-1,Slave PCBA-2, and PCBA-4.
PCBA-3 is responsible for the power management. During
the experiment, the system recorded inertial data in a
near space vehicle with a specially designed robust and
miniaturized system and computed the time-dependent
attitude and position of the gondola.
2. STATE OF THE ART
The classical Fault Tree Analysis (FTA) Ruijters and
Stoelinga (2015) and the Reliability Block Diagram (RBD)
Kim (2011) are the most common quantitative system-
level reliability evaluation techniques. The FTA is a top-
down approach in which an undesired state of a system
21st IFAC Symposium on Automatic Control in Aerospace
August 27-30, 2019. Cranfield, UK
Copyright © 2019 IFAC 394
Model-based Dependability Analysis of
Fault-tolerant Inertial Navigation System:
A Practical Experience Report
Mikael Steurer Andrey Morozov Klaus Janschek
Klaus-Peter Neitzke ∗∗
Institute of Automation, Technische Universit¨at Dresden, Dresden,
Germany, (e-mail: [mikael.steurer, andrey.morozov,
klaus.janschek]@tu-dresden.de).
∗∗ Institute for Informatics, Automation and Electronics, Hochschule
Nordhausen, Nordhausen, Germany, (e-mail:
klaus-peter.neitzke@hs-nordhausen.de
Abstract: Model-based systems engineering approaches are commonly used to develop safety-
critical mechatronic systems. Recently, a new SysML-based method for the dependability
analysis of Unmanned Aerial Vehicles (UAVs) has been introduced. The method consists of
three main steps: (i) creation of a structural SysML model using building blocks from the
underlying UAV dependability profile that extends the model with block-level reliability and
time properties, (ii) transformation of the semi-formal SysML model into a formal Dual-Graph
Error Propagation Model (DEPM) that captures relevant structural and behavioral properties
of the system, (iii) DEPM-based evaluation of system dependability metrics using Markov
chain models and state-of-the-art probabilistic model checking techniques. This paper describes
the practitioner experiences and lessons learned after the application of the aforementioned
method to a sophisticated real-world embedded fault-tolerant inertial navigation system. The
case study revealed two particular limitations that have been overcome by the optimization
of the method against the state-space explosion of underlying Markov chain models and the
introduction of a new computation algorithm for DEPMs with realistic extremely low fault
activation probabilities.
Keywords: system analysis and design, systems architecture, reliability engineering,
dependability, SysML, Markov chains, microsensors, space technology
1. INTRODUCTION
Safety is defined as the absence of catastrophic conse-
quences on the user(s) and the environment Aviˇzienis
et al. (2004). The measurement of angular velocity and
acceleration is an important task of every Unmanned
Aerial Vehicle (UAV). Usually, an Inertial Measurement
Unit (IMU) fulfills this task. A failure in the measurement
results in a high risk to human and environment. Also,
the integrated IMU data, merged with the data from other
sensors, e.g. magnetoresistive sensors, help to navigate the
UAV in case of degraded Global Navigation Satellite Sys-
tem (GNSS) or during indoor missions. The IMUFUSION
case study system, presented in this paper, was developed
in order to satisfy the following objectives: (1) record
inertial measurement data in a near space vehicle with
a specially designed, robust, and miniaturized system, (2)
estimate the flight trajectory including orientation by the
recorded inertial measurement data, and (3) integrate a
redundancy concept for higher reliability, diagnosis capa-
bility, and accuracy. A prototypical IMU with redundant
sensors, microcontrollers, and memory modules has been
developed and tested in the gondola of the stratospheric
research balloon of the Balloon EXperiments for Uni-
versity Students (BEXUS) program BEXUS (2018) and
REXUS/BEXUS (2018). The IMUFUSION project was
one of seven BEXUS experiments carried in two BEXUS
balloons which are lifted to a payload depending altitude
of 30 km with a flight duration of about five hours. The
main hardware components and the wiring concept are
shown in Fig. 1. Two Micro Controller Units (MCUs)
with redundant memory modules, 3-axis gyroscopes, 3-axis
acceleration sensors, and 3-axis magnetic field sensors are
the central parts of the IMUFUSION system. These com-
ponents are distributed on three Printed Circuit Boards
(PCBs): Master PCBA-1,Slave PCBA-2, and PCBA-4.
PCBA-3 is responsible for the power management. During
the experiment, the system recorded inertial data in a
near space vehicle with a specially designed robust and
miniaturized system and computed the time-dependent
attitude and position of the gondola.
2. STATE OF THE ART
The classical Fault Tree Analysis (FTA) Ruijters and
Stoelinga (2015) and the Reliability Block Diagram (RBD)
Kim (2011) are the most common quantitative system-
level reliability evaluation techniques. The FTA is a top-
down approach in which an undesired state of a system
21st IFAC Symposium on Automatic Control in Aerospace
August 27-30, 2019. Cranfield, UK
Copyright © 2019 IFAC 394
Model-based Dependability Analysis of
Fault-tolerant Inertial Navigation System:
A Practical Experience Report
Mikael Steurer Andrey Morozov Klaus Janschek
Klaus-Peter Neitzke ∗∗
Institute of Automation, Technische Universit¨at Dresden, Dresden,
Germany, (e-mail: [mikael.steurer, andrey.morozov,
klaus.janschek]@tu-dresden.de).
∗∗ Institute for Informatics, Automation and Electronics, Hochschule
Nordhausen, Nordhausen, Germany, (e-mail:
klaus-peter.neitzke@hs-nordhausen.de
Abstract: Model-based systems engineering approaches are commonly used to develop safety-
critical mechatronic systems. Recently, a new SysML-based method for the dependability
analysis of Unmanned Aerial Vehicles (UAVs) has been introduced. The method consists of
three main steps: (i) creation of a structural SysML model using building blocks from the
underlying UAV dependability profile that extends the model with block-level reliability and
time properties, (ii) transformation of the semi-formal SysML model into a formal Dual-Graph
Error Propagation Model (DEPM) that captures relevant structural and behavioral properties
of the system, (iii) DEPM-based evaluation of system dependability metrics using Markov
chain models and state-of-the-art probabilistic model checking techniques. This paper describes
the practitioner experiences and lessons learned after the application of the aforementioned
method to a sophisticated real-world embedded fault-tolerant inertial navigation system. The
case study revealed two particular limitations that have been overcome by the optimization
of the method against the state-space explosion of underlying Markov chain models and the
introduction of a new computation algorithm for DEPMs with realistic extremely low fault
activation probabilities.
Keywords: system analysis and design, systems architecture, reliability engineering,
dependability, SysML, Markov chains, microsensors, space technology
1. INTRODUCTION
Safety is defined as the absence of catastrophic conse-
quences on the user(s) and the environment Aviˇzienis
et al. (2004). The measurement of angular velocity and
acceleration is an important task of every Unmanned
Aerial Vehicle (UAV). Usually, an Inertial Measurement
Unit (IMU) fulfills this task. A failure in the measurement
results in a high risk to human and environment. Also,
the integrated IMU data, merged with the data from other
sensors, e.g. magnetoresistive sensors, help to navigate the
UAV in case of degraded Global Navigation Satellite Sys-
tem (GNSS) or during indoor missions. The IMUFUSION
case study system, presented in this paper, was developed
in order to satisfy the following objectives: (1) record
inertial measurement data in a near space vehicle with
a specially designed, robust, and miniaturized system, (2)
estimate the flight trajectory including orientation by the
recorded inertial measurement data, and (3) integrate a
redundancy concept for higher reliability, diagnosis capa-
bility, and accuracy. A prototypical IMU with redundant
sensors, microcontrollers, and memory modules has been
developed and tested in the gondola of the stratospheric
research balloon of the Balloon EXperiments for Uni-
versity Students (BEXUS) program BEXUS (2018) and
REXUS/BEXUS (2018). The IMUFUSION project was
one of seven BEXUS experiments carried in two BEXUS
balloons which are lifted to a payload depending altitude
of 30 km with a flight duration of about five hours. The
main hardware components and the wiring concept are
shown in Fig. 1. Two Micro Controller Units (MCUs)
with redundant memory modules, 3-axis gyroscopes, 3-axis
acceleration sensors, and 3-axis magnetic field sensors are
the central parts of the IMUFUSION system. These com-
ponents are distributed on three Printed Circuit Boards
(PCBs): Master PCBA-1,Slave PCBA-2, and PCBA-4.
PCBA-3 is responsible for the power management. During
the experiment, the system recorded inertial data in a
near space vehicle with a specially designed robust and
miniaturized system and computed the time-dependent
attitude and position of the gondola.
2. STATE OF THE ART
The classical Fault Tree Analysis (FTA) Ruijters and
Stoelinga (2015) and the Reliability Block Diagram (RBD)
Kim (2011) are the most common quantitative system-
level reliability evaluation techniques. The FTA is a top-
down approach in which an undesired state of a system
21st IFAC Symposium on Automatic Control in Aerospace
August 27-30, 2019. Cranfield, UK
Copyright © 2019 IFAC 394
Model-based Dependability Analysis of
Fault-tolerant Inertial Navigation System:
A Practical Experience Report
Mikael Steurer Andrey Morozov Klaus Janschek
Klaus-Peter Neitzke ∗∗
Institute of Automation, Technische Universit¨at Dresden, Dresden,
Germany, (e-mail: [mikael.steurer, andrey.morozov,
klaus.janschek]@tu-dresden.de).
∗∗ Institute for Informatics, Automation and Electronics, Hochschule
Nordhausen, Nordhausen, Germany, (e-mail:
klaus-peter.neitzke@hs-nordhausen.de
Abstract: Model-based systems engineering approaches are commonly used to develop safety-
critical mechatronic systems. Recently, a new SysML-based method for the dependability
analysis of Unmanned Aerial Vehicles (UAVs) has been introduced. The method consists of
three main steps: (i) creation of a structural SysML model using building blocks from the
underlying UAV dependability profile that extends the model with block-level reliability and
time properties, (ii) transformation of the semi-formal SysML model into a formal Dual-Graph
Error Propagation Model (DEPM) that captures relevant structural and behavioral properties
of the system, (iii) DEPM-based evaluation of system dependability metrics using Markov
chain models and state-of-the-art probabilistic model checking techniques. This paper describes
the practitioner experiences and lessons learned after the application of the aforementioned
method to a sophisticated real-world embedded fault-tolerant inertial navigation system. The
case study revealed two particular limitations that have been overcome by the optimization
of the method against the state-space explosion of underlying Markov chain models and the
introduction of a new computation algorithm for DEPMs with realistic extremely low fault
activation probabilities.
Keywords: system analysis and design, systems architecture, reliability engineering,
dependability, SysML, Markov chains, microsensors, space technology
1. INTRODUCTION
Safety is defined as the absence of catastrophic conse-
quences on the user(s) and the environment Aviˇzienis
et al. (2004). The measurement of angular velocity and
acceleration is an important task of every Unmanned
Aerial Vehicle (UAV). Usually, an Inertial Measurement
Unit (IMU) fulfills this task. A failure in the measurement
results in a high risk to human and environment. Also,
the integrated IMU data, merged with the data from other
sensors, e.g. magnetoresistive sensors, help to navigate the
UAV in case of degraded Global Navigation Satellite Sys-
tem (GNSS) or during indoor missions. The IMUFUSION
case study system, presented in this paper, was developed
in order to satisfy the following objectives: (1) record
inertial measurement data in a near space vehicle with
a specially designed, robust, and miniaturized system, (2)
estimate the flight trajectory including orientation by the
recorded inertial measurement data, and (3) integrate a
redundancy concept for higher reliability, diagnosis capa-
bility, and accuracy. A prototypical IMU with redundant
sensors, microcontrollers, and memory modules has been
developed and tested in the gondola of the stratospheric
research balloon of the Balloon EXperiments for Uni-
versity Students (BEXUS) program BEXUS (2018) and
REXUS/BEXUS (2018). The IMUFUSION project was
one of seven BEXUS experiments carried in two BEXUS
balloons which are lifted to a payload depending altitude
of 30 km with a flight duration of about five hours. The
main hardware components and the wiring concept are
shown in Fig. 1. Two Micro Controller Units (MCUs)
with redundant memory modules, 3-axis gyroscopes, 3-axis
acceleration sensors, and 3-axis magnetic field sensors are
the central parts of the IMUFUSION system. These com-
ponents are distributed on three Printed Circuit Boards
(PCBs): Master PCBA-1,Slave PCBA-2, and PCBA-4.
PCBA-3 is responsible for the power management. During
the experiment, the system recorded inertial data in a
near space vehicle with a specially designed robust and
miniaturized system and computed the time-dependent
attitude and position of the gondola.
2. STATE OF THE ART
The classical Fault Tree Analysis (FTA) Ruijters and
Stoelinga (2015) and the Reliability Block Diagram (RBD)
Kim (2011) are the most common quantitative system-
level reliability evaluation techniques. The FTA is a top-
down approach in which an undesired state of a system
21st IFAC Symposium on Automatic Control in Aerospace
August 27-30, 2019. Cranfield, UK
Copyright © 2019 IFAC 394
Model-based Dependability Analysis of
Fault-tolerant Inertial Navigation System:
A Practical Experience Report
Mikael Steurer Andrey Morozov Klaus Janschek
Klaus-Peter Neitzke ∗∗
Institute of Automation, Technische Universit¨at Dresden, Dresden,
Germany, (e-mail: [mikael.steurer, andrey.morozov,
klaus.janschek]@tu-dresden.de).
∗∗ Institute for Informatics, Automation and Electronics, Hochschule
Nordhausen, Nordhausen, Germany, (e-mail:
klaus-peter.neitzke@hs-nordhausen.de
Abstract: Model-based systems engineering approaches are commonly used to develop safety-
critical mechatronic systems. Recently, a new SysML-based method for the dependability
analysis of Unmanned Aerial Vehicles (UAVs) has been introduced. The method consists of
three main steps: (i) creation of a structural SysML model using building blocks from the
underlying UAV dependability profile that extends the model with block-level reliability and
time properties, (ii) transformation of the semi-formal SysML model into a formal Dual-Graph
Error Propagation Model (DEPM) that captures relevant structural and behavioral properties
of the system, (iii) DEPM-based evaluation of system dependability metrics using Markov
chain models and state-of-the-art probabilistic model checking techniques. This paper describes
the practitioner experiences and lessons learned after the application of the aforementioned
method to a sophisticated real-world embedded fault-tolerant inertial navigation system. The
case study revealed two particular limitations that have been overcome by the optimization
of the method against the state-space explosion of underlying Markov chain models and the
introduction of a new computation algorithm for DEPMs with realistic extremely low fault
activation probabilities.
Keywords: system analysis and design, systems architecture, reliability engineering,
dependability, SysML, Markov chains, microsensors, space technology
1. INTRODUCTION
Safety is defined as the absence of catastrophic conse-
quences on the user(s) and the environment Aviˇzienis
et al. (2004). The measurement of angular velocity and
acceleration is an important task of every Unmanned
Aerial Vehicle (UAV). Usually, an Inertial Measurement
Unit (IMU) fulfills this task. A failure in the measurement
results in a high risk to human and environment. Also,
the integrated IMU data, merged with the data from other
sensors, e.g. magnetoresistive sensors, help to navigate the
UAV in case of degraded Global Navigation Satellite Sys-
tem (GNSS) or during indoor missions. The IMUFUSION
case study system, presented in this paper, was developed
in order to satisfy the following objectives: (1) record
inertial measurement data in a near space vehicle with
a specially designed, robust, and miniaturized system, (2)
estimate the flight trajectory including orientation by the
recorded inertial measurement data, and (3) integrate a
redundancy concept for higher reliability, diagnosis capa-
bility, and accuracy. A prototypical IMU with redundant
sensors, microcontrollers, and memory modules has been
developed and tested in the gondola of the stratospheric
research balloon of the Balloon EXperiments for Uni-
versity Students (BEXUS) program BEXUS (2018) and
REXUS/BEXUS (2018). The IMUFUSION project was
one of seven BEXUS experiments carried in two BEXUS
balloons which are lifted to a payload depending altitude
of 30 km with a flight duration of about five hours. The
main hardware components and the wiring concept are
shown in Fig. 1. Two Micro Controller Units (MCUs)
with redundant memory modules, 3-axis gyroscopes, 3-axis
acceleration sensors, and 3-axis magnetic field sensors are
the central parts of the IMUFUSION system. These com-
ponents are distributed on three Printed Circuit Boards
(PCBs): Master PCBA-1,Slave PCBA-2, and PCBA-4.
PCBA-3 is responsible for the power management. During
the experiment, the system recorded inertial data in a
near space vehicle with a specially designed robust and
miniaturized system and computed the time-dependent
attitude and position of the gondola.
2. STATE OF THE ART
The classical Fault Tree Analysis (FTA) Ruijters and
Stoelinga (2015) and the Reliability Block Diagram (RBD)
Kim (2011) are the most common quantitative system-
level reliability evaluation techniques. The FTA is a top-
down approach in which an undesired state of a system
21st IFAC Symposium on Automatic Control in Aerospace
August 27-30, 2019. Cranfield, UK
Copyright © 2019 IFAC 394
Copyright © 2019. The Authors. Published by Elsevier Ltd. All rights reserved.
Mikael Steurer et al. / IFAC PapersOnLine 52-12 (2019) 394–399 395
Model-based Dependability Analysis of
Fault-tolerant Inertial Navigation System:
A Practical Experience Report
Mikael Steurer Andrey Morozov Klaus Janschek
Klaus-Peter Neitzke ∗∗
Institute of Automation, Technische Universit¨at Dresden, Dresden,
Germany, (e-mail: [mikael.steurer, andrey.morozov,
klaus.janschek]@tu-dresden.de).
∗∗ Institute for Informatics, Automation and Electronics, Hochschule
Nordhausen, Nordhausen, Germany, (e-mail:
klaus-peter.neitzke@hs-nordhausen.de
Abstract: Model-based systems engineering approaches are commonly used to develop safety-
critical mechatronic systems. Recently, a new SysML-based method for the dependability
analysis of Unmanned Aerial Vehicles (UAVs) has been introduced. The method consists of
three main steps: (i) creation of a structural SysML model using building blocks from the
underlying UAV dependability profile that extends the model with block-level reliability and
time properties, (ii) transformation of the semi-formal SysML model into a formal Dual-Graph
Error Propagation Model (DEPM) that captures relevant structural and behavioral properties
of the system, (iii) DEPM-based evaluation of system dependability metrics using Markov
chain models and state-of-the-art probabilistic model checking techniques. This paper describes
the practitioner experiences and lessons learned after the application of the aforementioned
method to a sophisticated real-world embedded fault-tolerant inertial navigation system. The
case study revealed two particular limitations that have been overcome by the optimization
of the method against the state-space explosion of underlying Markov chain models and the
introduction of a new computation algorithm for DEPMs with realistic extremely low fault
activation probabilities.
Keywords: system analysis and design, systems architecture, reliability engineering,
dependability, SysML, Markov chains, microsensors, space technology
1. INTRODUCTION
Safety is defined as the absence of catastrophic conse-
quences on the user(s) and the environment Aviˇzienis
et al. (2004). The measurement of angular velocity and
acceleration is an important task of every Unmanned
Aerial Vehicle (UAV). Usually, an Inertial Measurement
Unit (IMU) fulfills this task. A failure in the measurement
results in a high risk to human and environment. Also,
the integrated IMU data, merged with the data from other
sensors, e.g. magnetoresistive sensors, help to navigate the
UAV in case of degraded Global Navigation Satellite Sys-
tem (GNSS) or during indoor missions. The IMUFUSION
case study system, presented in this paper, was developed
in order to satisfy the following objectives: (1) record
inertial measurement data in a near space vehicle with
a specially designed, robust, and miniaturized system, (2)
estimate the flight trajectory including orientation by the
recorded inertial measurement data, and (3) integrate a
redundancy concept for higher reliability, diagnosis capa-
bility, and accuracy. A prototypical IMU with redundant
sensors, microcontrollers, and memory modules has been
developed and tested in the gondola of the stratospheric
research balloon of the Balloon EXperiments for Uni-
versity Students (BEXUS) program BEXUS (2018) and
REXUS/BEXUS (2018). The IMUFUSION project was
one of seven BEXUS experiments carried in two BEXUS
balloons which are lifted to a payload depending altitude
of 30 km with a flight duration of about five hours. The
main hardware components and the wiring concept are
shown in Fig. 1. Two Micro Controller Units (MCUs)
with redundant memory modules, 3-axis gyroscopes, 3-axis
acceleration sensors, and 3-axis magnetic field sensors are
the central parts of the IMUFUSION system. These com-
ponents are distributed on three Printed Circuit Boards
(PCBs): Master PCBA-1,Slave PCBA-2, and PCBA-4.
PCBA-3 is responsible for the power management. During
the experiment, the system recorded inertial data in a
near space vehicle with a specially designed robust and
miniaturized system and computed the time-dependent
attitude and position of the gondola.
2. STATE OF THE ART
The classical Fault Tree Analysis (FTA) Ruijters and
Stoelinga (2015) and the Reliability Block Diagram (RBD)
Kim (2011) are the most common quantitative system-
level reliability evaluation techniques. The FTA is a top-
down approach in which an undesired state of a system
21st IFAC Symposium on Automatic Control in Aerospace
August 27-30, 2019. Cranfield, UK
Copyright © 2019 IFAC 394
Model-based Dependability Analysis of
Fault-tolerant Inertial Navigation System:
A Practical Experience Report
Mikael Steurer Andrey Morozov Klaus Janschek
Klaus-Peter Neitzke ∗∗
Institute of Automation, Technische Universit¨at Dresden, Dresden,
Germany, (e-mail: [mikael.steurer, andrey.morozov,
klaus.janschek]@tu-dresden.de).
∗∗ Institute for Informatics, Automation and Electronics, Hochschule
Nordhausen, Nordhausen, Germany, (e-mail:
klaus-peter.neitzke@hs-nordhausen.de
Abstract: Model-based systems engineering approaches are commonly used to develop safety-
critical mechatronic systems. Recently, a new SysML-based method for the dependability
analysis of Unmanned Aerial Vehicles (UAVs) has been introduced. The method consists of
three main steps: (i) creation of a structural SysML model using building blocks from the
underlying UAV dependability profile that extends the model with block-level reliability and
time properties, (ii) transformation of the semi-formal SysML model into a formal Dual-Graph
Error Propagation Model (DEPM) that captures relevant structural and behavioral properties
of the system, (iii) DEPM-based evaluation of system dependability metrics using Markov
chain models and state-of-the-art probabilistic model checking techniques. This paper describes
the practitioner experiences and lessons learned after the application of the aforementioned
method to a sophisticated real-world embedded fault-tolerant inertial navigation system. The
case study revealed two particular limitations that have been overcome by the optimization
of the method against the state-space explosion of underlying Markov chain models and the
introduction of a new computation algorithm for DEPMs with realistic extremely low fault
activation probabilities.
Keywords: system analysis and design, systems architecture, reliability engineering,
dependability, SysML, Markov chains, microsensors, space technology
1. INTRODUCTION
Safety is defined as the absence of catastrophic conse-
quences on the user(s) and the environment Aviˇzienis
et al. (2004). The measurement of angular velocity and
acceleration is an important task of every Unmanned
Aerial Vehicle (UAV). Usually, an Inertial Measurement
Unit (IMU) fulfills this task. A failure in the measurement
results in a high risk to human and environment. Also,
the integrated IMU data, merged with the data from other
sensors, e.g. magnetoresistive sensors, help to navigate the
UAV in case of degraded Global Navigation Satellite Sys-
tem (GNSS) or during indoor missions. The IMUFUSION
case study system, presented in this paper, was developed
in order to satisfy the following objectives: (1) record
inertial measurement data in a near space vehicle with
a specially designed, robust, and miniaturized system, (2)
estimate the flight trajectory including orientation by the
recorded inertial measurement data, and (3) integrate a
redundancy concept for higher reliability, diagnosis capa-
bility, and accuracy. A prototypical IMU with redundant
sensors, microcontrollers, and memory modules has been
developed and tested in the gondola of the stratospheric
research balloon of the Balloon EXperiments for Uni-
versity Students (BEXUS) program BEXUS (2018) and
REXUS/BEXUS (2018). The IMUFUSION project was
one of seven BEXUS experiments carried in two BEXUS
balloons which are lifted to a payload depending altitude
of 30 km with a flight duration of about five hours. The
main hardware components and the wiring concept are
shown in Fig. 1. Two Micro Controller Units (MCUs)
with redundant memory modules, 3-axis gyroscopes, 3-axis
acceleration sensors, and 3-axis magnetic field sensors are
the central parts of the IMUFUSION system. These com-
ponents are distributed on three Printed Circuit Boards
(PCBs): Master PCBA-1,Slave PCBA-2, and PCBA-4.
PCBA-3 is responsible for the power management. During
the experiment, the system recorded inertial data in a
near space vehicle with a specially designed robust and
miniaturized system and computed the time-dependent
attitude and position of the gondola.
2. STATE OF THE ART
The classical Fault Tree Analysis (FTA) Ruijters and
Stoelinga (2015) and the Reliability Block Diagram (RBD)
Kim (2011) are the most common quantitative system-
level reliability evaluation techniques. The FTA is a top-
down approach in which an undesired state of a system
21st IFAC Symposium on Automatic Control in Aerospace
August 27-30, 2019. Cranfield, UK
Copyright © 2019 IFAC 394
Model-based Dependability Analysis of
Fault-tolerant Inertial Navigation System:
A Practical Experience Report
Mikael Steurer Andrey Morozov Klaus Janschek
Klaus-Peter Neitzke ∗∗
Institute of Automation, Technische Universit¨at Dresden, Dresden,
Germany, (e-mail: [mikael.steurer, andrey.morozov,
klaus.janschek]@tu-dresden.de).
∗∗ Institute for Informatics, Automation and Electronics, Hochschule
Nordhausen, Nordhausen, Germany, (e-mail:
klaus-peter.neitzke@hs-nordhausen.de
Abstract: Model-based systems engineering approaches are commonly used to develop safety-
critical mechatronic systems. Recently, a new SysML-based method for the dependability
analysis of Unmanned Aerial Vehicles (UAVs) has been introduced. The method consists of
three main steps: (i) creation of a structural SysML model using building blocks from the
underlying UAV dependability profile that extends the model with block-level reliability and
time properties, (ii) transformation of the semi-formal SysML model into a formal Dual-Graph
Error Propagation Model (DEPM) that captures relevant structural and behavioral properties
of the system, (iii) DEPM-based evaluation of system dependability metrics using Markov
chain models and state-of-the-art probabilistic model checking techniques. This paper describes
the practitioner experiences and lessons learned after the application of the aforementioned
method to a sophisticated real-world embedded fault-tolerant inertial navigation system. The
case study revealed two particular limitations that have been overcome by the optimization
of the method against the state-space explosion of underlying Markov chain models and the
introduction of a new computation algorithm for DEPMs with realistic extremely low fault
activation probabilities.
Keywords: system analysis and design, systems architecture, reliability engineering,
dependability, SysML, Markov chains, microsensors, space technology
1. INTRODUCTION
Safety is defined as the absence of catastrophic conse-
quences on the user(s) and the environment Aviˇzienis
et al. (2004). The measurement of angular velocity and
acceleration is an important task of every Unmanned
Aerial Vehicle (UAV). Usually, an Inertial Measurement
Unit (IMU) fulfills this task. A failure in the measurement
results in a high risk to human and environment. Also,
the integrated IMU data, merged with the data from other
sensors, e.g. magnetoresistive sensors, help to navigate the
UAV in case of degraded Global Navigation Satellite Sys-
tem (GNSS) or during indoor missions. The IMUFUSION
case study system, presented in this paper, was developed
in order to satisfy the following objectives: (1) record
inertial measurement data in a near space vehicle with
a specially designed, robust, and miniaturized system, (2)
estimate the flight trajectory including orientation by the
recorded inertial measurement data, and (3) integrate a
redundancy concept for higher reliability, diagnosis capa-
bility, and accuracy. A prototypical IMU with redundant
sensors, microcontrollers, and memory modules has been
developed and tested in the gondola of the stratospheric
research balloon of the Balloon EXperiments for Uni-
versity Students (BEXUS) program BEXUS (2018) and
REXUS/BEXUS (2018). The IMUFUSION project was
one of seven BEXUS experiments carried in two BEXUS
balloons which are lifted to a payload depending altitude
of 30 km with a flight duration of about five hours. The
main hardware components and the wiring concept are
shown in Fig. 1. Two Micro Controller Units (MCUs)
with redundant memory modules, 3-axis gyroscopes, 3-axis
acceleration sensors, and 3-axis magnetic field sensors are
the central parts of the IMUFUSION system. These com-
ponents are distributed on three Printed Circuit Boards
(PCBs): Master PCBA-1,Slave PCBA-2, and PCBA-4.
PCBA-3 is responsible for the power management. During
the experiment, the system recorded inertial data in a
near space vehicle with a specially designed robust and
miniaturized system and computed the time-dependent
attitude and position of the gondola.
2. STATE OF THE ART
The classical Fault Tree Analysis (FTA) Ruijters and
Stoelinga (2015) and the Reliability Block Diagram (RBD)
Kim (2011) are the most common quantitative system-
level reliability evaluation techniques. The FTA is a top-
down approach in which an undesired state of a system
21st IFAC Symposium on Automatic Control in Aerospace
August 27-30, 2019. Cranfield, UK
Copyright © 2019 IFAC 394
Model-based Dependability Analysis of
Fault-tolerant Inertial Navigation System:
A Practical Experience Report
Mikael Steurer Andrey Morozov Klaus Janschek
Klaus-Peter Neitzke ∗∗
Institute of Automation, Technische Universit¨at Dresden, Dresden,
Germany, (e-mail: [mikael.steurer, andrey.morozov,
klaus.janschek]@tu-dresden.de).
∗∗ Institute for Informatics, Automation and Electronics, Hochschule
Nordhausen, Nordhausen, Germany, (e-mail:
klaus-peter.neitzke@hs-nordhausen.de
Abstract: Model-based systems engineering approaches are commonly used to develop safety-
critical mechatronic systems. Recently, a new SysML-based method for the dependability
analysis of Unmanned Aerial Vehicles (UAVs) has been introduced. The method consists of
three main steps: (i) creation of a structural SysML model using building blocks from the
underlying UAV dependability profile that extends the model with block-level reliability and
time properties, (ii) transformation of the semi-formal SysML model into a formal Dual-Graph
Error Propagation Model (DEPM) that captures relevant structural and behavioral properties
of the system, (iii) DEPM-based evaluation of system dependability metrics using Markov
chain models and state-of-the-art probabilistic model checking techniques. This paper describes
the practitioner experiences and lessons learned after the application of the aforementioned
method to a sophisticated real-world embedded fault-tolerant inertial navigation system. The
case study revealed two particular limitations that have been overcome by the optimization
of the method against the state-space explosion of underlying Markov chain models and the
introduction of a new computation algorithm for DEPMs with realistic extremely low fault
activation probabilities.
Keywords: system analysis and design, systems architecture, reliability engineering,
dependability, SysML, Markov chains, microsensors, space technology
1. INTRODUCTION
Safety is defined as the absence of catastrophic conse-
quences on the user(s) and the environment Aviˇzienis
et al. (2004). The measurement of angular velocity and
acceleration is an important task of every Unmanned
Aerial Vehicle (UAV). Usually, an Inertial Measurement
Unit (IMU) fulfills this task. A failure in the measurement
results in a high risk to human and environment. Also,
the integrated IMU data, merged with the data from other
sensors, e.g. magnetoresistive sensors, help to navigate the
UAV in case of degraded Global Navigation Satellite Sys-
tem (GNSS) or during indoor missions. The IMUFUSION
case study system, presented in this paper, was developed
in order to satisfy the following objectives: (1) record
inertial measurement data in a near space vehicle with
a specially designed, robust, and miniaturized system, (2)
estimate the flight trajectory including orientation by the
recorded inertial measurement data, and (3) integrate a
redundancy concept for higher reliability, diagnosis capa-
bility, and accuracy. A prototypical IMU with redundant
sensors, microcontrollers, and memory modules has been
developed and tested in the gondola of the stratospheric
research balloon of the Balloon EXperiments for Uni-
versity Students (BEXUS) program BEXUS (2018) and
REXUS/BEXUS (2018). The IMUFUSION project was
one of seven BEXUS experiments carried in two BEXUS
balloons which are lifted to a payload depending altitude
of 30 km with a flight duration of about five hours. The
main hardware components and the wiring concept are
shown in Fig. 1. Two Micro Controller Units (MCUs)
with redundant memory modules, 3-axis gyroscopes, 3-axis
acceleration sensors, and 3-axis magnetic field sensors are
the central parts of the IMUFUSION system. These com-
ponents are distributed on three Printed Circuit Boards
(PCBs): Master PCBA-1,Slave PCBA-2, and PCBA-4.
PCBA-3 is responsible for the power management. During
the experiment, the system recorded inertial data in a
near space vehicle with a specially designed robust and
miniaturized system and computed the time-dependent
attitude and position of the gondola.
2. STATE OF THE ART
The classical Fault Tree Analysis (FTA) Ruijters and
Stoelinga (2015) and the Reliability Block Diagram (RBD)
Kim (2011) are the most common quantitative system-
level reliability evaluation techniques. The FTA is a top-
down approach in which an undesired state of a system
21st IFAC Symposium on Automatic Control in Aerospace
August 27-30, 2019. Cranfield, UK
Copyright © 2019 IFAC 394
Model-based Dependability Analysis of
Fault-tolerant Inertial Navigation System:
A Practical Experience Report
Mikael Steurer Andrey Morozov Klaus Janschek
Klaus-Peter Neitzke ∗∗
Institute of Automation, Technische Universit¨at Dresden, Dresden,
Germany, (e-mail: [mikael.steurer, andrey.morozov,
klaus.janschek]@tu-dresden.de).
∗∗ Institute for Informatics, Automation and Electronics, Hochschule
Nordhausen, Nordhausen, Germany, (e-mail:
klaus-peter.neitzke@hs-nordhausen.de
Abstract: Model-based systems engineering approaches are commonly used to develop safety-
critical mechatronic systems. Recently, a new SysML-based method for the dependability
analysis of Unmanned Aerial Vehicles (UAVs) has been introduced. The method consists of
three main steps: (i) creation of a structural SysML model using building blocks from the
underlying UAV dependability profile that extends the model with block-level reliability and
time properties, (ii) transformation of the semi-formal SysML model into a formal Dual-Graph
Error Propagation Model (DEPM) that captures relevant structural and behavioral properties
of the system, (iii) DEPM-based evaluation of system dependability metrics using Markov
chain models and state-of-the-art probabilistic model checking techniques. This paper describes
the practitioner experiences and lessons learned after the application of the aforementioned
method to a sophisticated real-world embedded fault-tolerant inertial navigation system. The
case study revealed two particular limitations that have been overcome by the optimization
of the method against the state-space explosion of underlying Markov chain models and the
introduction of a new computation algorithm for DEPMs with realistic extremely low fault
activation probabilities.
Keywords: system analysis and design, systems architecture, reliability engineering,
dependability, SysML, Markov chains, microsensors, space technology
1. INTRODUCTION
Safety is defined as the absence of catastrophic conse-
quences on the user(s) and the environment Aviˇzienis
et al. (2004). The measurement of angular velocity and
acceleration is an important task of every Unmanned
Aerial Vehicle (UAV). Usually, an Inertial Measurement
Unit (IMU) fulfills this task. A failure in the measurement
results in a high risk to human and environment. Also,
the integrated IMU data, merged with the data from other
sensors, e.g. magnetoresistive sensors, help to navigate the
UAV in case of degraded Global Navigation Satellite Sys-
tem (GNSS) or during indoor missions. The IMUFUSION
case study system, presented in this paper, was developed
in order to satisfy the following objectives: (1) record
inertial measurement data in a near space vehicle with
a specially designed, robust, and miniaturized system, (2)
estimate the flight trajectory including orientation by the
recorded inertial measurement data, and (3) integrate a
redundancy concept for higher reliability, diagnosis capa-
bility, and accuracy. A prototypical IMU with redundant
sensors, microcontrollers, and memory modules has been
developed and tested in the gondola of the stratospheric
research balloon of the Balloon EXperiments for Uni-
versity Students (BEXUS) program BEXUS (2018) and
REXUS/BEXUS (2018). The IMUFUSION project was
one of seven BEXUS experiments carried in two BEXUS
balloons which are lifted to a payload depending altitude
of 30 km with a flight duration of about five hours. The
main hardware components and the wiring concept are
shown in Fig. 1. Two Micro Controller Units (MCUs)
with redundant memory modules, 3-axis gyroscopes, 3-axis
acceleration sensors, and 3-axis magnetic field sensors are
the central parts of the IMUFUSION system. These com-
ponents are distributed on three Printed Circuit Boards
(PCBs): Master PCBA-1,Slave PCBA-2, and PCBA-4.
PCBA-3 is responsible for the power management. During
the experiment, the system recorded inertial data in a
near space vehicle with a specially designed robust and
miniaturized system and computed the time-dependent
attitude and position of the gondola.
2. STATE OF THE ART
The classical Fault Tree Analysis (FTA) Ruijters and
Stoelinga (2015) and the Reliability Block Diagram (RBD)
Kim (2011) are the most common quantitative system-
level reliability evaluation techniques. The FTA is a top-
down approach in which an undesired state of a system
21st IFAC Symposium on Automatic Control in Aerospace
August 27-30, 2019. Cranfield, UK
Copyright © 2019 IFAC 394
VCC
GND
2 x PT1000
Mainbox
PCBA-3 Power
PCBA-2 Slave
PCBA-1 Master
PCBA-4 Magnet
PT02E8-4P
62GB-16F10-07SN
Cable Glands
to GPS antennas
to E-Link
RJF21B
TEN 8-2412WI
MPU-6050
LIS3MDLTR
PT1000
conditioning
circuit
NEO-M8N
Memory
LM 2937 IMP-3.3
5.0
LM 2937 IMP-5.0
5.0
GMR100HTBFER015
TMP116AIDRVR
MS5607-02BA03-50
Color-coding:
GND
3.3 V
5.0 V
UART
Analog signal
I2C
RF cabling
Ethernet
Sensorbox
to Sensorbox
STM32F429ZIT6
Fig. 1. The IMUFUSION system architecture: Electronic components and the wiring concept.
is analyzed using the Boolean logic that combines a
series of lower-level events such as failures of individual
components. RBDs have a similar underlying concept, but
model the system success while fault trees model a system
failure.
Model-based systems engineering approaches such as
UML, SysML, AADL, and Simulink UML (2001); OMG
(2008); Feiler et al. (2006); Ong (1998) are common to
develop safety-critical systems. The system presented in
this paper has been designed using the SysML. Reliability
models can be generated automatically from various base-
line system models, including SysML diagrams. Methods
for the automatic generation of fault trees from SysML di-
agrams are proposed in Mhenni et al. (2014) and Machida
et al. (2013) respectively. Other SysML-based methods
for quantitative reliability analysis exploit various types
of Markov chain models Debbabi et al. (2010); Ouchani
et al. (2014); Jarraya et al. (2007); Ali et al. (2015); Baouya
et al. (2015). Markov chain based methods provide access
to powerful Probabilistic Model Checking (PMC) Baier
and Katoen (2008) techniques that allow the evaluation
of advanced, highly customizable, time-related reliability
properties.
Besides the discussed general methods, there are more spe-
cific ones. A SysML-based method for the dependability
analysis of UAVs is introduced in Steurer et al. (2018). The
new domain-specific SysML UAV Dependability Profile
(UDP), that captures reliability-related properties of UAV
components and the transformation algorithm from pro-
filed SysML models to the formal Dual-graph Error Prop-
agation Models (DEPM) Morozov and Janschek (2014) for
further extensive analysis are two key parts of this method.
The UDP allows annotating SysML blocks and flows with
update frequencies and reliability properties. The update
frequencies are defined based on the technical description
and the configuration of the deployed components. The
reliability properties are defined using expert experience,
common guidelines for part reliability evaluation, or the
combination of both. The annotated SysML model is
transformed into a DEPM using the algorithms presented
in Steurer et al. (2018). The DEPM captures system and
data flow structures, and reliability properties of system
components and enables the computation of reliability
metrics using underlying Discrete-Time Markov Chain
(DTMC) models. Fig. 5 shows an example of a DEPM.
The DEPM combines two directed graph models: a control
flow graph and a data flow graph. The nodes of the graphs
represent executable system elements (rounded rectangles)
and data storages (rectangles with blue borders). Con-
trol flow arcs (black lines) model control flow transitions
between the elements. Data flow arcs (blue lines) model
data transfer between the elements and data storages. The
reliability metrics are computed for the defined failures
(highlighted in red) using automatically generated DTMC
models whose states describe the current control flow state
(which element will be executed next) and the states of all
data storages. Technically a DEPM is transformed into
one or several PRISM Kwiatkowska et al. (2002) models
for numerical evaluation with stochastic model checkers
like PRISM or STORM Dehnert et al. (2017).
The method presented in Steurer et al. (2018) has been
demonstrated with a rather simple quadcopter case study.
This paper describes the practitioner experiences and
lessons learned after the full-scale method application to a
real sophisticated inertial navigation system. The applica-
tion has proven the feasibility of the method and revealed
two particular limitations, which have been overcome by
optimizing the method against the state-space explosion
of underlying Markov chain models and introducing a new
computation algorithm for DEPMs with extremely low
fault activation probabilities.
3. SYSML AND UDP SYSTEM MODELING
The creation of a profiled SysML model is the first
step of the analytical workflow as illustrated in Fig. 2.
We used the UML and SysML modeling tool Papyrus
Lanusse et al. (2009). The SysML Internal Block Diagram
(IBD) in Fig. 3 shows how the instances of the top-
level blocks are interconnected. The system consists of
seven sub-components: two SensorArray (UDP stereotype
<<Sensor>>), two Mcu (UDP stereotype <<MCU>>),
PowerSys (UDP stereotype <<PowerSystem>>), and two
Gps (UDP stereotype <<GPS>>.The PowerSys part is
powered by the external battery pack of the gondola and
2019 IFAC ACA
August 27-30, 2019. Cranfield, UK
395
396 Mikael Steurer et al. / IFAC PapersOnLine 52-12 (2019) 394–399
SysML + UDP
[structural Model; annotated BDDs and IBDs]
DEPM
[Control flow, Data flow, Fault probabilities]
Analysis results
[MTTF, Probability (t), ...]
Transformation
Computation
Fig. 2. The method workflow.
provides 3.3 V to the sa1,sa2,master, and slave parts and
additionally 5 V to the master and slave parts. The master
and slave parts distribute the 3.3 V power supply to the
sensors and the GPS receivers. The sa1 and sa2 parts
send the measured data via analog and digital interfaces to
master and slave respectively. Each SensorArray contains
all sensors that are necessary for the computation of the
current pose and position. To achieve higher accuracy
the data of both SensorArrays are shared via Universal
Asynchronous Receiver Transmitter (UART) connection
between master and slave and merged in both. The gps1
and gps2 components share their data also with the master
and the slave via UART. Similar IBDs have been created
for all lower hierarchical levels.
The power system mainly contains two TEN 8-2412WI
dc/dc converters, one GMR100HTBFER015 high power
low ohmic chip shunt resistor, one InputCurrentVoltage
power measurement conditioning circuit, four LM 2937
IMP-3.3 3.3 V linear voltage regulators, and two LM
2937 IMP-5.0 5.0 V linear voltage regulators. Each Sen-
sorArray primarily consists of one MPU-6050 combined
3-axis accelerometer and gyroscope, one MS5607-02BA03-
50 barometric pressure sensor, one TMP116AIDRVR dig-
ital pressure sensor, one CiSPresSens special for the
prevailing conditions calibrated CiS pressure sensor, one
LIS3MDLTR 3-axis magnetometer, and one Pt1000 out-
side temperature sensor with the related conditioning cir-
cuit OutsideTempCircuit. The essential parts of the Mcu
are the STM32F429ZIT6 32-bit Arm Cortex-M4 based
microcontroller with a frequency of up to 180 MHz, the
surrounding Printed Circuit Board Assembly (PCBA) in-
cluding two separated micro SD-card memories per con-
troller and an ethernet interface. Only the instance master
is connected to the ground station for the purpose of
communication. The Gps part mainly consists of the NEO-
M8N GPS-module and the Taoglas Titan GPS RG-174
GPS-antenna.
All atomic (indivisible) components are annotated with
update frequencies and failure rates, see Table 1. The
related flow arcs are annotated with update frequencies.
The update frequencies are defined using the component
specifications and refer to the update of the values of
the flow model components. The failure rates are speci-
fied using reliability guidelines such as the FIDES Guide
et al. (2009), or the Nonelectronic Parts Reliability Data
(NRPD) Denson et al. (1994).
4. SYSML TO DEPM TRANSFORMATION
A systematic approach for the transformation of the
SysML model into a DEPM is the second step and de-
scribed in Steurer et al. (2018). This method is partially
reused in our case study. However, it was modified in order
to exploit new features of the latest version of the DEPM-
based tool OpenErrorPro Morozov et al. (2015).
1) Frequency classes: System components modeled
with the SysML UDP blocks are classified according to
their update frequencies. The case-study system contains
the three frequency classes: f1,f2, and f3whereby the
fastest class f3(1 MHz) contains 13 components, f2(1
KHz) contains 12 components, and the slowest class f1
(10 Hz) contains 4 components. Analog components with
a theoretically infinite frequency are classified into the
highest frequency class.
2) DEPM elements and control flow arcs: For each
atomic SysML UDP block, we create a DEPM element.
Based on the defined frequency classes the elements are
grouped into three hierarchical DEPMs. The top-level
DEPM, see Fig. 5, contains the elements with the lowest
update frequencies as well as the compound element f2
that contains the DEPM of the second frequency class. The
same structure is created for the second-level DEPM and
so on. The number of repetitions represents the iterations
of the sub model until the superordinate level is executed
according to repi=fi/fi1. Where, fiis the frequency
of the higher frequency class and fi1is the frequency
of the lower frequency class. For instance, the number
of repetitions of the compound element f2of the top-
level DEPM is equal to rep2= 1000Hz/10Hz = 100.
The elements of the top-level DEPM are connected into
a loop structure with DEPM control flow arcs based on
the attributes informationSource and informationTarget
of the UDP Flows. The elements of the other DEPMs are
connected sequentially.
3) DEPM data storages and data flow arcs: Data
storages are created and connected via DEPM data flow
arcs with the corresponding DEPM elements, based on
the UDP Flow attributes such as informationSource,in-
formationTarget, and name. Also, backward data flow arcs
from a data storage to the source element are added
in order to model permanent faults. Such data storages
connected via incoming and outgoing data flow arcs with
the same element are called state data. In Fig. 5, for
example the state data dGpsAnt1 models both the state
and the output of the element GpsAnt1. Redundant ele-
ments require an additional data comparison element. For
example, Fig. 4 shows two redundant voltage regulators
Reg12V1,Reg12V2, and the following comparison element
Reg12Comp with their associated error propagation com-
mands. All state data of sub DEPMs f3and f2also
appear in the top-level DEPM f1in order to model error
propagation between the frequency classes, as shown in
Fig. 5.
4) Probabilistic commands: Probabilistic commands in
PRISM format are created to describe the fault activation
and error propagation processes in the DEPM elements.
The probabilities of errors in the outputs of the elements
are calculated as follows:
EPc=λc[FIT]
fc[Hz]·3600 [ s
h]·109(1)
Where λcis the component failure rate and fcthe up-
date frequency of the component. The computed error
2019 IFAC ACA
August 27-30, 2019. Cranfield, UK
396
Mikael Steurer et al. / IFAC PapersOnLine 52-12 (2019) 394–399 397
SysML + UDP
[structural Model; annotated BDDs and IBDs]
DEPM
[Control flow, Data flow, Fault probabilities]
Analysis results
[MTTF, Probability (t), ...]
Transformation
Computation
Fig. 2. The method workflow.
provides 3.3 V to the sa1,sa2,master, and slave parts and
additionally 5 V to the master and slave parts. The master
and slave parts distribute the 3.3 V power supply to the
sensors and the GPS receivers. The sa1 and sa2 parts
send the measured data via analog and digital interfaces to
master and slave respectively. Each SensorArray contains
all sensors that are necessary for the computation of the
current pose and position. To achieve higher accuracy
the data of both SensorArrays are shared via Universal
Asynchronous Receiver Transmitter (UART) connection
between master and slave and merged in both. The gps1
and gps2 components share their data also with the master
and the slave via UART. Similar IBDs have been created
for all lower hierarchical levels.
The power system mainly contains two TEN 8-2412WI
dc/dc converters, one GMR100HTBFER015 high power
low ohmic chip shunt resistor, one InputCurrentVoltage
power measurement conditioning circuit, four LM 2937
IMP-3.3 3.3 V linear voltage regulators, and two LM
2937 IMP-5.0 5.0 V linear voltage regulators. Each Sen-
sorArray primarily consists of one MPU-6050 combined
3-axis accelerometer and gyroscope, one MS5607-02BA03-
50 barometric pressure sensor, one TMP116AIDRVR dig-
ital pressure sensor, one CiSPresSens special for the
prevailing conditions calibrated CiS pressure sensor, one
LIS3MDLTR 3-axis magnetometer, and one Pt1000 out-
side temperature sensor with the related conditioning cir-
cuit OutsideTempCircuit. The essential parts of the Mcu
are the STM32F429ZIT6 32-bit Arm Cortex-M4 based
microcontroller with a frequency of up to 180 MHz, the
surrounding Printed Circuit Board Assembly (PCBA) in-
cluding two separated micro SD-card memories per con-
troller and an ethernet interface. Only the instance master
is connected to the ground station for the purpose of
communication. The Gps part mainly consists of the NEO-
M8N GPS-module and the Taoglas Titan GPS RG-174
GPS-antenna.
All atomic (indivisible) components are annotated with
update frequencies and failure rates, see Table 1. The
related flow arcs are annotated with update frequencies.
The update frequencies are defined using the component
specifications and refer to the update of the values of
the flow model components. The failure rates are speci-
fied using reliability guidelines such as the FIDES Guide
et al. (2009), or the Nonelectronic Parts Reliability Data
(NRPD) Denson et al. (1994).
4. SYSML TO DEPM TRANSFORMATION
A systematic approach for the transformation of the
SysML model into a DEPM is the second step and de-
scribed in Steurer et al. (2018). This method is partially
reused in our case study. However, it was modified in order
to exploit new features of the latest version of the DEPM-
based tool OpenErrorPro Morozov et al. (2015).
1) Frequency classes: System components modeled
with the SysML UDP blocks are classified according to
their update frequencies. The case-study system contains
the three frequency classes: f1,f2, and f3whereby the
fastest class f3(1 MHz) contains 13 components, f2(1
KHz) contains 12 components, and the slowest class f1
(10 Hz) contains 4 components. Analog components with
a theoretically infinite frequency are classified into the
highest frequency class.
2) DEPM elements and control flow arcs: For each
atomic SysML UDP block, we create a DEPM element.
Based on the defined frequency classes the elements are
grouped into three hierarchical DEPMs. The top-level
DEPM, see Fig. 5, contains the elements with the lowest
update frequencies as well as the compound element f2
that contains the DEPM of the second frequency class. The
same structure is created for the second-level DEPM and
so on. The number of repetitions represents the iterations
of the sub model until the superordinate level is executed
according to repi=fi/fi1. Where, fiis the frequency
of the higher frequency class and fi1is the frequency
of the lower frequency class. For instance, the number
of repetitions of the compound element f2of the top-
level DEPM is equal to rep2= 1000Hz/10Hz = 100.
The elements of the top-level DEPM are connected into
a loop structure with DEPM control flow arcs based on
the attributes informationSource and informationTarget
of the UDP Flows. The elements of the other DEPMs are
connected sequentially.
3) DEPM data storages and data flow arcs: Data
storages are created and connected via DEPM data flow
arcs with the corresponding DEPM elements, based on
the UDP Flow attributes such as informationSource,in-
formationTarget, and name. Also, backward data flow arcs
from a data storage to the source element are added
in order to model permanent faults. Such data storages
connected via incoming and outgoing data flow arcs with
the same element are called state data. In Fig. 5, for
example the state data dGpsAnt1 models both the state
and the output of the element GpsAnt1. Redundant ele-
ments require an additional data comparison element. For
example, Fig. 4 shows two redundant voltage regulators
Reg12V1,Reg12V2, and the following comparison element
Reg12Comp with their associated error propagation com-
mands. All state data of sub DEPMs f3and f2also
appear in the top-level DEPM f1in order to model error
propagation between the frequency classes, as shown in
Fig. 5.
4) Probabilistic commands: Probabilistic commands in
PRISM format are created to describe the fault activation
and error propagation processes in the DEPM elements.
The probabilities of errors in the outputs of the elements
are calculated as follows:
EPc=λc[FIT]
fc[Hz]·3600 [ s
h]·109(1)
Where λcis the component failure rate and fcthe up-
date frequency of the component. The computed error
2019 IFAC ACA
August 27-30, 2019. Cranfield, UK
396
<<Block>>
IMUFUSION
+powerman:PowerSys [1]
+slave:Mcu [1]
+sa1:SensorArray [1]
+gps2:Gps [1]
+sa2:SensorArray [1]
PowerToPm
Power3.3ToMag1
DigData1ToMaster
AnData1ToMaster
Power3.3ToSa1
Power3.3ToMag2
Power5ToSa1
Power5ToMaster
Power3.3ToMaster
+gps1:Gps [1]
Power3.3Gps1
Power3.3Gps2
+master:Mcu [1]
DigData2ToSlave
AnData2ToSlave
Power3.3ToSa2
Power5ToSa2
Power5ToSlave
Power3.3ToSlave
GpsData1ToSlave
GpsData1ToMaster
GpsData2ToMaster
GpsData2ToSlave
SlaveToMaster
MasterToSlave
DataToGs
AnDataPowerToMaster
Fig. 3. The IMUFUSION top-level Internal Block Diagram.
Reg12V1 (Error propagation commands)
(dReg12V1=ok) ->
0.99999999999999999441:(dReg12V1'=ok) +
0.00000000000000000559:(dReg12V1'=error);
(dReg12V1!=ok) ->
1.0:(dReg12V1'=error);
Reg12Comp (Error propagation commands)
(dReg12V1=ok) | (dReg12V2=ok) ->
(dReg12Comp'=ok);
(dReg12V1!=ok) & (dReg12V2!=ok) ->
(dReg12Comp'=error);
Reg12V2 (Error propagation commands)
(dReg12V2=ok) ->
0.99999999999999999441:(dReg12V2'=ok) +
0.00000000000000000559:(dReg12V2'=error);
(dReg12V2!=ok) ->
1.0:(dReg12V2'=error);
Fig. 4. A fragment of the frequency class f3DEPM.
probabilities PFare listed in the fourth column of Ta-
ble 1. The probabilistic commands for comparison ele-
ments are created separately, see the commands of the
element Reg12Comp in Fig. 4.
5) Number of steps and element timing properties:
The DEPM analysis is based on automatically generated
DTMC models. Therefore, numerical results are related to
the number of steps. Each step corresponds to a DTMC
transition from one state to another that is equivalent to
the execution of one DEPM element. The number of steps
for the top-level DEPM that corresponds to a time unit
should be computed in order to match analysis results to
the system operation time. The number of steps per hour
of the top-level DEPM is calculated as follows:
NS[h1]=NE·fi[Hz]·3600 [ s
h] = 252000 h1(2)
Where fiis the frequency of the ith frequency class and
NEis the number of elements in the ith frequency class.
We set the execution time of all elements to zero and add
the additional element Clock to the top-level DEPM and
specify its execution time to 0.1 s that corresponds to the
first frequency class.
Table 1. The component failure rates λc, the
frequency fc, and the specific error probabili-
ties EPc.
Name λc[FIT] fc[Hz] EPc
Shunt 1.98 1000000 5.507e-19
Voltage regulator 20.14 1000000 5.594e-18
Accel/Gyro 3648.35 1000 1.013e-12
Temperature sensor 10.20 1000 2.833e-15
Magnetometer 3624.45 1000 1.007e-12
Pressure sensor 1728.95 1000 4.803e-13
Memory 12.53 1000 3.481e-15
PCBA 496.92 1000000 1.380e-16
Microcontroller 32.00 1000000 8.889e-18
GPS module 12.71 10 3.530e-13
GPS antenna 20.65 10 5.735e-13
5. DEPENDABILITY ANALYSIS
The next step is the numerical dependability analysis of
the generated DEPM using OpenErrorPro. The tool au-
tomatically creates a PRISM DTMC model for the lowest
hierarchical level, computes it using the built-in PRISM
interface, and uses the results for the computation of the
next hierarchical level. For instance, Fig. 5 shows the top-
level DEPM with already computed second and third level
DEPMs and defined system failures. The probabilistic
commands of the element f2are automatically generated
based on the submodels computation results. For example,
the F dM emAll =”dMem11=error &dM em12 =
error &dM em21 = error &dMem22 = error” failure
models simultaneous errors in all four memory modules.
The computed probability of this failure for one hour
mission time is equal to 2.07e09.
After this verification, we performed an extensive DEPM-
based sensitivity analysis of the FdMemAll failure to an in-
crease of the failure rates of all component classes listed in
Table 1. The results are shown in the last column of Table 1
and visualized with the diagram in Fig. 5. The axis ”None”
represents the originally computed FdMemAll probability.
2019 IFAC ACA
August 27-30, 2019. Cranfield, UK
397
398 Mikael Steurer et al. / IFAC PapersOnLine 52-12 (2019) 394–399
“dMem11=error“
0.1 s
0 s
0 s
0 s
0 s
0 s
0 s
“dMem11=error &
dMem12=error &
dMem21=error &
dMem22=error
f2 (excerpt error propagation commands)
(dSens2=ok) & (dSens1=ok) & (dGpsComp=ok) &
(dMem21=ok) & (dReg12V2=ok) & (dMem11=ok) &
(dReg12V1=ok) & (dMCU2=ok) & (dMCU1=ok) &
(dMem12=ok) & (dMem22=ok) ->
1.0690732583455869e-48:(dSens2'=ok) &
(dSens1'=ok) & (dMem21'=error) &
(dReg12V2'=error) & (dMem11'=error) &
(dReg12V1'=ok) & (dMCU1'=error) &
(dMCU2'=ok) & (dMem12'=error) &
(dMem22'=error) +
2.3558603315428984e-50:(dSens2'=ok) &
(dSens1'=ok) & (dMem21'=error) &
(dReg12V2'=ok) & (dMem11'=error) &
(dReg12V1'=error) & (dMCU1'=ok) &
(dMCU2'=ok) & (dMem12'=ok) &
(dMem22'=error) +
4.868816127347023e-37:(dSens2'=error) &
(dSens1'=ok) & (dMem21'=ok) &
(dReg12V2'=error) & (dMem11'=error) &
(dReg12V1'=ok) & (dMCU1'=ok) & (dMCU2'=ok)
& (dMem12'=ok) & (dMem22'=ok) +
1.3851776349443909e-22:
GpsComp (Error propagation commands)
(dGpsMod2=ok) | (dGpsMod1=ok) ->
(dGpsComp'=ok);
(dGpsMod2!=ok) & (dGpsMod1!=ok) ->
(dGpsComp'=error);
Fig. 5. The top-level DEPM model for the frequency class f1with defined failures and the PF dM emAll(5h) with Individual
tenfold increase of component failure rate.
The other axes show the probability of the FdMemAll
failure after a tenfold increase of the corresponding com-
ponent failure rate. Three significant deteriorations of the
PF dM emAll (5h) appear in the component classes Shunt,
Accel/Gyro, and Magnetometer. This indicates that these
components shall not be replaced by less reliable ones.
Vice versa, all other component classes are insensitive to
a tenfold component failure rate increase. This type of
sensitivity analysis is very useful for further development
because it helps to reduce costs while preserving the re-
quired reliability level.
6. CHALLENGES
Challenge 1: We noticed that the PRISM model checker
(v4.4) used by OpenErroPro can not process correctly
the probabilities that are close to one (double-precision
floating-point format). Extremely low probabilities of er-
rors that are less than e15 are processed correctly, but the
opposite probabilities with more than 15 decimal places
of nines are rounded to one. We have encountered this
problem only for the third-level DEPM because the prob-
abilities of error in the frequency class f3(1000000 Hz) are
in the magnitude of e19. Therefore, the original compo-
nent failure rates of the third-level DEPM components are
multiplied by 105and 106. The result of the computation
of a sub DEPM is basically a number of probabilities for
possible combinations of ’ok’ and ’error’ values of the ex-
ternal data outputs that are used on higher DEPM levels.
After that, we compute factors for each combination as the
ratio of the x106and x105results and use this factor for the
computation of the real probabilities in the last column.
Note that this method can be applied only if the relations
between the component-level error probabilities and the
computed probabilities are linear. This was checked and
appeared to be true for the IMUFUSION system.
Challenge 2: The State Space Explosion (SSE) is the
problem of all Markov chain based methods. OpenError-
Pro and PRISM support several inherent optimizations.
However, additional DEPM-level optimizations based on
expert knowledge are required. The maximum possible
number of DTMC states for a DEPM model is
S=E·
D
i=1
Vi(3)
Where Eis the number of DEPM elements, Dis the
number of data storages, and Viis the number of modeled
values of the data storage i. The number of data storages is
a crucial part of the equation because it causes exponential
growth of the state space. For the computation of the
DEPM shown in Fig. 5 with 7 elements, 15 data storages
that can take values from {’ok’,’error’}, in the worst case,
7·215 DTMC states have to be generated. To cope with this
problem the practical solution was to reduce the number
of state data that are not taking part in failure definitions
and are not directly relevant for the analysis. In particular,
we identified sequential control flow structures where the
state data of a preceding element is used only as input for
a single succeeding element. After that, we removed the
backward data flow arc from the state data to its parent
element for all elements in the sequential structure except
the last one. By doing this we reduced the number of state
data preserving the same behavior of the model.
2019 IFAC ACA
August 27-30, 2019. Cranfield, UK
398
Mikael Steurer et al. / IFAC PapersOnLine 52-12 (2019) 394–399 399
“dMem11=error“
0.1 s
0 s
0 s
0 s
0 s
0 s
0 s
“dMem11=error &
dMem12=error &
dMem21=error &
dMem22=error
f2 (excerpt error propagation commands)
(dSens2=ok) & (dSens1=ok) & (dGpsComp=ok) &
(dMem21=ok) & (dReg12V2=ok) & (dMem11=ok) &
(dReg12V1=ok) & (dMCU2=ok) & (dMCU1=ok) &
(dMem12=ok) & (dMem22=ok) ->
1.0690732583455869e-48:(dSens2'=ok) &
(dSens1'=ok) & (dMem21'=error) &
(dReg12V2'=error) & (dMem11'=error) &
(dReg12V1'=ok) & (dMCU1'=error) &
(dMCU2'=ok) & (dMem12'=error) &
(dMem22'=error) +
2.3558603315428984e-50:(dSens2'=ok) &
(dSens1'=ok) & (dMem21'=error) &
(dReg12V2'=ok) & (dMem11'=error) &
(dReg12V1'=error) & (dMCU1'=ok) &
(dMCU2'=ok) & (dMem12'=ok) &
(dMem22'=error) +
4.868816127347023e-37:(dSens2'=error) &
(dSens1'=ok) & (dMem21'=ok) &
(dReg12V2'=error) & (dMem11'=error) &
(dReg12V1'=ok) & (dMCU1'=ok) & (dMCU2'=ok)
& (dMem12'=ok) & (dMem22'=ok) +
1.3851776349443909e-22:
GpsComp (Error propagation commands)
(dGpsMod2=ok) | (dGpsMod1=ok) ->
(dGpsComp'=ok);
(dGpsMod2!=ok) & (dGpsMod1!=ok) ->
(dGpsComp'=error);
Fig. 5. The top-level DEPM model for the frequency class f1with defined failures and the PF dM emAll(5h) with Individual
tenfold increase of component failure rate.
The other axes show the probability of the FdMemAll
failure after a tenfold increase of the corresponding com-
ponent failure rate. Three significant deteriorations of the
PF dM emAll (5h) appear in the component classes Shunt,
Accel/Gyro, and Magnetometer. This indicates that these
components shall not be replaced by less reliable ones.
Vice versa, all other component classes are insensitive to
a tenfold component failure rate increase. This type of
sensitivity analysis is very useful for further development
because it helps to reduce costs while preserving the re-
quired reliability level.
6. CHALLENGES
Challenge 1: We noticed that the PRISM model checker
(v4.4) used by OpenErroPro can not process correctly
the probabilities that are close to one (double-precision
floating-point format). Extremely low probabilities of er-
rors that are less than e15 are processed correctly, but the
opposite probabilities with more than 15 decimal places
of nines are rounded to one. We have encountered this
problem only for the third-level DEPM because the prob-
abilities of error in the frequency class f3(1000000 Hz) are
in the magnitude of e19. Therefore, the original compo-
nent failure rates of the third-level DEPM components are
multiplied by 105and 106. The result of the computation
of a sub DEPM is basically a number of probabilities for
possible combinations of ’ok’ and ’error’ values of the ex-
ternal data outputs that are used on higher DEPM levels.
After that, we compute factors for each combination as the
ratio of the x106and x105results and use this factor for the
computation of the real probabilities in the last column.
Note that this method can be applied only if the relations
between the component-level error probabilities and the
computed probabilities are linear. This was checked and
appeared to be true for the IMUFUSION system.
Challenge 2: The State Space Explosion (SSE) is the
problem of all Markov chain based methods. OpenError-
Pro and PRISM support several inherent optimizations.
However, additional DEPM-level optimizations based on
expert knowledge are required. The maximum possible
number of DTMC states for a DEPM model is
S=E·
D
i=1
Vi(3)
Where Eis the number of DEPM elements, Dis the
number of data storages, and Viis the number of modeled
values of the data storage i. The number of data storages is
a crucial part of the equation because it causes exponential
growth of the state space. For the computation of the
DEPM shown in Fig. 5 with 7 elements, 15 data storages
that can take values from {’ok’,’error’}, in the worst case,
7·215 DTMC states have to be generated. To cope with this
problem the practical solution was to reduce the number
of state data that are not taking part in failure definitions
and are not directly relevant for the analysis. In particular,
we identified sequential control flow structures where the
state data of a preceding element is used only as input for
a single succeeding element. After that, we removed the
backward data flow arc from the state data to its parent
element for all elements in the sequential structure except
the last one. By doing this we reduced the number of state
data preserving the same behavior of the model.
2019 IFAC ACA
August 27-30, 2019. Cranfield, UK
398
7. CONCLUSION
The article presents the practitioner experiences and
lessons learned after the application of a recently intro-
duced method for the model-based dependability analysis
to an inertial navigation system for near-space vehicles.
The case study system was developed within the con-
text of the Balloon Experiments for University Students
(BEXUS) campaign. The SysML UAV Dependability Pro-
file (UDP) and the transformation algorithm from SysML
to the Dual-graph Error Propagation Model (DEPM) are
two major parts of this method. With the optimization
against the state-space explosion of underlying Markov
chain models and the introduction of a new computation
algorithm for DEPMs with extremely low fault activation
probabilities, it was demonstrated that the method copes
with this type of mechatronic system and is appropriate
for prospective tasks of dependability analysis in early
development phases. The analytical results help to eval-
uate overall system reliability, probabilities of particular
failures, and to identify critical components. The presented
approach is systematic and partially automated. A future
goal is the complete automation of the method, especially
of the transformation step which will also increase the
consistency. The DEPM capabilities allow more complex
and realistic system analysis, that can be fully exploited
with further investigations on possible extensions of the
UDP.
REFERENCES
Ali, S., Basit-Ur-Rahim, M.A., and Arif, F. (2015). Formal
verification of internal block diagram of sysml for model-
ing real-time system. In Software Engineering, Artificial
Intelligence, Networking and Parallel/Distributed Com-
puting (SNPD), 2015 16th IEEE/ACIS International
Conference on, 1–6. IEEE.
Aviˇzienis, A., Laprie, J.C., and Randell, B. (2004). De-
pendability and its threats: a taxonomy. In Building the
Information Society, 91–120. Springer.
Baier, C. and Katoen, J.P. (2008). Principles of model
checking. MIT press.
Baouya, A., Bennouar, D., Mohamed, O.A., and Ouchani,
S. (2015). A probabilistic and timed verification ap-
proach of sysml state machine diagram. In Programming
and Systems (ISPS), 2015 12th International Sympo-
sium on, 1–9. IEEE.
BEXUS (2018). Balloon experiments for university stu-
dents. URL http://rexusbexus.net/bexus/.
Debbabi, M., Hassa¨ıne, F., Jarraya, Y., Soeanu, A., and
Alawneh, L. (2010). Verification and Validation in
Systems Engineering - Assessing UML / SysML Design
Models.
Dehnert, C., Junges, S., Katoen, J.P., and Volk, M.
(2017). A storm is coming: A modern probabilistic
model checker. In International Conference on Com-
puter Aided Verification, 592–600. Springer.
Denson, W., Chandler, G., Crowell, W., Clark, A., and
Jaworski, P. (1994). Nonelectronic parts reliability data
1995. Technical report, RELIABILITY ANALYSIS
CENTER GRIFFISS AFB NY.
Feiler, P.H., Gluch, D.P., and Hudak, J.J. (2006). The
architecture analysis & design language (aadl): An in-
troduction. Technical report, Carnegie-Mellon Univ
Pittsburgh PA Software Engineering Inst.
Guide, F. et al. (2009). Reliability methodology for
electronic systems. FIDES group.
Jarraya, Y., Soeanu, A., Debbabi, M., and Hassaine, F.
(2007). Automatic verification and performance analysis
of time-constrained sysml activity diagrams. In Engi-
neering of Computer-Based Systems, 2007. ECBS’07.
14th Annual IEEE International Conference and Work-
shops on the, 515–522. IEEE.
Kim, M.C. (2011). Reliability block diagram with general
gates and its application to system reliability analysis.
Annals of Nuclear Energy, 38(11), 2456–2461.
Kwiatkowska, M., Norman, G., and Parker, D. (2002).
Prism: Probabilistic symbolic model checker. In In-
ternational Conference on Modelling Techniques and
Tools for Computer Performance Evaluation, 200–204.
Springer.
Lanusse, A., Tanguy, Y., Espinoza, H., Mraidha, C., Ger-
ard, S., Tessier, P., Schnekenburger, R., Dubois, H., and
Terrier, F. (2009). Papyrus uml: an open source toolset
for mda. In Proc. of the Fifth European Conference
on Model-Driven Architecture Foundations and Appli-
cations (ECMDA-FA 2009), 1–4.
Machida, F., Xiang, J., Tadano, K., and Maeno, Y. (2013).
Composing hierarchical stochastic model from sysml for
system availability analysis. In 2013 IEEE 24th Inter-
national Symposium on Software Reliability Engineering
(ISSRE), 51–60. IEEE.
Mhenni, F., Nguyen, N., and Choley, J.Y. (2014). Au-
tomatic fault tree generation from sysml system mod-
els. In Advanced Intelligent Mechatronics (AIM), 2014
IEEE/ASME International Conference on, 715–720.
IEEE.
Morozov, A., Tuk, R., and Janschek, K. (2015). Errorpro:
Software tool for stochastic error propagation analysis.
In 1st International Workshop on Resiliency in Embed-
ded Electronic Systems, Amsterdam, The Netherlands,
59–60.
Morozov, A. and Janschek, K. (2014). Probabilistic error
propagation model for mechatronic systems. Mechatron-
ics, 24(8), 1189–1202.
OMG, H. (2008). Omg systems modeling language (omg
sysml)-version 1.4. Juni-2015.
Ong, C.M. (1998). Dynamic simulation of electric machin-
ery: using MATLAB/SIMULINK, volume 5. Prentice
hall PTR Upper Saddle River, NJ.
Ouchani, S., Mohamed, O.A., and Debbabi, M. (2014).
A formal verification framework for sysml activity dia-
grams. Expert Systems with Applications, 41(6), 2713–
2728.
REXUS/BEXUS (2018). Rexus/bexus rocket and
balloon experiments for university students. URL
http://rexusbexus.net/.
Ruijters, E. and Stoelinga, M. (2015). Fault tree analysis:
A survey of the state-of-the-art in modeling, analysis
and tools. Computer science review, 15, 29–62.
Steurer, M., Morozov, A., Janschek, K., and Neitzke, K.P.
(2018). Sysml-based profile for dependable uav design.
IFAC-PapersOnLine, 51(24), 1067–1074.
UML, O. (2001). Unified modeling language. Object
Management Group.
2019 IFAC ACA
August 27-30, 2019. Cranfield, UK
399
... The main reasons for these faults happen actually when the UAV is moving different operational environment. To observe the effect of the error propagation in the inertial navigation system, the work in [91] proposed two models that evaluate overall system reliability, probabilities of particular failures such as accelerators, gyroscope, temperature, and pressure sensors, memory, GPS, etc., which also identify critical components. ...
... In addition, they also proposed a reliability synthesis method that allows quickly making rational choices for fault-tolerant systems to meet the required level. Another research work [91] proposed MCM based on designing the flight control system for IMU, in which the authors initially created a system modeling language model and then transformed it into a Dual-Graph Error Propagation Model. Finally, the MCM model was used to evaluate the system dependability matrices. ...
... The reliability threats for computing platforms are mainly due to radiation-induced faults such as SEUs (soft errors) and hardware permanent errors, e.g., the ones due to the nanoelectronics aging phenomenon [87][88][89]. Another important threat may happen in the sensors and actuator layers due to the electro-mechanical fault and harsh environment [86,91]. A UAV system may face system failure due to the effect of any of the above two failures or both. ...
Article
Full-text available
This study describes the Computing Platforms (CPs) and the hardware reliability issues of Unmanned Aerial Vehicles (UAVs), or drones, which recently attracted significant attention in mission and safety-critical applications demanding a failure-free operation. While the rapid development of the UAV technologies was recently reviewed by survey reports focusing on the architecture, cost, energy efficiency, communication, and civil application aspects, the computing platforms’ reliability perspective was overlooked. Moreover, due to the rising complexity and diversity of today’s UAV CPs, their reliability is becoming a prominent issue demanding up-to-date solutions tailored to the UAV specifics. The objective of this work is to address this gap, focusing on the hardware reliability aspect. This research studies the UAV CPs deployed for representative applications, specific fault and failure modes, and existing approaches for reliability assessment and enhancement in CPs for failure-free UAV operation. This study indicates how faults and failures occur in the various system layers of UAVs and analyzes open challenges. We advocate a concept of a cross-layer reliability model tailored to UAVs’ onboard intelligence and identify directions for future research in this area.
... These faults primarily occur when the UAV operates in various environments. The authors of [17] presented two models to evaluate the system's overall reliability and investigate the consequences of error propagation in the inertial navigation system. These models assess the likelihood of various failures such as accelerometers, gyroscopes, temperature and pressure sensors, memory, GPS, etc., and also identify the critical components. ...
... Faults in these external sensors cause mission failure when the UAV is operating in a different environment [9]. These EIs can access the external sensor data to monitor EF information [17]. ...
Preprint
Cyber-Physical Systems (CPSs), such as Unmanned Aerial Vehicles (UAVs), use System-on-Chip (SoC) based computing platforms to perform multiple complex tasks in safety-critical applications that require a highly dependable operation. Due to continuous technological manufacturing miniaturization SoCs face a wide spectrum of chip-level reliability issues such as aging, soft and hard errors during the operational lifetime of a UAV. In addition, external (off-chip) faults in the sensors, actuators, and motors are another cause of UAV failures. While existing works examine either on-chip faults (internal) or sensors/actuators faults (external) separately, this research proposes a UAV health monitoring infrastructure considering both external and internal faults holistically. The proposed method relies on the IEEE 1687 standard (IJTAG) and employs on-chip embedded instruments as health monitors to instantly access external and internal sensor data. Experimental results for functional simulation of a real-life case-study design demonstrate both types of fault detection by serving only three clock cycles and the localization process using 16 and 30 clock cycles for the case of single and double faults, respectively.
... The proposed method offered a systematic way to derive DEPM graphs from corresponding system representations, fundamental to model-based systems engineering (MBSE) [5]. DEPM has since been applied in domains including avionics ( [6], [7], [8], [9], [10], [11]), automotive and robotics ( [5], [12], [13], [14]), embedded software( [15], [16], [17], [18]), and medical systems( [19], [20]). Within nuclear safety, recent endeavors have leveraged DEPM for reliability analysis [21], software CCF modeling [22] and as a hardware-software failure simulation tool for dynamic PRA( [23], [24]). ...
Conference Paper
Advanced nuclear reactors are ushering in a transformative era in the nuclear energy sector, characterized by generational advancements in safety, efficiency, and sustainability. Central to realizing the advanced reactor philosophy is the integration of digital instrumentation and control (I&C) systems, which are critical for enhancing the operational integrity, economic viability, and safety of advanced reactor designs. Designed to be vastly more complex than their analog counterparts, digital I&Cs offer comparatively superior control, diagnostic capabilities, and adaptability. However, added complexity makes failures of such systems inherently hard to describe, let alone predict, due in part to their potential for propagating internal errors in manners that are often unintuitive and opaque. Consequently, the use of digital I&Cs poses new challenges to qualifying reactor safety. Addressing these challenges requires a fundamental rethinking of probabilistic failure modeling. The dual error propagation method (DEPM) is a stochastic technique that allows us to induce and track failure behavior by explicitly representing a system in terms of its control and data flows – two attributes that are sufficient to adequately describe combinatorial logic. However, the physical processes underlying digital systems like all natural processes are inherently coincident. So far the nature of abstraction requires DEPM models to be sequential and time-agnostic leading to fundamental inconsistencies that are accepted nonetheless by relaxing modeling assumptions. This paper is an extension to our previous work introducing the concept of multiple control paths within DEPM aimed at addressing its sequential limitations. Through a case study on basic digital logic building blocks we showcase the improved expressivity afforded by multi-control DEPM emphasizing its potential and limitations in supporting the reliability analysis of larger more complex digital I&Cs. Our analysis demonstrates that while multi-control DEPM is feasible it is fundamentally limited by its inability to explicitly model time consequently accurately modeling race conditions concurrency and synchronization without compromising modeling assumptions remains unachievable. Furthermore multi-control DEPM is computationally expensive due to an exponential increase in the number of modeled states with each added control flow the issue of this so-called state-space explosion stays largely unresolved within DEPM as a whole. Given these limitations we conclude by proposing future research directions including the exploration of alternative time-explicit modeling techniques and strategies for managing model complexity with a deeper case study to follow.
... The proposed method offered a systematic way to derive DEPM graphs from corresponding system representations, fundamental to model-based systems engineering (MBSE) [5]. DEPM has since been applied in domains including avionics ( [6], [7], [8], [9], [10], [11]), automotive and robotics ( [5], [12], [13], [14]), embedded software( [15], [16], [17], [18]), and medical systems( [19], [20]). Within nuclear safety, recent endeavors have leveraged DEPM for reliability analysis [21], software CCF modeling [22] and as a hardware-software failure simulation tool for dynamic PRA( [23], [24]). ...
Preprint
Advanced nuclear reactors are ushering in a transformative era in the nuclear energy sector, characterized by generational advancements in safety, efficiency, and sustainability. Central to realizing the advanced reactor philosophy is the integration of digital instrumentation and control (I&C) systems, which are critical for enhancing the operational integrity, economic viability, and safety of advanced reactor designs. Designed to be vastly more complex than their analog counterparts, digital I&Cs offer comparatively superior control, diagnostic capabilities, and adaptability. However, added complexity makes failures of such systems inherently hard to describe, let alone predict, due, in part, to their potential for propagating internal errors in manners that are often unintuitive and opaque. Consequently, the use of digital I&Cs poses new challenges to qualifying reactor safety. Addressing these challenges requires a fundamental rethinking of probabilistic failure modeling. The dual error propagation method (DEPM) is a stochastic technique that allows us to induce and track failure behavior by explicitly representing a system in terms of its control and data flows-two attributes that are sufficient to adequately describe combinatorial logic. However, the physical processes underlying digital systems, like all natural processes, are inherently co-incident. So far, the nature of abstraction requires DEPM models to be sequential and time-agnostic, leading to fundamental inconsistencies that are accepted nonetheless by relaxing modeling assumptions. This paper is an extension to our previous work introducing the concept of multiple control paths within DEPM, aimed to address its sequential limitations. Through a case study on basic digital logic building blocks, we showcase the improved expressivity afforded by multi-control DEPM, emphasizing its potential and limitations in supporting the reliability analysis of larger, more complex digital I&Cs. Our analysis demonstrates that while multi-control DEPM is feasible, it is fundamentally limited by its inability to explicitly model time. Consequently, accurately modeling race conditions, concurrency, and synchronization without compromising modeling assumptions remains unachievable. Furthermore, multi-control DEPM is computationally expensive due to an exponential increase in the number of modeled states with each added control flow. The issue of this so-called state-space explosion stays largely unresolved within DEPM as a whole. Given these limitations, we conclude by proposing future research directions, including the exploration of alternative time-explicit modeling techniques and strategies for managing model complexity, with a deeper case study to follow.
... Steurer et al. [130] proposes a new Systems Modeling Language (SysML)-based method for the analysis of the reliability of unmanned aerial vehicles (UAVs). This method consists of three main stages, one of which the Dual-Graph Error Propagation Model (DEPM)-based evaluation of system reliability criteria using Markov chain models and advanced techniques for probabilistic model evaluation is the third stage. ...
Chapter
Full-text available
One of the most critical factors in using systems is their reliability. The reliability study aims to help understand the probable nature of equipment failure during use and establish reliability models. Predicting the reliability of a system is related to the future of the components. Therefore, it is probable, so reliability is mainly modeled in the literature using the concepts of probabilities and stochastic processes. The Markov process usually describes the time-dependent behavior of systems. Markov and semi-Markov models have been used in various fields for reliability widely. In this chapter, reliability in different systems is first described, and then the system failure process is defined. The following is a review of the literature on reliability focusing on Markov and semi-Markov models. At the end of the chapter, the conclusions and future research trends are stated.
... Technically a DEPM is transformed into one or several PRISM models for numerical evaluation with stochastic model checkers like PRISM [15] or STORM [9]. UAV-specific model-based dependability analysis: In our previous work, we presented how the SysML model can be transformed into the DEPM with the help of the SysML UDP [30] and proved the scalability to complex systems [31]. Since UAVs are often used to fulfill phased missions, we extended the approach towards phased-mission systems [32]. ...
Preprint
A failure in the behavior of safety-critical systems such as Unmanned Aerial Vehicles (UAV) may lead to hazards and serious consequences. Therefore UAVs must perform correctly throughout their intended life cycle. However, the functional system specification is not sufficient to ensure proper performance. The assessment of non-functional reliability, safety, or timing requirements in the early development phases helps to prevent conceptually wrong decisions. UAVs usually contain complex and tightly coupled hardware and software components that influence each other. Hence, a holistic approach to analyze the non-functional properties is preferable. The concept of phased mission systems enables the specification of individual requirements for different phases. This allows more accurate predictions of the success or failure of a mission. This paper presents a holistic model-based dependability assessment approach that supports system-software co-engineering for UAVs. The approach combines stochastic timing analysis and the data error propagation analysis and enables the consideration of phase-specific requirements as well as the identification, formalization, and analysis of timing properties. The solution proposed in this paper is centered around the Systems Modeling Language (SysML) and the systematic transformation of annotated SysML models to Probabilistic Model Checking (PMC) accessible Markov chain models.
Preprint
Assessment of non-functional reliability and safety requirements in the early development phases helps to prevent conceptually wrong decisions and, as a consequence, significantly reduces overall development costs. The application of model-based system analysis techniques demonstrates promising results for complex avionics systems, especially software-intensive Unmanned Aerial Vehicles (UAV). Such systems are commonly designed to accomplish a specific mission consisting of multiple mission phases. The concept of phased mission systems enables the specification of individual requirements for different phases. For instance, the reliability requirements or system specifications are different for UAV flights over an agricultural field and a highway. Therefore, modern analytical methods have to distinguish between different mission phases and enable the analysis of phased missions. In this paper, we propose a new model-based method that allows system engineers to assess a conceptional design specification of the UAV concerning the fulfillment of phase-specific requirements. The proposed approach exploits modern probabilistic model checking techniques for the quantification of several dependability metrics. The method supports the systematic analysis of system specifications that contain both structural and behavioral system properties. A case study demonstrates the feasibility of the proposed method.
Conference Paper
Full-text available
Model-based systems engineering approaches are commonly used for the development of various heterogeneous mechatronic systems including nowadays popular Unmanned Aerial Vehicles (UAV). As a rule, the dependability analysis is carried out during the system integration phase of the UAV development. Therefore, fundamental design drawbacks might be uncovered too late leading to serious and costly rework. This paper presents a new methodology for the early dependability analysis of a UAV system applicable during the design phase. The proposed dependability analysis is based on the Dual-graph Error Propagation Model (DEPM), a stochastic model of system design aspects that influence error propagation processes: control flow, data flow, and component-level reliability properties. A new domain-specific Systems Modeling Language (SysML) profile and the transformation algorithm from the SysML model to the DEPM are introduced as two key parts of this methodology as well as the main research results of this paper. The relevant technical details of the modeling workflow are demonstrated with a case study UAV that explains how to design a UAV using the introduced SysML profile, transform the SysML model into the DEPM, and evaluate dependability properties.
Conference Paper
Full-text available
SysML is a graphical modeling language that is mostly used for the graphical representation of real-time systems, complex systems, safely critical systems, and embedded systems. In this paper, we present a methodology based on model checking tool for the correction and verification of SysML internal block diagram with discrete time constraint. We describe the mapping of SysML internal block diagram to PRISM input language and use Probabilistic Computational Tree Logic (PCTL) for the verification of properties. The methodology provides more reliable and quick results for the development of real time systems as PRISM supports parallel composition of components. Finally, we present the effectiveness of our approach with the help of a case study of real-time system. The discrete time factor is included in the case study to evaluate the performance characteristics of system functionality.
Conference Paper
Full-text available
In this paper, a methodology is proposed to integrate safety analysis within a systems engineering approach. This methodology is based on SysML models and aims at generating (semi-) automatically safety analysis artifacts, mainly FMEA and FTA, from system models. Preliminary functional and component FMEA are automatically generated from the functional and structural models respectively, then completed by safety experts. By representing SysML structural diagram as a directed multi-graph, through a graph traversal algorithm and some identified patterns, generic fault trees are automatically derived with corresponding logic gates and events. The proposed methodology provides the safety expert with assistance during safety analysis. It helps reducing time and error proneness of the safety analysis process. It also helps ensuring consistency since the safety analysis artifacts are automatically generated from the latest system model version. The methodology is applied to a real case study, the electromechanical actuator EMA.
Conference Paper
We launch the new probabilistic model checker Storm. It features the analysis of discrete- and continuous-time variants of both Markov chains and MDPs. It supports the Prism and JANI modeling languages, probabilistic programs, dynamic fault trees and generalized stochastic Petri nets. It has a modular set-up in which solvers and symbolic engines can easily be exchanged. It offers a Python API for rapid prototyping by encapsulating Storm’s fast and scalable algorithms. Experiments on a variety of benchmarks show its competitive performance.
Chapter
Systems modeling language (SysML) [187] is a modeling language dedicated to systems engineering applications . It is a UML profile that not only reuses a subset of UML 2.1.1 [186] but also provides additional extensions to better fit SE’s specific needs. These extensions are mainly meant to address the requirements stated in the UML for SE request for proposal (RFP) [177]. It is intended to help specify and architect complex systems and their components and enable their analysis, design, and verification and validation . These systems may consist of heterogeneous components such as hardware , software , information, processes, personnel, and facilities [187].
Article
Fault tree analysis (FTA) is a very prominent method to analyze the risks related to safety and economically critical assets, like power plants, airplanes, data centers and web shops. FTA methods comprise of a wide variety of modeling and analysis techniques, supported by a wide range of software tools. This paper surveys over 150 papers on fault tree analysis, providing an in-depth overview of the state-of-the-art in FTA. Concretely, we review standard fault trees, as well as extensions such as dynamic FT, repairable FT, and extended FT. For these models, we review both qualitative analysis methods, like cut sets and common cause failures, and quantitative techniques, including a wide variety of stochastic methods to compute failure probabilities. Numerous examples illustrate the various approaches, and tables present a quick overview of results.
Article
This paper addresses a probabilistic approach to error propagation analysis of a mechatronic system. These types of systems require highly abstractive models for the proper mapping of the mutual interaction of heterogeneous system components such as software, hardware, and physical parts. A literature overview reveals a number of appropriate error propagation models that are based on Markovian representation of control flow. However, these models imply that data errors always propagate through the control flow. This assumption limits their application to systems, in which components can be triggered in arbitrary order with non-sequential data flow. A motivational example, discussed in this paper, shows that control and data flows must be considered separately for an accurate description of an error propagation process.
Conference Paper
Comprehensive analytic model for system availability analysis often confronts the largeness issue where a system designer cannot easily handle the model and the solution is not given in a feasible solution time. Hierarchical decomposition of a large state-space model gives a promising solution to the largeness issue when the model is decomposable. However, the decomposability of analytic model is not always manually tractable especially when the model is generated in an automated manner. In this paper, we propose an automated model composition technique from a system design to a hierarchical stochastic model which is the judicious combination of combinatorial and state-space models. In particular, from SysML-based system specifications, a top-level fault tree and associated stochastic reward nets are automatically generated in hierarchical manner. The obtained hierarchical stochastic model can be solved analytically considerably faster than monolithic state-space models. Through an illustrative example of three-tier web application system on a virtualized infrastructure, the accuracy and efficiency of the solution are evaluated in comparison to a monolithic state space model and a static fault tree.