Conference PaperPDF Available

Scenario Factory: Creating Safety-Critical Traffic Scenarios for Automated Vehicles

Authors:

Abstract and Figures

The safety validation of motion planning algorithms for automated vehicles requires a large amount of data for virtual testing. Currently, this data is often collected through real test drives, which is expensive and inefficient, given that only a minority of traffic scenarios pose challenges to motion planners. We present a workflow for generating a database of challenging and safety-critical test scenarios that is not dependent on recorded data. First, we extract a large variety of road networks across the globe from OpenStreetMap. Subsequently, we generate traffic scenarios for these road networks using the traffic simulator SUMO. In the last step, we increase the criticality of these scenarios using nonlinear optimization. Our generated scenarios are publicly available on the CommonRoad website.
Content may be subject to copyright.
Scenario Factory: Creating Safety-Critical Traffic Scenarios
for Automated Vehicles
Moritz Klischat*, Edmond Irani Liu*, Fabian H¨
oltke, and Matthias Althoff
Abstract The safety validation of motion planning algo-
rithms for automated vehicles requires a large amount of
data for virtual testing. Currently, this data is often collected
through real test drives, which is expensive and inefficient,
given that only a minority of traffic scenarios pose challenges
to motion planners. We present a workflow for generating a
database of challenging and safety-critical test scenarios that
is not dependent on recorded data. First, we extract a large
variety of road networks across the globe from OpenStreetMap.
Subsequently, we generate traffic scenarios for these road
networks using the traffic simulator SUMO. In the last step,
we increase the criticality of these scenarios using nonlinear
optimization. Our generated scenarios are publicly available
on the CommonRoad website.
I. INTRODUCTION
Virtual testing is an important tool for validating the
safety of automated vehicles, as it exposes potential defects
of the algorithms under test. Having a large variety of
challenging traffic scenarios is vital for effective and efficient
testing of motion planning algorithms. While carrying out
simulations using data recorded from test drives provides us
with realistic scenarios, the required data collection is often
overly expensive and time-consuming [1]. Even though the
number of publicly-available datasets has increased over the
last few years, e.g., [2]–[5], they usually feature only a small
number of maps and require much effort to record.
Our framework generates a database of safety-critical
scenarios for scenario-based testing [6] of motion planing
algorithms for automated vehicles. It consists of
1) Extracting interesting road intersections worldwide
from OpenStreetMap (OSM) [7] by using our Globe-
trotter tool (see Sec. III).
2) Generating safety-critical test scenarios by first pop-
ulating the extracted intersections with traffic partici-
pants through the traffic simulator SUMO [8].
3) Optimizing the criticality of the obtained scenarios by
using a generalizable criticality criterion (see Sec. IV).
A. Related Work
Below, we concisely review related works on approaches
towards automatically creating virtual representations of road
networks and generating critical test scenarios for automated
vehicles.
*The first two authors have contributed equally to this work.
All authors are with the Department of Informatics, Technical University
of Munich, 85748 Garching, Germany.
{moritz.klischat, edmond.irani, fabian.hoeltke,
althoff}@tum.de
1) Creating road networks: Generative approaches con-
struct road networks, e.g., based on abstract specifications
[9]. Moreover, a suite of road networks with a defined cover-
age of road curvatures is generated in [10] using satisfiability
modulo theories. Road networks that lead to the failure of
lane-keeping assistants are generated procedurally in [11]
through mutating road networks using genetic algorithms.
Alternatively, road networks can also be created from
external sources. In [12]–[16], the authors extract road
networks from aerial and satellite images with the help of
computer vision techniques. These works are capable of ex-
tracting high-level geometric information of road networks;
however, lane-level information concerning motion planners
of automated vehicles is not reconstructed. Promising works
towards the reconstruction of road networks with lane-level
detail from aerial images can be seen in [17], [18]. Similarly,
while creating road networks for single lanes from OSM data
is straightforward [19], creating those with lane-level detail
is a more challenging problem [20].
2) Generating critical test scenarios: Creating test sce-
narios for automated vehicles using traffic simulators is
proposed, e.g., in [21], [22]. Realistic scenarios can be
obtained by calibrating their simulations through real-world
measurements. By using criticality metrics, safety-critical
scenarios are filtered [22]; however, these situations occur
only rarely, in the main.
To efficiently obtain critical scenarios, importance sam-
pling from large databases of recorded traffic data is pro-
posed [23]. Criticality metrics are combined with the occur-
rence rates to efficiently sample critical scenarios representa-
tive of real-world driving conditions [24]. Other approaches
use optimization to create critical scenarios based on these
metrics [25], [26]. Similarly, falsification methods can detect
scenarios that falsify a motion planner with respect to a given
safety specification [27], [28]. Parameter regions for critical
scenarios based on constraint satisfaction are computed in
[29]. In our previous work, we presented an optimization-
based method to increase the criticality of initially uncritical
traffic scenarios [30], [31] by decreasing the space of possible
solutions for the vehicle under test, called the drivable area.
B. Contributions
In contrast to previous work on generating test scenarios
through simulation, our approach is particularly efficient
since we combine the automatic extraction of road networks
from OSM with a traffic simulation followed by an increase
of its criticality. By exploiting the large variety of road
networks around the world, we create complex, yet realistic
road networks which currently no procedural map generator
is capable of producing. In contrast to existing work, our
approach does not rely on test drives nor other real-world
traffic data, thus it is able to efficiently create many traffic
scenarios at low costs. The resulting scenarios are indepen-
dent of the vehicle under test, due to our criticality metric
based on the drivable area.
II. OVERVIEW
An overview of our approach is presented in Fig. 1. In the
following subsections, we introduce each component.
Fig. 1. Our pipeline for generating safety-critical scenarios.
A. Platforms
1) CommonRoad: The CommonRoad (CR) benchmark
suite1is an open-source framework that provides a collection
of traffic scenarios for motion planning algorithms. Scenarios
in CommonRoad consist of road networks,static obstacles,
and dynamic obstacles that represent all possible types of
traffic participants. In this work, we focus on cars, trucks,
and bicycles. Road networks in CommonRoad are described
by lanelets [32] (see Fig. 2). Lanelets are defined by their left
and right bounds, which are modeled by polylines. Further-
more, lanelets are connected through successor-predecessor
and lateral-adjacency relations and contain additional in-
formation such as the speed limit. Additionally, we define
forking points as the points on the centerlines of lanelets
where lanelets split or multiple lanelets merge.
2) OpenStreetMap: OSM is an open-source project that
provides geographic data worldwide. The main structure
of OSM data is defined by three elements: nodes,ways,
and relations.Nodes are geographic points defined by their
latitude and longitude; ways are tuples of nodes and represent
elements such as roads and boundaries of areas; relations are
groups of nodes,ways and other relations. Fig. 3a shows a
map taken from OpenStreetMap.
1https://commonroad.in.tum.de/
Fig. 2. Lanelet network representation.
3) GeoNames: GeoNames2is a free geographic database
which covers all countries and contains over 11 million
placenames of cities from all over the world. The provided
geographical information includes global coordinates, postal
codes, population, etc.
4) SUMO: This open-source microscopic traffic-
simulation package is designed to handle large road
networks. SUMO models individual vehicles and their
interactions using models for car-following, lane-changing,
and intersection behavior.
B. Converters
Since most of the software platforms mentioned above
have individual formats and map representations, we use
different converters and interfaces to bridge these platforms.
1) OSM2CR converter: It converts OSM maps to Com-
monRoad lanelet networks. While OSM provides map data
for almost any place in the world, their level of detail is
not yet suited for automated vehicles: The motion planners
of automated vehicles and traffic simulators typically require
lane-level information. To resolve this issue, in the first step,
the topology of the lanelet network, i.e., the connections at
intersections, needs to be estimated. Next, spatial information
of individual lanes is deducted accordingly. Fig. 3b shows a
converted lanelet network via this converter.
2) CR-SUMO interface: It enables the communication
between CommonRoad and SUMO by a) converting the CR
road network to SUMO format, b) generating configuration
files for the simulation, and c) converting simulated vehicle
trajectories to the CR format. We refer the interested reader
to [33] for more details regarding this interface.
III. GLO BET ROTTE R
To automatically extract interesting road networks from
all over the world, we have developed the Globetrotter
tool, which takes the road network data from OSM as its
underlying input. As we want to create scenarios on distinct
road networks, we mainly focus on extracting intersections.
Below, we explain the major steps for extracting the inter-
sections from OSM.
2https://www.geonames.org/
(a)
(b)
Fig. 3. (a) Map of Encamp, Andorra taken from OSM. (b) Conversion
result into CommonRoad lanelet network via the OSM2CR converter.
A. Retrieving Candidate Regions
Clearly, there are intersections all over the globe, and they
mostly vary according to region. Given that only 29%of
the Earth’s surface is covered by land3, and that 10%of
these regions accommodate 95%of the human population4,
sampling the Earth’s surface with random coordinates is
not very efficient. Assuming that intersections mostly occur
near populated areas, we retrieve these populated candidate
regions from GeoNames. To speed up the processing in the
next steps, we can also divide the region into smaller subre-
gions if the area of the region exceeds a certain value. The
retrieved candidate (sub)regions are converted into lanelet
networks via the OSM2CR converter.
B. Extracting Intersections
We denote the n-th forking point and the tuple of all
forking points in a lanelet network as Pnand P, respectively.
For a given lanelet network, it is usually difficult to determine
beforehand the number of intersections to be extracted. For
this reason, instead of k-means-like algorithms [34], we
apply the hierarchical agglomerative clustering (HAC) algo-
rithm [35] to P. HAC only requires a distance threshold dth
to limit the distances between clusters: a higher dth entails
larger intersections. Alg. 1 describes how the intersections
are extracted from P.
1) Clustering forking points (Alg. 1, lines 2-4): Initially,
each forking point forms a cluster Cnwith it being the only
3https://www.noaa.gov/
4https://ec.europa.eu/jrc/en
Algorithm 1 Extracting Intersections
Inputs: forking points P, distance threshold dth
Output: extracted intersections I
1: I ← ∅
2: Clustering forking points
3: C ← INITIALIZE(P)
4: C ← HAC(C,dth )
5: Creating intersections
6: for C0∈ C do
7: ICUT LANE LETS(C0)
8: IPOS TPROC ESS (I)
9: I ← I ∪ {I}
10: end for
11: return I
member: Cn={Pn}. We denote the tuple of clusters by
C:= hC1, C2, . . . i, and the distance between two clusters
Ci, Cjwith single-linkage setting [35] by di,j :
di,j = min{dist(a, b)|aCi, b Cj},
where the operator dist(·)returns the Euclidean distance
between two given forking points. In each iteration, the two
clusters Ci, Cjwith the minimum distance di,j < dth are
merged into a new cluster C0=CiCj. This process is
repeated until no more clusters can be merged. Fig. 4a-4b
show the dendrogram for clustering an exemplary lanelet
network and the clustered forking points.
2) Creating intersections (Alg. 1, lines 5-10): For each
remaining cluster C0∈ C, we determine the minimum radius
rmin of a circle enclosing all forking points within the cluster.
We enlarge this radius by a user-defined margin rmgn to
span a region of interest. We cut out all lanelets from this
region, resulting in an intersection I, and additionally apply
the following steps:
1) Lanelets that are not within the region of interest are
removed.
2) Due to removed lanelets, we update the successor-
predecessor and lateral-adjacency relations.
Fig. 4c shows the extracted intersections I.
C. Selecting Interesting Intersections
Given the intersections Iextracted from a lanelet network,
we only keep those that are particularly interesting or distinct
according to the following features:
number of forking points;
number of lanelets;
number of crossing lanelets;
number of predecessors and successors;
area of lanelets;
density of lanelets;
angle between lanelets; and
mean distance between forking points and their cen-
troid.
We associate the interestingness of intersections with
the dissimilarity between their features and those of other
dth
0
10
20
30
40
50
60
70
5 6 720 3
1 4 15 16 11 12 13 14 10 89
ID of forking points
Distance [m]
(a)
0
1
34
2
5
678
10
11
12
13 14
15
16
radius = rmin radius = rmin +rmgn
9
(b)
(c)
Fig. 4. (a) Dendrogram of the clustering result. Three clusters are generated
with dth set to 35 meters. (b) Forking points within one cluster have the
same color. rmgn is set to 15 meters. (c) Intersections extracted from the
input lanelet network.
intersections. By doing so, we turn the selection of interesting
intersections into a multivariate outlier (anomaly) detection
problem. To solve this problem, we use the isolation forest
(iForest) algorithm [36], since it is unsupervised, capable
of efficiently handling multiple dimensions of features, and
requires limited effort to hand-tune its parameters. In the
training phase, a total of kisolation trees (iTrees) are trained
with sets of randomly-selected intersections; in the detection
phase, an anomaly score s[0,1] is assigned to each
intersection by the iTrees [36], where an intersection with
a score above a threshold sth is considered an outlier. Fig. 5
presents a collection of distinct intersections. It should be
recalled that we divide the candidate region into subregions
if it is overly large, thus rendering the adopted iForest
algorithm computationally tractable.
IV. GEN ERATI ON OF SA FET Y-CRITICAL SCENA RIO S
On the extracted maps, we simulate traffic participants
using our previously-introduced CR-SUMO interface next.
Since scenarios simulated with SUMO often yield uncritical,
easy-to-solve motion planning problems, we subsequently
increase their criticality using our approach [30], [31] for
reducing the solution space. We first parametrize the initially
obtained trajectories of other traffic participants in Sec. IV-
B and, following that, formulate a nonlinear optimization
problem with a criticality criterion specified in Sec. IV-D.
A. Motion Planning Problem
The system dynamics of the ego vehicle is defined by
˙xe(t) = f(xe(t), u(t)),
where xe(t)Rnis the state vector and u(t)∈ U is the
input vector with the set of admissible inputs U Rm.
The trajectories of ntp other traffic participants are given
by xi(t;p),i∈ {1, ..., ntp}with parameters pRnp,
initial time t0, and final time tf. Initial candidates for these
trajectories are obtained from SUMO; their parametrization
is explained in more detail in Sec. IV-B. The occupied space
Oi(t;p)R2of a traffic participant is obtained through the
occ(·)operator, i.e., Oi(t;p) = occ(xi(t;p)). We define the
motion planning problem for the ego vehicle as a classical
reach-avoid problem: given an initial state xe,0=xe(t0),
an input trajectory u(t)has to be found to steer the ego
vehicle into a goal region while not leaving the road surface
Wlanes R2and avoiding the space O(t;p)occupied by all
obstacles, i.e.,
t[t0, tf] : occ(xe(t)) ⊆ Wlanes\O(t;p).(1)
We obtain a motion planning problem by deleting a
selected vehicle in a scenario simulated with SUMO and
storing the initial state xe,0of this vehicle. Vehicles with
interesting maneuvers are automatically selected by using
thresholds on the velocity and acceleration profiles or by
identifying lane changes, turns or vehicles driving nearby.
B. Scenario Parametrization
In order to optimize the criticality of scenarios, we
parametrize trajectories by the parameter vector p. We de-
scribe trajectories in lane-based coordinate systems, in which
a state is defined as x= [sξ,˙sξ, sη,˙sη]T. The subscripts ξ
and ηdenote the longitudinal and lateral coordinates with
respect to the centerline, respectively (see Fig. 2).
For the ntp traffic participants, we only parametrize the
longitudinal trajectory using translations psRntp , initial
velocity variations pvRntp, and acceleration variations
paRntp , yielding p=ps, pv, paT. The parametrized
longitudinal position trajectory is given by
sξ,i(t;pi) = ˆsξ,i(t) + ps
i+pv
it+1
2pa
it2.(2)
From (2), the centerlines, and the dimensions of the vehi-
cle, we obtain the occupied space Oi(t, p)of each traffic
participant.
Fig. 5. Selected road intersections generated by Globetrotter (sth = 0.9).
C. Drivable Area
We denote a feasible solution to the motion planning
problem defined in Sec. IV-A as χ(t;x0, u(·)), where u(·)
refers to the entire trajectory instead of a particular value
u(t)at time t. To quantify the criticality of a scenario, we use
the solution space, which corresponds to the set of reachable
states for t[t0, tf]without collisions:
R(t;x0,O(·;p)) = χ(t;x0, u(·))
τ[t0, tf] : u(τ)∈ U,
occχ(τ;x0, u(·))⊆ Wlanes\O(τ;p).
By applying the projection operator proj(x) : RnR2,
which projects the state space to the position domain, we
obtain the drivable area
D(t;x0,O(·;p)) = [
x∈R(t;x0,O(·;p))
proj(x).(3)
An example for the drivable area in presence of an obstacle
is depicted in Fig. 6.
drivable area D(t;x0,O(·;p)) obstacle Oi(t)
initial state ego xe,0
feasible trajectory χ(t;x0, u(·))
Fig. 6. Example of a drivable area for the time interval [t0, tf].
To quantify the solution space, we introduce the function
area(X)returning the area of a set. We write
A(t;p):= areaD(t;x0,O(·;p))
to obtain the area profile of the drivable area over time. We
compute the drivable area using our approach as in [37].
D. Optimization Problem
For increasing the criticality of the motion planning prob-
lem, we optimize the parameter vector pto obtain a desired,
critical area profile Acrit(t):
argmin
p
κ(p), κ(p) = Ztf
0A(t;p)Acrit(t)2
(4)
subject to t, i, j6=i:Oi(t;p)∩ Oj(t;p) = .
(5)
The constraint in (5) ensures that no traffic participants
collide with each other. In this work, we use the drivable
area computed without any traffic participants and the scalar
γ]0,1[ which quantifies the reduction of the drivable area:
Acrit(t) = γ·area(D(t;x0,)).
Since the drivable area is highly nonlinear with respect
to the trajectories of other traffic participants and possibly
subjected to local minima, we use particle swarm optimiza-
tion [38] as in our previous work [30]. Furthermore, we
implement a repair algorithm that enforces the collision
constraint (5). To that end, we formulate the collision con-
straints as linear inequality constraints and correct infeasible
solutions by computing the closest feasible solution using
linear programming. A more efficient optimization is ensured
by an a priori computation of relevant parameter intervals as
presented in [30].
V. EVAL UATI ON
We demonstrate our approach by generating scenarios
on a large variety of road networks from various places
across the world. First, we obtain 576 road networks from
8 countries and 46 cities from Globetrotter, for each of
which we simulate multiple scenarios using our CR-SUMO
interface. After selecting interesting ego vehicles, we obtain
1402 scenarios for which we optimize the criticality. The
resulting scenarios are added to our website5.
5https://commonroad.in.tum.de/
In Fig. 7a we compare the area profiles A(t;p)of the
drivable area in the optimized scenarios against the initial
scenario obtained from SUMO: our approach is able to
significantly decrease the drivable area, and it thus increases
the criticality. Fig. 7b shows the distribution of the achieved
reduction of the critical area. For most of the optimized
scenarios, the drivable area ranges between 0.20.3of its
initial size.
TABLE I
PARAMETERS FOR CRITICALITY OPTIMIZATION
Drivable area computation
max. acceleration ego vehicle |amax|5.0 m/s2
time step size t0.1 s
time horizon tf3.4 s
Constraints for optimization
initial velocity variation [3,3] m/s
acceleration variation [5,2] m/s2
0 1 2 3
time [s]
0
50
100
drivable area [m2]
initial area
optimized area
(a)
0.00 0.25 0.50 0.75 1.00
relative size of the drivable area
0
100
200
300
frequency
(b)
Fig. 7. (a) Size of the drivable area A(t;p)over time, averaged over all
scenarios. (b) Histogram of the size of the drivable area in the optimized
scenarios relative to the initial scenarios.
Let us present some concrete examples for demonstrating
our algorithm. The first example is an intersection from
the town Pula, Croatia. In Fig. 8, we compare the drivable
area of the initial scenario obtained from SUMO with the
optimized scenario. Note that we restrict the allowed road
surface Wlanes to lanelets that the ego vehicle is allowed
to drive in. In the initial scenario, the ego vehicle could
either turn freely to the right or drive straight. However, after
the optimization, two turning vehicles and a bicycle restrict
possible maneuvers of the ego vehicle.
The second example is a four-way intersection from the
town Putte, Belgium. In the optimized scenario, the ego vehi-
cle must either respect an oncoming vehicle when turning left
or a bicycle when driving straight. As a result, the drivable
area is split into two parts, as shown in Fig. 9.
VI. CONCLUSIONS
We present an approach to automatically generate a large
number of test scenarios for automated vehicles. Our results
show that we are able to extract a large number of distinct
road networks from OpenStreetMaps, for which we simulate
traffic scenarios using the traffic simulator SUMO. Our
approach subsequently yields challenging scenarios by de-
creasing the solution space for motion planning algorithms.
The generated, publicly-available scenarios render the
virtual testing of motion-planning algorithms in challenging
initial position
ego vehicle
t= 0.0 s
t= 1.5 s
drivable areavelocity
t= 3.2 s
(a) Initial scenario from simulation. (b) Optimized, more critical scenario.
Fig. 8. Example 1: Comparison of the drivable areas at different times.
t= 0.0 s
initial position
ego vehicle
t= 1.7 s
drivable area
velocity
t= 3.3 s
(a) Initial scenario from simulation. (b) Optimized, more critical scenario.
Fig. 9. Example 2: Comparison of the drivable areas at different times.
situations easier. In the future, the explicit consideration of
traffic rules during the generation of our critical scenarios
will further improve our test cases.
ACK NO WL EDG MEN TS
The authors would like to thank Chuxuan Li for her
work on the SUMO interface and Maximilian Rieger for
his work on the OSM2CR converter. We gratefully ac-
knowledge the financial support from the Central Innovation
Programme of the German Federal Government under grant
no. ZF4086007BZ8 and the German Research Foundation
(DFG) within the Priority Programme SPP 1835 Cooperative
Interacting Automobiles under grant no. AL 1185/4-2.
REFERENCES
[1] N. Kalra and S. M. Paddock, “Driving to safety: How many miles of
driving would it take to demonstrate autonomous vehicle reliability?”
Transp. Res. Part A: Policy Pract., vol. 94, pp. 182–193, 2016.
[2] W. Zhan, L. Sun, D. Wang, H. Shi, A. Clausse, M. Naumann,
J. K¨
ummerle, H. K¨
onigshof, C. Stiller, A. de La Fortelle, and
M. Tomizuka, “INTERACTION dataset: An INTERnational, Adver-
sarial and Cooperative moTION dataset in interactive driving scenarios
with semantic maps,” arXiv:1910.03088, 2019.
[3] R. Kesten, M. Usman, J. Houston, T. Pandya, K. Nadhamuni, A. Fer-
reira, M. Yuan, B. Low, A. Jain, P. Ondruska, S. Omari, S. Shah,
A. Kulkarni, A. Kazakova, C. Tao, L. Platinsky, W. Jiang, and V. Shet,
“Lyft Level 5 AV dataset 2019,” https://level5.lyft.com/dataset/, 2019.
[4] P. Sun, H. Kretzschmar, X. Dotiwalla, A. Chouard, V. Patnaik,
P. Tsui, J. Guo, Y. Zhou, Y. Chai, B. Caine, V. Vasudevan, W. Han,
J. Ngiam, H. Zhao, A. Timofeev, S. Ettinger, M. Krivokon, A. Gao,
A. Joshi, Y. Zhang, J. Shlens, Z. Chen, and D. Anguelov, “Scala-
bility in perception for autonomous driving: Waymo Open dataset,”
arXiv:1912.04838, 2019.
[5] R. Krajewski, J. Bock, L. Kloeker, and L. Eckstein, “The highD
dataset: A drone dataset of naturalistic vehicle trajectories on German
highways for validation of highly automated driving systems,” in Proc.
of the IEEE Int. Conf. Intell. Transp. Syst., 2018, pp. 2118–2125.
[6] S. Riedmaier, T. Ponn, D. Ludwig, B. Schick, and F. Diermeyer,
“Survey on scenario-based safety assessment of automated vehicles,
IEEE Access, vol. 8, pp. 87 456–87 477, 2020.
[7] M. Haklay and P. Weber, “OpenStreetMap: User-generated street
maps,” IEEE Pervasive Comput., vol. 7, no. 4, pp. 12–18, 2008.
[8] M. Behrisch, L. Bieker, J. Erdmann, and D. Krajzewicz, “SUMO–
simulation of urban mobility: an overview,” in Proc. of Int. Conf. Adv.
Syst. Simul., 2011, pp. 63–68.
[9] C. Campos, J. M. Leit˜
ao, J. P. Pereira, A. Ribas, and A. F. Coelho,
“Procedural generation of topologic road networks for driving simu-
lation,” in Iberian Conf. Inf. Syst. Technol., 2015, pp. 1–6.
[10] B. Kim, A. Jarandikar, J. Shum, S. Shiraishi, and M. Yamaura, “The
SMT-based automatic road network generation in vehicle simulation
environment,” in Proc. of the ACM Int. Conf. Embed. Softw., 2016,
pp. 1–10.
[11] A. Gambi, M. Mueller, and G. Fraser, “Automatically testing self-
driving cars with search-based procedural content generation,” in Proc.
of the 28th ACM SIGSOFT Int. Symposium on Software Testing and
Analysis, 2019, pp. 318–328.
[12] G. M´
attyus, W. Luo, and R. Urtasun, “DeepRoadMapper: Extracting
road topology from aerial images,” in Proc. of the IEEE Int. Conf.
Comput. Vision, 2017, pp. 3438–3446.
[13] M. Maboudi, J. Amini, M. Hahn, and M. Saati, “Road network
extraction from VHR satellite images using context aware object
feature integration and tensor voting,Remote Sens., vol. 8, no. 8,
2016.
[14] P. Li, Y. Zang, C. Wang, J. Li, M. Cheng, L. Luo, and Y. Yu, “Road
network extraction via deep learning and line integral convolution,” in
Proc. of the Int. Geosci. Remote Sens. Symp., 2016, pp. 1599–1602.
[15] Y. Zang, C. Wang, Y. Yu, L. Luo, K. Yang, and J. Li, “Joint enhancing
filtering for road network extraction,IEEE Trans. Geosci. Remote
Sens., vol. 55, no. 3, pp. 1511–1525, 2016.
[16] Y. Y. Chiang and C. A. Knoblock, “Automatic extraction of road
intersection position, connectivity, and orientations from raster maps,
in Proc. of the ACM Int. Symp. Adv. Geogr. Inf. Syst., 2008, pp. 183–
192.
[17] P. Fischer, S. M. Azimi, R. Roschlaub, and T. Krauß, “Towards HD
maps from aerial imagery: Robust lane marking segmentation using
country-scale imagery,” Int. J. Geo-Inf., vol. 7, no. 12, 2018.
[18] A. Zang, Z. Li, R. Xu, and D. Doria, “Lane boundary extraction from
satellite imagery,” in Proc. of the ACM SIGSPATIAL Workshop High-
Precis. Maps Intell. Appl. Auton. Veh., 2017, pp. 1–8.
[19] A. Artunedo, J. Godoy, and J. Villagra, “Smooth path planning for
urban autonomous driving using OpenStreetMaps,” in Proc. of the
IEEE Intell. Veh. Symp., 2017, pp. 837–842.
[20] D. Krajzewicz, G. Hertkorn, and J. Ringel, “Preparation of digital
maps for traffic simulation; part 1: approach and algorithms,” in Proc.
Ind. Simul. Conf., 2005, pp. 285–290.
[21] D. Nalic, A. Eichberger, G. Hanzl, M. Fellendorf, and B. Rogic,
“Development of a co-simulation framework for systematic generation
of scenarios for testing and validation of automated driving systems,
in Proc. of the IEEE Int. Conf. Intell. Transp. Syst., 2019, pp. 1895–
1901.
[22] P. Riegl, A. Gaull, and M. Beitelschmidt, “A tool chain for generating
critical traffic situations for testing vehicle safety functions,” in IEEE
Int. Conf. on Vehicular Electronics and Safety, 2019, pp. 1–6.
[23] D. Zhao, H. Lam, H. Peng, S. Bao, D. J. LeBlanc, K. Nobukawa,
and C. S. Pan, “Accelerated evaluation of automated vehicles safety
in lane-change scenarios based on importance sampling techniques,”
IEEE Trans. Intell. Transp. Syst., vol. 18, no. 3, pp. 595–607, 2017.
[24] S. Feng, Y. Feng, C. Yu, Y. Zhang, and H. X. Liu, “Testing scenario
library generation for connected and automated vehicles, part I:
Methodology,arXiv:1905.03419, 2020.
[25] F. Hauer, A. Pretschner, and B. Holzm¨
uller, “Fitness functions for
testing automated and autonomous driving systems,” in Proc. of the
Int. Conf. Comput. Safety, Rel., Security, 2019, pp. 69–84.
[26] H. Beglerovic, M. Stolz, and M. Horn, “Testing of autonomous
vehicles using surrogate models and stochastic optimization,” in Proc.
of the IEEE Int. Conf. Intell. Transp. Syst., 2018, pp. 1129–1134.
[27] C. E. Tuncali, T. P. Pavlic, and G. Fainekos, “Utilizing S-TaLiRo as
an automatic test generation framework for autonomous vehicles,” in
Proc. of the IEEE Int. Conf. Intell. Transp. Syst., 2016, pp. 1470–1475.
[28] M. Koschi, C. Pek, S. Maierhofer, and M. Althoff, “Computationally
efficient safety falsification of adaptive cruise control systems,” in
Proc. of the IEEE Int. Conf. Intell. Transp. Syst., 2019, pp. 2879–
2886.
[29] A. Nonnengart, M. Klusch, and M. Christian, “CriSGen : Constraint-
based generation of critical scenarios for autonomous vehicles,” in
Proc. of the Int. Workshop on Formal Methods for Autonomous
Systems, 2019.
[30] M. Klischat and M. Althoff, “Generating critical test scenarios for
automated vehicles with evolutionary algorithms,” in Proc. of the IEEE
Intell. Veh. Symp., 2019, pp. 2352–2358.
[31] M. Althoff and S. Lutz, “Automatic generation of safety-critical test
scenarios for collision avoidance of road vehicles,” in Proc. of the
IEEE Intell. Veh. Symp., 2018, pp. 1326–1333.
[32] P. Bender, J. Ziegler, and C. Stiller, “Lanelets: Efficient map repre-
sentation for autonomous driving,” in Proc. of the IEEE Intell. Veh.
Symp., 2014, pp. 420–425.
[33] M. Klischat, O. Dragoi, M. Eissa, and M. Althoff, “Coupling SUMO
with a motion planning framework for automated vehicles,” in SUMO:
Simulating Connected Urban Mobility, 2019.
[34] A. K. Jain, “Data clustering: 50 years beyond K-means,” Pattern
Recognit. Lett., vol. 31, no. 8, pp. 651–666, 2010.
[35] F. Murtagh and P. Legendre, “Ward’s hierarchical agglomerative
clustering method: which algorithms implement Ward’s criterion?” J.
Classif., vol. 31, no. 3, pp. 274–295, 2014.
[36] F. T. Liu, K. M. Ting, and Z.-H. Zhou, “Isolation forest,” in Proc. of
the IEEE Int. Conf. Data Mining, 2008, pp. 413–422.
[37] M. Klischat and M. Althoff, “A multi-step approach to accelerate the
computation of reachable sets for road vehicles,” in Proc. of the IEEE
Int. Conf. Intell. Transp. Syst., 2020.
[38] R. Eberhart and J. Kennedy, “A new optimizer using particle swarm
theory,” in Proc. of the Int. Symp. Micro Mach. Human Sci., 1995, pp.
39–43.
... Conversely, the achieved realism might be limited (see Sec. VII [3]). In microscopic traffic simulation [11], surveyed in Sec. 2.2, knowledge is provided through human driver models. ...
... Due to the transient and interfering nature of traffic, the mean velocity of vehicles v(t) does not converge until then (see Fig. 8). The duration is extended in Eq. (11), and simulation results are exported only after t warm−up = 170.4 s to ensure that settling processes are completed. ...
Article
Full-text available
Scenario-based testing plays a pivotal role in the development and validation of automated vehicles. Its main challenge is to efficiently generate realistic and relevant test scenarios to identify and analyze shortcomings of automated driving systems. The Scenario Factory 2.0 unifies several scenario generation techniques from the open-source CommonRoad framework and introduces simulation modes for coupling with the traffic simulators OpenTrafficSim and SUMO. The simulation modes enable generating scenarios with a tunable similarity to existing ones. As existing approaches, the Scenario Factory 2.0 integrates scenario generation from formal specifications and falsification techniques. Scenario Factory 2.0 has a modular structure and the modules can be easily rearranged for creating required scenarios. We evaluate the effectiveness of the novel simulation modes for various traffic scenarios and demonstrate the scenario generation with Scenario Factory 2.0 in a use case. The open-source code is provided at https://commonroad.in.tum.de/tools/scenario-factory .
... Three main approaches exist to generate test scenarios for autonomous driving: knowledge-based, data-driven, and adversarial. Knowledge-based methods depend on expert rules, e.g., using crowd models [19] or traffic simulators with certain constraints [20]. The data-driven method for scenario generation relies on real-world data [21]. ...
Preprint
Full-text available
Large Language Models (LLMs) are taking many industries by storm. They possess impressive reasoning capabilities and are capable of handling complex problems, as shown by their steadily improving scores on coding and mathematical benchmarks. However, are the models currently available truly capable of addressing real-world challenges, such as those found in the automotive industry? How well can they understand high-level, abstract instructions? Can they translate these instructions directly into functional code, or do they still need help and supervision? In this work, we put one of the current state-of-the-art models to the test. We evaluate its performance in the task of translating abstract requirements, extracted from automotive standards and documents, into configuration code for CARLA simulations.
... To mitigate this, integrated end-to-end methods using neural networks have been developed [17,19]. These methods are typically designed to extract high-level geometric information of road networks, and they do not provide the level of detail and accuracy required to plan trajectories useful for automated driving [20][21][22]. Compared to aerial photography, panoramic-view images captured by vehicle-mounted cameras can better describe local road scenes. ...
Article
Full-text available
Maps of road layouts play an essential role in autonomous driving, and it is often advantageous to represent them in a compact form, using a sparse set of surveyed points of the lane boundaries. While lane centerlines are valuable references in the prediction and planning of trajectories, most centerline extraction methods only achieve satisfactory accuracy with high computational cost and limited performance in sparsely described scenarios. This paper explores the problem of centerline extraction based on a sparse set of border points, evaluating the performance of different approaches on both a self-created and a public dataset, and proposing a novel method to extract the lane centerline by searching and linking the internal maximal circles along the lane. Compared with other centerline extraction methods producing similar numbers of center points, the proposed approach is significantly more accurate: in our experiments, based on a self-created dataset of road layouts, it achieves a max deviation below 0.15 m and an overall RMSE less than 0.01 m, against the respective values of 1.7 m and 0.35 m for a popular approach based on Voronoi tessellation, and 1 m and 0.25 m for an alternative approach based on distance transform.
... Building on this approach, [27] leverages optimization methods to automatically create diverse and high-stakes scenarios, focusing on uncovering system vulnerabilities during regression testing and ensuring robustness across software iterations. Similarly, [28] introduces a workflow that extracts diverse road networks from OpenStreetMap 2 and uses the SUMO [29] traffic simulator to generate traffic scenarios. By employing nonlinear optimization, it identifies safety-critical situations that can challenge autonomous systems, providing a scalable approach to scenario generation without relying on realworld data. ...
Preprint
Full-text available
The validation of autonomous driving systems benefits greatly from the ability to generate scenarios that are both realistic and precisely controllable. Conventional approaches, such as real-world test drives, are not only expensive but also lack the flexibility to capture targeted edge cases for thorough evaluation. To address these challenges, we propose a controllable latent diffusion that guides the training of diffusion models via reinforcement learning to automatically generate a diverse and controllable set of driving scenarios for virtual testing. Our approach removes the reliance on large-scale real-world data by generating complex scenarios whose properties can be finely tuned to challenge and assess autonomous vehicle systems. Experimental results show that our approach has the lowest collision rate of 0.098 and lowest off-road rate of 0.096, demonstrating superiority over existing baselines. The proposed approach significantly improves the realism, stability and controllability of the generated scenarios, enabling more nuanced safety evaluation of autonomous vehicles.
... (i) High-risk scenarios are situations that could endanger vehicles and road users, possibly resulting in severe accidents if they were to happen in reality. Identifying high-risk scenarios can be achieved through various methods, such as binary search techniques [28], evolutionary algorithms [29], or by amplifying the criticality of typical traffic scenarios using nonlinear optimization methods [30]. However, there is still no uniform standard for defining high-risk scenarios. ...
Article
Full-text available
Highly automated vehicles are complex systems, and ensuring their safe operation within their Operational Design Domain (ODD) presents significant challenges. Diagnosing failure modes and updating these systems are even more demanding tasks. This paper introduces a method to assist with assessing, diagnosing, and updating these systems by developing a stochastic model that predicts safety outcomes (collision, near-miss, or safe state) with quantified uncertainty in any parametrized scenario. The approach uses bootstrapping aggregation to create an ensemble of predictive models, leveraging fully connected feed-forward neural networks. These networks are designed with a flexible number of trainable parameters and hidden layers, requiring minimal computational resources. The model is trained on a small set of examples obtained through direct simulations that randomly sample the parametric scenario, bypassing the traditional test matrix definition. Once trained, the bootstrapped model serves as an identity card for the system under test, allowing continuous performance evaluation across the parametric scenario. The paper demonstrates applications, including safety assessment, failure mode identification, and developing a safe speed recommendation function. The model's compact size ensures rapid execution, facilitating extensive post-analysis for safety argumentation and diagnosis and real-time online use to extend the system's abilities.
Article
Autonomous driving is developing rapidly and has become a hot topic in both industry and research. The planning system plays a crucial role in meeting the requirements of autonomous driving. However, current planning system designs may not effectively serve planning tasks. A typical modular planning system offers high interpretability and flexibility. However, it may cause task-agnostic problems between the upstream predictor and the downstream planner. End-to-end driving systems have a natural advantage in achieving system-wide integration, but their poor interpretability poses safety risks. Therefore, in this paper, a task-unified planning framework is proposed to inspire the current prediction-planning paradigm. In this architecture, driving tasks are first modeled. Then, the predictor and planner are jointly designed and optimized based on these tasks. Finally, during the actual planning process, the upstream and downstream components remain relatively independent to allow for flexible adjustments. The core of this architecture is a planning-oriented predictor named POP, which fully retains the advantages of modular systems by optimizing the predictor to meet driving requirements. Comprehensive experiments demonstrate its effectiveness. Compared to typical modular systems, POP-based framework shows significant improvements in planning tasks, particularly in collision avoidance, ensuring system safety without compromising driving efficiency or comfort.
Chapter
Full-text available
Ensuring pedestrian-safety is paramount to the acceptance and success of autonomous cars. The scenario-based training and testing of such self-driving vehicles in virtual driving simulation environments has increasingly gained attention in the past years. A key challenge is the automated generation of critical traffic scenarios which usually are rare in real-world traffic, while computing and testing all possible scenarios is infeasible in practice. In this paper, we present a formal method-based approach CriSGen for an automated and complete generation of critical traffic scenarios for virtual training of self-driving cars. These scenarios are determined as close variants of given but uncritical and formally abstracted scenarios via reasoning on their non-linear arithmetic constraint formulas, such that the original maneuver of the self-driving car in them will not be pedestrian-safe anymore, enforcing it to further adapt the behavior during training.
Conference Paper
Full-text available
We propose an approach for the fast computation of reachable sets of road vehicles while considering dynamic obstacles. The obtained reachable sets contain all possible behaviors of vehicles and can be used for motion planning, verification, and criticality assessment. The proposed approach precomputes computationally expensive parts of the reachability analysis. Further, we partition the reachable set into cells and construct a directed graph storing which cells are reachable from which cells at preceding time steps. Using this approach, considering obstacles reduces to deleting nodes from the directed graph. Although this simple idea ensures an efficient computation, the discretization can introduce considerable over-approximations. Thus, the main novelty of this paper is to reduce the over-approximations by intersecting reachable sets propagated from multiple points in time. We demonstrate our approach on a large range of scenarios for automated vehicles showing a faster computation time compared to previous approaches while providing the same level of accuracy.
Article
Full-text available
When will automated vehicles come onto the market? This question has puzzled the automotive industry and society for years. The technology and its implementation have made rapid progress over the last decade, but the challenge of how to prove the safety of these systems has not yet been solved. Since a market launch without proof of safety would neither be accepted by society nor by legislators, much time and many resources have been invested into safety assessment in recent years in order to develop new approaches for an efficient assessment. This paper therefore provides an overview of various approaches, and gives a comprehensive survey of the so-called scenario-based approach. The scenario-based approach is a promising method, in which individual traffic situations are typically tested by means of virtual simulation. Since an infinite number of different scenarios can theoretically occur in real-world traffic, even the scenario-based approach leaves the question unanswered as to how to break these down into a finite set of scenarios, and find those which are representative in order to render testing more manageable. This paper provides a comprehensive literature review of related safety-assessment publications that deal precisely with this question. Therefore, this paper develops a novel taxonomy for the scenario-based approach, and classifies all literature sources. Based on this, the existing methods will be compared with each other and, as one conclusion, the alternative concept of formal verification will be combined with the scenario-based approach. Finally, future research priorities are derived.
Conference Paper
Full-text available
Due to the sheer infinite number of test scenarios for test and validation of automated driving, stand-alone onroad testing of these systems is not reasonable, calling for the development of X-in-the-loop test methods. Recent advances in simulation methods are often based on simulation techniques where test scenarios are built considering stochastic traffic or deterministic predefined manoeuvres. To ensure realism, numerical robustness and usability of the test scenarios for both approaches, increasing effort must be invested in modelling the driving environment as well as vehicle and traffic dynamics. Especially traffic models are rarely realistically modelled in most current scenario generation and testing techniques. The goal of the present paper is to introduce a co-simulation framework for automated scenario generation with calibrated traffic flow models using measured data from an official test road in Austria and modelled in PTV Vissim. Combined with the Multi-Body-System vehicle development software IPG CarMaker, the presented co-simulation framework provides an approach for generation of realistic scenarios. This approach is demonstrated for a Highway Chauffeur function and allows future systematic testing.
Chapter
Full-text available
Functional specifications and real drive data are typically used to derive parameterized scenarios for scenario-based testing of driving systems. The domains of the parameters span a huge space of possible test cases, from which “good” ones have to be selected. Heuristic search, guided by fitness functions, has been proposed as a suitable technique in the past. However, the methodological challenge of creating suitable fitness functions has not been addressed yet. We provide templates to formulate fitness functions for testing automated and autonomous driving systems. Those templates ensure correct positioning of scenario objects in space, yield a suitable ordering of maneuvers in time, and enable the search for scenarios in which the system leaves its safe operating envelope. We show how to compose them into fitness functions for heuristic search. Collision and close-to-collision scenarios from real drive data serve as a use case to show the applicability of the presented templates.
Conference Paper
Full-text available
Falsification aims to disprove the safety of systems by providing counterexamples that lead to a violation of safety properties. In this work, we present two novel falsification methods to reveal safety flaws in adaptive cruise control (ACC) systems of automated vehicles. Our methods use rapidly-exploring random trees to generate motions for a leading vehicle such that the ACC under test causes a rear-end collision. By considering unsafe states and searching backward in time, we are able to drastically improve computation times and falsify even sophisticated ACC systems. The obtained collision scenarios reveal safety flaws of the ACC under test and can be directly used to improve the system's design. We demonstrate the benefits of our methods by successfully falsifying the safety of state-of-the-art ACC systems and comparing the results to that of existing approaches.
Conference Paper
Full-text available
Self-driving cars rely on software which needs to be thoroughly tested. Testing self-driving car software in real traffic is not only expensive but also dangerous, and has already caused fatalities. Virtual tests, in which self-driving car software is tested in computer simulations, offer a more efficient and safer alternative compared to naturalistic field operational tests. However, creating suitable test scenarios is laborious and difficult. In this paper we combine procedural content generation, a technique commonly employed in modern video games, and search-based testing, a testing technique proven to be effective in many domains, in order to automatically create challenging virtual scenarios for testing self-driving car soft- ware. Our AsFault prototype implements this approach to generate virtual roads for testing lane keeping, one of the defining features of autonomous driving. Evaluation on two different self-driving car software systems demonstrates that AsFault can generate effective virtual road networks that succeed in revealing software failures, which manifest as cars departing their lane. Compared to random testing AsFault was not only more efficient, but also caused up to twice as many lane departures.
Conference Paper
Full-text available
Testing motion planning algorithms for automated vehicles in realistic simulation environments accelerates their development compared to performing real-world test drives only. In this work, we combine the open-source microscopic traffic simulator SUMO with our software framework CommonRoad to test motion planning of automated vehicles. Since SUMO is not originally designed for simulating automated vehicles, we present an interface for exchanging the trajectories of vehicles controlled by a motion planner and the trajectories of other traffic participants between SUMO and CommonRoad. Furthermore, we ensure realistic dynamic behavior of other traffic participants by extending the lane changing model in SUMO to implement more realistic lateral dynamics. We demonstrate our SUMO interface with a highway scenario.
Article
Testing and evaluation is a critical step in the development and deployment of connected and automated vehicles (CAVs), and yet there is no systematic framework to generate testing scenario library. This study aims to provide a general framework for the testing scenario library generation (TSLG) problem with different operational design domains (ODDs), CAV models, and performance metrics. Given an ODD, the testing scenario library is defined as a critical set of scenarios that can be used for CAV test. Each testing scenario is evaluated by a newly proposed measure, scenario criticality, which can be computed as a combination of maneuver challenge and exposure frequency. To search for critical scenarios, an auxiliary objective function is designed, and a multi-start optimization method along with seed-filling is applied. Theoretical analysis suggests that the proposed framework can obtain accurate evaluation results with much fewer number of tests, if compared with the on-road test method. In part II of the study, three case studies are investigated to demonstrate the proposed method. Reinforcement learning based technique is applied to enhance the searching method under high-dimensional scenarios.