Conference PaperPDF Available

Atmospheric Blocking Pattern Recognition in Global Climate Model Simulation Data

Authors:
Atmospheric Blocking Pattern Recognition
in Global Climate Model Simulation Data
Grzegorz Muszynski∗†, Prabhat, Jan Balewski, Karthik Kashinath, Michael Wehnerand Vitaliy Kurlin
Department of Computer Science
University of Liverpool, Liverpool, L69 3BX, UK
National Energy Research Scientific Computing Center
Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
Computational Research Division
Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
Abstract—In this paper, we address a problem of atmospheric
blocking pattern recognition in global climate model simulation
data. Understanding blocking events is a crucial problem to
society and natural infrastructure, as they often lead to weather
extremes, such as heat waves, heavy precipitation, and the
unusually poor air condition. Moreover, it is very challenging
to detect these events as there is no physics-based model of
blocking dynamic development that could account for their
spatiotemporal characteristics. Here, we propose a new two-
stage hierarchical pattern recognition method for detection and
localisation of atmospheric blocking events in different regions
over the globe. For both the detection stage and localisation
stage, we train five different architectures of a convolutional
neural network (CNN) based classifier and regressor. The results
show the general pattern of the atmospheric blocking detection
performance increasing significantly for the deep CNN architec-
tures. In contrast, we see the estimation error of event location
decreasing significantly in the localisation problem for the shallow
CNN architectures. We demonstrate that CNN architectures tend
to achieve the highest accuracy for blocking event detection
and the lowest estimation error of event localisation in regions
of the Northern Hemisphere than in regions of the Southern
Hemisphere.
I. INTRODUCTION
Deviations from the normal atmospheric circulation in the
mid-latitudes of both hemispheres usually lead to the forma-
tion of climate patterns that are known as atmospheric blocks
(ABs) [1], [2]. Blocking is a large-scale climate pattern that is
often correlated with extreme weather events, such as floods,
cold spells, short-lived droughts, heat waves, and extremely
poor air conditions in summer and winter seasons [3]–[6]
These weather extremes connected with AB phenomena pose
a high risk to society and natural infrastructure. For example,
the summer heat wave, combined with a drought in 2003 that
claimed tens of thousands of lives in Europe, was partially
caused by the resilient AB [7]. In 2012, AB in the North
Atlantic steered hurricane Sandy into the north-eastern coast of
the United States, and its heavy precipitation resulted in severe
flooding and brought the high economic cost to society [8].
Therefore, the importance of characterising and understanding
of the evolution of atmospheric patterns, including ABs, in a
changing climate cannot be overstated. To analyse how these
patterns change under different carbon emission scenarios or
large-scale climatic variations, ABs have first to be identified.
Objective identification of climate patterns, including AB
phenomena, has progressively emerged as one of the most
active research areas in climate science over the last decades.
However, it remains a challenge for the climate science
community. Existing AB pattern recognition methods are engi-
neered heuristics based on human expertise in defining specific
atmospheric phenomena. In other words, these conventional
methods are built upon hard constraints on subjective thresh-
olds of relevant physical variables, such as surface pressure,
temperature, and wind speed. Moreover, AB patterns do have
neither a unique theoretical model nor a clear empirical
definition that is universally accepted by all climate science
community [2]. ABs can take many forms in terms of the
shape, the size, and the location of patterns over the globe.
For all these reasons, various sets of criteria and thresholds are
used to characterise blocking patterns. This non-uniformity in
research on ABs usually results in the discrepancies between
outputs of different identification methods [9]. What is more,
the existing heuristics do not perform as well as visual
inspection of climate experts for identifying these patterns.
There are many ongoing efforts to search for alternative
identification methods of climate patterns or extreme weather
events, such as atmospheric rivers, extra-tropical cyclones, and
hurricanes. These methods include an unsupervised segmenta-
tion and a discovery of coherent structures in spatiotemporal
systems [10], topological methods [11], and machine learning
methods along with deep learning techniques [12]. The above
mentioned methods by their inherent design circumvent a crit-
ical selection of suitable thresholds of different physical vari-
ables to characterise atmospheric patterns. In particular, deep-
learning-based approaches can learn robust representations of
images from raw pixels, outperforming handcrafted features.
Furthermore, the recent advances in deep neural networks,
especially deep convolutional neural networks (CNNs), have
demonstrated a significant improvement in achieving the state-
of-the-art results on image recognition tasks [13]–[15], such
as object detection and object segmentation. Despite all of
these deep learning breakthroughs, the adoption of deep neural
networks in climate science research is still unexplored [16],
[17]. Although the atmospheric phenomena pattern recognition
task in climate data is similar to the task of finding objects
in natural RGB images, there are important differences, as
follows:
Climate model simulation products have more ‘channels’
of information than RGB images;
There is lack of publicly available data annotations or
labelled data for supervised machine learning methods;
Climate data do not share the same statistics as RGB
images; hence CNN architectures cannot be pre-trained
on publicly available databases (e.g., ImageNet [18]).
In this paper, we address the problem of identifying climate
patterns, that is formulated as a pattern recognition task. The
key contributions of this work are, as follows:
We develop a new two-stage hierarchical pattern recogni-
tion method for the identification of atmospheric blocking
events (ABs) in different regions over the globe, as is
shown in Figure 2. The first stage is an AB pattern detec-
tion problem defined as a binary classification task, and
the second stage is an AB pattern localisation problem
defined as a regression task. In the detection stage, a
CNN-based classifier distinguishes the AB images from
the non-AB images. Those images with detected AB
events are passed to the localisation stage. In that stage,
a CNN-based regressor predicts AB location parameters
in the image, i.e. a mass centre (a latitudinal position
and a longitudinal position), and a minimum enclosing
circular box. For both the detection step and localisation
step, we propose and train customised CNN architectures
inspired by the Oxford Visual Geometry Group (VGG)
architectures [14].
We investigate five different architectures of the generic
CNN-based classifier and regressor respectively designed
for each stage of the AB identification method, as shown
in Table I and II. We fix other parameters of the CNN
architectures and we steadily increase the network depth
by stacking more two-dimensional convolutional layers.
Furthermore, we observe the general pattern of the AB
detection performance increasing significantly for the
deep CNN architectures. In contrast, we see the estima-
tion error of event location decreasing significantly in the
localisation problem for the shallow CNN architectures.
We demonstrate that the proposed CNN architectures
tend to achieve better AB detection and localisation
performances in regions of the Northern Hemisphere than
in regions of the Southern Hemisphere. Moreover, we
observe that the classification accuracy gradually declines
for each CNN architecture over a period of time. We also
indicate that the value of estimation error in the regression
task strongly depends on geographical location and the
associated climatic variability.
In Section II, a brief overview of AB phenomena re-
search and the existing AB pattern detection methods are
described. In Section III, we detail climate data and the created
benchmark dataset. Section IV describes the proposed AB
pattern recognition method and CNN classifier and regressor,
including their architectures. In Section VI, we present and
discuss all the results. Section VII presents conclusions and
future work.
II. RE LATED WO RK
In this section, we briefly review a related work on at-
mospheric blocking (AB) patterns and the existing objective
identification methods of them.
A. Research on Atmospheric Blocking
An atmospheric blocking pattern is a resilient obstruction
of the normal west-to-east atmospheric circulation in the
mid-latitudes of both hemispheres [19]. In other words, this
pattern temporally redirects the jet stream and can cause
one of the most dangerous extreme weather events, such
as heat waves, cold spells, and many others [7]. An AB
phenomenon has important consequences on the weather of
densely populated areas [8], and that is why early blocking
studies have been mainly focused on the Northern Hemisphere.
Since the middle of the last century, there have been many
observational works on the AB phenomena [1], [20]. The
first AB climatology analyses by the atmospheric science
community were to investigate the geographical distribution
of its observational statistics and find a commonly accepted
structural definition of this phenomenon [2]. In the second half
of the 20th century, many theoretical studies started looking for
a physics-based model of AB dynamic development that could
account for its spatiotemporal characteristics. However, to
date, there is no theoretical model of AB that can constitute all
of its observational characteristics [21]. More recently, much
attention has been paid to the evaluation of AB occurrence in
global climate models. AB patterns still remain a challenge for
numerical simulations, although numerous improvements in
atmospheric modelling have been made by the climate science
community [22].
B. Atmospheric Blocking Pattern Recognition Methods
Turning all broad statements about the qualitative features
of AB patterns into an objective identification method is not
straightforward. For example, to compare climate simulations
with the observational records of AB or non-AB, these patterns
have to be first found in climate model products. However,
certain features of AB occur in the majority of these patterns,
ABs may have various spatial structures or shapes that depend
on a specific geographic location and season.
The first attempts at identifying ABs required comprehen-
sive visual inspection of the atmospheric flow patterns and
were also limited in their scope [1]. Such simple analysis
usually leads to potentially different conclusions about AB
patterns. For this reason, a wide range of objective identifi-
cation methods has been proposed over the years. Many of
them are based on 1D analysis techniques called blocking
indices [23]–[25]. Some of these 1D blocking indices have
been modified into 2D indices of flow fields [9], [26]. A
couple of intercomparison studies have been done to quantify
if both 1D and 2D identification techniques produce consistent
AB characteristics, such as blocking frequency, duration and
location [9].
III. DATA
A. Climate Dataset
In climate science research, one can distinguish three cate-
gories of global datasets: model simulation products, observa-
tional data (e.g., satellite images), and reanalysis products (i.e.,
generated by combining climate models with observations). In
this study, we analyze the last category because the reanalysis
provides a comprehensive historical record of the Earth’s
climate and gives a reliable way to monitor how fast it is
changing. That is why it fits the problem of studying climate
patterns, i.e. atmospheric blocking (AB) phenomena.
In our analysis, we use the ERA-Interim reanalysis product
from the European Center for Medium-Range Weather Fore-
casting. We use five physical variables on a regular grid: tem-
perature, meridional and zonal wind, geopotential height, and
potential vorticity variables at eight different pressure levels
in millibars (mb): {150mb, 200mb, 250mb, 300mb, 350mb,
400mb, 450mb, 500mb}, 6-hourly timesteps at approximately
80 km spatial resolution (180 pixels ×360 pixels based on
T119 spectral model grid resolution) in the period of January
1, 1980 - December 31, 2016.
Fig. 1. An example of the world map (globe) with six defined regions of
interest: the North Pacific Region (NP); the North Atlantic Region (NA); the
North Continental Region (NC); the South Pacific Region (SP); the South
Atlantic Region (SA), and the South Indian Ocean Region (SI).
B. Generated Dataset
The publicly available dataset of identified AB patterns
is based on a climatological field diagnostic procedure of
Schwierz, et al. [26] and is provided by the Institute for
Atmospheric and Climate Science at the ETH Zurich, Switzer-
land. The procedure consists of a few steps: a semi-automatic
detection algorithm, and 2D blocking index method that was
applied to the ERA-Interim reanalysis data [27]. The output of
this procedure yields 2D binary fields (masks) with the value
one at pixels that meet the procedure criteria for AB and the
value zero at pixels that do not belong (non-AB).
We generated the ground-truth labels (i.e., AB class vs non-
AB class) based on the output of the procedure mentioned
above. The global image of 180 pixels ×360 pixels ×40
channels is divided into six images (regions) of size 60 pixels
×120 pixels ×40 channels. Location of regions and extent
are based on the studies of [28]. We can distinguish six
regions: the North Pacific Region (NP); the North Atlantic
Region (NA); the North Continental Region (NC); the South
Pacific Region (SP); the South Atlantic Region (SA), and the
South Indian Ocean Region (SI), as shown in Figure 1. Each
region is roughly centred over a local maximum of AB main
frequency occurrence. We use only images of mid-latitudes
regions of both hemispheres on the globe. The positive class
label is assigned to an image if it contains a compact binary
blob of size greater or equal to the average AB blob size.
The negative class label is assigned to an image if it does
not have a binary blob. The generated labelled dataset has
approximately 140K images of both AB and non-AB events,
where the number of samples per class is almost balanced.
For the localisation (regression) problem, we select the images
containing AB phenomena. Then we calculate the centroid or
the mass centre of a binary blob in the region and a radius of
a minimum bounding circular box.
IV. METHODOLOGY
In this section, we describe a hierarchical pattern recognition
method for the identification of atmospheric blocking events
(ABs). As shown in Figure 2, the hierarchical method consists
of two stages:
In the first stage, AB pattern detection is formulated as a
binary classification task. A convolutional neural network
(CNN) based classifier distinguishes the AB images from
the non-AB images in different regions over the globe.
Those images with detected AB events are passed to the
second stage.
In the second stage, AB pattern localisation is defined as
a regression problem. A CNN-based regressor predicts
AB location parameters in the images, i.e. a mass centre
(a latitudinal position and a longitudinal position), and
a radius of a minimum enclosing circular box of these
events in different regions of the globe.
Both stages employ customised CNN architectures inspired
by VGG architectures (VGGs) [14]. Because the VGGs are the
most commonly referred ones in the deep learning literature,
and one of the state-of-the-art architectures in classification
task and localisation task. In both tasks, we investigate ve
different architectures of the generic CNNs designed for each
stage of the method, as outlined in Table I and Table II.
A. Detection of Atmospheric Blocking Events
In this stage, we develop a generic CNN-based classifier to
which we refer as the architecture A, as outlined in Table I.
The architecture has eight weight layers, including four con-
volutional layers (conv) and four fully connected layers (FC).
Each convolutional layers is followed by a max pool layer
(Maxpooling). The width of convs starts from 64 filters in the
first layer and then increases by a factor of two after each
Maxpooling layer until it reaches 512 filters. In contrast, the
width of FC layers starts from 512 channels and then decreases
by a factor of two, until it reaches 64 channels. The output FC
layer performs a binary classification and therefore contains
one value, i.e. an AB label or a non-AB label.
Fig. 2. The big picture of a hierarchical atmospheric blocking pattern recognition method. The method consists of two stages: (a) illustrates an architecture of
convolutional neural network (CNN) based classifier that distinguishes atmospheric blocking events (AB label) and otherwise non-blocking events (non-AB
label); and (b) illustrates an architecture of CNN-based regressor that predicts three values describing the AB location, i.e. a latitudinal position, a longitudinal
position, and a radius of a minimum enclosing circular box.
All five architectures of a CNN-based classifier follow the
architecture A and gradually increase the number of conv
layers (their depth). The configuration of the FC layers is the
same in all architectures. All five architectures are referred by
their names (A-E) and all details on the architectures are listed
in Table I, one per column.
TABLE I
CONVOLUTIONAL NEURAL NETWORK (CNN) ARCHITECTURES ARE
SHOWN IN COLUMNS. THE DEPTH OF THE ARCHITECTURES INCREASES
FRO M TH E LE FT (A) TO T HE R IG HT (E). TH E PARA ME TE RS O F BOT H
TY PES O F LAY ER S ARE D EN OT ED AS F OL LO WS :CON V-(NUMBER OF
FILTE RS ); FC-(NU MB ER O F CH ANN EL S) .
CNN Architectures
A B C D E
Input: (60 ×120 ×40)
conv-64 conv-64 conv-64 conv-64 conv-64
conv-64 conv-64 conv-64 conv-64
Maxpooling
conv-128 conv-128 conv-128 conv-128 conv-128
conv-128 conv-128 conv-128
Maxpooling
conv-256 conv-256 conv-256 conv-256 conv-256
conv-256 conv-256
Maxpooling
conv-512 conv-512 conv-512 conv-512 conv-512
conv-512
Maxpooling
FC-512
FC-256
FC-128
FC-64
Output: FC-1
B. Localisation of Atmospheric Blocking Events
In this stage, we develop a generic CNN-based regressor to
which we refer as the architecture A, as outlined in Table II.
The architecture has totally seven weight layers, including four
convolutional layers (conv) and three fully connected layers
(FC). Each convolutional layers is followed by a max pool
layer (Maxpooling). The width of convs starts from 64 filters
in the first layer and then increases by a factor of two after each
Maxpooling layer, until it reaches 512 filters. In contrast, the
width of FC layers starts from 256 channels and then decreases
by a factor of two, until it reaches 64 channels. The output FC
layer performs a multi-regression task and therefore contains
three values, i.e. a latitudinal position, longitudinal position,
and a radius of a minimum enclosing circular box.
All five architectures of a CNN-based regressor follow the
architecture A and gradually increase the number of conv
layers (their depth). The configuration of the FC layers is the
same in all architectures. All five architectures are referred by
their names (A-E) and all details on the architectures are listed
in Table II, one per column.
TABLE II
CONVOLUTIONAL NEURAL NETWORK (CNN) ARCHITECTURES ARE
SH OWN I N CO LU MN S. THE DEPTH OF THE ARCHITECTURES INCREASES
FRO M TH E LE FT (A) TO T HE R IG HT (E). TH E PARA ME TE RS O F BOT H
TY PES O F LAY ER S ARE D EN OT ED AS F OL LO WS :CON V-(NUM BE R OF
FILTE RS ); FC-(NU MB ER O F CH ANN EL S) .
CNN Architectures
A B C D E
Input: (60 ×120 ×40)
conv-64 conv-64 conv-64 conv-64 conv-64
conv-64 conv-64 conv-64 conv-64
Maxpooling
conv-128 conv-128 conv-128 conv-128 conv-128
conv-128 conv-128 conv-128
Maxpooling
conv-256 conv-256 conv-256 conv-256 conv-256
conv-256 conv-256
Maxpooling
conv-512 conv-512 conv-512 conv-512 conv-512
conv-512
Maxpooling
FC-256
FC-128
FC-64
Output: FC-3
V. EXP ER IME NTAL SETTINGS
A. The Setting of Experiments
We perform 10-fold-cross-validation (CV) on the dataset,
where eight folds are for training, the one fold is for validation,
and the one fold is for testing. In each CV-round, five convolu-
tional neural networks (CNNs) with different hyperparameters
are trained on a training set, evaluated on a validation set,
and tested on a testing set. There are two hyperparameters
that we search, i.e. dropout rate of neurons in fully connected
layers (i.e., [0.2; 0.6]) and batch size (i.e., {32,64,128,256}).
The hyperparameters are selected through a random search
procedure.
CNNs are trained using Adam optimizer, which is the adap-
tive learning rate optimization algorithm. For the regression
problem, we used the mean squared error loss function, and for
the classification task, we used the binary cross-entropy loss
function. The rectified linear activation function is used in all
convolutional layers and all fully connected layers. The output
layer of CNN-based classifier uses the sigmoid activation
function and the output layer of CNN-based regressor uses
the hyperbolic tangent activation function.
B. Computational Platform
We performed our data processing, CNN architectures train-
ing and testing on Cori, a Cray XC40 supercomputing system
at the National Energy Research Scientific Computing Center
(NERSC). Each of Cori computing node has 32 2.3 GHz Intel
Haswell processors or has 68 1.4 GHz Intel KNL processors.
In our computations, we used single node TensorFlow backend
of Keras. The random search procedure of hyper-parameter
optimisation was distributed across Cori compute nodes with
tasks fully parallel on 32 cores and 68 cores.
C. Evaluation Metrics
To assess the performance of architectures of a CNN-based
classifier and a CNN-based regressor, we use the following
metrics: a classification accuracy (ACC) and F1 score for a de-
tection task; Lin’s concordance correlation coefficient (CCC)
and the mean percentage error (MPE) for a regression problem.
The F1 score measures a weighted harmonic mean of the
precision and sensitivity of the classifier in case of imbalanced
class problem in data of different regions over the globe. The
CCC value is the measure of agreement between the true
variables and the predicted variables by the regressor. The
MPE measures if the regressor systematically underestimates
or overestimates the predicted variables. We use McNemar’s
statistical test at a significance level of 0.05 on the output
of classification task [29] and Wilcoxon Signed-Rank test at
a significance level of 0.05 for the output regression task
[30]. Both tests evaluate which CNN architecture performs
the best, i.e. we observe a statically significant increase and
decrease in classification accuracy and error of predicting
localisation parameters of AB patterns, respectively. In order to
facilitate the comparison, CNN architectures have been ranked
according to the ACC values for the classification task and the
respective MPE values for the regression task.
VI. RE SULTS A ND DISCUSSION
In this section, we present and discuss the obtained results
in detecting and localising atmospheric blocking (AB) events
in the dataset generated from the ERA-Interim climate model
simulation product, as described in Section III-B.
A. Results
Figure 3 (a) and Figure 3 (b) show a bar chart of classifi-
cation accuracy measure (ACC) and a bar chart of the mean
F1 score of both classes, respectively. The ACC and F1 score
values are computed for five architectures outlined in Table I,
and for all six regions over the globe shown in Figure 1.
What can be seen in these charts is the sharp decrease in
the architectures ACC and F1 score values in the regions of
the Southern Hemisphere (i.e., SA, SI, and SP) in comparison
to the architectures ACC values and F1 scores in the regions
of the Northern Hemisphere (i.e., NA, NC, and NP). Both
bar charts reveal that the architecture D of the proposed
classifier outperforms the other architectures in regions of
both hemispheres, regarding ACC and F1 score values. The
only exception to this is the SA region in which values of
performance metrics are slightly higher for the architecture B
than the architecture D. It can be seen that the architecture A
reached the lowest performance for all six regions. Overall,
we can observe a statistically significant increase in the ACC
values (p-value 0.05) for: architecture E in the NA region;
architecture D in the NC region; architecture D in the NP
region; architecture E in the SA region; architecture D in the
SI region, and architecture B in the SP region.
Figure 4 (a) and Figure 4 (b) display the mean values of
ACCs and F1 scores (the mean of both classes) for the entire
globe over time (i.e., averages of ve year periods). If we
look at the trends over time, we can see the fluctuation in
the ACC and F1 score values for all architectures. However,
there is no sign of levelling off; we can observe that for some
architectures (i.e., architecture B and architecture A) the scores
drop gradually over time after the initial surge. Furthermore,
we can see that architecture B, architecture C, architecture D,
and architecture E converge moderately at similar ACC and
F1 score values in most of the periods.
Figure 5 displays a bar chart of Lin’s concordance correla-
tion coefficient (CCC) for all architectures in Table II, and all
regions over the globe shown in Figure 1. What stands out in
this chart is a steep decline in CCC values for the architectures
in regions of the Southern Hemisphere (i.e., SA, SI, and SP)
in comparison to CCC values for the architectures in regions
of the Northern Hemisphere (i.e., NA, NC, and NP). The chart
shows the general dominance pattern of CCC values for the
architecture A in regions of both hemispheres, except for the
SA region. In contrast, it can be seen that CCC values for the
architecture E are the lowest for all the regions. In general, we
can observe more complex architecture is, more CCC values
decrease.
In Figure 6 (a), (b), and (c) show bar charts of mean per-
centage error (MPE) of three predicted parameters describing
the location of AB patterns for all architectures in Table II,
(a) (b)
Fig. 3. Performance of convolutional neural network (CNN) architectures: architecture A, architecture B, architecture C, architecture D, and architecture E;
for regions of the North Atlantic Region (NA), the North Continental Region (NC), the North Pacific Region (NP), the South Atlantic Region (SA), the South
Indian Ocean Region (SI) and the South Pacific Region (SP). Left bar chart (a) illustrates classification accuracy for each architecture per region and the right
chart (b) displays F1 score for each architecture per region. The ?symbol stands for a p-value 0.05.
(a) (b)
Fig. 4. Performance of convolutional neural network (CNN) architectures: architecture A, architecture B, architecture C, architecture D, and architecture E;
for regions of the North Atlantic Region (NA), the North Continental Region (NC), the North Pacific Region (NP), the South Atlantic Region (SA), the South
Indian Ocean Region (SI) and the South Pacific Region (SP). Left bar chart (a) illustrates the mean classification accuracy of each architecture per region
over five year periods and the right chart (b) displays the mean F1 score for each architecture per region over five year periods.
Fig. 5. Performance of convolutional neural network (CNN) architectures:
architecture A, architecture B, architecture C, architecture D, and architecture
E; for the North Atlantic Region (NA), the North Continental Region (NC),
the North Pacific Region (NP), the South Atlantic Region (SA), the South
Indian Ocean Region (SI) and the South Pacific Region (SP). The bar chart
illustrates the Lin’s concordance correlation coefficient for each architecture
per region.
and all regions over the globe shown in Figure 1.
Figure 6 (a) displays the MPE of the latitudinal mass
centre position of AB events. We can see that all architectures
overestimate the latitudinal parameter (positive error) in all the
regions. The chart reveals that there has been a steep decrease
in the overestimating the parameter by all architectures in
the NP region and SI region. It can be observed that the
architecture E has the highest MPE in four out of six regions
(i.e., NA, SA, SI, and SP). Overall, the MPE of latitudinal
parameter decreases statistically significantly (p-value 0.05)
for: architecture A in the SA region, architecture B in the SP
region; architecture C in the NA region, and architecture D in
the NC, NP and SI regions.
Figure 6 (b) shows the MPE of the longitudinal mass centre
position of AB events. It can be seen that different architec-
tures underestimate the longitudinal parameter in five out of six
regions (i.e., NA, NC, NP, SA, and SI). The exception to this
is the architecture B in the region SA, which the MPE value
is overestimated. What stands out in this chart is that MPE
values are positive for the most architectures in one out of six
regions, i.e. the SP region, except the architecture A which
has a small negative MPE value. The MPE of longitudinal
(a) (b) (c)
Fig. 6. Performance of convolutional neural network (CNN) architectures: architecture A, architecture B, architecture C, architecture D, and architecture E;
for regions of the North Atlantic Region (NA), the North Continental Region (NC), the North Pacific Region (NP), the South Atlantic Region (SA), the South
Indian Ocean Region (SI) and the South Pacific Region (SP). Top left bar chart (a) illustrates mean percentage error for each architecture in estimating the
latitudinal position of the mass centre of atmospheric blocks (ABs) per region and the top right chart (b) displays mean percentage error for each architecture
in estimating the longitudinal position of the mass centre of ABs per region. The bottom chart shows mean percentage error for each architecture in estimating
the radius of the mass centre of ABs per region. The ?symbol stands for a p-value 0.05.
parameter decreases statistically significantly (p-value 0.05)
for architecture A in the NP and SI regions; architecture B in
the SP region, and architecture E in the NA and SA regions.
The NC region is the exception to this in which there is no
sign of a significant decrease in the MPE values.
Figure 6 (c) displays the MPE of a radius of a minimum
bounding box located in the mass centre of AB events. We
can observe low MPE values for architectures in four out of
six regions (i.e., NA, NC, SA, and SI). In contrast, it can be
seen that the MPE values for all architectures are high in NP
and SP regions. In general, the chart reveals that architectures
tend to underestimate the radial parameter in five out of six
regions. The MPE of the parameter decreases statistically
significantly (p-value 0.05) for: architecture A in the NA
region; architecture B in the NP region; architecture C in the
SA region; and architecture D in the NC and SP regions.
B. Discussion
The results that we obtained indicate that it is possible
to accurately detect AB events in climate model simulation
products by using CNN architectures.
The ACC and F1 score values are the highest for all
architectures in three regions of the Northern Hemisphere (N-
H). That can be justified by the fact that AB events are much
more frequent in the N-H than in the Southern Hemisphere (S-
H). Moreover, ABs has important consequences on the weather
of densely populated Europe and North America, that is why
innumerable observational data improve the ability of climate
models to simulate AB patterns in the N-H correctly.
The results of the mean ACC and F1 score values fluctuate
over time. This variability in the values can be explained
by AB number of occurrence that is sometimes modulated
by other periodic climate regimes, for example, El Ni˜
no-
Southern Oscillation. We observe that the values of ACC or
F1 score increase significantly for CNN architectures with a
large number of parameters for various regions on the globe.
The values of the CCC are generally higher for all archi-
tectures in all regions of the N-H than for models in the
Southern Hemisphere. That can also be partially explained
by better observational characteristics of AB events and more
improvements in the ability of climate models to represent
correctly atmospheric dynamics of these events.
The values of the MPE of the latitudinal mass centre
position of AB events and the values of the longitudinal mass
centre position of these events are usually overestimated and
underestimated, respectively. Two reasons can cause that: in
the N-H, AB events tend to occur in particular longitudinal
and latitudinal localisation, however; in the S-H, these events
often occur at slightly lower latitude positions than in the N-
H. It can be observed that the values of the MPE decrease
statistically significantly for CNN detection models without
any particular trend in some geographical regions.
The values of the MPE of a radius of a minimum bounding
box located in the mass centre of AB events are relatively low
in most of the regions. That can suggest that despite a broad
spectrum of spatial location, AB events are probably more
similar in size in both hemispheres. Moreover, the MPE values
are much higher in the Pacific Ocean regions, which could
suggest that the climate model does not correctly represent
ABs spatial size due to the vast extent of the ocean.
VII. CONCLUSIONS AND FUTURE WOR K
In this paper, we study an identification problem of atmo-
spheric blocking patterns (ABs) in the climate dataset created
from a global climate model simulation output. To identify AB
events, we propose to apply a hierarchical pattern recognition
method that consists of two stages: the detection stage that is
formulated as a binary classification task and the localisation
stage that is defined as a regression problem. We explore five
different convolutional neural network (CNN) architectures
and evaluate their performance in six geographical regions
over the globe.
We show that the proposed method overall achieves high
detection accuracy of AB events and low estimation error of
their localisation in different regions. We observe a strong
relationship between the CNN architecture depth and the
performance in a specific geographical region, i.e. the Northern
Hemisphere (N-H) vs the Southern Hemisphere (S-H). That
can be explained by the fact that AB events tend to occur
much more frequently in the N-H than the S-H.
In the future work, we will explore 3D CNN architectures
to study the AB identification problem. We also plan to use
customised loss function that can be beneficial in this problem.
We will also attempt to reveal more statistical features of AB
events as more data will be collected.
ACKNOWLEDGMENT
We thank Marie McGraw (Colorado State University) for
sharing her expertise on atmospheric blocking phenomena,
and Steinfeld Daniel (ETH Zurich) and Stephan Pfahl (Freie
Universit¨
at Berlin) for sharing the blocking index dataset.
Grzegorz Muszynski and Vitaliy Kurlin would like to ac-
knowledge Intel for supporting the IPCC at the University of
Liverpool, UK. Prabhat, Jan Balewski, and Karthik Kashinath
were supported by the Intel Big Data Center at NERSC,
and Michael Wehner was supported by the Regional and
Global Climate Modeling Program of the Office of Biological
and Environmental Research in the Department of Energy
Office of Science under contract no. DE-AC02-05CH11231.
This research used resources of the National Energy Research
Scientific Computing Center, a DOE Office of Science User
Facility supported by the Office of Science of the US Depart-
ment of Energy under contract no. DE-AC02-05CH11231.
REFERENCES
[1] D. F. Rex, “Blocking action in the middle troposphere and its effect
upon regional climate,” Tellus, vol. 2, no. 4, pp. 275–301, 1950.
[2] S. Tibaldi and F. Molteni, “Atmospheric blocking in observation and
models,” in Oxford Research Encyclopedia of Climate Science, 2018.
[3] S. Pfahl and H. Wernli, “Quantifying the relevance of atmospheric
blocking for co-located temperature extremes in the northern hemisphere
on (sub-) daily time scales,” Geophysical Research Letters, vol. 39,
no. 12, 2012.
[4] T. Buehler, C. C. Raible, and T. F. Stocker, “The relationship of winter
season north atlantic blocking frequencies to extreme cold or dry spells
in the era-40,” Tellus A: Dynamic Meteorology and Oceanography,
vol. 63, no. 2, pp. 174–187, 2011.
[5] G. Gangoiti, L. Alonso, M. Navazo, A. Albizuri, G. Perez-Landa,
M. Matabuena, V. Valdenebro, M. Maruri, J. A. Garc´
ıa, and M. M.
Mill´
an, “Regional transport of pollutants over the bay of biscay: analysis
of an ozone episode under a blocking anticyclone in west-central
europe,” Atmospheric Environment, vol. 36, no. 8, pp. 1349–1361, 2002.
[6] C.-C. Hong, H.-H. Hsu, N.-H. Lin, and H. Chiu, “Roles of european
blocking and tropical-extratropical interaction in the 2010 pakistan
flooding,” Geophysical Research Letters, vol. 38, no. 13, 2011.
[7] R. Garc´
ıa-Herrera, J. D´
ıaz, R. M. Trigo, J. Luterbacher, and E. M.
Fischer, A review of the european summer heat wave of 2003, Critical
Reviews in Environmental Science and Technology, vol. 40, no. 4, pp.
267–306, 2010.
[8] T. M. Hall and A. H. Sobel, “On the impact angle of hurricane sandy’s
new jersey landfall, Geophysical Research Letters, vol. 40, no. 10, pp.
2312–2315, 2013.
[9] M. C. Pinheiro, P. Ullrich, and R. Grotjahn, “Atmospheric blocking and
intercomparison of objective detection methods: flow field characteris-
tics,” Climate Dynamics, pp. 1–28, 2019.
[10] A. Rupe, N. Kumar, V. Epifanov, K. Kashinath, O. Pavlyk, F. Schlim-
bach, M. Patwary, S. Maidanov, V. Lee, M. Prabhat et al., “Disco:
Physics-based unsupervised discovery of coherent structures in spa-
tiotemporal systems,” in 2019 IEEE/ACM Workshop on Machine Learn-
ing in High Performance Computing Environments (MLHPC). IEEE,
2019, pp. 75–87.
[11] G. Muszynski, K. Kashinath, V. Kurlin, M. Wehner et al., “Topological
data analysis and machine learning for recognizing atmospheric river
patterns in large climate datasets,” Geoscientific Model Development,
vol. 12, no. 2, pp. 613–628, 2019.
[12] Y. Liu, E. Racah, J. Correa, A. Khosrowshahi, D. Lavers, K. Kunkel,
M. Wehner, W. Collins et al., Application of deep convolutional neural
networks for detecting extreme weather in climate datasets, arXiv
preprint arXiv:1605.01156, 2016.
[13] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification
with deep convolutional neural networks, in Advances in neural infor-
mation processing systems, 2012, pp. 1097–1105.
[14] K. Simonyan and A. Zisserman, “Very deep convolutional networks
for large-scale image recognition,” in Proceedings of the International
Conference on Learning Representation (ICLR, 2015.
[15] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan,
V. Vanhoucke, and A. Rabinovich, “Going deeper with convolutions,
in Proceedings of the IEEE conference on computer vision and pattern
recognition, 2015, pp. 1–9.
[16] E. Racah, C. Beckham, T. Maharaj, S. E. Kahou, M. Prabhat, and C. Pal,
“Extremeweather: A large-scale climate dataset for semi-supervised
detection, localization, and understanding of extreme weather events,
in Advances in Neural Information Processing Systems, 2017, pp. 3402–
3413.
[17] M. Mudigonda, S. Kim, A. Mahesh, S. Kahou, K. Kashinath,
D. Williams, V. Michalski, T. O’Brien, and M. Prabhat, “Segmenting
and tracking extreme climate events using neural networks, in Deep
Learning for Physical Sciences (DLPS) Workshop, held with NIPS
Conference, 2017.
[18] O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma,
Z. Huang, A. Karpathy, A. Khosla, M. Bernstein et al., “Imagenet large
scale visual recognition challenge,” International journal of computer
vision, vol. 115, no. 3, pp. 211–252, 2015.
[19] T. Glickman, American meteorological society, Glossary of Meteorol-
ogy. 2nd ed. Boston, MA: American Meteorological Society, 2000.
[20] C.-G. Rossby, “On the dynamics of certain types of blocking waves,”
Journal of the Chinese Geophysical Society, vol. 2, pp. 1–13, 1951.
[21] N. Nakamura and C. S. Huang, “Atmospheric blocking as a traffic jam
in the jet stream,” Science, vol. 361, no. 6397, pp. 42–47, 2018.
[22] P. Davini and F. D’Andrea, “Northern hemisphere atmospheric blocking
representation in global climate models: Twenty years of improve-
ments?” Journal of Climate, vol. 29, no. 24, pp. 8823–8840, 2016.
[23] R. M. Dole and N. D. Gordon, “Persistent anomalies of the extratropical
northern hemisphere wintertime circulation: Geographical distribution
and regional persistence characteristics,” Monthly Weather Review, vol.
111, no. 8, pp. 1567–1586, 1983.
[24] S. Tibaldi and F. Molteni, “On the operational predictability of block-
ing,” Tellus A, vol. 42, no. 3, pp. 343–365, 1990.
[25] A. A. Scaife, T. Woollings, J. Knight, G. Martin, and T. Hinton,
“Atmospheric blocking and mean biases in climate models, Journal
of Climate, vol. 23, no. 23, pp. 6143–6152, 2010.
[26] C. Schwierz, M. Croci-Maspoli, and H. Davies, “Perspicacious indi-
cators of atmospheric blocking,” Geophysical research letters, vol. 31,
no. 6, 2004.
[27] M. Sprenger, G. Fragkoulidis, H. Binder, M. Croci-Maspoli, P. Graf,
C. M. Grams, P. Knippertz, E. Madonna, S. Schemm, B. ˇ
Skerlak et al.,
“Global climatologies of eulerian and lagrangian flow features based on
era-interim,” Bulletin of the American Meteorological Society, vol. 98,
no. 8, pp. 1739–1748, 2017.
[28] J. M. Wiedenmann, A. R. Lupo, I. I. Mokhov, and E. A. Tikhonova,
“The climatology of blocking anticyclones for the northern and southern
hemispheres: Block intensity as a diagnostic,” Journal of Climate,
vol. 15, no. 23, pp. 3459–3473, 2002.
[29] Q. McNemar, “Note on the sampling error of the difference between
correlated proportions or percentages,” Psychometrika, vol. 12, no. 2,
pp. 153–157, 1947.
[30] F. Wilcoxon, “Probability tables for individual comparisons by ranking
methods,” Biometrics, vol. 3, no. 3, pp. 119–122, 1947.
... As another approach, a growing number of machine learning methods have been developed in recent years, * shihori-koyama@mosk.tytlabs.co.jp which learn from observable time-series data and subsequently predict unobserved climatic elements and their future values. For example, they have been used for the prediction of climatic variables such as temperature and pressure [2][3][4][5][6][7][8][9][10][11], the detection of phenomena and patterns associated with extreme weather such as cyclones and atmospheric blocking phenomena [12][13][14][15][16][17], and the applications related to climatic factors such as predictions of crop yield [18,19] and water temperature in lakes [20]. These approaches are well suited to climate data, where immediacy of prediction is required, because once model parameters are learned from the data they can be used to make fast predictions. ...
Preprint
Collecting time series data spatially distributed in many locations is often important for analyzing climate change and its impacts on ecosystems. However, comprehensive spatial data collection is not always feasible, requiring us to predict climate variables at some locations. This study focuses on a prediction of climatic elements, specifically near-surface temperature and pressure, at a target location apart from a data observation point. Our approach uses two prediction methods: reservoir computing (RC), known as a machine learning framework with low computational requirements, and vector autoregression models (VAR), recognized as a statistical method for analyzing time series data. Our results show that the accuracy of the predictions degrades with the distance between the observation and target locations. We quantitatively estimate the distance in which effective predictions are possible. We also find that in the context of climate data, a geographical distance is associated with data correlation, and a strong data correlation significantly improves the prediction accuracy with RC. In particular, RC outperforms VAR in predicting highly correlated data within the predictive range. These findings suggest that machine learning-based methods can be used more effectively to predict climatic elements in remote locations by assessing the distance to them from the data observation point in advance. Our study on low-cost and accurate prediction of climate variables has significant value for climate change strategies.
... Furthermore, they can handle mixed data types, i.e. they could also include qualitative data on past weather in a categorised form. Nowadays, artificial intelligence is commonly used for 50 classification and pattern recognition in meteorological and climatological research, ranging from detection of extreme events (Racah et al., 2017;Chattopadhyay et al., 2020), frontal systems (Dagon et al., 2022;Bochenek et al., 2021;Biard and Kunkel, 2019), blocking situations (Muszynski et al., 2021;Thomas et al., 2021) or storms and cyclone tracks (Accarino et al., 2023;Kumler-Bonfanti et al., 2020;Mittermeier et al., 2019;Williams et al., 2008). In the context of WT classifications, ML is still a rather novel approach. ...
Preprint
Full-text available
Weather types are used to characterise large–scale synoptic weather patterns over a region. Long–standing records of weather types hold important information about day–to–day variability and changes of atmospheric circulation and the associated effects on the surface. However, most weather type reconstructions are restricted in their temporal extent as well as in the accuracy of the used methods. In our study, we assess various machine learning approaches for station–based weather type reconstruction over Europe based on the CAP9 weather type classification. With a common feedforward neural network performing best in this model comparison, we reconstruct a daily CAP9 weather type series back to 1728. The new reconstructions constitute the longest daily weather type series available. A detailed validation shows considerably better performance compared to previous statistical approaches and good agreement with the reference series for various climatological analyses. Our approach may serve as a guide for other weather type classifications.
Article
Full-text available
Objective methods for identifying and quantifying atmospheric blocking have been developed over recent decades, primarily targeting North Atlantic blocks. Differences arise from these methods, leading to changes in the resultant blocking climatology. To understand these differences, and better inform future assessments built on quantitative detection of blocks, this paper examines blocking properties produced by three different objective detection algorithms over the global extratropics. Blocking criteria examined include 500 hPa geopotential height anomaly (ZZ^*), column-averaged potential vorticity anomaly (PVPV^*), and 500 hPa geopotential height gradient (AGP). Results are analyzed for blocking climatologies and for instantaneous blocking patterns, as well as distributions of block size, speed, duration, and distance traveled. The results emphasize physical characteristics of the flow field and the subsequent blocking regions that emerge; overall, PVPV^* and ZZ^* blocked regions often have higher pattern correlation and spatial similarity, though these two methods also display high agreement with AGP in some instances.ZZ^* finds the largest (and greatest number of) blocked regions, while PVPV^*-detected regions are smallest in all instances except Southern Hemisphere winter. In some cases, PVPV^* tracks a nearby jet streak, leading to differences with height-based algorithms. All three algorithms detect some questionable low-latitude blocks that are stationary and persist but do not impair zonal flow, although at different times. Therefore, careful consideration of the algorithm biases is important in future blocking studies. For example, linking extreme weather to detected blocking could vary substantially depending on the algorithm used.
Article
Full-text available
Identifying weather patterns that frequently lead to extreme weather events is a crucial first step in understanding how they may vary under different climate change scenarios. Here, we propose an automated method for recognizing atmospheric rivers (ARs) in climate data using topological data analysis and machine learning. The method provides useful information about topological features (shape characteristics) and statistics of ARs. We illustrate this method by applying it to outputs of version 5.1 of the Community Atmosphere Model version 5.1 (CAM5.1) and the reanalysis product of the second Modern-Era Retrospective Analysis for Research and Applications (MERRA-2). An advantage of the proposed method is that it is threshold-free – there is no need to determine any threshold criteria for the detection method – when the spatial resolution of the climate model changes. Hence, this method may be useful in evaluating model biases in calculating AR statistics. Further, the method can be applied to different climate scenarios without tuning since it does not rely on threshold conditions. We show that the method is suitable for rapidly analyzing large amounts of climate model and reanalysis output data.
Article
Full-text available
The results of a semi-statistical study of 112 cases of blocking action in the upper westerlies are presented; areas of most frequent occurrence, characteristic movement and persistence, and seasonal and yearly trends of blocking activity are determined. The regional precipitation and mean surface-temperature anomalies associated with blocking over Europe are illustrated and compared with those produced by strong zonal flow aloft. The effects produced by variations in blocking activity upon European climatic trends are discussed. The influence of blocking action upon Scandinavian glaciation and certain indications as to the role played by blocking in an ice-age climatic cycle are discussed. DOI: 10.1111/j.2153-3490.1950.tb00339.x
Article
Full-text available
This paper introduces a newly compiled set of feature-based climatologies identified from ERA-Interim reanalyses (1979–2014). Two categories of flow features are considered: (i) Eulerian climatologies of jet streams, tropopause folds, surface fronts, cyclones and anticyclones, blocks, and potential vorticity (PV) streamers and cutoffs, and (ii) Lagrangian climatologies, based on a large ensemble of air-parcel trajectories, of stratosphere-troposphere exchange (STE), warm conveyor belts (WCB), and tropical moisture exports (TME). Monthly means of these feature climatologies are openly available at the ETH Zurich webpage http://eraiclim.ethz.ch and annually updated. Datasets at higher resolution can be obtained from the authors on request. These feature climatologies allow studying the frequency, variability, and trend of atmospheric phenomena and their inter-relationships across temporal scales. In order to illustrate the potential of this dataset, boreal winter climatologies of selected features are presented and, as a first application, the very unusual Northern Hemispheric winter of 2009/2010 is identified as the season when most of the considered features show maximum deviations from climatology. The second application considers dry winters in the western U.S. and reveals fairly localized anomalies in the eastern North Pacific of enhanced blocking and surface anticyclones, and reduced cyclones.
Article
A traffic jam of air Persistent meandering of the jet stream can cause atmospheric blocking of prevailing eastward winds and result in weather extremes such as heat waves in the midlatitudes. Nakamura and Huang interpret the poorly understood origins of these systems as the meteorological equivalents of traffic congestion on a highway and show how they can be described by analogous mathematical theory. Climate change may affect the frequency of blocking as well as its geographic distribution, reflecting a simultaneous shift in the structure of the stationary atmospheric waves and the regional capacity of the jet stream. Science , this issue p. 42
Conference Paper
We trained a large, deep convolutional neural network to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 dif- ferent classes. On the test data, we achieved top-1 and top-5 error rates of 37.5% and 17.0% which is considerably better than the previous state-of-the-art. The neural network, which has 60 million parameters and 650,000 neurons, consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax. To make training faster, we used non-saturating neurons and a very efficient GPU implemen- tation of the convolution operation. To reduce overfitting in the fully-connected layers we employed a recently-developed regularization method called dropout that proved to be very effective. We also entered a variant of this model in the ILSVRC-2012 competition and achieved a winning top-5 test error rate of 15.3%, compared to 26.2% achieved by the second-best entry
Article
Atmospheric blocking is an important source of low-frequency variability. We apply a blocking detection and tracking method to ERA-40 data for the Atlantic-European region to assess linkages to extreme events, that is, cold and dry spells in the extended winter season (November-April). The method is feature-oriented, identifies 500-hPa geopotential height maxima, and connects them with a next-neighbourhood search in time. The analysis reveals a statistically significant decrease of number of blocking events over the period of ERA-40. Winters with an increased number of blocking events are associated with negative temperature anomalies over Central to Eastern Europe and dryer conditions, whereas Southern Europe experiences warmer and wetter conditions during such episodes. Using extreme value statistics we show evidence that cold spells, and to some extent dry spells, are strongly associated with extremes in blocking frequency. The number of cold spell days increases with the duration of blocking events indicating that cold spells need time to establish during blocking events, thus return periods of cold spells are longer than those for blocking events. This behaviour is not found for the relation of dry spells with blocking events. Still, blocking episodes have a higher occurrence of dry spells than climatology.
Article
The correct simulation of midlatitude atmospheric blocking has always been a main concern since the earliest days of numerical modeling of Earth's atmosphere. To this day blocking represents a considerable source of error for general circulation models from both a numerical weather prediction and a climate perspective. In the present work, 20 years of global climate model (GCM) developments are analyzed from the special point of view of Northern Hemisphere atmospheric blocking simulation. Making use of a series of equivalent metrics, three generations of GCMs are compared. This encompasses a total of 95 climate models, many of which are different-successive-versions of the same model. Results from model intercomparison projects AMIP1 (1992), CMIP3 (2007), and CMIP5 (2012) are taken into consideration. Although large improvements are seen over the Pacific Ocean, only minor advancements have been achieved over the Euro-Atlantic sector. Some of the most recent GCMs still exhibit the same negative bias as 20 years ago in this region, associated with large geopotential height systematic errors. Some individual models, nevertheless, have improved and do show good performances in both sectors. Negligible differences emerge among ocean-coupled or atmosphere-only simulations, suggesting weak relevance of sea surface temperature biases. Conversely, increased horizontal resolution seems to be able to alleviate the Euro-Atlantic blocking bias.