Conference PaperPDF Available

Abstract and Figures

The forecast of the occurrence of voltage sags at the sites of a system is nowadays feasible thanks to the availability of huge quantity of recorded data. To forecast future performance from the statistical analysis of recorded sags, the stochastic modelling of the voltage sags is required since the events are not statistically time independent. The presence of groups of sags, named clusters, brings the phenomenon far from the conditions of Poisson model. This paper proposes the Gamma distribution to model the sags, which also include the clusters. Different techniques for assessing the parameters of the Gamma distribution are presented and applied to forecast the number of sags expected at selected sites in the year 2018, i.e., the year successive to those when the sags were measured. The outcomes of the forecast are compared with the sags effectively occurred in those sites in the year 2018, using different criteria for evaluating the forecast error. The results showed the viability of the approach and encourage further studies to improve the accuracy and extend the forecast to entire systems.
Content may be subject to copyright.
978-88-87237-50-4 ©2021 AEIT
Stochastic Model to Forecast the Voltage Sags
in Real Power Systems
Leonardo Di Stasio, Paola Verde,
Pietro Varilone
Department of Ingegneria elettrica e
dell’informazione “Maurizio Scarano”
Università degli studi di Cassino e del
Lazio Meridionale
Cassino
leonardo.distasio@unicas.it,
verde@unicas.it, varilone@unicas.it
Michele De Santis
Department of Ingegneria
Università Niccolò Cusano di Roma
Roma
michele.desantis@unicusano.it
Christian Noce
Enel X SrL
Roma
christian.noce@enel.com
Abstract— The forecast of the occurrence of voltage sags at
the sites of a system is nowadays feasible thanks to the
availability of huge quantity of recorded data. To forecast future
performance from the statistical analysis of recorded sags, the
stochastic modelling of the voltage sags is required since the
events are not statistically time independent. The presence of
groups of sags, named clusters, brings the phenomenon far from
the conditions of Poisson model. This paper proposes the
Gamma distribution to model the sags, which also include the
clusters. Different techniques for assessing the parameters of the
Gamma distribution are presented and applied to forecast the
number of sags expected at selected sites in the year 2018, i.e.,
the year successive to those when the sags were measured. The
outcomes of the forecast are compared with the sags effectively
occurred in those sites in the year 2018, using different criteria
for evaluating the forecast error. The results showed the
viability of the approach and encourage further studies to
improve the accuracy and extend the forecast to entire systems.
Keywords— Voltage Sag, Voltage sag monitoring, Poisson
process, Gamma distribution
I. INTRODUCTION
he voltage sags are among the most critical power1
quality (PQ) disturbances affecting the voltage [1].
The most frequent causes of a voltage sag in the
networks are the faults. The short circuit current flowing in
the system until the action of the protection system produces
the rapid decrease of the voltage, which can propagate in
large sections of the system.
The relevant literature recognized from several years the
severity of the phenomenon, proposing methods and tools
mainly aimed to estimate the average performance expected
at the system sites in terms of sag frequency, usually
computed on yearly basis. The largest part of the papers of
the literature proposed model-based simulations of the
systems in short circuit conditions, due to both symmetrical
and unsymmetrical faults. The voltages during the faults
obtained by the simulations are then correlated with the
average values of the fault rates of the busses or of the lines
to obtain the expected frequencies of the sags at the sites of
the system [2-12].
The authors L.Di Stasio, P.Varilone, P.Verde acknowledge the financial
support by MIUR through the Special Grant “Dipartimenti di eccellenza”
Lex. N. 232/31.12.2016. The paper was also developed inside the grant
Recently, some papers have approached the problem of
assessing the future occurrence of the sags at network sites
from a completely different point of view [13-15]. The main
novelty was to pass from the estimation of the average
performance of a system obtained from simulations to the
forecast of the number of sags at the sites of a system obtained
from the statistics of recorded sags. Statistical approaches
used the observed history to describe and forecast the future
performance. The statistical models take into account the
effect of external random events on the variable of interest.
Considering the voltage sags as a random phenomenon
which, depends on multiple conditions, also on time, the
statistical approaches are the best way for forecasting voltage
sags as they are built based on previous experiences. This
new approach was made possible, above all, by the ever-
increasing availability of data measured from the field. All
around the world, several campaigns of sag measurement are
active [16-17]. They are aimed to regulate the PQ levels of
the system, driven by rules and resolutions of the national
energy Authorities, and/or to assess the real conditions of
own networks, driven by internal targets of DSOs
(Distribution System Operators) and TSOs (Transmission
System Operators). In Italy, for example, the resolutions [18-
19] required all the DSOs to install at all the MV (Medium
Voltage) busbars of all the HV/MV (High Voltage/Medium
Voltage) stations measurement systems to measure and
record the occurred voltage sags. This produced large data
sets of measured sags in Italy from 2014, which allowed
developing methods to ascertain the origin of voltage sags
[20] also in the presence of GD [21].
The analysis of the sags recorded at the sites of a real
system evidenced that the sags are composed not only of rare
events but also of groups of events close each other in the
time.
Concerning the rare events, they are sags with a frequency
in each site in the range from some sags at year to some sags
at month. All the relevant literature was aimed to estimate the
average performance of a systems, focusing the attention on
rare sags [7-15].
Concerning the groups of sags, we can distinguish them
in two main categories as function of the time between every
“Attività di supporto scientifico sulle tematiche di Power Quality nelle reti
di distribuzione” with E-Distribuzione.
T
events. The first category encloses groups of sags close each
other in the time with a distance less than few seconds. They
are due to automatic closure maneuvers of the protection
systems and/or to incorrect recordings. This typology of
groups of sags are well known and are manipulated by the
time aggregation. All the sags, grouped within a given time
interval, are considered as a single event, which is the first of
the group [22-23]. The second category of groups of sags,
called sag clusters includes groups of sags close each other in
the time with a distance larger than few seconds up to some
hours. They can be due to exogenic factors, such as rain,
lightning strikes, wind, and other adverse weather conditions.
The sag clusters are not affected by the time aggregation and,
consequently, they are present in all the recorded sags even
after the time aggregation was carried out.
The presence of the sag clusters [13-16] makes the sag
frequency dependent on the time. The sags, which contain
both rare sags and sag clusters, present the features of a
stochastic process. They do not have the characteristics of
memoryless events and, consequently, cannot be modeled as
a Poisson process. To make successful forecast, filtering the
sag clusters out from all the measured sags is crucial [14-15].
The use of adequate intermittent indices to detect and filter
out the sag clusters [15] leaves only rare voltage sags which
can be used to forecast the future occurrence at the nodes of
a real system with an accuracy up to 10%. This important
result was obtained also thanks to the choice of the time to
next event, namely ttne, as the statistical variable to describe
the phenomenon.
In this paper, we present the first approach aimed to
predict all the sags, which contain both rare sags and sag
clusters. The statistical model of such time-dependent events
must respect the stochastic nature of the phenomenon. It is
possible by means of the Gamma distribution of ttne. This
model was first proposed in [16] to describe the recorded
data. In this paper, we extend this model to the statistical
analysis of the sags recorded from 1 January 2015 to 31
December 2017 with the specific target to forecast the
expected number of voltage sags in 2018. The sags were
recorded at a regional electric system, whose main
characteristics are reported in [15]. The availability of the
sags really occurred in the year 2018 allowed us to evaluate
the errors of the forecast.
Modelling a process by the Gamma distribution requires
to select two parameters, that is the shape parameter α, and
the scale parameter β. In this paper, we present the application
of both analytical and graphical methods to establish the
values of
α
and
β
, which statistically describe the random
variable ttne with minimum error.
The paper is organized as follows: Section II presents the
approach that was used to describe the stochastic model of all
the measured sags. Section III illustrates the methods adopted
for establishing the parameters of the Gamma distribution.
Section IV describes the methods used to assess theoretical
hypothesis. Section V shows the results of the statistical
validation and of the forecasting activity.
II. STOCHASTIC MODELING THE SAGS BY THE GAMMA
DISTRIBUTION
Most of the papers dealing with the estimation of average
performance of a system in terms of voltage sags proposed
methods and tools based on the simulation of the system in
short circuit conditions [2-7]. The basic assumption is that the
sags are due to faults in the nodes and/or along the lines. The
expected annual frequency of the sags is successively derived
correlating the frequency of the causes, the faults, with the
frequency of the effects, the sags. The subjects of all these
methods are the rare voltage sags, i.e. sudden under-voltages
lower that the 90% threshold that occur with a temporal
distance ranging from a few hours to a few months. The rare
voltage sags can be statistically modelled as a Poisson process,
a series of discrete memory-less and time independent events.
For this type of sags, the distribution of the random variable
ttne, which measured the waiting time until the occurrence of
the next event, follows the exponential density function, that
is:
 (1)
where λ is the sag rate constant with the time.
The measurements of the sags in real systems reveal that
the sags are not only rare, but there are also clusters of sags,
i.e. groups of sags close each other in time with a distance
between them from some seconds to some hours, as said in the
Introduction. As an example, Fig.1 shows the scatter plot of
the voltage sags measured on 17 January 2017 from 00:00 am
to 24:00 pm at the busbar #38 of the regional system described
in [15]. Every blue dot represents the time of occurrence of the
measured voltage sags. The rare sags are highlighted with
black circles and the sag clusters with red circles.
The sags belonging to clusters do not have the statistical
characteristics of a Poisson process since they are events
related to each other, often they have a common origin, for
example an adverse atmospheric phenomenon or fires, and for
these reasons, their time independence cannot be assumed. If
the statistical analysis is conducted on the measured sags, the
only way to apply the distribution given by (1) is to filter and
remove the clusters of sags, leaving only the rare sags in the
data to be analysed, as done in [13-15]. Without applying
these filters, the voltage sags present the characteristics of a
stochastic process, that is a collection of indexed random
variables, ttne in this case, defined on a common probability
sample space which depends on the time. A stochastic model
of voltage sag occurrences is mandatory to describe their
random appearance in the time, with large periods of absences,
when only rare sags occur, and others when sags appear as
clusters. To model all the recorded sags, the assumption of
constant sag rate is not valid, and the distribution of the
random variable ttne is given by [15-16]:


 (2)
where λ(t) is the sag rate variable with the time.
Fig.2 shows the probability density function (pdf), of ttne of
the voltage sags recorded at the site of example #38. It is
Fig. 1. Scatter plot of the voltage sags measured on 17/01/2017 from
00:00 am to 24 pm at the site #38.
Fig. 2. Histogram of probability density function of the random variable
ttne measured in the site #38.
evident a thickening of the ttne values in the first class. The
width of the first class is equal to 2.25 × 105s (62.5 hours), so
the probability that the value of the random variable ttne is
between 3s2 and 2.25 × 105s is equal to 81%. The first class of
the histogram in Fig. 2 is linked to the presence of clusters of
sags [15]. For successive classes, there is a decay of the pdf
values. Fig.3 shows the empirical cumulative density function
(ecdf) corresponding to the pdf of Fig.2; the plot of Fig.3 (b)
is zoomed in the range of low ttne.
From Fig. 3 (a), it is evident that the values of ttne range
from 3 s to 2.3 × 106 s (639 hours), but Fig. 3 (b) revealed that
the greatest probability, up to 60%, is already reached by ttne
less than 5 × 103s (1.4 hours), leaving the remaining 40% to
the ttne up to 2.3 × 106 s. This feature of the recorded data,
evident also for other sites beside site #38, proved that the sag
rate is not constant.
As proposed in [16], the random variable ttne of all the
sags, both rare and clusters, can be described by the Gamma
distribution as:
 !""#$!%&$%'(()*
+, (3)
where α and β are the shape and the scale parameters,
respectively.
The corresponding cdf is given by:
-./0%&
1023456
17

(4)

831
23
where γ (
α
,
β,
ttne) is the incomplete Gamma function and
Γ(
α
) is the Gamma function. The mean value of the Gamma
distribution is equal to:
9
:
;
<
.
/
(5)
For β = 1, and for different values of the shape parameter
α, Fig. 4 shows the theoretical plots of the pdf and of the cdf
of the Gamma distribution of a generic random variable t in
seconds. Fig. 4 shows that, for a given value of
β
, the
variation of
α
implies completely different waveshapes of the
Gamma pdf and cdf.
For
α
from 0.1 to 0.5, the pdf in Fig. 4 (a) presents a
thickening of observations close to zero, as derived by the
2 3 s is the minimum value of ttne after the time aggregation [9].
Fig. 3. Empirical cumulative density function of the random variable ttne
for the site #38; (a): complete plot; (b) plot zoomed in the range [0,5000]s.
Fig. 4. Theoretical probability density function (a), and cumulative
function (b) of the Gamma distribution.
empirical distribution of all the measured sags at the site #38
showed in Fig.2. For
α
equal to 1, the Gamma cdf is equal to
the exponential cdf with mean parameter λ equal to the scale
parameter β of the Gamma distribution.
Table I shows different percentiles of the theoretical
Gamma cdf of a random variable t, with β =1 and
α
ranging
from 0.1 to 2. For
α
equal to 0.1, 10% of t is less than 2.6 × 10 -
5, i.e. very close to zero, and 70% of the values are not greater
than 1.7 h. As the shape parameter
α
increases, the 10th
percentile increases too. For
α
equal to 2, 10% of t is less than
about 42 h, and 70% of the values are not greater than 194.4
h.
In conclusion, the use of Gamma distribution requires the
correct estimation of the parameters
α
and
β
, as shown in the
successive section.
III. STATISTICAL ANALYSIS OF DATA TO DETERMINE THE
PARAMETERS OF THE GAMMA DISTRIBUTION
Modeling ttne of all the measured sags by the Gamma
distribution requires to establish the two parameters
α
and
β
.
TABLE I. PERCENTILE VALUES OF THE GAMMA DISTRIBUTION WITH
β =1 AND α RANGING FROM 0.1 TO 2.
Percentile
α
0.1
0.5
1
1.5
2
P10%
[s]
2.6 × 10-5 2.4 × 103 3.1 × 104 8.4 × 104 1.5 × 105
[h]
7.2 × 10-9 0.7 8.6 23.3 41.7
P30%
[s]
4.3 2.8 × 104 1.2 × 105 2.3 × 105 3.4 × 105
[h]
1.2 × 10-3 7.8 33.3 63.9 94.4
P50%
[s]
303.2 7.3 × 104 2.1 × 105 3.5 × 105 5.0 × 105
[h]
8.4 × 10-2 20.3 58.3 97.2 138.9
P70%
[s]
6.0 × 103 1.6 × 105 3.5 × 105 5.3 × 105 7.0 × 105
[h]
1.7 44.4 97.2 147.2 194.4
P90%
[s]
7.3 × 104 3.8 × 105 6.4 × 105 8.7 × 105 1.1 × 106
[h]
20.3
105.6
177.8
241.7
305.6
Different techniques are available for the optimal selection of
statistical parameters of an assumed pdf [24-27]. The
Maximum Likelihood Estimation (MLE) is a very popular
technique used for determining the parameters of a
distribution chosen to describe observed data. The obtained
parameters obtained by MLE can be successively optimized in
function of an analytical criterium or applying a graphical
method, as done in this paper.
In the following, we first show the application of MLE to
the case of Gamma distribution, and after we describe two
methods, one analytical and one graphical, which we used to
optimize the parameters obtained by MLE. These methods are
the Hill Climbing Algorithm (HCA) and the Quantile-Quantile
plot (Q-Q plot), respectively.
- MLE method
The searched values of the parameters
α
and
β
are found
such that they maximise the likelihood that the process
described by the pdf produced the data that were observed.
Let:
- 4=
> = (xi,1, xi,2, …, xi,N) be the vector of the N observations
at the site i of the random variable xi;
- f0 be the pdf which describes the observations and depends
on the parameters included in the vector ?
>;
- Ө be the space of all the possible values of the vector ?
>,
i.e., ?
> ϵ Ө;
the application of MLE implies that f0 belongs to the family of
Gamma distributions obtainable by different values of the
parameters α and β, which are the components of the vector
?
>.
The likelihood function @4=
>A?
> is the product of the
individual pdf of all the N samples of the random variables,
assumed independent, that is:
@
4
=
>
A
?
>
B
C
D
4
=
>
E
F
G
H
?
>
).
(6)
For the case of Gamma distribution, the likelihood
function is equal to:
@
4
=
>
A
.
/
B
1
0
%
I
J
K
L
M
K
L
0
%
&
2
3
E
F
G
H
.
(7)
The Log-likelihood function, N4O
>A?
>, is the natural
logarithm of @4O
>A?
>, i.e.:
N
4
=
>
A
?
>
PQ
@
4
=
>
A
?
>
.
(8)
Putting equation (7) in (8), the Log-likelihood function for
the Gamma distribution with shape parameter α and scale
parameter βN4=
>A./ is defined as:
N
4
=
>
A
.
/
PQ
B
1
0
%
I
J
K
L
M
K
L
0
%
&
2
3
E
F
G
H
.
(9)
Applying the logarithm properties and after some
simplifications, the final expression N4=
>A./ is equal to:
N
4
=
>
A
.
/
R.
PQ
/
-
/
S
4
T
F
E
F
G
H
+(α-
1)×
S
PQ
4
T
F
E
F
G
H
- N
PQ
U
.
.
(10)
Maximizing equation (10) with respect of the parameters
α and β, we can derive the so called MLE estimators of the
parameters. With this aim, equalling to zero the derivative of
N4=
>A./ in (10) with respect to
β
, the maximum likelihood
estimator of the parameter β is:
β =
E
3
S
M
K
L
V
L
W
&
.
(11)
Substituting equation (11) in (10), we obtain
N
.
R.
N
R.
6
R.
N
S
4
T
F
E
F
G
H
6
R.
+
.
S
N
E
F
G
H
4
T
F
6
S
N
4
T
F
E
F
G
H
6
RN
U
.
.
(12)
Equalling to zero the derivative of l(α) with respect to
α
:
X
Y
X
3
RN
R.
Z
R.
H
3
6
R
N
S
4
T
F
E
F
G
H
6
R
Z
S
N
E
F
G
H
4
T
F
– Nψ(α) = 0
(13)
where ψ (α) 23[
23 is the Digamma function.
After some simplifications on (13), we obtain:
PQ
R.
– ψ(α) =
PQ
S
4
T
F
E
F
G
H
-
S
Y
V
L
W
&
M
K
L
E
\
(14)
Equation (14) gives the optimal value of α in implicit form,
which can be solved by a numerical method.
- Hill Climbing Algorithm (HCA)
HCA is an iterative algorithm that starts with an arbitrary
solution and, in accordance with a function to be maximized
or minimized, attempts to find a better solution by making an
incremental change to the parameters. The optimal value of
the parameters is found by using a test function [27]. In the
case of this study, the arbitrary solution is given by the cdf of
the Gamma distribution with
α
and
β
obtained by MLE, and
the test function is the Chi Squared Test (CS). Let observations
be divided into M bins:
- F is the theoretical cdf;
- Oi,1, Oi,2,…,Oi,N are the N observed values of each
bins,
- the χ2 value is equal to:
]
^
=
S
S
_
K
`
V
K
W
&
S
ab
K
`
V
K
W
&
c
S
ab
K
`
V
K
W
&
d
e
G
H
(15)
where 9fe are the expected values for each bin computed as:
9f
e
=
C
g
h
R
6
C
i
h
R
(16)
where N is the data size, a is the start value of the kth bin and
b is the end value of the kth bin, (b-a) is the length of the kth
bin, F(a) and F(b) are the values of the cdf of the Gamma
distribution computed in a and b, respectively.
The searched optimal values of the parameters
α
and
β
are those corresponding to the minimum value of χ2.
- Quantile-Quantile plots (Q-Q plots)
Q-Q plot is one of the graphical tools used to make
distributional analyses between an empirical distribution
derived from a dataset and a theoretical distribution. It is based
on the plots of the quantiles of an empirical distribution
derived from a dataset against the corresponding quantiles of
the theoretical distribution. The two main analyses allowed by
the Q-Q plots are the distributional comparisons between the
two distributions [14-16], and the estimation of one
distribution parameter of the theoretical distribution. This
second target was used in this study.
The theoretical distribution was the Gamma distribution
with
α
obtained by MLE, and we used the Q-Q plot to estimate
the value of
β
. To this aim, the values of the quantiles of the
data Qd are equal to:
j
k

l
(17)
and the quantiles of the theoretical distribution Qt are equal to:
j
C
H
H
m
(18)
where F-1 is the inverse cdf of the standard Gamma function
with β = 1 and
α
estimated with MLE.
The searched value of the parameter β of the Gamma
distribution is equal to the slope of the regression line:
/
n
m
(19)
Table II shows the results for three sites of example
obtained applying the three different methods above recalled,
that is α and β estimated by MLE, α and β estimated by HCA,
α estimated by MLE and β estimated by Q-Q plot.
TABLE II. PARAMETERS OF THE CDF OF THE GAMMA FUNCTION
ESTIMATED BY MLE, HCA, AND Q-Q PLOT.
Gamma distribution
#
Site
#
o
:
ppqr
<
[s]
s
β
[s-1]
Parameters estimated
by MLE
38 1.44 × 10
5
0.18 1.25 × 10
-
6
39 1.29 × 10
5
0.21 1.63 × 10
-
6
95 2.10 × 10
5
0.20 9.52 × 10
-
7
Parameters estimated
by HCA
38 1.58 × 10
5
0.18 1.14 × 10
-
6
39 2.02 × 10
5
0.23 1.63 × 10
-
6
95 2.32 × 10
5
0.20 8.62 × 10
-
7
Parameters estimated
by Q-Q plot
38 1.52 × 10
5
0.18 1.18 × 10
-
6
39 5.45 × 10
5
0.21 1.45 × 10
-
6
95 2.36 × 10
5
0.20 8.47 × 10
-
7
From Table II, it is evident that the shape parameter
changes from MLE to HCA only for the site #39, instead the
scale parameter appreciably changes for all the sites using the
Q-Q plot. These changes of the parameters provided different
values of the expected value of the Gamma distribution
computed by the relation (5). Fig. 5 presents, for the site #38,
the histogram of the pdf of the measured ttne (blue bars) and
the pdfs with parameters estimated by MLE (red curve), by
HCA (green curve), and by Q-Q plot (blue curve).
From the curves in Fig.5, all the estimation methods
provided very similar plots, analogous results have been
obtained for the other sites of the system (#39 and #95)
considered as example. It is important to evidence that the
resolution of Fig.5 does not allow appreciating the slight
differences among the curves. However, as Table II
evidenced, the curves give different values of
β
, which causes
diverse value of E[ttne] in accordance with the relation (5).
Therefore, it was mandatory to perform a numerical
quantification of the goodness of the assumed fit of the data to
the Gamma distribution. To this aim, we used three statistical
tools: the R2-test, the Root Mean Squared Error (RMSE test),
and Chi-Squared test (χ2 test).
The R2-test is based on the computation of R2 by the
relation:
t
^
S
C
T
6
C
uv
^
E
T
G
H
S
C
T
6
C
uv
^
E
T
G
H
Z
S
C
T
6
C
T
^
E
T
G
H
(20)
where Fmean = H
E SCT
E
TGH , CTare the values of the cdf of the
Gamma distribution andCT are the values of the ecdf.
The RMSE test provides a term-by-term comparison of the
actual deviation between observed probabilities and
theoretical probabilities; RMSE is computed by the relation:
twx9
y
z
R
{
C
T
E
T
G
H
6
C
T
|
^
(21)
Fig.5
Histogram of the probability density function (blue bars) and
gamma pdfs with parameters estimated by MLE (red curve), by HCA
(green curve), and by Q-Q plot (black curve)– Site #38.
Finally, the χ2 test is based on the computation of χ2 already
described by relation (15). Table III presents the results
obtained applying these three tools on the Gamma distribution
of ttne of the sags measured at sites #38, #39, and #95 having
the parameters estimated by MLE, HCA, and Q-Q plot.
Table III evidenced that the χ2 test for the MLE method has
the highest value for all the three sites considered, instead the
R2 test gives values of errors lowest for the site #38, for any
method, and for the site #39, for the method HCA. RMSE test,
finally, gives the lowest value for the site #95 for any method.
Summarising, the fit of the data to the Gamma distribution
presented an adequate goodness for all the methods used to
estimate the
α
and
β
parameters.
IV. FORECAST OF THE SAGS
The final goal was to forecast the mean number of events
per year at each site k, Nf,k. We obtained this value by
correlating the average sag rate at the site k, 1/E[ttnek], with
the duration of one year, P, by the relation:
R
}
F
~
9
:

F
<
(22)
where the average sag rate is expressed in number of sags per
seconds and P is expressed in seconds (31.5 × 106 s).
E[ttnek] in (22) is the expected value of the Gamma
function given by (5). For every method used to estimate the
parameters of the Gamma distribution, i.e. MLE, HCA, and Q-
Q plot, we have a different value of E[ttnek] and, then a
different value of forecasted Nf,k.
Thanks to the availability of the voltage sags really
measured in 2018, we were able to quantify the error of the
forecast in each of the considered sites. Let Nm,k be the number
of voltage sags measured at the site k in the year 2018 and Nf,k
be the number of sags forecasted at the same site k the relation
(22) using the sags measured in the preceding three years
(2015, 2016, 2017). The error of the forecast at each site k, εf,k,
was computed as the percentage value of the difference
between Nf,k and Nm,k, in per unit of Nm,k. For the considered
sites #38, #39, #95, Table IV shows the results of the forecast
and the computed values of the error εf,k for the three presented
methods (MLE, HCA, Q-Q plot).
The forecast error of all the sags comprehensive of the
clusters ranges from 0% to a maximum value lightly greater
than 10%. This performance of the forecast is really close to
the one obtained in [13-15] where only the rare sags were
forecasted. The accuracies of the references in the literature
are those reported in [12] where, with the highest frequency of
sags (1 per day), the error of 2% would require registering sags
for 25 years. The errors in Table IV are of the same order of
magnitude, but they were obtained for only three years.
TABLE III. STATISTICAL ERRORS OF THE CDF OF THE GAMMA
FUNCTION ESTIMATED BY MLE, HCA, AND Q-Q PLOT
Gamma distribution Site #
test
ƒ„…ƒ
†‡ˆ‰
ƒ„…ƒ
Parameters estimated
by MLE
38 3.22 0.88 2.19
39 3.59 0.92 1.87
95 3.28 0.93 1.53
Parameters estimated
by HCA
38 2.15 0.87 2.25
39 2.03 0.88 2.30
95 2.32 0.92 1.60
Parameters estimated
by Q-Q plot
38 2.33 0.87 2.29
39 2.68 0.91 1.98
95 2.28 0.92 1.62
TABLE IV. FORECASTED SAGS, MEASURED SAGS, AND FORECAST
ERROR USING THE GAMMA FUNCTION WITH PARAMETERS ESTIMATED BY
MLE, HCA, AND Q-Q PLOT.
Gamma distribution Site #
Š
Œ
Nm,k εf,k
Parameters estimated
by MLE
38 222 211 5.2%
39 248 248 0.0%
95 147 135 8.8%
Parameters estimated
by HCA
38 202 211 4.2%
39 225 248 9.2%
95 134 135 0.7%
Parameters estimated
by Q-Q plot
38 209 211 0.9%
39 220 248 11.3%
95 132 135 2.2%
We cannot ascertain the method, among MLE, HCA and Q-Q
plot, which gives the best performance of forecast in all the
sites. For the site #38, Q-Q plot gives the minimum forecast
error value, instead the minimum forecast error is obtained by
MLE and by HCA for the site #39 and #95, respectively. The
different performances of the methods at the various busses of
example could be linked to a different share of the sags
between rare sags and clusters. This analysis was not
performed in this first stage of the research. However, for the
considered sites of example, the different methods used to
estimate the parameters of the Gamma distribution are
effective to provide the range of the mean number of events
per year with acceptable errors. Further studies are in progress
to extend the forecast to all the regional system in study.
V. CONCLUSIONS
The paper faces the problem of forecasting the voltage sags
at the sites of a real system in terms of number of occurrences
expected in those sites in the successive year. The forecast is
based on the statistics of measured sags in three preceding
years, adopting a data-driven approach rather than model
based one. Since, the sags measured in real systems are
composed not only by rare sags, but also of groups of sags,
the paper proposed the Gamma distribution to properly model
them in view of the forecast. The use of Gamma distribution
requires to estimate two parameters, i.e. the shape parameter
and the scale parameter. The paper shows three alternative
methods to estimate their values, quantifies the estimation
errors and the forecast errors in three sites of example of a
real regional system. For these cases, it is not possible to
establish which method gives the best performance in terms
of forecast. However, even in the worst case, the forecast
presented an error lightly greater than 10%. This value is in
line with the error obtainable when only rare sags were
estimated. It is also considered acceptable, accounting that
only three years of measuring on field were used. Further
studies are ongoing to extend the approach to all the sites of
the regional system, also to verify if a method, among those
proposed in this paper, could be chosen as the one of the best
forecast performances.
REFERENCES
[1] M. H. Bollen, I. Gu, Signal Processing of Power Quality Disturbances.
Hoboken, NJ, USA: Wiley, 2006, pp. 766–771.
[2] G. Carpinelli, P. Caramia, C. Di Perna, P. Varilone, P. Verde,
“Complete matrix formulation of fault-position method for voltage-dip
characterisation,” IET G.T.D. 2007, 1, 56, doi:10.1049/iet-
gtd:20050544.
[3] A. Dos Santos, M.C. De Barros, “Voltage sag prediction for network
planning,” Electr. Power Syst. Res. 2016, 140, 976–983,
doi:10.1016/j.epsr.2016.03.033.
[4] G. Olguin, M. Aedo, M. Arias, A. Ortiz, “A Monte Carlo Simulation
Approach to the Method of Fault Positions for Stochastic Assessment
of Voltage Dips (Sags),” IEEE/PES T & D Conf. & Exp: Asia and
Pacific, Dalian, China, 18 August 2005, pp. 1–6.
[5] P. Caramia, P. Varilone, P. Verde, L. Vitale, “Tools for Assessing the
Robustness of Electrical System against Voltage Dips in terms of
Amplitude, Duration and Frequency,” Renew. Energy Power Qual. J.
2014, 1, 177–182, doi:10.24084/repqj12.275.
[6] Y. Lim, G. Strbac, “Analytical approach to probabilistic prediction of
voltage sags on transmission networks,” IEEE Proc. G.T.D. 2002, 149,
7–14, doi:10.1049/ip-gtd:20020008.
[7] E. Espinosa Juarez, A. Hernandez, “An Analytical Approach for
Stochastic Assessment of Balanced and Unbalanced Voltage Sags in
Large Systems,” IEEE Trans.on PD. 2006, 21, 1493–1500,
doi:10.1109/tpwrd.2005.860266.
[8] E. Di Mambro, P. Caramia, P. Varilone and P. Verde, “Voltage sag
estimation of real transmission systems for faults along the lines,” 18th
ICHQP, Ljubljana, 2018, pp. 1-6.
[9] P. Caramia, E. Di Mambro, P. Varilone, P. Verde, “Impact of
Distributed Generation on the Voltage Sag Performance of
Transmission Systems,” Energies. 10 (959), 2017, pp.1-19.
[10] Y. S. Lim and G. Strbac, “Analytical approach to probabilistic
prediction of voltage sags on transmission networks,” IEE Proc. G,T
and D. 149(1), 2002, pp. 7-14.
[11] A. dos Santos and M. T. Correia de Barros, “Predicting Equipment
Outages Due to Voltage Sags,” IEEE Trans. on PD. 31(4), 2016, pp.
1683-1691.
[12] M.R. Qader, M. H. J. Bollen, and R. N. Allan, “Stochastic prediction
of voltage sags in a large transmission system,” IEEE Trans. Ind. Appl.,
vol. 35, no. 1, pp. 152–162, Jan./Feb. 1999.
[13] C. Noce, L. Di Stasio, P. Varilone, P. Verde and M. De Santis, “On the
Forecast of the Voltage Sags: First Stages of Analysis on Real
Systems,” in 55th UPEC, Torino, Italy, 2020, pp. 1-6.
[14] M. De Santis, L. Di Stasio, C. Noce, P. Verde, P. Varilone, “Initial
Results of an Extensive, Long-Term Study of the Forecasting of
Voltage Sags,” Energies, 2021; 14, (5):1264.
https://doi.org/10.3390/en14051264.
[15] M. De Santis, L. Di Stasio, C. Noce, P. Verde and P. Varilone, "Indices
of Intermittence to Improve the Forecasting of the Voltage Sags
Measured in Real Systems," in IEEE Trans on PD, 2021, doi:
10.1109/TPWRD.2021.3082280.
[16] A. d. Santos, T. Rosa and M. T. Correia de Barros, “Stochastic
Characterization of Voltage Sag Occurrence Based on Field Data,”
IEEE Trans on PD. 34 (2), 2019, pp. 496-504.
[17] Council of European Energy Regulators (CEER), 6th CEER
Benchmarking Report on the Quality of Electricity and Gas Supply,
Brussels, August 2016, www.ceer.eu.
[18] ARERA (formerly AEEGSI), Resolution ARG/elt 198/11. Italy, 2011,
www.arera.it/it/docs/11/198-11arg.htm.
[19] ARERA, Resolution 646/2015/R/eel. Italy, 2015.
[20] M. De Santis, C. Noce, P. Varilone, P. Verde, “Analysis of the Origin
of Measured Voltage Sags in InterconnectedNetworks”, Electric Power
Systems Research, vol. 154, 2018, pp. 391–400.
[21] C. Noce, M. De Santis, L. Di Stasio, P. Varilone, P. Verde, “Detecting
the Origin of the Voltage Sags Measured in the Smart Grids”, 7°
ICCEP, Otranto (LE), Italy, 2-4 July 2019.
[22] IEEE Guide for Voltage Sag Indices. In IEEE Std 1564–2014; IEEE:
Piscataway, NJ, USA,2014; pp. 1–59
[23] Joint Working Group Cigre/Cired C4.110. Voltage Dip Immunity of
Equipment and Installations; Report 412; Cigré, Paris, France, 2010.
[24] S. Kadry, Stochastic Methods for Estimation and Problem Solving in
Engineering. IGI Global, March 2018. ISBN: 978-152255046-4; pp.
1-275.
[25] R. J. Rossi, Mathematical Statistics: An Introduction to Likelihood
Based Inference. Wiley, October 2018, ISBN: 978-1-118-77104-4, pp.
1-448.
[26] P. Olofsson, M. Andersson, Probability, Statistics, and Stochastic
Processes. 2nd Edition, John Wiley & Sons, Inc, May 2012, ISBN:
9780470889749, pp. 1-558.
[27] Z. Klaić, D. Šljivac and Z. Baus "Probability Density Functions of
Voltage Sags Measured Indices," Journal of Electrical Engineering,
vol.62, no.6, 2011, pp.335-341.https://doi.org/10.2478/v10187-011-
0053-8
... This feature was first evidenced in [16]. This constitutes a crucial aspect that influences the statistical modeling of measured voltage sags and their practical application in forecasting sags at every site within an entire electrical system, as demonstrated in [17][18][19][20][21]. Rare voltage sags can be modeled effectively as a Poisson process with a constant and positive rate of occurrence owing to their features of timeinvariance, memory-independence, and independence from each other. ...
... For example, a strong correlation of sag clusters with significant lightning activity in some areas of the system may suggest an improvement in the performance of the lines related to lightning in those areas. In all the papers [16][17][18][19][20][21], the statistics of the measured sag were modeled by the random variable tim to th n xt v nt of th sit , ttn , rather than the numb r of sags, N. The random variable ttn , which measures the time between successive voltage sags at each site denoted by , provided a significantly larger database compared to N. This allowed forecasting rare sags with errors lower than 10% using the sags measured in only 3 years with any frequency. Given that clusters continue to play a crucial role in representing the severity of voltage sags at various nodes within the power system, in successive papers [20,21], the forecast of all the sags (named total sags), that is rare sags and clusters of sags, was first approached. ...
... In all the papers [16][17][18][19][20][21], the statistics of the measured sag were modeled by the random variable tim to th n xt v nt of th sit , ttn , rather than the numb r of sags, N. The random variable ttn , which measures the time between successive voltage sags at each site denoted by , provided a significantly larger database compared to N. This allowed forecasting rare sags with errors lower than 10% using the sags measured in only 3 years with any frequency. Given that clusters continue to play a crucial role in representing the severity of voltage sags at various nodes within the power system, in successive papers [20,21], the forecast of all the sags (named total sags), that is rare sags and clusters of sags, was first approached. Considering the set of all the measured sags as total sags, they were modeled as a stochastic process in which the rate of sag changes over time. ...
Article
Full-text available
The field measurement campaigns have revealed that voltage sags also occur as clusters and not only as rare phenomena. The clusters of sags represent a stochastic process due to their time dependence; the rare satisfy the requirements for a Poisson distribution process. To forecast both kinds of sags using the statistics of the measurements, different approaches are required. In this study, a general method for predicting both types of sags is proposed with a procedure that can be implemented automatically. The method uses intermittent indices to distinguish between the sites that have a prevalent number of rare sags and the sites where rare sags and clusters occurred. Based on this means of identification, the technique offers two distinct models for predicting each kind of sag. The final goal is to implement the procedure in a measurement system that can automatically pre-analyze the recorded sags and choose the best technique for prediction depending on the type of sag. The first results were satisfying with forecast errors reduced in comparison with those obtained without the proposed procedure.
... i.e. wind, storms, fires [12], [16]. Rare voltage sags meet the conditions of a Poisson process. ...
... 16 shows the difference between the residual voltages, ΔV, measured at the two busbars, RE and GR, of the substation connected to the busbar #7 versus the value of the residual voltage at the GR busbar where the DG was connected. The obtained points were presented also with the interpolating curve. ...
... 16 Example plot of TNSkΔT versus ΔT computed at an actual site k with measured voltage sags with two possible values of ΔTLimit k 16 indicates that TNS k ΔT had a significant increase for low values of ΔT, but from ΔT values around 10000 s (about 3 hours), the index had a slight increase, but it was not strictly constant. This trend emerged in most of the actual cases. ...
Thesis
Full-text available
The availability of data measured in the field is one of the greatest advances in the last decades for electric power systems planning, management and control. The technological innovations of systems and devices for the measurement of a wide range of quantities allowed their installation at almost all levels of voltages, from distribution to transmission. Increasing number of proposals of new techniques are coming from researchers and system operators for facing the various problems of system operation relying on the availability of measured data in real systems. The data from the field enable data-driven approaches, which can be integrated into traditional model-based methods constructing the so-called digital twin of a system. It is a system digital model whose parameters and linkages are continuously changed and tuned in accordance with the measurements from the actual system in service. New and attractive possibilities for addressing the problems of planning, managing, and controlling the power systems are opening for researchers and system operators. The studies presented in this thesis used the voltage sags measured in the medium voltage (MV) regional systems of E-Distribuzione for four years. Two main problems were faced by data-driven approaches: the ascertaining the origin of the measured voltage sags, and the forecasting the voltage sags which will occur in the site of the system. The correct ascertaining of the origin of voltage sags is crucial in view of future economic regulation by the national energy Authorities. It gives to the system operator the correct indication if the measured sags were or not due to faults in his own network interconnected with other systems. The possibility of forecasting the future performance in terms of voltage sags per year was a challenge never dealt with in the literature for the absence for field data. The literature, till now, proposed methods, models, and tools to estimate the average performance of a system, derived from model-based approaches. Regarding the origin of the voltage sags measured at the MV busbars of the High Voltage/Medium Voltage (HV/MV) stations, this thesis analyses and compares two methods which use only the residual voltage, the time the voltage sags occurred, and their duration. The analysis was conducted also studying the effect of the presence of DG (Distributed Generation). The data from field, moreover, revealed that in some specific cases the sags caused by faults in the MV systems propagated to HV networks. This problem was also studied simulating a portion of a real system, which presents the interconnections between HV and MV network. Regarding the forecast of voltage sags, the thesis proposed two main methods which use at least three years of measurement. The common choice of these two methods is the selection of the random variable, different from the statistical variable used by all the methods in literature. The random variable used in this study is the time to next event, that is the time intercurred between each couple of sags, instead of the variable number of voltage sags. This choice allows a huge increase of the data sets with the positive consequence of reasonable measurements time to obtain a forecast with acceptable accuracy. The first method, based on Poisson model, is suitable for rare sags, that are the sags occurred with a time to next event of the dimension of hours. The second method, based on Gamma model, is suitable for all the voltage sags, comprehensive of sags occurred close each other as groups. The latter, named clusters, are typically due to exogenous causes from the power systems, like adverse climatic conditions or fires. Intermittent indices are also proposed introduced for an initial screening of the measured sags to focus if, and how many, clusters are present in the data base of the measured sags. Such analysis drives the successive steps of the statistical analyses for discriminating the adequacy of Poisson and Gamma models.
... Model-based simulations, both numerical [14], [15], and analytical methods [16], [17] give the expected performance at the nodes of the systems in terms of the average annual number of voltage sags, which is not a forecast in strict sense. Some recent papers [18]- [21] addressed the forecast of the expected number of voltage sags. The key factor of such approaches was not only the availability of large amounts of measured sags, but also a different choice of the random variable to describe the statistics of the sags. ...
... This choice was first proposed in the paper [22], even if not aimed to forecast the sags in a power system. This paper follows the preceding papers [20], [21] and addresses the problem of forecasting the sags considering not only the rare sags but also the clusters of sags, which the data from the field confirmed to be frequent. A cluster of sags is a group of voltage sags that occur close to each other in time with a time between the events from some seconds to some hours. ...
... In such a case, the Gamma distribution can properly describe the stochastic nature of the variable ttne. In [21], the forecast of the total sags was referred to some few real sites; in this paper the prediction was extended to an entire regional system. Using three years of measurements, we show the forecast of expected number of the total sags in the 4th year. ...
... The electrical power system and associated data linked to component reliability help to assess voltage sag. Component reliability data have the same uncertainties as the results of PQ monitoring since they can only be acquired by carefully examining how a system component behaves over an extended period [21][22][23]. The various uncertainties are wind speed on conductors and attachments, faults caused by cattle, animals, mice, rats, birds, bats, snakes, and unpredictable weather, aging of equipment, interruption by cattle, snakes, and so on. ...
... Consequently, the component failure rate has greatly decreased. In general, stochastic evaluation gives a more accurate estimate of how different uncertainties related to component models and reliability affect the system [21][22][23][24][25]. The fault locations and critical distances methods are often used to predict stochastic voltage sag [26][27][28]. ...
Article
Full-text available
A voltage sag is a major power quality problem at any load location, mainly due to short circuit faults. Its effects are especially clear in the industrial sector because they lead to direct financial losses. The first and most important step to getting rid of or at least reducing voltage sag is to find the places in the network where it can happen. So, this article discusses a creative way to find the weak spots in a network where the voltage is more likely to drop. This innovative strategy is based on correlation and a stochastic normal probability distribution. The voltage sag is evaluated using an analytical method. The method simulates the number, type, and location of faults randomly throughout the system. After gathering voltage dip data, it is analyzed using normal probability distribution and correlation concepts. The correlation concept provides the relationship between the sag frequency and other parameters, and the sag occurrence area was indicated by the normal probability distribution. After that, the region of vulnerability (ROV) is developed using the ROV flow chart. This paper uses an IEEE 30-bus RTS system as a case study to demonstrate the usefulness of the suggested strategy using MATLAB software.
... In [59], some ideas for voltage sags forecast using the parameter time to the next event as the probabilistic variable are given. Further elaboration is provided in [60]. In this paper, a gamma distribution is proposed to properly model voltage sags based on the forecast. ...
Article
Full-text available
In this paper, a novel method for real-time prediction of voltage sag duration is proposed. It is based on the recently introduced new characteristic of voltage sag, named harmonic footprint, and is formulated using a logistic regression model. The concept is mathematically formulated and statistically analyzed using an extensive set of real grid measurement data which are recorded in distribution grids. Furthermore, the proposed method is applied as a part of an advanced grid-tie converter control. It is included in previously developed methods for fast sag detection and magnitude of voltage sag prediction. The algorithm is applied to the control of grid-tie converters used in distributed generators and tested with real/grid measurement data in the IEEE 13-bus test grid by simulations and in the IEEE 33-bus test grid using a hardware-in-the-loop (HIL) microgrid laboratory testbed. It is shown that this method can prevent unnecessary tripping of distributed generators (DG) and improve low-voltage ride-through (LVRT) support. In addition, the model has the potential to be applied to a wide range of devices or algorithms for the protection, monitoring, and control systems of distribution grids.
Article
Full-text available
This paper presents the preliminary results of our research activity aimed at forecasting the number of voltage sags in distribution networks. The final goal of the research is to develop proper algorithms that the network operators could use to forecast how many voltage sags will occur at a given site. The availability of four years of measurements at Italian Medium Voltage (MV) networks allowed the statistical analyses of the sample voltage sags without performing model-based simulations of the electric systems in short-circuit conditions. The challenge we faced was to overcome the barrier of the extremely long measurement times that are considered mandatory to obtain a forecast with adequate confidence. The method we have presented uses the random variable time to next event to characterize the statistics of the voltage sags instead of the variable number of sags, which usually is expressed on an annual basis. The choice of this variable allows the use of a large data set, even if only a few years of measurements are available. The statistical characterization of the measured voltage sags by the variable time to next event requires preliminary data-conditioning steps, since the voltage sags that are measured can be divided in two main categories, i.e., rare voltage sags and clusters of voltage sags. Only the rare voltage sags meet the conditions of a Poisson process, and they can be used to forecast the performance that can be expected in the future. However, the clusters do not have the characteristics of memoryless events because they are sequential, time-dependent phenomena the occurrences of which are due to exogenic factors, such as rain, lightning strikes, wind, and other adverse weather conditions. In this paper, we show that filtering the clusters out from all the measured sags is crucial for making successful forecast. In addition, we show that a filter, equal for all of the nodes of the system, represents the origin of the most important critical aspects in the successive steps of the forecasting method. In the paper, we also provide a means of tracking the main problems that are encountered. The initial results encouraged the future development of new efficient techniques of filtering on a site-by-site basis to eliminate the clusters.
Conference Paper
Full-text available
This paper refers to the voltage sags, which are among the most critical Power Quality disturbances. Thanks to the availability of measured voltage sags for four years in the same regional electric system, the work is our first attempt of using the registered voltage sags in three years for forecasting the expected voltage sags in the fourth year. The approach of the paper follows that proposed very recently by the professor M.T. Correia de Barros. Several intermediate results are similar to those obtained by her researches. The final results in terms of forecasts reveal that some improvements are needed for obtaining predictions with increased precision. The paper also presents the first improved results.
Book
Full-text available
Utilizing mathematical algorithms is an important aspect of recreating real-world problems in order to make important decisions. By generating a randomized algorithm that produces statistical patterns, it becomes easier to find solutions to countless situations. Stochastic Methods for Estimation and Problem Solving in Engineering provides emerging research on the role of random probability systems in mathematical models used in various fields of research. While highlighting topics, such as random probability distribution, linear systems, and transport profiling, this book explores the use and behavior of uncertain probability methods in business and science. This book is an important resource for engineers, researchers, students, professionals, and practitioners seeking current research on the challenges and opportunities of non-deterministic probability models.
Article
This paper provides a means of forecasting the future average performance a regional electric power system in terms of sags per year based on data accumulated over a four-year period. The paper also presents statistical analyses of the measured data. The sags measured in real systems consist of both rare voltage sags and grouped voltage sags (clusters); clusters are stochastic events that must be detected and removed to allow the estimation of the future performance based on a Poisson process. To detect and remove the clusters, new site indices of intermittence are proposed. These indices were measured in every site at which clusters were present, and the measurements included the frequency, time of grouping, and the large number of sags that constituted them. The performance of the forecast was evaluated by comparing the results obtained using the proposed indices with the results derived using a time filter equal for every site. The results showed that the forecast of the rare voltage using a few years' worth of field data had acceptable errors and was not prohibitive. The main aspect, which must be investigated further, is related to the choice of the best threshold on the intermittence site indices.
Article
Computational methods for predicting voltage sags consider their occurrence as a Poisson process, although without confirmation from monitoring data, until now. In this paper, the stochastic nature of voltage sags is analyzed and discussed using monitoring data from 60 sites of the Portuguese Transmission Network, covering the years 2011- 2015 (a total of 17,157 recorded voltage sags). A mathematical model for describing the voltage sag occurrence as a stochastic process is presented. The assumption of constant failure rates of the network elements implies that the occurrence of voltage sags is a Poisson process. However, that assumption is not valid if voltage sag clusters are included, as these imply considering time dependent failure rates. Then, the time between voltage sags is described by an exponential distribution, if clusters are not included, and may be described by the gamma distribution, if including clusters. The boundaries of the adequacy of exponential and gamma distributions are assessed, based on monitoring data. The time of occurrence of monitored voltage sags are analyzed and results confirm that the Poisson process describes the occurrence of voltage sags when voltage sag clusters are disregarded. The gamma distribution fitting is also confirmed when clusters are included in the analysis.
Article
This paper deals with the assessment of the origin of voltage sags due to faults in real, interconnected networks. The first step in regulating such faults is to determine the origins of measured voltage sags. Thus, the challenge of this study was to use only the main measurements associated with voltage sags, i.e., the residual voltage, the time the voltage sags occurred, and their duration to detect their origins. No correlations with data from the existing protection systems were used because they were not available. Different operators who do not exchange any data with each other manage the interconnected networks that we considered, and this increased the complexity of the study. In this paper, we analysed the main aspects of the propagation of voltage sags in interconnected networks, and we focused our attention on High Voltage (HV) and Medium Voltage (MV) systems interconnected by HV/MV stations. Two methods were used to determine where the voltage sags originated in the network, and the methods used only the measurements associated with the voltage sags. Both methods were used to evaluate the huge number of measured voltage sags that occurred in an MV distribution system over a period of one year. These two methods: are presented after a brief review of the theoretical methods for simulating the propagation of voltage sags among the nodes of an electrical network. This review is valuable for defining the rules associated with each method. The paper contributes to the literature by providing analyses of a huge number of voltage sags measured in interconnected systems. The main outcomes show that it is possible to detect the origin of the sags using the main measurements of the voltage during the fault only on the MV side of the transformers that are installed in the HV/MV stations. The statistical analysis of these measurements allowed us to classify the origins of the sags better than imposing any conditions on the value of the residual voltages of the sags. Conversely, the origin of the sags in the HV network can be identified better if limits are assigned to the residual voltages.