ArticlePDF Available

Size agnostic change point detection framework for evolving networks

PLOS
PLOS One
Authors:

Abstract and Figures

Changes in the structure of observed social and complex networks can indicate a significant underlying change in an organization, or reflect the response of the network to an external event. Automatic detection of change points in evolving networks is rudimentary to the research and the understanding of the effect of such events on networks. Here we present an easy-to-implement and fast framework for change point detection in evolving temporal networks. Our method is size agnostic, and does not require either prior knowledge about the network’s size and structure, nor does it require obtaining historical information or nodal identities over time. We tested it over both synthetic data derived from dynamic models and two real datasets: Enron email exchange and AskUbuntu forum. Our framework succeeds with both precision and recall and outperforms previous solutions.
This content is subject to copyright.
RESEARCH ARTICLE
Size agnostic change point detection
framework for evolving networks
Hadar MillerID
, Osnat MokrynID
*
Information Systems, University of Haifa, Haifa, Israel
These authors contributed equally to this work.
*ossimo@gmail.com
Abstract
Changes in the structure of observed social and complex networks can indicate a significant
underlying change in an organization, or reflect the response of the network to an external
event. Automatic detection of change points in evolving networks is rudimentary to the
research and the understanding of the effect of such events on networks. Here we present
an easy-to-implement and fast framework for change point detection in evolving temporal
networks. Our method is size agnostic, and does not require either prior knowledge about
the network’s size and structure, nor does it require obtaining historical information or nodal
identities over time. We tested it over both synthetic data derived from dynamic models and
two real datasets: Enron email exchange and AskUbuntu forum. Our framework succeeds
with both precision and recall and outperforms previous solutions.
Introduction
Complex systems of interacting elements, from human (social and organizational) to physical
and biological ones, can be modeled as interaction networks, with nodes representing the ele-
ments and edges representing their interactions. When the interactions are dynamic, i.e.,
human and social interactions, a complete model that captures the longitudinal evolution of
the system is comprised of a sequence of networks, each portraying a snapshot of the system at
a single point in time [1,2].
Of specific interest recently is the analysis of changes in dynamic social and complex net-
works in response to events, and the automatic detection of these points of change termed
Change Point Detection (CPD). Recent works identified changes in the community partition-
ing of the Enron email exchange immediately after the Californian blackouts [3], and a tur-
tling-up of conversation networks between traders in response to significant stock price
changes [4]. Understanding the network’s reaction to unusual events provides improved abili-
ties to analyze, understand and possibly take actions in a given system, infer its response to
external shocks, and aid in predicting organizational and behavioral changes.
Past research for identifying change points used stochastic models, of either scalar values
representing the longitudinal data [5], or probabilistic and model-based representations of the
PLOS ONE
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 1 / 23
a1111111111
a1111111111
a1111111111
a1111111111
a1111111111
OPEN ACCESS
Citation: Miller H, Mokryn O (2020) Size agnostic
change point detection framework for evolving
networks. PLoS ONE 15(4): e0231035. https://doi.
org/10.1371/journal.pone.0231035
Editor: Baruch Barzel, Bar-Ilan University, ISRAEL
Received: September 15, 2019
Accepted: March 15, 2020
Published: April 10, 2020
Copyright: ©2020 Miller, Mokryn. This is an open
access article distributed under the terms of the
Creative Commons Attribution License, which
permits unrestricted use, distribution, and
reproduction in any medium, provided the original
author and source are credited.
Data Availability Statement: Enron dataset: Klimt
B, Yang Y. Introducing the Enron Corpus. Machine
Learning. 2004;. AskUbuntu dataset: http://snap.
stanford.edu/data/sx-askubuntu.html.
Funding: OM received funding from the Israeli
Science Foundation Grant 328/17. The funder had
no role in study design, data collection and
analysis, decision to publish, or preparation of the
manuscript.
Competing interests: The authors have declared
that no competing interests exist.
network [3,68]. However, the works mentioned above did not examine the complex net-
work’s structure as manifested through distributions.
The structural properties that are in the focus of our work here are the network’s native
statistical distribution, i.e., its degree distribution measure. Distribution functions are a
measure of the division of resources within the network, and their relative positions, and
are considered a fundamental tool in the understanding of complex systems. Stumpf and
Porter [9] have discussed this notable role, claiming that degree distributions serve as an
aiding tool for understanding, interpreting, and even predicting behaviors in a given
system. Bhamidi et al. [10] further showed that degree-distribution measures reflect
changes in the underlying structure better than the hyper-parameters of the corresponding
network models. An additional valuable advantage of a degree distribution-based event
detection is that it eliminates the need to know in advance the number of nodes in the net-
work at each point in time and can work with as little information as the sequence of inter-
actions for the periods under inspection. Thus, unlike other CPD schemes, the proposed
solution here assumes no prior knowledge of the network, does not require pre-processing,
and can be used in an online manner, where new network snapshots are generated
on-the-fly.
Here, we devise an online fast change point detection mechanism, utilizing the degree
distribution of snapshots of networks in time. The detection mechanism is planned in a
manner that does not require to determine exact theoretical fits to the distributions. We
conduct a hypothesis testing to assess the significance of the change and differentiate a
change signal from local fluctuations. Our study here centers on interaction networks, such
as phone calls, text messages, emails, and online social network postings. These networks,
also termed temporal asynchronous human communication networks [11], can be charac-
terized by the intertwining of the temporal topological structure and the interaction
dynamics.
The contributions of the work are the following:
1. Taking a sliding-window approach for the network interactions, this method can address
both the anomaly detection problem, in which there is a significant variation from a norm,
and the change point detection problem, which considers a significant change to the norm
itself, by computing the significance measure of the change (calculated p-value) over differ-
ent window sizes.
2. The approach is the first that does not require to know in advance the number of interact-
ing nodes in each stage of the network’s life, and hence can be used online.
3. We investigate the performance of the scheme over both synthetic data and real-world data.
For the synthetic data, we conduct a thorough investigation of several generative models,
i.e., random networks and small-world networks, with varying rates of events and over dif-
ferent network sizes. This enables us to quantify the reaction of different network models to
events. We further show that over two real datasets, the scheme performs better than exist-
ing detection schemes, while being faster.
4. The hypothesis testing we conduct enables a sensitivity measure for a change. First, it
enables to set the level of sensitivity of a change according to need. Then, it opens the possi-
bility to detect changes with decreasing sensitivity during a window of time. While current
schemes detect reactions to shocks, this scheme can detect gradual changes that follow a
clear trend of increasing probability of a change and can be utilized as a predictive
framework.
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 2 / 23
Related work
In this work, we investigate the effect of events on social networks. Romero, Uzzy, and Klein-
berg in a recent novel work [4] defined these events as mostly exogenous events that are either
unexpected or are extreme, relative to the average [4,12]. They found a turtling-up of the net-
work as a reaction to an external shock, and measured changes in the clustering coefficient, tie
strength and percentage of border edges. Kondor et al. [13] researched the longitudinal struc-
ture of the network of the most active Bitcoin users for two years and searched for important
changes in the graph structure by comparing successive snapshots of the active core of the
transaction network using principal component analysis (PCA). They found a clear correspon-
dence with the market price of Bitcoin. McCulloh & Carley [5] included in their study of
change points also cases of endogenous changes and showed that their detection system could
determine that a change has occurred from a longitudinal analysis of the network itself. Using
their method, Tambayong [14] examined Sudan’s political networks and found that foreign-
brokered signings of multiple peace agreements served as a political solidification point for
political actors of Sudan during the recent violent domestic conflict. According to their analy-
sis, this was a catalyst that caused three leaders to have emerged and lead the more compart-
mentalized yet faction-cohesive political networks of Sudan. Peel and Clauset [3] were able to
detect external changes during the Enron crisis through a stochastic analysis of the Enron
organizational email exchange [15].
In stochastic models of networks, change points are points in time where a change in the
system’s norm is detected in a manner that can be significantly differentiated from plain sto-
chastic noise [3,1619]. McCulloh & Carley [5] convert the series of networks to a time series
of scalar values for different network measures, looking for a stable change in these values (as
opposed to temporal change, when looking for anomaly detection) using process mining tech-
niques for change points detection [5,20,21].
Methods for CPD differ mainly by the graph features they compute. A model-based
approach fits each snapshot to a generative model. Examples for models used are the General
Hierarchical Random graph (GHRG), Generalized Two Block Erdos-Renyi (GBTER), and
Kronecker Product Graph Model (KPGM) [3,22,23]. Structural entropy is a new measure
suggested by Almog and Shmueli [8] that takes into account the number of communities and
their sizes, and is a revised interpretation of the Structural Diversity metric. It was utilized to
monitor changes in the structure of correlation-based networks over time. A model-based
approach requires a pre-processing phase, for which enough history is pre-known. Some
works further require that labeled nodal information is known. When taking the degree distri-
bution, we eliminate the need for this extended information, as degree distribution does not
require historical information, nor the node names. Moreover, a recent analysis found that the
degree distribution better detects structural changes than the hyper-parameters of the genera-
tive model for the PA case [10].
A complementary approach, similar to ours, is to extract a large number of features from
each consecutive graph snapshots and find the distance between them [6,7,16,24]. A change
is determined if a predefined threshold for the distance is crossed. Anomaly detection analo-
gous techniques suggest eigenvalue-based detection [25] and matrix factorization [26], to
name a few. The use of the Relative Hausdorff (RH) distance metric for anomaly detection in
temporal networks with long-tail distance distribution was recently discussed in [27]. Indeed,
different distance metrics can be used in our framework. We later discuss in more detail the
differences between distance metrics, including RH, and suggest when to use which.
In this work, we focus on temporal asynchronous human communication networks [11].
Previous research by Braha and Bar-Yam found, surprisingly, that “the existence of a link
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 3 / 23
between two individuals at one time does not make it more likely that the link will appear at
another time” [28,29]. They found a weak correlation between consecutive networks of inter-
actions, while the degree distributions of these networks remained similar. Further, Palla et al.
[30] found that while small groups can stay stable in time if their membership is constant, the
opposite is true for larger communities: “The condition for stability for large communities is
continuous changes in their membership, allowing for the possibility that after some time
practically all members are exchanged.” These findings motivate our choice of utilizing the
networks’ degree distributions for change detection.
Unlike previous works that consider graph features, in our work, we conduct a hypothesis
test to provide certainty for a change point detection. When used gradually, it can be further
used to detect an incremental change.
Method
We explain our method, following Fig 1. A sequence of networks is presented, where a change
in the generative model occurs. The change is not tied to a specific structural characteristic.
Our framework computes the cumulative distribution function of the degrees (CDF) for each
graph, computes the distance, and performs a hypothesis testing to infer how probable is a
change given the measured distance between the two CDF’s. Here, we chose the non-paramet-
ric Kolmogorov-Smirnov (KS) two-sample test to measure the distance, as discussed later.
Other non-parametric statistical methods for measuring the distance may be applied. We dis-
cuss alternatives in the Results Section.
Frameworks for change point detection divide the data to consecutive snapshots according
to a natural division derived from the nature of the data, such as daily or weekly snapshots of
organizational frameworks, or monthly graphs of votes. In methods measuring the distance
between features extracted from two consecutive graph snapshots [6,7,16], a change is
Fig 1. Framework for detection of changes. Our Window size defines the stability of the change over time.
Hypothesis testing over a distance measure is used to determine whether the underlying model has changed. On the
left graphs generated from the same model, on the right a graph generated from a changed model.
https://doi.org/10.1371/journal.pone.0231035.g001
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 4 / 23
detected if the measured distance is bigger than an arbitrarily predefined threshold value. A
drawback of distance measures is that they work well mainly for large sample sizes. When the
sample size is small, a large distance can be measured, crossing the predefined threshold value.
This large measure can lead to a false-positive result, i.e., an inference of a change when there
is merely a fluctuation. Fig 2 demonstrate such a false-positive scenario. To avoid these types
of false inferences, we suggest the use of a sliding window over several graph snapshots and
computing the CDF across the entire window, as is the case in Fig 4. A complimentary situa-
tion occurs when windows that are set too large, conceal an event within them, thus hiding the
point of change. This would correspond with a false negative inference and is demonstrated in
Fig 3. A solution to this problem is the use of a sliding window to find the exact point of change
within the window, as is used in [3].
An alternative approach to measuring a distance between windows is to try and fit a theo-
retical statistical distribution to each network snapshot, and determine whether they are
derived from the same model. This is, however, a rather time and computational-intensive
approach. Fitting data to a statistical theoretical model requires both to find a fit and to reject
other possible theoretical statistical distributions [31]. Hence, we compare distances across
windows, as described in Fig 4, and perform a hypothesis testing of a change.
Fig 2. False positive. Distance measure is large as the sample size is too small, although graphs come from the same
generative model.
https://doi.org/10.1371/journal.pone.0231035.g002
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 5 / 23
We conduct a hypothesis testing to learn whether the distance between the degree distribu-
tions affirms that they come from the same model or two different generative models. We
measure the distance between the cumulative degree distributions of consecutive snapshots.
For any two consecutive windows, let us define their graphs representations as g
i
,g
i+1
. The null
hypothesis is that the cumulative distributions measured for any two consecutive snapshots, g
i
,
g
i+1
, are drawn from the same distribution, G
Null
, in which case no change has occurred
between the windows. To test the hypothesis, we generate synthetic datasets from the distribu-
tion of g
i
and find their distributions. The standard approach for generating samples for
hypothesis testing is a Monte-Carlo bootstrap process, which generates samples by randomly
re-sampling (with replacement) the data [32].
We proceed to explain the process in more detail. We use here the non-parametric Kolmo-
gorov-Smirnov (KS) two-sample test. The method is considered robust and is widely used (as
discussed at the end of this section further). Yet, when comparing two distributions using too
few samples, it can fail to reject a false null hypothesis. The probability for that diminishes as
the number of nodes interacting in each snapshot increases.
For two consecutive graph snapshots g
i
,g
i+1
, (i2{1, 2, . . .}) we denote the two generated
corresponding cumulative degree distribution functions by S
i
(x), S
i+1
(x). Given the CDF
degree distribution S
j
(x), j2i,i+ 1 for graph g
j
:S
j
(x) = P
j
(xX) we compute the KS statistic
Fig 3. False negative. Fluctuations conceal each other and decrease the measured distance between two networks.
https://doi.org/10.1371/journal.pone.0231035.g003
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 6 / 23
D, defined as the maximal difference between the two empirical distributions, as described by
Eq 1. The KS null hypothesis is that the two samples were drawn from the same distribution.
DðSi;Siþ1Þ ¼ sup
xjSiðxÞ Siþ1ðxÞj ð1Þ
The KS null hypothesis is rejected if the computed distance D(S
i
,S
i+1
) is greater than some
critical value, usually a predefined threshold.
In our framework, we suggest a hypothesis testing that determines for each distance D(S
i
,
S
i+1
) the confidence with which the null hypothesis can be rejected for it, as follows. As
explained before, a large KS distance D(S
i
,S
i+1
) measured between S
i
(x) and S
i+1
(x) doesn’t
necessarily indicate that the model has changed. We would like to test how rare such distance
(D(S
i
,S
i+1
)) is, given S
i
(x). To do so, we define g
i
as the base model graph and conduct a
hypothesis testing, with a null hypothesis that the distance D(S
i
,S
i+1
) between the base model
graph distribution and the consecutive one, g
i+1
’s distribution, is not rare for samples taken
from the same statistical model. Our null hypothesis then assumes that the distance between
the model graph distribution CDF, S
i
(x), and the consecutive graph’s distribution CDF, S
i+1
(x),
is typical for distances between distributions sampled from the same base model graph distri-
bution. The null hypothesis is rejected with significance pif in α= (1 p) of the times the
Fig 4. Sliding window. The use of a sliding window over several graph snapshots decreases the probability of a false positive estimation of a change.
https://doi.org/10.1371/journal.pone.0231035.g004
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 7 / 23
measured distance between S
i
(x) and the sampled distributions is smaller than D(S
i
,S
i+1
), as
depicted in Eq 2.
Following the Monte-Carlo bootstrap procedure [32] we generate k= 1000 >>1 new sam-
ples from S
i
and compute the distance DðSi;SikÞbetween S
i
(x) and each of its bootstrap sam-
ples, fSikðxÞ;k21::1000g. This results in a group of size n= 1000 of distances from S
i
to its
samples, namely the group of distances fdikg;k2 ½1::n.Fig 5 gives a visual description of this
process. Then, given k2[1n]:
8a<1;9di;a;s:t:jfDðSi;SikÞ<di;aganð2Þ
Namely, there exists a distance d
i,α
that is greater than the distance between S
i
(x) and αof its
bootstrap-generated samples. If D(S
i
,S
i+1
)>d
i,α
then we can reject the null hypothesis with
confidence α.
For example, let us assume that α= 0.95. Then, following the definition in Eq 2,d
i,0.95
is
greater than the distances between S
i
(x) and 95% of its bootstrap-generated samples. If, then,
the distance between S
i
(x) and the subsequent graph’s distribution, S
i+1
(x), D(S
i
,S
i+1
), is
greater than d
i,0.95
, then we can reject the null hypothesis and claim with confidence of 95%
that g
i
,g
i+1
are graphs that are generated by difference models.
Next, we continue to evaluate the performance of our framework over both synthetic and
real-world networks and discuss alternatives to the KS distance metric.
Results
We conduct several experiments to demonstrate the performance of our framework on both
large synthetic datasets and real networks. First, we investigate the performance of the frame-
work on sampled synthetic networks generated by known generative models and then follow a
rigorous evaluation to evaluate the sensitivity of our framework with the suggested distance
metric over synthetic datasets. We then continue to evaluate the framework over two real data-
sets and compare our results against a known baseline solution. We conclude this section by
Fig 5. Monte-Carlo bootstrap procedure: S
i
is sampled n= 1000 times (with repetitions) to create the series Sik;k2 ½1::n.Then, we measure the KS
distance from S
i
to each of the sampled CDFs, yielding the group of distances fdikg;k2 ½1::n.
https://doi.org/10.1371/journal.pone.0231035.g005
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 8 / 23
evaluating the performance of different distance metrics that can be used with our framework
and discuss their differences.
Synthetic datasets
We conduct a series of experiments over synthetic datasets generated by known generative
models. Each such generative model enables us to investigate the framework’s behavior for dif-
ferent structural characteristics. As our method is based on the degree distribution of the net-
work, it is agnostic to any changes in the network size. Hence, we expect our framework to
detect changes across network snapshots that may gain or lose nodes during the network’s life-
time. At first, we considered using a preferential-attachment growing network as one of the
models. However, this model is specifically designed to explain the emergence of hubs in net-
works and the long tail distribution of real-world network degrees, and thus is designed to cre-
ate a specific degree distribution, which is what we try to find, resulting in a trivial test.For
generative models, we, therefore, employ the Erdo¨s-Re
´nyi (ER) random networks model and
the Caveman model. First, we conduct three experiments over these networks, and then we
perform a large scale sensitivity test for both models.
Designed experiments. In each experiment, the network model alternates between two
configurations that differ in their hyper-parameters. The number of changes is set to 100, dis-
tributed randomly. Then, the number of consecutive snapshots of the network drawn from the
model configuration, x, is chosen from a normal distribution x*N(μ= 4, σ
2
= 2), such that
each point is the averaged results of 400 runs.
Random graphs. We start with the Erdo¨s-Re
´nyi (ER) random graph model [33]. The
model for random graphs G(n,p) assumes a fixed number of nodes n(Recently Zhang et al.
[2] suggested a generalization for dynamic random networks, in which the dynamic process is
governed by a continuous Markov-process. As we need to study the change point detection
problem, requiring a change in the generative model hyper-parameters, we could not employ
their model.) Edges connect node pairs independently with probability p. Low values of p
entail that the number of edges is substantially lower than the number of nodes and the model
generates small components in tree forms. As pincreases, and reaches p>o1
n
, the network
changes to suddenly form a giant component, a phase transition that has a distinct influence
on the structure of the network.
We then perform two experiments for this model type, as described here, and detailed in
Figs 6and 7:
Experiment 1: A change in the hyper-parameters of the ER model transitions the network
between the two network states of fragmented (p<< o1
n
) and connected (p>o1
n
). The
network configurations are the following. Each configuration consists of 200 nodes, and the
model’s hyper-parameter is either p= 0.003, i.e., fragmented, or p= 0.01, i.e., connected.
Experiment 2: The ER networks consist of 200 nodes each, and the model’s hyper-parameter
is either p= 0.1, p= 0.15, i.e., both times the network is connected, and there is a slight
change in the connectedness. It is safe to assume that the subtleness of the change in the gen-
erative model of the random network will make it harder to identify.
Caveman model. The ER model generates a graph with small clustering coefficients,
which lack the capability to represent communities. Social networks are often characterized as
having highly connected communities that form rare interactions in between and form a
Small World. For example, in an organization, one may expect intensive interactions between
actors within departments and sparse interactions between actors belonging to different
departments. A generative model for a small-world network is the Caveman [34].
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 9 / 23
To test our framework against networks with varying sizes we generated a sequence of unla-
beled networks, g
i
2G, while using the Caveman model. The number of nodes for each snap-
shot was randomly selected from a uniform distribution ||g
i
|| *U(200, 1000). To prevent a
sample size bias while calculating the KS distance we randomly sampled 200 nodes from each
network and calculated the distance between the two samples degree distribution.
Experiment 3: The Caveman-based networks are drawn from a model containing 200
nodes, as explained above, and C= 5 communities each. The change in the hyper-parameter
Fig 6. Random networks: Experiment 1 (200 nodes). Experiment 1 1
st
configuration (p= .003, fragmented) degree
CDF, visualization of largest component.
https://doi.org/10.1371/journal.pone.0231035.g006
Fig 7. Random networks: Experiment 1 (200 nodes). Experiment 1 2
nd
configuration (p= .009, connected) degree
CDF, visualization of largest component.
https://doi.org/10.1371/journal.pone.0231035.g007
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 10 / 23
between the two configurations is in the rewire probability p. In the 1
st
configuration, visual-
ized in Fig 8 p= 0.4. In the 2
nd
p= 0.7, leading to a more inter-connected network, as is visu-
alized in Fig 9.
CPD performance for the synthetic networks examples. Table 1 describes the perfor-
mance of our detection framework for the three described experiments. Note that for the ER
networks (exp1, exp2), we get a perfect recall. The degree distribution of an ER random graph
Fig 8. Caveman: Experiment 3 (200 nodes, C = 5). Experiment 3 1
st
configuration (p= 0.4) CDF, visualization.
https://doi.org/10.1371/journal.pone.0231035.g008
Fig 9. Caveman: Experiment 3 (200 nodes, C = 5). Experiment 3 2
nd
configuration (p= 0.7) CDF, visualization.
https://doi.org/10.1371/journal.pone.0231035.g009
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 11 / 23
with edge probability p¼l
nfollows a Poisson distribution with probability mass function:
el lk
k!, with mean λand skewness λ
0.5
. A change in λdifferentiates two ER generative models
and will be projected to the networks’ CDF, thus detectable by our model. This may explain
the perfect detection (Recall = 1) of all events in our synthetic data tests. However, the variance
of a Poison distribution is λas well. As the variance λincreases, the chances of mistakenly find
two samples drawn from the same model as not sharing the same distribution increase. This
explains our relatively low precision.
As true positive events (change points) were detected with significance that exceeds 99%
We repeated the experiments while increasing the CPD threshold from 90% to 99%. This test
resulted with Recall = 1.0 and Precision = 0.89. This corresponds to changing the sensitivity of
the framework, as discussed before.
The results for the Caveman model (exp 3) yield excellent results of perfect precision
(100%) and near-perfect recall (96%), showing that a community structure of networks lends
itself naturally to our detection framework.
CPD framework’s performance—Sensitivity test. We continue to understand the sensi-
tivity of the framework, given the KS metric (As noted before, we will examine other distance
metrics’ performance in a follow-up section.) to changes in the models’ hyperparameters. We
examine both models as before, the ER random networks, and Caveman. The ER network’s
hyperparameter, p
ER
, is the edge probability, and thus dictates the network’s density. The
larger p
ER
, the larger the network’s average degree. As for the Caveman model, the correspond-
ing hyperparameter, p
CV
, indicates the amount of rewiring and is thus inversely correlated
with the network’s cluster-coefficient. The temporal networks’ sizes are randomly picked from
a normal distribution: n*N(μ= 100, σ
2
= 10). The Normal distribution with these parame-
ters was chosen as it represents the sizes of the temporal weekly networks of real-world datasets
used in this research.
Fig 10 depicts the results of the f
1
measureof the performance of our framework with the KS
distance metric for all permutations of ER networks that entail change. The networks were
modeled with the edge connectivity probability, p
1,ER
,p
2,ER
2[0.05, 0.1, 0.15, . . .1], p
1,ER
p
2,ER
. Overall, the sensitivity graph depicts 380 experiments, in which each point is the average
of 2x100 random networks. The framework excels in finding the hyper-parameter change.
Hence, it is very sensitive to changes in the network’s density, and fails to find a change only
when the changes are very small, i.e., |p
1,ER
p
2,ER
| = 0.05, as can be seen by the low f
1
value at
the narrow diagonal line.
We repeated the experiment above for the Caveman model, depicted in Fig 11. The net-
works were generated with the rewiring parameters p
1,CV
,p
2,CV
2[0.05, 0.1, 0.15, . . .1], p
1,CV
p
2,CV
. Similarly, each point is the result of 100 experiments. The rewiring parameter in the
model correlates negatively with the network’s clustering coefficient, as it determines the prob-
ability of edge rewiring. As the network is formed with a low number of communities that are
fully intra-connected, higher rewiring probability corresponds to lower values of the clustering
Table 1. CPD framework performance for synthetic networks examples.
Experiment Model & Main Structural Property Precision Recall
Mean, Std Mean, Std
exp1 ER: Phase transition p= {0.003, 0.009} 0.767, 0.03 1.0, 0.0
exp2 ER: Connected p= {0.1, 0.15} 0.671, 0.02 1.0, 0.0
exp3 Caveman: Communities p= {0.4, 0.7} 1.0, 0 0.961, 0.01
https://doi.org/10.1371/journal.pone.0231035.t001
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 12 / 23
coefficient. The results of our framework with the KS distance metric are moderate. It is not
sensitive to small changes (fails to find a change when |p
1,CV
p
2,CV
| = 0.1|), and as the p
CV
value increases, the framework’s ability to detect changes decreases. Hence, we can conclude
that the framework with the KS metric fails to detect changes between networks that exhibit
low clustering coefficient values.
Fig 10. The framework with KS distance—ER networks. Sensitivity test for ER networks with sizes *N(μ= 100, σ
2
= 10).
https://doi.org/10.1371/journal.pone.0231035.g010
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 13 / 23
Real-world datasets
We tested our framework against two real-world datasets. The first, the Enron email exchange
between 151 employees, mostly managers [15]. We generated weekly networks from the email
interactions similar to [3,7].
Fig 11. The framework with KS distance—Caveman networks. Sensitivity test for Caveman networks with sizes *N(μ= 100, σ
2
= 10).
https://doi.org/10.1371/journal.pone.0231035.g011
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 14 / 23
Fig 12 describes our framework’s performance, compared to both the real events, and to the
GHRG-based detection framework by Peel and Clauset [3]. Our framework detected 13 out of
14 change points, resulting in recall and precision of 0.9.
The second dataset is the interactions on the stack-exchange website AskUbuntu [35]. The
periods are of months. We assume that a new Ubuntu release might affect the community, and
extracted the release dates from the Ubuntu site. The number of interacting nodes (partici-
pants) varies greatly in the measured period, from 4095 to 6202, as is the amount of interac-
tions. As described before, our method allows for different participants and interactions
between windows while measuring the distribution. The results are described in Fig 13. Our
framework detects almost all the events with high confidence: Recall = 0.8, and Precision = 0.57.
There might be external events (i.e., are not version releases) that we are not aware of that
could have affected the network as well.
Considerations in distance metric choices
The KS test statistic uses the difference of greatest magnitude between two distributions and is
more sensitive to discrepancies between the distributions around the middle. Additionally, it
Fig 12. Enron emails exchange. The Enron email exchange during the second half of 2001, where many events took
place. Real events denoted by blue rhombuses, True positive detection events by a green star. In grey, the events
detected by the GHRG framework, which is used as a baseline for comparison. Our framework outperforms with
Recall = 0.9 and Precision = 0.9.
https://doi.org/10.1371/journal.pone.0231035.g012
Fig 13. AskUbuntu forum exchange. Release events denoted by blue rhombuses; True positive detection events by a
green star. Our framework performance yields Recall = 0.8 and Precision = 0.57, number of nodes in each window is
denoted on the graph per detected event.
https://doi.org/10.1371/journal.pone.0231035.g013
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 15 / 23
is efficient, fast to compute, and was found useful for change detection in data streams [36]. In
the case of moderate-sized asynchronous human communication networks of the type exam-
ined in our work, the KS distance metric yields very good results. However, Granger [37] fur-
ther showed that a large amount of data might not be advantageous for inferential statistics
such as the KS test. When the temporal degree in the different snapshots follows a long-tail dis-
tribution, a distance metric that takes into account small changes in the tail of the distribution
would be preferred. Examples for such networks are the Kullback-Leibler divergence (KL) [38]
and Relaxed Hausdorff (RH) [27] distance metrics. The RH is a relative new distance metric,
and was found to outperform the KL distance metric. It is defined as follows. Two graphs G,F
are defined to be (,δ)close if:
8d;9d02 ½ð1Þd;ð1þÞd;s:t:jFðdÞ Gðd0Þj dFðdÞ ð3Þ
Where if F,Gare close then RH(F,G) = 1.The metric is defined in a manner that does not
limit the distance, hence the threshold is data-driven. It is tailored to find small discrepancies
between long-tailed streams for anomaly detection and requires that enough samples would
exist in each graph for change detection. In our real datasets, this is not the case. For example,
in the Enron dataset, there are around a hundred nodes in each of the snapshots. As RH does
not work on the CDF of the graph, it requires padding if the number of nodes differs between
the measured graphs. Still, the RH metric outperformed all other metrics for change point
detection in long-tail networks that we checked. We continue here to discuss the performance
of RH and KL for moderate-sized networks.
Figs 14 and 15 depict the change detection performance of the RH and KL distance metrics,
correspondingly, in detecting changes between ER networks of sizes *N(μ= 100, σ
2
= 10).
Recall that the normal distribution with these parameters was chosen as it represents the
majority of the weekly temporal network snapshots of the real-world datasets we use in this
research. When the changes are small, both metrics fail to detect changes. However, when the
density increases, KL can better detect small changes between ER networks. Figs 16 and 17
depict the change detection performance of the RH and KL distance metrics, correspondingly,
in detecting changes between Caveman networks of sizes *N(μ= 100, σ
2
= 10). KL performs
well and can detect rather small changes. RH is tailored for detecting changes between net-
works with a long-tail degree distribution and thus does not perform well for networks of
small size.
Results summary
Table 2 Summarizes the framework’s performance compared to the relevant competition for
the real-wold dataset for which we had ground truth, and for the synthetic networks sensitivity
tests. For the real dataset (Enron) our framework found 13 out of the existing 14 points of
change in the data, and achieved recall and precision of 0.9, compared to the GHRG-based
solution suggested in [3] that did not have false positives but failed to identify some of the
change-points. We further compared our framework performance over the synthetic datasets
to the performance of two alternative distance metrics between the degree distributions of net-
works: KL and RH. For both random networks (ER) and Caveman-based networks of mid-
sizes, our framework outperformed the competition.
Discussion
It is widely accepted that the structural properties of a network play a significant role in deter-
mining its actors’ behavior [3944]. The last decade’s abundance of temporal information
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 16 / 23
paved the path to a further understanding of the dynamics of networks [45], and findings cor-
roborate that structural properties have a prominent effect on the longitudinal dynamics of
networks and their actors [43,44,4648].
Considering that distributions in complex systems have practical importance as an aiding
tool for data interpretation and event prediction [9,10], this work explores the interplay
between points of change and this fundamental structural distribution in social organizations
Fig 14. CPD for random networks with RH distance. Sensitivity test for ER networks with sizes *N(μ= 100, σ
2
= 10).
https://doi.org/10.1371/journal.pone.0231035.g014
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 17 / 23
and systems. Social interactions are dynamic, and in organizations and social venues (as is the
case with the AskUbuntu forum), different subsets of the networks interact at different times.
A distribution-based framework like the one presented in this work enables a variable number
of nodes at each window of time and hence is size agnostic. It also does not require historical
information and can be used for online detection. This is in contrast to model-based frame-
works, which are limited by definition to a stringent subset of traceable interacting players
Fig 15. CPD for random networks with KL distance. Sensitivity test for ER networks with sizes *N(μ= 100, σ
2
= 10).
https://doi.org/10.1371/journal.pone.0231035.g015
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 18 / 23
over time. The framework presented here captures the distribution of interactions in each win-
dow of time, regardless of the participants’ history. It also does not assume past correlations
between the parties involved.
The framework can be employed with different distance metrics. Our results demonstrate
that for moderate-size networks, the KS distance metric yields good performance, better than
KL and RH. It is also widely known to be fast [36]. For networks with long-tail distributions,
Fig 16. CPD for caveman networks with RH distance. Sensitivity test for Caveman networks with sizes *N(μ= 100, σ
2
= 10).
https://doi.org/10.1371/journal.pone.0231035.g016
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 19 / 23
the RH distance metric gives good results, as it is sensitive to accumulated changes in the
tail [27].
Conclusion
Our framework for size-agnostic detection of changes works across different generative mod-
els and real datasets, achieving very high recall and good precision of detection. During the
Fig 17. CPD for caveman networks with KL distance. Sensitivity test for Caveman networks with sizes *N(μ= 100, σ
2
= 10).
https://doi.org/10.1371/journal.pone.0231035.g017
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 20 / 23
work, we have identified an interesting trade-off between precision and recall of detection
when considering the size of the network and detectability. We intend to study this tradeoff in
future research further. An additional interesting line of research is to quantify the nature of
the change in the distribution in response to different events.
Author Contributions
Conceptualization: Hadar Miller, Osnat Mokryn.
Data curation: Hadar Miller.
Formal analysis: Hadar Miller, Osnat Mokryn.
Methodology: Hadar Miller, Osnat Mokryn.
Software: Hadar Miller.
Supervision: Osnat Mokryn.
Writing original draft: Osnat Mokryn.
Writing review & editing: Osnat Mokryn.
References
1. Gauvin L, Panisson A, Cattuto C. Detecting the Community Structure and Activity Patterns of Temporal
Networks: A Non-Negative Tensor Factorization Approach. PLOS ONE. 2014; 9:1–13.
2. Zhang X, Moore C, Newman ME. Random graph models for dynamic networks. The European Physical
Journal B. 2017; 90(10):200.
3. Peel L, Clauset A. Detecting change points in the large-scale structure of evolving networks. 29th AAAI
Conference on Artificial Intelligence (AAAI). 2015; p. 1–11.
4. Romero DM, Uzzi B, Kleinberg J. Social Networks Under Stress. In: Proceedings of the 25th Interna-
tional Conference on World Wide Web. International World Wide Web Conferences Steering Commit-
tee; 2016. p. 9–20.
5. McCulloh I, Carley K. Detecting change in longitudinal social networks. Journal of Social Structure.
2011; 12:1–37.
6. Koutra D, Vogelstein JT, Faloutsos C. Deltacon: A principled massive-graph similarity function. In: Pro-
ceedings of the 2013 SIAM International Conference on Data Mining. SIAM; 2013. p. 162–170.
7. Wang Y, Chakrabarti A, Sivakoff D, Parthasarathy S. Fast Change Point Detection on Dynamic Social
Networks. arXiv preprint arXiv:170507325. 2017.
8. Almog A, Shmueli E. Structural Entropy: Monitoring Correlation-Based Networks Over Time With Appli-
cation To Financial Markets. Scientific reports. 2019; 9(1):1–13.
9. Stumpf MP, Porter MA. Critical truths about power laws. Science. 2012; 335(6069):665–666. https://
doi.org/10.1126/science.1216142 PMID: 22323807
Table 2. CPD KS framework performance summary.
Dataset CPD framework GHRG [3]
Prec. Recall Prec. Recall
Enron 0.9 0.9 1.0 0.36
Solution ER sensitivity Caveman sensitivity
Precision Recall Precision Recall
Mean Std Mean Std Mean Std Mean Std
Framework 0.94 0.08 0.96 0.08 0.78 0.41 0.59 0.45
RH CPD 0.19 0.05 0.75 0.28 0.23 0.02 0.91 0.13
KL CPD 0.65 0.39 0.73 0.41 0.86 0.26 0.79 0.36
https://doi.org/10.1371/journal.pone.0231035.t002
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 21 / 23
10. Bhamidi S, Jin J, Nobel A, et al. Change point detection in network models: Preferential attachment and
long range dependence. The Annals of Applied Probability. 2018; 28(1):35–78.
11. Lehmann S. In: Holme P, Sarama
¨ki J, editors. Fundamental Structures in Temporal Communication
Networks. Cham: Springer International Publishing; 2019. p. 25–48. Available from: https://doi.org/10.
1007/978-3-030-23495-9_2.
12. Gilbert CG. Unbundling the structure of inertia: Resource versus routine rigidity. Academy of Manage-
ment Journal. 2005; 48(5):741–763.
13. Kondor D, Csabai I, Szu¨le J, Po
´sfai M, Vattay G. Inferring the interplay between network structure and
market effects in Bitcoin. New Journal of Physics. 2014; 16(12):125003.
14. Tambayong L. Change detection in dynamic political networks: the case of Sudan. In: Theories and
Simulations of Complex Social Systems. Springer; 2014. p. 43–59.
15. Klimt B, Yang Y. Introducing the Enron Corpus. Machine Learning. 2004.
16. Akoglu L, Faloutsos C. Event detection in time series of mobile communication graphs. In: Army Sci-
ence Conference; 2010. p. 77–79.
17. Hirose S, Yamanishi K, Nakata T, Fujimaki R. Network anomaly detection based on eigen equation
compression. In: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discov-
ery and data mining. ACM; 2009. p. 1185–1194.
18. Gupta M, Gao J, Aggarwal C, Han J. Outlier detection for temporal data. Synthesis Lectures on Data
Mining and Knowledge Discovery. 2014; 5(1):1–129.
19. Akoglu L, Tong H, Koutra D. Graph based anomaly detection and description: a survey. Data Mining
and Knowledge Discovery. 2015; 29(3):626–688.
20. Hawkins DM, Qiu P, Kang CW. The changepoint model for statistical process control. Journal of quality
technology. 2003; 35(4):355–366.
21. Priebe CE, Conroy JM, Marchette DJ, Park Y. Scan statistics on enron graphs. Computational & Mathe-
matical Organization Theory. 2005; 11(3):229–247.
22. Bridges RA, Collins JP, Ferragut EM, Laska JA, Sullivan BD. Multi-level anomaly detection on time-
varying graph data. In: Advances in Social Networks Analysis and Mining (ASONAM), 2015 IEEE/ACM
International Conference on. IEEE; 2015. p. 579–583.
23. Moreno S, Neville J. Network hypothesis testing using mixed Kronecker product graph models. In: Data
Mining (ICDM), 2013 IEEE 13th International Conference on. IEEE; 2013. p. 1163–1168.
24. Donnat C, Holmes S, et al. Tracking network dynamics: A survey using graph distances. The Annals of
Applied Statistics. 2018; 12(2):971–1012.
25. Facchetti G, Iacono G, Altafini C. Computing global structural balance in large-scale signed social net-
works. Proceedings of the National Academy of Sciences. 2011; 108(52):20953–20958.
26. Tong H, Lin CY. Non-negative residual matrix factorization with application to graph anomaly detec-
tion. In: Proceedings of the 2011 SIAM International Conference on Data Mining. SIAM; 2011. p. 143–
153.
27. Aksoy SG, Nowak KE, Purvine E, Young SJ. Relative Hausdorff distance for network analysis. Applied
Network Science. 2019; 4(1):80.
28. Braha D, Bar-Yam Y. From centrality to temporary fame: Dynamic centrality in complex networks. Com-
plexity. 2006; 12(2):59–63.
29. Braha D, Bar-Yam Y. Time-dependent complex networks: Dynamic centrality, dynamic motifs, and
cycles of social interactions. In: Adaptive Networks. Springer; 2009. p. 39–50.
30. Palla G, Pollner P, Baraba
´si AL, Vicsek T. Social group dynamics in networks. In: Adaptive Networks.
Springer; 2009. p. 11–38.
31. Clauset A, Shalizi CR, Newman MEJ. Power-law distributions in empirical data. SIAM review. 2009; 51
(4):661–703.
32. Efron B, Tibshirani RJ. An introduction to the bootstrap. CRC press; 1994.
33. Erdo
¨s P, Re
´nyi A. On the evolution of random graphs. Publ Math Inst Hungar Acad Sci. 1960; 5:17–61.
34. Watts DJ. Networks, dynamics, and the small-world phenomenon. American Journal of sociology.
1999; 105(2):493–527.
35. Paranjape A, Benson AR, Leskovec J. Motifs in temporal networks. In: Proceedings of the Tenth ACM
International Conference on Web Search and Data Mining. ACM; 2017. p. 601–610.
36. Glazer A, Lindenbaum M, Markovitch S. Learning high-density regions for a generalized kolmogorov-
smirnov test in high-dimensional data. In: Advances in neural information processing systems; 2012. p.
728–736.
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 22 / 23
37. Granger CWJ. Extracting information from mega-panels and high-frequency data. Statistica Neerlan-
dica. 1998; 52(3):258–272.
38. Kullback S, Leibler RA. On information and sufficiency. The annals of mathematical statistics. 1951;
22(1):79–86.
39. Granovetter M. The strength of weak ties: A network theory revisited. Sociological theory. 1983; 1
(1):201–233.
40. Burt RS. The network structure of social capital. Research in organizational behavior. 2000; 22:345–
423.
41. Haynie DL. Delinquent peers revisited: Does network structure matter? 1. American journal of sociol-
ogy. 2001; 106(4):1013–1057.
42. Spencer JW. Global gatekeeping, representation, and network structure: a longitudinal analysis of
regional and global knowledge-diffusion networks. Journal of International Business Studies. 2003;
34(5):428–442.
43. Kossinets G, Watts DJ. Empirical analysis of an evolving social network. Science. 2006; 311(5757):88–
90. https://doi.org/10.1126/science.1116869 PMID: 16400149
44. Fowler JH, Christakis NA, et al. Dynamic spread of happiness in a large social network: longitudinal
analysis over 20 years in the Framingham Heart Study. Bmj. 2008; 337:a2338. https://doi.org/10.1136/
bmj.a2338 PMID: 19056788
45. Lazer D, Pentland AS, Adamic L, Aral S, Barabasi AL, Brewer D, et al. Life in the network: the coming
age of computational social science. Science (New York, NY). 2009; 323(5915):721.
46. Leskovec J, Kleinberg J, Faloutsos C. Graph evolution: Densification and Shrinking Diameters. ACM
Transactions on Knowledge Discovery from Data. 2007; 1(1):2–es.
47. Phelps CC. A longitudinal study of the influence of alliance network structure and composition on firm
exploratory innovation. Academy of Management Journal. 2010; 53(4):890–913.
48. Ilany A, Booms AS, Holekamp KE. Topological effects of network structure on long-term social network
dynamics in a wild mammal. Ecology letters. 2015; 18(7):687–695. https://doi.org/10.1111/ele.12447
PMID: 25975663
PLOS ONE
Size agnostic CPD framework
PLOS ONE | https://doi.org/10.1371/journal.pone.0231035 April 10, 2020 23 / 23
... The obtained control chart for the observed events on the Ask Ubuntu website is shown in Fig. 11. After a thorough analysis of the found change points, it was discovered that they were in line with the release of new version modifications that came in September, and October 2015 of Ubuntu (Miller and Mokryn 2020). The suggested chart, which is based on the MENPC technique, shows how effective it is at detecting changes. ...
Article
Full-text available
Online social networks have become indispensable in modern life, facilitating knowledge sharing, social communication, and business marketing. To gain a deeper understanding of individuals' behavior within social networks, researchers have undertaken essential analytical tasks such as change point detection. Recently, nonparametric change point detection methods have attracted researchers' attention due to their generality and flexibility. However, existing methods exhibit limitations, including overlooking network structure, reliance on case-based network attributes, and neglecting the dynamic nature of data, which may have correlations in evolving social networks. In this study, we propose a novel multivariate mixed-effects nonparametric profile control (MENPC) algorithm to address these limitations. The advantage of MENPC relies on its unique point of view in approaching network data, where it incorporates the dynamic nature of data into the monitoring process without assuming internal independence of networks over time. Additionally, it takes into account the network structure by considering both nodal and network-level attributes. Furthermore, by introducing an updating trick formula, the proposed algorithm simplifies computations, effectively balancing memory and speed for online monitoring. We evaluate the effectiveness of MENPC through comprehensive numerical experiments using the degree correlated stochastic block model to simulate interactions in evolving online social networks. The results demonstrate MENPC's superior performance in terms of expected detection delay, showcasing its accuracy and efficiency in comparison to competing approaches including Wilson, and eigenvalue methods. Applying MENPC to the Enron email network dataset further confirms its significant progress in social network monitoring, expanding its potential for various applications.
... Fig. 13 illustrates the obtained control chart for the observed events in Ask Ubuntu website. Upon conducting a detailed investigation of the identified change points, it was revealed that they correlate with the changes corresponding to new releases of Ubuntu in April, July, September, and October of 2015 (Miller & Mokryn, 2020). The proposed chart based on ACLR method demonstrates its efficacy in facilitating change point detection analysis within social network platforms. ...
Article
Full-text available
Nowadays, online social networks play a crucial role in shaping human communication in various life activities. Social Network Analysis (SNA) provides valuable insights for businesses, authorities, and platform owners. One of the challenging tasks in SNA is detecting sequential change points in observed events in social networks when the parameters of statistical distribution of post-change networks are unknown. This challenging problem is particularly prominent in various real-world network systems, especially when the events in the networks can be modeled through a Hawkes process. Identifying change points in the stream of social network data, where the underlying statistical properties undergo significant changes, necessitates the development of adaptive online algorithms. Additionally, in cases where the use of maximum likelihood estimators is impractical or when no exact recursive function for likelihood is available, addressing this issue becomes more complex. This paper proposes likelihood estimators using online convex optimization methods, incorporating the adaptive moment estimation (ADAM) algorithm. The proposed method is seamlessly integrated into the sequential anomaly detection procedure for events in social networks. Experimental results on monitoring time between events demonstrate lower Expected Delay Detection (EDD), indicating the superiority of the proposed algorithm in both synthetic and real-world datasets such as Facebook and contact networks of individuals causing disease transmission. The proposed robust solution provides an efficient practical tool in situations where traditional methods face limitations in swift detection with high accuracy.
... To relax the model assumptions, many model-free approaches for NCPD extract graph features and use classical discrepancy measures to quantify the amount of change between two subsets of snapshots. For instance, Miller and Mokryn (2020) use the degree distribution as the snapshots' features, while Wang et al. (2017) and Huang et al. (2020) respectively choose the joint distribution of a set of edges, and the Laplacian eigenvectors as graph features. For directly comparing pairs of snapshots, several graph similarity functions, (pseudo)-distances, and kernels, have been used, such as the DeltaCon metric (Koutra et al., 2016), the Hamming distance (Donnat & Holmes, 2018), the Frobenius distance (Barnett & Onnela, 2016), the Laplacian spectral distance (Cribben & Yu, 2017;Hewapathirana et al., 2020), the 2 or ∞ distances , and graph kernels (Desobry et al., 2005;Gretton et al., 2008;Harchaoui et al., 2009). ...
Article
Full-text available
Dynamic networks are ubiquitous for modelling sequential graph-structured data, e.g., brain connectivity, population migrations, and social networks. In this work, we consider the discrete-time framework of dynamic networks and aim at detecting change-points, i.e., abrupt changes in the structure or attributes of the graph snapshots. This task is often termed network change-point detection and has numerous applications, such as market phase discovery, fraud detection, and activity monitoring. In this work, we propose a data-driven method that can adapt to the specific network domain, and be used to detect distribution changes with no delay and in an online setting. Our algorithm is based on a siamese graph neural network, designed to learn a graph similarity function on the graph snapshots from the temporal network sequence. Without any prior knowledge on the network generative distribution and the type of change-points, our learnt similarity function allows to more effectively compare the current graph and its recent history, compared to standard graph distances or kernels. Moreover, our method can be applied to a large variety of network data, e.g., networks with edge weights or node attributes. We test our method on synthetic and real-world dynamic network data, and demonstrate that it is able to perform online network change-point detection in diverse settings. Besides, we show that it requires a shorter data history to detect changes than most existing state-of-the-art baselines.
... The last decade's abundance of temporal information paved the path to further understanding of the temporal dynamics of networks 8,42,43 . Temporal networks have become the playground for inferring behavior in a plethora of areas, including, but not limited to, the inference of the effects of changes on the evolution of networks 44,45 ; revealing hidden structures, such as the revelation of the structure of "co-presence" in a metropolitan or university from temporal daily encounters 46 ; the temporal, spatial diffusion of information in social media 47 , and the behavior of viral processes over it 48 . Incorporating temporal dynamics into existing viral models has proven www.nature.com/scientificreports/ ...
Article
Full-text available
Interaction-driven modeling of diseases over real-world contact data has been shown to promote the understanding of the spread of diseases in communities. This temporal modeling follows the path-preserving order and timing of the contacts, which are essential for accurate modeling. Yet, other important aspects were overlooked. Various airborne pathogens differ in the duration of exposure needed for infection. Also, from the individual perspective, Covid-19 progression differs between individuals, and its severity is statistically correlated with age. Here, we enrich an interaction-driven model of Covid-19 and similar airborne viral diseases with (a) meetings duration and (b) personal disease progression. The enriched model enables predicting outcomes at both the population and the individual levels. It further allows predicting individual risk of engaging in social interactions as a function of the virus characteristics and its prevalence in the population. We further showed that the enigmatic nature of asymptomatic transmission stems from the latent effect of the network density on this transmission and that asymptomatic transmission has a substantial impact only in sparse communities.
... The last decade's abundance of temporal information paved the path to further understanding of the temporal dynamics of networks 8,42,43 . Temporal networks have become the playground for inferring behavior in a plethora of areas, including, but not limited to, the inference of the effects of changes on the evolution of networks 44,45 ; revealing hidden structures, such as the revelation of the structure of "co-presence" in a metropolitan or university from temporal daily encounters 46 ; the temporal, spatial diffusion of information in social media 47 , and the behavior of viral processes over it 48 . Incorporating temporal dynamics into existing viral models has proven challenging. ...
Preprint
Full-text available
Interaction-driven modeling of diseases over real-world contact data has been shown to promote the understanding of the spread of diseases in communities. This temporal modeling follows the path-preserving order and timing of the contacts, which are essential for accurate modeling. Yet, other important aspects were overlooked. Various airborne pathogens differ in the duration of exposure needed for infection. Also, from the individual perspective, Covid-19 progression differs between individuals, and its severity is statistically correlated with age. Here, we enrich an interaction-driven model of Covid-19 and similar airborne viral diseases with (a) meetings duration and (b) personal disease progression. The enriched model enables predicting outcomes at both the population and the individual levels. It further allows predicting individual risk of engaging in social interactions as a function of the virus characteristics and its prevalence in the population. We further showed that the enigmatic nature of asymptomatic transmission stems from the latent effect of the network density on this transmission and that asymptomatic transmission has a substantial impact only in sparse communities.
Preprint
Full-text available
The analysis of high-dimensional timeline data and the identification of outliers and anomalies is critical across diverse domains, including sensor readings, biological and medical data, historical records, and global statistics. However, conventional analysis techniques often struggle with challenges such as high dimensionality, complex distributions, and sparsity. These limitations hinder the ability to extract meaningful insights from complex temporal datasets, making it difficult to identify trending features, outliers, and anomalies effectively. Inspired by surprisability -- a cognitive science concept describing how humans instinctively focus on unexpected deviations - we propose Learning via Surprisability (LvS), a novel approach for transforming high-dimensional timeline data. LvS quantifies and prioritizes anomalies in time-series data by formalizing deviations from expected behavior. LvS bridges cognitive theories of attention with computational methods, enabling the detection of anomalies and shifts in a way that preserves critical context, offering a new lens for interpreting complex datasets. We demonstrate the usefulness of LvS on three high-dimensional timeline use cases: a time series of sensor data, a global dataset of mortality causes over multiple years, and a textual corpus containing over two centuries of State of the Union Addresses by U.S. presidents. Our results show that the LvS transformation enables efficient and interpretable identification of outliers, anomalies, and the most variable features along the timeline.
Article
Change point detection in dynamic networks aims to detect the points of sudden change or abnormal events within the network. It has garnered substantial interest from researchers due to its potential to enhance the stability and reliability of real-world networks. Most change point detection methods are based on statistical characteristics and phased training, and some methods are required to set the percent of change points. Meanwhile, existing methods for change point detection suffer from two limitations. On one hand, they struggle to extract snapshot features that are crucial for accurate change point detection, thereby limiting their overall effectiveness. On the other hand, they are typically tailored for specific network types and lack the versatility to adapt to networks of varying scales. To solve these issues, we propose a novel unified end-to-end framework called Variational Graph Gaussian Mixture model (VGGM) for change point detection in dynamic networks. Specifically, VGGM combines Variational Graph Auto-Encoder (VGAE) and Gaussian Mixture Model (GMM) through joint training, incorporating a Mixture-of-Gaussians prior to model dynamic networks. This approach yields highly effective snapshot embeddings via VGAE and a dedicated readout function, while automating change point detection through GMM. The experimental results, conducted on both real-world and synthetic datasets, clearly demonstrate the superiority of our model in comparison to the current state-of-the-art methods for change point detection.
Article
Dynamic graphs are rich data structures that are used to model complex relationships between entities over time. In particular, anomaly detection in temporal graphs is crucial for many real world applications such as intrusion identification in network systems, detection of ecosystem disturbances and detection of epidemic outbreaks. In this paper, we focus on change point detection in dynamic graphs and address three main challenges associated with this problem: i) how to compare graph snapshots across time, ii) how to capture temporal dependencies, and iii) how to combine different views of a temporal graph. To solve the above challenges, we first propose Laplacian Anomaly Detection (LAD) which uses the spectrum of graph Laplacian as the low dimensional embedding of the graph structure at each snapshot. LAD explicitly models short term and long term dependencies by applying two sliding windows. Next, we propose MultiLAD, a simple and effective generalization of LAD to multi-view graphs. MultiLAD provides the first change point detection method for multi-view dynamic graphs. It aggregates the singular values of the normalized graph Laplacian from different views through the scalar power mean operation. Through extensive synthetic experiments, we show that i) LAD and MultiLAD are accurate and outperforms state-of-the-art baselines and their multi-view extensions by a large margin, ii) MultiLAD’s advantage over contenders significantly increases when additional views are available, and iii) MultiLAD is highly robust to noise from individual views. In five real world dynamic graphs, we demonstrate that LAD and MultiLAD identify significant events as top anomalies such as the implementation of government COVID-19 interventions which impacted the population mobility in multi-view traffic networks.
Chapter
In this chapter, we show an application of the MDL principle to change detection. It is one of most practically important issues in data science, where the MDL principle plays a key role in designing effective change detection algorithms. We classify the change detection issues into parameter change detection and latent structure change detection. Each of both issues is further classified to abrupt change detection and gradual change detection. We introduce the notions of the MDL change statistics and dynamic model selection (DMS) with their sequential variants and hierarchical ones. We thereby present a unifying strategy for abrupt change detection. Furthermore, we address a more challenging issue, which we call change sign detection. This aims to detect the starting points of gradual changes as early as possible. To this end, we introduce the notions of differential MDL change statistics and structural entropy and give their rationale.
Article
Full-text available
Abstract Similarity measures are used extensively in machine learning and data science algorithms. The newly proposed graph Relative Hausdorff (RH) distance is a lightweight yet nuanced similarity measure for quantifying the closeness of two graphs. In this work we study the effectiveness of RH distance as a tool for detecting anomalies in time-evolving graph sequences. We apply RH to cyber data with given red team events, as well to synthetically generated sequences of graphs with planted attacks. In our experiments, the performance of RH distance is at times comparable, and sometimes superior, to graph edit distance in detecting anomalous phenomena. Our results suggest that in appropriate contexts, RH distance has advantages over more computationally intensive similarity measures.
Article
Full-text available
The concept of “Structural Diversity” of a network refers to the level of dissimilarity between the various agents acting in the system, and it is typically interpreted as the number of connected components in the network. This key property of networks has been studied in multiple settings, including diffusion of ideas in social networks and functional diversity of regions in brain networks. Here, we propose a new measure, “Structural Entropy”, as a revised interpretation to “Structural Diversity”. The proposed measure relies on the finer-grained network communities (in contrast to the network’s connected components), and takes into consideration both the number of communities and their sizes, generating a single representative value. We then propose an approach for monitoring the structure of correlation-based networks over time, which relies on the newly suggested measure. Finally, we illustrate the usefulness of the new approach, by applying it to the particular case of emergent organization of financial markets. This provides us a way to explore their underlying structural changes, revealing a remarkably high linear correlation between the new measure and the volatility of the assets’ prices over time.
Article
Full-text available
Statistical process control (SPC) requires statistical methodologies that detect changes in the pattern of data over time. The common methodologies, such as Shewhart, cumulative sum (cusum), and exponentially weighted moving average (EWMA) charting, require the in-control values of the process parameters, but these are rarely known accurately. Using estimated parameters, the run length behavior changes randomly from one realization to another, making it impossible to control the run length behavior of any particular chart. A suitable methodology for detecting and diagnosing step changes based on imperfect process knowledge is the unknown-parameter changepoint formulation. Long recognized as a Phase I analysis tool, we argue that it is also highly effective in allowing the user to progress seamlessly from the start of Phase I data gathering through Phase II SPC monitoring. Despite not requiring specification of the post-change process parameter values, its performance is never far short of that of the optimal cusum chart which requires this knowledge, and it is far superior for shifts away from the cusum shift for which the cusum chart is optimal. As another benefit, while changepoint methods are designed for step changes that persist, they are also competitive with the Shewhart chart, the chart of choice for isolated non-sustained special causes.
Article
Full-text available
A number of real world problems in many domains (e.g. sociology, biology, political science and communication networks) can be modeled as dynamic networks with nodes representing entities of interest and edges representing interactions among the entities at different points in time. A common representation for such models is the snapshot model - where a network is defined at logical time-stamps. An important problem under this model is change point detection. In this work we devise an effective and efficient three-step-approach for detecting change points in dynamic networks under the snapshot model. Our algorithm achieves up to 9X speedup over the state-of-the-art while improving quality on both synthetic and real world networks.
Chapter
In this paper I introduce a framework for modeling temporal communication networks and dynamical processes unfolding on such networks. The framework originates from the (new) observation that there is a meaningful division of temporal communication networks into six dynamic classes, where the class of a network is determined by its generating process. In particular, each class is characterized by a fundamental structure: a temporal-topological network motif, which corresponds to the network representation of communication events in that class of network. These fundamental structures constrain network configurations: only certain configurations are possible within a dynamic class. In this way the framework presented here highlights strong constraints on network structures, which simplify analyses and shape network flows. Therefore the fundamental structures hold the potential to impact how we model temporal networks overall. I argue below that networks within the same class can be meaningfully compared, and modeled using similar techniques, but that integrating statistics across networks belonging to separate classes is not meaningful in general. This paper presents a framework for how to analyze networks in general, rather than a particular result of analyzing a particular dataset. I hope, however, that readers interested in modeling temporal networks will find the ideas and discussion useful in spite of the paper’s more conceptual nature.
Article
From longitudinal biomedical studies to social networks, graphs have emerged as essential objects for describing evolving interactions between agents in complex systems. In such studies, after pre-processing, the data are encoded by a set of graphs, each representing a system’s state at a different point in time or space. The analysis of the system’s dynamics depends on the selection of the appropriate analytical tools. In particular, after specifying properties characterizing similarities between states, a critical step lies in the choice of a distance between graphs capable of reflecting such similarities. While the literature offers a number of distances to choose from, their properties have been little investigated and no guidelines regarding the choice of such a distance have yet been provided. In particular, most graph distances consider that the nodes are exchangeable—ignoring node “identities.” Alignment of the graphs according to identified nodes enables us to enhance these distances’ sensitivity to perturbations in the network and detect important changes in graph dynamics. Thus the selection of an adequate metric is a decisive—yet delicate—practical matter. In the spirit of Goldenberg et al.’s seminal 2009 review [Found. Trends Mach. Learn. 2 (2010) 129–233], this article provides an overview of commonly-used graph distances and an explicit characterization of the structural changes that they are best able to capture. We show how these choices affect real-life situations, and we use these distances to analyze both a longitudinal microbiome dataset and a brain fMRI study. One contribution of the present study is a coordinated suite of data analytic techniques, displays and statistical tests using “metagraphs”: a graph of graphs based on a chosen metric. Permutation tests can uncover the effects of covariates on the graphs’ variability. Furthermore, synthetic examples provide intuition as to the qualities and drawbacks of the different distances. Above all, we provide some guidance on choosing one distance over another in different contexts. Finally, we extend the scope of our analyses from temporal to spatial dynamics and apply these different distances to a network created from worldwide recipes.
Conference Paper
A number of real world problems in many domains (e.g. sociology, biology, political science and communication networks) can be modeled as dynamic networks with nodes representing entities of interest and edges representing interactions among the entities at different points in time. A common representation for such models is the snapshot model - where a network is defined at logical time-stamps. An important problem under this model is change point detection. In this work we devise an effective and efficient three-step-approach for detecting change points in dynamic networks under the snapshot model. Our algorithm achieves up to 9X speedup over the state-of-the-art while improving quality on both synthetic and real world networks.
Conference Paper
Networks are a fundamental tool for modeling complex systems in a variety of domains including social and communication networks as well as biology and neuroscience. The counts of small subgraph patterns in networks, called network motifs, are crucial to understanding the structure and function of these systems. However, the role of network motifs for temporal networks, which contain many timestamped links between nodes, is not well understood. Here we develop a notion of a temporal network motif as an elementary unit of temporal networks and provide a general methodology for counting such motifs. We define temporal network motifs as induced subgraphs on sequences of edges, design several fast algorithms for counting temporal network motifs, and prove their runtime complexity. We also show that our fast algorithms achieve 1.3x to 56.5x speedups compared to a baseline method. We use our algorithms to count temporal network motifs in a variety of real-world datasets. Results show that networks from different domains have significantly different motif frequencies, whereas networks from the same domain tend to have similar motif frequencies. We also find that measuring motif counts at various time scales reveals different behavior.