ArticlePDF Available

Deep-learning continuous gravitational waves

Authors:

Abstract and Figures

We present a first proof-of-principle study for using deep neural networks (DNNs) as a novel search method for continuous gravitational waves (CWs) from unknown spinning neutron stars. The sensitivity of current wide-parameter-space CW searches is limited by the available computing power, which makes neural networks an interesting alternative to investigate, as they are extremely fast once trained and have recently been shown to rival the sensitivity of matched filtering for black-hole merger signals [D. George and E. A. Huerta, Phys. Rev. D 97, 044039 (2018); H. Gabbard, M. Williams, F. Hayes, and C. Messenger, Phys. Rev. Lett. 120, 141103 (2018)]. We train a convolutional neural network with residual (shortcut) connections and compare its detection power to that of a fully coherent matched-filtering search using the Weave pipeline [K. Wette, S. Walsh, R. Prix, and M. A. Papa, Phys. Rev. D 97, 123016 (2018)]. As test benchmarks we consider two types of all-sky searches over the frequency range from 20 to 1000 Hz: an “easy” search using T=105 s of data, and a “harder” search using T=106 s. The detection probability pdet is measured on a signal population for which matched filtering achieves pdet=90% in Gaussian noise. In the easiest test case (T=105 s at 20 Hz) the DNN achieves pdet∼88%, corresponding to a loss in sensitivity depth of ∼5% versus coherent matched filtering. However, at higher frequencies and for longer observation times the DNN detection power decreases, until pdet∼13% and a loss of ∼66% in sensitivity depth in the hardest case (T=106 s at 1000 Hz). We study the DNN generalization ability by testing on signals of different frequencies, spindowns and signal strengths than they were trained on. We observe excellent generalization: only five networks, each trained at a different frequency, would be able to cover the whole frequency range of the search.
Content may be subject to copyright.
Deep-learning continuous gravitational waves
Christoph Dreissigacker,1,2,* Rahul Sharma,3,1,2 Chris Messenger,4Ruining Zhao,5,6,7 and Reinhard Prix1,2
1Max Planck Institute for Gravitational Physics (Albert-Einstein-Institute), D-30167 Hannover, Germany
2Leibniz Universität Hannover, D-30167 Hannover, Germany
3Birla Institute of Technology and Science, Pilani, Rajasthan 333031, India
4SUPA, School of Physics and Astronomy, University of Glasgow, Glasgow G12 8QQ, United Kingdom
5Key Laboratory of Optical Astronomy, National Astronomical Observatories,
Chinese Academy of Sciences, Beijing 100101, China
6University of Chinese Academy of Sciences, Beijing 100049, China
7Department of Astronomy, Beijing Normal University, Beijing 100875, China
(Received 6 May 2019; published 7 August 2019)
We present a first proof-of-principle study for using deep neural networks (DNNs) as a novel
search method for continuous gravitational waves (CWs) from unknown spinning neutron stars. The
sensitivity of current wide-parameter-space CW searches is limited by the available computing power,
which makes neural networks an interesting alternative to investigate, as they are extremely fast once
trained and have recently been shown to rival the sensitivity of matched filtering for black-hole merger
signals [D. George and E. A. Huerta, Phys.Rev.D97, 044039 (2018); H. Gabbard, M. Williams, F.
Hayes, and C. Messenger, Phys. Rev. Lett. 120, 141103 (2018)]. We train a convolutional neural network
with residual (shortcut) connections and compare its detection power to that of a fully coherent matched-
filtering search using the W
EAVE
pipeline[K.Wette,S.Walsh,R.Prix,andM.A.Papa,Phys. Rev. D 97,
123016 (2018)]. As test benchmarks we consider two types of all-sky searches over the frequency range
from20to1000Hz:aneasysearch using T¼105s of data, and a hardersearch using T¼106s.
The detection probability pdet is measured on a signal population for which matched filtering achieves
pdet ¼90% in Gaussian noise. In the easiest test case (T¼105s at 20 Hz) the DNN achieves
pdet 88%, corresponding to a loss in sensitivity depth of 5% versus coherent matched filtering.
However, at higher frequencies and for longer observation times the DNN detection power decreases,
until pdet 13% and a loss of 66% in sensitivity depth in the hardest case (T¼106sat1000Hz).
We study the DNN generalization ability by testing on signals of different frequencies, spindowns
and signal strengths than they were trained on. We observe excellent generalization: only five networks,
each trained at a different frequency, would be able to cover the whole frequency range of the
search.
DOI: 10.1103/PhysRevD.100.044009
I. INTRODUCTION
Gravitational waves from binary mergers are now being
observed routinely [14] by the Advanced LIGO [5] and
Virgo [6] detectors. In contrast, the much weaker and
longer-lasting (daysmonths) narrow-band continuous
gravitational waves (CWs) from spinning deformed neu-
tron stars are yet to be detected, despite a multitude of
searches over the past decade (see Refs. [79] for reviews)
and continuing improvements in search methods (see e.g.,
Ref. [10] for a recent overview).
A key limitation of current search methods for CWs with
unknown parameters is the exploding computing cost
problem: give that a putative signal would be very weak,
one needs to integrate as much data as possible in order to
increase the signal-to-noise ratio (SNR). However, for a
fully coherent matched-filtering search (which is close to
statistically optimal [11]), the corresponding computing
cost grows as a high power Tnof the data time span T,
with n5. This typically limits the longest coherent
duration to daysweeks before the computing cost would
become infeasible.
Therefore the class of semicoherent methods has been
developed, producing computationally cheaper searches.
They allow the analysis of more data, typically resulting in
*christoph.dreissigacker@aei.mpg.de
Published by the American Physical Society under the terms of
the Creative Commons Attribution 4.0 International license.
Further distribution of this work must maintain attribution to
the author(s) and the published articles title, journal citation,
and DOI.
PHYSICAL REVIEW D 100, 044009 (2019)
2470-0010=2019=100(4)=044009(11) 044009-1 Published by the American Physical Society
better sensitivity than a corresponding coherent search at
fixed computing cost (see e.g., Refs. [12,13]). Such
methods combined with massive amounts of computing
power, either via local computer clusters or via the
distributed public computing platform Einstein@Home
[14], currently yield the best state-of-the-art sensitivity to
CW signals (see e.g., Refs. [1517] for recent examples).
In this work we investigate deep neural networks
(DNNs) [1820] as a novel search method for CWs. The
field of DNNs, also referred to as deep learning, has
emerged as one of the most successful machine-learning
paradigms in the last decade, dominating wide-ranging
fields [20] such as image recognition, speech recognition
and language translation, as well as certain board [21] and
video games [22,23].
More recently DNNs have started to draw attention in
the field of gravitational-wave searches (i) as a classifier
for non-Gaussian detector transients (glitches)[2427],
(ii) as a search method for unmodeled burst signals
[28,29] in time-frequency images produced by coherent
WaveBurst [30], and (iii) as a direct detection method for
black-hole merger signals in gravitational-wave strain
data [3136].
This last approach (iii) is of particular interest to us, as
Refs. [31,32] illustrated for the first time that DNNs can
achieve a detection power comparable to that of (near-
optimal) matched filtering, at a fraction of the search time.
This is relevant for CW searches: while semicoherent
methods for wide-parameter-space searches are the most
sensitive approach currently known, they are by design less
sensitive than the statistical optimum achievable according
to the Neyman-Pearson-Searle lemma [37].
With DNNs the computationally expensive step is
shifted to the preparation stage of the searchthe archi-
tecture tuning and learningof optimal network weights
(i.e., the training)while the execution time on given input
vectors is very short (typically fractions of a second). The
determination of the noise distribution (for estimation of
the false-alarm level pfa) and measurement of upper limits
require many repeated searches over different input data
sets, with and without injected signals. The relative
search speed advantage of DNNs compared to traditional
search methods therefore accumulates dramatically over
these operations allowing very fast and flexible search
characterizations.
The plan of this paper is as follows. In Sec. II we define
and characterize our test benchmarks. In Sec. III we
describe our deep-learning approach to searching for
continuous gravitational waves, and explain the network
architecture and how it was trained. In Sec. IV we
characterize the performance our DNN achieves on the
test benchmarks in comparison to the matched-filtering
performance and how it generalizes beyond the bench-
markssearch parameters. And finally we discuss these
results in Sec. V.
II. COMPARISON TEST BENCHMARKS
A. Benchmark definitions
In order to characterize the detection power of the DNN
that we introduce in the next section, we define two
benchmark search setups and measure the corresponding
sensitivity achieved on them with a classical (near-optimal)
matched-filter search method described in Sec. II B.
We compare the sensitivity in the Neyman-Pearson
sense, also known as the receiver-operator characteristic
(ROC), using the upper limitconventions used in most
CW searches (cf. Ref. [10]): measure the detection prob-
ability pdet at a chosen false-alarm level pfa for a signal
population of fixed amplitude h0, with all other signal
parameters (i.e., polarization, sky position, frequency and
spindown) drawn randomly from their priors. In order to
characterize the signal strength in noise, we use the
sensitivity depth D[10,38], defined as
D
ffiffiffiffi
Sn
p
h0
;ð1Þ
where Snis the power spectral density of the (Gaussian)
noise at the signal frequency, and h0is the signal amplitude.
In particular we are interested in the sensitivity depth D90%
that corresponds to the signal amplitude h90%
0at which the
search yields a detection probability of pdet ¼90% at a
fixed false-alarm level, which here is chosen as pfa ¼1%
per 50 mHz frequency band.
We choose to use stationary white Gaussian noise, which
allows us to efficiently generate training data, and it
simplifies comparing against the sensitivity of idealized
matched filtering.
We consider two all-sky searches (parameters summa-
rized in Table I) over a range of frequency fand first-order
spindown _
f, one using T¼105s1.2days, and one
using T¼106s12 days of data assuming a single
detector (chosen as LIGO Hanford). These two searches
could realistically be performed with coherent matched
filtering. The required computing cost for the search and its
characterization (upper limits, false-alarm level) however
would still require a large cluster of, say, Oð1000Þcores for
over a month or so (see Table II). Therefore actually
performing these two full searches only for the purpose
of characterizing the matched-filtering sensitivity would be
TABLE I. Definition of the two benchmark searches.
Data span T¼105s=T ¼106s
Detectors LIGO Hanford
Noise Stationary, white, Gaussian
Sky-region All-sky
Frequency band f½20;1000Hz
Spindown range _
f½1010;0Hz=s
CHRISTOPH DREISSIGACKER et al. PHYS. REV. D 100, 044009 (2019)
044009-2
infeasible. Instead we characterize the matched-filter
search on only five narrow frequency bands of width Δf¼
50 mHz starting at frequencies f0¼20, 100, 200, 500 and
1000 Hz, yielding a total of ten representative test cases.
B. W
EAVE
matched-filtering sensitivity
For the matched-filter search we use the recently
developed W
EAVE
code [39], which implements a state-
of-the-art CW search algorithm [40] based on summing
coherent Fstatistics [41,42] over semicoherent segments
on optimal lattice-based template banks [43,44]. This
code can also perform fully coherent (i.e., single-segment)
F-statistic searches, which we use for the present proof-of-
principle study. The benchmark search definitions in
Table Iare chosen in such a way that a fully coherent
search is still computationally feasible. This yields a
simpler and cleaner comparison than using a semicoherent
search setup, as we can easily design near-optimal search
setups (by using relatively fine template banks) without the
extra complication of requiring costly sensitivity optimi-
zation at fixed computing cost [13,40,45].
The W
EAVE
template banks are characterized by a
maximal-mismatch parameter m, which controls how fine
the templates are spaced in parameter space. These are
chosen as m¼0.1and m¼0.2for the two searches with
T¼105and T¼106s, respectively. The reason for
choosing the larger mismatch value (i.e., coarser template
bank) in the T¼106s case is to keep the computing cost
of the corresponding test cases still practically manageable,
as the coherent cost scales with the mismatch parameter as
m2for a four-dimensional template bank [see e.g.,
Eq. (24) in [43]].
By repeated injections of signals in the data and recovery
of the loudest F-statistic candidate in the template bank,
one can measure the relative SNR loss μcompared to a
perfectly matched template. The resulting measured aver-
age mismatch hμiquantifies in some sense how close to
optimalthe matched-filter sensitivity will be (compared
to an infinite computing cost search with m¼0), and is
found as hμi5% and hμi11%, respectively for the two
searches.
Using the template-counting and timing models
[39,46,47] for W
EAVE
and the resampling Fstatistic, we
can estimate the total number of templates and the
corresponding total runtime for these two benchmark
searches as 78 and 45000 days on a single CPU core,
respectively. Table II provides a summary of the W
EAVE
search parameters and characteristics.
In order to estimate the sensitivity for the ten test cases
defined in the previous section (i.e., five frequency slices of
Δf¼50 mHz for each search of T¼105and T¼106s,
respectively), we first determine the corresponding detec-
tion thresholds Fth on the Fstatistic corresponding to a
false-alarm level of pfa ¼1% for each case. This is done
by repeatedly (105times for T¼105s, and 104times for
T¼106s, respectively) performing each search over
Gaussian noise and thereby recording the distribution of
the loudest candidate, which yields the relationship
between the threshold and false-alarm level. The corre-
sponding detection probability pdet for any given signal
population of fixed Dis obtained by injecting signals into
Gaussian noise data and measuring how many times the
loudest candidate exceeds the detection threshold. By
varying the injected Dwe can eventually find D90% for
the desired pdet ¼90% (see e.g., Ref. [10] for more details
and discussion of this standard upper limitprocedure).
By a final injection þrecovery Monte Carlo we can verify
that the achieved W
EAVE
detection probability for the
quoted thresholds and signal strengths D90% in Table III
is pdet 9091%, which is sufficiently accurate for our
present purposes.
The sky template resolution grows as f2as a function
of frequency f, resulting in a corresponding increase in the
number of templates at higher frequency. This increases the
number of trialsin noise at the higher-frequency slices,
which results in a corresponding increased false-alarm
threshold (chosen in order to keep the false-alarm level
at pfa ¼1%) as well as an increased computing cost, shown
in Table III.
TABLE II. W
EAVE
parameters and characteristics for the two
searches.
Name T¼105sT¼106s
Mismatch parameter m0.1 0.2
Average SNR loss hμi5% 11%
Number of templates N4×1011 3×1014
Search time [single CPU core] 6.7×106s3.9×109s
TABLE III. W
EAVE
characteristics for the ten test cases, each
covering a frequency sliceof Δf¼50 mHz, starting at f0,of
the full searches defined in Table I. The detection thresholds Fth
correspond to a false-alarm level of pfa ¼1% over the band Δf.
CPUΔfdenotes the search time in seconds for the respective Δf
band on a single CPU core.
f020 Hz 100 Hz 200 Hz 500 Hz 1000 Hz
T¼105s
NΔf5×1051×1075×1073×1081×109
CPUΔf[s] 0.1 4.9 19 2.3×1021.7×103
Fthðpfa Þ20.6 23.6 25.1 27.0 28.6
T¼106s
NΔf3×1088×1093×1010 2×1011 8×1011
CPUΔf[s] 45 3×1031.4×1041.6×1056.9×105
Fthðpfa Þ27.5 31.1 32.5 34.2 36.2
DEEP-LEARNING CONTINUOUS GRAVITATIONAL WAVES PHYS. REV. D 100, 044009 (2019)
044009-3
III. DEEP-LEARNING CWs
Our general approach is similar to that of Refs. [31,32] in
that we directly use the detector strain data as our network
input, and train a simple classifier with two output neurons
for the classes noiseand signal (in noise).However,
given that CW signals are long in duration and narrow in
frequency, instead of using the time-series input it makes
more sense in our case to use the frequency-domain
representation of that data. We therefore provide the real
and imaginary parts of the fast Fourier transform (FFT) of
the data as a two-dimensional input vector over frequency
bins, using the native FFT resolution of 1=T . We chose the
network input size to be sufficiently large to contain the
widest signal (signals get stretched in the frequency domain
by the spindown _
fand Doppler shifts) twice, so that we can
slide the network along the frequency axis in steps of half
the network input width, guaranteeing that one input
window will always contain the full signal.
A. Network architecture
We started experimenting with DNN architectures sim-
ilar to those described in Refs. [31,32], but eventually by
trial and error converged on a ResNet architecture [48],
which showed better performance for our problem cases.
We have chosen slightly different networks for the two
searches (T¼105and T¼106s) of Table I, as these
correspond to signals with rather different widths in the
frequency domain: the network in the T¼105s cases
contains six instances of a residual block, while in the
T¼106s cases the network uses 12.
The network layers can be separated into three parts: the
stem block, a block of multiple residual blocks, and an end
block; see Fig. 1. The stem block consists of a standard
convolutional layer, while each of the residual blocks is
built according to Ref. [48]. The end block contains a dense
softmax layer with two final output neurons, corresponding
to the estimated probability psignal that the input contains a
signal, and pnoise ¼1psignal for a pure noise sample. The
DNNs are implemented in the K
ERAS
framework [49] on
top of a T
ENSORFLOW
[50] backend.
B. DNN training and validation
Training the network is performed on a synthesized data
set of input vectors, where half contain pure Gaussian
noise, and half contain a signal added to the noise. One full
pass through this training set is commonly referred to as a
training epoch. Using a precomputed set of 10 000 signals,
each signal is added to 24 dynamically generated noise
realizations, which are also used as pure-noise inputs.
The number of signals in the training set is chosen by
observing that in all our cases we find that going beyond
10 000 signals yields only negligible further improvements
in detection probability, as shown in Fig. 2for the example
of T¼105s, f0¼1000 Hz.
The signals are scaled to a fixed depth D90%
training for each
test case and randomly shifted in frequency within the
network input window. These training depths were esti-
mated semianalytically using the method of Refs. [10,47],
and differ slightly from the final measured values D90% of
Table IV, which were not available at the time of training.
When testing the network on signals of different depths, the
detection probability behaves as expected; see Sec. IV D.
Furthermore, we found that using a different choice of
training depth did not significantly affect training success.
Every few epochs of training, we perform a validation
step, where the detection probability of the network is
measured on an independent data set. This validation set
contains another 20 000 input vectors, with half containing
signals in noise (of fixed depth D90% ), and half containing
noise only.
In order to compute the networks detection probability
pDNN
det , we treat the output neuron psignal as a statistic, and
follow the usual upper limitprocedure described in
FIG. 1. Illustration of the general network architecture used in
this study.
FIG. 2. Validation detection probability for T¼105s, f0¼
1000 Hz for training with training sets containing 10,100,
1000,10 000 and 50 000 signals.
CHRISTOPH DREISSIGACKER et al. PHYS. REV. D 100, 044009 (2019)
044009-4
Sec. II B: we repeatedly run the network on Gaussian noise
inputs in order to determine the pfa ¼1% detection thresh-
old. We then run the network on the signal set and measure
for what fraction of signals the statistic exceeds that
threshold.
The evolution of the detection probability as a function
of training epoch (or similarly, as a function of training
time) is presented in Fig. 3, illustrating the progress of
learning. In order to test the variability and dependence of
the learning success on the random initialization of the
network, we train a cloudof 100 differently initialized
network instances. We use the network at its point of best
validation performance from each test case for the further
test results presented in the next sections.
TABLE IV. Measured W
EAVE
upper limitsensitivity D90% at
a false-alarm level of pfa ¼1%.
D90% [Hz1=2]f0¼20 Hz 100 Hz 200 Hz 500 Hz 1000 Hz
T¼105s11.4 10.8 10.4 9.9 9.7
T¼106s29.3 28.2 27.6 26.8 26.0
(a) (b)
(c) (d)
FIG. 3. Validation detection probability pDNN
det of the DNN versus training time (or mean trained epoch) for 100 different network
instances trained for each of four test cases: (a) T¼105s;f
0¼20 Hz, pbest
det ¼85.7%;(b)T¼105s;f
0¼1000 Hz, pbest
det ¼71.9%;
(c) T¼106s;f
0¼20 Hz, pbest
det ¼68.3%; and (d) T¼106s;f
0¼1000 Hz, pbest
det ¼8.3%, all trained on an Nvidia GTX 750. The
solid horizontal line denotes the matched-filtering detection performance of pdet ¼90%. The red circles indicate the networks with the
best detection probability pbest
det . The error bars on each of the 100 curves are smaller than the widths of the lines.
DEEP-LEARNING CONTINUOUS GRAVITATIONAL WAVES PHYS. REV. D 100, 044009 (2019)
044009-5
Most of the training was performed on Nvidia GTX 750
GPUs. We see in Fig. 3that for most cases the improve-
ments in detection probability seem to level off after the
training time (about one day in the T¼105s cases, and
about 10 days in the T¼106s cases). However, in the case
of T¼106s;f
0¼1000 Hz seen in Fig. 3(d) [and also for
T¼106s;f
0¼500 Hz (not shown)], there still seems to
be a slowly increasing trend in detection probability at the
end of training time. Therefore we trained a single network
instance for these two cases again on a more powerful
Nvidia TITAN V GPU for many more epochs, until the
validation detection probability seemed to level off, which
is shown in Fig. 4.
Overall we observe a dramatic increase in the diffi-
cultythe DNN has in learning the different test cases along
the direction of increasing data span Tand frequency f,
also seen clearly in Table V. In the easiest case of
T¼105s;f
0¼20 Hz the DNN achieves a detection
probability of pDNN
det 88%, nearly rivaling matched-
filtering performance, while in the hardest case of T¼
106s;f
0¼1000 Hz it only manages pdet 13% (also see
Table V). This may not be very surprising, given that the
cases become increasingly more computationally intensive
(more templates) along the same axis for matched filtering,
as seen in Table III. In the frequency-domain input vectors
of the DNN, this would manifest by the signals being more
widely spread out due to increased frequency drift _
fT and
Doppler stretching.
IV. TESTING DNN PERFORMANCE
After the training and validation steps, we perform a
series of tests on the best DNN found for each test case (i.e.,
with the highest pDNN
det over all validation steps), in order to
more fully characterize its performance as a CW detection
method. In these tests we simulate the signals and noise
directly for any given depth using the standard CW
LALS
UITE
[51] machinery, in order to independently verify
the network performance. Hence we are not using a
traditionally fixed testing set but generate it on demand.
A. Verifying detection probabilities
As a sanity check we use an independent test pipeline to
confirm the detection probabilities pDNN
det for the ten cases.
These results, given in Table V, are seen to be between
0.52 percentage points higher than the corresponding
validation pbest
det originally observed in Figs. 3and 4. This
can be understood as follows: in order to speed up training,
in the validation step we do not slide the network window
over the search frequency Δfin the signal case, but instead
use only one network window fully containing the signal.
In the test case, on the other hand, we perform a more
accurate simulation of a real search by sliding (see Sec. III),
which can only increase the detection probability.
A second interesting question is how the detection
probability depends on the false-alarm level pfa (commonly
referred to as ROC curve) for a fixed signal population. This
isshowninFig.5in comparison to the matched-filter ROC.
B. Generalization in frequency f
If we want to perform a search over the whole frequency
range (e.g., as defined in Table I) using DNNs, we would
need to determine how many different networks we have to
train in order to cover this range with a reasonable overall
sensitivity. Alternatively we can also train a single DNN
with signals drawn from the full frequency range of the
search and compare its performance.
The results of these tests are shown in Fig. 6, which show
how the five DNNs, trained at their respective frequencies
f0, perform over the full frequency range of the search. In
addition we show the performance of another network that
has been trained directly over the full frequency range.
We see that the specificnetworks trained only on a
narrow frequency range still perform reasonably well over a
fairly broad range of frequencies, and especially that
networks trained at higher frequencies generalize well to
lower frequencies. This result shows that a small number of
networks Oð5Þwould be able to cover the whole frequency
FIG. 4. Validation detection probability pDNN
det of the DNN
versus training time for a single network trained on an Nvidia
TITAN V for the case T¼106s;f
0¼1000 Hz. The best
network, indicated by the red circle, achieves a detection
probability of pbest
det ¼11.5%. The error bars are smaller than
the widths of the lines.
TABLE V. Detection probabilities in %of the best networks for
each case at a false-alarm level pfa ¼1% and 90% matched-
filtering depth.
p90%
det f0¼20 Hz 100 Hz 200 Hz 500 Hz 1000 Hz
T¼105s87.6þ0.7
0.685.4þ0.7
0.784.1þ0.7
0.780.2þ0.8
0.873.0þ0.9
0.9
T¼106s68.8þ0.9
0.950.0þ1.0
1.038.7þ0.9
1.025.4þ0.8
0.913.1þ0.6
0.7
CHRISTOPH DREISSIGACKER et al. PHYS. REV. D 100, 044009 (2019)
044009-6
range at a similar detection performance that was obtained
on the individual training frequencies. Furthermore, for the
T¼105s search, it seems quite feasible to train a single
network over the full frequency range directly, achieving
similar (albeit lower) performance to the specialized
networks trained on narrow frequency bands. On the
contrary for the T¼106s search the detection probability
of the full-rangenetwork drops up to 20 percentage
points against the specializednetworks.
C. Generalization in spindown _
f
A further interesting aspect to consider is how far in
spindown _
fthe performance network extends beyond the
range that it was trained on, i.e., _
f½1010;0Hz=sas
given in Table I. This is shown in Fig. 7. We see that the
DNN detection probability remains high even for
spindowns that are 12 orders of magnitude larger than
the training range. In particular, networks trained at higher
frequencies seem to have a wider generalization range in
spindown, which makes sense as they would have learned
to recognize signal shapes with larger Doppler broadening,
a qualitatively similar effect to having more spindown.
D. Generalization in signal strength
Another important issue is how well the DNN general-
izes for signals of different strength D, given that we only
trained each network at one specific depth D90%
training,an
estimate of the matched-filtering depth. The results of this
test are shown in the efficiency plots of Fig. 8. We see that
generally the dependence of pdetðDÞfor the DNNs seems to
be quite similar to that of matched filtering, but shifted to its
overall (lower) performance level.
(a) (b)
FIG. 5. ROC curve: Detection probability pdet versus pfa for the 105s search (left) and the 106s search (right). The solid red lines
indicate the measured ROC curves for matched filtering.
(a) (b)
FIG. 6. Detection probability pdet versus injection frequency ffor networks trained at five different frequencies and for a network trained
with signalsdrawn from the full frequency range (solid black line). The dashed vertical lines markthe respective trainingfrequencies for the
five specializednetworks. The horizontal dashed line represents the coherent matched-filtering detection performance.
DEEP-LEARNING CONTINUOUS GRAVITATIONAL WAVES PHYS. REV. D 100, 044009 (2019)
044009-7
(a) (b)
FIG. 7. Detection probability pdet versus injected spindown _
ffor networks trained at five different frequencies. The green shade in the
middle marks the 1010 Hz=s wide spindown band the networks were trained on. The xaxis is plotted as a symmetric logarithm, i.e.,
logarithmical for the larger negative values, linear for j_
fj<1010 Hz=s and logarithmical for the larger positive values. The red shades
at the edges illustrate where we start losing SNR purely by the network input window being smaller than the widest signals.
(a) (b)
(c) (d)
FIG. 8. Detection probability pdet versus injection depth Dfor networks trained on the respective matched-filtering depth D90%
(indicated by the vertical solid line with the diamond at 90%). The second vertical line with the star at 90% gives the sensitivity depth for
the DNN at 90% detection probability.
CHRISTOPH DREISSIGACKER et al. PHYS. REV. D 100, 044009 (2019)
044009-8
Conversely we also calculated the upper limitsensi-
tivity depth D90%
DNN where the network achieves 90%
detection probability (see Table VI). These values corre-
spond to a sensitivity loss of 521% (as a function of
frequency) for the T¼105s search, and 2666% for the
T¼106s search.
E. Timing
The total amount of computational resources needed, is
another interesting point of comparison to a matched-filter
search. The total search times for using the matched-filter
W
EAVE
method on the two benchmark searches can be
found in Table II.
For the DNN the total computation time consists of two
parts: training time and prediction time (i.e., calculating
one output statistic psignal for one input data vector). The
training time for the two network architectures is 1and
10 d per network for the T¼105and T¼106s cases,
respectively. Only part of this time is actually spent on
training the network; another part is spent calculating the
detection probability of the network every few epochs in
order to monitor the progress of training.
The prediction time in comparison is almost negligible.
The smaller networks for the T¼105s cases require
3ms to process one input window. The larger networks
for the T¼106s cases need 10 ms per prediction. Each
search requires a different number of sliding input windows
to cover the whole frequency range, and the total search
time can be found in Table VII.
An important detail to note in a direct comparison
between matched filtering and a pure classifier signal
versus noiseDNN search is that matched filtering yields
far more information on outlier candidates that cross the
threshold. In particular, its signal parameters will be well
constrained already, allowing a follow-up search to be
performed in a small region of parameter space. The DNN
classifier, on the other hand, would flag input windows
(of width ΔfIW) in frequency as outliers to be followed up.
Assuming we follow up two input windows per candidate,
one can estimate the total expected follow-up cost (using
matched filtering) as a fraction 2ðΔfIW=ΔfÞpfa of the total
matched-filtering cost (see Table II), where pfa ¼1% is the
false-alarm probability per Δf¼50 mHz band.
Therefore even including all the training time and
assuming a matched-filter follow-up, the DNN search would
still seem to require less computing power. At the present
stage, however, we cannot realize this potential benefit given
that our DNN search so far is far less sensitive overall.
V. DISCUSSION
In this work we have shown that deep learning (DNNs)
can in principle be used to directly search for CW signals in
data, at substantially faster search times than matched
filtering. For the hand-optimized network architecture
studied here, the DNN detection probability (at fixed false
alarm) is found to be somewhat competitive (8873% over
the full frequency range) with matched filtering (90%) for
short data spans of T1day, while the detection perfor-
mance falls short (6913%) for a longer data span of
T12 days. On the plus side, the DNN search shows a
surprising ability to extend further in frequency and spin-
down than it was trained for, and is generally much faster in
search performance than matched filtering.
In order to make this a competitive search method, we
can identify a few necessary next steps:
(1) Extend to a multidetector search.
(2) Find better networks with a comparable detection
probability to existing methods in Gaussian noise.
(3) Train for parameter estimation in addition to pure
classification in order to reduce follow-up cost of
candidates.
(4) Test how a network trained on Gaussian noise
performs on real detector data. Given the networks
ability to generalize, one might expect problems if
non-Gaussian artifacts are identified as signals. On
the other hand, training a network on real detector
noise should alleviate that problem.
Overall we think that deep learning has the potential to
become a useful CW search tool, but there is substantial
further research and development effort required in order to
achieve this.
ACKNOWLEDGMENTS
We thank Sin´ead Walsh, Maria Alessandra Papa, Marlin
Schäfer and the AEI CW group for helpful comments. All
W
EAVE
Monte Carlo simulations and all DNN training and
testing were performed on the ATLAS computing cluster of
the Albert-Einstein Institute in Hannover. C. M. is sup-
ported by the Science and Technology Research Council
(Grant No. ST/L000946/1) and the European Cooperation
in Science and Technology (COST) action CA17137.
TABLE VI. Sensitivity depths D90%
DNN at a false-alarm level of
pfa ¼1% achieved by the network for the ten test cases. The
respective matched-filter depths can be found in Table IV.
D90%
DNN [Hz1=2]f0¼20 Hz 100 Hz 200 Hz 500 Hz 1000 Hz
T¼105s10.8 10.0 9.5 8.6 7.7
T¼106s21.6 16.5 14.3 11.1 8.9
TABLE VII. DNN computing cost (in seconds) for training,
search and follow-up (using matched filtering). The respective
matched-filtering cost can be found in Table II.
Cost [s] Training Search Follow-up Total
T¼105s4.3×10558.8 2.2×1044.5×105
T¼106s4.3×106196 6.5×1076.9×107
DEEP-LEARNING CONTINUOUS GRAVITATIONAL WAVES PHYS. REV. D 100, 044009 (2019)
044009-9
[1] B. P. Abbott, R. Abbott, T. D. Abbott, M. R. Abernathy, F.
Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso,
R. X. Adhikari et al., Observation of Gravitational Waves
from a Binary Black Hole Merger, Phys. Rev. Lett. 116,
061102 (2016).
[2] B. P. Abbott, R. Abbott, T. D. Abbott, F. Acernese, K.
Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari,
V. B. Adya et al., GW170817: Observation of Gravitational
Waves from a Binary Neutron Star Inspiral, Phys. Rev. Lett.
119, 161101 (2017).
[3] A. H. Nitz, C. Capano, A. B. Nielsen, S. Reyes, R. White,
D. A. Brown, and B. Krishnan, 1-OGC: The first open
gravitational-wave catalog of binary mergers from analysis
of public Advanced LIGO data, Astrophys. J. 872, 195
(2019).
[4] The LIGO Scientific and the Virgo Collaborations,
GWTC-1: A gravitational-wave transient catalog of com-
pact binary mergers observed by LIGO and Virgo during the
first and second observing runs, arXiv:1811.12907.
[5] J. Aasi et al. (LIGO Scientific Collaboration), Advanced
LIGO, Classical Quantum Gravity 32, 074001 (2015).
[6] F. Acernese et al. (VIRGO Collaboration), Advanced Virgo:
A second-generation interferometric gravitational wave
detector, Classical Quantum Gravity 32, 024001 (2015).
[7] R. Prix, Gravitational waves from spinning neutron stars, in
Neutron Stars and Pulsars (Springer, Berlin, 2009),
pp. 651685, https://dcc.ligo.org/LIGO-P060039/public.
[8] P. D. Lasky, Gravitational waves from neutron stars: A
review, Pub. Astron. Soc. Aust. 32, e034 (2015).
[9] K. Riles, Recent searches for continuous gravitational
waves, Mod. Phys. Lett. A 32, 1730035 (2017).
[10] C. Dreissigacker, R. Prix, and K. Wette, Fast and accurate
sensitivity estimation for continuous-gravitational-wave
searches, Phys. Rev. D 98, 084058 (2018).
[11] R. Prix and B. Krishnan, Targeted search for continuous
gravitational waves: Bayesian versus maximum-likelihood
statistics, Classical Quantum Gravity 26, 204013 (2009).
[12] P. R. Brady and T. Creighton, Searching for periodic sources
with LIGO. II. Hierarchical searches, Phys. Rev. D 61,
082001 (2000).
[13] R. Prix and M. Shaltev, Search for continuous gravitational
waves: Optimal Stackslide method at fixed computing cost,
Phys. Rev. D 85, 084010 (2012).
[14] Einstein@home project page, https://einsteinathome.org/.
[15] V. Dergachev and M. A. Papa, First Falcon all-sky search for
periodic gravitational waves, arXiv:1902.05530.
[16] J. Ming, M. A. Papa, A. Singh, H.-B. Eggenstein, S. J. Zhu,
V. Dergachev, Y.-M. Hu, R. Prix, B. Machenschalk, C. Beer,
O. Behnke, and B. Allen, Results from an Einstein@Home
search for continuous gravitational waves from Cassiopeia
A, Vela Jr. and G347.3, arXiv:1903.09119.
[17] B. P. Abbott et al. (The LIGO Scientific and the Virgo
Collaborations), All-sky search for continuous gravitational
waves from isolated neutron stars using Advanced LIGO O2
data, arXiv:1903.01901.
[18] M. A. Nielsen, Neural Networks and Deep Learning (Deter-
mination Press, 2015), http://neuralnetworksanddeeplearning
.com/.
[19] I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning
(MIT Press, 2016), http://www.deeplearningbook.org/.
[20] Y. LeCun, Y. Bengio, and G. Hinton, Deep learning, Nature
(London) 521, 436 (2015).
[21] D. Silver, T. Hubert, J. Schrittwieser, I. Antonoglou, M. Lai,
A. Guez, M. Lanctot, L. Sifre, D. Kumaran, T. Graepel
et al., A general reinforcement learning algorithm that
masters chess, shogi, and go through self-play, Science
362, 1140 (2018).
[22] OpenAI, Openai five, https://blog.openai.com/openai-five/
(2018).
[23] O. Vinyals et al., AlphaStar: Mastering the real-time
strategy game StarCraft II, https://deepmind.com/blog/
alphastar-mastering-real-time-strategy-game-starcraft-ii/
(2019).
[24] N. Mukund, S. Abraham, S. Kandhasamy, S. Mitra, and
N. S. Philip, Transient classification in LIGO data using
difference boosting neural network, Phys. Rev. D 95,
104059 (2017).
[25] M. Zevin, S. Coughlin, S. Bahaadini, E. Besler, N. Rohani,
S. Allen, M. Cabero, K. Crowston, A. K. Katsaggelos, S. L.
Larson, T. K. Lee, C. Lintott, T. B. Littenberg, A. Lundgren,
C. Østerlund, J. R. Smith, L. Trouille, and V. Kalogera,
Gravity Spy: Integrating Advanced LIGO detector charac-
terization, machine learning, and citizen science, Classical
Quantum Gravity 34, 064003 (2017).
[26] M. Razzano and E. Cuoco, Image-based deep learning for
classification of noise transients in gravitational wave
detectors, Classical Quantum Gravity 35, 095016 (2018).
[27] D. George, H. Shen, and E. A. Huerta, Classification and
unsupervised clustering of LIGO data with Deep Transfer
Learning, Phys. Rev. D 97, 101501 (2018).
[28] S. Vinciguerra, M. Drago, G. A. Prodi, S. Klimenko, C.
Lazzaro, V. Necula, F. Salemi, V. Tiwari, M. C. Tringali, and
G. Vedovato, Enhancing the significance of gravitational
wave bursts through signal classification, Classical Quan-
tum Gravity 34, 094003 (2017).
[29] P. Astone, P. Cerdá-Durán, I. Di Palma, M. Drago, F.
Muciaccia, C. Palomba, and F. Ricci, New method to
observe gravitational waves emitted by core collapse super-
novae, Phys. Rev. D 98, 122002 (2018).
[30] S. Klimenko, G. Vedovato, M. Drago, F. Salemi, V.
Tiwari, G. A. Prodi, C. Lazzaro, K. Ackley, S. Tiwari,
C. F. Da Silva, and G. Mitselmakher, Method for detection
and reconstruction of gravitational wave transients with
networks of advanced detectors, Phys. Rev. D 93, 042004
(2016).
[31] D. George and E. A. Huerta, Deep neural networks to enable
real-time multimessenger astrophysics, Phys. Rev. D 97,
044039 (2018).
[32] H. Gabbard, M. Williams, F. Hayes, and C. Messenger,
Matching Matched Filtering with Deep Networks in
Gravitational-Wave Astronomy, Phys. Rev. Lett. 120,
141103 (2018).
[33] D. George and E. A. Huerta, Deep learning for real-time
gravitational wave detection and parameter estimation:
Results with Advanced LIGO Data, Phys. Lett. B 778,
64 (2018).
CHRISTOPH DREISSIGACKER et al. PHYS. REV. D 100, 044009 (2019)
044009-10
[34] T. Gebhard, N. Kilbertus, G. Parascandolo, I. Harry, and B.
Schölkopf, Convwave: Searching for gravitational waves
with fully convolutional neural nets, in Workshop on Deep
Learning for Physical Sciences (DLPS) at the 31st
Conference on Neural Information Processing Systems
(NIPS), Long Beach, CA, USA (2017), https://
dl4physicalsciences.github.io/files/nips_dlps_2017_13.pdf.
[35] X. Fan, J. Li, X. Li, Y. Zhong, and J. Cao, Applying deep
neural networks to the detection and space parameter
estimation of compact binary coalescence with a network
of gravitational wave detectors, Sci. Chin. Phys. Mech.
Astron. 62, 969512 (2019).
[36] T. D. Gebhard, N. Kilbertus, I. Harry, and B. Schölkopf,
Convolutional neural networks: A magic bullet for gravi-
tational-wave detection? arXiv:1904.08693.
[37] A. C. Searle, Monte-Carlo and Bayesian techniques in
gravitational wave burst data analysis, arXiv:0804.1161.
[38] B. Behnke, M. A. Papa, and R. Prix, Postprocessing
methods used in the search for continuous gravitational-
wave signals from the Galactic Center, Phys. Rev. D 91,
064007 (2015).
[39] K. Wette, S. Walsh, R. Prix, and M. A. Papa, Implementing
a semicoherent search for continuous gravitational waves
using optimally constructed template banks, Phys. Rev. D
97, 123016 (2018).
[40] S. Walsh, K. Wette, M. A. Papa, and R. Prix, Optimising the
choice of analysis method for all-sky searches for continu-
ous gravitational waves with Einstein@Home, Phys. Rev. D
99, 082004 (2019).
[41] P. Jaranowski, A. Krolak, and B. F. Schutz, Data analysis of
gravitational-wave signals from spinning neutron stars: The
signal and its detection, Phys. Rev. D 58, 063001 (1998).
[42] R. Prix, The F-statistic and its implementation in Compu-
teFstatistic_v2, Technical Report No. LIGO-T0900149-v5,
2015, https://dcc.ligo.org/LIGO-T0900149/public.
[43] R. Prix, Template-based searches for gravitational waves:
Efficient lattice covering of flat parameter spaces, Classical
Quantum Gravity 24, S481 (2007).
[44] K. Wette, Lattice template placement for coherent all-sky
searches for gravitational-wave pulsars, Phys. Rev. D 90,
122010 (2014).
[45] J. Ming, M. A. Papa, B. Krishnan, R. Prix, C. Beer, S. J.
Zhu, H.-B. Eggenstein, O. Bock, and B. Machenschalk,
Optimally setting up directed searches for continuous
gravitational waves in Advanced LIGO O1 data, Phys.
Rev. D 97, 024051 (2018).
[46] R. Prix, Characterizing timing and memory-requirements of
the F-statistic implementations in LALSuite, Technical
Report No. T1600531-v4+, 2017, https://dcc.ligo.org/
LIGO-T1600531/public.
[47] K. Wette, R. Prix, D. Keitel, M. Pitkin, C. Dreissigacker,
J. T. Whelan, and P. Leaci, OctApps: A library of octave
functions for continuous gravitational-wave data analysis, J.
Open Source Software 3, 707 (2018).
[48] K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning
for image recognition, arXiv:1512.03385.
[49] F. Chollet et al., Keras, https://keras.io (2015).
[50] M. Abadi et al., TensorFlow: Large-scale machine learningon
heterogeneous systems, software available from https://www
.tensorflow.org/.
[51] LIGO Scientific Collaboration, LALSuite: FreeSoftware
(GPL) tools for data-analysis, https://www.lsc-group.phys
.uwm.edu/daswg/projects/lalsuite.html.
DEEP-LEARNING CONTINUOUS GRAVITATIONAL WAVES PHYS. REV. D 100, 044009 (2019)
044009-11
... A more complex architecture involving inception modules [60] and temporal convolutional networks (TCNs) [61] was designed to further improve BNS searches with ML [1]. Other ML applications to GW science include glitch classification [62], [63], the search for continuous GW signals [63,64], the detection and classification of supernova GW signals [65,66], denoising of BBH merger signals [64] and GW parameter estimation [67,68,69]. NNs allocate most of the computational cost to the training phase, prior to the actual prediction stage [70]. ...
... A more complex architecture involving inception modules [60] and temporal convolutional networks (TCNs) [61] was designed to further improve BNS searches with ML [1]. Other ML applications to GW science include glitch classification [62], [63], the search for continuous GW signals [63,64], the detection and classification of supernova GW signals [65,66], denoising of BBH merger signals [64] and GW parameter estimation [67,68,69]. NNs allocate most of the computational cost to the training phase, prior to the actual prediction stage [70]. ...
... The approach taken in [12][13][14] and in this paper is to train a DNN directly on detector strain data to identify a CW signal. In our previous work [12], we have presented new design principles for DNNs to perform CW searches. ...
... Therefore, with a segment FFT resolution of 1=6 h and bandwidth of the widest signal ∼18.7 mHz, the DNN has a total input bandwidth of 824 frequency bins or ∼38 mHz. 1 The DNN input bandwidth is chosen to be twice the bandwidth of the widest possible signal to simplify the application of our DNN to a search over a wider frequency band. Similar to [13,14], this is done by sliding the DNN input window over the band with a half overlap between consecutive windows. Such a placement of input windows ensures that any signal in the frequency band of the search lies completely inside at least one input window. ...
Article
Full-text available
The sensitivity of wide parameter-space searches for continuous gravitational waves (CWs) is limited by their high computational cost. Deep learning is being studied as an alternative method to replace various aspects of a CW search. In previous work [], new design principles were presented for deep neural network (DNN) search of CWs and such DNNs were trained to perform a targeted search with matched-filtering sensitivity. In this paper, we adapt these design principles to build a DNN architecture for wide parameter-space searches in 10 days of data from two detectors (H1 and L1). We train a DNN for each of the benchmark cases: six all-sky searches and eight directed searches at different frequencies in the search band of 20 – 1000 Hz . We compare our results to the DNN sensitivity achieved from Dreissigacker and Prix [] and find that our trained DNNs are more sensitive in all the cases. The absolute improvement in detection probability ranges from 6.5% at 20 Hz to 38% at 1000 Hz in the all-sky cases and from 1.5% at 20 Hz to 59.4% at 500 Hz in the directed cases. An all-sky DNN trained on the entire search band of 20–1000 Hz shows a high sensitivity at all frequencies, providing a proof of concept for training a single DNN to perform the entire search. We also study the generalization of the DNN performance to signals with different signal amplitude, frequency, and the dependence of the DNN sensitivity on sky position. Published by the American Physical Society 2024
... Following the success of the first challenge, which attracted strong interest from the Kaggle community, a more difficult challenge was introduced in the second G2Net competition (Tenorio et al 2022). This focused on continuous gravitational wave detection, where recent ML efforts (Dreissigacker et al 2019;Dreissigacker and Prix 2020) have shown promise but have yet to overcome the significant computational hurdles of traditional search methods -see Sect. 7.4. ...
Preprint
Full-text available
This article provides an overview of the current state of machine learning in gravitational-wave research with interferometric detectors. Such applications are often still in their early days, but have reached sufficient popularity to warrant an assessment of their impact across various domains, including detector studies, noise and signal simulations, and the detection and interpretation of astrophysical signals. In detector studies, machine learning could be useful to optimize instruments like LIGO, Virgo, KAGRA, and future detectors. Algorithms could predict and help in mitigating environmental disturbances in real time, ensuring detectors operate at peak performance. Furthermore, machine-learning tools for characterizing and cleaning data after it is taken have already become crucial tools for achieving the best sensitivity of the LIGO--Virgo--KAGRA network. In data analysis, machine learning has already been applied as an alternative to traditional methods for signal detection, source localization, noise reduction, and parameter estimation. For some signal types, it can already yield improved efficiency and robustness, though in many other areas traditional methods remain dominant. As the field evolves, the role of machine learning in advancing gravitational-wave research is expected to become increasingly prominent. This report highlights recent advancements, challenges, and perspectives for the current detector generation, with a brief outlook to the next generation of gravitational-wave detectors.
... Another example concerns the detection method of gravitational waves from core-collapse supernovae [54][55][56][57]. The analysis of continuous waves using a NN is also under investigation [58][59][60][61][62]. The Gravity Spy project [63][64][65][66] and Sakai et al [67,68] have also used an NN to classify the transient noises. ...
Article
Full-text available
The observation of gravitational waves is expected to allow new tests of general relativity to be performed. As the gravitational wave signal is hidden by detector noise in observed data, a method to reduce noise is required to analyze the ringdown phase of gravitational wave signals. Recently, some noise reduction methods based on a neural network have been proposed; however, the results of these methods must be considered with caution because the output can contain spurious components. To overcome this limitation, in this study, we developed a neural network–based method to design optimal digital filters for extracting ringdown gravitational wave signals. In this method, no spurious components appear in the output because the digital filters reduce the noise. We conducted simulations with waveforms of gravitational waves from binary black hole coalescence and confirmed that the proposed method designs appropriate filters that reduce detector noise.
Article
Full-text available
As gravitational wave astronomy has advanced, the need for effective and quick signal processing has never been more critical. New detectors such as Laser Interferometer Gravitational-Wave Observatory (LIGO) produces huge volumes of data, which poses a significant challenge to identify genuine astrophysical events amidst transient noise. With this in view, this study proposes a new, efficient deep-learning architecture designed to process gravitational wave signals. Motivated by the WaveNet model, our method uses dilated convolutions to precisely model long-term dependencies in the data ensuring that subtle characteristics are captured. In addition, higher-order recurrent layers like Long Short-Term Memory(LSTM) networks are also used to precisely model temporal characteristics so that accuracy is preserved with enhanced anomaly detection and noise removal. Our experimental validation shows that this method has achieved a Root Mean Square Error (RMSE) of 0.00228, a Residual Signal Ratio (RSR) of 0.08798, a Peak Signal-to-Noise Ratio (PSNR) of 33.34 dB, and an average volume error of 0.03965. These performance metrics show the framework’s ability to operate in real time with minimal computational overhead is a key requirement given the massive datasets involved in gravitational wave science. Also, by an explicit consideration of initial and boundary conditions, our scheme exhibits not only stability for the current missions but also scalability for future detection missions, like Laser Interferometer Space Antenna (LISA), allowing these missions to enhance their sensitivity and reliability. Our study demonstrates the development of state-of-the-art deep learning methods to surmount the specific obstacles concerning gravitational wave detection, paving the way for real-time processing of astrophysical data and an improved understanding of the Universe.
Chapter
In this chapter, we summarize a generic approach to the analysis of gravitational wave (GW) data: we consider the GW to be the anomaly in the signal received by the detector and focus on detecting such anomalies. This approach works because detectable gravitational waves are rare (at the moment), transient signals, distinct from the slow-varying, non-stationary noise typically present in the detectors. The anomalies under investigation arise mainly from gravitational waves produced by the merger of binary black hole systems. However, our analysis extends beyond GW signals to encompass glitches identified within the real LIGO/Virgo dataset, accessible through the Gravitational Waves Open Science Center. Our anomaly detection process is based on deep learning techniques, specifically convolutional autoencoders trained on both simulated and real detector data. We demonstrate the efficacy of our method by reconstructing injected GW signals and explore how the detection of anomalies is influenced by the strength of the gravitational wave, quantified via the matched filter Signal-to-Noise Ratio (SNR). Furthermore, we apply our methodology to localize anomalies within the temporal domain of the time-series data that models the gravitational wave. The validity of our approach is confirmed by applying it to real-world data containing verified gravitational wave detections: our method is able to generalize and identify GW events not included in the training dataset.
Article
Full-text available
This article provides an overview of the current state of machine learning in gravitational-wave research with interferometric detectors. Such applications are often still in their early days, but have reached sufficient popularity to warrant an assessment of their impact across various domains, including detector studies, noise and signal simulations, and the detection and interpretation of astrophysical signals. In detector studies, machine learning could be useful to optimize instruments like LIGO, Virgo, KAGRA, and future detectors. Algorithms could predict and help in mitigating environmental disturbances in real time, ensuring detectors operate at peak performance. Furthermore, machine-learning tools for characterizing and cleaning data after it is taken have already become crucial tools for achieving the best sensitivity of the LIGO–Virgo–KAGRA network. In data analysis, machine learning has already been applied as an alternative to traditional methods for signal detection, source localization, noise reduction, and parameter estimation. For some signal types, it can already yield improved efficiency and robustness, though in many other areas traditional methods remain dominant. As the field evolves, the role of machine learning in advancing gravitational-wave research is expected to become increasingly prominent. This report highlights recent advancements, challenges, and perspectives for the current detector generation, with a brief outlook to the next generation of gravitational-wave detectors.
Article
Relic gravitational waves (RGWs) from the early universe carry crucial and fundamental cosmological information. Therefore, it is of extraordinary importance to investigate potential RGW signals in the data from observatories such as the LIGO-Virgo-KAGRA network. Here, focusing on typical RGWs from the inflation and the first-order phase transition (by sound waves and bubble collisions), effective and targeted deep learning neural networks are established to search for these RGW signals within the real LIGO data (O2, O3a and O3b). Through adjustment and adaptation processes, we develop suitable Convolutional Neural Networks (CNNs) to estimate the likelihood (characterized by quantitative values and distributions) that the focused RGW signals are present in the LIGO data. We find that if the constructed CNN properly estimates the parameters of the RGWs, it can determine with high accuracy (approximately 94% to 99%) whether the samples contain such RGW signals; otherwise, the likelihood provided by the CNN cannot be considered reliable. After testing a large amount of LIGO data, the findings show no evidence of RGWs from: 1) inflation, 2) sound waves, or 3) bubble collisions, as predicted by the focused theories. The results also provide upper limits of their GW spectral energy densities of h^2Ω_{gw} ~10^{-5}, respectively for parameter boundaries within 1) [β∈(-1.85,-1.87) × α∈(0.005,0.007)], 2) [β/H_{pt}∈(0.02,0.16)× α∈(1,10)×T_{pt}∈(5∗10^9,10^10)Gev], and 3) [β/H_{pt}∈(0.08,0.2)× α∈(1,10)×T_{pt}∈(5∗10^9,8∗10^10) Gev]. In short, null results and upper limits are obtained, and the analysis suggests that our developed methods and neural networks to search for typical RGWs in the LIGO data are effective and reliable, providing a viable scheme for exploring possible RGWs from the early universe and placing constraints on relevant cosmological theories.
Article
Full-text available
In the last few years, machine learning techniques, in particular convolutional neural networks, have been investigated as a method to replace or complement traditional matched filtering techniques that are used to detect the gravitational-wave signature of merging black holes. However, to date, these methods have not yet been successfully applied to the analysis of long stretches of data recorded by the Advanced LIGO and Virgo gravitational-wave observatories. In this work, we critically examine the use of convolutional neural networks as a tool to search for merging black holes. We identify the strengths and limitations of this approach, highlight some common pitfalls in translating between machine learning and gravitational-wave astronomy, and discuss the interdisciplinary challenges. In particular, we explain in detail why convolutional neural networks alone cannot be used to claim a statistically significant gravitational-wave detection. However, we demonstrate how they can still be used to rapidly flag the times of potential signals in the data for a more detailed follow-up. Our convolutional neural network architecture as well as the proposed performance metrics are better suited for this task than a standard binary classifications scheme. A detailed evaluation of our approach on Advanced LIGO data demonstrates the potential of such systems as trigger generators. Finally, we sound a note of caution by constructing adversarial examples, which showcase interesting “failure modes” of our model, where inputs with no visible resemblance to real gravitational-wave signals are identified as such by the network with high confidence.
Article
Full-text available
We present results of an all-sky search for continuous gravitational waves (CWs), which can be produced by fast spinning neutron stars with an asymmetry around their rotation axis, using data from the second observing run of the Advanced LIGO detectors. Three different semicoherent methods are used to search in a gravitational-wave frequency band from 20 to 1922 Hz and a first frequency derivative from −1×10−8 to 2×10−9 Hz/s. None of these searches has found clear evidence for a CW signal, so upper limits on the gravitational-wave strain amplitude are calculated, which for this broad range in parameter space are the most sensitive ever achieved.
Article
Full-text available
We present the first Open Gravitational-wave Catalog, obtained by using the public data from Advanced LIGO's first observing run to search for compact-object binary mergers. Our analysis is based on new methods that improve the separation between signals and noise in matched-filter searches for gravitational waves from the merger of compact objects. The three most significant signals in our catalog correspond to the binary black hole mergers GW150914, GW151226, and LVT151012. We assume a common population of binary black holes for these three signals by defining a region of parameter space that is consistent with these events. Under this assumption, we find that LVT151012 has a 97.6% probability of being astrophysical in origin. No other significant binary black hole candidates are found, nor did we observe any significant binary neutron star or neutron star–black hole candidates. We make available our complete catalog of events, including the subthreshold population of candidates.
Article
Full-text available
While gravitational waves have been detected from mergers of binary black holes and binary neutron stars, signals from core collapse supernovae, the most energetic explosions in the modern Universe, have not been detected yet. Here we present a new method to analyse the data of the LIGO, Virgo, and KAGRA network to enhance the detection efficiency of this category of signals. The method takes advantage of a peculiarity of the gravitational wave signal emitted in the core collapse supernova and it is based on a classification procedure of the time-frequency images of the network data performed by a convolutional neural network trained to perform the task to recognize the signal. We validate the method using phenomenological waveforms injected in Gaussian noise whose spectral properties are those of the LIGO and Virgo advanced detectors and we conclude that this method can identify the signal better than the present algorithm devoted to select gravitational wave transient signal.
Article
Full-text available
This paper presents an efficient numerical sensitivity-estimation method and implementation for continuous-gravitational-wave searches, extending and generalizing an earlier analytic approach by Wette [1]. This estimation framework applies to a broad class of F-statistic-based search methods, namely (i) semi-coherent StackSlide F-statistic (single-stage and hierarchical multistage), (ii) Hough number count on F-statistics, as well as (iii) Bayesian upper limits on F-statistic search results (coherent or semi-coherent). We test this estimate against results from Monte-Carlo simulations assuming Gaussian noise. We find the agreement to be within a few % at high detection (i.e., low false-alarm) thresholds, with increasing deviations at decreasing detection (i.e., higher false-alarm) thresholds, which can be understood in terms of the approximations used in the estimate. We also provide an extensive summary of sensitivity depths achieved in past continuous-gravitational-wave searches (derived from the published upper limits). For the F-statistic-based searches where our sensitivity estimate is applicable, we find an average relative deviation to the published upper limits of less than 10%, which in most cases includes systematic uncertainty about the noise-floor estimate used in the published upper limits.
Article
We report results of the most sensitive search to date for periodic gravitational waves from Cassiopeia A, Vela Jr., and G347.3 with frequency between 20 and 1500 Hz. The search was made possible by the computing power provided by the volunteers of the Einstein@Home project and improves on previous results by a factor of 2 across the entire frequency range for all targets. We find no significant signal candidate and set the most stringent upper limits to date on the amplitude of gravitational wave signals from the target population, corresponding to sensitivity depths between 54 [1/Hz] and 83 [1/Hz], depending on the target and the frequency range. At the frequency of best strain sensitivity, near 172 Hz, we set 90% confidence upper limits on the gravitational wave intrinsic amplitude of h090%≈10−25, probing ellipticity values for Vela Jr. as low as 3×10−8, assuming a distance of 200 pc.
Article
We present results of an all-sky search for continuous gravitational waves (CWs), which can be produced by fast-spinning neutron stars with an asymmetry around their rotation axis, using data from the second observing run of the Advanced LIGO detectors. We employ three different semi-coherent methods (FrequencyHough, SkyHough, and Time-Domain F-statistic) to search in a gravitational-wave frequency band from 20 to 1922 Hz and a first frequency derivative from −1×10^(−8) to 2×10^(−9) Hz/s. None of these searches has found clear evidence for a CW signal, so we present upper limits on the gravitational-wave strain amplitude h_0 (the lowest upper limit on h_0 is 1.7×10^(−25) in the 123-124 Hz region) and discuss the astrophysical implications of this result. This is the most sensitive search ever performed over the broad range of parameters explored in this study.
Article
Rapidly rotating neutron stars are promising sources of continuous gravitational wave radiation for the LIGO and Virgo interferometers. The majority of neutron stars in our galaxy have not been identified with electromagnetic observations. All-sky searches for isolated neutron stars offer the potential to detect gravitational waves from these unidentified sources. The parameter space of these blind all-sky searches, which also cover a large range of frequencies and frequency derivatives, presents a significant computational challenge. Various methods have been designed to perform these searches within the limits of available computing resources. Recently, a search method called Weave has been proposed to achieve template placement with a minimal number of templates. We employ a mock data challenge to assess the ability of this method to recover signals in an all-sky search for unknown neutron stars, and compare its search sensitivity with that of the global correlation transform method (GCT), which has been used for all-sky searches with the Einstein@Home volunteer computing project for a number of years. We find that the Weave method is 14% more sensitive than GCT for an all-sky search on Einstein@Home, with a sensitivity depth of 57.9±0.61Hz at 90% detection efficiency, compared to 50.8−1.1+0.71Hz for GCT. This corresponds to a 50% increase in the volume of sky where we are sensitive with the Weave search. We also find that the Weave search recovers candidates closer to the true signal position. In the all-sky search studied here the improvement in candidate localization would lead to a factor of 70 reduction in the computing cost required to follow up the same number of candidates. We assess the feasibility of deploying the search on Einstein@Home, and find that Weave requires significantly more memory than is typically available on a volunteer computer. We conclude that, while GCT remains the best choice for deployment on Einstein@Home due to its lower memory requirements, Weave presents significant advantages for the subsequent hierarchical follow-up searches of interesting candidates.
Article
In this paper, we study an application of deep learning to the advanced laser interferometer gravitational wave observatory (LIGO) and advanced Virgo coincident detection of gravitational waves (GWs) from compact binary star mergers. This deep learning method is an extension of the Deep Filtering method used by George and Huerta (2017) for multi-inputs of network detectors. Simulated coincident time series data sets in advanced LIGO and advanced Virgo detectors are analyzed for estimating source luminosity distance and sky location. As a classifier, our deep neural network (DNN) can effectively recognize the presence of GW signals when the optimal signal-to-noise ratio (SNR) of network detectors ≥ 9. As a predictor, it can also effectively estimate the corresponding source space parameters, including the luminosity distance D, right ascension α, and declination δ of the compact binary star mergers. When the SNR of the network detectors is greater than 8, their relative errors are all less than 23%. Our results demonstrate that Deep Filtering can process coincident GW time series inputs and perform effective classification and multiple space parameter estimation. Furthermore, we compare the results obtained from one, two, and three network detectors; these results reveal that a larger number of network detectors results in a better source location.
Article
One program to rule them all Computers can beat humans at increasingly complex games, including chess and Go. However, these programs are typically constructed for a particular game, exploiting its properties, such as the symmetries of the board on which it is played. Silver et al. developed a program called AlphaZero, which taught itself to play Go, chess, and shogi (a Japanese version of chess) (see the Editorial, and the Perspective by Campbell). AlphaZero managed to beat state-of-the-art programs specializing in these three games. The ability of AlphaZero to adapt to various game rules is a notable step toward achieving a general game-playing system. Science , this issue p. 1140 ; see also pp. 1087 and 1118