ArticlePDF Available

Spatial models of sparse data to inform cetacean conservation planning: An example from Oman

Authors:
  • Independent consultant, Megaptera Marine Conservation
  • AfriSeas Solutions Pty Ltd.

Abstract and Figures

Habitat models are tools for understanding the relationship between cetaceans and their environment, from which patterns of the animals space use can be inferred and management strate- gies developed. Can working with space use alone be sufficient for management, when habitat can- not be modeled? Here, we analyzed cetacean sightings data collected from small boat surveys off the coast of Oman between 2000 and 2003. The waters off Oman are used by the Endangered Arabian Sea population of humpback whales. Our data were collected primarily for photo-identification, using a haphazard sampling regime, either in areas where humpback whales were thought to be relatively abundant, or in areas that were logistically easy to survey. This leads to spatially autocorrelated data that are not amenable to analysis using standard approaches. We used quasi-Poisson generalized lin- ear models and semi-parametric spatial filtering to assess the distribution of humpback and Brydes whales in 3 areas off Oman relative to 3 simple physiographic variables in a survey grid. Our analysis focused on the spatial eigenvector filtering of models, coupled with the spatial distribution of model residuals, rather than just on model predictions. Spatial eigenvector filtering accounts for spatial autocorrelation in models, allowing inference to be made regarding the relative importance of partic- ular areas. As an exemplar of this approach, we demonstrate that the Dhofar coast of southern Oman is important habitat for the Arabian Sea population of humpback whales. We also suggest how con- servation planning for mitigating impacts on humpback whales off the Dhofar coast could start.
Content may be subject to copyright.
ENDANGERED SPECIES RESEARCH
Endang Species Res
Vol. 15: 39– 52, 2011
doi: 10.3354/esr00367 Published online October 21
INTRODUCTION
Habitat models, spatial models, and conservation
planning
Habitat models have evolved from Hutchinson’s
(1957) concept of niche as environmental hyperspace
(Basille et al. 2008). In general, habitat models are
used to inform cetacean management by attempting
to understand the relationships between cetaceans
and their environment, from which inference is then
drawn on space use (e.g. Johnston et al. 2005,
Cañadas & Hammond 2008, Redfern et al. 2008,
Stafford et al. 2009). In order to develop spatially-
based approaches to cetacean conservation, is this
the only way forward?
© Inter-Research 2011 · www.int-res.com*Email: peter.corkeron@noaa.gov
Spatial models of sparse data to inform cetacean
conservation planning: an example from Oman
Peter J. Corkeron1, 2, 3,8,*, Gianna Minton,4,5,Tim Collins4, 6, Ken Findlay7,
Andrew Willson4, Robert Baldwin4
1Integrated Statistics, Woods Hole, Massachusetts 02543, USA
2Bioacoustics Research Program, Cornell Lab of Ornithology, Ithaca, New York 14850, USA
3The New England Aquarium, Central Wharf, Boston, Massachusetts 02110-3399, USA
4Environment Society of Oman, Ruwi, Sultanate of Oman
5Sarawak Dolphin Project, Institute of Biodiversity and Environmental Conservation, Universiti Malaysia Sarawak,
94300 Kota Samarahan, Sarawak, Malaysia
6Ocean Giants Program, Wildlife Conservation Society, Bronx, New York 10460-1099, USA
7MaRe, Oceanography Department, University of Cape Town, Rondebosch 7701, South Africa
8Present address: NOAA Northeast Fisheries Science Center, Woods Hole, Massachusetts 02543, USA
ABSTRACT: Habitat models are tools for understanding the relationship between cetaceans and their
environment, from which patterns of the animals’ space use can be inferred and management strate-
gies developed. Can working with space use alone be sufficient for management, when habitat can-
not be modeled? Here, we analyzed cetacean sightings data collected from small boat surveys off the
coast of Oman between 2000 and 2003. The waters off Oman are used by the Endangered Arabian
Sea population of humpback whales. Our data were collected primarily for photo-identification, using
a haphazard sampling regime, either in areas where humpback whales were thought to be relatively
abundant, or in areas that were logistically easy to survey. This leads to spatially autocorrelated data
that are not amenable to analysis using standard approaches. We used quasi-Poisson generalized lin-
ear models and semi-parametric spatial filtering to assess the distribution of humpback and Bryde’s
whales in 3 areas off Oman relative to 3 simple physiographic variables in a survey grid. Our analysis
focused on the spatial eigenvector filtering of models, coupled with the spatial distribution of model
residuals, rather than just on model predictions. Spatial eigenvector filtering accounts for spatial
autocorrelation in models, allowing inference to be made regarding the relative importance of partic-
ular areas. As an exemplar of this approach, we demonstrate that the Dhofar coast of southern Oman
is important habitat for the Arabian Sea population of humpback whales. We also suggest how con-
servation planning for mitigating impacts on humpback whales off the Dhofar coast could start.
KEY WORDS: Spatial eigenvector models · Spatial planning · Marine Protected Area · Generalized
linear models · Oman · Whales
Resale or republication not permitted without written consent of the publisher
Contribution to the Theme Section: ‘Beyond marine mammal habitat modeling’
O
PEN
PEN
A
CCESS
CCESS
Endang Species Res 15: 39– 52, 2011
Science is generally used to mitigate inadvertent
anthropogenic mortality of cetaceans in a series of
steps: estimating the abundance of the population
of interest; determining population structure and
boundaries (e.g. Taylor et al. 2000); estimating an -
thropogenic mortality of the population; then model-
ing the likely sustainability of this mortality (e.g.
Wade 1998). With these scientific inputs, managers
and stakeholders can devise measures that should
reduce mortality to a level that will allow adequate
mitigation within the social and cultural norms of the
people using the area over which the cetacean popu-
lation ranges.
The view of oceans underpinning this process is
one in which the marine environment was viewed
as generally undisturbed, with patches of impact,
which has recently been questioned (e.g. Crowder
et al. 2006). This has led, in some nations, to the
view that marine zoning, i.e. the marine equivalent
of terrestrial conservation planning (Margules &
Pressey 2000), is a more appropriate paradigm to
adopt (e.g. Fernandes et al. 2005). When providing
data on habitat use by cetaceans to inform spa-
tially-explicit conservation planning (e.g. Parra et
al. 2006a), knowing the absolute abundance of
cetaceans becomes less crucial than in the tradi-
tional approach. Note also that knowledge of the
ecological (or social) processes driving spatial dis-
tribution of whales, although very useful, is not
necessarily more important than simply having a
well-quantified understanding of what the spatial
distribution is. At this point, the interaction between
model outputs of space use, and the management
and policy milieu of the area of interest, is what
matters (e.g. Grech & Marsh 2008).
Spatial autocorrelation
Spatial autocorrelation (i.e. the closer samples are,
the more similar they are) introduces challenges
when making inference from models, as standard
errors of fixed effects from linear models are likely
underestimated (Dormann 2009). Recently, several
modeling techniques that can account for spatial
autocorrelation have been brought to the attention of
ecologists (Dormann et al. 2007). Even when working
with data from surveys specifically designed to pro-
duce distance-sampled estimates of abundance (e.g.
Gómez de Segura et al. 2007, Redfern et al. 2008),
spatial autocorrelation arising from niche-related or
social factors must be considered. However, when
working from vessels of opportunity, or from field
data where the principal aims did not require sys-
tematic or random sampling for survey coverage,
another source of autocorrelation needs considera-
tion, viz. that introduced by the haphazard sampling
regime.
Note that here we are using ‘haphazard’ techni-
cally to refer to sampling that is not explicitly ran-
domized, nor a fixed sampling regime starting from a
randomized point. It is generally assumed in these
instances that simply accounting for effort is suffi-
cient (e.g. Macleod et al. 2004). However, papers
analyzing haphazardly-collected data that then
account for effort rarely analyze model residuals to
demonstrate that spatial autocorrelation has been
handled satisfactorily by the model used (e.g.
Macleod et al. 2004).
Arabian Sea humpback whales
Here we address this problem by modeling the dis-
tribution of the Arabian Sea population of humpback
whales Megaptera novaeangliae off the coast of the
Sultanate of Oman (hereafter, Oman). The Arabian
Sea humpback whales are the only known non-
migratory population of humpback whales, and were
designated as an Endangered subpopulation in the
2008 revision of the IUCN Red List for cetacean spe-
cies (Minton et al. 2008). The current population,
estimated to number less than 100 individuals, does
not appear to be recovering from depletion due to
whaling in the 1960s (Minton et al. in press). Data
from photo-identified individuals (Minton et al. 2010)
and genetics (Rosenbaum et al. 2009) demonstrate
that this population is isolated from the nearest
neighboring Indian Ocean populations.
The distribution of the Arabian Sea population of
humpback whales is assumed to include the waters
of other nations (Fig. 1), particularly the Islamic
Republic of Pakistan, India, the Islamic Republic of
Iran, and the Republic of Yemen (Minton et al.
2010), but dedicated survey effort in these nations’
waters is either absent or very limited (e.g. Braulik
et al. 2010). Thanks to the combination of historical
records (Mikhalev 2000), continued research efforts
in recent years, a history of attendance at Interna-
tional Whaling Commission meetings, and the
establishment of a well networked non-government
organization as a platform to support conservation-
based research, Oman has become the range state
of primary importance for the research and protec-
tion of the Arabian Sea population of humpback
whales.
40
Corkeron et al.: Spatial models for cetacean conservation planning 41
Fig. 1. Study area, showing bottom topography. (A) Arabian Sea and environs, showing countries named in the text. White box
delineates area shown in detail in (B). (B) Study area in the waters off the Sultanate of Oman in detail. Black boxes delineate
the 3 study areas described in the text
A
B
Endang Species Res 15: 39– 52, 2011
The available data present several challenges for
modeling. As the Arabian Sea population of hump-
backs is small, there are few sightings of individuals,
and those sightings are clustered. The data were col-
lected by small boat, primarily for photo-identifica-
tion and genetic sampling, so surveys were haphaz-
ard, with coverage affected by logistical constraints,
and with a concentration on areas where whales
were likely to be encountered. Furthermore, as sur-
veys were conducted by researchers on a volunteer
basis, survey timing had to fit around researchers’
normal occupations, and were spread out over 4 yr
(although they were timed to coincide with the likely
presence of humpback whales as indicated by his-
toric whaling records; Mikhalev 2000).
For these reasons, previous publications from these
surveys have either been descriptive (e.g. Minton et
al. 2011), or provided results from photo-identifica-
tion (e.g. Minton et al. 2010, in press) or genetic stud-
ies (e.g. Rosenbaum et al. 2009). Our aim in this
paper was to develop a spatial model from the Oman
sightings data in order to identify the areas of great-
est relative abundance of humpback whales off the
Oman coast. We could not use biological oceano-
graphic predictors for our model, given the timing
over which data were collected and the relatively few
sightings in each year (Minton et al. 2011), and
because oceanographic processes off
Oman are driven largely by mon-
soonal conditions. The timing of the
monsoon, and its strength, varies
between years (Burkill 1999). That
being so, we chose simple physio-
graphic predictor variables for our
model.
This means that although we are
developing spatial models, we are
not constructing habitat models from
our data. Instead, we show how rel-
atively simple spatial models, based
on data that violate most models’
assumptions of spatial indepen-
dence, can still provide the scientific
foundation for management action.
In doing so, we aim to demonstrate
how others with similar sightings
data and issues of spatial autocorre-
lation can extract statistically robust,
meaningful results that can be used
to inform conservation measures. As
we have survey data for several
cetacean species, we used the dif-
ferences between humpback and
Bryde’s whales Balaenoptera sp. in model results to
begin to differentiate between spatial autocorrela-
tion caused by species-specific ecological factors
from those due to haphazard sampling.
MATERIALS AND METHODS
Study area and field techniques
Table 1 shows a summary of survey dates and dis-
tances covered on effort. Fig. 2 shows effort within
the study grid (see below). Full details of the survey
design and field methods are given by Minton et al.
(2011). Small boat surveys (most frequently a 6.5 m
rigid-hulled inflatable boat) were run between Janu-
ary 2000 and October 2003 in 3 areas: off Muscat, the
Gulf of Masirah, and the Dhofar coast (Fig. 1). Sur-
veys were generally conducted on a monthly basis in
the Muscat region through most of the study period;
the Gulf of Masirah was surveyed in October and
November, and the Dhofar coast was surveyed in
February and March. As the research focus was on
humpback whales, areas of known or suspected
humpback whale distribution were targeted, based
on historical data (e.g. Wray & Martin 1983,
Mikhalev 2000) and anecdotal reports. The excep-
42
Survey area Survey dates Effort hours
Muscat
Monthly surveys 15 Mar 2001 − 15 Jul 2003 104.21
Dhofar
Hallaniyat Islands 15−24 Jan 2000, 8−21 Feb 2000 63.5
Dhofar 9−22 Feb 2001 34.26
Dhofar 10 Feb − 2 Mar 2002 62.37
Hasik Bay 24−26 Jun 2002 4.32
Sharbitat and Hallaniyats 17−20 Nov 2002 36.83
Dhofar 24 Feb − 19 Mar 2003 116.31
Dhofar (Hasik only) 15−17 May 2003 2.17
Total 319.76
Gulf of Masirah
N. Gulf of Masirah 15−17 Oct 2000 11
Gulf of Masirah 4−27 Oct 2001 83.15
Gulf of Masirah 24 Oct − 16 Nov 2002 58.2
Total 152.35
Other areas
Ras al Hadd 30 Mar − 2 Apr 2001 8.13
Shore-based observations
Duqm 10−13 Jun 2001 25
Table 1. Dates and locations of small boat surveys in Oman. Effort indicates
time spent actively searching for whales and excludes time spent working
with whales, in transit, or on breaks
Corkeron et al.: Spatial models for cetacean conservation planning
tion to this was the area around Muscat, as authors
who ran the field surveys lived there. Within each of
the 3 survey areas, tracks were designed to provide
as much coverage of the area as possible within the
logistic and safety limitations of daily excursion small
boat surveys.
Survey tracklines generally followed an irregular
saw-tooth pattern along the coast, and were tra-
versed at speeds of 12 to 15 knots (22 to 28 km h−1).
Search effort was suspended when cetaceans were
sighted and groups were approached to confirm spe-
cies identity and collect data (e.g. photographs to
identify individual animals, biopsy samples). Search-
ing stopped in Beaufort states of 4 or higher. Sighting
positions and other positional data were recorded
using Garmin 12 or 12XL GPS units. Tracks were
logged, with the vessel’s position recorded every 30
to 45 s. Georeferenced data were imported into
ArcView®3.2a (ESRI: www.esri.com) and checked at
the end of each day. Sightings data were stored in an
MS Access®database.
Geoprocessing
Sightings data were overlaid onto a 0.1 × 0.1°
lat./long. grid (at these latitudes, approximately 11 ×
11 km). Grid cell size was determined as a compro-
mise between accuracy in classifying habitat charac-
teristics within grid cells and the need for sufficient
encounters within each cell to yield usable results
(e.g. Hamazaki 2002). On-effort portions of survey
tracks were imported into ArcGIS (ESRI, WGS84 pro-
jection) and converted into shape files, one for each
day’s effort. The geo-processing ‘intersect’ and ‘dis-
solve’ functions of ArcGIS were then used to calcu-
late the total distance (in decimal degrees) surveyed
on-effort in each cell. The ‘spatial join’ function of
ArcGIS was used to calculate the total number of
cetacean groups, by species, in each cell, from the
MS Access®database.
Digitized depth files were generated for each sur-
vey area using rasterized nautical charts (British
Admiralty Raster Chart Series, British Admiralty
chart nos. 2851, 2828, 2896, 3519, 3522, 3784, and
3785). Depth files were interpolated using ArcGIS
Spatial Analyst to generate depth rasters for the grid,
with a mask applied to exclude terrestrial surfaces
from grid cells overlapping the coast. Minimum and
maximum values for slope and depth were calculated
from the rasters for each grid cell. All geoprocessing
was conducted by G. Minton.
The ArcGIS shape file of the Oman coast
(WGS84 projection) was imported into R (R Devel-
opment Core Team 2010) using the maptools v0.7-
34 (package Lewin-Koh et al. 2010), and converted
into a SpatialLines object. The center point of
each grid square was also read into R as a Spa-
tialPoints object, using the same projection. Both
objects were then projected to UTM (zone 40Q).
In order to calculate the distance to shore for the
center of each grid square, the SpatialPoints and
SpatialLines objects were transformed into spatial
point patterns and line segment patterns, respec-
tively, using the spatstat v1.21-2 (package Badde-
ley & Turner 2005). The ‘nncross’ command was
used to calculate distances.
Model construction
Our data were counts, and although the humpback
data were approximately Poisson distributed, the
Bryde’s whale data were not. As we needed a model-
ing approach that would be consistent across both
species, we used quasi-Poisson generalized linear
models (GLMs) with log-link (Venables & Ripley
2002). To account for survey effort differing across
grid cells, the natural log of on-effort distance for
each cell was included as an offset. Mapping the
43
Fig. 2. On-effort survey (in decimal degrees) for cetaceans
surveyed off Oman coastal waters, 2000 to 2003. Grid is that
used for all analyses
Endang Species Res 15: 39– 52, 2011
residuals of GLMs (see below) showed spatial pat-
terning, so further analysis was undertaken. We used
spatial eigenvector mapping (SEVM; Dormann et al.
2007) to account for residual spatial autocorrelation
(SAC), as SEVM builds on GLM results. Also, as Dor-
mann et al. (2007, p. 612) noted, it is a method that
‘could thus be very useful for data with SAC stem-
ming from larger scale observation bias,’ and we
know that biases due to haphazard design confound
our data.
SEVM works by ‘whitening out’ residual spatial
autocorrelation in a model, rather
than incorporating it into the model.
We used the ‘ME’ command from the
spdep 0.5-16 (package Bivand et al.
2010), which takes a brute force
approach to finding the smallest pos-
sible subset of eigenvectors that
removes residual spatial autocorrela-
tion from a GLM. The residual auto-
correlation is then accounted for by
refitting the original model with the
eigenvectors included as covariates
(for further details, see Dormann et al.
2007, Bivand et al. 2008). Hereafter,
we refer to these as SEVM-GLMs. A
flow diagram outlining the process of
model construction and listing com-
mands used is provided in Fig. 3.
Analyses were run using R 2.11.1
(R Development Core Team 2010)
through rgedit 0.7.0.1 on an x86 com-
puter running Ubuntu 9.04.
RESULTS
Of the slope and depth values, max-
imum depth and minimum slope
were the least correlated and so were
selected for inclusion in the model.
Distance from shore was not strongly
correlated with any other physio-
graphic variable. Of the 3 separate
study areas, the Muscat and Dhofar
coasts are relatively similar in topo -
graphy, with deep water within 1 grid
cell of shore. The Gulf of Masirah, on
the other hand, includes one of the
largest areas of shallow waters any-
where off the Arabian Sea coast of
Oman (approximately 80 km at the
widest, see Fig. 1B), with a gently
sloping shelf extending to the outer edge of the sur-
vey area. Coefficients for the GLMs for both species
are shown in Table 2.
To calculate spatial eigenvectors, first we con-
structed a neighborhood, then calculated eigenvec-
tors (see Fig. 3 for details). The distribution of Bryde’s
whales showed little spatial autocorrelation, so we
used an alpha value of 0.25 as a stopping rule for
eigenvector calculation. All neighborhoods are, by
definition, confined within one of the 3 separate study
areas, i.e. the Dhofar coast, the Gulf of Masirah, or off
44
Estimate SE tp r(>|t|)
Megaptera
Intercept −2.42 × 10−1 2.04 × 10−1 −1.182 0.238
DepthMax 5.83 × 10−4 2.62 × 10−4 2.228 0.0268*
DistShore −1.46 × 10−5 1.69 × 10−5 −0.862 0.390
SlopeMin −3.74 × 10−6 1.95 × 10−6 −1.915 0.0566
Balaenoptera
Intercept −2.59 4.13 × 10−1 −6.256 < 0.001***
DepthMax −2.55 × 10−4 6.21 × 10−4 −0.410 0.682
DistShore 4.04 × 10−5 2.19 × 10−5 1.842 0.0667
SlopeMin 1.93 × 10−6 1.36 × 10−6 1.418 0.157
Table 2. Coefficients from the quasi-Poisson generalized linear models of
humpback whales Megaptera novaeangliae and Bryde’s whales Balaenoptera
sp. surveyed off Oman coastal waters, 2000 to 2003, with results
of significance testing. *p < 0.05, ***p < 0.001
Fig. 3. Process by which the choice to use a generalized linear model (GLM)
or a spatial eigenvector mapping GLM (SEVM-GLM) is made. Text in italics
indicates commands used from the R packages indicated in bold
Corkeron et al.: Spatial models for cetacean conservation planning
Muscat. Coefficients for the SEVM-GLMs for both
species are shown in Table 3. Tests comparing the fits
of GLMs and SEVM-GLMs are given in Table 4.
We plotted the results of GLMs and the SEVM-
GLMs side by side in order to display their differ-
ences. Plots of the predicted values generated by
GLMs and SEVM-GLMs are shown in Fig. 4, with
residuals in Fig. 5. Note that predicted values are for
counts of groups of cetaceans (the unit of sighting)
per grid square over the entire period of the study, so
the maps show spatial patterns of relative abun-
dance. For both species, the standard errors esti-
mated for the SEVM-GLMs were greater than those
for the GLMs (as expected), and so these results
are not mapped. Fig. 6 shows maps of eigenvector
values.
For humpback whales, the most important habitat
variable identified in the GLM was depth. The 3
eigenvectors extracted fell clearly into the 3 survey
areas: off Muscat, the Gulf of Masirah, and the Dho-
far coast, respectively (Fig. 6A). The SEVM-GLMs
fitted the data much better than the GLM, with slope
and depth appearing important. As expected, the
SEVM-GLM residuals are substantially smaller and
less spatially clustered than the GLM residuals
(Fig. 5A).
Bryde’s whales produced a very different result
from humpbacks. The distribution of sightings across
the study area was more even than Poisson (disper-
sion parameter for the GLM was ~0.6). There was
relatively little spatial autocorrelation in the initial
GLM, and the SEVM extracted only 1 eigenvector
(Fig. 6B), in which autocorrelation in the Muscat
study area predominated. Habitat variables iden -
tified as being important in the SEVM-GLMs are
distance from shore and slope. Both models’ predic-
tions give relatively similar patterns, with the great-
est relative abundance of Bryde’s whales off Muscat
and, to a lesser extent, off the Dhofar coast (Fig. 4B).
Mapping residuals (Fig. 5B) suggested that the
SEVM-GLM predicts the relative abundance of
Bryde’s whales somewhat better than does the
GLM, although there is a small area
in the Gulf of Masirah where neither
model predicted their relative abun-
dance well.
DISCUSSION
Context
The mark−recapture estimate of
abundance for the Arabian Sea pop-
ulation of humpback whales (82;
95% CI: 60−111; Minton et al. in
press) is from data that are now
almost a decade old. Relying on a
time series of mark−recapture esti-
mates of a cetacean population of
around 100 animals to determine
trends in abundance is futile (e.g.
Thompson et al. 2000, Parra et al.
2006b). Analysis of scarring on the
caudal peduncle region of photo-
graphically identified humpback
whales in Oman in 2003 indicated
that 30 to 40% of all whales exam-
ined were likely to have been
involved in entanglements with fish-
ing gear (Minton et al. in press).
Despite this apparently high level of
interaction with fisheries, there are
no estimates of fisheries-related mor-
45
Estimate SE t p r(>|t|)
Megaptera
Intercept −7.10 × 10−1 2.49 × 10−1 −2.853 0.005**
DepthMax 4.11 × 10−4 1.91 × 10−4 2.150 0.033*
DistShore −9.02 × 10−6 1.48 × 10−5 −0.611 0.542
SlopeMin −3.29 × 10−6 1.24 × 10−6 −2.644 0.009**
fitted(meg.ME.quasi)vec1 −16.6 6.06 −2.747 0.007**
fitted(meg.ME.quasi)vec5 8.84 1.58 5.605 <0.001***
fitted(meg.ME.quasi)vec4 8.67 2.06 4.205 <0.001***
Balaenoptera
Intercept −3.17 5.50 × 10−1 −5.767 <0.001***
DepthMax −9.69 × 10−4 8.83 × 10−4 −1.097 0.274
DistShore 6.03 × 10−5 2.42 × 10−5 2.489 0.015*
SlopeMin 3.62 × 10−6 1.71 × 10−6 2.117 0.035*
fitted(bal.ME.quasi) −8.44 2.55 −3.310 0.001**
Table 3. Coefficients from spatial eigenvector mapping of quasi-Poisson
generalized linear models of cetaceans surveyed off Oman coastal wa-
ters, 2000 to 2003. For species information see Table 2. *p < 0.05, **p < 0.01,
***p < 0.001
Residual df Residual dev. Δdf ΔDeviance p (>|χ|)
Megaptera
GLM 242 163.450
SEVM-GLM 239 100.810 3 62.637 <0.001***
Balaenoptera
GLM 242 48.284
SEVM-GLM 241 40.535 1 7.749 <0.001***
Table 4. Tests comparing fits of generalized linear models (GLMs) and spatial
eigenvector mapping of quasi-Poisson GLMs of cetaceans surveyed off Oman
coastal waters, 2000 to 2003. For species information see Table 2. Dev.:
deviance; ***p < 0.001
Endang Species Res 15: 39– 52, 2011
tality for this population. It is thus unrealistic to
expect that the now-traditional approach outlined
in the Introduction (estimate abundance, estimate
anthropogenic mortality and model sustainability of
anthropogenic mortality), can be implemented in a
timely manner for this population.
46
Fig. 4. Megaptera novaeangliae and Balaenoptera sp. Predicted numbers of whale groups for each grid square from the quasi-
Poisson generalized linear models (GLM Predicted), and spatial eigenvector mapping of quasi-Poisson generalized linear
models (SEVM Predicted) of cetaceans surveyed off Oman coastal waters, 2000 to 2003. (A) Humpback whales, (B) Bryde’s whales
A
B
Corkeron et al.: Spatial models for cetacean conservation planning
How then can scientific input inform plans to
manage anthropogenic activities impacting these
whales? Further, are there any results from our study
that can inform management more generally? In our
experience, these related problems, i.e. small popu-
lation, no reliable quantification of anthropogenic
47
Fig. 5. Megaptera novaeangliae and Balaenoptera sp. Model residuals for each grid square from the quasi-Poisson general-
ized linear models (GLM Predicted) and spatial eigenvector mapping of quasi-Poisson GLMs (SEVM Predicted) of cetaceans
surveyed off Oman coastal waters, 2000 to 2003. (A) Humpback whales, (B) Bryde’s whales
A
B
Endang Species Res 15: 39– 52, 2011
mortality or population trends, and limited resources,
are not uncommon in most of the developing world.
Further, throughout the world, there is likely to be
a large body of data collected using haphazard sam-
pling methods, e.g. cetacean sighting data with asso-
ciated effort (note that effort data are essential), but
collected on platforms of opportunity, or with addi-
tional survey aims that grossly violate the assump-
tions of line- or strip-transect sampling. Haphazard
sampling for photo-identification and genetic sam-
pling is common, as it is for coastal patrols in marine
protected areas. How can cetacean biologists make
best use of such data to inform conservation plan-
ning? Here we show why SEVMs may be the most
appropriate modern tool for analyzing this type of
spatially autocorrelated data.
48
Fig. 6. Megaptera novaeangliae and Balaenoptera sp. Eigenvalues for
each grid square from spatial eigenvector mapping of quasi-Poisson
GLMs (SEVM eigenvectors) of cetaceans surveyed off Oman coastal
waters, 2000 to 2003. (A) Humpback whales, (B) Bryde’s whales
A
B
Corkeron et al.: Spatial models for cetacean conservation planning
Other modeling approaches
Before discussing the results from our models, we
outline why we did not use other approaches to habi-
tat modeling.
Mixed models
We initially attempted to run generalized linear
mixed models (GLMMs) with spatial structure to the
random effect, as outlined by Dormann et al. (2007).
First, we note that a caveat with this technique is that
those authors refer to using a ‘random’ effect with
only 1 category as a ‘cheat’ (Dormann et al. 2007,
their supplementary material). We ran quasi-Poisson
(and Poisson) GLMMs, but they produced non-posi-
tive definite approximate variance−covariance matri-
ces, making it impossible to check the confidence
intervals of the ‘random’ effect. We therefore did not
pursue this line of analysis.
Additive models
Despite the popularity of generalized additive
models (GAMs) in modeling cetacean habitat use
(e.g. Gómez de Segura et al. 2007, Cañadas & Ham-
mond 2008, Redfern et al. 2008), we did not use them,
for 3 reasons. First, with relatively few samples for
whale species (56 sightings over 4 yr for humpback
whales, 15 for Bryde’s whales, Table 1), techniques
such as GAMs that can handle spatial autocorrela-
tion by seeking nonlinear patterns in the raw data
themselves become inherently less useful. Second,
we wanted to compare model outputs between spe-
cies in order to differentiate between autocorrelation
due to haphazard survey design, and autocorrelation
due to cetacean ecology. As GAMs, by definition, fit
nonlinear curves to the data, we judged that they
were less likely to allow us to make these distinc-
tions. Finally, the SEVM-GLM approach allows a
clear comparison to be made between a model that is
unlikely to successfully account for spatial autocorre-
lation (the GLM) with one that does (the SEVM-
GLM). Differences between these model predictions
allow us to make some inference on the manner in
which spatial autocorrelation influences these pre-
dictions, and so provides another form of insight into
the driver(s) of the autocorrelation. A GAM-based
approach would not allow this.
General niche-environment system factor analysis
Ecological niche factor analysis (ENFA), a form of
niche-environment factor analysis (Calenge & Basille
2008), has recently become a popular tool for devel-
oping habitat models for cetaceans (e.g. Oviedo &
Solís 2008, Praca et al. 2009). We did not use ENFA
on our data for 2 principal reasons. First, ENFA is, by
definition, based on Hutchinsonian niche hyper-
space. We did not attempt to model whale niches, as
we knew that we did not have environmental data
available that would be appropriate for niche model-
ing (see Introduction). Secondly, the mathematical
formulation for ENFA (Hirzel et al. 2002) does not
explicitly account for spatial autocorrelation, espe-
cially that due to haphazard sampling in a ‘design I’
habitat use study (as defined by Thomas & Taylor
1990), such as this one. As the main point of our mod-
eling exercise was to account for this form of spatial
autocorrelation, ENFA was inappropriate.
Details from Oman
Our model results confirm and provide a statisti-
cally robust underpinning for previous work based
on the same survey data (Minton et al. 2011). Our
results clearly demonstrate the importance of the
Dhofar coast, particularly in the region of the Hal-
laniyat Islands and Hasik, for the Arabian Sea popu-
lation of humpback whales over our study period.
Examination of the differences in outputs be -
tween models (i.e. with and without autocorrelation
accounted for), and between species, allow us to
identify the most reliable model outputs and thus the
information that is of greatest value for management.
For humpback whales, there are substantial differ-
ences between the model predictions of the GLM
and the SEVM-GLM (Fig. 4A). The GLM predicts the
greatest relative abundance of humpbacks to be off
Muscat, while the SEVM-GLM predicts most hump-
backs off the Dhofar coast. Examination of model
residuals (Fig. 5A) demonstrates that the SEVM-
GLM prediction is more robust. The GLM prediction
appears driven by the substantial search effort off
Muscat, and the similarity in habitat characteristics
between Muscat and the Dhofar coast (where most
humpback sightings were made). Note that model
bias is introduced into the GLM by focusing survey
effort in an area known to be important for hump-
backs that had similar physiographic characteristics
(i.e. the Dhofar coast). The SEVM-GLM successfully
handles this bias. Both models poorly predicted
humpback occurrence in the shallow coastal shelf
waters of the Gulf of Masirah (Fig. 4A), although the
SEVM-GLM fit is somewhat better.
49
Endang Species Res 15: 39– 52, 2011
The way in which model outputs for Bryde’s whales
differ from humpback whales is of interest. As the
effort data are from the same survey series, one
would expect the confounding effects of haphazard
sampling to be consistent, and as such, model differ-
ences would reflect biological/ecological traits of the
species rather than sampling artefacts. The maxi-
mum cell count for Bryde’s whales is approximately
an order of magnitude less than that for humpbacks.
The pattern of spatial autocorrelation in the data is
also different. The distribution of Bryde’s whales
across the study area is more regular, and they are
more prevalent off Muscat than are humpbacks. This
suggests that the clumped distribution of humpbacks
is real, as is the importance of the Dhofar coast for
humpbacks. This has important local, and regional,
implications for management.
The Arabian Sea population of humpback whales
is the smallest population of humpback whales
known to exist, the only population known not to
undertake an extensive seasonal migration, and one
of the most endangered baleen whale populations
(Minton et al. 2008). The Dhofar coast, in particular in
the region of the Hallaniyat Islands and Hasik, was
identified previously (Minton et al. in press) as likely
to be an important habitat for this population. Our
modeling work quantifies the significance of this
area for these whales.
Caveats
We considered it inappropriate to attempt to make
inference on parts of the Oman coast not covered by
the surveys. Although it is theoretically possible for
us to project model predictions into other areas, we
consider this inadvisable, as our basic design was not
to make inference about the distribution of hump-
back whales along the entire Oman coast. Given the
constraints under which field work was undertaken,
both logistic and financial, a synoptic survey of the
entire coast was impossible. We focused our survey
effort on areas which available information sug-
gested were likely the most important areas for
humpback whales.
An extension of these caveats is that although we
make recommendations on the relative importance
for conservation of the Dhofar coast (see below), we
cannot state with certainty that other areas will not
prove equally important. Nevertheless, the informa-
tion available is sufficient to note the importance of
starting the process of mitigating inadvertent anthro-
pogenic mortality on Arabian Sea humpback whales,
and that the science available suggests that the best
place to start is off the Dhofar coast. Recent (March
2011) fieldwork off the Dhofar coast, focusing near
the village of Hasik (Fig. 1) was planned based in
part on the results of our model. This field season
resulted in regular, multiple sightings of humpback
whales, and observations of feeding and breeding
behavior, confirming the area’s continued relative
importance (A. Willson pers. obs.). Unfortunately, the
threat of pirate activity offshore prevented field work
around the Hallaniyat Islands.
Implications for humpback whale conservation
The coastal zone of Oman is experiencing rapid
transformation as the country moves beyond a wholly
petroleum-dependent economy. Oman’s population
growth rate is among the highest in the world (3.14%
per annum), and there is a continuing demographic
shift towards coastal areas (Oman Ministry of
National Economy 2009). Fishing effort off the coast
of Oman and in other parts of the Arabian Sea is
increasing dramatically (Oman Ministry of Agricul-
ture and Fisheries 2002, FAO 2007, Oman Ministry of
National Economy 2009), and drifting and set gillnets
as well as traps are already widely used (Stengel & Al
Harthy 2002).
One of the most important findings of this study is
that the clustering of humpback whales along part of
the Dhofar coast and the Hallaniyat Islands is not a
sampling artefact, but a result of the whales’ ranging
behavior. This suggests that a spatially-explicit man-
agement program should be implemented along this
section of the Dhofar coast, as a preliminary step to
larger-scale marine conservation planning in Oman.
There are instances where declaring a marine pro-
tected area for cetacean conservation has not led to
cessation of threatening processes, particularly gill-
netting (e.g. Notarbartolo di Sciara et al. 2008). There
are also examples where it has been successful: the
implementation of netting restrictions to protect
dugongs, and the general process of rezoning in the
Great Barrier Reef Marine Park (Dobbs et al. 2008,
Grech et al. 2008) provides an example of how to
achieve spatially-explicit restrictions on netting. We
suggest that this process, suitably modified for
Omani cultural norms and local capacity for manage-
ment, start as soon as possible.
Finally, we suggest that other researchers working
with spatially autocorrelated data on cetacean (and
other marine wildlife) distribution give serious con-
sideration to using spatial eigenvector models.
50
Corkeron et al.: Spatial models for cetacean conservation planning
There will be instances where sampling regimes are
just too haphazard, and there are no options to dis-
tinguish between the likely causes of spatial auto-
correlation, where these models may prove ineffec-
tive. But in those instances, it is possible that any
other spatial modeling approach will be equally
ineffective. Researchers need to ensure that they do
not make inference beyond their data with this tool,
as is the case for all spatial or habitat models. Tools
to run spatial eigenvector models on all operating
systems are available for free download as part of
the R statistical language (R Development Core
Team 2010), and example code to run eigenvector
filtering is provided as an appendix to the paper by
Dormann et al. (2007).
Acknowledgements. Logistic support and research permits
for surveys were provided by Oman's Ministry of Environ-
ment and Climate Affairs, the Oman Natural History
Museum, and Oman's Ministry of Agriculture and Fisheries.
Fieldwork was supported by: The Ford Environmental
Grants, The UK Foreign and Commonwealth Office, Shell
Marketing Oman, Petroleum Development Oman, Veritas
Geophysical, The Peter Scott Trust for Education and
Research in Conservation, and the Marina Bandar al Row-
dah. The manuscript was improved by comments from
Daniel Palacios and two anonymous reviewers.
LITERATURE CITED
Baddeley A, Turner R (2005) Spatstat: an R package for ana-
lyzing spatial point patterns. J Stat Softw 12: 1−42
Basille M, Calenge C, Marboutin É, Andersen R, Galliard
JM (2008) Assessing habitat selection using multivariate
statistics: some refinements of the ecological-niche factor
analysis. Ecol Model 211: 233−240
Bivand R, Altman M, Anselin L, Assunção R and others
(2010) spdep: Spatial dependence: weighting schemes,
statistics and models. R package version 0.5-16. Avail -
able at http: //CRAN.R-project.org/package=spdep
Bivand RS, Pebesma EJ, Gómez-Rubio V (2008) Applied
spatial data analysis with R. Springer, New York, NY
Braulik GT, Ranjbar S, Owfi F, Aminrad T, Dakhteh SMH,
Kamrani E, Mohsenizadeh F (2010) Marine mammal
records from Iran. J Cetacean Res Manag 11: 49−64
Burkill PH (1999) ARABESQUE: an overview. Deep-Sea Res
II 46: 529−547
Calenge C, Basille M (2008) A general framework for the
statistical exploration of the ecological niche. J Theor
Biol 252: 674−685
Cañadas A, Hammond PS (2008) Abundance and habitat
preferences of the short-beaked common dolphin Del-
phinus delphis in the southwestern Mediterranean:
implications for conservation. Endang Species Res 4:
309−331
Crowder LB, Osherenko G, Young OR, Airame S and others
(2006) Resolving mismatches in U.S. ocean governance.
Science 313: 617−618
Dobbs K, Fernandes L, Slegers S, Jago B and others (2008)
Incorporating dugong habitats into the marine protected
area design for the Great Barrier Reef Marine Park,
Queensland, Australia. Ocean Coast Manag 51: 368−375
Dormann CF (2009) Response to Comment on ‘Methods to
account for spatial autocorrelation in the analysis of spe-
cies distributional data: a review’. Ecography 32: 379−381
Dormann CF, McPherson JM, Araújo MB, Bivand R and
others (2007) Methods to account for spatial autocorrela-
tion in the analysis of species distributional data: a
review. Ecography 30: 609−628
FAO (Food and Agriculture Organisation of the United
Nations) (2007) The state of world fisheries and aquacul-
ture 2006. FAO, Rome
Fernandes L, Day J, Lewis A, Slegers S and others (2005)
Establishing representative no-take areas in the Great
Barrier Reef: large scale implementation of theory on
marine protected areas. Conserv Biol 19: 1733−1744
Gómez de Segura A, Hammond PS, Cañadas A, Raga JA
(2007) Comparing cetacean abundance estimates de -
rived from spatial models and design-based line transect
methods. Mar Ecol Prog Ser 329: 289−299
Grech A, Marsh H (2008) Rapid assessment of risks to a
mobile marine mammal in an ecosystem-scale marine
protected area. Conserv Biol 22: 711−720
Grech A, Marsh H, Coles R (2008) A spatial assessment of
the risk to a mobile marine mammal from bycatch.
Aquatic Conserv 18: 1127−1139
Hamazaki T (2002) Spatiotemporal prediction models of
cetacean habitats in the mid-western North Atlantic
Ocean (from Cape Hatteras, North Carolina, U.S.A. to
Nova Scotia, Canada). Mar Mamm Sci 18: 920−939
Hirzel AH, Hausser J, Chessel D, Perrin N (2002) Ecological-
niche factor analysis: how to compute habitat-suitability
maps without absence data? Ecology 83: 2027−2036
Hutchinson GE (1957) Concluding remarks. Cold Spring
Harbor Symp Quant Biol 22: 415−427
Johnston DW, Westgate AJ, Read AJ (2005) Effects of fine-
scale oceanographic features on the distribution and
movements of harbour porpoises Phocoena phocoena in
the Bay of Fundy. Mar Ecol Prog Ser 295: 279−293
Lewin-Koh NJ, Bivand R, Pebesma EJ, Archer A and others
(2010) maptools: Tools for reading and handling spatial
objects. R package version 0.7-34. Available at http://
CRAN.R-project.org/package=maptools
Macleod K, Fairbairns R, Gill A, Fairbairns B, Gordon J,
Blair-Myers C, Parsons ECM (2004) Seasonal distribution
of minke whales Balaenoptera acutorostrata in relation
to physiography and prey off the Isle of Mull, Scotland.
Mar Ecol Prog Ser 277: 263−274
Margules CR, Pressey RL (2000) Systematic conservation
planning. Nature 405: 243−253
Mikhalev YA (2000) Whaling in the Arabian Sea by the
whaling fleets Slava and Sovetskaya Ukraina. In:
Yablokov AV, Zemsky VA, Tormosov DD (eds) Soviet
whaling data (1949−1979). Centre for Russian Environ-
mental Policy, Moscow, p 141−181
Minton G, Collins TJQ, Pomilla C, Findlay KP, Rosenbaum
HC, Baldwin R, Brownell RL Jr (2008) Megaptera novae -
angliae, Arabian Sea subpopulation. IUCN Red List of
Threatened Species. Available at www.iucnredlist.org/
details/132835
Minton G, Cerchio S, Collins T, Ersts P and others (2010) A
note on the comparison of humpback whale tail fluke
catalogues from the Sultanate of Oman with Madagascar
and the East African Mainland. J Cetacean Res Manag
11: 65−68
51
Endang Species Res 15: 39– 52, 2011
52
Minton G, Collins TJQ, Findlay KP, Baldwin R (2011)
Cetacean distribution in the coastal waters of the Sul-
tanate of Oman. J Cetacean Res Manag 11: 301−313
Minton G, Collins T, Findlay K, Ersts P, Rosenbaum H,
Berggren P, Baldwin R (in press) Seasonal distribution,
abundance, habitat use and population identity of hump-
back whales in Oman. J Cetacean Res Manag (Spec
Issue Southern Hemisphere Humpback Whales)
Notarbartolo di Sciara G, Agardy T, Hyrenback D, Scovazzi
T, Van Klaveren P (2008) The Pelagos Sanctuary for
Mediterranean marine mammals. Aquatic Conserv 18:
367−391
Oman Ministry of Agriculture and Fisheries (2002) Fisheries
statistical year book 2001. Ministry of Agriculture and
Fisheries, Muscat
Oman Ministry of National Economy (2009) 2009 statistical
yearbook. Ministry of National Economy, Muscat
Oviedo L, Solís M (2008) Underwater topography deter-
mines critical breeding habitat for humpback whales
near Osa Peninsula, Costa Rica: implications for marine
protected areas. Rev Biol Trop 56: 591−602
Parra GJ, Schick R, Corkeron PJ (2006a) Spatial distribution
and environmental correlates of Australian snubfin and
Indo-Pacific humpback dolphins. Ecography 29: 396−406
Parra GJ, Corkeron PJ, Marsh H (2006b) Population sizes,
site fidelity and residence patterns of Australian snubfin
and Indo-Pacific humpback dolphins: implications for
conservation. Biol Conserv 129: 167−180
Praca E, Gannier A, Das K, Laran S (2009) Modelling the
habitat suitability of cetaceans: example of the sperm
whale in the northwestern Mediterranean Sea. Deep-
Sea Res I 56: 648−657
R Development Core Team (2010) R: a language and en -
vironment for statistical computing. R Foundation
for Statistical Computing, Vienna. Available at www.
R-project.org
Redfern JV, Barlow J, Ballance LT, Gerrodette T, Becker EA
(2008) Absence of scale dependence in dolphin−habitat
models for the eastern tropical Pacific Ocean. Mar Ecol
Prog Ser 363: 1−14
Rosenbaum HC, Pomilla C, Mendez M, Leslie MS and oth-
ers (2009) Population structure of humpback whales from
their breeding grounds in the South Atlantic and Indian
Oceans. PLoS ONE 4: e7318
Stafford KM, Citta JJ, Moore SE, Daher MA, George JE
(2009) Environmental correlates of blue and fin whale
call detections in the North Pacific Ocean from 1997 to
2002. Mar Ecol Prog Ser 395: 37−53
Stengel H, Al Harthy A (2002) The traditional fishery of the
Sultanate of Oman (fishing gear and methods). Ministry
of Agriculture and Fisheries, Directorate General of Fish-
eries Resources, Marine Science and Fisheries Center,
Muscat
Taylor BL, Wade PR, DeMaster DP, Barlow J (2000) Incorpo-
rating uncertainty into management models for marine
mammals. Conserv Biol 14: 1243−1252
Thomas D, Taylor E (1990) Study designs and tests for com-
paring resource use and availability. J Wildl Manag 54:
322−330
Thompson PM, Wilson B, Grellier K, Hammond PS (2000)
Combining power analysis and population viability
analysis to compare traditional and precautionary
approaches to conservation of coastal cetaceans. Con-
serv Biol 14: 1253−1263
Venables WN, Ripley BD (2002) Modern applied statistics
with S, 4th edn. Springer, New York, NY
Wade PR (1998) Calculating limits to the allowable human-
caused mortality of pinnipeds and cetaceans. Mar
Mamm Sci 14: 1−37
Wray P, Martin KR (1983) Historical whaling records from
the western Indian Ocean. Rep Int Whal Comm 5 (Spec
Issue): 213−241
Editorial responsibility: Daniel Palacios,
Pacific Grove, California, USA
Submitted: January 16, 2011; Accepted: July 8, 2011
Proofs received from author(s): October 6, 2011
... Understanding the relationships between cetaceans and their environment can provide fundamental knowledge for the development of conservation strategies (Corkeron et al., 2011). Species distribution models (SDMs) are an important tool in the identification of critical habitats (Rasmussen et al., 2007;Corkeron et al., 2011;Marshall et al., 2014). ...
... Understanding the relationships between cetaceans and their environment can provide fundamental knowledge for the development of conservation strategies (Corkeron et al., 2011). Species distribution models (SDMs) are an important tool in the identification of critical habitats (Rasmussen et al., 2007;Corkeron et al., 2011;Marshall et al., 2014). Critical habitats are defined as areas that provide a key value for the sustainability of a healthy population, including those used for raising calves (Hoyt, 2005). ...
... Despite the technique's widespread use, it is important when using presence-only data to acknowledge and address potential data limitations related to the non-systematic survey, which can be spatiotemporally heterogeneous and possibly biased towards easily accessible areas and times that present better navigation conditions (Corkeron et al., 2011) as well as the commercial basis of the navigation. The potential limitations of the data can be addressed by implementing spatial filtering and a good validation approach (Smith et al., 2021) such as the use of a geographic resampling parameter, which was used in the present study. ...
Article
Full-text available
Understanding the relationships between cetaceans and their environment is crucial for conservation. This study examined humpback whales in Bahía de Banderas, Mexico, identifying key calving habitats. From 2018 to 2023, 1066 sightings were recorded, including 242 mother–calf groups, 109 mating groups, and 715 other groups. Spatial analysis revealed a non-random distribution; both the Kruskal–Wallis and Wilcoxon–Mann–Whitney tests detected significant differences ( P < 0.05) in site preferences. Calving mothers favoured habitats with a mean depth of 59 m and a distance of 2 km from the coast, while mating groups preferred locations at 126 m and 4 km, and other groups chose areas at 149 m and 4 km. All groups were found in relatively flat areas around 2° seafloor slope. A dispersion test indicated a significant relationship between the location of calving mothers and environmental factors. K -means clustering showed 83.6% of calving mothers' sightings at depths less than 40 m and 2 km from the coast. Ensemble species distribution models identified three critical calving areas: one large area (261.8 km ² ) along the north coast and two smaller areas (9.5 and 5.4 km ² ) at the southern end of the bay. This study highlights Bahía de Banderas as a vital breeding habitat for humpback whales, providing insights for conservation strategies to protect calving grounds during the breeding season.
... Both non-systematic haphazard (sensu [35]) and systematic sampling procedure. More details in [30,31]. ...
... One omnidirectional hydrophone Bruel e Kjer (Naerum, Denmark) model 8104 (sensitivity -205.6 dB re 1 V/1 μPa ± 4.0 dB), with a bandwidth < 0. Non-systematic haphazard sampling procedure (sensu [35]). More details in [26]. ...
Article
Full-text available
Acoustic sequences are commonly observed in many animal taxa. The vast vocal repertoire of common bottlenose dolphins (Tursiops truncatus) also includes sequences of multi-unit rhythmic signals called bray-call which are still poorly documented, both functionally and geographically. This study aimed to (1) describe, classify, and characterize series of bray-call recorded in two sites of the Mediterranean basin (Rome-Tyrrhenian Sea and Mazara del Vallo-Strait of Sicily) and (2) investigate for the existence of possible geographic differences. The acoustic analysis identified 13 different sequence types, only two detected in both study areas. The Sørensen-Dice index revealed a low degree of similarity between the sequence repertoire of the two common bottlenose dolphin sub-populations, with the Tyrrhenian being more diversified and complex than the Sicilian one. The acoustic parameters also showed variability between the study area. Different variants of the main acoustic elements composing the bray-call sequences were detected in the Tyrrhenian Sea only. The Markov-chain model demonstrated that the transition probability between acoustic elements is not uniform, with specific combinations of elements having a higher probability of occurrence. These new findings on common bottlenose dolphin bray-call sequences highlight the structural complexity of these vocalizations and suggest addressing future research on the context of emissions and the possible function(s) of such acoustic arrangements.
... However, data collection is costly and often difficult in remote areas (Franchini et al., 2020). Moreover, the databases that do exist for most cetaceans, including Commerson's dolphins, rarely comply with the assumptions for modeling, especially when data have been obtained from platforms of opportunity (Corkeron et al., 2011). ...
... This study also provides evidence that the same features that condition the distribution of Commerson's dolphins on a regional scale are those that rule its distribution on a small scale (Dellabianca et al., 2016;Franchini et al. 2020;Garaffo et al. 2011). From an analytical point of view, using the INLA framework to model habitat preferences also proves to be a powerful tool to analyze data originating from opportunistic or nonsystematic sampling such as haphazard (Corkeron et al., 2011). ...
... Consequently, unsystematic records in cetacean databases are signi cantly in uenced by the heterogeneous distribution of survey efforts. This distribution, in turn, is affected by the di culty in selecting locations that maximize the likelihood of animal presence and/or provide suitable observations (Corkeron et al. 2011). In this study, we analyze surveys conducted by various Gulf of California, Mexico researchers, focusing on collecting whale records. ...
Preprint
Full-text available
Data on the distribution of most species are often collected using non-standardized sampling protocols, resulting in biased data due to preferential selection of certain environmental conditions. This study aimed to assess the distribution of survey effort for whale monitoring in the Gulf of California, México and estimate its correlation with environmental variables at different resolutions. This comprehensive database compiles navigation details and species observations from 1982 to 2018. The number of navigation routes for whale monitoring in the Gulf of California was calculated, and 10% and 5% of the best-surveyed cells were located at five different resolutions. Generalized Linear Models were employed to estimate the explanatory capacity of eight environmental variables in the distribution of the survey effort. Only approximately 3%-10% of the entire area can be considered well-surveyed. Collection effort was highest in areas with cold waters, high levels of particulate organic carbon, and phytoplankton, irrespective of resolution. However, regardless of environmental conditions, the distribution of survey efforts correlated with available data on the distribution of whales. These results suggest that the knowledge and prolonged interaction between data collectors and the whale population mainly influence the heterogeneous distribution of survey effort. Understanding biases and associated factors in survey effort distribution may provide insights for future monitoring programs. This knowledge can inform effective conservation strategies for whales in the Gulf of California and beyond.
... Pardo & Palacios 2006, Weir et al. 2012, Tardin et al. 2017. Both the whales and their prey therefore occur where physiographic features, such as the continental shelf break (Corkeron et al. 2011), or oceanographic features, such as the Benguela Current along the African coast (Weir et al. 2012) and the Kuroshio Front in the western Pacific (Watanabe et al. 2012), maintain persistent upwelling and high prey density. In addition, seasonal changes in prey distribution and inter-annual variation related to the ENSO cycle may also drive variability in Bryde's-like whale ranges as the underlying distribution of prey changes (Best 2001, Salvadeo et al. 2011, Kerosky et al. 2012, Dwyer et al. 2016. ...
Article
Full-text available
The newly recognized Rice’s whale Balaenoptera ricei is among the most endangered large whale species in the world and primarily occupies a region near the continental shelf break in the northeastern Gulf of Mexico (GoMex). We analyzed visual line-transect survey data collected throughout the northern GoMex from 2003-2019 and developed spatially explicit density maps using a density surface modeling approach to examine relationships between Rice’s whale density and bathymetric and oceanographic features. We identified water depth, surface chl a concentration, bottom temperature, and bottom salinity as key parameters that define the Rice’s whale habitat. This is consistent with upwelling of cold, high-salinity water along the continental shelf break and seasonal input of high-productivity surface water originating from coastal sources. The dominant circulation patterns in the GoMex, including the presence of Loop Current eddies, lead to increased productivity and likely play a role in maintaining high densities of forage species needed to support Rice’s whales. Extrapolation of the model suggests additional regions in Mexican waters of GoMex that may be suitable for Rice’s whales. This study informs the designation of critical habitat as defined by the US Endangered Species Act and will assist in marine spatial planning activities to avoid additional anthropogenic impacts to Rice’s whales associated with the development of wind energy and aquaculture.
... Nevertheless, heterogeneous data are complex to manage as they are polymorphic in nature and affected by numerous forms of bias and limitations (Isaac and Pocock 2015). Information on species occurrence collected at sea by sea-users, for example, is characterised by a different spatiotemporal distribution of effort, which can be biased toward easily accessible habitat and times with better weather, or known areas of use (Corkeron et al. 2011, Sicacha-Parada et al. 2020). Hence, a simple data pooling (Fletcher et al. 2019) with data gathered under conventional research methodologies is not enough to reliably model the presence of a species considering different explanatory variables both environmental and anthropogenic and to define its distribution over multiple spatial and temporal scales. ...
Article
Full-text available
Presence‐only data are typical occurrence information used in species distribution modelling. Data may be originated from different sources, and their integration is a challenging exercise in spatial ecology as detection biases are rarely fully considered. We propose a new protocol for presence‐only data fusion, where information sources include social media platforms, to investigate several possible solutions to reduce uncertainty in the modelling outputs. As a case study, we use spatial data on two dolphin species with different ecological characteristics and distribution, collected in central Tyrrhenian through traditional research campaigns and derived from a careful selection of social media images and videos. We built a spatial log‐Gaussian cox process that incorporates different detection functions and thinning for each data source. To finalize the model in a Bayesian framework, we specified priors for all model parameters. We used slightly informative priors to avoid identifiability issues when estimating both the animal intensity and the observation process. We compared different types of detection function and accessibility explanations. We showed how the detection function's variation affects ecological findings on two species representatives for different habitats and with different spatial distribution. Our findings allow for a sound understanding of the species distribution in the study area, confirming the proposed approach's appropriateness. Besides, the straightforward implementation in the R software, and the provision of examples' code with simulated data, consistently facilitate broader applicability of the method and allow for further validations. The proposed approach is widely functional and can be considered with different species and ecological contexts.
... Lockyer & Brown, 1981;Valenzuela et al., 2009;Barendse et al., 2013), Bryde's whales do not undertake such long-range migrations and seem to feed regularly rather than relying on stored reserves (characteristic of income breeders; Constantine et al., 2018). These high energetic requirements may be an important driver of seasonal fluctuations in Bryde's whales (Tardin et al., 2017), although other studies have also described physiographic and oceanographic influences (Corkeron et al., 2011;Weir, MacLeod & Pierce, 2012). Marrero, pers. ...
Article
• The conservation of marine megafauna presents numerous difficulties owing to their high mobility over difficult-to-access oceanic areas that impairs the collection of basic, but essential, biological information. • The Bryde's whale (Balaenoptera edeni) is one of the most elusive species of baleen whales, and although it is known to be a seasonal visitor to several archipelagos in Macaronesia (the Azores, Madeira, and Canaries), there are no studies regarding its occurrence or geographical connectivity in this area of the Atlantic. • A 14-year photographic database was used to determine short-term (intra-seasonal) and long-term (inter-annual) Bryde's whale site fidelity and to estimate individual residency times in Madeira, whereas photographic catalogues from Madeira and the Canaries were compared in order to assess large-scale movements (i.e. on the scale of hundreds of kilometres). • In Madeira, 59 individuals were identified, 27 (45.8%) of which were recaptured. Of these, 10 individuals (37.0%) presented short-term site fidelity and 17 individuals (63.0%) presented long-term site fidelity, with a maximum recapture interval of 12 years. Lagged identification rates showed that five individuals (SE = 2) remained in the area for 32 days (SE = 108 days) before leaving and not returning during the same year. Seven individuals were seen both in Madeira and the Canaries (catalogue comprising 51 individuals), three of which were identified multiple times in both archipelagos, with a minimum of 43 days between consecutive sightings. • This information combined with the fact that this species is commonly sighted accompanied by calves and feeding in both archipelagos highlights the ecological importance of this area for Bryde's whales. This should be taken into consideration by policymakers when implementing conservation measures, where coordination of effort among countries is needed. This study also reinforces the value of using data from platforms of opportunity and of making photographic data open access.
... Dolphin data has been collected over 13 years (2007-2019) by three sources: a) conventional research protocols from motor and sailing boats (non-systematic 'haphazard', sensu [Corkeron et al., 2011a]) (labelled UNIRM) [Pace et al., 2019]; b) standardized monitoring protocols from platforms of opportunity within the project "FLT Mediterranean Monitoring Network" (labelled FERRY) [ISPRA, 2016., Pace et al., 2019; c) social media reports (Facebook and YouTube) by sea-users [Pace et al., 2019] (labelled SM). Data collection procedures and selection are provided in Pace et al. [2019]. ...
Preprint
Full-text available
Presence-only data are a typical occurrence in species distribution modeling. They include the presence locations and no information on the absence. Their modeling usually does not account for detection biases. In this work, we aim to merge three different sources of information to model the presence of marine mammals. The approach is fully general and it is applied to two species of dolphins in the Central Tyrrhenian Sea (Italy) as a case study. Data come from the Italian Environmental Protection Agency (ISPRA) and Sapienza University of Rome research campaigns, and from a careful selection of social media (SM) images and videos. We build a Log Gaussian Cox process where different detection functions describe each data source. For the SM data, we analyze several choices that allow accounting for detection biases. Our findings allow for a correct understanding of Stenella coeruleoalba and Tursiops truncatus distribution in the study area. The results prove that the proposed approach is broadly applicable, it can be widely used, and it is easily implemented in the R software using INLA and inlabru. We provide examples' code with simulated data in the supplementary materials.
Article
Full-text available
The sub‐population of humpback whales inhabiting the Arabian Sea is a small and genetically distinct population that remains in low latitudes year‐round. Designated as Endangered on the IUCN Red list of Threatened Species, the sub‐population faces a number of threats throughout its range, including entanglement in fishing gear, ship strikes, disease and habitat degradation. Research conducted primarily off the coast of Sultanate of Oman over the past 20 years has contributed to understanding the population’s distribution, abundance, and conservation status. However, information on the population’s health and specific threats is limited. This study examines all available images of Arabian Sea humpback whales obtained between 2000 and 2018 for evidence of disease, predation, epizoites, ectoparasites, and human‐induced scars and wounds. Tattoo skin disease‐like lesions were detected in 41% of 93 whales, with a roughly equal distribution between males and females. Prevalence of the disease was significantly higher in 2012–2018 (51.7%) than in 2000–2011 (27.6%). Killer whale tooth rakes were detected on the ventral surface of the tail flukes of 12% (95% CI 4.5–18%) of 77 individuals. Roughly two thirds (66.6%: 95% CI 52–80%) of the 42 individuals represented by good quality photographs of the caudal peduncle region at the fluke insertion bore scarring patterns consistent with entanglement in fishing gear. At least two individuals showed severe injuries or deformations likely caused by interactions with fishing gear. Six individuals had injuries consistent with vessel strikes. Documented entanglement events from Oman and Pakistan involved large‐mesh nylon gillnets, known to be used extensively throughout the Arabian Sea. These findings indicate an urgent need to design effective measures for the management and mitigation of threats, and to continue monitoring Arabian Sea humpback whales, with an emphasis on methods that allow continued and expanded assessment of health, body condition, and anthropogenic interactions.
Thesis
Full-text available
El Alto Golfo de California (AGC) es una provincia del Golfo de California (GC), la oceanografía y fisiografía de la zona hacen de ésta una región productiva y de alta biodiversidad, incluyendo cetáceos, donde se han registrado ocho especies de misticetos y 23 de odontocetos. Se ha descrito que los patrones de desplazamiento de los cetáceos dependen del movimiento de sus presas, de la reproducción y crianza. Conocer la distribución de las especies es una cuestión básica de ecología y los modelos de distribución de especies (MDE) han permitido tener una alternativa para el manejo y conservación de especies o poblaciones. Sin embargo, la distribución está determinada por diversos factores ecológicos, evolutivos y geográficos, que a su vez hacen que el estudio de la distribución de especies sea un tema complejo. Con el objetivo de estimar la presencias y distribución de los cetáceos en el AGC, en este estudio realizamos MDE para seis especies de cetáceos con requerimientos ecológicos diferentes: Balaenoptera edeni, B. physalus, B. musculus, Delphinus capensis, Tursiops truncatus y Orcinus orca. Para ello, utilizamos Modelos Lineales Generalizados a cinco resoluciones diferentes, con datos de la temporada templada y cálida se relacionaron datos de presencia/ ausencia y frecuencia de avistamientos con ocho variables ambientales. Se realizó la estimación de ausencias verdaderas a partir de las rutas de navegación, que también fueron tomadas como un equivalente del esfuerzo. El mejor modelo para cada especie fue aquel con mayor devianza explicada por las variables ambientales y con menor influencia del esfuerzo. Para todos los modelos de todas las especies y temporadas analizadas, a la resolución de 10 x 10 km se obtuvieron los mejores modelos. En los modelos binomiales de odontocetos, la temperatura superficial del mar (TSS) y el oxígeno molecular disuelto, seguidos por el fitoplancton (PHY) y la productividad primaria (PP), fueron las variables significativas con mayor influencia para la presencia de los delfines. La TSS, el PHY y la PP fueron las variables más importantes para la presencia de misticetos. A excepción de unos pequeños parches, todo el AGC resultó ser una zona favorable para la presencia de los cetáceos. La temperatura superficial del mar y la batimetría (Bat) fueron las variables más significativas en los modelos de frecuencias de todos los cetáceos. Para los misticetos, durante la temporada templada hubo una distribución más importante en las zonas más norteñas del AGC y durante la cálida la zona central y suroeste.. Las estimaciones estuvieron dadas por las necesidades ecológicas y biológicas de cada especie.
Article
Full-text available
Small boat surveys were conducted between 2000 and 2003 in three main regions of Oman's coastal waters: Muscat, the Gulf of Masirah and Dhofar. Survey data were analysed to calculate relative abundances of the seven most frequently encountered species in these areas. These include (in order of frequency) bottlenose dolphins (Tursiops sp.), long-beaked common dolphins (Delphinus capensis), humpback whales (Megaptera novaeangliae), spinner dolphins (Stenella longirostris), Indo-Pacific humpback dolphins (Sousa chinensis), Bryde's whales (Balaenoptera sp.) and Risso's dolphins (Grampus griseus). Other species observed include false killer whales (Pseudorca crassidens), blue whales (Balaenoptera musculus), rough-toothed dolphins (Steno bredanensis) and unidentified beaked whales. Encounterrates per distance searched were plotted by 0.1 × 0.1 degree grid cell, giving an indication of relative abundances and key areas of habitat used by each of the seven most frequently encountered species. These plots demonstrate that the nearshore areas of the Gulf of Masirah, as well as the coastal waters of Dhofar, are areas of concentration for the Arabian Sea's recently designated Endangered subpopulation of humpback whales, as wellas Indo-Pacific humpback dolphins, which are considered Near Threatened on the IUCN Red List of Threatened Species. 1 The results presented here provide valuable baseline data for future research and help to inform conservation management efforts that are required to address the highly vulnerable status of the humpback whale and Indo-Pacific humpback dolphin populations in question.
Book
A guide to using S environments to perform statistical analyses providing both an introduction to the use of S and a course in modern statistical methods. The emphasis is on presenting practical problems and full analyses of real data sets.
Article
A simulation method was developed for identifying populations with levels of human-caused mortality that could lead to depletion, taking into account the uncertainty of available information. A mortality limit (termed the Potential Biological Removal, PBR, under the U.S. Marine Mammal Protection Act) was calculated as the product of a minimum population estimate (N(MIN)), one-half of the maximum net productivity rate (R(MAX)), and a recovery factor (F(R)). Mortality limits were evaluated based on whether at least 95% of the simulated populations met two criteria: (1) that populations starting at the maximum net productivity level (MNPL) stayed there or above after 20 yr, and (2) that populations starting at 30% of carrying-capacity (K) recovered to at least MNPL after 100 yr. Simulations of populations that experienced mortality equal to the PBR indicated that using approximately the 20th percentile (the lower 60% log-normal confidence limit) of the abundance estimate for N(MIN) met the criteria for both cetaceans (assuming R(MAX) = 0.04) and pinnipeds (assuming R(MAX) = 0.12). Additional simulations that included plausible levels of bias in the available information indicated that using a value of 0.5 for F(R) would meet both criteria during these 'bias trials.' It is concluded that any marine mammal population with an estimate of human-caused mortality that is greater than its PBR has a level of mortality that could lead to the depletion of the population. The simulation methods were also used to show how mortality limits could be calculated to meet conservation goals other than the U.S. goal of maintaining populations above MNPL.
Article
Classified study designs for comparing resource (food, habitat) use and availability into 3 basic types. Design 1 permits investigation of resource selectivity only at the population level because individual animals are not identified. Designs 2 and 3 measure use by individuals and thus allow examination of the variation in resource selection strategies. Resource availabilities are measured for each individual in Design 3 but not in Design 2. Graphical plots illustrating individual selection are recommended for data resulting from Designs 2 and 3 to assess variability and possible sex or age differences. The authors recommend a method for determining the number of random points required to bound the probable error in estimating resource availability proportions simultaneously, rather than individually. Four problem areas in the use of statistical methods for evaluating resource selectivity are identified: dependencies among observations, misuse of the Chi-square goodness-of-fit test when availabilities are estimated, tests that do not control experimentwise error rates, and the sensitivity of tests to the subjective inclusion or exclusion of resources. -Authors
Chapter
Exploratory spatial data analysis (ESDA) as used in spatial statistics, spatial econometrics and geostatistics, developed from exploratory data analysis (EDA). In particular, two threads that are central to a-spatial EDA have carried over to ESDA – the importance of the data themselves, and the importance of analytical graphics in representing chosen characteristics of the data.
Article
Previously published data on the occurrence of humpback whales (Megaptera novaeangliae) in the Arabian Sea suggests that the region hosts a non-migratory population that adheres to a Northern Hemisphere breeding cycle. In order to investigate the distribution and abundance of this population, twelve small boat surveys were conducted in three main locations off the coast of Oman between February 2000 and November 2004. Humpback whales were observed during surveys in Dhofar and Gulf of Masirah on Oman’s Arabian Sea coast, but not during surveys in the Muscat region in the Gulf of Oman. An even ratio of males to females was observed and sampled during surveys in the Gulf of Masirah, which was surveyed in October and November (n = 38), while almost all whales sampled in Dhofar in February/March were male (n = 28). Song was detected frequently in the bay surrounding the Halaniyat Islands (formerly known as the Kuria Muria Bay) in February/March, but observations of mother-calf pairs were sparse, and competitive groups were absent. Feeding was observed in both October/November and February/March, but behavioural and environmental observations indicate that the Gulf of Masirah is primarily an important feeding ground, while the Dhofar region, particularly the Halaniyat Bay, may be a breeding area. However, limited survey effort and a lack of recent observations of mother-calf pairs or competitive groups raises the possibility that the primary mating, calving and nursing areas are yet to be identified. Sixty-four individual whales were identified using photographs of dorsal fins or tail flukes. A high rate of re-sightings between years and between survey areas at different times of the year indicates year-round residence off the coast of Oman. A Chapman’s modified Petersen estimator was applied to various data pairings to calculate abundance. All pairings yielded estimates of less than 100 individuals, but sample sizes were small and there were various sources of possible bias. Analysis of scarring on the caudal peduncle region of identified individuals in Oman indicates that between 30 and 40% are likely to have been involved in entanglements with fishing gear. Comparison of the Oman photo-identification catalogue with those from Zanzibar, Antongil Bay (Madagascar) and Mayotte and the Geyser Atoll (Comoros Archipelago), yielded no photographic matches. These data are consistent with the hypothesis of a discrete population. The distribution of fluke pigmentation rankings from the Oman catalogue, which varied significantly from those of Madagascar and Mayotte, provides further evidence for this theory. The evidence presented here provides a strong underpinning for the recent IUCN Red List classification of the Arabian Sea sub-population of humpback whales as Endangered. In light of ongoing coastal development and other threats to this population’s habitat and future survival, urgent research and conservation measures are recommended
Article
The photo-identification catalogue of humpback whale tail flukes from Oman was compared with those from Antongil Bay, Madagascar and study sites in South Africa and Mozambique collectively termed the 'East African Mainland'. No matches were found, supporting other lines of evidence that the humpback whales studied off the coast of Oman form part of a discrete Arabian Sea population, which adheres to a Northern Hemisphere breeding cycle, and has little or no ongoing exchange with the nearest neighbouring populations in the southern Indian Ocean. While the sample size from Oman is small, and low levels of ongoing exchange might not be detected in this type of catalogue comparison, the study nonetheless emphasises the need to pursue research and conservation efforts in the known and suspected range of the Endangered Arabian Sea humpback whale population.