ArticlePDF Available

Abstract and Figures

Research and development on atmospheric and topographic correction methods for multispectral satellite data such as Landsat images has far outpaced the availability of those methods in geographic information systems software. As Landsat and other data become more widely available, demand for these improved correction methods will increase. Open source R statistical software can help bridge the gap between research and implementation. Sophisticated spatial data routines are already available, and the ease of program development in R makes it straightforward to implement new correction algorithms and to assess the results. Collecting radiometric, atmospheric, and topographic correction routines into the landsat package will make them readily available for evaluation for particular applications.
Content may be subject to copyright.
JSS Journal of Statistical Software
July 2011, Volume 43, Issue 4. http://www.jstatsoft.org/
Analyzing Remote Sensing Data in R: The landsat
Package
Sarah C. Goslee
USDA-ARS Pasture Systems and Watershed Management Research Unit
Abstract
Research and development on atmospheric and topographic correction methods for
multispectral satellite data such as Landsat images has far outpaced the availability of
those methods in geographic information systems software. As Landsat and other data
become more widely available, demand for these improved correction methods will in-
crease. Open source Rstatistical software can help bridge the gap between research and
implementation. Sophisticated spatial data routines are already available, and the ease
of program development in Rmakes it straightforward to implement new correction al-
gorithms and to assess the results. Collecting radiometric, atmospheric, and topographic
correction routines into the landsat package will make them readily available for evaluation
for particular applications.
Keywords: atmospheric correction, Landsat, radiometric correction, R, remote sensing, satel-
lite, topographic correction.
1. Introduction
Satellite remote sensing data can provide a complement or even alternative to ground-based
research for large scale studies or over long periods. The Landsat platform is the premier
example: images have been collected continuously since the early 1970s. If both Landsat 5
and 7 are considered, every location within the United States is imaged every 8 days. Other
satellite platforms have become available more recently. Images from all platforms can be
used for mapping land cover, tracking land use change, and even estimating plant biomass.
Image characteristics vary from date to date. Some of this variation is due to solar elevation
angle and can be easily corrected for given date and time (radiometric calibration). Addi-
tional variation is caused by atmospheric conditions at the time of imaging: scatter at different
wavelengths due to haze. Atmospheric corrections are very complex. If one image every few
2landsat: Analyzing Remote Sensing Data in R
years is analyzed, for example to track development patterns, atmospheric corrections may
be unnecessary because of the magnitude of the change in land use during that interval. If
researchers wish to take advantage of the temporal data density offered by Landsat, careful
correction is required. Otherwise differences in apparent reflectance due to atmospheric vari-
ation can swamp differences caused by actual change on the ground. In mountainous areas,
topographic corrections are also necessary to compensate for differences in incident radiation
due to slope and aspect. Without correction, the same ground cover on opposite sides of the
mountain could return a different signal.
Many procedures to implement atmospheric and topographic corrections have been proposed,
each with strengths and weaknesses, but few are readily available in geographic information
systems (GIS) or image processing software. It is impossible to compare methods to identify
the one most suited for a given application. Indeed, it is not always clear exactly what
proprietary GIS software is doing. Roffers an alternative to GIS software for research and
testing of satellite image processing algorithms. The interpreted nature of Rmakes it possible
to implement, test and modify algorithms easily. The available graphical and statistical tools
are vastly superior to anything available in common GIS packages, making it straightforward
to compare algorithms. Tools for processing and display of spatially-referenced data are
already available in R(RDevelopment Core Team 2011). The sp (Bivand, Pebesma, and
Gomez-Rubio 2008) and rgdal (Keitt, Bivand, Pebesma, and Rowlingson 2011) packages
provide these capabilities for several commonly-used spatial formats.
The landsat package described here, and available from the Comprehensive RArchive Network
at http://CRAN.R-project.org/package=landsat, provides basic tools for working with
satellite imagery such as automated georeferencing and cloud detection. It contains functions
for radiometric normalization, and several different approaches to atmospheric correction.
Four topographic correction algorithms have been implemented. Other useful functions such
as bare soil line and tasseled cap calculations have been included. While these functions were
developed with Landsat data in mind, they are suitable for use with satellite imagery from
other platforms as long as appropriate calibration data are used.
2. Landsat platform characteristics
Landsat 5 (TM) was launched on 1984-03-01 and Landsat 7 (ETM+) on 1999-04-15. Each
revisits a location every 16 days, and the two orbits are staggered so an image is taken for
each location every 8 days. The Landsat Data Continuity Mission is planned to launch in
2011, and will record data in bands compatible with the TM and ETM+ instruments. Due
to a hardware failure on 2003-05-31, Landsat 7 scenes are now missing 22% of the pixels. The
problem is most severe near the edges of a image. Band characteristics are largely consistent,
but ETM+ added an additional band (Table 1).
The Landsat images have been converted to integer digital numbers (DN ) before distribution
to facilitate storage and display. They may require conversion to radiance or reflectance,
topographic correction or atmospheric correction. If using a single image, or images widely
separated in time to examine gross changes, minimal processing may be required. For detailed
comparison of vegetation indices from multiple images, however, careful correction is needed.
The most accurate atmospheric corrections require ground data taken during the satellite
overpass. For retrospective studies this is impossible to obtain, and less-accurate image-based
Journal of Statistical Software 3
Landsat 5
Planned wavelength Actual wavelength Resolution
Band 1 Blue 0.45–0.52 0.452–0.518 30
Band 2 Green 0.52–0.60 0.528–0.609 30
Band 3 Red 0.63–0.69 0.626–0.693 30
Band 4 Near infrared (NIR) 0.76–0.90 0.776–0.904 30
Band 5 Middle infrared (MIR) 1.55–1.75 1.567–1.784 30
Band 6 Thermal infrared (TIR) 10.40–12.50 10.45–12.42 120
Band 7 Middle infrared (SWIR) 2.08–2.35 2.097–2.349 30
Landsat 7
Planned wavelength Actual wavelength Resolution
Band 1 Blue 0.45–0.52 0.452–0.514 30
Band 2 Green 0.52–0.60 0.519–0.601 30
Band 3 Red 0.63–0.69 0.631–0.692 30
Band 4 Near infrared (NIR) 0.77–0.90 0.772–0.898 30
Band 5 Middle infrared (MIR) 1.55–1.75 1.547–1.748 30
Band 6 Thermal infrared (TIR) 10.40–12.50 10.31–12.36 60
Band 7 Middle infrared (SWIR) 2.09–2.35 2.065–2.346 30
Band 8 Panchromatic 0.52–0.90 0.515–0.896 15
Table 1: Band wavelengths (µm) and resolutions (m) for Landsat 5 Thematic Mapper (TM)
and 7 Enhanced Thematic Mapper (ETM+). Band 8 (ETM+ only) is higher-resolution visible
light data. Actual wavelengths are from Chander et al. (2009).
correction methods must be used. The following sections describe the procedures needed for
calibration of Landsat data and the Rimplementations of these algorithms in the landsat
package.
3. Sample data
The landsat package includes a 300 ×300 pixel subset of United States Geological Survey
(USGS) Landsat ETM+ images from two dates, 2002-07-20 and 2002-11-25 (US Geological
Survey 2010b). These images are in SpatialGridDataFrame format, and can be displayed
using image(). Even after clouds are removed, the July image has a higher mean DN and
greater dynamic range than the November image (Figure 1c, d). Complete metadata are
included in the help files. A USGS digital elevation model (DEM) covering the same area and
at the same resolution has been included (Figure 1f; US Geological Survey 2010a).
4. Tools
This package provides a few basic tools for working with Landsat images. The lssub()
function is an Rinterface to the image subsetting tools from the Geospatial Data Abstraction
Library (GDAL, GDAL Development Team 2011). This function is provided for convenience;
while the same effect can be obtained using the subsetting property of sp, it is considerably
faster to use the GDAL functions to remove a smaller section of a geotiff image. Landsat
images are usually distributed as geotiff files, and can be imported in SpatialGridDataFrame
4landsat: Analyzing Remote Sensing Data in R
Figure 1: Band 4 (near infrared) from Landsat images from July and November 2002. The
July image has substantial cloud cover, as identified in the cloud mask produced by clouds().
November 2002 was cloud-free, so no mask is shown.
Journal of Statistical Software 5
format using readGDAL. The lssub() function preserves the geotiff format.
Image processing requires some information not often available in the metadata. The Earth-
Sun distance for a given date can be calculated with ESdist(). A date can be conveniently
converted to decimal format with ddist().
4.1. Automated georeferencing
A simple error-minimization routine can be used to provide relative georeferencing by match-
ing one image to a reference image (georef and geoshift by means of vertical and horizontal
shifts. If the reference image has been absolutely georeferenced, then all of the subsequent
images will also be spatially referenced. This function can find local minima, so results should
always be checked visually. The matching process only needs to be done once for each date;
Band 3 or Band 4 generally provide good results, but any band may be used. The results of
this step can then be used for each band image from that date. Sufficient padding must be
added around the edges of the image to accommodate the magnitude of the shift. The larger
the image area, the more effective the matching process. The below example illustrates the
procedure, but because of the small image sizes used for the demo data the shift coefficients
are unreliable.
R> july.shift <- georef(nov3, july3, maxdist = 50)
R> july1.corr <- geoshift(july1, padx = 10, pady = 10, july.shift$shiftx,
+ july.shift$shifty)
4.2. Topographic calculations
Topographic corrections require the use of slope and aspect data calculated from a DEM of
the same resolution as the satellite data. The slopeasp() function will calculate both given a
DEM. While all GIS software will provide topographic calculations, this function was included
for the convenience of being able to do all the processing within Rand to allow exploration
of different algorithms. Most GIS software offers only a tiny subset of the methods that have
been proposed.
The most common method for slope and aspect calculations is the third-order finite difference
weighted by reciprocal of distance (Unwin 1981;Clarke and Lee 2007). This method is the
equivalent of using a 3 ×3 Sobel filter to determine the east-west slope and the north-south
slope. Given a cell Zi,j with east-west cell size EW res and north-south cell size NSres , slope
in either percent or degrees, and aspect with north as 0, east as 90and south as 180can
be calculated using Equation 1. A simple smoothing correction (dividing by a smoothing
parameter before taking the arctan) can reduce extreme slope values (Riano, Chuvieco, Salas,
and Aguado 2003).
EW = [(Zi+1,j+1 + 2Zi+1,j +Zi+1,j 1)(Zi1,j+1 + 2Zi1,j +Zi1,j1)]/(8EW res)
NS = [(Zi+1,j+1 + 2Zi,j +1 +Zi1,j+1 )(Zi+1,j1+ 2Zi,j1+Zi1,j1)]/(8NS res )
θp= arctan pEW 2+NS 2
slope% = 100 ·pEW 2+NS2
φo= 180arctan NS
EW + 90EW
|EW |
(1)
6landsat: Analyzing Remote Sensing Data in R
5. Radiometric calibration
Landsat images are distributed as digital numbers, integer values from 0–255. While less
crucial now, this correction was originally necessary to make it possible to store, distribute
and portray these images efficiently. Radiometric calibration is a two-step process. First
the DN values are converted to at-satellite radiance using parameters provided in the image
metadata. Data on solar intensity are used to convert the at-satellite radiance to at-satellite
reflectance. These parameters are also in the image metadata.
5.1. At-sensor radiance
The first step in processing is to convert DN to at-sensor spectral radiance L, also called
top-of-atmosphere radiance. The conversion coefficients are available in the metadata accom-
panying the images. Whenever possible the metadata values should be used, as coefficients
vary by platform and over time, but standard coefficients are given in Table 2.
Coefficients are provided in one of three band-specific formats: gain2and offset;Grescale (also
called gain) and Brescale (bias); or radiances associated with minimum and maximum DN
values (Lmax and Lmin). Any of the three can be used to convert from DN to at-sensor
radiance (Equation 2).
L=DN offset
gain2
L=Grescale DN +Brescale
L= ( Lmax Lmin
DN max DN min
)·(DN DN min ) + Lmin
(2)
It is not always clear which form of coefficients the metadata contain because “gain” has been
used to refer to both gain2and Grescale. Most recent image metadata provide Grescale and
Brescale , but these formulations are interconvertible (Equation 3). The magnitudes of Grescale
and gain2are similar for most bands, but the former is paired with Brescale , which will have
Landsat 5 Landsat 7 low gain Landsat 7 high gain
Grescale Brescale Grescale Brescale Grescale Brescale
Band 1 0.765827 2.29 1.180709 7.38 0.778740 6.98
0.671339 2.19 for TM images taken before 1991-12-31
Band 2 1.448189 4.29 1.209843 7.61 0.798819 7.20
1.322205 4.16 for TM images taken before 1991-12-31
Band 3 1.043976 2.21 0.942520 5.94 0.621654 5.62
Band 4 0.876024 2.39 0.969291 6.07 0.639764 5.74
Band 5 0.120354 0.49 0.191220 1.19 0.126220 1.13
Band 6 0.055376 1.18 0.067087 0.07 0.037205 3.16
Band 7 0.065551 0.22 0.066496 0.42 0.043898 0.39
Band 8 NA NA 0.975597 5.68 0.641732 5.34
Table 2: Default gain (Grescale;W m2sr1µm1DN 1) and bias (Brescale ;W m2sr1µm1)
for Landsat 5 (TM) and Landsat 7 (ETM) from Chander et al. (2009). The metadata will
state whether Landsat 7 images were taken at low gain or high gain.
Journal of Statistical Software 7
Landsat 5 Landsat 7
Band 1 1983 1997
Band 2 1796 1812
Band 3 1536 1533
Band 4 1031 1039
Band 5 220.0 230.8
Band 7 83.44 84.90
Band 8 NA 1362
Table 3: Extra-solar atmospheric constants (Esun ;W m2µm1) for Landsat 5 (TM) and 7
(ETM+) from Chander et al. (2009). (Note: The Landsat 5 column contained errors in
previous versions of this manuscript, corrected on 2012-01-08.)
negative values for non-thermal bands, while gain 2is paired with offset , which is positive for
non-thermal bands.
Grescale =Lmax Lmin
DN max DN min
Grescale =1
gain2
Brescale =Lmin Grescale DN min
Brescale =offset
gain2
gain2=1
Grescale
offset =Brescale
Grescale
(3)
All radiometric functions in landsat accept either gain and offset or Grescale and Brescale .
5.2. At-sensor reflectance
The at-sensor radiance values calculated using Equation 2must be corrected for solar vari-
ability caused by annual changes in the Earth-Sun distance d, producing unitless at-sensor
(or top-of-atmosphere) reflectance ρAS (Equation 4).
ρAS =πd2L
Esun cos θz
(4)
Esun is the band-specific exoatmospheric solar constant (Table 3;W m2µm1). The solar
zenith angle θzcan be derived from the image metadata, where the solar elevation angle θsis
usually included; θz= 90θs. At-sensor reflectance can be calculated using the radiocorr()
function with method = "apparentreflectance".
R> july4.ar <- radiocorr(july4, Grescale = 0.63725, Brescale = -5.1,
+ sunelev = 61.4, edist = ESdist("2002-07-20"), Esun = 1039,
+ method = "apparentreflectance")
8landsat: Analyzing Remote Sensing Data in R
5.3. Thermal bands
Band 6 contains thermal infrared data. Landsat 7 offers two thermal bands, while Land-
sat 5 provides one. Instead of calculating top-of-atmosphere reflectance, these data can be
converted to temperature (K) using thermalband(). This function provides default coeffi-
cients, and requires only the DN data and the band number (6 for Landsat 5; 61 or 62 for
Landsat 7).
R> july61.thermal <- thermalband(july61, band = 61)
6. Cloud identification
Clouds are reflective (high) in Band 1 and cold (low) in Band 6, so the ratio of the two
bands is high over clouds (Martinuzzi, Gould, and Gonz´alez 2007). The absolute value of this
ratio must be adjusted for data type, whether reflectance, radiance, or DN . The clouds()
function will create a cloud mask (1where clouds are present; NA where they are not) given
Band 1 and Band 6. The default parameters for the ratio level (level) and for adding a buffer
around the cloud edge (buffer) were adequate for the test data once converted to at-sensor
reflectance and temperature. This function can be used with DN data if the level argument
is adjusted appropriately. The mask does not demarcate areas of cloud shadow, but only the
clouds themselves (Figure 1e).
R> july.cloud <- clouds(july1.ar, july61.thermal)
R> nov.cloud <- clouds(nov1.ar, nov61.thermal)
7. Atmospheric correction
For most applications, ground reflectance is of greater interest than at-sensor reflectance so
atmospheric correction is required. Variation in atmospheric conditions at the time of over-
pass can overwhelm any changes in surface reflectance, so it is crucial to correct for these
differences. If measured atmospheric data such as optical depth are available an accurate
correction can be applied, but these are rarely available for retrospective studies. Most com-
monly, an image-based method is used instead. Two categories of corrections are available.
Relative normalization methods match the spectral characteristics of each image to a refer-
ence image in such a way that each transformed image appears to have been taken using the
same sensor and with the same atmospheric conditions as the reference image. Functions are
available for relative atmospheric correction methods using the entire image or unchanging
subsets.
Absolute atmospheric correction methods rely on a mechanistic understanding of atmospheric
effects to adjust each image individually. Instead of correcting to a reference image, informa-
tion contained in part of an image, for instance the darkest areas, is extracted and used to
correct the rest of the image. This extracted information substitutes for measured parameters.
Three such methods have been included here.
7.1. Relative atmospheric correction using the entire image
Statistical correction methods are entirely empirical and do not consider physical principles
Journal of Statistical Software 9
or atmospheric conditions in any way. These algorithms force the distribution of values in
one image to match that in another. Unless otherwise indicated, these methods can operate
on DN or reflectance.
Relative normalization
The most aggressive correction method is to regress all of the pixels in the image to be
corrected onto the corresponding pixels in the reference image. This method requires georef-
erenced images covering the same area and at the same resolution. Since both variables are
random, model II regression method such as Major Axis regression implemented in lmodel2()
from the lmodel2 package is recommended (Legendre 2008). The relnorm() function imple-
ments this method for spatial data, and returns both a corrected image and the coefficients
used. In the example given here, Band 4 of the July image is corrected to match the November
image.
R> july4.rncoef <- relnorm(nov4, july4, mask = july.cloud, nperm = 0)
R> july4.rn <- july4.rncoef[["newimage"]]
Histogram matching
Histogram matching forces the distribution of the DN values in one image to match another.
This algorithm is commonly used in other areas of image processing, and the images do not
need to cover the same area, or to match in any way. The histmatch() function should be
used only with the integer DN values. In the example given here, Band 4 of the July image
is corrected to match the November image.
R> july4.hmcoef <- histmatch(nov4, july4, mask = july.cloud)
R> july4.hm <- july4.hmcoef[["newimage"]]
For this image pair, histogram matching performs better than relative normalization (Fig-
ure 2). The former method forces the histogram for July to acquire the shape of the histogram
for November, while keeping the range of values (compare Figure 1c to Figure 2c). Relative
normalization created a similar histogram profile, but greatly compressed the dynamic range,
thereby losing much of the information contained in the data. This method also inverted the
values of the original image: the slope of the regression line was negative. This inversion
is a drawback of relative normalization methods, especially for image pairs where seasonal
vegetation differences are pronounced.
7.2. Relative atmospheric correction using a subset of the image
Using the entire image for statistical correction includes areas of the image that are likely
to change between dates, particularly vegetation, so the correction factors incorporate non-
atmospheric effects. As shown above this may have unwanted side effects. These corrections
may thus conceal actual changes between dates. Identifying pixels that cover developed areas
or other land uses that would be expected to possess constant reflectance properties from
date to date could produce more accurate statistical corrections. Two methods for doing so
are included in landsat: pseudo-invariant features and radiometric control sets.
10 landsat: Analyzing Remote Sensing Data in R
Figure 2: Whole-image relative atmospheric corrections for Band 4 of the July data.
Pseudo-invariant features
The reflectance of a specific developed area such as a large rooftop or parking lot should
not change seasonally, so differences in apparent reflectance in these areas between dates are
assumed to be due to atmospheric differences. Such areas, termed pseudo-invariant features
(PIF) by Schott, Salvaggio, and Volchok (1988), can be identified using Band 7 and the ratio
of Band 4 to Band 3. The Band 4 to Band 3 ratio is low where there is no vegetation,
including water and developed areas. Band 7 is low over water areas, so it can be used to
distinguish between the unvegetated areas identified using the Band 4 to Band 3 ratio. As
implemented here, the images must cover the same area at the same resolution, but that is not
a requirement of the method. The PIF() function can be used to identify invariant features
within an image using the above criteria. Visual inspection may be needed to ensure that
the threshold value is correct for the particular images being analyzed. Having an insufficient
number of PIF points is a concern with this method, especially with the small images in these
examples.
Journal of Statistical Software 11
Using the data included in the landsat package, November is the clearer image, as would be
expected because of the higher humidity in the summer months, so it is used to identify PIF
pixels. These PIFs are then used in major axis regression to correct the July image (only
Band 4 is shown).
R> nov.PIF <- PIF(nov3, nov4, nov7, level = 0.9)
R> july4.pifcorr <- lmodel2(nov4@data[nov.PIF@data[, 1] == 1, 1] ~
+ july4@data[nov.PIF@data[, 1] == 1, 1])
R> july4.pifcorr <- unlist(july4.pifcorr[["regression.results"]][2, 2:3])
The final steps in the correction are to use the original image as a template for putting
the corrected data into the SpatialGridDataFrame format, and to block out the previously-
identified cloud areas.
R> july4.pif <- july4
R> july4.pif@data[, 1] <- july4@data[, 1] * july4.pifcorr[2] +
+ july4.pifcorr[1]
R> july4.pif@data[!is.na(july.cloud@data[, 1]), 1] <- NA
Radiometric control sets
The radiometric control set (RCS) procedure implemented in RCS() takes a different approach
to choosing invariant features Hall, Strebel, Nickeson, and Goetz (1991). The tasseled cap
procedure (tasscap()) is used to identify the greenness and brightness components of the
image (Crist 1985;Crist and Kauth 1986;Huang, Wylie, Yang, Homer, and Zylstra 2002).
These are then used to identify dark sets and bright sets with low greenness: unvegetated
radiometric control sets. As with PIF(), the threshold value may need to be adjusted for
a particular set of images. The tasselled-cap procedure included in landsat contains default
values for at-sensor reflectance for both Landsat 5 (TM) and 7 (ETM+), although other
formulations are available in the literature. In the example given, reflectance is used for RCS
identification, but the adjustment is done on the DN values to maintain consistency with
previous correction examples.
The clearer November image is used to identify the radiometric control set, and major axis
regression is used to correct the July image based on that RCS. Finally the corrected data
are put in SpatialGridDataFrame format, and cloud areas removed.
R> nov.tasscap <- tasscap("nov", 7)
R> nov.RCS <- RCS(nov.tasscap)
R> july4.rcscorr <- lmodel2(nov4@data[nov.RCS@data[, 1] == 1, 1] ~
+ july4@data[nov.RCS@data[, 1] == 1, 1])
R> july4.rcscorr <- unlist(july4.rcscorr[["regression.results"]][2, 2:3])
R> july4.rcs <- july4
R> july4.rcs@data[, 1] <- july4@data[, 1] * july4.rcscorr[2] +
+ july4.rcscorr[1]
R> july4.rcs@data[!is.na(july.cloud@data[, 1]), 1] <- NA
The PIF correction compressed the range of DN values considerably (Figure 3c), losing much
of the detail in the original image (Figure 1a). Using only points identified by PIF() for both
12 landsat: Analyzing Remote Sensing Data in R
Figure 3: Subset-based relative atmospheric corrections for Band 4 of the July data.
dates, rather than just in the clearer November image set, did not improve the results. The
RCS correction reduced the dynamic range somewhat, and inverted the data range, as did
relative normalization (Figure 3d; Figure 2d). Of the four whole-image relative atmospheric
correction methods implemented here, only the histogram matching method produced accept-
able results.
The root mean square error (RM SE) between the target and reference image can be used
to assess the effectiveness of a relative correction. The greater the reduction in RMSE,
the more effective the transformation was at matching that image pair. The RMSE for
the untransformed Band 4 data was 0.2, and after histogram matching it was reduced to
0.067. Despite their poor overall performance in other ways, the other three methods reduced
RMSE values as much or more: 0.043 for relative normalization, 0.043 for PIF, and 0.065
for RCS. The correspondence between image pairs can increase even when other properties
such as range that are crucial for further analysis and interpretation are not preserved.
Journal of Statistical Software 13
7.3. Absolute atmospheric correction
This class of methods attempts to deduce values for atmospheric parameters from information
contained within the image itself rather than using externally-measured data. Each image is
treated on its own.
ρ=πd2(LLhaze )
Tv(Esun cos θzTz+Edown )(5)
The conversion of at-sensor radiance to atmospherically-corrected surface reflectance is de-
scribed in Equation 5. Absolute atmospheric correction methods use measurements or at-
mospheric simulation models to determine the parameters Tz,Tv,Edown , and Lhaze (Chavez
1989). The major difference between the relative atmospheric correction methods is the pro-
cedure for estimating these values. With default parameter choices, this simplifies to the
equation for at-sensor reflectance (Table 4; Equation 4).
Dark object subtraction
The dark object subtraction (DOS) method assumes that if there are areas in an image with
very low actual reflectance values, any apparent reflectance should be due to atmospheric
scattering effects, and this information can be used to calibrate the rest of the image (Chavez
1988,1989). The darkest pixels can be selected by examining the histogram of the DN values
in an image, or by setting a threshold such as ‘lowest DN value found in at least npixels,’ or
some other criterion appropriate for the size of image being analyzed. The chosen DN value,
the Starting Haze Value (SHV ) is then converted to radiance (e.g., Equation 2or similar
conversion). It is unlikely that most images contain entire pixels that are true black, so a
correction is applied that assumes a 1% actual reflectance of these areas.
L1% = 0.01Esun cos θz
πd2
Lhaze =SHV rad L1% (6)
The simplest form of DOS simply converts the calculated Lhaze value to at-sensor reflectances
(Equation 4) and subtracts it from the entire image (also converted to at-sensor reflectances).
A new SHV value must be calculated for each band (Chavez 1988).
The improved method developed by Chavez (1989) uses information from a single band to
calculate Lhaze values for the remaining bands of an image. This method produces correlated
haze values, and may be more accurate if there are few shadows or dark areas. Scattering
is band-specific, and the band effects are correlated with atmospheric conditions. The DOS
method implemented here uses a realistic relative atmospheric scattering model, and maintains
the spectral relationship between bands (Chavez 1989). This may be important for vegetation
Model TzTvEdown Lhaze
Apparent reflectance 1 1 0 0
DOS 1 1 0 SH V
COSTZ cos θzcos θv0SH V
DOS4 Iterative Iterative Iterative Iterative
Table 4: Parameters values used in four different atmospheric correction methods.
14 landsat: Analyzing Remote Sensing Data in R
indices whose values depend on the ratios between bands. The DOS method works poorly
for bands 5 and 7, hugely overestimating the Lhaze value. Only small components of the total
atmospheric scattering occurs in these bands, so the DN corrections for very clear atmosphere
were used for bands 5 and 7. The DOS() function will take the SHV value for the starting
band (usually Band 1, though bands 2 or 3 may be used), and calculate the related SHV
value for each other band.
DOS correction as implemented here is a two-step process. In this example, the July image
is corrected. Band 1 is used to find the SHV , here the lowest DN with at least 1000 pixels.
R> SHV <- table(july1@data[, 1])
R> SHV <- min(as.numeric(names(SHV)[SHV > 1000]))
R> SHV
[1] 69
That SHV (69) is then used in DOS() to find the corrected SHV values for the remaining
bands (Chavez 1989).
R> july.DOS <- DOS(sat = 7, SHV = SHV, SHV.band = 1, Grescale = 0.77569,
+ Brescale = -6.2, sunelev = 61.4, edist = ESdist("2002-07-20"))
R> july.DOS <- july.DOS[["DNfinal.mean"]]
R> july.DOS
coef-4 coef-2 coef-1 coef-0.7 coef-0.5
band1 68.308152 68.30815 68.30815 68.30815 68.30815
band2 41.927435 53.23414 60.23021 62.53285 64.12406
band3 25.580966 40.56327 52.31547 56.60737 59.69712
band4 14.858007 28.34164 43.02630 49.22788 53.96081
band5 8.443139 13.20415 25.72192 33.59098 40.69352
band7 8.130282 10.87164 21.16987 28.78982 36.18461
The above Routput shows the possible SHV values calculated for five scattering coefficients,
with the smallest coefficient (-4) describing the clearest conditions and -0.5 representing
extreme haze. For this example, Band 1 was used to calculate the SHV from the actual
image, so it is not adjusted for scattering coefficient. The user must select the appropriate
set of band-specific SHV values. Chavez (1989) suggested using the original SHV as a guide:
SHV 55 very clear (-4); SHV 56–75 clear (-2); SHV 76–95 moderate (-1); SHV 96–115
hazy (-0.7); SHV >115 very hazy (-0.5). The SHV for Band 1 from July was 69, so the
atmosphere was clear, and the corresponding column coef-2 was selected for use in DOS().
R> july.DOS <- july.DOS[, 2]
R> july2.DOSrefl <- radiocorr(july2, Grescale = 0.79569, Brescale = -6.4,
+ sunelev = 61.4, edist = ESdist("2002-07-20"), Esun = 1812,
+ Lhaze = july.DOS[2], method = "DOS")
R> july4.DOSrefl <- radiocorr(july4, Grescale = 0.63725, Brescale = -5.1,
+ sunelev = 61.4, edist = ESdist("2002-07-20"), Esun = 1039,
+ Lhaze = july.DOS[4], method = "DOS")
R> july4.DOSrefl@data[!is.na(july.cloud@data[, 1]), 1] <- NA
Journal of Statistical Software 15
COSTZ
Chavez (1996) improved on his earlier dark object subtraction methods by adding a correction
for the multiplicative transmittance component of the atmospheric scatter (cos θz, abbreviated
as COSTZ). In this revised procedure, cos θzis used as an approximation of Tz, and cos θvis
used as an approximation of Tv. For Landsat, the latter parameter is one because the satellite
sensor has a nadir view (θv= 0). The value for Lhaze is determined just as for the DOS
method. This method is not appropriate for use with bands 5 or 7; the original DOS method
should be used with these data (Song, Woodcock, Seto, Lenney, and Macomber 2001).
R> july4.COSTZrefl <- radiocorr(july4, Grescale = 0.63725, Brescale = -5.1,
+ sunelev = 61.4, edist = ESdist("2002-07-20"), Esun = 1039,
+ Lhaze = july.DOS[4], method = "COSTZ")
R> july4.COSTZrefl@data[!is.na(july.cloud@data[, 1]), 1] <- NA
Modified dark object subtraction
The modified dark object subtraction method (DOS4) was developed to incorporate the effect
of atmospheric aerosols into atmospheric correction (Song et al. 2001). The value for Lhaze
calculated in the DOS method is used. Corrected values for Tzand Tvare determined through
an iterative process. Both are set to initial values of one, and Equation 7is solved for τ, the
Rayleigh atmospheric optical depth. New values are calculated for Tzand Tv(Equation 8),
and the process is repeated until τstabilizes. This generally requires fewer than ten iterations.
τ=cos θzln 1Grescale DN min +Brescale 0.01(Eocos θzTz+Edown )Tv
Eocos θz(7)
Tv= exp (τ / cos θv)
Tz= exp (τ / cos θz)(8)
R> july4.DOS4refl <- radiocorr(july4, Grescale = 0.63725, Brescale = -5.1,
+ sunelev = 61.4, edist = ESdist("2002-07-20"), Esun = 1039,
+ Lhaze = july.DOS[4], method = "DOS4")
R> july4.DOS4refl@data[!is.na(july.cloud@data[, 1]), 1] <- NA
When compared to at-sensor reflectance, all three absolute atmospheric correction methods
produced similar results for the July test image (Figure 4). Each reduced the dynamic range,
but only slightly, and the overall appearance was similar to the original, unlike the results of
most of the relative correction methods.
Other research has found that COSTZ was effective in the visible-light bands (1-3), but less
accurate in the near-infrared, particularly in humid conditions (Wu, Wang, and Bauer 2005).
Song et al. (2001) found that DOS4 outperformed COSTZ or DOS.
8. Topographic correction
The interaction between sun angle, surface slope and satellite position produces variations in
surface reflectance unrelated to true reflectance. The same land use type can return a different
16 landsat: Analyzing Remote Sensing Data in R
Figure 4: Absolute atmospheric corrections for Band 4 of the July data.
signal if located on opposite sides of a hill. The intent of topographic correction is to remove
this source of variation, and leave only the portion of the reflectance signal actually due to
ground cover, thus converting the reflectance from an inclined surface (ρT) to that from an
equivalent horizontal surface (ρH).
The sun angle at the time of the November image was lower, so differential shading of the
north and south faces of the ridge in the middle of the image can be clearly seen (Figure 1b,
and see also Figure 1f). A successful topographic correction would eliminate this shading
difference; corrected images should appear flat.
8.1. Illumination
Slope and aspect are used in conjunction with solar and satellite parameters to model illu-
mination (IL) conditions. Information derived from the DEM is required to compute the
incident angle (γi), defined as the angle between the normal to the ground and the sun rays.
Journal of Statistical Software 17
The IL parameter varies from 1 (minimum illumination) to 1 (maximum illumination) and
is calculated from the slope angle θp, solar zenith angle θz, solar azimuth angle φaand aspect
angle φo(Equation 9).
IL = cos γi= cos θpcos θz+ sin θpsin θzcos (φaφo) (9)
The illumination is then used in one of several topographic correction methods. Seven such
algorithms are currently implemented in topocorr(). Many of these algorithms are described
and compared in Riano et al. (2003).
8.2. Lambertian methods
Lambertian methods assume that the reflectance for all wavelengths is constant regardless of
viewing angle. The correction factor is identical across all bands.
Cosine correction
The cosine method is a trigonometric approach that assumes that irradiance is proportional
to the cosine of the incidence angle (Teillet, Guindon, and Goodenough 1982). The cosine
correction discounts indirect illumination and saturates in dark areas.
ρH=ρTcos θz
IL (10)
R> dem.slopeasp <- slopeasp(dem)
R> nov4.cosine <- topocorr(nov4.ar, dem.slopeasp$slope, dem.slopeasp$aspect,
+ sunelev = 26.2, sunazimuth = 159.5, method = "cosine")
Improved cosine correction
The improved cosine method attempts to compensate for the overcorrection often seen in the
cosine method by including average illumination (IL) in the calculation (Civco 1989).
ρH=ρT+ρTIL IL
IL  (11)
R> nov4.cosimp <- topocorr(nov4.ar, dem.slopeasp$slope, dem.slopeasp$aspect,
+ sunelev = 26.2, sunazimuth = 159.5, method = "improvedcosine")
Gamma correction
The Gamma method is an extension of the cosine correction that adds terms for the sensor
view angle on flat terrain (θv) and on inclined terrain (βv). This is another attempt to reduce
overcorrection in areas with low illumination (Richter, Kellenberger, and Kaufmann 2009).
ρH=ρTγ=ρTcos θz+ cos θv
IL + cos βv
βv= 90(θv+θp)
(12)
18 landsat: Analyzing Remote Sensing Data in R
R> nov4.gamma <- topocorr(nov4.ar, dem.slopeasp$slope, dem.slopeasp$aspect,
+ sunelev = 26.2, sunazimuth = 159.5, method = "gamma")
Sun-canopy-sensor method
The sun-canopy-sensor method (SCS) was developed by Gu and Gillespie (1998) for use in
forested areas where the canopy geometry contributes to reflectance on a sloped surface (Gao
and Zhang 2009).
ρH=ρTcos θzcos θp
IL (13)
R> nov4.SCS <- topocorr(nov4.ar, dem.slopeasp$slope, dem.slopeasp$aspect,
+ sunelev = 26.2, sunazimuth = 159.5, method = "SCS")
8.3. Non-Lambertian methods
This group of methods recognizes that the combination of angles of incidence and observation
can affect reflectance: surface roughness matters.
Minnaert method
The Minnaert method adds a band-specific constant Kto the cosine method (Minnaert 1941,
in Riano et al. 2003; Equation 14). If K= 1, the Minnaert and cosine methods are equivalent
(Lambertian behavior is assumed).
ρH=ρTcos θz
IL K
(14)
The Kparameter is the slope of the ordinary linear regression with Kand ln ρHas regression
coefficients, and is constant across the entire image for each band (Equation 15).
ln(ρT) = ln(ρH) + Kln IL
cos θz
(15)
The topocorr() function with the argument method = "minnaert" will take care of both
steps, returning the corrected image.
R> nov4.minnaert <- topocorr(nov4.ar, dem.slopeasp$slope, dem.slopeasp$aspect,
+ sunelev = 26.2, sunazimuth = 159.5, method = "minnaert")
A further modification of the Minnaert method includes the slope explicitly as well as through
the illumination calculation (Colby 1991; Equation 16).
ρH=ρTcos θpcos θz
IL cos θpK
(16)
R> nov4.min2 <- topocorr(nov4.ar, dem.slopeasp$slope, dem.slopeasp$aspect,
+ sunelev = 26.2, sunazimuth = 159.5, method = "minslope")
Journal of Statistical Software 19
Figure 5: Topographic corrections for Band 4 of the November data.
20 landsat: Analyzing Remote Sensing Data in R
Figure 6: Topographic corrections for Band 4 of the November data (continued).
C-correction
The C-correction method is a statistical approach based on the ratio of the band-specific
regression coefficients band m, as given in Equation 17 (Teillet et al. 1982).
ρH=ρTcos θz+c
IL +c
ρT=b+mIL
c=b
m
(17)
R> nov4.ccor <- topocorr(nov4.ar, dem.slopeasp$slope, dem.slopeasp$aspect,
+ sunelev = 26.2, sunazimuth = 159.5, method = "ccorrection")
Riano et al. (2003) recommend smoothing the slope to reduce the overcorrection imposed
when the illumination is low. A smoothing option was added to slopeasp() to make this
Journal of Statistical Software 21
variant possible. They found the most improvement with a slope correction factor of 5, as
illustrated in the example.
R> dem.smooth5 <- slopeasp(dem, smoothing = 5)
R> nov4.ccorsmooth <- topocorr(nov4.ar, dem.smooth5$slope, dem.smooth5$aspect,
+ sunelev = 26.2, sunazimuth = 159.5, method = "ccorrection")
None of the topographic correction methods completely eliminated the shading effects due to
the central ridge (Figures 5,6). The gamma correction was particularly poor (Figure 5c), as
was the C-correction (Figure 6a). The cosine and Minnaert methods performed the best, but
in all the images the north-south topographic features along the main ridge can be clearly
seen.
The cosine and SCS methods also resulted in reflectances greater than one. The Minnaert
methods increased the dynamic range, though not the median, and the gamma correction
reduced both. Meyer, Itten, Kellenberger, Sandmeier, and Sandmeier (1993) found that the
Minnaert and C-correction methods gave similar results, but that the cosine method results
were very different. Other work found that Minnaert was superior under most conditions
(Richter et al. 2009).
9. Conclusions
Implementing atmospheric and topographic correction methods in Rwas straightforward,
given R’s good preexisting spatial capabilities. While large-scale production work is probably
best done in a GIS environment, the scripting features available in Rmake it a feasible solution
for small- and medium-sized tasks involving satellite remote sensing imagery.
The landsat package will be useful for researchers investigating the utility of these algorithms
for their own work, and for facilitating development of new methods. The topographic algo-
rithms especially appear to need further research. The wide range of sophisticated statistical
tools available in Rmake it suitable for research and development on these methods. The
open source nature of both Rand of the landsat package allow for examination and alteration
of the code base, facilitating further work on processing satellite imagery.
Acknowledgments
This work contributes to the Conservation Effects Assessment Project (CEAP), jointly funded,
coordinated and administered by the United States Department of Agriculture’s Natural Re-
sources Conservation Service, Agricultural Research Service, and Cooperative State Research,
Education, and Extension Service. Funding for this project was provided by the USDA-NRCS.
Forrest R. Stevens provided considerable assistance in locating a problem with an earlier ver-
sion of landsat.
Mention of trade names or commercial products in this publication is solely for the purpose
of providing specific information and does not imply recommendation or endorsement by the
US Department of Agriculture.
USDA is an equal opportunity provider and employer.
22 landsat: Analyzing Remote Sensing Data in R
References
Bivand RS, Pebesma EJ, Gomez-Rubio V (2008). Applied Spatial Data Analysis with R.
Springer-Verlag, New York.
Chander G, Markham BL, Helder DL (2009). “Summary of Current Radiometric Calibration
Coefficients for Landsat MSS, TM, ETM+, and EO-1 ALI Sensors.” Remote Sensing of
Environment,113, 893–903.
Chavez Jr PS (1988). “An Improved Dark-Object Subtraction Technique for Atmospheric
Scattering Correction of Multispectral Data.” Remote Sensing of Environment,24, 459–
479.
Chavez Jr PS (1989). “Radiometric Calibration of Landsat Thematic Mapper Multispectral
Images.” Photogrammetric Engineering and Remote Sensing,55, 1285–1294.
Chavez Jr PS (1996). “Image-Based Atmospheric Corrections Revisited and Improved.” Pho-
togrammetric Engineering and Remote Sensing,62, 1025–1036.
Civco DL (1989). “Topographic Normalization of Landsat Thematic Mapper Digital Imagery.”
Photogrammetric Engineering and Remote Sensing,55, 1303–1309.
Clarke KC, Lee SJ (2007). “Spatial Resolution and Algorithm Choice as Modifiers of Downs-
lope Flow Computed from Digital Elevation Models.” Cartography and Geographic Infor-
mation Science,34, 215–230.
Colby JD (1991). “Topographic Normalization in Rugged Terrain.” Photogrammetric Engi-
neering and Remote Sensing,57, 531–537.
Crist EP (1985). “A TM Tasseled Cap Equivalent Transformation for Reflectance Factor
Data.” Remote Sensing of Environment,17, 301–306.
Crist EP, Kauth RJ (1986). “The Tasseled Cap De-Mystified.” Photogrammetric Engineering
and Remote Sensing,52, 81–86.
Gao Y, Zhang W (2009). “LULC Classification and Topographic Correction of Landsat-
7 ETM+ Imagery in the Yangjia River Watershed: The Influence of DEM Resolution.”
Sensors,9, 1980–1995.
GDAL Development Team (2011). GDAL Geospatial Data Abstraction Library, Ver-
sion 1.7.3. Open Source Geospatial Foundation. URL http://www.gdal.org/.
Gu D, Gillespie A (1998). “Topographic Normalization of Landsat TM Images of Forest Based
on Subpixel Sun-Canopy-Sensor Geometry.” Remote Sensing of Environment,64, 166–175.
Hall FG, Strebel DE, Nickeson JE, Goetz SJ (1991). “Radiometric Rectification: Toward a
Common Radiometric Response Among Multidate, Multisensor Images.” Remote Sensing
of Environment,35, 11–27.
Huang C, Wylie B, Yang L, Homer C, Zylstra G (2002). “Derivation of a Tasseled Cap
Transformation Based on Landsat 7 At-Satellite Reflectance.” International Journal of
Remote Sensing,23, 1741–1748.
Journal of Statistical Software 23
Keitt TH, Bivand R, Pebesma E, Rowlingson B (2011). rgdal: Bindings for the Geospatial
Data Abstraction Library.Rpackage version 0.7-1, URL http://CRAN.R-project.org/
package=rgdal.
Legendre P (2008). lmodel2: Model II Regression.Rpackage version 1.6-3, URL http:
//CRAN.R-project.org/package=lmodel2.
Martinuzzi S, Gould WA, Gonz´alez OMR (2007). “Creating Cloud-Free Landsat ETM+
Data Sets in Tropical Landscapes: Cloud and Cloud-Shadow Removal.” General Technical
Report IITF-GTR-32, United States Department of Agriculture Forest Service International
Institute of Tropical Forestry.
Meyer P, Itten KI, Kellenberger T, Sandmeier S, Sandmeier R (1993). “Radiometric Correc-
tions of Topographically Induced Effects on Landsat TM Data in an Alpine Environment.”
ISPRS Journal of Photogrammetry and Remote Sensing,48, 17–28.
Minnaert M (1941). “The Reciprocity Principle in Lunar Photometry.” Astrophysics Journal,
37, 978–986.
RDevelopment Core Team (2011). R: A Language and Environment for Statistical Computing.
RFoundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL http:
//www.R-project.org/.
Riano D, Chuvieco E, Salas J, Aguado I (2003). “Assessment of Different Topographic Cor-
rections in Landsat-TM Data for Mapping Vegetation Types.” IEEE Transactions on Geo-
science and Remote Sensing,41, 1056–1061.
Richter R, Kellenberger T, Kaufmann H (2009). “Comparison of Topographic Correction
Methods.” Remote Sensing,1, 184–196.
Schott JR, Salvaggio C, Volchok WJ (1988). “Radiometric Scene Normalization Using Pseu-
doinvariant Features.” Remote Sensing of Environment,26, 1–16.
Song C, Woodcock CE, Seto KC, Lenney MP, Macomber SA (2001). “Classification and
Change Detection Using Landsat TM Data: When and How to Correct Atmospheric Ef-
fects?” Remote Sensing of Environment,75, 230–244.
Teillet PM, Guindon B, Goodenough DG (1982). “On the Slope-Aspect Correction of Multi-
spectral Scanner Data.” Canadian Journal of Remote Sensing,8, 84–106.
Unwin D (1981). Introductory Spatial Analysis. Methuen, London.
US Geological Survey (2010a). “Earth Resources Observation and Science Center.” Accessed
2010-09-20, URL http://eros.usgs.gov/.
US Geological Survey (2010b). “Landsat Missions.” Accessed 2010-09-20, URL http://
landsat.usgs.gov/.
Wu J, Wang D, Bauer ME (2005). “Image-Based Atmospheric Correction of QuickBird Im-
agery of Minnesota Cropland.” Remote Sensing of Environment,99, 315–325.
24 landsat: Analyzing Remote Sensing Data in R
A. Mathematical quantities
DN Digital number (integer, 0–255 for Landsat).
λ Wavelength (µm).
L At-sensor radiance (W m2sr1µm1).
ρAS At-sensor reflectance (unitless).
ρ Surface reflectance (unitless).
Grescale Band-specific gain (W m2sr1µm1DN 1).
Brescale Band-specific bias (W m2sr1µm1).
gain2 Band-specific gain (alternate version).
offset Band-specific (alternate version).
Lmax Radiance associated with maximum DN value (W m2sr1µm1).
Lmin Radiance associated with minimum DN value (W m2sr1µm1).
DN max Maximum DN value, usually 255.
DN min Minimum DN value, usually 0.
Esun Band-specific exoatmospheric solar constant (W m2µm1).
d Earth-Sun distance (AU).
Eo Distance-adjusted exoatmospheric solar constant d2/Esun.
θz Solar zenith angle (degrees); 90θs.
θs Solar elevation angle (degrees).
DOY Day of year; Julian date (days).
Lhaze Path radiance, upwelling atmospheric scattering (W m2sr1µm1).
Tv Atmospheric transmittance from surface to the sensor (W m2µm1).
Tz Atmospheric transmittance from the Sun to the surface (W m2µm1).
Edown Downwelling irradiance due to atmospheric scattering (W m2µm1).
SHV Starting haze value (DN ).
Z Elevation at pixel (i, j).
EW East-west component of slope.
NS North-south component of slope.
EW res East-west pixel resolution.
NSres North-south pixel resolution.
θp Slope angle (degrees).
θz Solar zenith angle (degrees).
θv Sensor zenith angle (degrees; 0for Landsat).
βv Sensor view angle on inclined terrain (degrees).
φa Solar azimuth angle (degrees)
Journal of Statistical Software 25
φo Aspect angle (degrees, with north = 0and moving clockwise).
ρH Reflectance of a horizontal surface (unitless).
ρT Reflectance of an inclined surface (unitless).
IL Illumination (1 to +1).
IL Average illumination across the image (1 to +1).
γi Incident angle between the normal to the ground and the sun rays (degrees).
τ Rayleigh atmospheric optical depth.
Affiliation:
Sarah C. Goslee
USDA-ARS Pasture Systems and Watershed Management Research Unit
Bldg. 3702 Curtin Rd.
University Park PA, 16802, United States of America
E-mail: Sarah.Goslee@ars.usda.gov
Journal of Statistical Software http://www.jstatsoft.org/
published by the American Statistical Association http://www.amstat.org/
Volume 43, Issue 4 Submitted: 2010-09-29
July 2011 Accepted: 2011-06-16
... The year-round Landsat 5/7/8 surface reflectance images, centered on the farming lands, were downloaded from the U.S. Geological Survey (USGS) Earth Explorer on 5 July 2022. These images underwent radiometric and atmospheric corrections using the radCor function of the RStoolbox package in the R language (R Core Team, 2021) [54]. Since Landsat 5 and 7 exhibit a good and consistent match in surface reflectance [55], no further adjustments were necessary regarding sensor geometry. ...
... Reflectance data in the blue (ρblue, 450-520 nm), red (ρred, 630-690 nm), and near-infrared (ρnir, 760-900 nm) bands were used to calculate the EVI [38]. It should be noted that the radCor processing applied to remove cloud and shadow pixels for atmospheric corrections was not perfect [54]. As a result, residual clouds and shadows may appear as distinct outliers in the EVI time series. ...
Article
Full-text available
Although the Landsat 30 m Enhanced Vegetation Index (EVI) products are important input variables in land surface models, recurring Landsat 5/7 EVI time series over cloud-prone, fragmented, and mosaic agricultural landscapes is still a great challenge. In this study, we put forward a simple, but effective "Light and Temperature-Driven Growth model and Double Logistic function fusion algorithm" (LTDG_DL). The empirical basis of the LTDG_DL algorithm was traced from the de Wit crop growth simulation model and the commonly observed nonlinear correlation between the EVI and the Leaf Area Index (LAI). It assimilates the ground daily solar radiation and air temperature to generate seasonal profiles of the empirical LAI and EVI and conducts the within season calibration of the empirical EVI by adjusting crop growth using cloud-free Landsat EVI observations. The initial date of seedling emergence (DOYini) and the accumulated Growing Degree Days for completion of the vegetative and Flowering stage (FGDDs) were variables to which the algorithm's accuracy was most sensitive. The variable-constrained optimization of the LTDG_DL algorithm was performed by loading the seedling emergence calendar of local prevailing crops and establishing an FGDD lookup table with an exhaustive sampling without replication method. Compared to temporal interpolation functions and Landsat-MODIS spatiotemporal fusion algorithms , the LTDG_DL algorithm had superior performance in the predictions of the EVI increment slope at the vegetative growth stage, the timing of the peak EVI, and the protection of key Landsat EVI observations over cloud-contaminated and complex landscape agricultural systems. Finally, the advantages and limitations of the LTDG_DL algorithm are discussed.
... The soil line corresponds to a linear relationship on the 2D plane of the soil spectral reflectance values between the near-infrared and red band values [85,86]. As there is no universal soil line for all soil types, as the spectral signatures vary with soil color, mineralogy, grain size, and moisture [87][88][89], we computed the soil line for our study area (by preliminarily removing the large waterbodies, such as lakes, bays, inlets, and ocean) with the BSL function from the Landsat package version 3.2.5 [90] using the R software version 4.2.1 [91] (hereafter referred to as "R"). MSAVI2 has a simpler algorithm and does not require a soil line plot to be generated or to specify the soil brightness correction factor as in SAVI [86]. ...
... Table S1: List of 38 layers used as potential predictors for the land cover classification of a polar desert area near Alert, NU, Canada. The specificity, source, or method of acquisition and the range of values (mean and sometimes standard deviation and minimum) are given for each layer [77][78][79][80][81]85,86,90,[93][94][95][96][156][157][158][159][160]; Table S2: Mean values and standard deviations of the 38 potential predictors per land cover class; Figure S1: Top view of illuminated canyon slopes (light areas, a and b) and saline soils along the coast (pale gray areas, c and d). Reference points were added in these types of areas as they were confused with snow in preliminary classifications.; Figure S2: Color-coded correlogram outlining the Pearson correlation coefficients (r) between all 38 predictors used in the study.; Figure S3: False color infrared (nearinfrared, red, and green bands) satellite imagery (top panel) and classified subareas (9 lower panels) within Alert to illustrate that the shadow produced by the rocks (examples pointed to by arrows in the top panel) classified as water, grass-dominated wetland, snow, or sedge-dominated wetland by RFs (random forests), ANNs (artificial neural networks), NB (naive Bayes), the EC (ensemble classifier), SVMs (support vector machines), LDA (linear discriminant analysis), CARTs (classification and regression trees), ML (maximum likelihood), and KNNs (K-nearest neighbors). ...
Article
Full-text available
The use of remote sensing for developing land cover maps in the Arctic has grown considerably in the last two decades, especially for monitoring the effects of climate change. The main challenge is to link information extracted from satellite imagery to ground covers due to the fine-scale spatial heterogeneity of Arctic ecosystems. There is currently no commonly accepted methodological scheme for high-latitude land cover mapping, but the use of remote sensing in Arctic ecosystem mapping would benefit from a coordinated sharing of lessons learned and best practices. Here, we aimed to produce a highly accurate land cover map of the surroundings of the Canadian Forces Station Alert, a polar desert on the northeastern tip of Ellesmere Island (Nunavut, Canada) by testing different predictors and classifiers. To account for the effect of the bare soil background and water limitations that are omnipresent at these latitudes, we included as predictors soil-adjusted vegetation indices and several hydrological predictors related to waterbodies and snowbanks. We compared the results obtained from an ensemble classifier based on a majority voting algorithm to eight commonly used classifiers. The distance to the nearest snowbank and soil-adjusted indices were the top predictors allowing the discrimination of land cover classes in our study area. The overall accuracy of the classifiers ranged between 75 and 88%, with the ensemble classifier also yielding a high accuracy (85%) and producing less bias than the individual classifiers. Some challenges remained, such as shadows created by boulders and snow covered by soil material. We provide recommendations for further improving classification methodology in the High Arctic, which is important for the monitoring of Arctic ecosystems exposed to ongoing polar amplification.
... The correction to TOA accounts for power, distance and elevation angle of the sun which varies with time and by location. These preprocessing steps were completed manually in R following Goslee (2011) with calibration coefficients (e.g., mean exoatmospheric solar irradiance) provided in Chander et al. (2009) andJanuar et al. (2020). To improve the temporal continuity between Landsat acquisitions, we developed per band functions for relative calibration with ordinary least squares regression (Roy et al., 2016;Young et al., 2017). ...
Preprint
Full-text available
Remote sensing data capture ecologically important information that can be used to characterize, model and predict bird habitat. This study implements fusion techniques using Random Forests (RF) with spectral Landsat data and structural airborne laser scanning (ALS) data to scale habitat attributes through time and to characterize habitat for four bird species in dynamic young forest environments in the United Kingdom. We use multi-temporal (2000, 2005, 2012/13, 2015) multi-sensor (Landsat and ALS) data to (i) predict structural attributes via pixel-level fusion at 30 metre spatial resolution, (ii) model bird habitat via object-level fusion and compare with models based on ALS, Landsat and predicted structural attributes, and (iii) predict bird habitat through time (i.e., predict 2015 habitat based on 2000-2012 data). First, we found that models predicting mean height from spectral information had the highest accuracy, whilst maximum height, standard deviation of heights, foliage height diversity, canopy cover and canopy relief ratio had good accuracy, and entropy had low accuracy. The green band and the normalized burn ratio (NBR) were consistently important for prediction, with the red and shortwave infrared (SWIR) 1 bands also important. For all structural variables, high values were underpredicted and low values were overpredicted. Second, for Blue Tit ( Cyanistes caeruleus ) and Chaffinch ( Fringilla coelebs ), the most accurate model employed Landsat data, while object-level fusion performed best for Chiffchaff ( Phylloscopus collybita ) and Willow Warbler ( Phylloscopus trochilus ). ALS mean, maximum and standard deviation of heights and Landsat tasseled cap transformations (TCT) (i.e., wetness, greenness and brightness) were ranked as important to all species across various models. Third, we used our models to predict presence in 2015 and implemented a spatial intersection approach to assess the predictive accuracy for each species. Blue Tit and Willow Warbler presences were well predicted with the Landsat, ALS, and objectlevel fusion models. Chaffinch and Chiffchaff presences were best predicted with the ALS model. Predictions based on pixel-level predicted structure surfaces had low accuracy but were acceptable for Chaffinch and Willow Warbler. This study is significant as it provides guidance for Landsat and ALS data application and fusion in habitat modelling. Our results highlight the need to use appropriate remote sensing data for each study species based on their ecology. Object-level data fusion improved habitat characterization for all species relative to ALS, but not to Landsat for Blue Tit and Chaffinch. Pixel-level fusion for predicting structural attributes in years where ALS data are note available is increasingly being used in modelling but may not adequately represent within-patch wildlife habitat. Finally, incorporating predicted surfaces generated through pixel-level fusion in our habitat models yielded low accuracy. Highlights We used object- and pixel-level fusion with ALS and Landsat to examine bird habitat Pixel-level fusion predicted surfaces yielded low accuracy in habitat models Best models: Landsat (Blue Tit, Chaffinch); fusion (Chiffchaff, Willow Warbler) Best prediction: ALS (Chaffinch, Chiffchaff) Best prediction: ALS, Landsat, object-level fusion (Blue Tit, Willow Warbler) Graphical abstract
... The first step consisted of calibrating the range of error where the error was randomly sampled and assigned to M SAV I2 values at each date. To perform this stage, we used an approach based on the darkest pixel principle applied in the context of atmospheric corrections [102] [103]. It consists of calculating an error value by examining the variability of pixels that are not expected to change over time. ...
Preprint
Full-text available
While satellite time series are essential tools to derive phenometrics at unprecedented spatial and temporal scales, non-systematic acquisition or medium spatial resolution of available missions are potentially problematic. At the same time, low-cost observation networks bridge the gap between satellite and in-situ observations which considerably increases ground-based data and associated possibilities. Here we provide robust statistics about the reliability of satellite derived phenometrics of urban trees across phenophases. Environmental and acquisition factors influencing the quality of phenometrics estimates were analyzed. First, a multi-facet regression-based analysis was conducted to measure discrepancies between PlanetScope (and Sentinel-2) and ground-based measurements across phenophases. Then, we performed hierarchical partitioning to tackle the effects of biological parameters (canopy closure and colour leaf) for assessing phenometrics with satellite time series. Third, we ran Monte-Carlo simulations to propagate errors according to viewing angles in PlanetScope acquisition. Our results show that: (i) PlanetScope provide consistent phenometric estimates for different tree layouts belonging to the same species (average R2R^{2} = 0.50 ±\pm 0.18). Performances are higher than Sentinel-2 but duration-based phenometrics estimates were poorly reconstructed with both satellite missions, (ii) contributions of biological parameters in the vegetation signal above trees strongly vary between growth periods. While canopy closure drives the growing season signal (independent contribution >> 40%), colour leaf plays a major role in the senescence season, (iii) variable viewing angles in PlanetScope acquisitions showed only significant effects on duration-based metrics estimates. Our research opens new perspectives for monitoring urban trees which improves the measurement of ecosystem services for local inhabitants.
... The atmospheric correction consists of transforming the numerical number DN into top-of-atmosphere (TOA) radiance and subsequently converting TOA reflectance into surface reflectance values. This process was carried out to generate a Level-2A (bottom -ofatmosphere, BOA) reflectance image using the dark object subtraction algorithm (DSO1) (Goslee, 2011). Moreover, ten spectral bands were exploited in this research, comprising four 10 m bands. ...
Article
Full-text available
Crop type identification is critical for agricultural sustainability policy development and environmental assessments. Therefore, it is important to obtain their spatial distribution via different approaches. Medium-, high- and very high-resolution optical satellite sensors are efficient tools for acquiring this information, particularly for challenging studies such as those conducted in heterogeneous agricultural fields. This research examined the ability of four multitemporal datasets (Sentinel-1-SAR (S1), Sentinel-2-MSI (S2), RapidEye (RE), and PlanetScope (PS)) to identify land cover and crop types (LCCT) in a Mediterranean irrigated area. To map LCCT distribution, a supervised pixel-based classification is adopted using Support Vector Machine with a radial basis function kernel (SVMRB) and Random Forest (RF). Thus, LCCT maps were generated into three levels, including six (Level I), ten (Level II), and fourteen (Level III) classes. Overall, the findings revealed high overall accuracies of >92%, >83%, and > 81% for Level I, Level II, and Level III, respectively, except for Sentinel-1. It was found that accuracy improves considerably when the number of classes decreases, especially when cropland or non-cropland classes are grouped into one. Furthermore, there was a similarity in performance between S2 alone and S1S2. PlanetScope LCCT classifications outperform other sensors. In addition, the present study demonstrated that SVM achieved better performances against RF and can thereby effectively extract LCCT information from high-resolution imagery as PlanetScope.
... only provide tools for processing individual Landsat scenes. For instance, 'landsat' includes functions for radiometric and topographic correction of Landsat scenes (Goslee 2011), while 'landsat8' includes functions for computing top of atmosphere reflectance, radiance, and/or brightness temperature on Landsat scenes (dos Santos 2017). Thus, the 'rgee' package makes it easier for ecologists to use the GEE platform and work with Landsat data. ...
Article
Full-text available
The Landsat satellites provide decades of near‐global surface reflectance measurements that are increasingly used to assess interannual changes in terrestrial ecosystem function. These assessments often rely on spectral indices related to vegetation greenness and productivity (e.g. Normalized Difference Vegetation Index, NDVI). Nevertheless, multiple factors impede multi‐decadal assessments of spectral indices using Landsat satellite data, including ease of data access and cleaning, as well as lingering issues with cross‐sensor calibration and challenges with irregular timing of cloud‐free acquisitions. To help address these problems, we developed the ‘LandsatTS' package for R. This software package facilitates sample‐based time series analysis of surface reflectance and spectral indices derived from Landsat sensors. The package includes functions that enable the extraction of the full Landsat 5, 7, and 8 records from Collection 2 for point sample locations or small study regions using Google Earth Engine accessed directly from R. Moreover, the package includes functions for 1) rigorous data cleaning, 2) cross‐sensor calibration, 3) phenological modeling, and 4) time series analysis. For an example application, we show how ‘LandsatTS' can be used to assess changes in annual maximum vegetation greenness from 2000 to 2022 across the Noatak National Preserve in northern Alaska, USA. Overall, this software provides a suite of functions to enable broader use of Landsat satellite data for assessing and monitoring terrestrial ecosystem function during recent decades across local to global geographic extents.
Article
Agriculture is increasingly putting pressure on biodiversity resources in the fragile environment of the Semi-Arid Argentine Pampas, specifically in the Pampa Medanosa region. This region, affected by wind erosion and deforestation, includes the Caldén Biogeographic Corridor. The Caldén (Neltuma caldenia B.), an endemic and threatened tree species, gives the corridor its name. Recent studies have mentioned the reactivation of wind erosion processes in response to non-sustainable soil management and the advance of the agricultural frontier. Thus, land cover analysis could help monitor land use change. The aim of this research was to analyze the temporal and spatial evolution of land use change in the Pampa Medanosa Region, from 2006/07 to 2018/19, through observation of land cover. For the analysis a supervised classification by mean of Random Forest (RF) algorithm was conducted. It achieves class predictions through the majority voting of decision tree groups, and it allows the combination of spectral and topographic information. The results showed an increase in agricultural surface area, mainly represented by soybeans and corn, over the cattle grazing areas. The agricultural expansion in the study area occurred fundamentally since the 2007/08 cropping season. Random Forest, in conjunction with R and Landsat satellite images, has shown great potential as a diagnostic tool for monitoring changes in land cover, ecosystem modification, and soil degradation problems and also as a tool for applying public policies and land planning.
Article
Full-text available
Texas Rio Grande Valley Red–crowned Parrots (Psittaciformes: Amazona viridigenalis [Cassin, 1853]) primarily occupy vegetated urban rather than natural areas. We investigated the utility of raw vegetation indices and their derivatives as well as elevation in modelling the Red–crowned parrot’s general use, nest site, and roost site habitat distributions. A feature selection algorithm was employed to create and select an ensemble of fine–scale, top–ranked MaxEnt models from optimally–sized, decorrelated subsets of four to seven of 199 potential variables. Variables were ranked post hoc by frequency of appearance and mean permutation importance in top–ranked models. Our ensemble models accurately predicted the three distributions of interest (x¯ Area Under the Curve [AUC] = 0.904–0.969). Top–ranked variables for different habitat distribution models included: (a) general use–percent cover of preferred ranges of entropy texture of Normalized Difference Vegetation Index (NDVI) values, entropy and contrast textures of NDVI, and elevation; (b) nest site–entropy textures of NDVI and Green–Blue NDVI, and percent cover of preferred range of entropy texture of NDVI values; (c) roost site–percent cover of preferred ranges of entropy texture of NDVI values, contrast texture of NDVI, and entropy texture of Green–Red Normalized Difference Index. Texas Rio Grande Valley Red–crowned Parrot presence was associated with urban areas with high heterogeneity and randomness in the distribution of vegetation and/or its characteristics (e.g., arrangement, type, structure). Maintaining existing preferred vegetation types and incorporating them into new developments should support the persistence of Red–crowned Parrots in southern Texas.
Book
Nowadays, dealing with data in all sciences is very critical. Data science and data engineering are exciting disciplines to turn data into bright understanding. Finding statistical characteristics and communicating data by some techniques such as visualization. R is a rapidly growing, statistical, open-source software with many libraries and packages. A core team of R developers and the R Foundation for Statistical Computing support it. Recently, many scholars in different fields have used R daily and produced publication-quality graphics. Hydrologists use this software to tidy, transform, visualize, and model their hydrometric data. There are many packages available in hydrology science. Hydroinformatics is a branch of informatics that deals with water purposes. They are starting from downloading hydrological data in special packages, cleaning up hydrological and climate data, managing data such as aggregating, dealing with missing data, extracting indicators, analyzing extreme events such as floods, droughts, bushfires, dealing with data scales, spatial and temporal dataset tools, dealing with surface and groundwater, hydrographs, rainfall, snowfall water quality, reservoirs packages, watershed modelling, soil water systems, evaporation, and multiple water-related packages. This chapter aims to analyze these packages by explaining the resources and their use in water and hydrology science, finding the gaps in existing packages, and suggesting researchers develop new packages.
Article
Full-text available
DEM-based topographic corrections on Landsat-7 ETM+ imagery from rugged terrain, as an effective processing techniques to improve the accuracy of Land Use/Land Cover (LULC) classification as well as land surface parameter retrievals with remotely sensed data, has been frequently reported in the literature. However, few studies have investigated the exact effects of DEM with different resolutions on the correction of imagery. Taking the topographic corrections on the Landsat-7 ETM+ images acquired from the rugged terrain of the Yangjiahe river basin (P.R. China) as an example, the present work systematically investigates such issues by means of two commonly used topographic correction algorithms with the support of different spatial resolution DEMs. After the pre-processing procedures, i.e. atmospheric correction and geo-registration, were applied to the ETM+ images, two topographic correction algorithms, namely SCS correction and Minnaert correction, were applied to assess the effects of different spatial resolution DEMs obtained from two sources in the removal of topographic effects and LULC classifications. The results suggested that the topographic effects were tremendously reduced with these two algorithms under the support of different spatial resolution DEMs, and the performance of the topographic correction with the 1:50,000-topographic-map DEM was similar to that achieved using SRTM DEM. Moreover, when the same topographic correction algorithm was applied the accuracy of LULC OPEN ACCESS Sensors 2009, 9 1981 classification after topographic correction based on 1:50,000-topographic-map DEM was similar as that based on SRTM DEM, which implies that the 90 m SRTM DEM can be used as an alternative for the topographic correction of ETM+ imagery when high resolution DEM is unavailable.
Article
Full-text available
A main problem encountered in radiometric calibration of satellite image data is correcting for atmospheric effects. Without this correction, an image digital number (DN) cannot be converted to a surface reflectance value. In this paper the accuracy of a calibration procedure, which includes a correction for atmospheric scattering, is tested. Two simple methods, a stand-alone and an in situ sky radiance measurement technique, were used to derive the HAZE DN values for each of the six reflectance Thematic Mapper (TM) bands. The DNs of two Landsat TM images of Phoenix, Arizona were converted to surface reflectances. -from Author
Article
Full-text available
This research investigated the dependence of computing downslope flow from digital elevation models (DEMs) on two basic elements of the flow calculation: (1) the spatial resolution of the DEM and (2) the algorithm used to compute the downslope flow, specifically how it defines topographic slope and aspect. Six algorithms for downslope flow and downslope accumulation were implemented in MATLAB using different assumptions about components of the computation from the terrain analysis literature. We analyzed the results of downslope flow statistically, and thresholded the values to yield streams, and so compared the different results with the actual surface streams as shown in the National Hydrographic Database. We repeated the computations using DEMs of 3m, 30m, and 90m resolutions covering a test area near Santa Barbara, CA. We conclude that downslope flow computations are overestimates, and are fraught with critical algorithmic assumptions and scale effects. They should be approached with great caution in GIS-based analyses.
Article
Full-text available
Four radiometric correction methods for the reduction of slope-aspect effects in a Landsat TM data set are tested in a mountainous test site with regard to their physical soundness, their influence on a forest classification as well as on the visual appearance of the scene. Excellent ground reference information and a fine resolution DEM allowed a precise assessment of the applicability of the methods under investigation. The results of the study presented here demonstrate the weakness of the classical cosine correction method for the radiometric correction in rugged terrain. The statistical, Minnaert and C-correction approaches however yielded an improvement of the forest classification and an impressive reduction of the visual topography effect.
Article
The electromagnetic radiation (EMR) signals collected by satellites in the solar spectrum are modified by scattering and absorption by gases and aerosols while traveling through the atmosphere from the Earth's surface to the sensor. When and how to correct the atmospheric effects depend on the remote sensing and atmospheric data available, the information desired, and the analytical methods used to extract the information. In many applications involving classification and change detection, atmospheric correction is unnecessary as long as the training data and the data to be classified are in the same relative scale. In other circumstances, corrections are mandatory to put multitemporal data on the same radiometric scale in order to monitor terrestrial surfaces over time. A multitemporal dataset consisting of seven Landsat 5 Thematic Mapper (TM) images from 1988 to 1996 of the Pearl River Delta, Guangdong Province, China was used to compare seven absolute and one relative atmospheric correction algorithms with uncorrected raw data. Based on classification and change detection results, all corrections improved the data analysis. The best overall results are achieved using a new method which adds the effect of Rayleigh scattering to conventional dark object subtraction. Though this method may not lead to accurate surface reflectance, it best minimizes the difference in reflectances within a land cover class through time as measured with the Jeffries–Matusita distance. Contrary to expectations, the more complicated algorithms do not necessarily lead to improved performance of classification and change detection. Simple dark object subtraction, with or without the Rayleigh atmosphere correction, or relative atmospheric correction are recommended for classification and change detection applications.
Article
Because trees are geotropic (perpendicular to the geoid), topography has no control over the Sun–crown geometry. What topography does influence is the relative positioning of trees and thus the amount of shadowing cast by them within the canopy. As satellite sensors in general measure the collective radiance of many trees inside their instantaneous field of view, the overall canopy brightness at the pixel scale is strongly controlled by canopy shadowing and hence by the topography. The removal of or compensation for topographic effects on forest images should be based on the normalization of mutual shadowing at the subpixel scale, rather than on the normalization of Sun–terrain–sensor geometry at the pixel scale. The Sun–canopy–sensor (SCS) topographic correction model was developed to characterize and hence correct the topographic effects on forest images. Testing with simulated image data showed the SCS model to be accurate (root-mean-squared residual error