ArticlePDF Available

Automated Continuous Fields Prediction From Landsat Time Series: Application to Fractional Impervious Cover

Authors:

Abstract and Figures

The characterization of fine temporal-resolution land surface dynamics from broadband optical satellite sensors is constrained by sparse acquisitions of high-quality imagery; interscene variation in radiometric, phenological, atmospheric, and illumination conditions; and subpixel variability in heterogeneous environments. In this letter, we address these concerns by developing and testing the automatic adaptive signature generalization and regression (AASGr) algorithm. Provided a robust reference map corresponding to the date of one image, AASGr automates the prediction of continuous fields maps from imagery time series that is adaptive to the spectral and radiometric characteristics of each target image and thereby requires neither atmospheric correction nor data normalization. We tested AASGr on a 22-year Landsat time series to quantify subannual impervious fractional cover dynamics in Houston, TX--an area characterized by a high degree of spatial heterogeneity in surface cover and high frequency in land cover change. The map time series achieved high accuracy in a three-part validation procedure and reveals spatio-temporal dynamics of urban intensification and extensification at a level of detail previously elusive in discrete classifications or coarse temporal-resolution map products. The automation of continuous fields time series is enabling a new generation of land surface products capable of characterizing precise morphologies along a continuum of spatio-temporal change. While AASGr was applied here to predict subpixel impervious fractional cover from Landsat imagery, the method is generalizable to a range of imagery and applications requiring dense continuous fields time series with uncertainty estimates of geophysical and biochemical characteristics, such as leaf area index, biomass, and albedo.
Content may be subject to copyright.
SUBMITTED VERSION. PUBLISHED VERSION AVAILABLE AT: https://ieeexplore.ieee.org/document/8727430
Abstract The characterization of fine temporal-resolution
land surface dynamics from broadband optical satellite sensors is
constrained by sparse acquisitions of high-quality imagery; inter-
scene variation in radiometric, phenological, atmospheric, and
illumination conditions; and subpixel variability in heterogeneous
environments. In this letter, we address these concerns by
developing and testing the automatic adaptive signature
generalization and regression (AASGr) algorithm. Provided a
robust reference map corresponding to the date of one image,
AASGr automates the prediction of continuous fields maps from
imagery time series that is adaptive to the spectral and radiometric
characteristics of each target image, and thereby requires neither
atmospheric correction nor data normalization. We tested AASGr
on a 22-year Landsat time series to quantify subannual impervious
fractional cover dynamics in Houston, TX an area characterized
by a high degree of spatial heterogeneity in surface cover and high
frequency in landcover change. The map time series achieved high
accuracy in a three-part validation procedure, and reveals spatio-
temporal dynamics of urban intensification and extensification at
a level of detail previously elusive in discrete classifications or
coarse temporal-resolution map products. The automation of
continuous fields time series is enabling a new generation of land
surface products capable of characterizing precise morphologies
along a continuum of spatio-temporal change. While AASGr was
applied here to predict subpixel impervious fractional cover from
Landsat imagery, the method is generalizable to a range of
imagery and applications requiring dense continuous fields time
series with uncertainty estimates of geo-physical and biochemical
characteristics such as leaf area index, biomass, and albedo.
Index Termscontinuous fields, impervious cover, land cover
change, Landsat, machine learning, random forests, signature
generalization, time series, urbanization.
I. INTRODUCTION
he steady deployment of satellite remote sensing platforms
in recent decades has provided scientists with prodigious
data streams of medium spatial resolution, broad-band imagery
for observing change on the Earth surface [1]. Bi-temporal
change detection using image pairs has been used effectively to
quantify state change (e.g. land cover class) or relative change
in surface characteristics between two dates, but is unable to
capture higher-order temporal dynamics, including gradual
change, periodicity, and change rates [2]. Reflecting the
demand for more temporally-frequent land surface data
products for disparate applications from land cover change to
biophysical land surface models [3], [4], the use of multi-
temporal image time series has increased rapidly [5].
Among all medium spatial resolution satellite sensors, the
Landsat program stands out for providing consistent, multi-
Manuscript submitted December 3, 2018. (Corresponding author
Christopher R. Hakkenberg).
C.R. Hakkenberg is with the Department of Statistics and the Kinder Institute
for Urban Research, Rice University, Houston, TX 77251 USA (email:
ch55@rice.edu, chrishakkenberg@gmail.com)
M.P. Dannenberg is with the Department of Geographical and Sustainability
Sciences, University of Iowa, Iowa City, IA 52242 USA (email: matthew-
decadal, high quality land surface imagery [6]. However, even
with Landsat sensor calibration and product quality assurance
[7], the consistent characterization of multi-temporal land
surface dynamics is impeded by inter-scene and inter-image
variation in radiometric, phenological, atmospheric, and
illumination/BRDF conditions [8]. The general scarcity of
high-quality, cloud-free image pairs at or near inter-annual
anniversary dates only exacerbates the challenge of ensuring
inter-date consistency [9]. A number of approaches have been
used to circumvent issues associated with sparsely acquired
imagery, including input data enhancements like best-available-
pixels composites, data blending, and multi-sensor data fusion
techniques [10], [11] as well as compromises in model output
such as the utilization of multi-year imagery for the
characterization of a single, nominal year [12]. Despite this,
inter-image discrepancies may still require onerous, and
potentially confounding, data correction and normalization
procedures that run the risk of exacerbating confusion between
radiometric differences among image dates (noise) and land
cover change (signal) [8].
Alongside the added value of fine temporal resolution time
series products, land surface models at medium spatial
resolution can benefit from more precise information on land
surface characteristics than simple discrete class designations.
This is especially so in spatially-heterogeneous environments,
where critical information may be lost by classifying complex,
intergrading land surfaces as discrete classes which can be
converted but not undergo subtle changes in intensity [13], [14].
Continuous fields pixel values offer several advantages over
discrete classifications by retaining maximum information
content and more precisely characterizing subpixel
heterogeneity [15].
Due to the demand for automated workflows for producing
temporally-dense, continuous fields land surface time series, we
developed the automatic adaptive signature generalization and
regression (AASGr) algorithm. AASGr builds upon AASG
classification - a training data selection algorithm that adapts to
image noise and inter-scene variation [16], [17] - to automate
the prediction of continuous fields land surface characteristics
based on a single reference map and time series imagery.
AASGr thus circumvents the resource-intensive and error-
prone process of manual training data selection, while ensuring
that all models in the series are individually tuned to unique
image characteristics and optimized for predictive accuracy.
This letter consists of four components: a description of the
AASGr algorithm (II.A), experimental implementation to
dannenberg@uiowa.edu).
C. Song is with the Department of Geography, University of North Carolina
at Chapel Hill, Chapel Hill, NC, 27599 USA (email: csong@email.unc.edu)
G. Vinci is with the Department of Statistics, Rice University, Houston, TX
77251 USA (email: gv9@rice.edu)
Christopher R. Hakkenberg, Matthew P. Dannenberg, Conghe Song, and Giuseppe Vinci
Automated continuous fields prediction from Landsat
time series: application to fractional impervious cover
T
SUBMITTED VERSION. PUBLISHED VERSION AVAILABLE AT: https://ieeexplore.ieee.org/document/8727430
quantify subannual impervious fractional cover dynamics over
a 22-year time series in Houston, TX (II.B), a three-part
validation of the map time series (II.C III), and a short
discussion of applications and implications for the novel class
of products enabled by AASGr (IV).
II. METHODS
A. AASGr training data selection and predictive modeling
Provided a reference image (IR) paired with a reference map
(MR) from the same date, AASG automates training data
selection for prediction in a spatially-coincident stack of target
imagery (IT) by first delineating ‘stable sites’ - locations
ostensibly not experiencing land cover change between the
dates of the IR and IT [17] which are used as the basis for
signature extension from the reference date to the target date(s).
To do this, a series of image differences (∆Ii) are created, where:
Ii = IR[x,y,zi] - IT[x,y,zi]; i=1…k
(1)
in the xy coordinate plane for the ith among k spectral bands (z).
Under the assumption that the majority of a sufficiently large
landscape did not undergo landcover change between the dates
of the IR and IT, pixels with stable land cover will tend to have
∆I values located at or near the mode of the image difference
histogram. By extension, unstable sites - pixels having
experienced significant landcover change possess ∆I values
significantly dislocated from the histogram mode. Because
modal values are relative to the two images in question, stable
sites reflect relative stability between dates rather than absolute
spectral differences between images [17]. Thereafter, band-
specific difference images are combined into a multi-band
difference image (∆I), defined as:


where a pixel value of 0 in the ∆I would be expected for a
maximally stable pixel that exhibits the minimum possible
(relative) spectral difference between dates among all k bands.
Concurrently, a spatio-temporally coincident reference map
(MR) consisting of continuous or consecutive integer values of
the response variable in question is stratified into m bins
spanning the range of pixel values in the MR. Then,
for each we select pixels 
according to:


(3)
 
(4)
 
(5)
where = σ × c, is the standard deviation of the values
in , c is a user-defined threshold parameter regulating the
maximum allowed total sample size N, T is the total number of
pixels in MR, w denotes w rounded to the nearest integer, and
(h,l) denotes a pixel location. That is, pixels are randomly
sampled without replacement from each of the m sets ,
whose corresponding value in the ∆I is less than a threshold 
defining the set of stable sites. To ensure a maximally
representative training dataset optimized for prediction on
independent data, the number of stable site pixels sampled
from each set is proportional to the number of pixels in the
full MR whose values fall in . For example, for a Landsat
image, the distribution of subsampled pixels (n=1x106) will
closely resemble (R2=0.99) that of the full dataset (n=6x1010),
though at a fraction of the size and consisting of only the most
stable site pixels for model training (Fig. 1). Provided that m is
large enough to capture the full distribution of values in the MR,
the number of bins for stratification is user-defined.
Fig. 1. Reference map subsampling. (a) Density histogram for reference pixel
values from NLCD 2001 impervious and subsampled pixels, (b) QQ plot for all
reference pixel values versus those in the subsample.
While a priori training data stratification and proportional
allotment provides an efficient method for sampling stable site
pixels among bins, sampled pixels  in the MR retain
their original continuous integer values. Once the location of all
stable site pixels in the MR is determined, a full training dataset
is compiled from the stable site values in the MR and spatially-
corresponding pixels in the IT to predict a continuous fields
target map (MT) and associated uncertainties from the full IT.
To summarize, the algorithmically-generated training data set
exhibits three desirable properties:
(1) multi-band stable sites: sampled pixels exhibit the minimum
relative spectral difference across multiple spectral bands
between dates in reference and target imagery;
(2) proportional allotment: the distribution of sample values is
proportional to that of the full reference map, thereby
ensuring representation across the range of parameter
space for optimized prediction on an independent dataset;
(3) random stratified sampling: within stratified bins of
candidate stable sites, sample selection is randomized.
AASGr is not beholden to any one sensor or regression
model, and in this experiment predictive regression was
performed on Landsat imagery using random forests (RF), an
ensemble of regression trees based on votes across bootstrap
replicates [18]. As an ensemble algorithm with predictors
randomly permuted at tree nodes, RF is able to efficiently
handle data noise, and is noted for its record of high predictive
accuracy and generalizability [19]. These properties make it
attractive for Landsat image time series, as RF has been shown
to effectively handle collinearity among spectral bands, noise
due to atmospheric and radiometric contamination, or
georegistration issues arising from image misalignment [19].
B. Experimental implementation
AASGr was tested on a 22-year Landsat image stack covering
a 2720 km2 portion of central Houston, TX (see Fig 2a).
Houston’s spatially heterogenous cover and its rapid growth
from 1997-2018 makes it a compelling test case for assessing
0%
1%
2%
3%
4%
0.00 0.25 0.50 0.75 1.00
Reference pixel values
Density
all pixels
subsample
(a)
R2=0.9998
0.00
0.25
0.50
0.75
1.00
0.00 0.25 0.50 0.75 1.00
All pixels
Subsample
1000
2000
3000
Frequency
(b)
SUBMITTED VERSION. PUBLISHED VERSION AVAILABLE AT: https://ieeexplore.ieee.org/document/8727430
the performance of AASGr to detect fractional impervious
cover as a continuous field. Imagery consisted of all available
radiometrically-calibrated and orthorectified Landsat
Collection 1 Level-1 imagery (WRS2 path/row 25/39)
possessing <10% cloud cover. In total, 66 images fulfilled these
criteria. Imagery spanned a range of phenological states
(DOYs) and atmospheric conditions, over a range of three
satellites/sensors: Landsat-5 TM between 1997-2011, Landsat
7 ETM+ for 1999-2012, and Landsat 8 OLI for 2013-2018.
Reference maps (MR) consist of wall-to-wall subpixel
impervious fraction maps possessing continuous values
between 0.00-1.00 from the U.S. Geological Service (USGS)
National Land Cover Database (NLCD) Percent Developed
Imperviousness product from 2001, 2006 and 2011 [12]. The
three reference maps were paired with Landsat reference
imagery for each respective year, and applied to the most
temporally-proximate target imagery (i.e. IR-2001 and MR-2001 was
paired with IT-1997 - IT-2003, IR-2006 and MR-2006 with IT-2004 - IT-2008,
and IR-2011 and MR-2011 with IT-2009 - IT-2018).
Fig. 2. Houston, TX study area. (a) predicted impervious fractional cover for
August 28, 2011 where insets refer to the extent of validation images, and (b)
the standard deviation of the random forest’s predictive posterior distribution.
Before prediction, all clouds, cloud shadows, ETM+ SLC-off
gaps, and radiometrically-saturated or contaminated pixels
were algorithmically masked based on quality assessment
bands [20]. Except for these masked areas, which are treated as
data gaps in all predicative modelling, RF regression models
yield posterior predictive distributions based on the votes of all
trees in the RF model [21]. Lacunae are interpolated post-hoc,
based on temporally-adjacent pixels in the prediction time
series. For this interpolation procedure a low pass filter using a
Gaussian kernel in a five-year window was applied to each pixel
in the temporal dimension of the full time series.
Parameter optimization, performed to balance efficiency (run
time) and predictive accuracy based on an external comparison
with independent NLCD maps resulted in the selection of
140,000 pixels as the total sample size (N) and 10 bins (m), with
RF hyperparameters: 300 trees per model, and 1 predictor
sampled at each split. Sensitivity analysis confirmed that model
performance was largely robust to parameter values. Among all
Landsat band combinations and derived indices, the difference
of the blue and near-infrared bands yielded the highest
prediction accuracies in model testing, and was thereby adopted
for all model runs. To prevent the mischaracterization of
temporarily docked waterborne vessels as terrestrial impervious
surface, a mask based on unchanged water pixels in NLCD
2001 and 2011 maps was applied to the full time series.
C. Accuracy assessment
Predictive maps were validated via a three-part accuracy
assessment. First, an in-sample OOB estimate of model
performance (pseudo-R2) was derived for every model run.
Second, all pixels in predictive maps were compared with
NLCD impervious maps for coincident years (i.e. 2001, 2006,
and 2011) and the strength of their agreement was assessed via
adjusted R2. To maintain independence of training and
validation data, training data was constrained to the two NLCD-
Landsat sets not corresponding to the year of prediction, and
their results averaged to produce a single metric of agreement.
Fig. 3. Validation map (MV) classification. (a) 3m resolution validation imagery
(IV); (b) validation classification (CV) at 3m IV native resolution; (c) 3m binary
pervious/impervious CV; (d) resampled impervious cover validation map (MV)
at 30m Landsat spatial resolution.
As a third test of map accuracy, three 3m resolution Quickbird
validation images (IV) from 2005, 2007, and 2013 were used for
independent validation with spatio-temporally coincident map
subsets (Fig. 2a). Cloud masks and regions of interest for five
primary land cover types (forest, grassland, urban, water, and
barren) in the IV were manually delineated, and used to train a
Classes
Impervious
Pervious
Fractional
Impervious
100%
0%
Classes
Forest
Grass
Water
Barren
Impervious
(a) (b)
(c) (d)
0 1 20.5 km
0 1 20.5 km
0 1 20.5 km
0 1 20.5 km
SUBMITTED VERSION. PUBLISHED VERSION AVAILABLE AT: https://ieeexplore.ieee.org/document/8727430
random forest classifier. The resulting validation maps possess
five-class overall accuracies of 0.83, 0.84, and 0.85 for the three
dates, respectively, with the largest confusion occurring
between Barren and Urban classes a not uncommon result in
urban classification [22]. Thereafter, the five-class validation
classification (CV) was converted to a binary urban-nonurban
classification and resampled to a 30m resolution validation map
(MV) based on the aggregate of all urban (impervious)
subpixels. Aggregated, binary urban-nonurban validation maps
possess overall accuracies of 0.91, 0.91, and 0.92, respectively.
Subpixel impervious fraction in the resampled MV is calculated
as:
 


(6)
where N is the total number of pixels in CV corresponding to
pixel  in the MV. For example, for a 3m binary CV, a single
30m aggregate pixel consists of 100 subpixels, each possessing
a value of 0 (pervious) or 1 (impervious). Summing the
subpixels and dividing by 100 renders an estimated subpixel
impervious fraction in the MV at the coarser 30m resolution
(Fig. 3). Thus devised, AASGr-generated maps (MT) can be
directly compared with corresponding pixels in the validation
subsets in the MV, and assessed for accuracy based on adj-R2.
III. RESULTS
In experiments, internal OOB pseudo-R2 for all 66 image
predictions ranged from 0.76 to 0.90 (= 0.83, =0.03) [23].
Visual observation confirms that AASGr-generated maps
accurately reproduce known patterns in impervious surface
cover (Fig. 2a). The standard deviation of posterior votes serve
as a measure of certainty (Fig. 2b). When compared to
corresponding, but independent NLCD maps from dates not
used for model training, AASGr predictions showed a high
degree of agreement based on adj-R2 (Table 1). As no one map
is authoritative, agreement does not directly correspond with
accuracy and could reflect or obscure errors in any one image
or errors in both [4].
TABLE 1
AGREEMENT & VALIDATION
Validation Data
Year
adj-R2
RMSE
MAE
bias
NLCD
2001
0.82
0.14
0.09
0.01
NLCD
2006
0.77
0.16
0.11
0.02
NLCD
2011
0.77
0.16
0.11
-0.01
Quickbird
2005
0.72
0.15
0.11
-0.05
Quickbird
2007
0.80
0.14
0.11
0.03
Quickbird
2013
0.79
0.14
0.11
-0.01
Independent accuracy assessment using three classified and
resampled fine-resolution images indicate accuracies in line
with NLCD agreement metrics, and comparable to those
observed in other studies [13][15], though with the added
benefit of a continuous fields output at a subannual resolution
(Table 1). Scatter charts of agreement show a slight deviation
from the 1:1 line, indicative of some boundary bias (Fig. 4;
Table 1). This compression of the posterior distribution reflects
empirical limitations of ensemble classifiers that, while
optimizing total accuracy, extract predictions from the mean of
the vote posterior and thereby tend to underestimate extreme
values at the poles of the range [24].
Fig. 4. Validation results. AASGr prediction versus NLCD agreement (left
column) and independently classified, 3m resolution Quickbird images.
IV. DISCUSSION
AASGr fully automates model parameterization and
prediction of a continuous fields response variable from time
series imagery, achieving high predictive accuracy in
experiments as it efficiently adapts to inconsistencies in multi-
temporal imagery. Automated signature generalization
algorithms are noteworthy as streamlined workflows enable the
estimation of land surface characteristics at previously elusive
spatio-temporal resolutions. And unlike discrete classifications,
continuous fields regression is optimally suited for producing
confidence intervals critical to secondary applications - such as
estimating unbiased areal land cover change estimates or as
input maps for process-based land surface models - requiring
uncertainty estimates of state values [3], [4].
In this implementation, AASGr was tested on a Landsat
imagery time series to estimate subannual subpixel impervious
fractional cover over a 22-year period in the rapidly urbanizing
city of Houston, TX. The resulting map time series is significant
in that it showcases the utility of AASGr to quantify subannual
land cover dynamics and sub-pixel heterogeneity (Fig. 5).
Therefore, AASGr-enabled time series are capable of
simultaneously characterizing both urban extensification
(conversion of pervious to impervious surface) and
intensification (changing intensity of fractional impervious
cover in any one pixel) a feat otherwise unattainable with hard
0.00
0.25
0.50
0.75
1.00
0.00 0.25 0.50 0.75 1.00
NLCD
AASGr
2001
0.00
0.25
0.50
0.75
1.00
0.00 0.25 0.50 0.75 1.00
Quickbird
AASGr
2005
0.00
0.25
0.50
0.75
1.00
0.00 0.25 0.50 0.75 1.00
NLCD
AASGr
2006
0.00
0.25
0.50
0.75
1.00
0.00 0.25 0.50 0.75 1.00
Quickbird
AASGr
2007
0.00
0.25
0.50
0.75
1.00
0.00 0.25 0.50 0.75 1.00
NLCD
AASGr
2011
0.00
0.25
0.50
0.75
1.00
0.00 0.25 0.50 0.75 1.00
Quickbird
AASGr
2013
5000 10000 15000
Frequency 2000 4000 6000 8000
Frequency
SUBMITTED VERSION. PUBLISHED VERSION AVAILABLE AT: https://ieeexplore.ieee.org/document/8727430
classifiers. This level of precision is noteworthy for next
generation urban land cover change applications requiring
precise characterizations of land cover change morphologies
along a continuum of spatio-temporal change. Land cover time
series that are near-continuous in space and time offer several
advantages to coarse spatial resolution, temporally-sparse time
series by more precisely capturing heterogeneity in spatially
complex areas, thereby better lending themselves to the
derivation of indices of continuous surface metrology [25].
Fig. 5. Continuous change in impervious cover for three dates in Houston, TX.
Intermediate colors represent overlap between dates.
V. CONCLUSION
In this letter, we present the automatic adaptive signature
generalization and regression (AASGr) algorithm. AASGr is
fully automated for time series prediction in that it is adaptive
to the spectral and radiometric characteristics of target imagery,
and thereby requires neither atmospheric correction nor data
normalization. Provided a robust reference map paired with an
image from the same date, AASGr can predict highly-accurate
continuous response values and associated uncertainties for
time series imagery before and after the reference date. This
quality makes it attractive for diverse applications requiring
multi-date land surface information where reference data is
otherwise limited. In this implementation, we tested AASGr for
estimating fractional impervious cover in a rapidly urbanizing
city, demonstrating its capacity to characterize heterogeneity
and intensity in spatially complex areas, as well as higher-order
temporal dynamics and change rates. AASGr is not limited to
estimating subpixel land cover fractions, and is amenable to a
range of applications requiring dense, continuous fields raster
map time series with uncertainty estimates including land
surface characteristics like leaf area index, biomass, and albedo.
REFERENCES
[1] A. S. Belward and J. O. Skøien, “Who launched what, when and why;
trends in global land-cover observation capacity from civilian earth
observation satellites,” ISPRS J. Photogramm. Remote Sens., vol. 103,
pp. 115128, 2015.
[2] C. R. Hakkenberg, M. P. Dannenberg, C. Song, and K. B. Ensor,
“Characterizing multi-decadal, annual land cover change dynamics in
Houston, TX based on automated classification of Landsat imagery,”
Int. J. Remote Sens., vol. 40, no. 2, pp. 693718, 2019.
[3] A. M. Fox, T. J. Hoar, J. L. Anderson, A. F. Arellano, W. K. Smith, M.
E. Litvak, N. MacBean, D. S. Schimel, and D. J. P. Moore, “Evaluation
of a Data Assimilation System for Land Surface Models using
CLM4.5,” J. Adv. Model. Earth Syst., p. 124., 2018.
[4] P. Olofsson, G. M. Foody, M. Herold, S. V Stehman, C. E. Woodcock,
and M. A. Wulder, “Good practices for estimating area and assessing
accuracy of land change,” Remote Sensing of Environment, vol. 148, no.
January 2013. pp. 4257, 2014.
[5] Z. Zhu, “Change detection using landsat time series: A review of
frequencies, preprocessing, algorithms, and applications,” ISPRS J.
Photogramm. Remote Sens., vol. 130, pp. 370384, 2017.
[6] B. L. Markham and D. L. Helder, “Forty-year calibrated record of earth-
reflected radiance from Landsat: A review,” Remote Sensing of
Environment, vol. 122. pp. 3040, 2012.
[7] J. E. Vogelmann, A. L. Gallant, H. Shi, and Z. Zhu, “Perspectives on
monitoring gradual change across the continuity of Landsat sensors
using time-series data,” Remote Sens. Environ., vol. 185, pp. 258270,
2016.
[8] C. Song, C. E. Woodcock, K. C. Seto, M. P. Lenney, and S. A.
Macomber, “Classification and change detection using Landsat TM data:
When and how to correct atmospheric effects?,” Remote Sens. Environ.,
vol. 75, no. 2, pp. 230244, 2001.
[9] C. Gómez, J. C. White, and M. A. Wulder, “Optical remotely sensed
time series data for land cover classification: A review,” ISPRS J.
Photogramm. Remote Sens., vol. 116, pp. 5572, 2016.
[10] Z. Zhu, C. E. Woodcock, C. Holden, and Z. Yang, “Generating synthetic
Landsat images based on all available Landsat data: Predicting Landsat
surface reflectance at any given time,” Remote Sens. Environ., vol. 162,
pp. 6783, 2015.
[11] G. Yin, G. Mariethoz, Y. Sun, and M. F. McCabe, “A comparison of
gap-filling approaches for landsat-7 satellite data,” Int. J. Remote Sens.,
vol. 38, no. 23, pp. 66536679, 2017.
[12] G. Xian, C. Homer, J. Dewitz, J. Fry, N. Hossain, and J. Wickham, “The
change of impervious surface area between 2001 and 2006 in the
conterminous United States.,” Photogramm. Eng. Remote Sens., vol. 77,
no. 8, pp. 758762, 2011.
[13] C. Deng and Z. Zhu, “Continuous subpixel monitoring of urban
impervious surface using Landsat time series,” Remote Sensing of
Environment, no. October, Elsevier, pp. 121, 2018.
[14] D. Lu and Q. Weng, “Use of impervious surface in urban land-use
classification,” Remote Sens. Environ., vol. 102, no. 1, 146160, 2006.
[15] P. Wang, C. Huang, and E. C. B. de Colstoun, “Mapping 2000-2010
impervious surface change in India using global land survey Landsat
data,” Remote Sens., vol. 9, no. 4, p. 366, 2017.
[16] M. P. Dannenberg, C. R. Hakkenberg, and C. Song, “Consistent
classification of landsat time series with an improved automatic adaptive
signature generalization algorithm,” Remote Sens., vol. 8, no. 8, 2016.
[17] J. Gray and C. Song, “Consistent classification of image time series with
automatic adaptive signature generalization,” Remote Sens. Environ.,
vol. 134, pp. 333341, Jul. 2013.
[18] L. Breiman, “Random forests,” Mach. Learn., vol. 45, pp. 532, 2001.
[19] P. Gislason, J. Benediktsson, and J. Sveinsson, “Random Forests for
land cover classification,” Pattern Recognit. Lett., vol. 27, no. 4, pp.
294300, 2006.
[20] Z. Zhu, S. Wang, and C. E. Woodcock, “Improvement and expansion of
the Fmask algorithm: Cloud, cloud shadow, and snow detection for
Landsats 4-7, 8, and Sentinel 2 images,” Remote Sens. Environ., vol.
159, pp. 269277, 2015.
[21] A. Liaw and M. Wiener, “Classification and regression by
randomForest,” R News, vol. 2, no. 3, pp. 1822, 2002.
[22] P. S. Kaspersen, R. Fensholt, and M. Drews, “Using Landsat vegetation
indices to estimate impervious surface fractions for European cities,”
Remote Sens., vol. 7, no. 6, pp. 82248249, 2015.
[23] C. R. Hakkenberg, “Houston Subannual Percent Impervious (SPI) Land
Cover Dataset: 1997-2018,” Kinder Institute Urban Data Platform.
2019. doi.org/10.25612/837.d8nxbzwj01ad
[24] E. Grossmann, J. Ohmann, J. Kagan, H. May, and M. Gregory,
“Mapping ecological systems with a random foret model: tradeoffs
between errors and bias,” Gap Anal. Bull., vol. 17, pp. 1622, 2010.
[25] K. McGarigal, S. Tagil, and S. A. Cushman, “Surface metrics: An
alternative to patch metrics for the quantification of landscape
structure,” Landsc. Ecol., vol. 24, no. 3, pp. 433450, 2009.
... These approaches, however, are laborious or even unrealistic in remote and inaccessible areas, such as the Arctic. Alternatively, several studies demonstrated the potential of deriving training samples from preexisting knowledge (Gray and Song, 2013;Hakkenberg et al., 2020). Building on this premise, our study developed a ready-for-use training sample by leveraging the FAST library and pre-existing land cover datasets for supervised classification model development. ...
Article
Full-text available
The entire Arctic is rapidly warming, which brings in a multitude of environmental consequences far beyond the northern high-latitude limits. Land cover maps offer biophysical insights into the terrestrial environment and are therefore essential for understanding the transforming Arctic in the context of anthropogenic activity and climate change. Satellite remote sensing has revolutionized our ability to capture land cover information over large areas. However, circumpolar Arctic-scale fine-resolution land cover mapping has so far been lacking. Here, we utilize a combination of multimode satellite observations and topographic data at 10 m resolution to provide a new baseline land cover product (CALC-2020) across the entire terrestrial Arctic for circa 2020. Accuracy assessments suggest that the CALC-2020 product exhibits satisfactory performances, with overall accuracies of 79.3 % and 67.3 %, respectively, at validation sample locations and field/flux tower sites. The derived land cover map displays reasonable agreement with pre-existing products, meanwhile depicting more subtle polar biome patterns. Based on the CALC-2020 dataset, we show that nearly half of the Arctic landmass is covered by graminoid tundra or lichen/moss. Spatially, the land cover composition exhibits regional dominance, reflecting the complex suite of both biotic and abiotic processes that jointly determine the Arctic landscape. The CALC-2020 product we developed can be used to improve Earth system modelling and benefit the ongoing efforts on sustainable Arctic land management by public and non-governmental sectors. The CALC-2020 land cover product is freely available on Science Data Bank: 10.57760/sciencedb.01869 (Xu et al., 2022a).
... These approaches, however, are laborious or even 360 unrealistic in remote and inaccessible areas, such as the Arctic. Alternatively, several studies demonstrated the potential of deriving reference sample from pre-existing knowledge (Gray and Song, 2013;Hakkenberg et al., 2020). Building on this premise, our study takes a step forward by developing a "ready for use" sample set migrated from the FAST library for supervised classification model calibration and evaluation. ...
Preprint
The entire Arctic is rapidly warming, which brings in a multitude of environmental consequences far beyond the northern high-latitude limits. Land cover maps offer biophysical insights into the terrestrial environment and are therefore essential for understanding the transforming Arctic in the context of anthropogenic activity and climate change. Satellite remote sensing has revolutionized our ability to capture land cover information over large areas. However, circumpolar Arctic-scale fine resolution land cover mapping has been so far lacking. Here, we utilize a combination of multimode satellite observations and topographic data at 10 m resolution to provide a new baseline land cover product (CALC-2020) across the entire terrestrial Arctic for circa 2020. Accuracy assessments suggest that the CALC-2020 product exhibits satisfactory performances, with overall accuracies of 79.63 % and 67.27 %, respectively, at validation sample locations and field/flux tower sites. The derived land cover map also displays reasonable agreement with three pre-existing global products, meanwhile depicting much more subtle polar biome patterns. Based on the CALC-2020 dataset, we show that over half of the Arctic landmass is covered by graminoid tundra or lichen/moss. Spatially, the land cover distribution exhibits regional dominance, reflecting the complex suite of both biotic and abiotic processes that jointly determine the Arctic landscape. The CALC-2020 product we developed can be used to improve earth system modelling, and benefit the ongoing efforts on sustainable Arctic land management by public and non-governmental sectors. The CALC-2020 land cover product is freely available on Science Data Bank: http://cstr.cn/31253.11.sciencedb.01869 (Xu et al., 2022a).
... The change detection of impervious surfaces is important in monitoring and understanding urban development and has been extensively studied in the remote sensing literature. However, most of the existing studies monitor the change of impervious surfaces based on coarse-and medium-spatial-resolution satellite imagery, such as MODIS and Landsat [209], [210], which, on the other hand, have difficulty dealing with areas that have low impervious-surface intensities and mixed pixels [211]. During recent decades, images with high spatial resolution have provided new opportunities for subtle impervious-surface monitoring at very fine scales. ...
Article
Full-text available
Change detection is a vibrant area of research in remote sensing. Thanks to increases in the spatial resolution of remote sensing images, subtle changes at a finer geometrical scale can now be effectively detected. However, change detection from very-high-spatial-resolution (VHR) (≤5 m) remote sensing images is challenging due to limited spectral information, spectral variability, geometric distortion, and information loss. To address these challenges, many change detection algorithms have been developed. However, a comprehensive review of change detection in VHR images is lacking in the existing literature. This review aims to fill the gap and mainly includes three aspects: methods, applications, and future directions.
... Driven by a demand for spatio-temporal accuracy and consistency across the multidecadal imagery time series (where individual images may vary due to seasonal illumination angles and atmospheric conditions), the land cover change dataset was generated using a threepart algorithmic procedure: (1) automatic adaptive signature generalization (Dannenberg et al. 2016) for automated training data selection from NLCD classifications from 2001, 2006, and 2011(Homer et al. 2015, (2) machine learning image classification using random forests to classify atmospherically-corrected image spectra to one of the four aforementioned developed classes (Hakkenberg et al. 2020), and (3) spatio-temporal filtering to reduce erroneous classifications due to clouds, atmospheric contamination, and other sources of data noise and model errors among the 153 billion pixels classified (Hakkenberg et al. 2019). All classifications were validated using independent, multi-temporal fine-resolution imagery from the Ikonos, Quickbird, and Worldview sensors (Hakkenberg et al. 2019). ...
Article
Full-text available
Urbanization results in increasing impervious surfaces with the potential to threaten fragile environments and heighten flood risks. In the United States, research on the social processes driving urbanization has tended to focus on the twenty-first century, but less is known about how temporal trends arose from the spatial layout of the urban land upon which this growth was founded. To address this gap, we present a novel interdisciplinary synthesis using neighborhood-level census data in tandem with a satellite-derived annual land cover change time series to assess the role of race, affluence, and socioeconomic status in shaping spatio-temporal urbanization in the Houston metropolitan area from 1997−2016. Results from cross-sectional and temporal regression models indicate that while social dynamics associated with historical versus recent urbanization are related, they are not identical. Thus, while temporal change in Houston's urbanization is driven primarily by socioeconomic status, the social dynamics associated with spatial disparities in urbanization relate primarily to race, regardless of socioeconomic status. These results are noteworthy as urbanization in Houston does not fully comport with existing theoretical perspectives or with empirical findings nationally. Instead, we suggest these findings reflect the city’s politics and culture surrounding land use. Thus, beyond its important social and environmental implications, this study affirms the utility of fusing socio-demographic data with satellite remote sensing of urban growth, and highlights the value of the socioenvironmental succession framework for characterizing urbanization as a recursive process in space and time.
... Driven by a demand for spatio-temporal accuracy and consistency across the multidecadal imagery time series (where individual images may vary due to seasonal illumination angles and atmospheric conditions), the land cover change dataset was generated using a threepart algorithmic procedure: (1) automatic adaptive signature generalization (Dannenberg et al. 2016) for automated training data selection from NLCD classifications from 2001, 2006, and 2011(Homer et al. 2015, (2) machine learning image classification using random forests to classify atmospherically-corrected image spectra to one of the four aforementioned developed classes (Hakkenberg et al. 2020), and (3) spatio-temporal filtering to reduce erroneous classifications due to clouds, atmospheric contamination, and other sources of data noise and model errors among the 153 billion pixels classified (Hakkenberg et al. 2019). All classifications were validated using independent, multi-temporal fine-resolution imagery from the Ikonos, Quickbird, and Worldview sensors (Hakkenberg et al. 2019). ...
Article
Full-text available
Urbanization results in increasing impervious surfaces with the potential to threaten fragile environments and heighten flood risks. In the United States, research on the social processes driving urbanization has tended to focus on the twenty-first century, but less is known about how temporal trends arose from the spatial layout of developed land upon which this growth was founded. To address this gap, we present a novel interdisciplinary synthesis using neighborhood-level census data in tandem with a satellite-derived annual land cover change time series to assess the role of race, affluence, and socioeconomic status in shaping spatio-temporal urbanization in the Houston metropolitan area from 1997-2016. Results from cross-sectional and temporal regression models indicate that while social dynamics associated with historical versus recent urbanization are related, they are not identical. Thus, while temporal change in urbanization is driven primarily by socioeconomic status, the social dynamics associated with spatial disparities in urbanization relate primarily to race, regardless of socioeconomic status. The results are noteworthy as urbanization in Houston does not fully comport with existing theoretical perspectives or with empirical findings nationally. Instead, we suggest these findings reflect the city's politics and culture surrounding land use. Thus, beyond its important social and environmental implications, this study affirms the utility of fusing socio-demographic data with satellite remote sensing of urban growth, and highlights the value of the socioenvironmental succession framework for characterizing urbanization as a recursive process in space and time.
Article
Addicks and Barker reservoirs were built in the 1940s to protect downtown Houston from flooding and have generally worked very well until 2017 when Hurricane Harvey devastated much of Houston and surroundings with up to 40 inches (102 cm) of rainfall causing flooding of 154,000 homes in over 22 watersheds in Houston/Harris County alone. However, the story of how Addicks and Barker flooded upstream residential areas from a hydrologic standpoint is a harsh lesson in flood infrastructure policy and funding. This failure to protect both downstream properties in Buffalo Bayou and upstream areas behind the dams ended up with tens of thousands of flooded homes and properties, with many having flood waters for over 10 days. This paper explores the main causes for the flooding and addresses the hydrologic issues upstream in both reservoirs. The main causes of flooding were not just related to a massive rainfall event, but also explosive urban expansion of land use upstream of reservoirs, altered and updated reservoir design issues, and lack of governmental action in the years leading up to the disaster. Potential long‐term solutions to the flooding and design problems are addressed in this article as well.
Article
Full-text available
In 2017, Hurricane Harvey caused substantial loss of life and property in the swiftly urbanizing region of Houston, TX. Now in its wake, researchers are tasked with investigating how to plan for and mitigate the impact of similar events in the future, despite expectations of increased storm intensity and frequency as well as accelerating urbanization trends. Critical to this task is the development of automated workflows for producing accurate and consistent land cover maps of sufficiently fine spatio-temporal resolution over large areas and long timespans. In this study, we developed an innovative automated classification algorithm that overcomes some of the traditional trade-offs between fine spatio-temporal resolution and extent – to produce a multi-scene, 30m annual land cover time series characterizing 21 years of land cover dynamics in the 35,000 km2 Greater Houston area. The ensemble algorithm takes advantage of the synergistic value of employing all acceptable Landsat imagery in a given year, using aggregate votes from the posterior predictive distributions of multiple image composites to mitigate against misclassifications in any one image, and fill gaps due to missing and contaminated data, such as those from clouds and cloud shadows. The procedure is fully automated, combining adaptive signature generalization and spatio-temporal stabilization for consistency across sensors and scenes. The land cover time series is validated using independent, multi-temporal fine-resolution imagery, achieving crisp overall accuracies between 78–86% and fuzzy overall accuracies between 91–94%. Validated maps and corresponding areal cover estimates corroborate what census and economic data from the Greater Houston area likewise indicate: rapid growth from 1997–2017, demonstrated by the conversion of 2,040 km² (± 400 km²) to developed land cover, 14% of which resulted from the conversion of wetlands. Beyond its implications for urbanization trends in Greater Houston, this study demonstrates the potential for automated approaches to quantifying large extent, fine resolution land cover change, as well as the added value of temporally-dense time series for characterizing higher-order spatio-temporal dynamics of land cover, including periodicity, abrupt transitions, and time lags from underlying demographic and socio-economic trends.
Article
Full-text available
Understanding and monitoring the environmental impacts of global urbanization requires better urban datasets. Continuous field impervious surface change (ISC) mapping using Landsat data is an effective way to quantify spatiotemporal dynamics of urbanization. It is well acknowledged that Landsat-based estimation of impervious surface is subject to seasonal and phenological variations. The overall goal of this paper is to map 2000-2010 ISC for India using Global Land Survey datasets and training data only available for 2010. To this end, a method was developed that could transfer the regression tree model developed for mapping 2010 impervious surface to 2000 using an iterative training and prediction (ITP) approachAn independent validation dataset was also developed using Google Earth™ imagery. Based on the reference ISC from the validation dataset, the RMSE of predicted ISC was estimated to be 18.4%. At 95% confidence, the total estimated ISC for India between 2000 and 2010 is 2274.62 ± 7.84 km².
Article
Full-text available
The free and open access to all archived Landsat images in 2008 has completely changed the way of using Landsat data. Many novel change detection algorithms based on Landsat time series have been developed We present a comprehensive review of four important aspects of change detection studies based on Landsat time series, including frequencies, preprocessing, algorithms, and applications. We observed the trend that the more recent the study, the higher the frequency of Landsat time series used. We reviewed a series of image preprocessing steps, including atmospheric correction, cloud and cloud shadow detection, and composite/fusion/metrics techniques. We divided all change detection algorithms into six categories, including thresholding, differencing, segmentation, trajectory classification, statistical boundary, and regression. Within each category, six major characteristics of different algorithms, such as frequency, change index, univariate/multivariate, online/offline, abrupt/gradual change, and sub-pixel/pixel/spatial were analyzed. Moreover, some of the widely-used change detection algorithms were also discussed. Finally, we reviewed different change detection applications by dividing these applications into two categories, change target and change agent detection.
Article
Full-text available
Classifying land cover is perhaps the most common application of remote sensing, yet classification at frequent temporal intervals remains a challenging task due to radiometric differences among scenes, time and budget constraints, and semantic differences among class definitions from different dates. The automatic adaptive signature generalization (AASG) algorithm overcomes many of these limitations by locating stable sites between two images and using them to adapt class spectral signatures from a high-quality reference classification to a new image, which mitigates the impacts of radiometric and phenological differences between images and ensures that class definitions remain consistent between the two classifications. We refined AASG to adapt stable site identification parameters to each individual land cover class, while also incorporating improved input data and a random forest classifier. In the Research Triangle region of North Carolina, our new version of AASG demonstrated an improved ability to update existing land cover classifications compared to the initial version of AASG, particularly for low intensity developed, mixed forest, and woody wetland classes. Topographic indices were particularly important for distinguishing woody wetlands from other forest types, while multi-seasonal imagery contributed to improved classification of water, developed, forest, and hay/pasture classes. These results demonstrate both the flexibility of the AASG algorithm and the potential for using it to produce high-quality land cover classifications that can utilize the entire temporal range of the Landsat archive in an automated fashion while maintaining consistent class definitions through time.
Article
Full-text available
Accurate land cover information is required for science, monitoring, and reporting. Land cover changes naturally over time, as well as a result of anthropogenic activities. Monitoring and mapping of land cover and land cover change in a consistent and robust manner over large areas is made possible with Earth Observation (EO) data. Land cover products satisfying a range of science and policy information needs are currently produced periodically at different spatial and temporal scales. The increased availability of EO data—particularly from the Landsat archive (and soon to be augmented with Sentinel-2 data)—coupled with improved computing and storage capacity with novel image compositing approaches, have resulted in the availability of annual, large-area, gap-free, surface reflectance data products. In turn, these data products support the development of annual land cover products that can be both informed and constrained by change detection outputs. The inclusion of time series change in the land cover mapping process provides information on class stability and informs on logical class transitions (both temporally and categorically). In this review, we present the issues and opportunities associated with generating and validating time-series informed annual, large-area, land cover products, and identify methods suited to incorporating time series information and other novel inputs for land cover characterization.
Article
The magnitude and persistence of land carbon (C) pools influence long-term climate feedbacks. Interactive ecological processes influence land C pools and our understanding of these processes is imperfect so land surface models have errors and biases when compared to each other and to real observations. Here we implement an Ensemble Adjustment Kalman Filter (EAKF), a sequential state data assimilation technique to reduce these errors and biases. We implement the EAKF using the Data Assimilation Research Testbed coupled with the Community Land Model (CLM 4.5 in CESM 1.2). We assimilated simulated and real satellite observations for a site in central New Mexico, United States. A series of observing system simulation experiments allowed assessment of the data assimilation system without model error. This showed that assimilating biomass and leaf area index observations decreased model error in C dynamics forecasts (29% using biomass observations and 40% using leaf area index observations) and that assimilation in combination shows greater improvement (51% using both observation streams). Assimilating real observations highlighted likely model structural errors and we implemented an adaptive model-variance-inflation technique to allow the model to track the observations. Monthly and longer model forecasts using real observations were improved relative to forecasts without data assimilation. The reliable forecast lead-time varied by model pool and is dependent on how tightly the C pool is coupled to meteorologically driven processes. The EAKF and similar state data assimilation techniques could reduce errors in projections of the land C sink and provide more robust forecasts of C pools and land-atmosphere exchanges.
Article
The purpose of this study is to assess the relative performance of four different gap-filling approaches across a range of land-surface conditions, including both homogeneous and heterogeneous areas as well as in scenes with abrupt changes in landscape elements. The techniques considered in this study include: (1) Kriging and co-Kriging; (2) geostatistical neighbourhood similar pixel interpolator (GNSPI); (3) a weighted linear regression (WLR) algorithm; and (4) the direct sampling (DS) method. To examine the impact of image availability and the influence of temporal distance on the selection of input training data (i.e. time separating the training data from the gap-filled target image), input images acquired within the same season (temporally close) as well as in different seasons (temporally far) to the target image were examined, as was the case of using information only within the target image itself. Root mean square error (RMSE), mean spectral angle (MSA), and coefficient of determination (R²) were used as the evaluation metrics to assess the prediction results. In addition, the overall accuracy (OA) and kappa coefficient (kappa) were used to assess a land-cover classification based on the gap-filled images. Results show that all of the gap-filling approaches provide satisfactory results for the homogeneous case, with R² > 0.93 for bands 1 and 2 in all cases and R² > 0.80 for bands 3 and 4 in most cases. For the heterogeneous example, GNSPI performs the best, with R² > 0.85 for all tested cases. WLR and GNSPI exhibit equivalent accuracy when a temporally close input image is used (i.e. WLR and GNSPI both have an R² equal to 0.89 for band 1). For the case of abrupt changes in scene elements or in the absence of ancillary data, the DS approach outperforms the other tested methods.