ArticlePDF Available

Circa 2010 Land Cover of Canada: Local Optimization Methodology and Product Development

MDPI
Remote Sensing
Authors:
  • Environment and Climate Change Canada, Ottawa

Abstract and Figures

Land cover information is necessary for a large range of environmental applications related to climate impacts and adaption, emergency response, wildlife habitat, etc. In Canada, a 2008 user survey indicated that the most practical land cover data is provided in a nationwide 30 m spatial resolution format, with an update frequency of five years. In response to this need, the Canada Centre for Remote Sensing (CCRS) has generated a 30 m land cover map of Canada for the base year 2010, as the first of a planned series of maps to be updated every five years, or more frequently. This land cover dataset is also the Canadian contribution to the 30 m spatial resolution 2010 Land Cover Map of North America, which is produced by Mexican, American and Canadian government institutions under a collaboration called the North American Land Change Monitoring System (NALCMS). This paper describes the mapping approach used for generating this land cover dataset for Canada from Thematic Mapper (TM) and Enhanced Thematic Mapper (ETM+) Landsat sensor observations. The innovative part of the mapping approach is the local optimization of the land cover classifier, which has resulted in increased spatial consistency and accuracy. Training and classifying with locally confined reference samples over a large number of partially overlapping areas (i.e., moving windows) ensures the optimization of the classifier to a local land cover distribution, and decreases the negative effect of signature extension. A weighted combination of labels, which is determined by the classifier in overlapping windows, defines the final label for each pixel. Since the approach requires extensive computation, it has been developed and deployed using the Government of Canada’s High-Performance Computing Center (HPC). An accuracy assessment based on 2811 randomly distributed samples shows that land cover data produced with this new approach has achieved 76.60% accuracy with no marked spatial disparities.
Content may be subject to copyright.
remote sensing
Article
Circa 2010 Land Cover of Canada: Local Optimization
Methodology and Product Development
Rasim Latifovic 1, *, Darren Pouliot 2and Ian Olthof 1
1Natural Resources Canada, Canadian Centre for Remote Sensing, 560 Rochester, Ottawa, ON K1A 0E4,
Canada; Ian.Olthof@canada.ca
2Environment and Climate Change Canada, Landscape Science and Technology, Ontario,
1125 Colonel By Drive, Ottawa, ON K1S 5B6, Canada; Darren.Pouliot@canada.ca
*Correspondence: Rasim.Latifovic@canada.ca; Tel.: +1-613-759-7002
Received: 1 September 2017; Accepted: 25 October 2017; Published: 27 October 2017
Abstract:
Land cover information is necessary for a large range of environmental applications related
to climate impacts and adaption, emergency response, wildlife habitat, etc. In Canada, a 2008 user
survey indicated that the most practical land cover data is provided in a nationwide 30 m spatial
resolution format, with an update frequency of five years. In response to this need, the Canada Centre
for Remote Sensing (CCRS) has generated a 30 m land cover map of Canada for the base year 2010,
as the first of a planned series of maps to be updated every five years, or more frequently. This land
cover dataset is also the Canadian contribution to the 30 m spatial resolution 2010 Land Cover Map
of North America, which is produced by Mexican, American and Canadian government institutions
under a collaboration called the North American Land Change Monitoring System (NALCMS).
This paper describes the mapping approach used for generating this land cover dataset for Canada
from Thematic Mapper (TM) and Enhanced Thematic Mapper (ETM+) Landsat sensor observations.
The innovative part of the mapping approach is the local optimization of the land cover classifier,
which has resulted in increased spatial consistency and accuracy. Training and classifying with
locally confined reference samples over a large number of partially overlapping areas (i.e., moving
windows) ensures the optimization of the classifier to a local land cover distribution, and decreases the
negative effect of signature extension. A weighted combination of labels, which is determined by the
classifier in overlapping windows, defines the final label for each pixel. Since the approach requires
extensive computation, it has been developed and deployed using the Government of Canada’s
High-Performance Computing Center (HPC). An accuracy assessment based on 2811 randomly
distributed samples shows that land cover data produced with this new approach has achieved
76.60% accuracy with no marked spatial disparities.
Keywords:
land cover; land cover change; landsat; Canada; 2010; random forest algorithm; classification
1. Introduction
National-scale land cover and land cover change information are required for studying
land-surface processes that characterize environmental, social, and economic aspects of sustainability.
The International Geosphere Biosphere Programme (IGBP), the World Climate Research Programme
(WCRP), and global monitoring have long articulated the science requirement for land cover
information, in particular at the global level. Land cover is a fundamental earth surface attribute
shaped by geologic, hydrologic, climatic, atmospheric, and land-use processes occurring at a range
of space–time scales. Land cover, in turn, affects these processes through feedback mechanisms such
as plant respiration, which both absorbs and releases carbon, water, oxygen, and other biochemical
elements from or to the environment. Therefore, knowledge of land cover is essential for understanding
Remote Sens. 2017,9, 1098; doi:10.3390/rs9111098 www.mdpi.com/journal/remotesensing
Remote Sens. 2017,9, 1098 2 of 18
earth surface processes that are relevant for land management and the preservation of natural
environments that may influence ecosystem and human health [1].
Global and continental land cover information is needed to implement various United Nations’
initiatives: the Millennium Development Goals (MDG), the Framework Convention on Climate Change
(FCCC), the Convention on Biological Diversity (CBD), the Convention to Combat Desertification
(CCD) and the Forum on Forest (UNFF). A list of Essential Climate Variables (ECV) endorsed by the
Global Climate Observing System (GCOS) and Committee on Earth Observation Satellites (CEOS)
science community includes land cover as an essential terrestrial variable [
2
4
]. The GCOS-107 [
2
]
report, which defines key requirements for ECV-land cover, recommends annual updates at 0.25–1 km,
and five-year updates at 10–30 m spatial resolution.
In Canada, a number of national land cover products have been generated using medium
and low-resolution (250 m–1 km) optical satellite data, including data acquired by the Moderate
Resolution Imagining Spectroradiometer (MODIS), the Satellite Pour l’Observation de la Terre (SPOT
VEGETATION), and the Advance Very High-Resolution Radiometer (AVHRR) sensors [
5
7
]. More
recent land cover information over Canada has been generated by the Canada Centre for Remote
Sensing (CCRS) under the North American Land Change Monitoring System (NALCMS) collaboration.
NALCMS is a collaborative framework involving CCRS’ parent organization, Natural Resources
Canada (NRCan); the United States Geological Survey (USGS); and Mexican institutions including
the National Institute of Statistics and Geography (INEGI), the National Forestry Commission
(CONAFOR), and the National Commission for Knowledge and Use of Biodiversity (CONABIO).
The collective need for a harmonized land cover monitoring system across North America’s political
boundaries was a motivating factor for establishing the NALCMS collaboration. The NALCMS group
has used national land cover mapping efforts to assemble continental land cover and change maps
for North America from MODIS observations at 250 m spatial resolution for 2005 and 2010 [
8
,
9
].
The current NALCMS effort is to generate cross-border synchronized 30 m land cover of the North
America continent for the year 2010. The 2010 baseline map will be used for generating a land
cover time series following the updating approach described in Latifovic and Pouliot [
10
]. To ensure
continuity with the existing 2005–2010 time series at 250 m, the new time series at 30 m will start from
2010, ensuring overlap that meets minimal requirements for data continuity. As with the 250 m maps,
this 30 m map is being compiled from national land cover mapping efforts.
In support of creation of this circa 2010 North American Land Cover, the objective of the study
was to develop a semi-automated classification approach to improve the accuracy and consistency of
the new land cover map of Canada generated from Landsat Thematic Mapper (TM) and Enhanced
Thematic Mapper (ETM+) observations at 30 m. Thus, this paper describes both the mapping approach
with local classifier optimization, and the characteristics of the produced map.
2. Materials and Methods
2.1. Landsat Data and Processing
The initial data coverage of Canada includes 13,350 Landsat TM/ETM+ scenes all obtained
from the United States (US) Geological Survey. Scenes spanned the 2009–2011 period, and were all
acquired in July and August. The procedure to create a full coverage of Canada includes: re-projection,
calibration, cloud/cloud shadow detection, compositing, and additional corrections as described
by Latifovic et al. [
11
]. The Landsat mosaic of Canada July–August circa 2010 is shown in Figure 1.
This mosaic was used to generate the land cover dataset described in this article. In order to reduce
significant phenological differences due to strong vegetation gradients, particularly south–north,
a narrow acquisition window of July–August 2010 was selected. However, such a short period
does not provide clear-sky coverage over Canada. A compromise was made by including two more
July–August periods from adjacent years, 2009 and 2011. Selecting adjacent pixels from different years
could create inconsistencies over areas where land cover changes occurred in the 2009–2011 period
Remote Sens. 2017,9, 1098 3 of 18
and adjacent pixels were selected from different years (i.e., before and after changes). To minimize this,
selection criteria were adjusted to preferentially select observations from 2010 over no change areas,
and from 2011 over changed areas. Overall, 10% of observations were selected from 2009, 70% were
from 2010, and 20% were from 2011. For each pixel, a scene ID with the time of acquisition is provided
as a separate layer. When this product is used for studies involving temporal aspects, the provided
time information may need to be considered. Processing was performed using a tile system to facilitate
parallel processing and enable handling and loading data into viewing tools for performing visual
quality control. The tile arrangement size and naming are shown in Figure 1. Naming follows a
common row–column convention.
Remote Sens. 2017, 9, 1098 3 of 18
change areas, and from 2011 over changed areas. Overall, 10% of observations were selected from
2009, 70% were from 2010, and 20% were from 2011. For each pixel, a scene ID with the time of
acquisition is provided as a separate layer. When this product is used for studies involving temporal
aspects, the provided time information may need to be considered. Processing was performed using
a tile system to facilitate parallel processing and enable handling and loading data into viewing tools
for performing visual quality control. The tile arrangement size and naming are shown in Figure 1.
Naming follows a common row–column convention.
Figure 1. Landsat Mosaic of Canada 2010 false color composite Top of Atmosphere reflectance and
tile system used for Landsat processing and land cover mapping.
The Lambert Conformal Conic (LCC) projection is commonly used in Canada for large area
spatial datasets because it does not require separate zones and keeps distortion to an acceptable level.
Through selecting LCC projection, consistency was maintained with CCRS’ other Long Time Satellite
Data Records (LTSDR) and national vector products. Table 1 provides the parameters of the LCC
projection.
Table 1. The parameters of the Lambert Conformal Conic (LCC) projection and earth ellipsoid model
used for output imagery over Canada.
Parameter Value
Earth ellipsoid GRS 1980
Major semi-axis, a 6,378,137 (m)
First eccentricity 0.00669438002290
Ellipsoid flattening, f 0.00335281068118
Projection LCC
1st parallel 49.00 (degree)
2nd parallel 77.00 (degree)
Central meridian 95.00 (degree)
Upper left corner (2,600,000.0 E (m); 10,500,000.0 N (m))
Lower right corner (3,100,000.0 E (m); 5,700,000.0 N (m))
Easting 0
Northing 0
Figure 1.
Landsat Mosaic of Canada 2010 false color composite Top of Atmosphere reflectance and tile
system used for Landsat processing and land cover mapping.
The Lambert Conformal Conic (LCC) projection is commonly used in Canada for large area spatial
datasets because it does not require separate zones and keeps distortion to an acceptable level. Through
selecting LCC projection, consistency was maintained with CCRS’ other Long Time Satellite Data
Records (LTSDR) and national vector products. Table 1provides the parameters of the LCC projection.
Table 1.
The parameters of the Lambert Conformal Conic (LCC) projection and earth ellipsoid model
used for output imagery over Canada.
Parameter Value
Earth ellipsoid GRS 1980
Major semi-axis, a 6,378,137 (m)
First eccentricity 0.00669438002290
Ellipsoid flattening, f 0.00335281068118
Projection LCC
1st parallel 49.00 (degree)
2nd parallel 77.00 (degree)
Central meridian 95.00 (degree)
Upper left corner (2,600,000.0 E (m); 10,500,000.0 N (m))
Lower right corner (3,100,000.0 E (m); 5,700,000.0 N (m))
Easting 0
Northing 0
Remote Sens. 2017,9, 1098 4 of 18
2.2. Ancillary Data
In addition to satellite datasets, a number of other information sources were used to train the
classification algorithm, aid the interpretation of specific land cover classes, and improve the mapping
of specific classes. The main reference data sources were existing land cover datasets derived from
medium resolution satellite observations (25–60 m) including: Satellite Information for Land Cover
(SILC), Northern Land Cover of Canada (NLCC), Agricultural Crop Cover Classification (ACCC) and
Earth Observation for Sustainable Development (EOSD), in addition to the other ancillary data listed
in Table 2.
Table 2. Ancillary data used as aid for generating classifier-training data.
Title Source
National Hydro Network, 1:50,000 scale
Canada Centre for Mapping and Earth Observation
(2004) http://geogratis.gc.ca [12]
Canadian Digital Elevation Data, 1:50,000 scale
Canada Centre for Mapping and Earth Observation
(2000) http://geogratis.gc.ca [12]
National Road Network, 1:50,000 scale
Canada Centre for Mapping and Earth Observation
(2012) [12]http://geogratis.gc.ca
SILC: Satellite Information for Land Cover of Canada—a sample
of LANDSAT Thematic Mapper/Enhanced Thematic Mapper
(TM/ETM+) scenes (30 m resolution)
Canada Centre for Mapping and Earth
Observation [13]
EOSD: Earth Observation for Sustainable Development of
Forests Land Cover Classification, circa 2000 at 30 m resolution
Canadian Forest Service
http://cfs.nrcan.gc.ca/publications?id=29220 [14]
NLLC: Circa 2000 Northern Land Cover of Canada at 30 m
spatial resolution
Canada Centre for Mapping and Earth Observation
http://geogratis.cgdi.gc.ca [15]
ACCC: Agricultural Crop Cover Classification annual crop
inventory, 2013, 30 m spatial resolution
AAFC Science and Technology Branch, Earth
Observation Team. http://open.canada.ca [16]
Northern treeline http://data.arcticatlas.org, [17]
National Burned Area Composites 2004–2013 Canadian Forest Service [18]
Version 1 Visible Infrared Imaging Radiometer Suite (VIIRS)
Day/Night Band Nighttime Lights (2012)
Earth Observation Group, NOAA National
Geophysical Data Center [19]
Ground truth datasets NRCan, CCRS, unpublished
2.3. Land Cover Mapping
Large area land cover mapping providing accurate and spatially consistent results is not a trivial
task. It requires careful consideration of the effective use of available training data combined with
strategic classifier implementation, and followed by robust procedures for quality control. To facilitate
land cover monitoring requirements, more sophisticated algorithms based on advances in the fields
of pattern recognition and machine learning have emerged. Decision tree classifiers have often
been used for land cover classification at continental to global scales. Early applications of decision
trees [
20
] for remote sensing-based land cover classification focused on mapping using coarse resolution
imagery [
21
25
]. Applications of the random forest (RF) algorithm to land cover mapping using
medium and fine resolution remote sensing data are described and evaluated in [
26
28
]. This study
used the random forest (RF) algorithm [
29
] as it is fast to train, can handle data of different types, and
is widely proven to achieve good results with a multitude of classification problems. The conceived
mapping approach involves the following steps: (1) generation of training and testing data; (2) training
initial RF models and generation of a primary classification for each tile; (3) local optimization within
a tile and blending between tiles; (4) mapping of urban and agriculture areas; (5) post-classification
corrections; and (6) quality control. The classification legend is designed for the North American
scale in two hierarchical levels using the Food and Agriculture Organization (FAO) Land Cover
Classification System (LCCS). Table 3provides the NALCMS level I legend with 12 classes and level
II legend with 19 classes. The legend for the Land Cover Map of Canada 2010 has 15 classes of the
NALCMS legend level II; tropical vegetation classes 3, 4, 7, and 9 were not used.
Remote Sens. 2017,9, 1098 5 of 18
Table 3. North American Land Change Monitoring System (NALCMS) land cover classification legend level I and II.
Level I Level II Land Cover Classification System (LCCS)
Basic Classifier
Primarily
vegetated areas
Natural and
semi-natural terrestrial
and aquatic
1. Needleleaf forest 1. Temperate or sub-polar needleleaf forest A3.A10.B2.XX.D2.E1
2. Sub-polar taiga needleleaf forest A3.A10.B2.XX.D1.E2
2. Broadleaf forest
3. Tropical or sub-tropical broadleaf evergreen forest A3.A10.B2.XX.D1.E1
4. Tropical or sub-tropical broadleaf deciduous forest A3.A11.B2.XX.D1.E1
5. Temperate or sub-polar broadleaf deciduous forest A3.A14.B2.XX.D1.E1
3. Mixed forest 6. Mixed forest A3.A10.B2.XX.D2.E1/A3.A10.B2.XX.D1.E2
4. Shrubland 7. Tropical or sub-tropical shrubland A4.A20.B3–B9
8. Temperate or sub-polar shrubland A4.A20.B3–B10
5. Grassland 9. Tropical or sub-tropical grassland A6.A20.B4
10. Temperate or sub-polar shrubland A2.A20.B4.XX.E5
6. Lichen/moss
11. Sub-polar or polar shrubland–lichen–moss A4.A11.B3–B10/A2.A20.B4–B12/A8.A11–A13
12. Sub-polar or polar grassland–lichen–moss A4.A20.B4–B12/A4.A11.B3–B10/A8.A11–A13
13. Sub-polar or polar barren–lichen–moss A8.A20–A.13/A4.A11.B3–B10/A2.A20.B4–B12
7. Wetland 14. Wetland A2.A20.B4.C3
Cultivated/managed
terrestrial/aquatic 8. Cropland 15. Cropland A4–S1
Primarily
non-vegetated
areas
Terrestrial 9. Barren land 16. Barren land A1/A2
10. Urban and built-up
17. Urban and built-up A4
Aquatic 11. Water 18. Water A1
12. Snow and ice 19. Snow and ice A2/A3
Remote Sens. 2017,9, 1098 6 of 18
2.3.1. Generation of Training and Testing Data
Division of the total mapping area into a system of tiles proceeded with consideration for the
trade-off between processing efficiency and localization. The size of each tile is 900 km
×
900 km.
Another possible option, using a mapping zone approach, was not employed due to the large size and
irregular shape of the desired mapping zones. The generation of robust non-parametric models such
as RF requires a large quantity of training data. In this mapping effort, a large training dataset was
generated following two sequential steps.
In the first step, initial training data was generated for a selected subset of tiles that capture the
spatial distribution of land cover across all of the ecozones in Canada. Tiles 2-2, 2-4, 3-2, 3-4, 4-2, 4-3,
4-6, and 5-4 (Figure 1) were classified using the unsupervised classification approach described in
Beaubien et al. [
5
], Latifovic et al. [
6
]. This involves unsupervised clustering and labeling, and requires
considerable time, expertise in image interpretation, and knowledge of the land cover in the area of
interest. In addition to the classified tiles, a number of other information sources (Table 2) were used to
aid the selection of training and testing samples.
In the second step for each of the tiles, a RF land cover model was trained using initial tile-specific
training data. RF models for tiles without specific training data were trained using samples selected
from the adjacent or closest tiles. Model input included the green, red, near-infrared (NIR) and
short-wave infrared (SWIR) (1.5 um) Landsat bands. This study used the Open Source Computer
Vision Library (http://opencv.org) implementation of the random forest algorithm with the following
values of the hyperparameters: the maximum possible depth of the tree (maxDepth = 30), the number
of samples in a node to be split (MinSampleCount = 0.1%), the size of randomly selected features at each
tree node that are used to find the best split(s) (ActiveVarCount = 3), and the number of trees (TC = 100).
The values for maxDepth, TC, and MinSampleCount were defined by varying one parameter at a time
while keeping other two fixed, and comparing the model performance with accuracy.
Once the base land cover was generated, significant quality control was carried out to ensure viable
land cover sample selection. A number of additional layers were developed, such as: a north–south
tree density layer based on the tree line, a growing condition layer to more consistently isolate classes 2,
11, 12, and 13 to logical geographic extents, a water body layer, and a wetland probability layer. Urban
and forest fire burned area layers were also generated from auxiliary data and, used for selection of
the final training dataset.
2.3.2. Random Forest Local Optimization and Blending
Large-scale land cover mapping is associated with a number of challenges related to limitations in
signature extension i.e., increasing the spatial–temporal range over which a set of training statistics can
be used to map certain land cover types without significant loss of accuracy [
30
]. This is particularly
significant for the boreal forest, where high spatial variability in composition and structure limits the
extension of spectral signatures to a few hundred kilometers [
31
]. A large number of samples over a
limited area is required to achieve higher land cover accuracy. Furthermore, it is desirable to merge
individual scenes into regional image mosaics, and then to treat these mosaics as image entities, rather
than separately classify each scene [
32
]. For example, the USGS National Land Cover Database has
delineated 66 mapping zones with respect to landform, soil, vegetation, spectral characteristic, and
image footprints [
33
]. Separately mapping each mapping zone often results in distinct boundaries
due to differences in the local optimization of the land cover classifier within the zone. Several
approaches have been developed to address this problem, such as object-based edge matching [
8
] or
membership blending [
33
]. To address the problems of limited signature extension, blending, and
the local optimization of the land cover classifier, we develop an approach based on a large number
of rectangular overlapping areas (i.e., moving windows). For each window, a RF model was trained
using local reference samples. The label for each pixel was defined as a weighted combination of labels
determined in overlapping windows that contain the given pixel. Decision tree classifiers are sensitive
to the sample distribution; they attempt to optimize agreement with large classes. In Pouliot et al. [
34
],
Remote Sens. 2017,9, 1098 7 of 18
the effect of sample distribution was found to be as high as 30% with the RF algorithm. Millard
and Richardson [
35
] also showed RF’s strong sensitivity to sample distribution. Thus, combining
results from several possible sample distributions can improve accuracy, and has the added benefit of
blending across discontinuities.
The window size we used for each local classifier was 10,000 by 10,000 Landsat pixels, with a
horizontal and vertical window offset of 30%. In this configuration, each pixel was contained in nine
different windows. For each window, a RF classifier was trained and used to classify pixels in the
window. The class with the highest weight was selected as the final land cover label for the pixel.
The overall weight for a class was computed as a sum of sigmoid function of the pixel’s distance from
the window center. The sigmoid function was used to scale the output between 0 and 1. The distance
from the center was used because the sample distribution for the window is most representative at
the center, and less representative at the window edges. Overlapping areas of the moving windows
ensure smooth transitions between tiles (i.e., blending). Table 4provides an example of the weight
calculation used, where conifer would be the class selected.
Table 4. Example of class weighting and label selection from windows.
Window 1 2 3 4 5 6 7 8 9
Class
Conifer Conifer Conifer
Water Mixed Mixed Shrub Water Mixed
Distance (pixels) 250 3500 5000 13,000 5000 9500 9400 7500 1000
Weight 0.94 0.819 0.70 0.074 0.706 0.263 0.271 0.454 0.929
Conifer count = 3, summed distance = 2.47
Mixed count = 3, summed distance = 1.90
Figure 2shows the windows configuration used for local classifier optimization and blending.
Overall, 1472 windows and corresponding RF models were generated and applied to produce the
land cover of Canada. Implementation of the above-described approach might not be feasible on a
regular workstation, for there is very high processing demand. Therefore, this study’s methods were
developed and deployed on the Government of Canada’s High-Performance Computing Centre (HPC).
In parallel processing mode on 1300 cores, processing time was about 2 h.
Remote Sens. 2017, 9, x FOR PEER REVIEW 7 of 18
classes. In Pouliot et al. [34], the effect of sample distribution was found to be as high as 30% with the
RF algorithm. Millard and Richardson [35] also showed RF’s strong sensitivity to sample distribution.
Thus, combining results from several possible sample distributions can improve accuracy, and has
the added benefit of blending across discontinuities.
The window size we used for each local classifier was 10,000 by 10,000 Landsat pixels, with a
horizontal and vertical window offset of 30%. In this configuration, each pixel was contained in nine
different windows. For each window, a RF classifier was trained and used to classify pixels in the
window. The class with the highest weight was selected as the final land cover label for the pixel. The
overall weight for a class was computed as a sum of sigmoid function of the pixel’s distance from the
window center. The sigmoid function was used to scale the output between 0 and 1. The distance
from the center was used because the sample distribution for the window is most representative at
the center, and less representative at the window edges. Overlapping areas of the moving windows
ensure smooth transitions between tiles (i.e., blending). Table 4 provides an example of the weight
calculation used, where conifer would be the class selected.
Table 4. Example of class weighting and label selection from windows.
Window 1 2 3 4 5 6 7 8 9
Class Conifer Conifer Conifer Water Mixed Mixed Shrub Water Mixed
Distance (pixels) 250 3500 5000 13,000 5000 9500 9400 7500 1000
Weight 0.94 0.819 0.70 0.074 0.706 0.263 0.271 0.454 0.929
Conifer count = 3, summed distance = 2.47
Mixed count = 3, summed distance = 1.90
Figure 2 shows the windows configuration used for local classifier optimization and blending.
Overall, 1472 windows and corresponding RF models were generated and applied to produce the
land cover of Canada. Implementation of the above-described approach might not be feasible on a
regular workstation, for there is very high processing demand. Therefore, this study’s methods were
developed and deployed on the Government of Canada’s High-Performance Computing Centre
(HPC). In parallel processing mode on 1300 cores, processing time was about 2 h.
Figure 2. Configuration of the local optimization windows.
2.3.3. Mapping of Urban and Agriculture Areas
Figure 2. Configuration of the local optimization windows.
Remote Sens. 2017,9, 1098 8 of 18
2.3.3. Mapping of Urban and Agriculture Areas
Urban mapping was developed separately. Initially, two density-based classes of industrial and
built-up lands were deciphered from road density data at 1 km and 5 km scales, Landsat reflectance
and the Nightly Mosaic of the Visible Infrared Imaging Radiometer Suite (VIIRS) data. These were
merged to a final urban and built-up class (class 17 in Table 3). For each tile, the two urban classes
were sampled and used to train RF with these features as inputs. Figure 3provides a schematic of the
processing applied. Results were generally very good, except in agriculture regions where the Landsat
data quality was poor. In these regions, considerable manual correction was undertaken. Roads were
also rasterized to 30 m, and added to the map.
Remote Sens. 2017, 9, x FOR PEER REVIEW 8 of 18
Urban mapping was developed separately. Initially, two density-based classes of industrial and
built-up lands were deciphered from road density data at 1 km and 5 km scales, Landsat reflectance
and the Nightly Mosaic of the Visible Infrared Imaging Radiometer Suite (VIIRS) data. These were
merged to a final urban and built-up class (class 17 in Table 3). For each tile, the two urban classes
were sampled and used to train RF with these features as inputs. Figure 3 provides a schematic of the
processing applied. Results were generally very good, except in agriculture regions where the
Landsat data quality was poor. In these regions, considerable manual correction was undertaken.
Roads were also rasterized to 30 m, and added to the map.
Figure 3. Approach used for mapping urban and built-up areas. True color image is from Google
Earth for 2015.
Highly dynamic areas such as agricultural regions present a particular challenge, since inter-
scene radiometric consistency can be affected by crop rotation and growing practices. The cropland
class was also mapped separately, and is based on the existing agriculture crop cover classification
dataset (Table 2), which was used to extract samples for training RF models.
2.3.4. Additional Corrections and Quality Control
Several additional processing steps were required to create the final product. In the far north,
above 83 degrees latitude, Landsat data is not collected. For this area, land cover from MODIS was
used [8]. In addition, in some regions of the north, missing values resulted from a low sun angle and
topography. For these areas, a majority filter within a 5 × 5 window was used to estimate missing
values. Finally, in mountain regions, strong topographic effects occurred due to shadows. To correct
this, a water mask derived from http://geogratis.cgdi.gc.ca and digital elevation data was used to
identify shadow-affected areas within the mountain regions. MODIS observations were then used to
estimate reflectance by averaging the spectrally closest 50 Landsat values to MODIS. As MODIS uses
different sun azimuth angles, it can obtain a non-shadow or reduced shadow observation. If no
suitable observations were identified, then the shadow-affected areas were filled with the majority
label of unaffected spatial neighbors (Figure 4).
Figure 3.
Approach used for mapping urban and built-up areas. True color image is from Google Earth
for 2015.
Highly dynamic areas such as agricultural regions present a particular challenge, since inter-scene
radiometric consistency can be affected by crop rotation and growing practices. The cropland class
was also mapped separately, and is based on the existing agriculture crop cover classification dataset
(Table 2), which was used to extract samples for training RF models.
2.3.4. Additional Corrections and Quality Control
Several additional processing steps were required to create the final product. In the far north,
above 83 degrees latitude, Landsat data is not collected. For this area, land cover from MODIS was
used [
8
]. In addition, in some regions of the north, missing values resulted from a low sun angle and
topography. For these areas, a majority filter within a 5
×
5 window was used to estimate missing
values. Finally, in mountain regions, strong topographic effects occurred due to shadows. To correct
this, a water mask derived from http://geogratis.cgdi.gc.ca and digital elevation data was used to
identify shadow-affected areas within the mountain regions. MODIS observations were then used
to estimate reflectance by averaging the spectrally closest 50 Landsat values to MODIS. As MODIS
Remote Sens. 2017,9, 1098 9 of 18
uses different sun azimuth angles, it can obtain a non-shadow or reduced shadow observation. If no
suitable observations were identified, then the shadow-affected areas were filled with the majority
label of unaffected spatial neighbors (Figure 4).
Remote Sens. 2017, 9, x FOR PEER REVIEW 9 of 18
Figure 4. Results of shadow correction in mountain regions, before correction (a) and after (b).
Post classification operations also included additional image processing performed in cases of
known confusion. For example, spectrally similar classes such as low biomass, cropland, and
grassland were confused with each other, or with tundra. Therefore, a tundra mask was generated
for the area north of the tree line, delimited by Timoney et al. [17], and an agriculture mask was
produced from a minimum red reflectance winter composite and an integrated summer Normalized
Difference Vegetation Index (NDVI) image. These masks were used to separate spectrally similar
pixels in the northern treeless and agriculture regions by changing the class labels for the appropriate
clusters beneath these masks. The water body areas were corrected using the water mask generated
from the National Hydro Network’s 1:50,000 scale data.
3. Results
Figure 5 shows the land cover classification generated using algorithms described in Section 2.3.
To achieve the required quality (i.e., accuracy and spatial consistency) over a large area is not a
simple, straightforward procedure. Any large-area land cover map constructed from remotely sensed
data is limited in the accuracies it can achieve. Challenges include satellite-sensor data limitations,
and issues associated with the generalization and abstraction of the real land surface. Most of the
land cover types at a large scale comprise a wide range of variation in vegetation species, structure,
and understory. In such a high-variance environment, mapping approaches need considerable
tuning to achieve consistent accuracy. The mapping approach used in this study includes systematic
qualitative examinations of the following map quality characteristics: (1) measure of classification
algorithm confidence, (2) land cover spatial distribution consistency, (3) assessment of known
regional land cover type confusions, and (4) representation of known local change drivers. Factors 1
and 2 are described in the next sections in further detail. Factors 3 and 4 are listed, as they are
generally important considerations for large-area land cover mapping.
Figure 4. Results of shadow correction in mountain regions, before correction (a) and after (b).
Post classification operations also included additional image processing performed in cases of
known confusion. For example, spectrally similar classes such as low biomass, cropland, and grassland
were confused with each other, or with tundra. Therefore, a tundra mask was generated for the
area north of the tree line, delimited by Timoney et al. [
17
], and an agriculture mask was produced
from a minimum red reflectance winter composite and an integrated summer Normalized Difference
Vegetation Index (NDVI) image. These masks were used to separate spectrally similar pixels in the
northern treeless and agriculture regions by changing the class labels for the appropriate clusters
beneath these masks. The water body areas were corrected using the water mask generated from the
National Hydro Network’s 1:50,000 scale data.
3. Results
Figure 5shows the land cover classification generated using algorithms described in Section 2.3.
To achieve the required quality (i.e., accuracy and spatial consistency) over a large area is not a simple,
straightforward procedure. Any large-area land cover map constructed from remotely sensed data
is limited in the accuracies it can achieve. Challenges include satellite-sensor data limitations, and
issues associated with the generalization and abstraction of the real land surface. Most of the land
cover types at a large scale comprise a wide range of variation in vegetation species, structure, and
understory. In such a high-variance environment, mapping approaches need considerable tuning to
achieve consistent accuracy. The mapping approach used in this study includes systematic qualitative
examinations of the following map quality characteristics: (1) measure of classification algorithm
confidence; (2) land cover spatial distribution consistency; (3) assessment of known regional land cover
type confusions; and (4) representation of known local change drivers. Factors 1 and 2 are described
in the next sections in further detail. Factors 3 and 4 are listed, as they are generally important
considerations for large-area land cover mapping.
Remote Sens. 2017,9, 1098 10 of 18
Remote Sens. 2017, 9, x FOR PEER REVIEW 10 of 18
Figure 5. Land cover of Canada at 30 m spatial resolution for 2010.
4. Discussion
4.1. Classification Algorithm Confidence
The posterior probability of the assigned label to each pixel is useful information for map
confidence-based quality and accuracy assessment [36]. The concept of confidence-based quality
assessment was first described in detail in the remote sensing literature by Strahler et al. [37]. In our
study, it is based on voting the random forest trees, and it is computed as the proportion of trees that
predict a certain label. The posterior probabilities can be used to quantify classification map quality
in a spatially explicit fashion (Figure 6a). This measure tends to follow accuracy; however, it is not
true accuracy, because if a given pixel is not similar to any of the training data, the classifier will still
assign a label and compute the posterior probability. The graph in Figure 6b shows the confidence
for each land cover type computed as the average confidence of all the pixels labeled with a given
class. Classes 17 and 15, urban (34%) and cropland (57%), showed the lowest confidence and
necessitated imposing additional revision of the mapping approach to achieve better classifier
confidence. A classifier for urban and cropland mapping was designed and trained separately with
additional features, as described in Section 2.3.4. The snow (17), water (18), and sub-polar barren–
lichenmoss (13) classes showed the highest mapping confidence (9397%), which was anticipated
considering their easily distinguishable signature. Forest classes (1, 2, 5, and 6) revealed good
mapping confidence (85–91%). Other classes were between 75% and 85%, with a somewhat lower
mapping confidence of 70% for the sub-polar shrubland–lichen–moss class. Overall, average
classification confidence weighted by class area was 87%. The results of the map confidence-based
quality analysis did not identify any additional issues.
Figure 5. Land cover of Canada at 30 m spatial resolution for 2010.
4. Discussion
4.1. Classification Algorithm Confidence
The posterior probability of the assigned label to each pixel is useful information for map
confidence-based quality and accuracy assessment [
36
]. The concept of confidence-based quality
assessment was first described in detail in the remote sensing literature by Strahler et al. [
37
]. In our
study, it is based on voting the random forest trees, and it is computed as the proportion of trees that
predict a certain label. The posterior probabilities can be used to quantify classification map quality in
a spatially explicit fashion (Figure 6a). This measure tends to follow accuracy; however, it is not true
accuracy, because if a given pixel is not similar to any of the training data, the classifier will still assign a
label and compute the posterior probability. The graph in Figure 6b shows the confidence for each land
cover type computed as the average confidence of all the pixels labeled with a given class. Classes 17
and 15, urban (34%) and cropland (57%), showed the lowest confidence and necessitated imposing
additional revision of the mapping approach to achieve better classifier confidence. A classifier
for urban and cropland mapping was designed and trained separately with additional features,
as described in Section 2.3.4. The snow (17), water (18), and sub-polar barren–lichen–moss (13) classes
showed the highest mapping confidence (93–97%), which was anticipated considering their easily
distinguishable signature. Forest classes (1, 2, 5, and 6) revealed good mapping confidence (85–91%).
Other classes were between 75% and 85%, with a somewhat lower mapping confidence of 70% for
the sub-polar shrubland–lichen–moss class. Overall, average classification confidence weighted by
class area was 87%. The results of the map confidence-based quality analysis did not identify any
additional issues.
Remote Sens. 2017,9, 1098 11 of 18
Figure 6.
(
a
) Example of classifier confidence i.e., posterior probability based on voting the random
forest trees; (b) average confidence by land cover type.
4.2. Land Cover Spatial Distribution Consistency
To assess spatial consistency between tiles and the expected cover distribution for a class,
the fraction of each class was depicted across Canada in a separate map layer at 300 m spatial resolution,
where the pixels’ values represented the percent occupied by the specific land cover class. As an
example of these maps, the temperate or sub-polar needle leaf forest fraction is shown in Figure 7a,
and the sub-polar or polar grassland–lichen–moss fraction is shown in Figure 7b. Such maps were
generated and closely examined for each class. The examination of the fractional maps derived from
final land cover data did not reveal any discontinuity between mapping tiles, which confirmed the
effectiveness of the local optimization and blending procedures described in Section 2.3.3.
Remote Sens. 2017, 9, x FOR PEER REVIEW 11 of 18
Figure 6. (a) Example of classifier confidence i.e., posterior probability based on voting the random
forest trees; (b) average confidence by land cover type.
4.2. Land Cover Spatial Distribution Consistency
To assess spatial consistency between tiles and the expected cover distribution for a class, the
fraction of each class was depicted across Canada in a separate map layer at 300 m spatial resolution,
where the pixels’ values represented the percent occupied by the specific land cover class. As an
example of these maps, the temperate or sub-polar needle leaf forest fraction is shown in Figure 7a,
and the sub-polar or polar grasslandlichen–moss fraction is shown in Figure 7b. Such maps were
generated and closely examined for each class. The examination of the fractional maps derived from
final land cover data did not reveal any discontinuity between mapping tiles, which confirmed the
effectiveness of the local optimization and blending procedures described in Section 2.3.3.
Figure 7. Fractional maps: (a) Temperate or sub-polar needleleaf forest; (b) Sub-polar or polar
grassland–lichen–moss.
The fractional maps were also used to assess the correctness of the spatial distribution of each
land cover class using class-particular characteristics. For example, Figure 6a illustrates the extent of
needleleaf forest that generally agrees with the expected class distribution (i.e., absence of the forest
Figure 7.
Fractional maps: (
a
) Temperate or sub-polar needleleaf forest; (
b
) Sub-polar or
polar grassland–lichen–moss.
Remote Sens. 2017,9, 1098 12 of 18
The fractional maps were also used to assess the correctness of the spatial distribution of each
land cover class using class-particular characteristics. For example, Figure 6a illustrates the extent of
needleleaf forest that generally agrees with the expected class distribution (i.e., absence of the forest
above the treeless line, and a very small amount of needleleaf forest in the prairie region). Moreover,
such analysis of fractional maps helped in ensuring separation between temporal, sub-polar, and
polar classes. Some corrections were required to spatially separate temporal or sub-polar shrubland
from sub-polar or polar shrubland–lichen–moss, and temporal grassland from sub-polar or polar
grassland classes.
The fraction maps were also useful in highlighting discontinuities between tiles and showing the
effect of the window-based local optimization and blending methodology. Figure 8shows the effect of
blending for the mixed forest class. Often, these discontinuities can be missed when examining land
cover, as less frequent classes are drowned-out relative to the dominant class in a region.
Remote Sens. 2017, 9, x FOR PEER REVIEW 12 of 18
above the treeless line, and a very small amount of needleleaf forest in the prairie region). Moreover,
such analysis of fractional maps helped in ensuring separation between temporal, sub-polar, and
polar classes. Some corrections were required to spatially separate temporal or sub-polar shrubland
from sub-polar or polar shrubland–lichen–moss, and temporal grassland from sub-polar or polar
grassland classes.
The fraction maps were also useful in highlighting discontinuities between tiles and showing
the effect of the window-based local optimization and blending methodology. Figure 8 shows the
effect of blending for the mixed forest class. Often, these discontinuities can be missed when
examining land cover, as less frequent classes are drowned-out relative to the dominant class in a
region.
Figure 8. The effect of the window-based local optimization on the mixed forest class across tile
boundary 4-2 to 4-3, before (a) and after (b).
4.3. Local Classifier Characteristics
To investigate the effect of the described mapping approach with classifier local optimization on
the land cover accuracy, we carried out a detailed analysis over a region in northeast Alberta. The
region is 200 km × 350 km in extent, and represents boreal forest with very diverse land cover
distribution and a significant amount of disturbance caused by intensive industrial development. An
additional reason for selecting this region was the availability of sufficient reference data collected
through intensive fieldwork and the availability of very fine resolution images. The training dataset
contained 18,500 reference samples (i.e., 0.024% of the mapping area). Independent validation data
contained 800 ground truth samples compiled from a number of fieldwork campaigns performed
over the 2009–2011 period.
To assess the influence of local sample selection (i.e., signature extension), the diameter of the
sample-searching circle was varied from 30 km to 200 km, in 30 km increments. The study area was
divided into 112 classification windows of 24 km × 24 km (Figure 9). A local RF model was trained
for each classification window using all of the samples inside the corresponding sample-searching
circle. Overall, six land cover maps were produced and compared against the validation data set.
Figure 9b shows an increase of the overall land cover mapping accuracy with localization of the
training sample. The overall mapping accuracy changed from 67% to 81% as a function of the
diameter of the sample-searching circle. Figure 9c shows the comparison between the accuracy of
land cover maps produced with the model trained with all reference data (i.e., traditional random
forest approach) versus that of land cover maps produced with the models trained using only local
subsamples. Table 5 presents an evaluation of the statistical significance of the difference [38] between
classifications accuracies produced by the traditional random forest approach and the approach
using local optimization. The largest gain in accuracy was achieved when mapping land cover classes
Figure 8.
The effect of the window-based local optimization on the mixed forest class across tile
boundary 4-2 to 4-3, before (a) and after (b).
4.3. Local Classifier Characteristics
To investigate the effect of the described mapping approach with classifier local optimization
on the land cover accuracy, we carried out a detailed analysis over a region in northeast Alberta.
The region is 200 km
×
350 km in extent, and represents boreal forest with very diverse land cover
distribution and a significant amount of disturbance caused by intensive industrial development.
An additional reason for selecting this region was the availability of sufficient reference data collected
through intensive fieldwork and the availability of very fine resolution images. The training dataset
contained 18,500 reference samples (i.e., 0.024% of the mapping area). Independent validation data
contained 800 ground truth samples compiled from a number of fieldwork campaigns performed over
the 2009–2011 period.
To assess the influence of local sample selection (i.e., signature extension), the diameter of the
sample-searching circle was varied from 30 km to 200 km, in 30 km increments. The study area was
divided into 112 classification windows of 24 km
×
24 km (Figure 9). A local RF model was trained
for each classification window using all of the samples inside the corresponding sample-searching
circle. Overall, six land cover maps were produced and compared against the validation data set.
Figure 9b shows an increase of the overall land cover mapping accuracy with localization of the
training sample. The overall mapping accuracy changed from 67% to 81% as a function of the
diameter of the sample-searching circle. Figure 9c shows the comparison between the accuracy of
land cover maps produced with the model trained with all reference data (i.e., traditional random
forest approach) versus that of land cover maps produced with the models trained using only local
Remote Sens. 2017,9, 1098 13 of 18
subsamples. Table 5presents an evaluation of the statistical significance of the difference [
38
] between
classifications accuracies produced by the traditional random forest approach and the approach using
local optimization. The largest gain in accuracy was achieved when mapping land cover classes had
smaller areal extents. When land cover classes had larger areal extents, which were mostly uniformly
distributed, the gain in accuracy using locally optimized models was smaller. The increase of accuracy
for localized classes is understandable, because when the training sample is also localized, there is less
probability that classes occurring with greater frequency (i.e., extent) can overwhelm the model, and be
assigned to given pixels at the expense of more infrequent but legitimate land cover classes. Another
advantage is that the locally optimized RF model is less likely to result in the spectral confusion of
classes with similar spectral signatures, but different spatial distribution. Shortcomings of the approach
include the need for a large number of samples that are uniformly distributed spatially, and increased
processing time.
Remote Sens. 2017, 9, x FOR PEER REVIEW 13 of 18
had smaller areal extents. When land cover classes had larger areal extents, which were mostly
uniformly distributed, the gain in accuracy using locally optimized models was smaller. The increase
of accuracy for localized classes is understandable, because when the training sample is also
localized, there is less probability that classes occurring with greater frequency (i.e., extent) can
overwhelm the model, and be assigned to given pixels at the expense of more infrequent but
legitimate land cover classes. Another advantage is that the locally optimized RF model is less likely
to result in the spectral confusion of classes with similar spectral signatures, but different spatial
distribution. Shortcomings of the approach include the need for a large number of samples that are
uniformly distributed spatially, and increased processing time.
Figure 9. Results of the land cover classification accuracy produced by the local classifier
optimization. The red squares in image (a) are the areas classified using a local model. Each local
model is trained using the sample found inside the sample-search circle. Circles of different
circumference are shown around each area being classified. Land cover mapping accuracy as a
function of the size of the searching-samples circle is presented in graph (b). Land cover mapping
accuracy for two boundary cases are presented in graph (c).
Table 5. Summary of evaluation of the statistical difference between the accuracies of classifications
produced by traditional random forest (RF) and local optimization approach.
Comparison of Kappa Coefficients Comparison of Proportions
Class.
1 Class. 2 k1 k2 k1k2 Significant? x1/n1 x2/n2 x1/n1x2/n2 |z| Significant?
RF Local RF 0.63 0.78 0.155 Yes, 0.1% 0.67 0.82 0.141 10.9 Yes, 0.1%
4.4. Land Cover Map Accuracy Assessment
The accuracy of the Land Cover Map of Canada 2010 was assessed against 2811 reference
samples (Figure 10) For 839 randomly selected samples (Figure 10a), labels were determined through
land cover expert interpretation using Google Earth fine resolution images and a false color
composite (R–near-infrared (NIR), G–short-wave infrared (SWIR), Bred band) from Landsat ETM+
acquired in 2010. The rest of the data are ground truth data from six different regions (Figure 10b)
Figure 9.
Results of the land cover classification accuracy produced by the local classifier optimization.
The red squares in image (
a
) are the areas classified using a local model. Each local model is trained
using the sample found inside the sample-search circle. Circles of different circumference are shown
around each area being classified. Land cover mapping accuracy as a function of the size of the
searching-samples circle is presented in graph (
b
). Land cover mapping accuracy for two boundary
cases are presented in graph (c).
Table 5.
Summary of evaluation of the statistical difference between the accuracies of classifications
produced by traditional random forest (RF) and local optimization approach.
Comparison of Kappa Coefficients Comparison of Proportions
Class. 1 Class. 2 k1 k2
k1
k2
Significant? x1/n1 x2/n2
x1/n1
x2/n2 |z|
Significant?
RF Local RF
0.63 0.78
0.155
Yes, 0.1% 0.67 0.82 0.141
10.9
Yes, 0.1%
Remote Sens. 2017,9, 1098 14 of 18
4.4. Land Cover Map Accuracy Assessment
The accuracy of the Land Cover Map of Canada 2010 was assessed against 2811 reference samples
(Figure 10) For 839 randomly selected samples (Figure 10a), labels were determined through land
cover expert interpretation using Google Earth fine resolution images and a false color composite
(R–near-infrared (NIR), G–short-wave infrared (SWIR), B–red band) from Landsat ETM+ acquired in
2010. The rest of the data are ground truth data from six different regions (Figure 10b) acquired during
field campaigns linked to various CCRS projects between 2009 and 2011. For each sample, the land
cover class was defined based on photos and fieldwork records.
Remote Sens. 2017, 9, x FOR PEER REVIEW 14 of 18
acquired during field campaigns linked to various CCRS projects between 2009 and 2011. For each
sample, the land cover class was defined based on photos and fieldwork records.
Figure 10. Spatial distribution of the reference samples used in accuracy assessment (a) obtained from
fine resolution data interpretation, and (b) acquired during field work ground truth.
Primary and alternate land cover labels for each reference sample were determined for
minimum mapping units (MMU) of both 1 × 1 pixel and 12 × 12 pixels. For any pixel with a mixed
land cover composition, an interpreter assigned a label based on the most abundant land cover type
present. The assessment includes both land cover maps with 1 × 1 and 12 × 12 MMU. The error matrix
in Table 6 summarizes the accuracy assessment results using only primary labels, which were
tabulated by combining all of the reference samples from both datasets. It shows that 2180 out of 2811
(77.60%) samples are in agreement with land cover classifications. Commission and omission errors
are specified in columns and rows. In order to report accuracy that reflects the overall land cover
map, the users and producer’s accuracy were weighted by the class area for 1 × 1 and 12 × 12 MMU
maps. Table 7 shows the overall average accuracy for two definitions of agreement: (1) an overall
accuracy when agreement is defined as a match between the map class and either the primary or
alternate reference, and (2) when agreement is defined using only the primary reference label.
Figure 10.
Spatial distribution of the reference samples used in accuracy assessment (
a
) obtained from
fine resolution data interpretation; and (b) acquired during field work ground truth.
Primary and alternate land cover labels for each reference sample were determined for minimum
mapping units (MMU) of both 1
×
1 pixel and 12
×
12 pixels. For any pixel with a mixed land cover
composition, an interpreter assigned a label based on the most abundant land cover type present.
The assessment includes both land cover maps with 1
×
1 and 12
×
12 MMU. The error matrix in
Table 6summarizes the accuracy assessment results using only primary labels, which were tabulated
by combining all of the reference samples from both datasets. It shows that 2180 out of 2811 (77.60%)
samples are in agreement with land cover classifications. Commission and omission errors are specified
in columns and rows. In order to report accuracy that reflects the overall land cover map, the user’s
and producer’s accuracy were weighted by the class area for 1
×
1 and 12
×
12 MMU maps. Table 7
shows the overall average accuracy for two definitions of agreement: (1) an overall accuracy when
agreement is defined as a match between the map class and either the primary or alternate reference;
and (2) when agreement is defined using only the primary reference label.
The error matrix analysis reveals anticipated sources of spectral confusion among land cover
classes. Adjacent forest classes tend to be confused, for example, coniferous with mixed forest,
as well as deciduous forest, shrubland, shrub-covered wetlands, and certain croplands. These classes
are difficult to separate with spectral data alone, since all are primarily broadleaved deciduous.
Radiometric saturation occurs at the leaf area index levels in the range of 3–5 [
39
], which is typical
of all three classes. Confusion also arose with the herbaceous class, which was misidentified as
either low biomass croplands, or sparse conifer along the tree line consisting of open treed areas with
herbaceous understory. Confusion also occurred between the herbaceous and sub-polar shrub classes
due to the relatively small spectral differences between them. Finally, reference data indicated that
the lichen–moss class was confused with either the herbaceous or the wetland classes, due to the
prevalence of both lichen and moss in certain wetlands, and the low biomass, which characterizes both
the lichen–moss and herbaceous classes.
Remote Sens. 2017,9, 1098 15 of 18
Table 6.
Error matrix for the Land Cover Map of Canada 2010 with agreement defined as a match between the map label and the primary reference label. Rows of the
error matrix are the map classes, and columns are the reference classes.
Predicted vs. Reference Reference
Class 1 2 5 6 8 10 11 12 13 14 15 16 17 18 19 User’s Accuracy
1. Temperate or sub-polar needleleaf forest 381 1 10 27 21 4 0 0 0 18 0 5 0 1 0 381 468 81.4
2. Sub-polar taiga needleleaf forest 7 13 0 0 1 2 1 0 0 4 0 0 0 0 0 13 28 46.4
5. Temperate or sub-polar broadleaf forest 14 2 117 13 24 6 3 1 0 2 4 2 2 0 0 117 190 61.6
6. Mixed forest 33 0 15 88 15 1 0 0 0 0 0 1 1 1 0 88 155 56.8
8. Temperate or sub-polar shrubland 15 3 5 6 154 13 22 0 0 7 5 6 0 0 0 154 236 65.3
10. Temperate or sub-polar grassland 5 2 1 1 17 92 0 2 1 1 6 23 0 0 0 92 151 60.9
11. Sub-polar or polar shrubland-lichen-moss 0 0 0 0 8 0 29 0 0 0 0 0 0 0 0 29 37 78.4
12. Sub-polar or polar grassland-lichen-mod 2 0 0 1 1 1 1 33 0 0 0 4 0 1 0 33 44 75.0
13. Sub-polar or polar barren-lichen-moss 0 0 0 0 0 0 0 0 13 6 0 6 0 0 1 13 26 50.0
14. Wetland 6 0 2 3 8 5 1 2 1 67 0 2 0 1 0 67 98 68.4
15. Cropland 5 0 18 2 23 49 0 0 0 4 735 0 3 0 0 735 839 87.6
16. Barren land 2 1 0 2 1 5 1 0 1 0 5 86 1 1 6 86 112 76.8
17. Urban 2 0 6 0 5 6 0 0 0 0 8 5 124 0 0 124 156 79.5
18. Water 1 0 1 0 3 0 2 0 0 5 1 1 0 210 0 210 224 93.8
19. Snow and ice 0 0 0 0 0 0 0 0 0 0 0 9 0 0 38 38 47 80.9
number of samples 473 22 175 143 281 184 60 38 16 114 764 150 131 215 45 2180
Producer’s Accuracy 80.5 59.1 66.9 61.5 54.8 50.0 48.3 86.8 81.3 58.8 96.2 57.3 94.7 97.7 84.4
Overall
2811 77.6
Remote Sens. 2017,9, 1098 16 of 18
Table 7.
Average accuracy for the land cover maps with 1
×
1 and 12
×
12 minimum mapping
units (MMU).
First Call Only User’s Producer ’s
1×1 MMU pixel count 74.84 73.88
12 ×12 MMU pixel count 76.12 74.70
First and Second Call User’s Producer’s
1×1 MMU pixel count 85.96 84.15
12 ×12 MMU pixel count 86.05 84.33
5. Conclusions
A nationally consistent map depicting the distribution of land cover with a high spatial resolution
(~30 m) is an urgent requirement for various scientific, policy, and reporting purposes, and has only
recently become fully feasible. Based on research carried out at the Canada Centre for Remote Sensing,
this paper describes a new methodology that makes optimum use of satellite data to generate a very
accurate product while minimizing the costs of a mapping program at the national scale. It relies
on expert knowledge for generating initial reference data, which are used to train machine-learning
algorithms operating in a high-performance computing environment that essentially enables intensive,
fully automated mapping. The innovative features of the new methodology are: classifier local
optimization and blending; quantitative confidence assessment based on overlapping areas and
posterior probability of the assigned labels; and thoughtful involvement of the analyst at key stages of
the quality assessment. Results show that the approach achieved the objective of spatial consistency and
reasonable accuracy at approximately 75–80%, depending on the nature of the assessment. However,
for some classes, accuracy was considered too low, and needs to be improved.
Acknowledgments:
This research was financially supported by the Canadian Space Agency under Government
Related Initiatives Program (Grant No. 17MOA41001). The authors would like to thank Calvin Poff for assisting
in much of the quality control efforts undertaken in this work.
Author Contributions:
Rasim Latifovic wrote the manuscript with contributions from all authors. Rasim Latifovic
developed and implemented the software, performed the processing and analyzed the results; Rasim Latifovic
and Darren Pouliot conceived and designed the mapping approach; Ian Olthof contributed to reference data
processing and analysis.
Conflicts of Interest: The authors declare no conflict of interest.
References
1.
DB Geoservices Inc. GeoBase Land Cover Products User Needs Assessment; Agriculture and Agri-Food Canada:
Ottawa, ON, Canada, 2008.
2.
GCOS. Systematic Observation Requirements for Satellite-Based Products for Climate Supplemental Details to the
Satellite-Based Component of the Implementation Plan for the Global Observing System for Climate in Support of
the UNFCCC; World Meteorological Organization (WMO): Geneva, Switzerland, 2006. Available online:
https://library.wmo.int/opac/doc_num.php?explnum_id=3710 (accessed on 26 October 2017).
3.
CEOS. Satellite Observation of the Climate System: The Committee on Earth Observation Satellites (CEOS)
Response to the Implementation Plan for the Global Observing System for Climate in Support of the
UNFCCC. 2006; p. 54. Available online: ftp://ftp.ncdc.noaa.gov/pub/data/sds/CEOS-Response-to-the-
GCOS-IP.pdf (accessed on 26 October 2017).
4.
Global Terrestrial Observing System (GTOS). Terrestrial Essential Climate Variables. Biennial Report
Supplement for Climate Change Assessment, Mitigation and Adaptation. 2008, p. 44. Available online:
ftp://ftp.fao.org/docrep/fao/011/i0197e/i0197e.pdf (accessed on 26 October 2017).
5.
Beaubien, J.; Cihlar, J.; Simard, G.; Latifovic, R. Land Cover from Multiple Thematic Mapper Scenes Using a
New Enhancement-Classification Methodology. J. Geophys. Res. Atmos. 1999,104, 27909–27920. [CrossRef]
Remote Sens. 2017,9, 1098 17 of 18
6.
Latifovic, R.; Zhu, Z.L.; Josef, C.; Chandra, G.; Ian, O. Land Cover Mapping of North and Central
America—Global Land Cover 2000. Remote Sens. Environ. 2004,89, 116–127. [CrossRef]
7.
Pouliot, D.; Latifovic, R.; Zabcic, N.; Guindon, L.; Olthof, I. Development and Assessment of a 250 m Spatial
Resolution MODIS Annual Land Cover Time Series (2000–2011) for the Forest Region of Canada Derived
from Change-Based Updating. Remote Sens. Environ. 2014,140, 731–743. [CrossRef]
8.
Latifovic, R.; Homer, C.; Ressl, R.; Pouliot, D.; Hossain, S.N.; Colditz, R.R.; Olthof, I.; Giri, C.P.; Victoria, A.
North American Land Change Monitoring System. Remote Sens. Land Use Land Cover Princ. Appl.
2012
,
303–324.
9.
Colditz, R.R.; Pouliot, D.; Llamas, R.M.; Homer, C.; Latifovic, R.; Ressl, R.A.; Tovar, C.M.; Hernandez, A.V.;
Richardson, K. Detection of North American Land Cover Change between 2005 and 2010 with 250 m MODIS
Data. Photogramm. Eng. Remote Sens. 2014,7, 3358–3372.
10.
Latifovic, R.; Pouliot, D.A. Multi-Temporal Land Cover Mapping for Canada: Methodology and Products.
Can. J. Remote Sens. 2005,31, 347–363. [CrossRef]
11.
Latifovic, R.; Pouliot, D.; Sun, L.; Schwarz, J.; Parkinson, W. Moderate Resolution Time Series Data
Management and Analysis: Automated Large Area Mosaicking and Quality Control; Geomatics Canada.
Open File 2015,6, 25. Available online: https://doi.org/10.4095/296204 (accessed on 26 October 2017).
12.
National Road Network, 1:50,000 Scale Geogratis. Available online: http://www.nrcan.gc.ca/earth-sciences/
geography/topographic-information/free-data-geogratis/whats-new/17213 (accessed on 26 October 2017).
13.
SILC: Satellite Database for the Land cover of Canada Sample of LANDSAT TM/ETM+ Scenes (30 m
Resolution). Available online: ftp://ftp.ccrs.nrcan.gc.ca/ad/silc (accessed on 26 October 2017).
14.
Wulder, M.A.; Cranny, M.M.; Hall, R.J.; Luther, J.E.; Beaudoin, A.; White, J.C.; Goodenough, D.G.; Dechka, J.A.
Satellite Land Cover Mapping of Canada’s Forests: The EOSD Land Cover Project; Campbell, J.C., Jones, K.B.,
Smith, J.H., Koeppe, M.T., Eds.; North America Land Cover Summit; American Association of Geographers:
Washington, DC, USA, 2008; pp. 21–30. Available online: https://cfs.nrcan.gc.ca/publications?id=29220
(accessed on 26 October 2017).
15.
Olthof, I.; Latifovic, R.; Pouliot, D. Development of a Circa 2000 Land Cover Map of Northern Canada at
30 m Resolution from Landsat. Can. J. Remote Sens. 2009,35, 152–165. [CrossRef]
16.
Annual Crop Inventory. Available online: http://open.canada.ca/data/en/dataset/ba2645d5-4458-414d-
b196-6303ac06c1c9 (accessed on 26 October 2017).
17.
Timoney, K.P.; Roi, G.H.L.; Zoltai, S.C.; Robinson, A.L. The High Subarctic Forest-Tundra of Northwestern
Canada: Position, Width, and Vegetation Gradients in Relation to Climate. ARCTIC
1992
,45, 1–9. [CrossRef]
18.
De Groot, W.J.; Landry, R.; Kurz, W.A.; Anderson, K.R.; Englefield, P.; Fraser, R.H.; Hall, R.J.; Banfield, G.E.;
Raymond, D.A.; Decker, V.; et al. Estimating Direct Carbon Emissions from Canadian Wildland Fires.
Int. J. Wildland Fire 2007,16, 593–606. [CrossRef]
19.
National Centers for Environmental Information. Available online: https://www.ncei.noaa.gov/ (accessed
on 28 August 2017).
20.
Breimann, L.; Friedman, J.; Ohlsen, R.; Stone, C. Classification and Regression Trees; Taylor & Francis Group,
LLC: Abingdon, UK, 1984.
21.
DeFries, R.; Hansen, M.; Townshend, J.R.G.; Sohlberg, R. Global Land Cover Classifications at 8 km Spatial
Resolution: The Use of Training Data Derived from Landsat Imagery in Decision Tree Classifiers. Int. J.
Remote Sens. 1998,19, 3141–3168. [CrossRef]
22.
Hansen, M.; Dubayah, R.; DeFries, R. Classification Trees: An Alternative to Traditional Land Cover
Classifiers. Int. J. Remote Sens. 1996,17, 1075–1081. [CrossRef]
23.
Friedl, M.A.; Brodley, C.E.; Strahler, A. Land Cover Classification Accuracies Produced by Decision Trees at
Continental to Global Scales. IEEE Trans. Geosci. Remote Sens. 1999,37, 969–977. [CrossRef]
24.
Friedl, M.A.; McIver, D.K.; Hodges, J.C.F.; Zhang, X.Y.; Muchoney, D.; Strahler, A.H.; Woodcock, C.E.;
Gopal, S.; Schneider, A.; Cooper, A.; et al. Global Land Cover Mapping from MODIS: Algorithms and Early
Results. Remote Sens. Environ. 2002,83, 287–302. [CrossRef]
25.
Friedl, M.A.; Sulla-Menashe, D.; Tan, B.; Schneider, A.; Ramankutty, N.; Sibley, A.; Xiaoman, H.
MODIS Collection 5 Global Land Cover: Algorithm Refinements and Characterization of New Datasets.
Remote Sens. Environ. 2010,114, 168–182. [CrossRef]
26.
Gislason, P.O.; Benediktsson, J.A.; Sveinsson, J.R. Pattern Recognition Letters—Special Issue: Pattern Recognit.
Remote Sens. Arch. 2006,27, 294–300. [CrossRef]
Remote Sens. 2017,9, 1098 18 of 18
27.
Pal, M. Random Forest Classifier for Remote Sensing Classification. Int. J. Remote Sens.
2005
,26, 217–222.
[CrossRef]
28.
Rodriguez-Galiano, V.F.; Ghimire, B.; Rogan, J.; Chica-Olmo, M.; Rigol-Sanchez, J.P. An Assessment of the
Effectiveness of a Random Forest Classifier for Land-Cover Classification. ISPRS J. Photogramm. Remote Sens.
2012,67, 93–104. [CrossRef]
29. Breimann, L. Random Forests. Mach. Learn. 2001,45, 5–32. [CrossRef]
30. Chittineni, C.B. Signature extension in remote sensing. Pattern Recognit. 1980,12, 243–249. [CrossRef]
31.
Fernandes, R.; Fraser, R.; Latifovic, R.; Cihlar, J.; Beaubien, J.; Du, Y. Approaches to Fractional Land Cover
and Continuous Field Mapping: A Comparative Assessment over the BOREAS Study Region. Remote Sens.
Environ. 2004,89, 234–251. [CrossRef]
32.
Cihlar, J.; Guindon, B.; Beaubien, J.; Latifovic, R.; Peddle, D.R.; Wulder, M.A.; Fernandes, R.; Kerr, J.
From Need to Product: A Methodology for Completing a Land Cover Map of Canada with Landsat Data.
Can. J. Remote Sens. 2003,29, 171–186. [CrossRef]
33.
Homer, C.; Dewitz, J.; Fry, J.; Coan, M.; Hossain, N.; Larson, C.; Herold, N.; McKerrow, A.; VanDriel, J.N.;
Wickham, J. Completion of the 2001 National Land Cover Database for the Conterminous United States.
Photogramm. Eng. Remote Sens. 2007,73, 337–341.
34. Pouliot, D.; Latifovic, R.; Parkinson, W. Influence of Sample Distribution and Prior Probability Adjustment
on Land Cover Classification. Geomatics Canada, Open File 23, 2016, 13 pages. Available online:
http://ftp.maps.canada.ca/pub/nrcan_rncan/publications/ess_sst/297/297517/of_0023_gc.pdf (accessed
on 26 October 2017).
35. Millard, K.; Richardson, M. On the Importance of Training Data Sample Selection in Random Forest Image
Classification: A Case Study in Peatland Ecosystem Mapping. Remote Sens. 2015,7, 8489–8515. [CrossRef]
36.
Strahler, A.H.; Boschetti, L.; Foody, G.M.; Friedl, M.A.; Hansen, M.C.; Herold, M.; Mayaux, P.; Morisette, J.T.;
Stehman, S.V.; Woodcock, C.E. Global Land Cover Validation: Recommendations for Evaluation and Accuracy
Assessment of Global Land Cover Maps; Report of Institute of Environmental Sustainability Global Land
Analysis & Discovery (GLAD): College Park, MD, USA, 2006. Available online: http://nofc.cfs.nrcan.gc.ca/
gofc-gold/Report%20Series/GOLD_25.pdf (accessed on 26 October 2017).
37.
Foody, G.; Campbell, N.; Trodd, N.; Wood, T. Derivation and Applications of Probabilistic Measures of
Class Membership from the Maximum-Likelihood Classification. Photogramm. Eng. Remote Sens.
1992
,58,
1335–1341.
38.
Foody, G.M. Thematic Map Comparison: Evaluating the Statistical Significance of Differences in
Classification Accuracy. Photogramm. Eng. Remote Sens. 2004,70, 627–633. [CrossRef]
39.
Turner, D.P.; Cohen, W.B.; Kennedy, R.E.; Fassnacht, K.S.; Briggs, J.M. Relationships between Leaf Area Index
and Landsat TM Spectral Vegetation Indices across Three Temperate Zone Sites. Remote Sens. Environ.
1999
,
70, 52–68. [CrossRef]
©
2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
... Land cover classes, mostly vegetation communities, can be mapped at various scales. Natural Resources Canada maps land cover across the entire country (and in conjunction with other North American countries) at 30 m resolution (Land Cover of Canada: 1: 50,000 to 1: 250,000) in 19 classes, 14 of which occur in the IPCA (Latifovic et al. 2017, Natural Resources Canada 2020. The British Columbia (BC) biogeoclimatic mapping (BC Forest Service 2021) program pioneered detailed mapping (c. ...
... At this scale, high elevations (alpine zones) are dominated by barren land, temperate or sub-polar shrubland, and temperate or sub-polar grassland. In the northwest portions of the region, there are higher proportions of temperate or sub-polar grassland, temperate or sub-polar shrubland, and mixed wood, at the expense of barren land, and temperate or sub-polar needleleaf forest (Latifovic et al. 2017, Natural Resources Canada 2020. At this scale and detail in mapping, the IPCA well represents the broad vegetation patterns found through most of the region, while the former Beringian glacial refuge would require its own representation in protection. ...
... First, through the Boreal High and Boreal Subalpine Climate Zones as well as some bog (muskeg) wetlands, mature conifer tree density is highly variable and conifers can be widely dispersed. Pixels may have been classified differently by the two models depending on the chosen spectral threshold for discriminating between needleleaf and shrub dominance in the satellite imagery (as suggested by Latifovic et al. 2017). Second, the PEM classified land cover in all forest fire burns from 2000-2010 as shrub (Grods et al. 2013b), perhaps missing some stands dominated by conifer regeneration. ...
Technical Report
Full-text available
The purpose of this document is to synthesize and present biophysical information, from the scientific and technical literature, that addresses the suitability of a National Park within all or a portion of the Ross River Dena Council’s Indigenous Protected and Conserved Area (IPCA). We intend this synthesis to inform and support negotiations between the Ross River Dena Council (RRDC) and the Governments of Yukon and Canada regarding the feasibility of all or a portion of the IPCA being co-designated as a National Park, perhaps in conjunction with other conservation designations, in support of the Ross River Dena’s vision for the IPCA. In this Summary, we present an overview of the key themes and findings of this investigation and synthesis.
... The United States Geological Survey archive of Landsat imagery has provided open and free access to georeferenced and spectrally corrected analysis-ready imagery (Wulder et al., 2012), which makes it possible to generate time series of LC maps to study LC change. Recently two of these products based on Landsat imagery were generated over Canada, including the North America Land Change Monitoring System (NALCMS) LC dataset (Latifovic et al., 2017) and the Virtual Land Cover Engine (VLCE) framework-generated LC dataset (Hermosilla et al., 2018). ...
... Based on the random forest algorithm and local optimization method, the Canada Centre for Remote Sensing has generated the NALCMS LC maps of Canada for the years 2010 and 2015 at 30 m resolution using Landsat imagery (Latifovic et al., 2017). These LC products are the Canadian contribution to the 30 m resolution 2010/2015 LC map of North America to the joint collaborative effort by the Mexican, American, and Canadian government institutions under the NALCMS umbrella. ...
... The NALCMS LC map has 19 classes based on the United Nations Land Cover Classification System (LCCS; Di Gregorio, 2005). Assessment based on reference samples showed an overall accuracy of 76.6 % for the year 2010 data (Latifovic et al., 2017), which is used in this study. ...
Article
Full-text available
Plant functional types (PFTs) are used to represent vegetation distribution in land surface models (LSMs). Previous studies have shown large differences in the geographical distribution of PFTs currently used in various LSMs, which may arise from the differences in the underlying land cover products but also the methods used to map or reclassify land cover data to the PFTs that a given LSM represents. There are large uncertainties associated with existing PFT mapping methods since they are largely based on expert judgement and therefore are subjective. In this study, we propose a new approach to inform the mapping or the cross-walking process using analyses from sub-pixel fractional error matrices, which allows for a quantitative assessment of the fractional composition of the land cover categories in a dataset. We use the Climate Change Initiative (CCI) land cover product produced by the European Space Agency (ESA). Previous work has shown that compared to fine-resolution maps over Canada, the ESA-CCI product provides an improved land cover distribution compared to that from the GLC2000 dataset currently used in the CLASSIC (Canadian Land Surface Scheme Including Biogeochemical Cycles) model. A tree cover fraction dataset and a fine-resolution land cover map over Canada are used to compute the sub-pixel fractional composition of the land cover classes in ESA-CCI, which is then used to create a cross-walking table for mapping the ESA-CCI land cover categories to nine PFTs represented in the CLASSIC model. There are large differences between the new PFT distributions and those currently used in the model. Offline simulations performed with the CLASSIC model using the ESA-CCI-based PFTs show improved winter albedo compared to that based on the GLC2000 dataset. This emphasizes the importance of accurate representation of vegetation distribution for realistic simulation of surface albedo in LSMs. Results in this study suggest that the sub-pixel fractional composition analyses are an effective way to reduce uncertainties in the PFT mapping process and therefore, to some extent, objectify the otherwise subjective process.
... [29]. (b) The land cover in the study areas, where detailed labeling information can be found in [30] for Alaska and [31] for Canada. ...
... Beneath this surface layer [29]. (b) The land cover in the study areas, where detailed labeling information can be found in [30] for Alaska and [31] for Canada. ...
... Figure 4 depicts the time series of the interferometric coherence for eight typical landscape features in continuous permafrost terrain across consecutive thawing seasons: (1) rock, (2) exposed land, (3) sedge, (4) wetland low vegetation, (5) lichen, (6) grass, (7) shrub, and (8) needleleaf forest. The areas containing these landscapes were identified in the study area based on the National Land Cover Database (NLCD) land cover data for Alaska [30] and Canadian land cover data [31]. Additionally, 1 km × 1 km polygons were extracted to investigate the temporal changes in the interferometric coherence. ...
Article
Full-text available
In the context of global warming, the accelerated degradation of circum-Arctic permafrost is releasing a significant amount of carbon. InSAR can indirectly reflect the degradation of permafrost by monitoring its deformation. This study selected three typical permafrost regions in North America: Alaskan North Slope, Northern Great Bear Lake, and Southern Angikuni Lake. These regions encompass a range of permafrost landscapes, from tundra to needleleaf forests and lichen-moss, and we used Sentinel-1 SAR data from 2018 to 2021 to determine their deformation. In the InSAR process, due to the prolonged snow cover in the circum-Arctic permafrost, we used only SAR data collected during the summer and applied a two-stage interferogram selection strategy to mitigate the resulting temporal decorrelation. The Alaskan North Slope showed pronounced subsidence along the coastal alluvial plains and uplift in areas with drained thermokarst lake basins. Northern Great Bear Lake, which was impacted by wildfires, exhibited accelerated subsidence rates, revealing the profound and lasting impact of wildfires on permafrost degradation. Southern Angikuni Lake’s lichen and moss terrains displayed mild subsidence. Our InSAR results indicate that more than one-third of the permafrost in the North American study area is degrading and that permafrost in diverse landscapes has different deformation patterns. When monitoring the degradation of large-scale permafrost, it is crucial to consider the unique characteristics of each landscape.
... Fuel data can range in complexity from a simple classification of dominant land cover (e.g. Forestry Canada Fire Danger Group 1992; Nadeau et al. 2005;Latifovic et al. 2017) to detailed inventories of fuel characteristics within a stand (e.g. or from advanced remote sensing techniques like LiDAR (light detection and ranging) to map the vertical structure of a forest (Chuvieco et al. 2020;Abdollahi and Yebra 2023). Fuel data are frequently integrated with weather information to predict rates of spread and intensity (e.g. ...
... For example, the 2020 product was not available to publicly download until 2022, and the full continental extent was not available until 2023. The methods used to produce these data are extensively documented in Latifovic et al. (2017), and Latifovic and Pouliot (2005). We obtained the most recent available version (2020), which is freely available from the Government of Canada's open data portal. ...
Article
Full-text available
Background Land cover information is routinely used to represent fuel conditions in wildfire hazard, risk and exposure assessments. Readily available land cover data options that vary in resolution, extent, cost and purpose of collection have become increasingly accessible in recent years. Aim This study investigates the sensitivity of community-scale wildfire exposure assessments to different land cover information products used to identify hazardous fuel. Methods Ten versions of a community wildfire exposure assessment were conducted for each of five case study locations in Alberta, Canada, by varying the input land cover data. Proportional and spatial distribution of hazardous fuels and classified exposure are compared across datasets and communities. Key results We found proportional and spatial variation of exposure values between datasets within each community, but the nature of this variation differed between communities. Land cover classification definitions and scale were important factors that led to inconsistencies in assessment results. Conclusions Readily available land cover information products may not be suitable for exposure assessments at a localised scale without consideration of unique context and local knowledge of the assessment area. Implications Results may inform fuel data selection considerations for improved results in various wildfire applications at localised scales.
... This land cover corresponds to the year 2010. This land cover product combines information from the North American Land Change Monitoring System land cover (Latifovic et al., 2017), the National Terrestrial Ecosystem Monitoring System (NTEMS) (Hermosilla et al., 2018(Hermosilla et al., , 2016, satellite-derived maps of the National Forest Inventory attributes (Beaudoin et al., 2018), and British Columbia's biogeoclimatic ecosystem classification map (MacKenzie and Meidinger, 2018;Salkfield et al., 2016). Using a land cover that does not vary in time (i.e., static land cover as opposed to dynamic or prescribed land cover changes) allows us to focus on the influences of fire, harvest, and dynamic tiling on the model outputs. ...
Article
Full-text available
Canada's forests play a critical role in the global carbon (C) cycle and are responding to unprecedented climate change as well as ongoing natural and anthropogenic disturbances. However, the representation of disturbance in boreal regions is limited in pre-existing land surface models (LSMs). Moreover, many LSMs do not explicitly represent subgrid-scale heterogeneity resulting from disturbance. To address these limitations, we implement harvest and wildfire forcings in the Canadian Land Surface Scheme Including Biogeochemical Cycles (CLASSIC) land surface model alongside dynamic tiling that represents subgrid-scale heterogeneity due to disturbance. The disturbances are captured using 30 m spatial resolution satellite data (Landsat) on an annual basis for 33 years. Using the pan-Canadian domain (i.e., all of Canada south of 76° N) as our study area for demonstration, we determine the model setup that optimally balances a detailed process representation and computational efficiency. We then demonstrate the impacts of subgrid-scale heterogeneity relative to standard average individual-based representations of disturbance and explore the resultant differences between the simulations. Our results indicate that the modeling approach implemented can balance model complexity and computational cost to represent the impacts of subgrid-scale heterogeneity resulting from disturbance. Subgrid-scale heterogeneity is shown to have impacts 1.5 to 4 times the impact of disturbance alone on gross primary productivity, autotrophic respiration, and surface energy balance processes in our simulations. These impacts are a result of subgrid-scale heterogeneity slowing vegetation re-growth and affecting surface energy balance in recently disturbed, sparsely vegetated, and often snow-covered fractions of the land surface. Representing subgrid-scale heterogeneity is key to more accurately representing timber harvest, which preferentially impacts larger trees on higher quality and more accessible sites. Our results show how different discretization schemes can impact model biases resulting from the representation of disturbance. These insights, along with our implementation of dynamic tiling, may apply to other tile-based LSMs. Ultimately, our results enhance our understanding of, and ability to represent, disturbance within Canada, facilitating a comprehensive process-based assessment of Canada's terrestrial C cycle.
... The province of Alberta in western Canada contains many highly fire-prone landscapes ( Fig. 1) including extensive boreal forests where large, high-intensity crown fires are commonplace and function as a natural agent of forest health and renewal (Tymstra et al. 2005). Most forested land in Alberta (86%, Latifovic et al. 2017Latifovic et al. , circa 2020 is contained within the Forest Protection Area (FPA) where the provincial government manages fires. Despite the evolution of increasingly intensive fire management policies and practices aimed at limiting fire within the FPA over roughly the past century, wildfires have remained an active and dominant force that has shaped the province's landscape with over 76 800 fires burning more than 18.1 million hectares between 1923 and 2023 (Fig. 2;Murphy 1985;Alberta Forestry and Parks 2024), which is equivalent to 82.5% of the current forested area within the FPA. ...
Article
Full-text available
Wildfires burned an estimated 2.2 million hectares in Alberta in 2023. We describe key attributes of the fires relative to historical fires and fire seasons and offer a perspective on potentially influential factors. Thirty-six large fires ≥10 000 ha generated 95% of annual area burned. Individually, these fires exhibited sizes, fire weather, and behaviour consistent with historical fires; there were simply far more of them in 2023. Thirteen fires reported in early May were ignited by lightning and reached final sizes ≥10 000 ha, revealing a previously unrecognized threat. Historically, large lightning-ignited fires reported before mid-May occur just once per decade on average. Collectively, 18 large fires reported in early May coincided with drier conditions compared with 18 large fires reported after mid-May. Early May fire weather was also warmer and drier than historical weather. The early May fire group was a temporally concentrated outbreak in west-central Alberta and coincided with extreme potential rate of fire spread. Large fires reported after mid-May were intermittent through to September, concentrated in northern regions and coincided with extreme potential for fuel consumption. Individually, these two spatiotemporal modes of fire season severity (outbreak, intermittent) produced annual burned areas on par with historical extremes. Together, the 2023 multi-modal pattern of fire season severity amplified area burned far above anything previously recorded. Potential contributing factors include climate warming, hemispheric teleconnections, phenology and exhaustion of suppression resources. Implications for future fire seasons, research and management are discussed.
... These data included a set of 19 bioclimatic variables (Supplemental Table S1) as well as solar radiation, wind speed, water vapor pressure, and elevation at a spatial resolution of 30 arc-seconds (ca. 1 km at the equator). Land cover data were obtained from the North American land cover map at a spatial resolution of 30 m (Latifovic et al., 2017). These data were developed by the North American Land Change Monitoring System and provide a harmonized view of the physical cover of Earth's surface across North America based on 2015 Landsat satellite imagery for Canada and the United States, and Rap-idEye imagery for Mexico. ...