ArticlePDF Available

Abstract and Figures

Airborne laser scanner (ALS) data are used to map a range of forest inventory attributes at operational scales. However, when wall-to-wall ALS coverage is cost prohibitive or logistically challenging, alternative approaches are needed for forest mapping. We evaluated an indirect approach for extending ALS-based maps of forest attributes using medium resolution satellite and environmental data. First, we developed ALS-based models and predicted a suite of forest attributes for a 950 km2 study area covered by wall-to-wall ALS data. Then, we used samples extracted from the ALS-based predictions to model and map these attributes with satellite and environmental data for an extended 5600 km2 area with similar forest and ecological conditions. All attributes were predicted well with the ALS data (R2 ≥ 0.83; RMSD% < 26). The satellite and environmental models developed using the ALS-based predictions resulted in increased correspondence between observed and predicted values by 13–49% and decreased prediction errors by 8–28% compared with models developed directly with the ground plots. Improvements were observed for both multiple regression and random forest models, and for the suite of forest attributes assessed. We concluded that the use of ALS-based predictions in this study improved the estimation of forest attributes beyond an approach linking ground plots directly to the satellite and environmental data.
Content may be subject to copyright.
remote sensing
Article
Extending ALS-Based Mapping of Forest Attributes
with Medium Resolution Satellite and
Environmental Data
Joan E. Luther 1, * , Richard A. Fournier 2, Olivier R. van Lier 3and Mélodie Bujold 2
1Natural Resources Canada, Canadian Forest Service–Atlantic Forestry Centre,
Corner Brook, NL A2H 5G4, Canada
2Department of Applied Geomatics, Centre d’Applications et de Recherches en Télédétection (CARTEL),
Universitéde Sherbrooke, Sherbrooke, QC J1K 2R1, Canada; Richard.Fournier@USherbrooke.ca (R.A.F.);
Melodie.Bujold@USherbrooke.ca (M.B.)
3Natural Resources Canada, Canadian Forest Service–Canadian Wood Fibre Centre,
Corner Brook, NL A2H 5G4, Canada; Olivier.vanLier@Canada.ca
*Correspondence: JoanE.Luther@Canada.ca; Tel.: +1-709-637-4917
Received: 29 March 2019; Accepted: 2 May 2019; Published: 8 May 2019


Abstract:
Airborne laser scanner (ALS) data are used to map a range of forest inventory attributes
at operational scales. However, when wall-to-wall ALS coverage is cost prohibitive or logistically
challenging, alternative approaches are needed for forest mapping. We evaluated an indirect
approach for extending ALS-based maps of forest attributes using medium resolution satellite and
environmental data. First, we developed ALS-based models and predicted a suite of forest attributes
for a 950 km
2
study area covered by wall-to-wall ALS data. Then, we used samples extracted from
the ALS-based predictions to model and map these attributes with satellite and environmental data
for an extended 5600 km
2
area with similar forest and ecological conditions. All attributes were
predicted well with the ALS data (R
2
0.83; RMSD% <26). The satellite and environmental models
developed using the ALS-based predictions resulted in increased correspondence between observed
and predicted values by 13–49% and decreased prediction errors by 8–28% compared with models
developed directly with the ground plots. Improvements were observed for both multiple regression
and random forest models, and for the suite of forest attributes assessed. We concluded that the
use of ALS-based predictions in this study improved the estimation of forest attributes beyond an
approach linking ground plots directly to the satellite and environmental data.
Keywords:
boreal forest; forest attributes; imagery; inventory; LiDAR; modeling; random forest;
regression; Sentinel-2; PALSAR
1. Introduction
The increasing availability and decreasing cost of commercial airborne laser scanning (ALS)
systems have resulted in widespread application of ALS data for enhancing forest inventories [
1
4
].
Forest attributes commonly predicted and mapped using ALS data include average tree height,
density, mean diameter at breast height (DBH), gross total volume, gross merchantable volume,
and biomass [
5
7
]. Most operational applications employ an area-based approach [
8
12
], whereby
regression and other statistical techniques are used to relate metrics derived from the ALS data to
attributes measured at ground sample plot locations [
13
15
]. The statistical models are then used to
predict the attributes of interest at unsampled locations. The success of ALS data for predicting forest
attributes has resulted in ALS data becoming a key data source for enhancing forest management
inventories [1618].
Remote Sens. 2019,11, 1092; doi:10.3390/rs11091092 www.mdpi.com/journal/remotesensing
Remote Sens. 2019,11, 1092 2 of 22
However, ALS data acquisition is not always possible for an entire area of interest due to limited
resources or diculties associated with covering remote or large areas. In such cases, alternative
remote sensing data have been used to generate attribute predictions. For example, forest attributes
have been linked directly to relatively inexpensive satellite data [
19
21
] using a variety of modeling
methods (reviewed by Brofoske et al. [
14
]). Among the most common methods are regression [
22
24
],
k-nearest neighbors (kNN) [
25
27
], and random forests [
28
]. Multispectral imagery such as Landsat
Thematic Mapper (TM) [
29
,
30
] or Sentinel-2 (S2) [
31
] enhance the eciency and cost eectiveness in
mapping forest attributes over large forest landscapes. However, the use of multispectral imagery
alone does not normally reach the level of accuracy possible with wall-to-wall ALS data [32].
Several studies have demonstrated the combined use of ALS data and satellite imagery to map
forest attributes. Not surprisingly, models combining ALS with satellite variables generally provide
better estimates than models using satellite imagery alone [
32
,
33
]. However, models using both ALS
and satellite metrics can only be applied to areas that are sampled by ALS. One of the earliest studies
to explore integration for the purpose of mapping locations unsampled by ALS was conducted by
Hudak et al. [
19
]. These authors tested five aspatial and spatial methods for extending canopy heights
and concluded that integration of ALS and Landsat data improved the utility of both datasets. Other
studies have used ALS-derived estimates as a substitute for field plots and extended the ALS estimates
with satellite imagery. For example, McInerney et al. [
34
] extended ALS-derived canopy heights
using moderate resolution imagery from the Indian Remote Sensing satellite. Also focused on canopy
height, Pascual et al. [
35
] developed multiple regression models between ALS-based height predictions
and Landsat Enhanced Thematic Mapper (ETM+) band transformations and spectral indices for the
purpose of extrapolating ALS height measurements across a broad landscape. Maselli et al. [
36
]
evaluated both regression and kNN for extending stem volumes derived from mean stand heights
obtained from ALS using Landsat ETM+images. Subsequently, the concepts of “lidar plots” [
37
,
38
]
and “lidar sampling” (reviewed by Wulder et al. [
39
]) were proposed to mitigate the need for extensive
ground plot data for large-area forest characterization and mapping. In that context, Matasci et al. [
40
]
used ALS plots and multitemporal Landsat composites to model cover, height, biomass, and other
structural attributes for the entire boreal forest area of Canada. Although the spatial extent of that
study is remarkable, the authors acknowledge that accuracy metrics reported in that study are to be
interpreted with caution, as the ALS-predicted attribute values with variable accuracies were used as
response variables.
The conceptual basis for a multilevel approach combining ground plots, ALS data, and satellite
data for forest inventory is described in detail by Andersen et al. [
41
] and further demonstrated by
others [
42
50
]. In a multilevel approach, relationships are first established between forest attribute
values of ground plots and ALS data covering a portion of the area of interest. Then, relationships are
established between samples extracted from the ALS data and spatial layers covering the entire area of
interest. The multilevel approach thus allows estimation or map extension beyond the area covered by
ALS using the spatially comprehensive layers. In Andersen et al. [
41
], implementation of the multilevel
approach combining ground plots, ALS strip sampling, and satellite imagery provided total biomass
estimates for the boreal forests of interior Alaska. Strunk et al. [
42
] demonstrated that the multilevel
approach—referred to as an indirect approach—resulted in reduced estimates of residual variability
for biomass by up to 36% relative to using Landsat imagery alone.
Saarela et al. [43]
reported relative
standard errors in the range of 1–4% for model-assisted estimation of growing stock volume using
multilevel sampling. Others have combined field, airborne, and spaceborne LiDAR (i.e., Geoscience
Laser Altimeter System (GLAS) data) to produce regional maps of stand attributes [
44
] and to estimate
aboveground biomass and carbon of boreal forests using multilevel sampling strategies [
45
50
].
Although these studies have demonstrated improved mapping and estimation capabilities using
indirect or multilevel approaches, further evaluation is warranted with dierent data sets, at dierent
scales, and for dierent forest conditions.
Remote Sens. 2019,11, 1092 3 of 22
In this study, we hypothesized that the indirect approach including ALS samples would capture
an extensive range of forest structure and environmental conditions, and thus improve prediction over
a direct modeling approach that links a more limited set of ground plots to spatially comprehensive
satellite and environmental data. We tested this hypothesis using parametric and nonparametric
modeling methods commonly used for predicting forest attributes with ALS or satellite data, including
multiple regression (e.g., [
1
,
20
,
42
]) and random forest (e.g., [
51
53
]). We developed ALS-based
prediction models for a suite of forest attributes using ground plots and we predicted these attributes
for a study area covered by wall-to-wall ALS data. Then, we used samples extracted from the
ALS-based predictions to model forest attributes with multispectral, radar, and topographic data.
We evaluated predictions from models developed using the ALS samples with those from models
developed with only the ground plots using independent validation data. Finally, we produced maps
of forest attributes for an extended area of boreal forest of western Newfoundland, Canada.
2. Materials and Methods
2.1. Geographic Area
Our geographic area of interest was Forest Management District 15, which covers an area of
~5600 km
2
centered around 49.04
N and 57.93
W, and corresponding closely with the Corner Brook
Subregion of the Western Newfoundland Ecoregion (Figure 1). This ecoregion is located within the most
eastern Boreal Forest Region of North America [
54
] and contains some of the most favorable sites for
forest growth on the Island of Newfoundland [
55
]. Balsam fir (Abies balsamea (L.) Mill.)) is the dominant
tree species of the region, and black spruce (Picea mariana (Mill.) Britton, Sterns & Poggenb.), white birch
(Betula papyrifera Marsh.), yellow birch (Betula alleghaniensis Britton), and white spruce (Picea glauca
(Moench) Voss) are also present. This region includes high productivity mountainous slopes underlain
by orthic or gleyed podzols where seepage waters ensure good forest growth and profuse regeneration
after cutting. In contrast, the region also includes the lower productivity coastal plain, dominated by
coarse-textured deposits such as glacio-fluvial deposits, eskers, drumlins, and kames.
2.2. Ground Plots
We used two independent data sets of ground plots for calibrating and validating models of
forest attributes. Both datasets were measured following ground sampling guidelines established for
Canada’s National Forest Inventory [
56
]. The sample design aimed to capture the range of variability
in the forest structure of the study area. For calibration, we selected potential sample locations using
ALS data (described in Section 2.3) as a basis for stratification—following others who found improved
accuracies using ALS-assisted plot selection for area-based inventory [
57
59
]. Instead of using specific
ALS metrics (e.g., mean and standard deviation [
58
]), we performed a principal component analysis
(PCA) [
60
] and represented forest structure by the first two components, which accounted for ~83%
of the variance in ALS metrics. We used photo-interpreted forest inventory stand polygons to create
stand-level dominant species masks and divided the range of values for each PCA component into
10 equal strata under each mask. We then selected a random sample location for ground-based
assessment from each combination of species mask and PCA strata that existed within our study area.
For validation, we selected potential sample locations using a stratified random sampling design
using total volume predictions for conifer-dominated stands (result from Section 2.6) as the basis
for stratification. We stratified the range of values of volume into 20 strata and randomly selected a
minimum of two sample locations per strata.
We established circular plots with 11.28 m radius and recorded species, DBH (measured at 1.3 m),
height, and status (live or dead) for all merchantable trees (trees
9 cm DBH). We recorded these same
attributes for all trees with a minimum height of 1.3 m for a subplot of radius 3.99 m located in the
center of each plot. We recorded the center location of each plot with a Trimble GeoExplorer, 6000
series XH global positioning system (GPS) with floodlight technology and used base station data to
Remote Sens. 2019,11, 1092 4 of 22
dierentially correct the roving receiver data to within submeter accuracy. In total, we measured 89
calibration plots in 2016 and 2017, and 43 validations plots in 2018.
Remote Sens. 2019, 11, x FOR PEER REVIEW 4 of 24
data to differentially correct the roving receiver data to within submeter accuracy. In total, we
measured 89 calibration plots in 2016 and 2017, and 43 validations plots in 2018.
Figure 1. Spatial extent of the airborne laser scanner (ALS) study area with the extended area of
Forest Management District 15 and the Corner Brook Subregion of the Western Newfoundland
Ecoregion of Newfoundland, Canada.
From the standard measurements of species, height, and DBH, we derived a suite of structural
attributes for all plots. We calculated basal area of each tree from DBH. We estimated total and gross
merchantable volumes as a function of DBH and height according to region- and species-specific
equations from Warren and Meades [61,62]. We estimated above ground biomass (organic dry mass
per unit area) with equations and coefficients published by Lambert et al. [63]. We used the
individual attributes of live trees to estimate plot-level attributes including basal area weighted
height (i.e., Lorey’s mean height) (HGT), basal area (BA), gross merchantable volume (GMV), total
volume (TVOL), and aboveground biomass (B). We calculated attributes based on live trees (i.e.,
growing stock) for consistency with volume tables used by provincial and industrial agencies and
Figure 1.
Spatial extent of the airborne laser scanner (ALS) study area with the extended area of Forest
Management District 15 and the Corner Brook Subregion of the Western Newfoundland Ecoregion of
Newfoundland, Canada.
From the standard measurements of species, height, and DBH, we derived a suite of structural
attributes for all plots. We calculated basal area of each tree from DBH. We estimated total and gross
merchantable volumes as a function of DBH and height according to region- and species-specific
equations from Warren and Meades [
61
,
62
]. We estimated above ground biomass (organic dry
mass per unit area) with equations and coecients published by Lambert et al. [
63
]. We used the
individual attributes of live trees to estimate plot-level attributes including basal area weighted height
(i.e., Lorey’s mean height) (HGT), basal area (BA), gross merchantable volume (GMV), total volume
Remote Sens. 2019,11, 1092 5 of 22
(TVOL), and aboveground biomass (B). We calculated attributes based on live trees (i.e., growing
stock) for consistency with volume tables used by provincial and industrial agencies and because
live-tree attributes are of interest for most operational and planning applications in the region. For the
calculation of BA and GMV, we used only merchantable trees. With the exception of HGT, we converted
values to per-hectare estimates. We also calculated species-specific basal areas as a percentage of the
total BA of each plot and used these percentages to determine coniferous, broadleaf and mixed wood
forest types.
We stratified the ground plots according to dominant forest type as is common practice for
area-based modeling [
60
]. We labeled plots with
75% coniferous or broadleaf species as coniferous or
broadleaf, respectively, and we labeled all other plots as mixed forest. For the current study, we focused
on the coniferous forest, which represents ~90% of the productive forest of our study area and is the
forest type of commercial importance to the region. Of the 89 calibration plots, 58 plots representing
coniferous forest were used for calibration. Of the 43 validation plots, we removed four plots with
GMV or TVOL that exceeded the range of the calibration data and were, therefore, not representative
of the conditions modeled. This left 39 coniferous plots for validation and evaluation of our hypothesis
(Table 1).
Table 1. Descriptive statistics of forest attributes for ground plots.
Variable 1Units Calibration (n=58) Validation (n=39)
Min Max Mean Std. Dev. Min Max Mean Std. Dev.
HGT m 3.4 17.1 10.1 3.6 3.7 17.0 10.5 3.6
BA
m
2
ha
10.4 63.2 30.1 18.1 0.9 62.7 33.8 16.5
GMV
m
3
ha
10.8 401.5 156.9 118.7 1.7 376.8 175.2 109.5
TVOL
m
3
ha
13.6 439.9 190.1 128.6 25.1 401.0 211.7 109.5
Bt ha16.7 260.2 121.9 67.1 33.7 214.5 132.2 52.3
1
HGT is Lorey’s height; BA is basal area; GMV is gross merchantable volume; TVOL is total volume; B is
aboveground biomass.
2.3. ALS Data
Wall-to-wall ALS data were acquired between 15 August and 24 September 2016 representing
an area of ~950 km
2
centered at 48.77
N and 58.19
W. The ALS data were acquired with a Riegl
LMS-Q680i. The approximate flight altitude was 1000 m above ground level, with an approximate
speed of 100 knots. The scan angle was
±
30
with 50% minimum overlap between flight lines.
The aggregate pulse density was specified to be a minimum of six pulses m
2
. We calculated the
resulting average point density to be 7.3 points m
2
with a standard deviation of 2.4 points m
2
. Points
were classified according to standard LAS specification classes [64] by the service provider.
We processed the ALS data to generate a digital terrain model (DTM) and a canopy height model
(CHM) with spatial resolutions of 1 m
×
1 m using the LTK
TM
extension for ArcGIS [
65
]. For the DTM,
we generated a triangular irregular network (TIN) using returns classified as ground and interpolated
a raster surface from the TIN using natural neighbor interpolation. We used first returns classified as
vegetation to generate the CHM, also using natural neighbor interpolation. We set the binning cell
assignment to the maximum value, and replaced negative values with zeros.
We calculated metrics commonly used in forestry studies from normalized LiDAR point cloud
data. The metrics consisted of height, structural, density, and cover statistics of each plot computed
using LTK
TM
[
65
]. We calculated all height, structural, and density metrics using vegetated returns
2 m representing treed vegetation [
57
] and
30 m to avoid erroneous points that exceeded that
maximum tree height of the dominant trees species in the region [
66
]. Height metrics represented the
basic statistics of mean (MEAN) and max (MAX) height and the heights of various percentiles of first
returns (i.e., P10,
. . .
, P90, P95) [
6
]. Structural metrics were statistical measures of skewness (SKEW),
coecient of variation (COVAR), vertical distribution ratio (VDR) [
67
] and vertical complexity index
Remote Sens. 2019,11, 1092 6 of 22
(VCI) [
68
]. To compute density metrics, we divided the range of LiDAR heights into 10 equal intervals
and calculated the cumulative proportion of LiDAR returns found in the first nine intervals (i.e., D1,
. . .
, D9) [
6
]. We calculated cover metrics at 2 m height intervals (i.e., CC2,
. . .
, CC14) from the CHM as
the number of 1 m
×
1 m cells with a height value >2 m divided by the number of nonvoid 1 m
×
1 m
cells [
53
]. Finally, we selected a reduced set of metrics for this study avoiding very highly correlated
predictor variables within each group (Pearson correlation coecient, r>0.95) (Table 2).
Table 2.
Description of airborne laser scanner (ALS), satellite and environment data used as
predictor variables.
Name Units Description
ALS Variables
Height Metrics 1
MAX m Maximum height of first returns
P95 m Height of the 95th percentile of first returns
MEAN m Mean height of first returns
Structural Metrics
1
SKEW Skewness of first returns
COVAR
Standard deviation of first returns/mean of first returns
VDR Vertical distribution ratio [67]
VCI Vertical complexity index [68]
Density Metrics 1
D2 %
Percentage of all returns found in bins 1 through 2 of 10 where bin
2 represents the 20th percentile height of all returns
D5 %
Percentage of all returns found in bins 1 through 5 of 10 where bin
5 represents the 50th percentile height of all returns
D8 %
Percentage of all returns found in bins 1 through 8 of 10 where bin
8 represents the 80th percentile height of all returns
CHM Metrics
CC2 %
Number of 1 m
×
1 m canopy height model cells that have a height
value >2 m divided by the number of nonvoid 1 m ×1 m cells
CC6 %
Number of 1 m
×
1 m canopy height model cells that have a height
value >6m divided by the number of nonvoid 1 m ×1 m cells
CC14 %
Number of 1 m
×
1 m canopy height model cells that have a height
value >14m divided by the number of nonvoid 1 m ×1 m cells
Satellite Variables
Sentinel 2 2
S2_B2 % Blue band; original resolution 10 m
S2_B4 % Red band; original resolution 10 m
S2_B5 % Vegetation red edge; resolution 20 m
S2_B8 % NIR; original resolution 10 m
S2_B11 % SWIR; resolution 20 m
PALSAR HH DN Radar backscatter — HH polarization; original resolution 25 m
HV DN Radar backscatter — HV polarization; original resolution 25 m
Environmental Variables
Topographic and
Solar Radiation
Elevation
m Elevation above mean sea level from 0.75 arc second CDEM [69]
CosAspect
1 to 1 cos(Aspect) transformation representing northness
SCOSA Slope ×cos(Aspect) transformation [70]
SINA Slope ×sin(Aspect) transformation [70]
1
Metrics computed from returns classified as vegetation with heights
2 m and
30 m.
2
Pixel radiometric
measurements are Level1C Top of Atmosphere (TOA) reflectances.
2.4. Spatially Comprehensive Data
We used spectral, structural, and topographic data that were readily available and spatially
comprehensive for our area of interest. For the spectral data, we mosaicked four Level-1C orthorectified
Sentinel-2 (S2) images [
71
] acquired on 27 June 2017 under virtually clear skies. The S2 images consisted
of 13 spectral bands; however, bands 1, 9, and 10 were not used in this study due to their coarse
resolution (60 m
×
60 m), and bands 3, 7, 8A, and 12 were omitted to minimize redundancy with
other bands (Pearson correlation coecient, r>0.95). We resampled bands with an original spatial
resolution of 10 m
×
10 m to 20 m
×
20 m prior to mosaicking. Similarly, we resampled Advanced Land
Remote Sens. 2019,11, 1092 7 of 22
Observing Satellite (ALOS) Phased Arrayed L-band Synthetic Aperture Radar (PALSAR) HH and HV
polarization mosaic data available at 25 m
×
25 m spatial resolution [
72
] and Canadian Digital Elevation
Model (CDEM) raster data [
69
] with a base resolution of 0.75 arc seconds (~23 m) in the north–south
direction to 20 m
×
20 m. We generated solar radiation transformations including CosAspect, SCOSA,
and SINA [70] by combining slope and aspect information derived from the CDEM.
2.5. Overview of the Approach
Our implementation of the indirect mapping approach consisted of two phases with an
independent evaluation. In the first phase, we built predictive models between the calibration plot
attributes and ALS plot metrics. For the second phase, we built predictive models between ALS-based
predictions and variables derived from the spatially comprehensive satellite and environmental data.
For comparative purposes, we also built predictive models directly between the calibration plot
attributes and the satellite and environmental variables. In all cases, we developed both random forest
and multiple linear regression models and we evaluated model goodness-of-fit by regressing observed
vs. predicted values for independently measured validation plots [
73
,
74
]. We produced maps by
applying the predictive models to 20 m ×20 m raster grids.
2.6. Development of ALS-Based Inventory (Phase 1)
In phase 1, we built random forest models [
75
]inR[
76
] with ModelMap [
77
]. ModelMap
automates the process of model building and map construction by providing an interface with several
R packages, including randomForest [
78
]. Each random forest model consisted of 500 trees by default
whereby each tree was developed with a random subset of the reference data and a random selection
of predictors at each node of the tree to determine the split. We assessed variable importance during
initial model development and retained only those predictors with a %IncMSE greater than 3% for
the final models. We used tuneRF to determine the optimal number of predictor variables to retain at
each node (mtry) for each model. The tuneRF algorithm starts with a default value of mtry (i.e., the
number of predictors divided by three) and searches for the optimal value according to out-of-bag
error estimates. The optimal values of mtry varied depending on the attribute modeled. During model
development, we evaluated the random forest models according to out-of-bag errors.
We developed regression models using a subset regression technique available in the R package
Leaps [
79
]. Leaps exhaustively searches all pathways to choose the best variable subsets for a
given number of predictors. For each response variable, we used Leaps to identify the 10 best
models for each combination of predictors according to the method of Mallows’ Cp [
80
], where
the Cp statistic is an indicator of balance between models that are too simple and may suer from
biased coecients and biased prediction, or too complicated, resulting in large variances in both the
coecients and the prediction [
81
]. We used statistical tests to determine whether each model met the
assumptions of linear regression, including the Shapiro–Wilk Test for normality of residuals [
82
] and
the Breusch–Pagan Test for homogeneity of variance [
83
]. Variance inflation factors (VIF) identified
problems with collinearity [
81
]. We removed models from the candidate set where model tests indicated
heteroscedasticity, non-normality of residuals (p<0.05) or collinearity (VIF
7). Of the remaining
models, we selected the model with the lowest Akaike’s information criterion (AIC) [
84
] score. During
model development, we assessed the regression models using leave-one-out cross validation.
We generated raster grids for each predictor variable at 20 m
×
20 m spatial resolution for the ALS
site, and we used the models to predict the response variable for each raster cell. We implemented
the random forest models using the R package ModelMap [
77
] and the regression models using the R
package raster [85].
2.7. Development of Extended Inventory (Phase 2)
The ALS-based raster products from phase 1 provided more than two million potential samples
(i.e., raster cells) for calibrating the satellite and environmental models. We converted the raster cells
Remote Sens. 2019,11, 1092 8 of 22
to a points shape file using ESRI ArcGIS [
86
] and we input the points shape file in R [
76
], where
we filtered the potential samples to provide calibration data that were representative of the relevant
forest conditions as follows. First, we assumed—due to our sampling design—that our ground plots
represented the range of forest conditions of interest to this study. Therefore, we removed potential
samples that had ALS metric values outside the range of those observed at the ground plots (e.g.,
MAX <3.44 m & >21.54 m). We also removed potential samples located in areas of cloud or cloud
shadow and areas harvested on the satellite imagery since the time of ALS acquisition. To do this, we
established thresholds for band 2 (>0.7 & <0.09), 3 (>0.6 & <0.11) and 4 (>0.03 & <0.08) by visually
assessing the imagery. Finally, we removed samples from areas with
75% coniferous forest according
to a recent forest management inventory of the area as models only represented the coniferous forest.
After filtering, 828,853 potential samples remained. We subsampled the remaining samples by dividing
the range of the predicted response variable into 10 equal strata and randomly selected 500 samples
within each strata for a total of 5000 samples for each attribute of interest. We used the R package
BalancedSampling [
87
] and the Spatially Balanced Local Pivotal Method [
88
,
89
] to select samples
that were spatially, spectrally, structurally, and topographically representative of the forest conditions
within each strata. We balanced the samples spatially on longitude and latitude, spectrally on S2 band 4
reflectance values, structurally on the PALSAR HV polarization radar backscatter, and topographically
on elevation. Using the resulting samples as calibration data sets, we developed random forest and
regression models using the same methods as applied in phase 1. We assessed out-of-bag errors
(random forest) and leave-one-out cross validation results (regression) to evaluate the quality of the
phase 2 models for capturing the relationship between the ALS predictions of forest attributes and the
satellite and environmental predictor variables.
2.8. Independent Plot Evaluation
We evaluated the indirect modeling approach relative to the direct modeling approach using
the independent validation plots and common measures used to evaluate area-based model
performance [
60
,
74
]. Evaluation statistics included: correspondences between observed and predicted
values as indicated by the coecient of determination (R
2
), the root mean square deviation (RMSD
Equation (1)), relative root mean square deviation expressed as a percentage of the mean (RMSD%,
Equation (2)), absolute measure of model bias, (Bias, Equation (3)), and relative bias expressed as a
percentage of the mean (Bias%, Equation (4)):
RMSD =sPn
i=1(yiˆ
yi)2
n1(1)
RMSD% =RMSD
y×100 (2)
Bias =Pn
i=1(yiˆ
yi)
n(3)
Bias% =Bias
y×100 (4)
where y
i
is the observed value,
ˆ
y
i
is the predicted value for case i, and nis the number of observations.
The absolute RMSD and Bias provide error estimates in the units of the respective forest attributes,
whereas the relative RMSD and Bias allow for comparisons between dierent forest attributes.
We repeated the direct and indirect approaches with five random seeds whereby the seeds
influenced the selection of samples and predictors for the random forest models and the selection
of spatially balanced samples in phase 2 for both the random forest and regression models. We
implemented the Vuong test [
90
] with the R package nonnest2 [
91
] to determine significant dierences
Remote Sens. 2019,11, 1092 9 of 22
between the results of the direct and indirect approaches and between the random forest and
regression methods.
3. Results
3.1. ALS Models of Forest Attributes (Phase 1)
The phase 1 models developed with ALS predictors resulted in high correspondence between
predicted and observed values for both the calibration (
R2>0.90
) and validation data sets (
R2>0.83
)
(Table 3). Prediction errors were low for the calibration set (RMSD% <20; Bias% <0.9) but slightly
higher for the validation set (RMSD% <26; Bias% <13), as we would expect. Observed versus
predicted scatter plots showed most plots near the 1:1 line (e.g., TVOL, Figure 2a; other attributes not
shown). Overall, the evaluation statistics indicated strong prediction models for all attributes reported.
Moreover, dierences in the error statistics for the random forest and regression models were minimal
(R
2
<0.03; RMSD% <3; Bias% <3). Therefore, we deemed the ALS-based predictions from both the
random forest and regression models as suitable for use as calibration data for phase 2.
Table 3. Evaluation statistics for ALS-based prediction of forest attributes 1.
Random Forest Regression
R2RMSD RMSD% Bias Bias% R2RMSD RMSD% Bias Bias%
Calibration 2(n=58)
HGT 0.93 0.95 9.37 0.09 0.84 0.95 0.84 8.27 0.00 0.00
BA 0.90 5.67 18.86 0.08 0.28 0.92 5.22 17.37 0.00 0.00
GMV 0.94 29.11 18.56 1.35 0.86 0.94 30.31 19.33 0.00 0.00
TVOL 0.94 31.29 16.46 0.04 0.02 0.96 27.72 14.58 0.00 0.00
B 0.90 21.49 17.63 0.43 0.35 0.93 18.74 15.37 0.00 0.00
Validation (n=39)
HGT 0.94 1.14 10.86 0.53 5.09 0.95 1.03 9.79 0.54 5.17
BA 0.83 7.56 22.40 3.03 8.97 0.86 6.71 19.87 2.44 7.23
GMV 0.88 44.76 25.55 22.19 12.67 0.90 40.07 22.87
18.40
10.50
TVOL 0.91 37.84 17.87 17.03 8.04 0.91 37.21 17.57
13.32
6.29
B 0.85 22.14 16.75 5.86 4.43 0.83 23.71 17.93 4.28 3.24
1
Random forest values represent average results of five random seeds.
2
Calibration results are calculated from
out-of-bag samples for random forest and leave-one-out cross validation for regression.
3.2. Satellite and Environmental Models (Phase 2)
The phase 2 models with satellite and environmental predictors had moderate correspondence
between predicted and observed values for the calibration (R
2
=0.58–0.74) and validation (R
2
=0.56–0.84)
data sets (Table 4). However, in this case, the random forest models consistently outperformed the
regression models, with R
2
values increasing by 4–9% based on the calibration data and by 5–12%
based on the independent validation data. Similarly, RMSD% values decreased for all attributes by
1–7%. These results suggest that the random forest models may better capture the complex relationship
between the forest attributes and the satellite and environment variables, compared with the more
simple regression models. Bias was low in all cases (Bias% <8%). Surprisingly, the error statistics
of the independent validation results were not markedly poorer than those of the calibration data
sets assessed via out-of-bag samples for random forest and using leave-one-out cross validation for
regression. These results further suggest that the ALS predictions provided suitable calibration data
for developing the satellite and environmental models.
Remote Sens. 2019,11, 1092 10 of 22
Remote Sens. 2019, 11, x FOR PEER REVIEW 10 of 24
Random Forest Regression
a) ALS
b) DIRECT
c) INDIRECT
Figure 2. Observed versus predicted scatter plots of total volume (m3 ha1) modeled with (a) ALS
metrics, (b) direct approach, and (c) indirect approach. Graphs represent independent validation
results of a single seed shown for illustrative purposes. Black lines are 1:1 reference lines; gray lines
are regression lines.
Figure 2.
Observed versus predicted scatter plots of total volume (m
3
ha
1
) modeled with (
a
) ALS
metrics, (
b
) direct approach, and (
c
) indirect approach. Graphs represent independent validation
results of a single seed shown for illustrative purposes. Black lines are 1:1 reference lines; gray lines are
regression lines.
Remote Sens. 2019,11, 1092 11 of 22
Table 4.
Evaluation statistics for prediction of forest attributes with satellite and environmental variables
1.
Random Forest Regression
R2RMSD RMSD% Bias Bias% R2RMSD RMSD% Bias Bias%
Calibration 2(n=5000)
HGT 0.71 1.81 17.85 0.01 0.09 0.68 2.17 21.16 0.00 0.00
BA 0.70 8.14 27.11 0.04 0.14 0.66 9.80 32.20 0.00 0.00
GMV 0.67 59.40 31.66 0.50 0.26 0.58 73.06 36.49 0.00 0.00
TVOL 0.70 61.64 28.30 0.14 0.06 0.61 76.73 34.73 0.00 0.00
B 0.74 30.43 23.31 0.21 0.16 0.67 39.43 30.63 0.00 0.00
Validation (n=39)
HGT 0.84 1.66 15.88 0.09 0.91 0.79 1.74 16.59 0.31 2.94
BA 0.76 8.87 26.27 2.39 7.08 0.64 9.93 29.43 0.75 2.24
GMV 0.79 55.14 31.47 3.04 1.74 0.67 63.41 36.20 10.13 5.78
TVOL 0.74 56.55 26.71 6.60 3.11 0.64 66.66 31.48 7.64 3.61
B 0.65 31.36 23.72 3.07 2.32 0.56 37.15 28.10 4.47 3.38
1
Random forest values represent average results of five random seeds.
2
Calibration statistics are calculated from
out-of-bag samples for random forest and leave-one-out cross validation for regression.
3.3. Direct vs. Indirect Approach
For all attributes studied, the indirect approach resulted in higher correspondence between
observed and predicted values than the direct approach (e.g., TVOL, Figure 2b and c; Figure 3).
R
2
values increased by 13–49% for random forest and 13–36% for regression depending on the attribute.
Similarly, mean deviation of predicted values with respect to the observed ones was consistently
lower for the indirect approach than the direct approach by 8–28% for random forest and 14–20% for
regression. These error statistics favor the indirect approach over the direct approach—when using
the satellite and environmental models to map the extended area. Not surprising, the ALS-based
predictions had the most favorable error statistics overall, with the exception of bias. Bias was negative
for all attributes for the ALS-based predictions, but more variable for the direct and indirect approaches.
However, bias was less than 13% for all attributes regardless of the approach used.
Although the ALS models predicted all attributes well (R
2
>0.83, Table 3), HGT had the most
favorable error statistics for the satellite and environmental models (Figure 3). Better results were
achieved for the indirect approach (R
2
>0.79; RMSD% <17
;
Bias% <3), whereas the results of the
direct approach were poorer (R
2
<0.70; RMSD% >20; Bias% <4) for both random forest and regression
models. The attribute with the least favorable results based on the satellite and environmental variables
was GMV. Again, the results of the indirect approach (R
2
>0.67; RMSD% <37; Bias% <6) were more
favorable than those of the direct approach (R2<0.66; RMSD% >38; Bias% <6).
For illustration, we present observed versus predicted scatter plots for TVOL for each approach
based on the independent validation data (Figure 2). The results represent a single seed, however,
the scatter plots and associated statistics were similar regardless of the seed selected (results not
shown). The correspondence between the observed and ALS-based predictions was strong (R
2
>0.90;
RMSD% <18
) for both random forest and regression (Figure 2a). Not surprisingly, the predictions from
the satellite and environmental variables showed weaker correspondence with higher prediction errors
than those achieved with ALS. However, the indirect approach (Figure 2c) provided more favorable
results than the direct approach (Figure 2b) for random forest (R
2
=0.73 vs. 52; RMSD =58 vs. 76.3 m
3
ha
1
) with the exception of bias, which increased and switched from positive to negative (Bias =
8.23
vs. 6.06 m
3
ha
1
). For regression, the indirect approach provided more favorable results for all three
error statistics (R2=0.65 vs. 0.54; RMSD =66.8 vs. 82 m3ha1; Bias =10.1 vs. 15.8 m3ha1).
Vuong tests indicated that the dierences in the relationship between observed and predicted
values between the direct and indirect approaches were statistically significant (P
DIFF
<0.05) for all
Remote Sens. 2019,11, 1092 12 of 22
attributes. Moreover, there was sucient evidence that the indirect approach performed better than
the direct approach (P
INDvsDIR
<0.05) for all attributes except BA for random forest (P
INDvsDIR
=0.05)
and B for regression (P
INDvsDIR
=0.10) (Table 5a). Vuong tests comparing the relationships between
observed and predicted values for random forest and regression suggest sucient evidence that the
random forest models performed better than the regression results for the indirect approach (P
RFvsREG
<0.05) and for the direct approach for GMV (P
RFvsREG
=0.02), but not so for the direct approach and
the remaining forest attributes (PRFvsREG >0.05)(Table 5b).
Table 5.
Vuong test comparing observed versus predicted values from (a) the direct and indirect
approaches and (b) random forest and regression models.
(a) Direct vs. Indirect
Random Forest Regression
R2DIR R2IND pDIFF pDIRvsIND pINDvsDIR R2DIR R2IND pDIFF pDIRvsIND pIND vsDIR
HGT 0.67 0.86 0.000 0.999 0.001 0.70 0.79 0.001 0.996 0.004
BA 0.68 0.77 0.000 0.947 0.053 0.54 0.65 0.000 0.982 0.018
GMV
0.67 0.80 0.000 0.977 0.023 0.50 0.68 0.000 1.000 0.000
TVOL
0.54 0.76 0.000 0.993 0.007 0.54 0.65 0.000 0.960 0.040
B 0.44 0.66 0.000 0.991 0.009 0.47 0.56 0.000 0.895 0.105
(b) Random Forest vs. Regression
Direct Indirect
R2RF R2REG pDIFF pRFvsREG pREGvsRF R2RF R2REG pDIFF pRFvsREG pREGvsRF
HGT 0.67 0.70 0.000 0.626 0.374 0.86 0.79 0.000 0.025 0.975
BA 0.68 0.54 0.000 0.060 0.940 0.77 0.65 0.000 0.000 1.000
GMV
0.67 0.50 0.000 0.018 0.982 0.80 0.68 0.000 0.004 0.996
TVOL
0.54 0.54 0.000 0.491 0.509 0.76 0.65 0.000 0.002 0.998
B 0.44 0.47 0.000 0.617 0.383 0.66 0.56 0.000 0.008 0.992
p
DIFF
: variance test if the models are significantly dierent; p
DIRvsIND
: likelihood ratio test that the direct (DIR)
model fits better than the indirect (IND) model; p
INDvsDIR
: likelihood ratio test that the indirect (IND) model fits
better than the direct (DIR) model; p
RFvsREG:
likelihood ratio test that the random forest (RF) model fits better than
the regression (REG) model; p
REGvsRF:
likelihood ratio test that the regression (REG) model fits better than the
random forest (RF) model.
3.4. Landscape Patterns
Maps illustrate landscape patterns of the attribute predictions (e.g., TVOL, Figure 4). The phase
1 maps of the study site generated with ALS metrics are visually similar regardless of the modeling
method used. This observation is consistent with the fact that both the random forest and regression
models were strong with low prediction errors. The maps generated with the satellite and environmental
variables exhibit greater variation between the direct and indirect approaches and between the random
forest and regression models. Based on the independent evaluation, the optimal results were achieved
for the indirect approach over the direct approach and for the random forest models over the regression
models. The map of TVOL produced with the indirect approach and the random forest models showed
the greatest similarity with the ALS predictions. There appears to be a slight overestimation of the low
volume areas and a slight underestimation of high volume areas, as is also evident from the scatter
plots (Figure 2c) and in the more limited range of values mapped with the random forest models.
The maps from the direct approach and from the regression models had more low values (TVOL <
50 m
3
ha
1
) and more high values (TVOL >400 m
3
ha
1
) throughout the ALS area and the extended
district. These map observations are consistent with the slight underestimation of the low volume plots
and overestimation of the high volume plots observed in the corresponding scatter plots of Figure 2.
Remote Sens. 2019,11, 1092 13 of 22
Remote Sens. 2019, 11, x FOR PEER REVIEW 13 of 24
Figure 3. Independent evaluation of direct and indirect and modeling approaches. ALS model
evaluation is shown for reference. Values presented are the average results of each approach
implemented with five random seeds. Error bars represent ± 1 standard deviation. See Table 5 for
evaluation of significant differences.
Random Forest Regression
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
HGT BA GMV TVOL B
R
2
DIRECT R2 INDIRECT R2 ALS R2
0
5
10
15
20
25
30
35
40
45
50
HGT BA GMV TVOL B
RMSD%
DIRECT RMSD% INDIRECT RMSD% ALS RMSD%
-14
-12
-10
-8
-6
-4
-2
0
2
4
6
8
10
HGT BA GMV TVOL B
Bias%
DIRECT INDIRECT ALS
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
HGT BA GMV TVOL B
R
2
DIRECT R2 INDIRECT R2 ALS R2
0
5
10
15
20
25
30
35
40
45
50
HGT BA GMV TVOL B
RMSD%
DIRECT RMSD% REG_INDIRE CT RMSD% ALS RMSD%
-14
-12
-10
-8
-6
-4
-2
0
2
4
6
8
10
HGT BA GMV TVOL B
Bias%
DIRECT INDIRECT ALS
Figure 3.
Independent evaluation of direct and indirect and modeling approaches. ALS model
evaluation is shown for reference. Values presented are the average results of each approach
implemented with five random seeds. Error bars represent
±
1 standard deviation. See Table 5
for evaluation of significant dierences.
Remote Sens. 2019,11, 1092 14 of 22
Remote Sens. 2019, 11, x FOR PEER REVIEW 14 of 24
Figure 4. Illustrative maps of total volume (m3 ha1) predicted with (a) ALS, (b) direct and (c) indirect
modeling approaches. The corresponding ALS area is outlined in black.
Figure 4.
Illustrative maps of total volume (m
3
ha
1
) predicted with (
a
) ALS, (
b
) direct and (
c
) indirect
modeling approaches. The corresponding ALS area is outlined in black.
Remote Sens. 2019,11, 1092 15 of 22
4. Discussion
4.1. ALS-Based Inventory
We generated a suite of ALS-based models for predicting attributes of conifer-dominated forests
of western Newfoundland, Canada. All models had R
2
values of 0.83 or greater and relative RMSD
values <26% of mean values based on independent validation data. The model prediction errors and
mapped products were similar using both regression and random forest models. These results are
consistent with many other studies demonstrating the use of ALS for area-based estimation of forest
attributes in boreal forest conditions [
1
,
6
,
7
,
53
]. However, in many instances, wall-to-wall ALS coverage
is not available. Therefore, we evaluated the use of a subarea of ALS coverage to calibrate satellite and
environmental data that oers spatially comprehensive coverage over an extended area.
4.2. Extension of ALS-Based Inventory
The approach used in this study combines information measured at ground plots with a subarea
of wall-to-wall ALS data and spatially comprehensive satellite and environmental data to map forest
attributes for an extended area of interest. More specifically, we used ALS-based predictions to provide
thematic and spatial balance, a highly desirable property of area-based inventory design. We used S2
spectral bands and PALSAR L-band SAR backscatter combined with topographic and solar radiation
variables to provide complementary information for extending the ALS-based inventory over an area
with similar ecological conditions. The ALS-based predictions improved representation of the range of
forest and environmental conditions available for calibrating the spatially comprehensive data in an
indirect estimation and mapping approach.
The practical justification for such an approach requires demonstration that the prediction of
inventory attributes exceeds that achieved by linking ground plots directly to spatially comprehensive
satellite and environmental data [
42
]. Accordingly, we tested the hypothesis that the indirect modeling
approach using ALS-based predictions as calibration data improves prediction over a direct modeling
approach using a more limited set of ground plots. In evaluating the hypothesis, we used an
independent set of ground plots measured within two years of the ALS and satellite data acquisitions.
Improvements in R
2
and RMSD were observed for the indirect approach over the direct approach
for all attributes analysed in this study. The satellite and environmental models developed using
the ALS-based predictions resulted in increased correspondence between observed and predicted
values by 13–49% and decreased prediction errors by 8–28% compared with models developed directly
with the ground plots. Significance tests suggested that there was sucient evidence that the indirect
approach was better than the direct approach for four of the five attributes regardless of whether
random forest or regression models were used. Thus, the evaluation with independent validation plots
confirmed our hypothesis for several key forest attributes of western Newfoundland boreal forests.
This finding is important because the added cost of ALS data acquisition for a portion of an area of
interest can be justified if the prediction accuracy exceeds an approach using only the ground plots.
4.3. Parametric vs. Nonparametric Models
Random forest (nonparametric) and regression (parametric) models have advantages and
disadvantages when it comes to operational prediction of forest inventory attributes [
53
]. Random
forest models take full advantage of the data set and do not require prior specification. As random forest
models never predict outside the range of the data, there are no concerns with extreme predictions,
but regression allows for model fitting with more limited data. However, there are additional
considerations when developing regression models as regression models must satisfy conditions of
normality, heterogeneity, significance of predictors, and collinearity. Furthermore, the number of
possible predictors is limited by the sample size to avoid overfitting.
Our results suggest that both random forest and regression models were suitable for predicting
forest inventory attributes from ALS data. In our study, the dierences between the random forest
Remote Sens. 2019,11, 1092 16 of 22
and regression model predictions with ALS metrics were not significant. Similarly, for the direct
approach, we did not find sucient evidence that the random forest models fit better than the regression
models, with the exception of GMV. On the contrary, the indirect approach implemented with random
forest models resulted in better correspondence between predicted and observed values and reduced
prediction errors relative to the direct approach. This may be in part due to the ability of random forest
models to capture the more complicated relationships between forest attributes and the satellite and
environmental data, whereas the regression models are based on the linear relation hypothesis.
4.4. Data and Technical Considerations
The results of this study are likely dependent upon the quality of the remote sensing data sets used.
For phase 1, we acquired ALS data with parameters optimized for forest applications [
92
] including
50% overlap between flight lines with minimum aggregate pulse densities of six points per square
meter. Single-pass ALS data with lower pulse densities may result in lower quality ALS models, the
errors of which are propagated through the indirect approach. For phase 2, we used medium resolution
optical and radar satellite data combined with topographic and solar radiation indices for building the
models. In particular, we used high quality cloud free S2 imagery all acquired on the same day, which
reduced the requirement for normalization. Any normalization of imagery across scenes could reduce
the radiometric quality of the imagery. Additionally, the use of imagery from dierent dates and times
of year in order to expand across large areas could further introduce noise and limit the predictability
of the attributes from the satellite imagery. On the other hand, other types of remote sensing data
may oer complementary information with the potential for improved results. For example, many
researchers have shown that GLAS data are useful for extension purposes [
44
50
] and that metrics
derived from Landsat time-series data have improved the estimation of forest attributes beyond the use
of single-date imagery [
93
,
94
]. Spaceborne LiDAR from the Advanced Topographic Laser Altimeter
System (ATLAS) sensor on board ICESat-2 or future Global Ecosystem Dynamics Investigation (GEDI)
mission could also facilitate estimation across large regions when used in combination with ground
and airborne LiDAR data.
In addition, several important technical aspects may have influenced the results achieved [
41
]. For
phase 1, it was important to ensure that the range of forest structural conditions was well-represented
by the calibration plots. Herein, we used principle components extracted from the ALS data [
13
] prior
to the selection of field plots to ensure that the range of structural conditions captured by the ALS data
was sampled. In theory, any pixels with values outside the range of the calibration data should be
nonforested or not the forest type of interest. Furthermore, the ground plots established for calibrating
the ALS data were designed specifically for ALS-based modeling [
51
]. We used precisely located
circular plots representing an area of 400 m
2
to correspond with the desired mapping resolution from
the ALS data. We dierentially corrected GPS measurements to minimize positional errors to the extent
possible. As coniferous forest was the primary commercial interest in this area, our study was limited
to areas with coniferous trees representing
75% of the total basal area. For these conditions, both the
ALS random forest and regression models were strong, allowing us to generate accurate predictions
for the area covered by ALS. However, we expect that any factors that aect the quality of the ALS
predictions (e.g., geolocation and representativity of the ground plots, ALS acquisition parameters,
and forest conditions) will aect the results achieved by the indirect approach.
For phase 2, the advantage of using ALS predictions of the response variables over ground
plots alone was that the ALS predictions represented the broader range of conditions of the spatially
comprehensive predictor variables. In order to extract representative calibration samples from the
ALS predictions, we applied “balanced” sampling [
88
,
89
], whereby the samples were more or less
evenly distributed over the extent of the population of interest. We sampled equally across 10 strata
of the response variable, and within each strata, we balanced the sample across the auxiliary space
to ensure that the samples were distributed across the range of conditions in the predictor variables.
Remote Sens. 2019,11, 1092 17 of 22
A less representative sample of ALS calibration samples may have resulted in poorer phase 2 models
as thematic and spatial balance are highly desirable aspects of inventory sample designs [41].
4.5. Implications for Forest Inventory
Forest inventories require information on a broad suite of forest attributes. In this study, we focused
on key attributes commonly mapped with ALS that characterize the structure of vegetation. However,
forest type and tree species information is a key information requirement of forest management [
17
].
Although substantive research has been conducted on species characterization with ALS data (reviewed
by Vauhkonen et al. [95]), methods for mapping species have not yet reached the same level of maturity
as those for mapping structural attributes such as height and volume. As a result, we did not address
individual species. Rather, we stratified our plot database into coniferous, broadleaf, and mixed forest
types as is common practice for area-based modeling, and we based our study on the coniferous forest
strata. We did not produce models for broadleaf and mixed forest because of the limited ground sample
data representing these forest types. Furthermore, we did not include the broadleaf and mixed forest plots
in the models because preliminary analysis showed that doing so resulted in substantive underestimation
of the high volume coniferous stands. Given that coniferous forest represents ~90% of the productive
forest of this region and is the primary forest of industrial interest, we decided to sacrifice coverage for
better estimation of attributes of the commercial forest. This limited application of the models to the area
of coniferous forest and required a spatial layer representing forest type for mapping purposes, which
we obtained from a conventional photo-based inventory of the area. Further work (and additional field
sampling) is required to develop models for the broadleaf and mixed forest types.
The indirect approach used in this study does not preclude the requirement for ground plots.
On the contrary, spatially precise and well-distributed ground plots are essential for building high
quality ALS models. However, the indirect approach has the potential to optimize the efficiency of
ground plot acquisition. For example, in this study, we used a priori ALS data to characterize the range
of forest structural conditions across the study area prior to field sampling. There is general consensus
that the use of a priori ALS data can maximize efficiency and reduce costs of ground plot acquisitions
for area-based ALS modeling and mapping [
59
]. Recent research on the transferability of ALS-attribute
models suggests potential for cost savings of some attributes by applying models to data with different
point cloud characteristics or different areas [
96
]. Furthermore, in this study, we demonstrated that
supplementing ground plots with ALS samples improved prediction over a direct modeling approach
that uses the more limited set of ground plots. Other studies have also shown improved estimation of
volume and biomass by combining field plots, ALS data and satellite data [
41
43
]. Supplementation of
ground plots with ALS samples could significantly reduce inventory costs for remote and less accessible
areas. Further research is needed in the design of ground and ALS sampling systems for multilevel
mapping and estimation and to make inventories most cost-efficient (e.g., [97]).
Finally, our study was carried out in the boreal forest conditions of western Newfoundland,
Canada. The practical objective was to extend ALS-based mapping of key forest structural attributes
from an area covered by wall-to-wall ALS data to an area of similar ecological conditions representing
a full forest management district. Extending the mapping beyond these ecological conditions would
result in predictions with unknown and likely increased errors. Additionally, the performance of the
approach under dierent conditions (i.e., forest types, stand structures, dicult terrain) and with
dierent datasets requires further research.
5. Conclusions
In this study, we evaluated an indirect approach for extending ALS-based mapping of forest
attributes with medium resolution satellite and environmental data for coniferous-dominated forests
of western Newfoundland. In the first phase, we used ALS data to map a suite of forest attributes with
high accuracy. In the second phase, we used ALS-based predictions from phase 1 to calibrate spatially
comprehensive satellite and environmental data for an extended area. The indirect approach improved
Remote Sens. 2019,11, 1092 18 of 22
estimation beyond an approach linking ground plots directly to spatially comprehensive data for a
suite of forest inventory attributes. The results were consistent using both regression and random
forest models. Therefore, we concluded that the indirect mapping approach resulted in improved
prediction over a direct approach for mapping coniferous forest attributes of an extended area with
similar ecological conditions. In this study, we believe that suitability of calibration plots at both phases
of the modeling were critical to the implementation. Moreover, the quality of the ALS data and of
the satellite and environmental data likely aected the model accuracy of each phase. Therefore, we
suggest that further research is needed to evaluate improvements oered by the indirect approach for
dierent data and environmental conditions.
Author Contributions:
Conceptualization, J.E.L., R.A.F. and M.B.; Formal analysis, J.E.L.; Methodology, J.E.L.,
R.A.F., O.R.v.L. and M.B.; Validation, J.E.L. and O.R.v.L.; Writing—original draft, J.E.L.; Writing—review & editing,
R.A.F., O.R.v.L. and M.B.
Funding:
This research was funded by the Canadian Forest Service–Canadian Wood Fibre Centre (CWFC) and
the Assessment of Wood Attributes using Remote Sensing (AWARE) Project (NSERC CRDPJ-462973-14, grantee
N.C. Coops, UBC), in collaboration with Corner Brook Pulp and Paper Limited (CBPPL) and the Newfoundland
and Labrador Department of Fisheries and Land Resources (NLFLA).
Acknowledgments:
The authors thank Darrell Harris and Ken Parsons for helpful discussions on inventory of
Newfoundland forests, and Caroline Simpson for editorial review.
Conflicts of Interest: The authors declare no conflict of interest.
References
1.
Naesset, E. Airborne laser scanning as a method in operational forest inventory: Status of accuracy
assessments accomplished in Scandinavia. Scand. J. For. Res. 2007,22, 433–442. [CrossRef]
2.
Wulder, M.A.; Bater, C.W.; Coops, N.C.; Hilker, T.; White, J.C. The role of LiDAR in sustainable forest
management. For. Chron. 2008,84, 807–826. [CrossRef]
3.
Montaghi, A.; Corona, P.; Dalponte, M.; Gianelle, D.; Chirici, G.; Olsson, H. Airborne laser scanning of forest
resources: An overview of research in Italy as a commentary case study. Int. J. Appl. Earth Obs. Geoinf.
2013
,
23, 288–300. [CrossRef]
4.
Maltamo, M.; Næsset, E.; Vauhkonen, J. Forestry Applications of Airborne Laser Scanning: Concepts and Case
Studies; Springer: Dordrecht, The Netherlands, 2014; Volume 27, ISBN 9789401786621.
5.
Næsset, E.; Gobakken, T. Estimation of above- and below-ground biomass across regions of the boreal forest
zone using airborne laser. Remote Sens. Environ. 2008,112, 3079–3090. [CrossRef]
6.
Woods, M.; Pitt, D.; Penner, M.; Lim, K.; Nesbitt, D.; Etheridge, D.; Treitz, P. Operational Implementation of a
LiDAR Inventory in Boreal Ontario. For. Chron. 2011,87, 512–528. [CrossRef]
7.
Luther, J.E.; Skinner, R.; Fournier, R.A.; van Lier, O.R.; Bowers, W.W.; Cot
é
, J.F.; Hopkinson, C.; Moulton, T.
Predicting wood quantity and quality attributes of balsam fir and black spruce using airborne laser scanner
data. Forestry 2014,87, 313–326. [CrossRef]
8.
Næsset, E. Predicting forest stand characteristics with airborne scanning laser using a practical two-stage
procedure and field data. Remote Sens. Environ. 2002,80, 88–99. [CrossRef]
9.
Holmgren, J. Prediction of tree height, basal area and stem volume in forest stands using airborne laser
scanning. Scand. J. For. Res. 2004,19, 543–553. [CrossRef]
10.
Hyyppä, J.; Hyyppä, H.; Leckie, D.; Gougeon, F.; Yu, X.; Maltamo, M. Review of methods of small-footprint
airborne laser scanning for extracting forest inventory data in boreal forests. Int. J. Remote Sens.
2008
,29,
1339–1366. [CrossRef]
11.
White, J.C.; Wulder, M.A.; Varhola, A.; Vastaranta, M.; Coops, N.C.; Cook, B.D.; Pitt, D.; Woods, M. A Best
Practices Guide for Generating Forest Inventory Attributes from Airborne Laser Scanning Data Using an Area-Based
Approach; Information Report FI-X-010; Natural Resources Canada, Canadian Forest Service, Canadian Wood
Fibre Centre: Victoria, BC, Canada, 2013; p. 41.
12.
Bouvier, M.; Durrieu, S.; Fournier, R.A.; Renaud, J.P. Generalizing predictive models of forest inventory
attributes using an area-based approach with airborne LiDAR data. Remote Sens. Environ.
2015
,156, 322–334.
[CrossRef]
Remote Sens. 2019,11, 1092 19 of 22
13.
Li, Y.; Andersen, H.-E.; McGaughey, R. A comparison of statistical methods for estimating forest biomass
from light detection and ranging data. J. Appl. For. 2008,23, 223–231.
14.
Brosofske, K.D.; Froese, R.E.; Falkowski, M.J.; Banskota, A. A review of methods for mapping and prediction
of inventory attributes for operational forest management. For. Sci. 2014,60, 733–756. [CrossRef]
15.
Hansen, E.H.; Ene, L.T.; Mauya, E.W.; Patoˇcka, Z.; Mikita, T.; Gobakken, T.; Næsset, E. Comparing empirical
and semi-empirical approaches to forest biomass modelling in dierent biomes using airborne laser scanner
data. Forests 2017,8, 1–17. [CrossRef]
16.
Hollaus, M.; Dorigo, W.; Wagner, W.; Schadauer, K.; Höfle, B.; Maier, B. Operational wide-area stem volume
estimation based on airborne laser scanning and national forest inventory data. Int. J. Remote Sens.
2009
,30,
5159–5175. [CrossRef]
17.
White, J.C.; Coops, N.C.; Wulder, M.A.; Vastaranta, M.; Hilker, T.; Tompalski, P. Remote Sensing Technologies
for Enhancing Forest Inventories: A Review. Can. J. Remote Sens. 2016,42, 619–641. [CrossRef]
18.
Kangas, A.; Astrup, R.; Breidenbach, J.; Fridman, J.; Gobakken, T.; Korhonen, K.T.; Maltamo, M.; Nilsson, M.;
Nord-Larsen, T.; Næsset, E.; et al. Remote sensing and forest inventories in Nordic countries–roadmap for
the future. Scand. J. For. Res. 2018,33, 397–412. [CrossRef]
19.
Hudak, A.T.; Lefsky, M.A.; Cohen, W.B.; Berterretche, M. Integration of lidar and Landsat ETM+data for
estimating and mapping forest canopy height. Remote Sens. Environ. 2002,82, 397–416. [CrossRef]
20.
Hudak, A.T.; Crookston, N.L.; Evans, J.S.; Falkowski, M.J.; Smith, A.M.S.; Gessler, P.E.; Morgan, P. Regression
modeling and mapping of coniferous forest basal area and tree density from discrete-return lidar and
multispectral satellite data. Can. J. Remote Sens. 2006,32, 126–138. [CrossRef]
21.
Huang, S.; Ramirez, C.; Kennedy, K.; Mallory, J. A New Approach to Extrapolate Forest Attributes from Field
Inventory with Satellite and Auxiliary Data Sets. For. Sci. 2017,63, 232–240. [CrossRef]
22.
Zheng, D.; Rademacher, J.; Chen, J.; Crow, T.; Bresee, M.; Le Moine, J.; Ryu, S.R. Estimating aboveground
biomass using Landsat 7 ETM+data across a managed landscape in northern Wisconsin, USA. Remote Sens.
Environ. 2004,93, 402–411. [CrossRef]
23.
Labrecque, S.; Fournier, R.A.; Luther, J.E.; Piercey, D. A comparison of four methods to map biomass from
Landsat-TM and inventory data in western Newfoundland. For. Ecol. Manage.
2006
,226, 129–144. [CrossRef]
24.
Lu, D.; Chen, Q.; Wang, G.; Moran, E.; Batistella, M.; Zhang, M.; Vaglio Laurin, G.; Saah, D. Aboveground
forest biomass estimation with Landsat and LiDAR data and uncertainty analysis of the estimates. Int. J.
For. Res. 2012,2012, 1–16. [CrossRef]
25.
McRoberts, R.E. Estimating forest attribute parameters for small areas using nearest neighbors techniques.
For. Ecol. Manage. 2012,272, 3–12. [CrossRef]
26.
Beaudoin, A.; Bernier, P.Y.; Guindon, L.; Villemaire, P.; Guo, X.J.; Stinson, G.; Bergeron, T.; Magnussen, S.;
Hall, R.J. Mapping attributes of Canada’s forests at moderate resolution through kNN and MODIS imagery.
Can. J. For. Res. 2014,44, 521–532. [CrossRef]
27.
Chirici, G.; Mura, M.; McInerney, D.; Py, N.; Tomppo, E.O.; Waser, L.T.; Travaglini, D.; McRoberts, R.E.
A meta-analysis and review of the literature on the k-Nearest Neighbors technique for forestry applications
that use remotely sensed data. Remote Sens. Environ. 2016,176, 282–294. [CrossRef]
28.
Latifi, H.; Nothdurft, A.; Koch, B. Non-parametric prediction and mapping of standing timber volume and
biomass in a temperate forest: Application of multiple optical/LiDAR-derived predictors. Forestry
2010
,83,
395–407. [CrossRef]
29.
Mäkelä, H.; Pekkarinen, A. Estimation of forest stand volumes by Landsat TM imagery and stand-level
field-inventory data. For. Ecol. Manage. 2004,196, 245–255. [CrossRef]
30.
Tomppo, E.; Olsson, H.; Ståhl, G.; Nilsson, M.; Hagner, O.; Katila, M. Combining national forest inventory
field plots and remote sensing data for forest databases. Remote Sens. Environ.
2008
,112, 1982–1999.
[CrossRef]
31.
Mura, M.; Bottalico, F.; Giannetti, F.; Bertani, R.; Giannini, R.; Mancini, M.; Orlandini, S.; Travaglini, D.;
Chirici, G. Exploiting the capabilities of the Sentinel-2 multi spectral instrument for predicting growing stock
volume in forest ecosystems. Int. J. Appl. Earth Obs. Geoinf. 2018,66, 126–134. [CrossRef]
32.
Deo, R.K.; Russell, M.B.; Domke, G.M.; Woodall, C.W.; Falkowski, M.J.; Cohen, W.B. Using Landsat time-series
and LiDAR to inform aboveground forest biomass baselines in northern Minnesota, USA. Can. J. Remote
Sens. 2017,43, 28–47. [CrossRef]
Remote Sens. 2019,11, 1092 20 of 22
33.
Deo, R.; Russell, M.; Domke, G.; Andersen, H.-E.; Cohen, W.; Woodall, C.; Deo, R.K.; Russell, M.B.;
Domke, G.M.; Andersen, H.-E.; et al. Evaluating site-specific and generic spatial models of aboveground
forest biomass based on Landsat time-series and LiDAR strip samples in the Eastern USA. Remote Sens.
2017
,
9, 598. [CrossRef]
34.
McInerney, D.O.; Suarez-Minguez, J.; Valbuena, R.; Nieuwenhuis, M. Forest canopy height retrieval using
LiDAR data, medium-resolution satellite imagery and kNN estimation in Aberfoyle, Scotland. Forestry
2010
,
83, 195–206. [CrossRef]
35.
Pascual, C.; Garc
í
a-Abril, A.; Cohen, W.B.; Mart
í
n-Fern
á
ndez, S. Relationship between LiDAR-derived forest
canopy height and Landsat images. Int. J. Remote Sens. 2010,31, 1261–1280. [CrossRef]
36.
Maselli, F.; Chiesi, M.; Montaghi, A.; Pranzini, E. Use of ETM+images to extend stem volume estimates
obtained from lidar data. ISPRS J. Photogramm. Remote Sens. 2011,66, 662–671. [CrossRef]
37.
Wulder, M.A.; White, J.C.; Bater, C.W.; Coops, N.C.; Hopkinson, C.; Chen, G. Lidar plots — a new large-area
data collection option: context, concepts, and case study. Can. J. Remote Sens. 2012,38, 600–618. [CrossRef]
38.
Zald, H.S.J.; Wulder, M.A.; White, J.C.; Hilker, T.; Hermosilla, T.; Hobart, G.W.; Coops, N.C. Integrating
Landsat pixel composites and change metrics with lidar plots to predictively map forest structure and
aboveground biomass in Saskatchewan, Canada. Remote Sens. Environ. 2016,176, 188–201. [CrossRef]
39.
Wulder, M.A.; White, J.C.; Nelson, R.F.; Næsset, E.; Ørka, H.O.; Coops, N.C.; Hilker, T.; Bater, C.W.;
Gobakken, T. Lidar sampling for large-area forest characterization: A review. Remote Sens. Environ.
2012
,
121, 196–209. [CrossRef]
40.
Matasci, G.; Hermosilla, T.; Wulder, M.A.; White, J.C.; Coops, N.C.; Hobart, G.W.; Zald, H.S.J. Large-area
mapping of Canadian boreal forest cover, height, biomass and other structural attributes using Landsat
composites and lidar plots. Remote Sens. Environ. 2018,209, 90–106. [CrossRef]
41.
Andersen, H.E.; Strunk, J.; Temesgen, H.; Atwood, D.; Winterberger, K. Using multilevel remote sensing and
ground data to estimate forest biomass resources in remote regions: A case study in the boreal forests of
interior Alaska. Can. J. Remote Sens. 2012,37, 596–611. [CrossRef]
42.
Strunk, J.L.; Temesgen, H.; Andersen, H.; Packalen, P. Prediction of Forest Attributes with Field Plots, Landsat,
and a Sample of Lidar Strips: A Case Study on the Kenai Peninsula, Alaska. Photogramm. Eng. Remote Sens.
2014, 143–150. [CrossRef]
43.
Saarela, S.; Grafström, A.; Ståhl, G.; Kangas, A.; Holopainen, M.; Tuominen, S.; Nordkvist, K.; Hyyppä, J.
Model-assisted estimation of growing stock volume using dierent combinations of LiDAR and Landsat
data as auxiliary information. Remote Sens. Environ. 2015,158, 431–440. [CrossRef]
44.
Mahoney, C.; Hall, R.; Hopkinson, C.; Filiatrault, M.; Beaudoin, A.; Chen, Q.; Mahoney, C.; Hall, R.J.;
Hopkinson, C.; Filiatrault, M.; et al. A forest attribute mapping framework: A pilot study in a Northern
boreal forest, Northwest Territories, Canada. Remote Sens. 2018,10, 1338. [CrossRef]
45.
Boudreau, J.; Nelson, R.F.; Margolis, H.A.; Beaudoin, A.; Guindon, L.; Kimes, D.S. Regional aboveground
forest biomass using airborne and spaceborne LiDAR in Qu
é
bec. Remote Sens. Environ.
2008
,112, 3876–3890.
[CrossRef]
46.
Nelson, R.; Boudreau, J.; Gregoire, T.G.; Margolis, H.; Næsset, E.; Gobakken, T.; Ståhl, G. Estimating Quebec
provincial forest resources using ICESat/GLAS. Can. J. For. Res. 2009,39, 862–881. [CrossRef]
47.
Margolis, H.A.; Nelson, R.F.; Montesano, P.M.; Beaudoin, A.; Sun, G.; Andersen, H.-E.; Wulder, M.A.
Combining satellite lidar, airborne lidar, and ground plots to estimate the amount and distribution of
aboveground biomass in the boreal forest of North America. Can. J. For. Res. 2015,45, 838–855. [CrossRef]
48.
Nelson, R.; Margolis, H.; Montesano, P.; Sun, G.; Cook, B.; Corp, L.; Andersen, H.-E.; de Jong, B.; Pellat, F.P.;
Fickel, T.; et al. Lidar-based estimates of aboveground biomass in the continental US and Mexico using
ground, airborne, and satellite observations. Remote Sens. Environ. 2017,188, 127–140. [CrossRef]
49.
Neigh, C.S.R.; Nelson, R.F.; Ranson, K.J.; Margolis, H.A.; Montesano, P.M.; Sun, G.; Kharuk, V.; Næsset, E.;
Wulder, M.A.; Andersen, H.-E. Taking stock of circumboreal forest carbon with ground measurements,
airborne and spaceborne LiDAR. Remote Sens. Environ. 2013,137, 274–287. [CrossRef]
50.
Holm, S.; Nelson, R.; Ståhl, G. Hybrid three-phase estimators for large-area forest inventory using ground
plots, airborne lidar, and space lidar. Remote Sens. Environ. 2017,197, 85–97. [CrossRef]
51.
Hudak, A.T.; Crookston, N.L.; Evans, J.S.; Hall, D.E.; Falkowski, M.J. Nearest neighbor imputation of
species-level, plot-scale forest structure attributes from LiDAR data. Remote Sens. Environ.
2008
,112,
2232–2245. [CrossRef]
Remote Sens. 2019,11, 1092 21 of 22
52.
Hudak, A.T.; Crookston, N.L.; Evans, J.S.; Hall, D.E.; Falkowski, M.J. Corrigendum to “Nearest neighbor
imputation of species-level, plot-scale forest structure attributes from LiDAR data”[Remote Sensing of
Environment, 112: 2232-2245]. Remote Sens. Environ. 2009,113, 289–290. [CrossRef]
53.
Penner, M.; Pitt, D.G.; Woods, M.E. Parametric vs. nonparametric LiDAR models for operational forest
inventory in boreal Ontario. Can. J. Remote Sens. 2013,39, 426–443. [CrossRef]
54.
Rowe, J.S. Forest Regions of Canada. Based on W. E. D. Halliday’s “A forest classification for Canada”; Publication
No 1300; Department of the Environment, Canadian Forestry Service: Ottawa, ON, Canada, 1972; p. 177.
55.
Government of Newfoundland and Labrador Western Newfoundland Subregions, Forestry and Agrifoods
Agency. Available online: https://www.faa.gov.nl.ca/forestry/maps/west_eco.html (accessed on 13 February
2019).
56.
Canadian Forest Service Canada’s National Forest Inventory Ground Sampling Guidelines. Available online:
https://cfs.nrcan.gc.ca/publications?id=29402 (accessed on 23 April 2019).
57.
Maltamo, M.; Bollandsås, O.M.; Næsset, E.; Gobakken, T.; Packal
é
n, P. Dierent plot selection strategies for
field training data in ALS-assisted forest inventory. Forestry 2011,84, 23–31. [CrossRef]
58.
Hawbaker, T.J.; Keuler, N.S.; Lesak, A.A.; Gobakken, T.; Contrucci, K.; Radelo, V.C. Improved estimates of
forest vegetation structure and biomass with a LiDAR-optimized sampling design. J. Geophys. Res. Biogeosci.
2009,114. [CrossRef]
59.
Melville, G.; Stone, C.; Turner, R. Application of LiDAR data to maximise the eciency of inventory plots in
softwood plantations. New Zeal. J. For. Sci. 2015,45. [CrossRef]
60.
White, J.C.; Wulder, M.A.; Varhola, A.; Vastaranta, M.; Coops, N.C.; Cook, B.D.; Pitt, D.; Woods, M. A Model
Development and Application Guide for Generating an Enhanced Forest Inventory Using Airborne Laser Scanning
Data and an Area-Based Approach; Information Report FI-X-018; Natural Resources Canada, Canadian Forest
Service, Canadian Wood Fibre Centre: Victoria, BC, Canada, 2017; p. 50.
61.
Warren, G.R.; Meades, J.P. Wood Defect and Density Studies II Total and Net Volume Equations for Newfoundland
Forest Management Units; Information Report N-X-242; Canadian Forestry Service, Newfoundland Forestry
Centre: St. John’s, NL, Canada, 1986; p. 14.
62.
Ker, M.F. Metric Yield Tables for the Major Forest Cover Types of Newfoundland; Information Report M-X-141;
Natural Resources Canada, Canadian Forest Service, Atlantic Forestry Centre: St. John’s, NL, Canada, 1974;
p. 79.
63.
Lambert, M.-C.; Ung, C.-H.; Raulier, F. Canadian national tree aboveground biomass equations. Can. J.
For. Res. 2005,35, 1996–2018. [CrossRef]
64.
ASPRS. Las Specification 1.4 R13; The American Society for Photogrammetry & Remote Sensing: Methesda,
Maryland, 2013; p. 28.
65. Lim Geomatics. LTK LAS Toolkit Version 1.2; Lim Geomatics Inc.: Ottawa, ON, Canada, 2016; p. 32.
66.
Government of Newfoundland and Labrador Forest Types |Forestry and Agrifoods Agency. Available
online: https://www.faa.gov.nl.ca/forestry/our_forest/forest_types.html (accessed on 23 April 2019).
67.
Goetz, S.; Steinberg, D.; Dubayah, R.; Blair, B. Laser remote sensing of canopy habitat heterogeneity as
a predictor of bird species richness in an eastern temperate forest, USA. Remote Sens. Environ.
2007
,108,
254–263. [CrossRef]
68.
Van Ewijk, K.Y.; Treitz, P.M.; Scott, N.A. Characterizing forest succession in central Ontario using Lidar-derived
indices. Photogramm. Eng. Remote Sens. 2011,77, 261–269. [CrossRef]
69.
Natural Resources Canada. Canadian Digital Elevation Model: Product Specifications - Edition 1.1; Government
of Canada: Sherbrooke, QC, Canada, 2013; p. 11.
70.
Stage, A.R. An expression for the eect of aspect, slope and habitat type on tree growth. For. Sci.
1976
,22,
457–460.
71.
Gatti, A.; Gollapo, A. Sentinel-2 Products Specification Document; Thales Alenia Space: Cannes, France, 2018;
p. 487.
72.
Shimada, M.; Itoh, T.; Motooka, T.; Watanabe, M.; Shiraishi, T.; Thapa, R.; Lucas, R. New global
forest/non-forest maps from ALOS PALSAR data (2007-2010). Remote Sens. Environ.
2014
,155, 13–31.
[CrossRef]
73.
Smith, E.P.; Rose, K.A. Model goodness-of-fit analysis using regression and related techniques. Ecol. Modell.
1995,77, 49–64. [CrossRef]
Remote Sens. 2019,11, 1092 22 of 22
74.
Piñeiro, G.; Perelman, S.; Guerschman, J.P.; Paruelo, J.M. How to evaluate models: Observed vs. predicted or
predicted vs. observed? Ecol. Modell. 2008,216, 316–322. [CrossRef]
75. Breiman, L. Random Forests. Mach. Learn. 2001,45, 5–32. [CrossRef]
76.
R Core Team R: A Language and Environment for Statistical Computing. Available online: http://www.r-
project.org (accessed on 15 January 2019).
77.
Freeman, E.A.; Frescino, T.S. Modeling and Map Production using Random Forest and Stochastic Gradient Boosting;
USDA Forest Service, Rocky Mountain Research Station: Ogden, UT, USA, 2009; p. 65.
78. Liaw, A.; Wiener, M. Classification and regression by randomForest. R News 2002,2, 18–22.
79. Lumley, T. Based on F. Code by A.M. Leaps: Regression Subset Selection. R Package Version 3.0. Available
online: https://cran.r-project.org/package=leaps (accessed on 13 February 2019).
80. Mallows, C.L. Some comments on C P. Technometrics 1973,15, 661–675. [CrossRef]
81.
Myers, R.H. Classical and Modern Regression with Applications, 2nd ed.; PWS-KENT Publishing Company:
Boston, MA, USA, 1990; ISBN 0534921787.
82. Royston, J.P. Algorithm AS 181. Appl. Stat. 2007,31, 176–180. [CrossRef]
83.
Breusch, T.S.; Pagan, A.R. A simple test for heteroscedasticity and random coecient variation. Econometrica
1979,47, 1287–1294. [CrossRef]
84.
Akaike, H. Information Theory and an Extension of the Maximum Likelihood Principle; Petrov, B.N., Cs
á
ki, F.C.,
Eds.; Akademiai Kiado: Budapest, Hungary, 1973.
85.
Hijmans, R.J. Raster: Geographic Data Analysis and Modeling. R Package Version 2.6-7. Available online:
https://cran.r-project.org/package=raster (accessed on 13 February 2019).
86. ESRI. ArcGIS Desktop: Release 10.4; Environmental Systems Research Institute: Redlands, CA, USA, 2016.
87.
Grafström, A.A.; Lisic, J. BalancedSampling: Balanced and Spatially Balanced Sampling. R Package Version
1.5.2. Available online: https://cran.r-project.org/package=BalancedSampling (accessed on 13 February 2019).
88. Grafström, A.; Tillé, Y. Doubly balanced spatial sampling with spreading and restitution of auxiliary totals.
Environmetrics 2013,24, 120–131. [CrossRef]
89.
Grafström, A.; Ringvall, A.H. Improving forest field inventories by using remote sensing data in novel
sampling designs. Can. J. For. Res. 2013,43, 1015–1022. [CrossRef]
90.
Vuong, Q.H. Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica
1989
,57,
307. [CrossRef]
91.
Merkle, E.C.; You, D. Nonnest2: Tests of Non-Nested Models. R Package Version 0.5-2. Available online:
https://cran.r-project.org/package=nonnest2 (accessed on 13 February 2019).
92.
Natural Resources Canada; Public Safety Canada. Federal Airborne LiDAR data Acquisition Guideline
Version 1.0. Available online: https://doi.org/10.4095/308382 (accessed on 15 February 2019).
93.
Bolton, D.K.; White, J.C.; Wulder, M.A.; Coops, N.C.; Hermosilla, T.; Yuan, X. Updating stand-level forest
inventories using airborne laser scanning and Landsat time series data. Int. J. Appl. Earth Obs. Geoinf.
2018
,
66, 174–183. [CrossRef]
94.
Pflugmacher, D.; Cohen, W.B.; Kennedy, R.E.; Yang, Z. Using Landsat-derived disturbance and recovery
history and lidar to map forest biomass dynamics. Remote Sens. Environ. 2014,151, 124–137. [CrossRef]
95.
Vauhkonen, J.; Ørka, H.O.; Holmgren, J.; Dalponte, M.; Heinzel, J.; Koch, B. Tree Species Recognition Based
on Airborne Laser Scanning and Complementary Data Sources. In Forestry Applications of Airborne Laser
Scanning Concepts and Case Studies; Maltamo, M., Naesset, E., Vauhkonen, J., Eds.; Springer: Dordrecht,
The Netherlands, 2014; pp. 135–156.
96.
Tompalski, P.; White, J.C.; Coops, N.C.; Wulder, M.A. Demonstrating the transferability of forest inventory
attribute models derived using airborne laser scanning data. Remote Sens. Environ.
2019
,227, 110–124.
[CrossRef]
97.
Saarela, S.; Holm, S.; Healey, S.; Andersen, H.-E.; Petersson, H.; Prentius, W.; Patterson, P.; Næsset, E.;
Gregoire, T.; Ståhl, G.; et al. Generalized Hierarchical Model-Based Estimation for Aboveground Biomass
Assessment Using GEDI and Landsat Data. Remote Sens. 2018,10, 1832. [CrossRef]
©
2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
... In the last decade, much effort has been devoted to modelling and mapping forest inventory attributes from airborne laser scanning data (ALS) to the point where these data are being used operationally over large, continuous areas internationally (e.g., [1][2][3]). ALS can provide precise and reliable predictions of many stand-mean values of biophysical attributes (e.g., biomass, volume, height, and DBH [4][5][6]), as well as distributions thereof (e.g., stem diameter, height, and volume distributions [7][8][9]). Stem diameter is the most frequently modelled distribution found in the literature (e.g., [10][11][12][13][14][15][16][17]) as it provides insights on stand structure, the basis for understanding the stand's ecological and economic value. ...
Article
Full-text available
This study aimed to determine the optimal approach for estimating stem diameter distributions (SDD) from airborne laser scanning (ALS) data using point cloud metrics (M als), a canopy height model (CHM) texture metrics (M tex), and a combination thereof (M comb). We developed area-based models (i) to classify SDD modality and (ii) predict SDD function parameters, which we tested for 5 modelling techniques. Our results demonstrated little variability in the performance of SDD modality classification models (mean overall accuracy: 72%; SD: 2%). Our best SDD function parameter models were generally fitted with M comb , with R 2 improvements up to 0.25. We found the variable Correlation, originating from M tex , to be the most important predictor within M comb. Trends in the performance of the predictor groups were mostly consistent across the modelling techniques within each parameter. Using an Error Index (EI), we determined that differentiating modality prior to estimating SDD improved the accuracy of estimates for bimodal plots (~12% decrease in EI), which was trivially not the case for unimodal plots (<1% increase in EI). We concluded that (i) CHM texture metrics can be used to improve the estimate of SDD parameters and that (ii) differentiating for modality prior to estimating SSD is especially beneficial in stands with bimodal SDD.
... Table 5 and Fig. 6). These are in line with the studies of Luther et al. (2019) and Bhattarai et al. (2022), which emphasized the importance of topographic data for improved LAI prediction performance and accuracy. Another study carried out by Moradi et al. (2021) demonstrated that the highest accuracy (R 2 = 0.781, RMSE = 20%) in estimating LAI was obtained using the combination with reflectance values, vegetation indices, and topographic data. ...
Article
We aimed to map and analyze LAI by using Landsat 8 and Sentinel-2 time series and the corresponding ground measurements collected in pure Anatolian black pine [Pinus nigra J.F. Arnold ssp. pallasiana (Lamb.) Holmboe] stands within seven-month (from June to December) period. A total of 30 sample plots were selected and seven-month changes of LAI values were determined through hemispherical photography for each sample plot. Remote sensing (reflectance values and vegetation indices obtained from Landsat-8 and Sentinel-2) and topographic (elevation, aspect, and slope) data were used to model the LAI for each month using multiple linear regression (MLR) method. Additionally, the data for all months were combined and modeled. In this case, autoregressive modeling techniques were used to solve the temporal autocorrelation problem. Our study indicated that the models developed from Sentinel-2 give more successful results than Landsat 8 on monthly LAI models. The most successful models were obtained in June by using the reflectance values (R2adj = 0.39, RMSE = 0.3138 m2 m−2), reflectance values–topographic data (R2adj = 0.59, RMSE = 0.3174 m2 m−2), vegetation indices–topographic data (R2adj = 0.82, RMSE = 0.2126 m2 m−2), and reflectance values–vegetation indices–topographic data (R2adj = 0.93, RMSE = 0.1060 m2 m−2). Among the autoregressive modeling techniques, the highest success was obtained with the Landsat 8 OLI using the moving average (2) procedure (R2 = 0.56). This study is significant that it is the first to analyze the monthly effect on LAI modeling and mapping in pure Anatolian black pine stands using both reflectance values, vegetation indices, and topographic data.
... As stated in the literature, collecting multiple ALS data for an entire area of interest is not always possible due to limited resources or restrictions associated with covering remote or large areas (Luther et al. 2019). As a result, the primary advantage of using a chronosequence to predict tree height growth is that it enables a low-cost method of forecasting maximum tree height by the end of the rotation using a single ALS data set. ...
Article
Full-text available
Representing the spatial distribution of trees and competition interactions in growth models improves growth prediction and provides insights into spatially explicit forecasts for precise silvicultural interventions. However, this information is rarely taken into account over large areas because obtaining the spatial distribution of individual trees and estimating their competition is both expensive and time consuming. Airborne laser scanning enables rapid estimation of tree height and other attributes over large areas. In this study, we implemented an individual tree detection approach to first extract tree attributes of Pinus radiata D. Don plantations, and second to use this spatially explicit information on tree location and competition to forecast potential tree height, defined as a maximum projected tree height at rotation age. To do so, using a chronosequence of tree heights, we developed a tree height growth model using a Chapman–Richards function, utilizing the effect of inter-tree competition and stand-level top height (TH) on the tree height growth. The results showed that using chronosequence of heights, competition, and TH resulted in accurate predictions of potential tree height (root mean square error = 2.9 m; mean absolute percentage error = 0.154%). We concluded that individual tree height growth is significantly influenced by competition, with increased competition values associated with reductions in potential height growth by 22.2% at 30 years.
... Her notable works include contributions to mapping the land cover of Canada [122], modelling forest attributes [123][124][125] and monitoring forest change [126]. Her recent work includes techniques for enhancing forest inventories using optical and LiDAR (Light Detection and Ranging) technologies [127]. ...
Article
Full-text available
Forest research and professional workforces continue to be dominated by men, particularly at senior and management levels. In this review, we identify some of the historical and ongoing barriers to improved gender inclusion and suggest some solutions. We showcase a selection of women in forestry from different disciplines and parts of the globe to highlight a range of research being conducted by women in forests. Boosting gender equity in forest disciplines requires a variety of approaches across local, regional and global scales. It is also important to include intersectional analyses when identifying barriers for women in forestry, but enhanced equity, diversity and inclusion will improve outcomes for forest ecosystems and social values of forests, with potential additional economic benefits.
... These forest attributes were estimated for the reference dataset of surrogate FI plots using ALS and GLAS LiDAR models as previously reported in [5,7] and as summarized in Methods (Section 2.3.1). [7,8,16,43], we considered a number of feature variables with the following criteria: (i) known correlation to forest attributes, (ii) publicly available as wall-to-wall geospatial layers, (iii) native spatial resolution deemed sufficient (25-100 m) for mapping attributes with a 30 m pixel size, and (iv) for satellite imagery, orthorectified and calibrated radiometry with acquisition within plus or minus 1 year relative to the reference mapping year of 2007. By this standard, we gathered across the study area, satellite imagery from both Landsat 5 Thematic Mapper (TM) and ALOS-1/PALSAR, and environmental datasets, both biotic and abiotic, obtained from several sources ( Table 1). . ...
Article
Full-text available
Satellite forest inventories are the only feasible way to map Canada’s vast, remote forest regions, such as those in the Northwest Territories (NWT). A method used to create such inventories is the k-nearest neighbour (k-NN) algorithm, which spatially extends information from forest inventory (FI) plots to the entire forest land base using wall-to-wall features typically derived from Landsat data. However, the benefits of integrating L-band synthetic aperture radar (SAR) data, strongly correlated to forest biomass, have not been assessed for Canadian northern boreal forests. Here we describe an optimized multivariate k-NN implementation of a 151,700 km2 area in southern NWT that included ca. 2007 Landsat and dual-polarized Phased Array type L-band SAR (PALSAR) data on board the Advanced Land Observing Satellite (ALOS). Five forest attributes were mapped at 30 m cells: stand height, crown closure, stand/total volume and above-ground biomass (AGB). We assessed accuracy gains compared to Landsat-based maps. To circumvent the scarcity of FI plots, we used 3600 footprints from the Geoscience Laser Altimeter System (GLAS) as surrogate FI plots, where forest attributes were estimated using Light Detection and Ranging (LiDAR) metrics as predictors. After optimization, k-NN predicted forest attribute values for each pixel as the average of the 4 nearest (k = 4) surrogate FI plots within the Euclidian space of 9 best features (selected among 6 PALSAR, 10 Landsat, and 6 environmental features). Accuracy comparisons were based on 31 National Forest Inventory ground plots and over 1 million airborne LiDAR plots. Maps that included PALSAR HV backscatter resulted in forest attribute predictions with higher goodness of fit (adj. R2), lower percent mean error (ME%), and percent root mean square error (RMSE%), and lower underestimation for larger attribute values. Predictions were most accurate for conifer stand height (RMSE% = 32.1%, adj. R2 = 0.58) and AGB (RMSE% = 47.8%, adj. R2 = 0.74), which is much more abundant in the area than mixedwood or broadleaf. Our study demonstrates that optimizing k-NN parameters and feature space, including PALSAR, Landsat, and environmental variables, is a viable approach for inventory mapping of the northern boreal forest regions of Canada.
... Random forest regression is a flexible, non-parametric machine learning algorithm that has previously been employed to fuse LiDAR and satellite data and produce maps of AGB and other structural parameters (e.g. Wulder et al. 2012;Mascaro et al. 2014;Urbazaev et al. 2018;Luther et al. 2019). Random forest models were fitted using the implementation of scikit-learn in Python (Pedregosa et al. 2011). ...
Article
Information on the spatial distribution of forest aboveground biomass (AGB) and its uncertainty is important to evaluate management and conservation policies in tropical forests. However, the scarcity of field data and robust protocols to propagate uncertainty prevent a robust estimation through remote sensing. We upscaled AGB from field data to LiDAR, and to landscape scale using Sentinel-2 and ALOS-PALSAR through machine learning, propagated uncertainty using a Monte Carlo framework and explored the relative contributions of each sensor. Sentinel-2 outperformed ALOS-PALSAR (R² = 0.66, vs 0.50), however, the combination provided the best fit (R² = 0.70). The combined model explained 49% of the variation comparing against plots within the calibration area, and 17% outside, however, 94% of observations outside calibration area fell within the 95% confidence intervals. Finally, we partitioned the distribution of AGB in different management and conservation categories for evaluating the potential of different strategies for conserving carbon stock.
Article
Full-text available
Airborne laser scanning (ALS) technology is fully implemented in forest resource assessment processes, providing highly accurate and spatially continuous results throughout the area of interest, thus reducing inventory costs when compared with traditional sampling inventories. Several approaches have been employed to estimate forest parameters using ALS data, such as the Area-Based Approach (ABA) and Individual Tree Crown (ITC). These two methodologies use different information processing and field data collection approaches; thus, it is important to have a selection criterion for the method to be used based on the expected results and admissible errors. The objective of this study was to compare the prediction errors of forest inventory attributes in the functioning of ABA and ITC approaches. A plantation of 500 ha of Pinus radiata (400–600 trees ha−1) in Chile was selected; a forest inventory was conducted using the ABA and ITC methods and the accuracy of both methods was analyzed. The ITC models performed better than the ABA models at low tree densities for all forest inventory attributes (15% MAPE in tree density—N—and 11% in volume—V). There was no significant difference in precision regarding the volume and basal area (G) estimations at medium densities, although ITC obtained better results for density and dominant height (Ho). At high densities, ABA performed better for all the attributes except for height (6.5% MAPE in N, 8.7% in G, and 8.9% in V). Our results showed that the precision of forest inventories based on ALS data can be adjusted depending on tree density to optimize the selected approach (ABA and ITC), thus reducing the inventory costs. Hence, field efforts can be greatly decreased while achieving better prediction accuracies.
Article
Modeling forest attributes using lidar data has been a useful tool for forest management but the need to correlate lidar to ground-based measurements creates challenges to modeling in diverse forest landscapes. Many lidar models have been based on metrics derived from summarizations of individual lidar returns over sample plot areas, but more recently, metrics based on summarization by volumetric pixel (voxel) have shown promise to better characterize forest structure and distinguish between diverse forest types. Voxel-based metrics may improve characterization of leaf area distribution and horizontal forest structure, which could help create general models of forest attributes applicable in complex landscapes composed of many distinct forest types. We modeled wood volume in longleaf pine woodlands and associated forests to compare how area- and voxel- based lidar metrics predicted wood volume in forest type specific and general predictive models. We created four area-based and six voxel-based metrics to fit models of wood volume using a multiplicative power function. We selected models and compared metric importance using AIC and evaluated model performance using cross-validated mean prediction error. We found that one area-based metric and four voxel-based metrics consistently improved model predictions We suggest that area-based metrics alone may have limitations for characterizing complex forest structure. Area-based summarizes of lidar returns are more heavily influenced by upper canopy returns because lidar returns attenuate below the canopy. By contrast, summarizing lidar returns into a single value per voxel prior to summarization over plots homogenizes point density, giving added weight to sub-canopy returns. Thus voxel-based metrics may be more sensitive to structural variation that may not be adequately captured by area-based metrics alone. This study highlights the potential of voxel-based metrics for characterizing complex forest structure and model generalization capable of accurate forest attribute prediction across diverse forest types.
Article
Full-text available
Sustainable forest management requires information on the spatial distribution, composition, and structure of forests. However, jurisdictions with large tracts of noncommercial forest, such as the Northwest Territories (NWT) of Canada, often lack detailed forest information across their land base. The goal of the Multisource Vegetation Inventory (MVI) project was to create a large area forest inventory (FI) map that could support strategic forest management in the NWT using optical, radar, and light detection and ranging (LiDAR) satellite remote sensing anchored on limited field plots and airborne LiDAR data. A new landcover map based on Landsat imagery was the first step to stratify forestland into broad forest types. A modelling chain linking FI plots to airborne and spaceborne LiDAR was then developed to circumvent the scarcity of field data in the region. The developed models allowed the estimation of forest attributes in thousands of surrogate FI plots corresponding to spaceborne LiDAR footprints distributed across the project area. The surrogate plots were used as a reference dataset for estimating each forest attribute in each 30 m forest cell within the project area. The estimation was based on the k-nearest neighbour (k-NN) algorithm, where the selection of the four most similar surrogate FI plots to each cell was based on satellite, topographic, and climatic data. Wall-to-wall 30 m raster maps of broad forest type, stand height, crown closure, stand volume, total volume, aboveground biomass, and stand age were created for a ~400,000 km2 area, validated with independent data, and generalized into a polygon GIS layer resembling a traditional FI map. The MVI project showed that a reasonably accurate FI map for large, remote, predominantly non-inventoried boreal regions can be obtained at a low cost by combining limited field data with remote sensing data from multiple sources.
Article
Assessing trade-offs among ecosystem services (ESs) that are provided by forests is necessary to support decision-making and to minimize negative effects of timber harvesting. In this study, we examined how spatial data, forest operational rules, ESs, and probabilistic statistics can be combined into a practical tool for trade-off analysis that could guide decision-making towards sustainable forestry. Our main goal was to analyze trade-offs among the wood provisioning ES and other forest ESs at the landscape level using a Bayesian belief network (BBN). We used LiDAR data to derive four ES layers as inputs to a spatial BBN: (i) wood provisioning; (ii) erosion regulating; (iii) climate regulating; and (iv) habitat supporting. We quantified operational constraints with four forest operational rules (FOR) that were defined in terms of: (i) potential harvest block size; (ii) distance between a small potential harvest block and a larger harvest block; (iii) gross merchantable volume (GMV); and (iv) distance to an existing resource road. Maps of the most probable trade-off classes between the wood provisioning ES and other ESs enabled us to identify areas where timber harvesting should be avoided or where timber harvesting should have a very low negative effect on other ESs. Even with our most restrictive management scenario, the total GMV that could be harvested met the annual allowable cut (AAC) volume required to meet sustainable forestry objectives. Through our study, we demonstrated that high-resolution spatial data could be used to quantify trade-offs among wood provisioning ES and other forest-related ESs and to simulate small changes in ES indicators within the BBN. We also demonstrated the potential to evaluate management scenarios to reduce trade-offs by considering FOR as inputs to the BBN. Maps of the most probable trade-off classes among two or three ESs under operational constraints provide key information to guide forest management decision-making towards sustainable forestry.
Article
Full-text available
Airborne laser scanning (ALS) is a reliable source of accurate information for forest stand inventory attributes including height, cover, basal area, and volume. The commonly applied area-based approach (ABA) allows the derivation of wall-to-wall geospatial coverages representing each of the modeled attributes at a grid-cell level, with spatial resolutions typically between 20 and 30 m. The ABA predictive models are developed using stratified inventory data from field plots, the requirement for which can increase the overall cost of the ALS-based inventory. Parsimonious use of ground plots is a key means to control variable costs in the operational implementation of the ABA. In this paper, we demonstrate how the prediction accuracy of Lorey's height (HL, m), quadratic mean diameter (QMD, cm), and gross volume (V, m 3) vary when existing ABA models are transferred to different areas or are applied to point cloud data with different characteristics than those on which the original model was developed. Specifically, we consider three scenarios of model transferability: (i) same point cloud characteristics, different areas; (ii) different point cloud characteristics, same areas; and (iii) different point cloud characteristics, different areas. We generated area-based models using three modeling approaches: linear regression (OLS), random forests (RF), and k-nearest neighbour (kNN) imputation. Results indicated that the prediction accuracy of area-based models varied by attribute and by modeling approach. We found that when the models were transferred their prediction accuracy decreased, with an average increase in relative bias up to 22.04%, and increase in relative RMSE up to 29.31%. Prediction accuracies for HL were higher than those of QMD or V when models were transferred, and had the lowest average increase in relative bias and relative RMSE of < 5% in the majority of cases. Likewise OLS models for HL had greater prediction accuracies when models were transferred compared to RF and kNN models, especially when the point cloud characteristics were similar. Conversely, we found that for QMD and V, RF models were found to be the most transferable in cases when models were applied to different areas with similar and different point cloud characteristics. While there is potential for cost savings by transferring models and reducing data acquisition costs, our results show the degree of transferability depends more on the attribute being modelled or the modeling approach applied, and less on the characteristics of the point cloud data.
Article
Full-text available
Recent developments in remote sensing (RS) technology have made several sources of auxiliary data available to support forest inventories. Thus, a pertinent question is how different sources of RS data should be combined with field data to make inventories cost-efficient. Hierarchical model-based estimation has been proposed as a promising way of combining: (i) wall-to-wall optical data that are only weakly correlated with forest structure; (ii) a discontinuous sample of active RS data that are more strongly correlated with structure; and (iii) a sparse sample of field data. Model predictions based on the strongly correlated RS data source are used for estimating a model linking the target quantity with weakly correlated wall-to-wall RS data. Basing the inference on the latter model, uncertainties due to both modeling steps must be accounted for to obtain reliable variance estimates of estimated population parameters, such as totals or means. Here, we generalize previously existing estimators for hierarchical model-based estimation to cases with non-homogeneous error variance and cases with correlated errors, for example due to clustered sample data. This is an important generalization to take into account data from practical surveys. We apply the new estimation framework to case studies that mimic the data that will be available from the Global Ecosystem Dynamics Investigation (GEDI) mission and compare the proposed estimation framework with alternative methods. Aboveground biomass was the variable of interest, Landsat data were available wall-to-wall, and sample RS data were obtained from an airborne LiDAR campaign that produced simulated GEDI waveforms. The results show that generalized hierarchical model-based estimation has potential to yield more precise estimates than approaches utilizing only one source of RS data, such as conventional model-based and hybrid inferential approaches.
Article
Full-text available
A methods framework is presented that utilizes field plots, airborne light detection and ranging (LiDAR), and spaceborne Geoscience Laser Altimeter System (GLAS) data to estimate forest attributes over a 20 Mha area in Northern Canada. The framework was implemented to scale up forest attribute models from field data to intersecting airborne LiDAR data, and then to GLAS footprints. GLAS data were sequentially filtered and submitted to the k-nearest neighbour (k-NN) imputation algorithm to yield regional estimates of stand height and crown closure at a 30 m resolution. Resulting outputs were assessed against independent airborne LiDAR data to evaluate regional estimates of stand height (mean difference = −1 m, RMSE = 5 m) and crown closure (mean difference = −5%, RMSE = 9%). Additional assessments were performed as a function of dominant vegetation type and ecoregion to further evaluate regional products. These attributes form the primary descriptive structure attributes that are typical of forest inventory mapping programs, and provide insight into how they can be derived in northern boreal regions where field information and physical access is often limited.
Article
Full-text available
Passive optical remotely sensed images such as those from the Landsat satellites enable the development of spatially comprehensive, well-calibrated reflectance measures that support large-area mapping. In recent years, as an alternative to field plot data, the use of Light Detection and Ranging (lidar) acquisitions for calibration and validation purposes in combination with such satellite reflectance data to model a range of forest structural response variables has become well established. In this research, we use a predictive modeling approach to map forest structural attributes over the ~552 million ha boreal forest of Canada. For model calibration and independent validation we utilize airborne lidar-derived measurements of forest vertical structure (known as lidar plots) obtained in 2010 via a > 25,000 km transect-based national survey. Models were developed linking the lidar plot structural variables to wall-to-wall 30-m spatial resolution surface reflectance composites derived from Landsat Thematic Mapper and Enhanced Thematic Mapper Plus imagery. Spectral indices extracted from the composites, disturbance information (years since disturbance and type), as well as geographic position and topographic variables (i.e., elevation, slope, radiation, etc.) were considered as predictor variables. A nearest neighbor imputation approach based on the Random Forest framework was used to predict a total of 10 forest structural attributes. The model was developed and validated on> 80,000 lidar plots, with R2 values ranging from 0.49 to 0.61 for key response variables such as canopy cover, stand height, basal area, stem volume, and aboveground biomass. Additionally, a predictor variable importance analysis confirmed that spectral indices, elevation, and geographic coordinates were key sources of information, ultimately offering an improved understanding of the driving variables for large-area forest structure modeling. This study demonstrates the integration of airborne lidar and Landsat-derived reflectance products to generate detailed and spatially extensive maps of forest structure. The methods are portable to map other attributes of interest (based upon calibration data) through access to Landsat or other appropriate optical remotely-sensed data sources, thereby offering unique opportunities for science, monitoring, and reporting programs.
Article
Full-text available
The Nordic countries have long traditions in forest inventory and using remote sensing (RS) in connection with forest inventories. In sample-based national forest inventories (NFI), utilization of aerial photographs started during 1960’s, satellite images during 1980’s, laser scanning during 2000’s, and photogrammetric point clouds during 2010’s. In forest management inventories (FMI), utilization of aerial photos started during 1940’s and laser scanning during 2000’s. However, so far remote sensing has mostly been used for map production and research rather than for estimation of regional parameters or inference on their accuracy. In recent years, the RS technology has been developing very fast. At the same time, the needs for information are constantly increasing. New technologies have created possibilities for cost efficient production of accurate, large area forest data sets, which also will change the way forest inventories are done in the future. In this study, we analyse the state-of-the-art both in the NFIs and FMIs in the Nordic countries. We identify the benefits and drawbacks of different RS materials and data acquisition approaches with different user perspectives. Based on the analysis, we identify needs for further development and emerging research questions. We also discuss alternatives for ownership of the data and cost-sharing between different actors in the field.
Article
Full-text available
Vertical forest structure can be mapped over large areas by combining samples of airborne laser scanning (ALS) data with wall-to-wall spatial data, such as Landsat imagery. Here, we use samples of ALS data and Landsat time-series metrics to produce estimates of top height, basal area, and net stem volume for two timber supply areas near Kamloops, British Columbia, Canada, using an imputation approach. Both single-year and time series metrics were calculated from annual, gap-free Landsat reflectance composites representing 1984-2014. Metrics included long-term means of vegetation indices, as well as measures of the variance and slope of the indices through time. Terrain metrics, generated from a 30 m digital elevation model, were also included as predictors. We found that imputation models improved with the inclusion of Landsat time series metrics when compared to single-year Landsat metrics (relative RMSE decreased from 22.8% to 16.5% for top height, from 32.1% to 23.3% for basal area, and from 45.6% to 34.1% for net stem volume). Landsat metrics that characterized 30-years of stand history resulted in more accurate models (for all three structural attributes) than Landsat metrics that characterized only the most recent 10 or 20 years of stand history. To test model transferability, we compared imputed attributes against ALS-based estimates in nearby forest blocks (> 150,000 ha) that were not included in model training or testing. Landsat-imputed attributes correlated strongly to ALS-based estimates in these blocks (R 2 = 0.62 and relative RMSE = 13.1% for top height, R 2 = 0.75 and relative RMSE = 17.8% for basal area, and R 2 = 0.67 and relative RMSE = 26.5% for net stem volume), indicating model transferability. These findings suggest that in areas containing spatially-limited ALS data acquisitions, imputation models, and Landsat time series and terrain metrics can be effectively used to produce wall-to-wall estimates of key inventory attributes , providing an opportunity to update estimates of forest attributes in areas where inventory information is either out of date or non-existent.
Article
Full-text available
Large-area assessment of aboveground tree biomass (AGB) to inform regional or national forest monitoring programs can be efficiently carried out by combining remotely sensed data and field sample measurements through a generic statistical model, in contrast to site-specific models. We integrated forest inventory plot data with spatial predictors from Landsat time-series imagery and LiDAR strip samples at four sites across the eastern USA-Minnesota (MN), Maine (ME), Pennsylvania-New Jersey (PANJ) and South Carolina (SC)-in statistical modeling frameworks to analyze the performance of generic (all sites combined) versus site-specific models. The major objective was to evaluate the prediction accuracy of generic and site-specific models when applied to particular sites. Pixel-level polynomial model fitting was applied to the time-series of near-anniversary date Landsat variables to obtain projected metrics in the target year 2014 for which LiDAR strip samples were available. Two forms of models based on ordinary least-squares multiple linear regressions (MLR) and the random forest (RF) machine learning approach were developed for each site and for the pooled (i.e., generic) reference data frame. The models were evaluated using national forest inventory (NFI) data for the USA. We observed stronger fit statistics with the MLR than with RF for both the site-specific and the generic models. The proportions of variances explained (adjusted R²) with the site-specific models were 0.86, 0.78, 0.82 and 0.92 for ME, MN, PANJ and SC, respectively while the generic model had adjusted R² = 0.85. A test of statistical equivalence of observed and predicted AGB for the NFI locations did not reveal equivalence with any of the models, possibly due to the different resolutions of the observed and predicted data. In contrast, predictions by the generic and site-specific models were equivalent. We conclude that a generic model provides accuracies comparable to the site-specific models for large-area AGB assessment across our study sites in the eastern USA.
Book
Airborne laser scanning (ALS) has emerged as one of the most promising remote sensing technologies to provide data for research and operational applications in a wide range of disciplines related to management of forest ecosystems. This book provides a comprehensive, state-of-the-art review of the research and application of ALS in a broad range of forest-related disciplines. However, this book is more than just a collection of individual contributions – it consists of a well-composed blend of chapters dealing with fundamental methodological issues and contributions reviewing and illustrating the use of ALS within various domains of application. The main aim of this book is to provide the scientific and technical background of ALS with a particular focus on applicability in operational forestry. Most of the chapters are devoted to applications in forest inventory and forest ecology such as forest management inventory and assessments of canopy cover, habitats and organism-habitat relationships. Many of the chapters focus on boreal forests simply because methods were initially developed for boreal conditions. However, examples show the most common applications of ALS at various geographical scales; from individual trees, to forest stands, regions and nations. The reviews provide a comprehensive and unique overview of recent research and applications that researchers, students and practitioners of forest remote sensing and forest ecosystem assessment should consider as a useful reference text.
Article
The spatial prediction of growing stock volume is one of the most frequent application of remote sensing for supporting the sustainable management of forest ecosystems. For such a purpose data from active or passive sensors are used as predictor variables in combination with measures taken in the field in sampling plots. The Sentinel-2 (S2) satellites are equipped with a Multi Spectral Instrument (MSI) capable of acquiring 13 bands in the visible and infrared domains with a spatial resolution varying between 10 and 60 m. The present study aimed at evaluating the performance of the S2-MSI imagery for estimating the growing stock volume of forest ecosystems. To do so we used 240 plots measured in two study areas in Italy. The imputation was carried out with eight k-Nearest Neighbours (k-NN) methods available in the open source YaImpute R package. In order to evaluate the S2-MSI performance we repeated the experimental protocol also with two other sets of images acquired by two well-known satellites equipped with multi spectral instruments: Landsat 8 OLI and RapidEye scanner. We found that S2 worked better than Landsat in 37.5% of the cases and in 62.5% of the cases better than RapidEye. In one study area the best performance was obtained with Landsat OLI (RMSD =6.84%) and in the other with S2 (RMSD =22.94%), both with the k-NN system based on a distance matrix calculated with the Random Forest algorithm. The results confirmed that S2 images are suitable for predicting growing stock volume obtaining good performances (average RMSD for both the test areas of less than 19%).
Article
Previous studies have utilized ground plots, airborne lidar scanning or profiling data, and space lidar profiling data to estimate biomass across large regions, but these studies have failed to take into account the variance components associated with multiple models because the proper variance equations were not available. Previous large-domain studies estimated the variances of their biomass density estimates as the sum of the GLAS sampling variability plus the model variability associated with the models that predict airborne lidar estimates of biomass density (Y) as a function of satellite lidar measurements (X). This approach ignores the additional variability associated with the predictive models used to estimate ground biomass density as a function of airborne lidar measurements. This paper addresses that shortcoming. Analytic variance expressions are provided that include sampling variability and model variability in situations where multiple models are employed to generate estimates of biomass. As an example, the forest biomass of the continental US is estimated, by forest stratum within state, using a space lidar system (ICESat/GLAS). An airborne laser system (ALS) is used as an intermediary to tie the GLAS measurements of forest height to a small subset of US Forest Service (USFS) ground plots by flying the ALS over the ground plots and, independently, over individual GLAS footprints. Two sets of models are employed to relate satellite measurements to the ground plots. The first set of equations relates USFS ground plot estimates of total aboveground dry biomass density (Y1) to spatially coincident ALS forest canopy measurements (X1). The second set of models predicts those ALS canopy height measurements (X1) used in the first set of models to GLAS waveform measurements (X2). The following important conclusions are noted. (1) The variability associated with estimation of the plot-ALS model coefficients is significant and should be included in the overall estimate of biomass density variance. In the continental US, the total variance of mean forest biomass density (98.06 t/ha) increases by a factor of 3.6 ×, i.e., from 1.91 to 6.94 t²/ha², when plot-ALS model variance is included in the calculation of total variance. (2) State-level results are more variable, but on average, the percent model variance at the state level, i.e., (model variance / total variance) ∗ 100, increases from 16% to 59% when plot-ALS model variance is included. (3) The overall model variance is driven in large part by the number of plots overflown by the ALS and the number of GLAS pulses overflown by the ALS. Given a choice of improving precision by either increasing the number of plot-ALS observations or increasing ALS-GLAS observations, there is no obvious benefit to selecting one over the other. However, typically the number of ground plots overflown is the limiting factor. (4) If heteroskedasticity is evident in either the ground-air or air-satellite models, it can modeled using weighted regression techniques and incorporated into these model variance formulas in straightforward fashion. The results are unambiguous; in a hybrid three-phase sampling framework, both the ground-air and air-satellite model variance components are significant and should be taken into account.