Fig 1 - uploaded by Ross F. Nelson
Content may be subject to copyright.
Source publication
Abstract: We report estimates of the amount, distribution, and uncertainty of aboveground biomass (AGB) of the different ecoregions and forest land cover classes within the North American boreal forest, analyze the factors driving the error estimates, and compare our
estimates with other reported values. A three-phase sampling strategy was used (i)...
Contexts in source publication
Context 1
... as forest. We conducted the ecoregion-level calculations using three different threshold values, i.e., 25%, 50%, and 75%. We selected the 75% threshold for estimating k NN-based mean AGB density because this threshold provided us with a large number of mostly pure forest pixels and resulted in the closest agreement with GLAS estimates of AGB density. For the GLAS analysis, ecoregion areas were based simply on the number of 25 m resolution (i.e., 0.0625 ha) EOSD forest classes falling within an ecoregion. The 1000 forest inventory plots that were sampled by PALS in this study were distributed across 11 of the 22 (50%) WWF boreal forest ecoregions of North America (Table 1; Figs. 1 and 2). Although the plots were largely concentrated in the managed com- mercial forests of Canada and Alaska, there were also plots in the boreal taiga of northern Quebec and in some fairly remote areas of Alaska (Fig. 2). The plots spanned a large range of AGB densities (range = 0 to 361 Mg·ha −1 , mean = 76.3 Mg·ha −1 , SD = 58.8; n = 1000) and fell into all four of the major boreal land cover classes (32% conifer, 25% hardwood, 38% mixedwood, and 6% treed wetland). Because this study used a lidar-assisted, model-based sampling approach, we make the assumption that these 1000 plots are representative of our area of interest and therefore meet an important statistical criterion needed to produce an accurate forest inventory. In Alaska, the most robust regressions for relating PALS metrics to the AGB density of the ground plots were found when we developed equations for specific ecoregions across all forest cover types, with R 2 values ranging from 0.58 to 0.80 (Table 2). In all cases, more than one height metric was selected by the best possible subset regression procedure, typically one variable related to the height of the upper canopy and one or more variables related to the height distribution of laser returns from the lower canopy (Table 2). Throughout the analyses, variance inflation factors (VIF) were monitored and variable selection adjustments were made to keep VIFs well below 10. In Canada, the most robust ground plot – PALS regressions were found when we developed generic equations for each forest land cover class with all ecoregions pooled, yielding R 2 values ranging from 0.50 to 0.84 (Table 3 a ). For the conifer and treed wetland equations, measurements of both upper and lower canopy heights were selected as independent variables, whereas for the hardwood and mixedwood forests, mid- to upper-canopy height measurements paired with canopy density metrics were selected. We were also able to develop robust equations for the major regions of Canada (eastern versus western) and for specific land cover types within some regions and ecoregions (Table 3 b ). These equations had a similar range of R 2 values as the generic equations but slightly lower root mean squared error (RMSE). Therefore, when available, we applied the more specific equations to gener- ate PALS biomass values along the PALS–GLAS flight lines; otherwise, we applied the generic equations for each land cover type. Our decision to use ecoregion-specific equations across all cover types in Alaska and generic equations for each land cover class across all ecoregions in Canada allowed us to use the equations that yielded the highest R 2 and (or) lowest RMSE values in the different regions and thus should have increased the precision of our estimates. We compared the estimates from these two different stratification approaches across the three main geographic regions and found that the impacts on mean biomass density were on the order of 3% to 5% with no consistent trends, i.e., stratified equations yielded both higher and lower mean AGB density than the nonstratified equations depending on the region. As well, standard errors were similar. The GLAS 3c and 3f lines that we flew with PALS spanned a large portion of the North American boreal forest in the western and central portions of the biome but did not include much sampling in the eastern portion of eastern Canada (Fig. 2). See the Materials and methods section for reasons why GLAS 2a data from Quebec were not included. For Alaska, the R 2 values for the regressions relating GLAS metrics to PALS-calculated AGB for each of the five cover types across all ecoregions were very similar and ranged from 0.53 to 0.57 (Table 2). RMSE values were lower than for the ground plot – PALS equations. The conifer and hardwood equations both included a GLAS-derived measurement of upper and lower canopy heights, as well as the GLAS acquisition as a dummy variable (Table 2). For the treed wetlands and burned areas, the GLAS-derived fslope measurement that is related to canopy closure was selected as an independent variable in addition to the height metrics. The fslope variable describes the angle formed between a vertical base line and the leading edge of the waveform; larger angles connote a denser upper canopy layer (see fig. 1 in Boudreau et al. 2008). The mixedwood and burned area equations had higher co-linearity but were still within acceptable ranges (Table 2). We had no coincident measurements of PALS and GLAS for burned areas in Alaska, but we did have 179 data points where PALS flew over GLAS pulses classified as shrubland or grassland (Table 2). Consequently, we applied these shrub–grassland equations to the burned areas in Alaska and used the resulting equation for scaling GLAS to the region for the burned area land cover type (see next section). This is not the ideal method for estimating burned area biomass as GLAS has low sensitivity for detecting vegetation heights less than 5 m (Nelson 2010); however, it does at least provide a quantitative estimate for comparison with future estimates derived using other methods. For Canada, we developed another set of generic regressions between GLAS metrics and PALS-derived AGB for each forest cover type by region, i.e., western versus eastern Canada (Table 4). However, we did not have a sufficient number of coincident PALS– GLAS observations in burned areas in eastern Canada, so we used the 31 observations available in western Canada to develop an equation that could be applied to burned areas across the country. RMSE values were generally similar to those for the Alaska equations for the three most common forest cover types (conifer, hardwood, mixedwood), ranging from 21.3 to 27.6 Mg·ha −1 (Tables 2 and 4). Both R 2 values and sample sizes were higher for the Canadian equations than for Alaska equations, with the exception of treed wetlands in western Canada (Tables 2 and 4). The highest R 2 (0.79) was obtained for hardwood forests in eastern Canada. The GLAS-14 standard height product was selected as an independent variable for six of the nine equations. As found for Alaska, GLAS- derived measurements of both upper and lower canopy heights were included in the equations. The fslope variable describing the density of the upper canopy was selected as an independent variable for treed wetlands in eastern Canada, as it was in Alaska. Co-linearity between independent variables was again at acceptable levels for all PALS–GLAS equations in Canada. We present our AGB estimates for each of the three main boreal regions, i.e., Alaska, western Canada, and eastern Canada, and then report our totals for all of North America. Note that we report average biomass densities for the forest areas of the different ecoregions. On the other hand, Neigh et al. (2013) reported biomass densities averaged across forested and nonforested areas of an ecoregion assuming zero biomass for nonforested areas. An individual GLAS orbit is the sampling unit used for estimating AGB at the ecoregion level. In Alaska, there were 57 GLAS 3c and 3f orbits available, which included 22 682 GLAS pulses (Table 5). The number of GLAS orbits available per ecoregion varied from 7 to 55, and the number of GLAS pulses per ecoregion ranged from 29 to 17 025 (Table 5). The Interior Alaska–Yukon Lowland Taiga ecoregion was intercepted by the largest number of orbits (i.e., 55), due in large measure to the very large east–west extent of the ecoregion. The three ecoregions for Alaska with the lowest sampling intensity had only 7 to 8 GLAS orbits each. These three ecoregions also tended to have only small areas occupied by forest. The Alaska–St. Elias Range Tundra ecoregion also supports limited forested areas, but because it is oriented largely in an east–west direction, it was sampled more intensively (29 orbits). Total AGB for Alaska forests was estimated at 2110 ± 45 Mt (Table 5). The forests of the Interior Alaska–Yukon Lowland Taiga ecoregion contained nearly two-thirds (1386 ± 28 Mt) of the AGB. However, this was due to its large surface area and not to a particularly high AGB density. The forests within the Interior Yukon– Alaska Alpine Tundra ecoregion had the second greatest amount of AGB (15.2%; 320.5 ± 7.4 Mt). These two ecoregions, which accounted for >80% of the total AGB in Alaska, had similar AGB densities (56.3 and 56.9 Mg·ha −1 ), and these densities were very close to the average for all ecoregions (57.0 ± 1.2 Mg·ha −1 ). Model error was a large component of the overall error for all ecoregions (Table 5). The two ecoregions with the greatest number of GLAS orbits also had the highest percent model error (54.4% and 51.0%). The general empirical relationship between the number of orbits, percent model error, and percent sampling error is addressed later. Conifers were the most intensively sampled stratum (12 547 GLAS pulses), had the largest forest area, and were estimated to contain 54.9% (1159 ± 27.4 Mt) of Alaska's boreal forest AGB. The high sampling intensity for conifers was associated with low sampling error and higher percent model error (80.8%; Table 6). Hardwoods and mixedwoods accounted for similar percentages of the AGB in Alaska (20.5% and 16.4%, respectively), whereas treed wetlands and ...
Context 2
... measured geolocated ground plots and were also collected along all of the GLAS 3c and 3f orbital ground tracks used in this study (Fig. 2). Following quality filtering of the GLAS pulses for implausible (e.g., >50 m) height metrics, we had 10 421 coincident PALS–GLAS observations that we used to relate PALS-based estimates of biomass to GLAS measurements. Once these AGB models were in place, we could use GLAS as our regional sampling tool to inventory the North American boreal forest. The Canadian Forest Service (CFS) worked with various provincial and territorial natural resource management agencies to provide us access to geolocated ground plots within Canada's boreal ecoregions. The inventory plot measurements used in this study were collected in the Northwest Territories (2006–2008), Saskatchewan (2004–2006), Ontario (2006–2007), and Quebec (2001–2004) (Table 1). Consequently, these plot data originated from a number of different jurisdictions (Table 1). The CFS has developed species-specific, national-level equations (Lambert et al. 2005) that we used in this study to convert ground plot measurements of tree height and diameter to AGB. In Alaska, ground plot information was provided by the Forest Inventory and Assessment (FIA) Program of the USDA Forest Service (Woudenberg et al. 2010). We successfully sampled precisely 1000 geolocated forest inventory sample plots across eastern Canada, western Canada, and Alaska (Table 1) of which 303 were located in Quebec, 196 in Ontario, 80 in Saskatchewan, 57 in the Northwest Territories, and 364 in Alaska (Table 1). Land cover classifications derived from Landsat data were used for stratification and reporting of AGB by forest stratum. In Canada, we used the 25 m resolution, 23-class EOSD (Earth Observation for Sustainable Development) land cover map of Canada (Wulder et al. 2003, 2013 b ). For Alaska, we used the NLCD-2001 version 1.0 land cover product developed for Alaska at 30 m spatial resolution ( This NLCD product uses 20 land cover classes, but only four of these are forest classes (deciduous forest, conifer forest, mixed forest, treed wetland). We condensed and harmonized the two land cover classifications into the four NLCD forest classes (trees > 5 m) to obtain a uniform land cover stratification for all of boreal North America. More specifically, dense, open, and sparse covers in EOSD were harmonized with NLCD covers, respectively, for conifers, hardwoods, mixedwoods, and treed – woody wetland forest covers. Recently burned areas were overlaid onto the land cover map to provide a unique stratification for this category. Fire polygons from the Canadian Forest Service and the Alaskan Department of Natural Resources were used to define the areas burned between 2001 and 2006, the final year of GLAS acquisitions. Information from the MODIS MCD45 burned area product (Roy et al. 2006) was also used to supplement the fire polygon information when required. Fire is a very common disturbance in the boreal forests of North America, with historical annual burn rates varying from 0.2% to 2.0% across different ecoregions (Bergeron et al. 2004). The burned area biomass values reported in the current article refer only to a subset of the total burned area, i.e., surviving or regen- erating forests on areas burned between 2000 and 2006. Furthermore, harvested areas are only included in our analysis if they fell into one of our EOSD or NLCD forest cover classes, e.g., conifer, mixedwood, hardwood, or treed – woody wetland. For topographic information at the scale of individual GLAS pulses, we used version 1 of the ASTER Global Digital Elevation Model (GDEM) ( At a spatial resolution of 30 m, the ASTER GDEM has a finer resolution and more northern coverage (83°N) than the Shuttle Radar Topography Mission (SRTM) topographic product that was used for the previous PALS–GLAS analysis in Quebec (60°N, 90 m) (Boudreau et al 2008). We mosaicked the ASTER tiles for each of our three regions (Alaska, western Canada, eastern Canada), masking out data less than 3 m and more than 6195 m in elevation, the elevation of the highest mountain peak in North America. We then calculated 3 × 3 pixel slopes using the topographic modeling feature in the ENVI 4.1 image process software. For obtaining a general quantification of slopes for different ecoregions to char- acterize the overall ecoregion topography (e.g., mountainous versus flat, average slope, percentage of forested area per ecoregion above 20° slope), we used a SRTM DEM version at 250 m resolution available from Beaudoin et al. (2014). To provide a pan-biome ecosystem-oriented context, we used the WWF map of global terrestrial ecoregions (Olson et al. 2001), which we rasterized to a 1 km resolution. We used this classi- fication to stratify the North American boreal forest into bio- geographic reporting units for our AGB estimates. Of the 867 ecoregions identified globally, 22 were located in the North American boreal forest (Fig. 1). We used a model-based procedure for estimating AGB density from the ground-plot, PALS, and GLAS data (Nelson et al. 2012; Ståhl et al. 2011). In this procedure, we do not rely on a probability- based forest inventory, rather we use the ground plots that were available and that were as representative as possible of our area of interest, i.e., the plots spanned a wide east to west swath across the North American boreal forest and included plots from both southern and northern portions of the biome (Table 1). In the current case, we applied a three-phase sampling design that linked the three sampling phases (ground plots, airborne profiling lidar, and satellite lidar) via two sets of equations, i.e., a two- phase estimator. The first set of equations estimates ground plot biomass density as a function of the PALS lidar metrics. The second set of predictive equations links the PALS-derived estimates of biomass density, calculated from the first set of equations, to the GLAS metrics. We then used all of the 311 981 quality-filtered GLAS 3c and 3f pulses available in the entire study region (Fig. 3) to obtain AGB estimates for only the forest strata, i.e., conifers, hardwoods, mixedwood, and treed wetlands, where the strata types were determined by the land cover maps. Of the 311 981 quality- filtered GLAS shots, 73.4% of the pulses (229 096 pulses) were identified as measuring one of the four forest strata. Strata identified as nonforest strata, e.g., shrubs, grassland, barren, and urban, were consequently assumed to have zero AGB regardless of the GLAS measurement. Finally, these strata were condensed into the three major regions, i.e., Alaska, western Canada, and eastern Canada. The estimates of the AGB for a given stratum (land cover, ecoregion, region) from all available GLAS pulses were obtained using eqs. 1 through 4 presented in table 4 of Neigh et al. (2013). Here, we provide a brief summary of the approach. (1) An estimated mean stratum AGB density (in Mg·ha −1 ) for a single ascending or descending ICESat orbit was calculated by averaging the GLAS-based biomass estimates for all pulses intercepting that stratum along an orbit. If a particular stratum was not intercepted along a given orbit, then the stratum mean was taken to be zero. (2) The mean AGB density for a given stratum can then be multiplied by a weighting factor proportional to the area of that stratum within a region or an ecoregion or using eq. 2 from Neigh et al. (2013). (3) The variance of this mean AGB estimate across all strata is provided by eq. 3 in Neigh et al. (2013), and the variance of the individual stratum for AGB density is given in eq. 4 of Neigh et al. (2013). These variance calculations include both sampling error and model error, with the latter term quantifing the variance of the coefficient estimates of the predictive model, i.e., how much the predictive model would change with repeated samples. In the current context, there is no assumption of randomness in the selection of ground plots. However, there are three other basic assumptions for this model approach. (1) The coincident PALS– GLAS observations are characteristic of the area of interest (AOI, e.g., stratum, ecoregion, or region) and they represent the full range of conditions within the AOI. We note that our decision to exclude GLAS data on >20° slopes violates this assumption to some extent, but we explain why this should not be a major con- straint in the Discussion. (2) There is an assumption that the GLAS orbits are randomly acquired. (3) The models to predict biomass for different strata are developed independently. We compiled data from two other sources of North American boreal forest AGB to compare with the estimates obtained in our GLAS study. First, we calculated mean AGB density from Canada's National Forest Inventory (NFI) database across the Canadian terrestrial ecozones (Ecological Stratification Working Group 1996) used by NFI for reporting purposes (https://nfi.nfis.org/standardreports. php?lang=en). These calculations use NFI total AGB estimates by forest type, age class, and terrestrial ecozone. The wood volume estimates from the NFI photo plots were then converted by NFI into AGB density using allometric equations and expansion factors (Boudewyn et al. 2007; Stinson et al. 2011). Unfortunately, standard errors were not tractable at the required forest strata level, so we only had access to mean values. We weighted the respective areas of the WWF ecoregions that fell into each Canadian NFI ecozone. Next, we applied this weighting to the GLAS estimates for each WWF ecoregion to obtain an equiv- alent GLAS estimate of AGB density for a given Canadian NFI ecozone. In three of the seven cases, the two systems aligned very closely; for the other four cases, there were significant spatial differences between the two systems. In the second approach, we used 2001 AGB density maps developed at 250 m resolution (i.e., 6.25 ha) using the k ...
Context 3
... and carbon stocks in the boreal forest of North America involves combining forest inventory ground plots with growth and yield curves for different land cover types and disturbance frequencies (Stinson et al. 2011). Laser-ranging airborne lidar has the potential to provide supplemental information on aboveground biomass (AGB) density of forests and has been used in- creasingly as a sampling tool at local to regional scales (Wulder et al. 2012 a , 2012 b ). This airborne technology is particularly useful in extending AGB measurements into areas where few, if any, ground plots exist, e.g., unmanaged forest regions. Airborne lidars provide direct measurements of distances between the aircraft and various ground targets. Data processing algorithms calculate both the distance to the ground and the height above the ground of different components of the forest. From these measurements, a number of lidar metrics related to forest height and canopy structure can be derived, which can then be converted into AGB (Nelson et al. 1988). Both profiling and scanning lidars have been used as sampling tools (e.g., Nelson et al. 2012; Gobakken et al. 2012), and various statistical approaches have been developed to estimate both AGB and carbon stocks, as well as their statistical uncertainties (Gregoire et al. 2011; Ståhl et al. 2011). For example, in lidar-assisted model-based sampling, ground plots representative of the entire area of interest are selected. Models are constructed that predict ground- measured biomass as a function of lidar height and canopy density metrics, and then these models are applied to the entire area (Nelson et al. 2012). The error calculations account for the nonrandom, spatially dependent transect data obtained by the lidar (Ene et al. 2013), which is accomplished partially by using the transect as the sampling unit. Reliable measurements of forest AGB from a spaceborne lidar would be useful because such an instrument could provide repeated global-scale sampling of variables related to forest height from which regional and global biomass could be derived. The first opportunity to explore the utility of a global sample of satellite-based lidar measurements occurred in 2003 when ICESat-1 (Ice Cloud and land Elevation SATellite) was launched with the Geosciences Lidar Altimetry System (GLAS) aboard. Using the GLAS waveform lidar, the ICESat mission's main objective was to measure changes in the mass balance of the polar ice sheets, and it collected ice elevation data from 2003 to 2009. However, GLAS also offered the scientific community the possibility of estimating forest height and AGB, although the sensor was not ex- plicitly designed for this objective. Global tree height maps have been developed (Simard et al. 2011) and Bolton et al. (2013) compared this information with tree height data in Canada and found that discarding GLAS waveforms from steep terrain reduced errors in height estimates. Boudreau et al. (2008) and Nelson et al. (2009 a ) applied a sampling approach for 1.3 million km 2 of forest in the province of Quebec that involved three different sampling phases, i.e., ground, airborne, and satellite. The basic approach involved ( i ) building an initial statistical model to link PALS (Portable Airborne Laser System) height measurements to ground plot biomass, ( ii ) building a second model to relate the estimated biomass from the airborne lidar to the height metrics obtained by GLAS for the 1325 GLAS pulses that were flown by the aircraft, and ( iii ) using the GLAS height metrics, slope, and land cover for the ϳ 104 000 quality-filtered GLAS pulses available across the province of Quebec to calculate the AGB and carbon stocks for the province by land cover type. In 2008 and 2009, we extended the PALS lidar measurements of ground plots and GLAS ground tracks to the rest of Canada and Alaska. Neigh et al. (2013) used these data in an analysis of the aboveground carbon stocks of the entire circumpolar boreal forest by combining the North American data with similar data in Scandinavia (Næsset et al. 2011) and only ground plot and GLAS data in Siberia (Nelson et al. 2009 b ). They were thus able to compare aboveground carbon stocks between the different regions of the circumpolar boreal. The goal of the current study is to report for the first time on the detailed results for the three-phase sampling of the 3.7 million km 2 of the North American boreal forest. We extend the work of Neigh et al. (2013) by reporting detailed estimates of the amount, distribution, and uncertainty of the AGB of the different ecoregions and land cover classes within the boreal forest of North America, analyze some of the factors driving the error estimates, compare our AGB estimates with other available estimates in Canada, and offer our perspectives on the future of lidar sample-based approaches for forest biomass estimation. The study area encompasses the boreal forest biome within Canada and Alaska (Fig. 1) and ranges from a minimum latitude of 44.4°N to a maximum of 69.0°N and from longitudes 52.6°W to 165.0°W. Various definitions and maps of the North American boreal forest have appeared in the literature (e.g., Brandt 2009; Olson et al. 2001). The boundaries of the boreal forest used in the current study were based on those proposed by Brandt (2009, see his fig. 22) for Canada. The Brandt (2009) boundaries were estab- lished by digitizing and then harmonizing several existing maps. The northern extent of the boreal forest was defined as the northern tree limit, i.e., the taiga–tundra boundary. Consistent with the traditional approach used in North America for defining the boreal forest, we do not include eastern hardwood forests (e.g., areas in which cold-intolerant hardwoods begin to intermingle with cold-tolerant tree species). On the other hand, we decided to include the Canadian Aspen Forests and Parklands ecoregion in our boreal forest area as it is such an important transition zone that is highly vulnerable to climate change (Michaelian et al. 2011) and is generally considered to be in the boreal zone. For Alaska, we applied additional adjustments to the Brandt (2009) map to retain consistency with the World Wildlife Fund (WWF) ecoregions used as the spatial reporting units (see below). We truncated our boreal biome at the northern US – southern Canadian border, which allowed us to more readily report biomass for the boreal forest in Canada. Areal extents of the boreal forests of Canada and Alaska for our study were 3 326 658 km 2 (90%) and 370 074 km 2 (10%), respectively. Launched on 12 January 2003, the ICESat satellite carried three lasers packaged within the Geoscience Laser Altimetry System (GLAS). GLAS was the first spaceborne lidar instrument developed for continuous global observations of the Earth. The three lasers were deployed sequentially over the life of the satellite (January 2003 to October 2009). For the current study, we used GLAS data acquisitions 3c and 3f, both of which relied on the last of the three lasers. These collections were used because ( i ) they were the most recent acquisitions available when laser power was still high enough for forest applications and ( ii ) these acquisitions were collected during the growing season in June 2005 and 2006, respectively, and therefore avoided possible contamination of the data from either snow cover or leaf-off conditions. Previous forest inventory studies that used GLAS as a regional sampling tool in Quebec (Nelson et al. 2009 a ; Boudreau et al. 2008) and Siberia (Nelson et al. 2009 b ) used acquisition 2a. This acquisition, collected between 24 September and 17 November 2003, was used in these previous studies because, at the time that the results were compiled, it was the only acquisition available that was acquired under close to leaf-on conditions. In the current study, the more temporally suitable acquisitions 3c and 3f, collected from 8 to 13 June 2005 and from 8 to 26 June 2006, respectively, were used. However, the differences in the GLAS acquisitions used, combined with the differences in the spatial reporting units, prevent us from making useful quantitative comparisons of AGB between the two studies. In the current study, we only used GLAS pulses that were obtained from footprints that had slopes of <20 degrees as determined by the ASTER DEM (Advanced Spaceborne Thermal Emission and Reflection Radiometer Digital Elevation Model). This was based partially on our analyses of slope effects on bio- mass regressions from our earlier study (Boudreau et al. 2008). Below, we provide more information on the ASTER DEM. GLAS collected altimetry data using a 1064 nm laser sending pulses at 40 Hz with a ground footprint that varied in size and shape with laser power, although laser power decreased markedly over time. During acquisitions 3c and 3f, the size of the elliptical footprint of the GLAS pulse was nominally ϳ 60 m × ϳ 40 m, and the distance between pulses was 172 m (Neigh et al. 2013). The GLAS sensors collected waveform data on the intensity of the energy returned from a laser pulse, thus providing information on the vertical distribution of canopy structure (Boudreau et al. 2008). In this study, we made use of both a standard GLAS data product (GLA14) provided by the National Snow and Ice Data Center (NSIDC) and extracted our own structural metrics from the raw waveforms (GLA01) data product. GLA14 is the standard laser altimetry product that uses information from the waveforms recorded over vegetated land by fitting up to six Gaussian distributions to the waveform to describe different features of the vertical structure of the vegetation. The raw GLA01 waveforms were used to calculate additional forest canopy metrics related to forest height and canopy density such as median, mean, and quadratic-mean canopy height, height of different levels of energy return, descriptors of the waveform such as total waveform ...
Citations
... The RMSE, AIC, and Mallow's Cp metrics are model selection techniques often used in regression models for estimating forest attributes with lidar data [36][37][38]. Multicollinearity was also addressed using variance inflation factors (VIFs), where values over 10 indicated presence of multicollinearity [39,40]. ...
NASA’s Ice, Cloud, and land Elevation Satellite-2 (ICESat-2) provides exceptional opportunities for characterizing the structure of ecosystems through the acquisition of along-track, three-dimensional observations. Focusing on canopy cover as a fundamental parameter for assessing forest conditions, the overall goal of this study was to establish a framework for generating a gridded 30 m canopy cover product with ICESat-2. Specifically, our objectives were to (1) Determine and compare ICESat-2-derived canopy cover with airborne lidar-derived and the 2016 National Land Cover Database (NLCD) cover product estimates, and (2) Evaluate a methodology for wall-to-wall mapping of canopy cover. Using two Southern US sites, the Sam Houston National Forest (SHNF) in south-east Texas and the Solon Dixon Forestry Education Center (SDFEC) in southern Alabama, four measures of canopy cover estimated with ICESat-2′s Land-Vegetation Along-Track Product, or ATL08, were evaluated at the 30 m pixel scale. Comparisons were made using spatially coinciding NLCD pixels and airborne lidar-derived reference canopy cover. A suite of Landsat and Landsat-derived parameters were then used as predictors to model and map each measure of canopy cover with Random Forests (RF), and their accuracies were assessed and compared. Correlations (r) between ICESat-2-derived and airborne lidar canopy cover at the pixel scale ranged from 0.57 to 0.78, and R2 up to 0.81 was produced between NLCD and ICESat-2-derived canopy cover. RF models developed for extrapolating ICESat-2-derived canopy cover estimate yielded R2 values between 0.50 and 0.61 (RMSEs between 16% and 20%) when evaluated with airborne lidar-derived canopy cover. With a demonstrated capability of ICESat-2 to estimate vegetation biophysical parameters, the findings serve to support the spatially comprehensive mapping of other vegetation attributes, especially forest aboveground biomass, and contribute to the development of an up-to-date gridded canopy cover product.
... Their SE estimates varied from 2% to 4%, although SE estimates were not reported for specific forest species. It is well known that uncertainty in the estimation of different vegetation strata mapsaffects the determination of the volume stocks, but in practice its effect has been mostly ignored (Margolis et al. 2015;Neigh et al. 2013;Yanjun et al. 2016). Previous studies assessing this source of error have shown very significant accuracy losses for forest stock variables at different scales (Xue et al. 2017;Esteban etal. ...
Remotely sensed data are increasingly used together with National Forest Inventory (NFI) data to improve the spatial precision of forest variable estimates. In this study, we combined data from the 4th Spanish National Forest Inventory (SNFI-4) and from the 2nd nationwide Airborne Laser Scanning (ALS) survey to develop predictive forest inventory variables (total over bark volume (V), basal area (G), and annual increase in total volume (IAVC)) and aboveground biomass (AGB) models for the eight major forest strata in the region of Extremadura that are included in the Spanish Forest Map (SFM). We generated maps at 25 m resolution by applying an area‐based approach (ABA) and 758 sample plots measured with good positional accuracy within the SNFI-4 in Extremadura (Spain). Inventory performance is mainly influenced by spatial scale and vegetation structure. Therefore, in this study, we conducted a comparative analysis of statistical inference methods that can characterize forest inventory variables and AGB uncertainty across multiple spatial scales and types of vegetation structure. Predictions at pixel level were used to produce county, provincial, and regional model-based estimates, which were then compared with design-based estimates at different scales for different types of forest. We developed and tested both methods for forested area (cover, 19,744.15 km²), one province (9126.78 km²), and two counties (1594.42 km² and 2076.76 km², respectively) in Extremadura. The resulting relative standard error (SE) for regional level forest type-specific model-based estimates of V, G, IAVC, and AGB ranged from 3.34%–14.46%, 3.22%–12.50%, 4.46%–16.67%, and 3.63%–12.58%, respectively. The performance of the model-based approach, as assessed by the relative SE, was similar to that of the design-based approach at regional and provincial levels. However, the precision of SNFI model-based estimates was higher than that of estimates based on only the plot observations in small areas (e.g. at county level). The standard errors (SE) for model-based inferences were stable across the different scales, while SNFI design-based errors were higher due to the small sample sizes available for small areas. The findings indicate that SNFI-model based maps could be used directly to estimate forest inventory variables and AGB in the major forest strata included in the Spanish Forest Map, leading to potentially large economic savings. © 2022 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.
... The selection of studies included three main criteria: (a) free availability of the AGB map in raster format including uncertainty estimation, (b) continental or either global coverage, and (c) up to four years of temporal delay from the year 2000. The initial search identified 1128 published studies, seven of which fulfilled the stipulated requisites (Avitabile et al., 2016;Baccini et al., 2017;Blackard et al., 2008;Hu et al., 2016;Margolis et al., 2015;Neigh et al., 2013;Ruesch & Holly, 2008, see Figure S1 in Supporting Information S2 and Table S1 in Supporting Information S3). The projection system of the different studies was homogenized to World Geodetic System 1984 World Mercator projection using a Geographic Information System (GIS). ...
Here we present a global time-series of global forest above ground biomass from 2000 to 2019,analyzing spatiotemporal patterns of carbon balance, accounting for losses and gains. We generated a globalAbove-Ground Biomass (AGB) map for the year 2000 and assessed its correlation with different satelliteproducts. We generated a multi-year analysis of AGB changes at the pixel level was generated, estimatingcarbon (C) loss and gain. Finally, we estimated the C losses due to forest clearing and wildfires analyzingtheir trends across biomes and countries. Our results show that the global mean annual loss was 2.88 ± 0.33PgC yr−1, while global mean C gain was 2.95 ± 0.43 PgC yr−1, resulting in a neutral to sink behavior of−0.06 ± 0.58 PgC yr−1. The mean annual C loss by forest clearing was 1.04 ± 0.03 PgC yr−1, with anincreasing trend of +0.03 ± 0.01 PgC yr−1. Eight biomes and 54 countries showed a significant increasing trendof C loss by forest clearing. Wildfires C losses reached 0.351 ± 0.02 Pg C yr−1, representing the 33.71% offorest clearing C losses. Boreal forest presented the highest C losses from wildfires, while significant increasingtrends were evidenced in five biomes. We also find increasing trends of wildfire C loss in 20 countries whiledecreasing trends were identified in 10 countries. Our findings highlight the importance of designing strongpolicies to halt deforestation as agreed in the recent COP26 and provide information to identify priority areas tostart implementing these policies in the short term.
... Stepwise regression was used to fit possible models and Corrected (AIC c ) and Mallow's C p criteria were applied to select the best linear regression model. Variance inflation factors (VIFs) were examined to address multicollinearity and variables with VIFs greater than or equal to 10 were removed (Margolis et al., 2015;Narine et al., 2019a;Nelson et al., 2017). Models were evaluated with test data using reported R 2 and RMSE values. ...
The acquisition of elevation measurements from NASA's Ice, Cloud, and land Elevation Satellite-2 (ICESat-2) and availability of airborne lidar data over the US present an exceptional opportunity to understand vegetation structural estimates provided by ICESat-2. Although a critical forest biophysical attribute, canopy cover is not yet reported from ICESat-2. Thus, there is a need to better understand the application of ICESat-2 for providing canopy cover information. The overall goal of this study was to investigate methods to derive canopy cover and characterize the predictive capability, with ICESat-2. Given availability of dedicated vegetation products (ATL08) and custom noise filtering algorithms developed for ICESat-2 photon data, multiple datasets were evaluated in this study. With focus on two study sites, the Sam Houston National Forest (SHNF) in south-east Texas and the Solon Dixon Forestry Education Center (SDFEC) in southern Alabama, specific objectives were to: (1) Evaluate equations for estimating canopy cover using custom-processed geolocated photon data, ATL03, and ICESat-2's vegetation product data, ATL08, (2) Compare ICESat-2-derived estimates of canopy cover with airborne lidar canopy cover, and (3) Evaluate a modeling-based approach to improve predictions of canopy cover by leveraging available canopy metrics from ATL08 and custom-processed datasets. A total of six (6) potential measures of canopy cover were investigated in this study, based on 100-m ATL08 segments: (i) percentage of photons above 2 m, (ii) percentage of photons above 4.6 m, (iii) percent of canopy (inside canopy) and top-of-canopy photons of total canopy and ground photons, (iv) percent of canopy of total ground and canopy photons, (v) percent of top-of-canopy of total top-of-canopy and ground photons, and (vi) mean of (iii), (iv) and (v). While strongest correlations were produced from canopy cover computed with (ii) (r between 0.72 and 0.84), similar relationships were indicated from using (iii) and (v) (r values between 0.70 and 0.79). Considering a suite of available canopy parameters, the best R2 and RMSE values from canopy cover estimation models were produced from the use of custom-processed ICESat-2-derived metrics (R2 and RMSE values of 0.77 and 13%). Findings from this study highlight methods for estimating canopy cover with ICESat-2 that may be suitable to a range of cover types, in addition to vegetation settings where cut-offs based on tree heights (e.g., 2 m and 4.6 m) are not applicable. With multiple approaches to characterizing canopy cover, results serve to better understand the applicability of ICESat-2 as a data source for canopy cover information and ultimately inform ongoing efforts for deriving an updated gridded product.
... Efforts to use GLAS have fallen into three general categories, each limited in a specific way. Some efforts have knowingly treated GLAS overpasses as if they were randomly allocated, allowing use of analytically derived hybrid methods of variance estimation but potentially underestimating variance due to the discrepancy between the hypothetical and actual sample designs [17,18]. One study alternatively subset available GLAS data to what could be presumed to be a spatially balanced random sample, but suffered a substantial drop in statistical precision because of the large quantity of data that was eliminated [19]. ...
... GEDI's 25 m footprint biomass predictions are used with hybrid modelbased estimators [25] to infer biomass within each 1 km grid cell across the mission's range of observation. The parametric models mentioned above are used to predict biomass for all footprints within a given grid cell, and the hybrid estimates of variance of the mean account for both modeling uncertainty and uncertainty related to how the cell is sampled by GEDI's observations [18,26]. Furthermore, hybrid inference directly enables estimates at any aggregation scale coarser than 1 km (e.g. a country) without resorting to the ad hoc and approximate methods used in other remote sensing biomass products. ...
... Note that although many of the reported SEs at the country level are small, for example 1.1% for the United States, these are in line with those reported by other studies [17,18] that used methods related to our own. However, our approach, as with these other studies and almost all national forest inventories, does not consider model uncertainty from the allometric tree-level biomass models. ...
Accurate estimation of aboveground forest biomass stocks is required to assess the impacts of land use changes such as deforestation and subsequent regrowth on concentrations of atmospheric CO2. The Global Ecosystem Dynamics Investigation (GEDI) is a lidar mission launched by NASA to the International Space Station in 2018. GEDI was specifically designed to retrieve vegetation structure within a novel, theoretical sampling design that explicitly quantifies biomass and its uncertainty across a variety of spatial scales. In this paper we provide the estimates of pan-tropical and temperate biomass derived from two years of GEDI observations. We present estimates of mean biomass densities at 1 km resolution, as well as estimates aggregated to the national level for every country GEDI observes, and at the sub-national level for the United States. For all estimates we provide the standard error of the mean biomass. These data serve as a baseline for current biomass stocks and their future changes, and the mission’s integrated use of formal statistical inference points the way towards the possibility of a new generation of powerful monitoring tools from space.
... Este hecho ha impulsado la aplicación y el desarrollo de nuevos métodos de inferencia que ajustan modelos predictivos integrando información auxiliar proveniente de sensores remotos . La aplicación de modelos para la estimación de atributos forestales en grandes superficies es una práctica muy extendida en la comunidad científica (Chen et al., 2016;Chirici et al., 2020;Margolis et al., 2015;Saarela et al., 2015). Sin embargo, hay que tener en cuenta que la eficacia de los métodos de inferencia basados en modelos depende de la bondad de ajuste del modelo aplicado, por lo que los estimadores poblacionales pueden ser sesgados e imprecisos (McRoberts, 2010). ...
... El uso de técnicas bootstrapping ha permitido enriquecer todavía más la cuantificación de recursos forestales propagando distintas fuentes de incertidumbre (Andersen et al., 2012;Condés and McRoberts, 2017;Fortin et al., 2018;Hou et al., 2017;Sexton and Laake, 2009). A pesar de que numerosos autores han demostrado que la clasificación de imágenes de satélite multiespectral permite generar cartografía temática añadiendo valor a los inventarios (Blázquez-Casado et al., 2019;Dalponte et al., 2012;Fernández-Landa et al., 2018;Fragoso-Campón et al., 2020;Shendryk, 2013), hasta ahora la mayoría de trabajos no ha considerado esta componente de incertidumbre (Margolis et al., 2015;Neigh et al., 2013;Su et al., 2016). Los resultados obtenidos confirman que no se puede ignorar la incertidumbre de la cartografía temática, en los inventarios LiDAR por métodos de masa, siendo estos consistentes con los de otros autores que analizaron la influencia de la incertidumbre 7/12 de cartografías de especies o mapas de composición específica. ...
En el proceso de estimación de variables forestales se suelen emplear cartografías base sobre la distribución de especies forestales. Sin embargo, y a pesar de su uso generalizado, no se suele cuantificar la incertidumbre asociada con estos mapas pudiendo suponer fuentes de error no controladas. El objetivo de este estudio era estimar el efecto que tiene la incertidumbre de un mapa de distribución de especies utilizado en el proceso de estimación del volumen de madera (V) de las principales especies forestales de La Rioja en un inventario LiDAR con métodos de masa. Para la estimación del V se ajustaron modelos Random Forest relacionando los datos de las parcelas del cuarto Inventario Forestal Español con datos LiDAR de 2010. Se generó un mapa de tipos de especies forestales a partir de imágenes espectrales de Landsat-5. Para cuantificar la incertidumbre
de los modelos de predicción de V y del mapa Landsat se implementaron distintas técnicas de remuestreo bootstrapping. La incertidumbre del volumen total aumentó entre 1,6 y 3,1 veces en el caso de considerar la incertidumbre del mapa Landsat, el cual es más acusado para las masas forestales más discontinuas y con una superficie menor. Los resultados revelan que los efectos de la incertidumbre del mapa sobre la incertidumbre de las estimaciones de volumen son significativos, y el hecho de ignorar dichos efectos podría poner en peligro la fiabilidad de las estimaciones de volumen forestal.
... Research has shown that ecosystem functions and species richness depend on the vertical arrangement of plant material [4,5]. Furthermore, information on vertical forest structure enables the modelling of biomass and related variables [6][7][8][9], which is essential for monitoring carbon cycles in the interest of climate change mitigation [10]. ...
... The ICESat mission carried the first spaceborne lidar system, operating from 2003 to 2009. Despite its focus on measuring ice shields and cloud heights, its data have been successfully used to derive measurements of vertical forest structure [8,21,22]. Since 2019, the Global Ecosystem Dynamics Investigation (GEDI) mission onboard the International Space Station (ISS) has provided data on forest structure over the Earth's tropical and temperate forests [20]. ...
Forest structure is an important variable in ecology, fire behaviour, and carbon management. New spaceborne lidar sensors, such as the Global Ecosystem Dynamics Investigation (GEDI), enable forest structure to be mapped at a global scale. Virtual GEDI-like observations can be derived from airborne laser scanning (ALS) data for given locations using the GEDI simulator, which was a tool initially developed for GEDI’s pre-launch calibration. This study compares the relative height (RH) and ground elevation metrics of real and simulated GEDI observations against ALS-derived benchmarks in southeast Australia. A total of 15,616 footprint locations were examined, covering a large range of forest types and topographic conditions. The impacts of canopy cover and height, terrain slope, and ALS point cloud density were assessed. The results indicate that the simulator produces more accurate canopy height (RH95) metrics (RMSE: 4.2 m, Bias: −1.3 m) than the actual GEDI sensor (RMSE: 9.6 m, Bias: −1.6 m). Similarly, the simulator outperforms GEDI in ground detection accuracy. In contrast to other studies, which favour the Gaussian algorithm for ground detection, we found that the Maximum algorithm performed better in most settings. Despite the determined differences between real and simulated GEDI observations, this study indicates the compatibility of both data sources, which may enable their combined use in multitemporal forest structure monitoring.
... To provide context, 14 separate GNWT FVIs were available, occupying just 38% of Phase 1 and 4% of Phase 2. This problem is not unique, resulting in considerable interest in spaceborne LiDAR (ICESat satellites) as a data source, particularly when combined with datasets that have complete spatial coverage [20,[53][54][55]. The integration of airborne/spaceborne LiDAR with other multisource data for large area mapping, including the NWT, has seen increasing traction in the recent literature [22,24,53,56,57]. Although all of these studies generated raster-based estimates of various attributes such as aboveground carbon, AGB, mean canopy height, and canopy cover, none employed locally collected field data specific to this region for calibrating their models, and neither did they generate polygonal maps of specific interest to NWT applications. ...
Sustainable forest management requires information on the spatial distribution, composition, and structure of forests. However, jurisdictions with large tracts of noncommercial forest, such as the Northwest Territories (NWT) of Canada, often lack detailed forest information across their land base. The goal of the Multisource Vegetation Inventory (MVI) project was to create a large area forest inventory (FI) map that could support strategic forest management in the NWT using optical, radar, and light detection and ranging (LiDAR) satellite remote sensing anchored on limited field plots and airborne LiDAR data. A new landcover map based on Landsat imagery was the first step to stratify forestland into broad forest types. A modelling chain linking FI plots to airborne and spaceborne LiDAR was then developed to circumvent the scarcity of field data in the region. The developed models allowed the estimation of forest attributes in thousands of surrogate FI plots corresponding to spaceborne LiDAR footprints distributed across the project area. The surrogate plots were used as a reference dataset for estimating each forest attribute in each 30 m forest cell within the project area. The estimation was based on the k-nearest neighbour (k-NN) algorithm, where the selection of the four most similar surrogate FI plots to each cell was based on satellite, topographic, and climatic data. Wall-to-wall 30 m raster maps of broad forest type, stand height, crown closure, stand volume, total volume, aboveground biomass, and stand age were created for a ~400,000 km2 area, validated with independent data, and generalized into a polygon GIS layer resembling a traditional FI map. The MVI project showed that a reasonably accurate FI map for large, remote, predominantly non-inventoried boreal regions can be obtained at a low cost by combining limited field data with remote sensing data from multiple sources.
... Both profiling airborne lidar (e.g. Boudreau et al., 2008;Nelson et al., 2009a;Margolis et al., 2015) and small footprint ALS data (e.g. Nelson et al., 2017;Holm et al., 2017) have been utilized. ...
Spaceborne lidar sensors have potential to improve the accuracy of forest above-ground biomass (AGB) estimates by providing direct measurements of 3D structure of forests over large spatial scales. The ICESat-2 (Ice, Cloud and land Elevation Satellite 2), launched in 2018, provides a good coverage of the boreal forest zone and has been previously shown to provide good estimates of forest canopy height and AGB. However, spaceborne lidar data are affected by various conditions, such as presence of snow, solar noise, and in the case of ICESat-2, the power difference between the so-called strong and weak beams. The aim of this study was to explore the effects of these conditions on the performance of AGB modeling using ICESat-2 photon data in a boreal forest area. The framework of the study is multiphase modeling, where AGB field data and wall-to-wall airborne laser scanning (ALS) data are used to produce proxy ALS plots on ICESat-2 track positions. Models between the ALS-predicted AGB and the ICESat-2 photon data are then formulated and evaluated by subsets, such as only strong beam data captured in snowy conditions.Our results indicate that, if possible, strong beam night data from snowless conditions should be used in AGB estimation, because our models showed clearly smallest RMSE (27.0%) for this data subset. If more data are needed, we recommend using only strong beam data and constructing separate models for the different data subsets. In the order of increasing RMSE\%, the next best options were snow/night/strong (30.5%), snow/day/strong (33.6%), and snowless/day/strong (34.2%). Weak beam data from snowy night conditions could also be used if necessary (31.1%).
... While reviewing earlier research that applies two sequential regression models in their modeling strategy, we noted a variety of terms describing the same concept in the literature. While we choose to refer to this as a sequential regression approach, we additionally found the following use of terminology for similar, but not necessary identical approaches: two-step modeling strategy [40], [57], [65], two-stage regression [41], [62], two-stage up-scaling method [23], [42], two-phase estimator [59], two-phase (or three-phase) sampling design [56], [61], hybrid and hierarchical model-based inference [60], [64], and three-phase design [36]. Additionally, [37]- [39], [55], [58], [63] also apply a modeling approach with two sequential regression models without labeling it by any particular term. ...
... Additionally, [37]- [39], [55], [58], [63] also apply a modeling approach with two sequential regression models without labeling it by any particular term. Most of the previous research that we identified focuses on relating ground reference data to ALS, and then relates ALS-derived AGB estimates to spaceborne LiDAR data [36], [55], [56], [58], [59], [61] or a combination of different sensors [23], [38], [42], [60], [63]- [65]. Some others relate the ALS-derived AGB estimates to a single sensor, such as Sentinel-2 [39], [41], Landsat [40], [62], GEDI Lidar [65], PALSAR, [57], or SRTM X-band radar [37]. ...
... In previous research that adopts a modeling strategy with two sequential regression models, we found traditional regression models to be most common [36]- [38], [55]- [57], [59]- [61], [64], [65], such as, e.g., [38], which focuses on multiple linear regression for upscaling biomass estimates to large areas in the tropical forest of Indonesia. Although Englhart et al. [38] included neither ML nor DL, their overall idea has similarities with our modeling strategy. ...
This study derives regression models for aboveground biomass (AGB) estimation in miombo woodlands of Tanzania that utilize the high availability and low cost of Sentinel-1 data. The limited forest canopy penetration of C-band SAR sensors along with the sparseness of available ground truth restricts their usefulness in traditional AGB regression models. Therefore, we propose to use AGB predictions based on airborne laser scanning (ALS) data as a surrogate response variable for SAR data. This dramatically increases the available training data and opens for flexible regression models that capture fine-scale AGB dynamics. This becomes a sequential modeling approach, where the first regression stage has linked
in situ
data to ALS data and produced the AGB prediction map; we perform the subsequent stage, where this map is related to Sentinel-1 data. We develop a traditional, parametric regression model and alternative nonparametric models for this stage. The latter uses a conditional generative adversarial network (cGAN) to translate Sentinel-1 images into ALS-based AGB prediction maps. The convolution filters in the neural networks make them contextual. We compare the sequential models to traditional, nonsequential regression models, all trained on limited AGB ground reference data. Results show that our newly proposed nonsequential Sentinel-1-based regression model performs better quantitatively than the sequential models, but achieves less sensitivity to fine-scale AGB dynamics. The contextual cGAN-based sequential models best reproduce the distribution of ALS-based AGB predictions. They also reach a lower RMSE against
in situ
AGB data than the parametric sequential model, indicating a potential for further development.