ArticlePDF Available

Abstract

As the most widely used crop-specific land use data, the Cropland Data Layer (CDL) product covers the entire Contiguous United States (CONUS) at 30-meter spatial resolution with very high accuracy up to 95% for major crop types (i.e., Corn, Soybean) in major crop area. However, the quality of early-year CDL products were not as good as the recent ones. There are many erroneous pixels in the early-year CDL product due to the cloud cover of the original Landsat images, which affect many follow-on researches and applications. To address this issue, we explore the feasibility of using machine learning technology to refine and correct misclassified pixels in the historical CDLs in this study. An end-to-end deep learning-based framework for restoration of misclassified pixels in CDL image is developed and tested. By feeding the CDL time series into the artificial neural network, a crop sequence model is trained and the misclassified pixels in an original CDL map can be restored. In the experiment with the 2005 CDL data of the State of Illinois, the misclassified pixels over Agricultural Statistics Districts (ASD) #1760 were corrected with a reasonable accuracy (> 85%). The findings suggest that the proposed method provides a low-cost and reliable way to refine the historical CDL data, which can be potentially scaled up to the entire CONUS.
REFINEMENT OF CROPLAND DATA LAYER USING MACHINE LEARNING
Chen Zhang 1,2, Zhengwei Yang 3, Liping Di 1,2,
, Li Lin 1,2, Pengyu Hao 1
1Center for Spatial Science and Systems, George Mason University, Fairfax, VA 22030, USA -
(czhang11, ldi, llin2, phao)@gmu.edu
2Department of Geography and Geoinformation Science, George Mason University, Fairfax, VA 22030, USA
3Research and Development Division, U.S. Department of Agriculture National Agricultural Statistics Service,
Washington, DC 20250, USA - Zhengwei.Yang@usda.gov
KEY WORDS: Cropland Data Layer, Machine Learning, Misclassification Correction, Crop Sequence Modeling, Raster Map
Refinement
ABSTRACT:
As the most widely used crop-specific land use data, the Cropland Data Layer (CDL) product covers the entire Contiguous United
States (CONUS) at 30-meter spatial resolution with very high accuracy up to 95% for major crop types (i.e., Corn, Soybean)
in major crop area. However, the quality of early-year CDL products were not as good as the recent ones. There are many
erroneous pixels in the early-year CDL product due to the cloud cover of the original Landsat images, which affect many follow-on
researches and applications. To address this issue, we explore the feasibility of using machine learning technology to refine and
correct misclassified pixels in the historical CDLs in this study. An end-to-end deep learning-based framework for restoration of
misclassified pixels in CDL image is developed and tested. By feeding the CDL time series into the artificial neural network, a crop
sequence model is trained and the misclassified pixels in an original CDL map can be restored. In the experiment with the 2005
CDL data of the State of Illinois, the misclassified pixels over Agricultural Statistics Districts (ASD) #1760 were corrected with
a reasonable accuracy (>85%). The findings suggest that the proposed method provides a low-cost and reliable way to refine the
historical CDL data, which can be potentially scaled up to the entire CONUS.
1. INTRODUCTION
Since its first release of a full state wide data product in 1997,
the Cropland Data Layer (CDL) product of the U.S. Depart-
ment of Agriculture (USDA) National Agricultural Statistics
Service (NASS) has been widely used by growers, agricultural
industry, governments, educators and students, and researchers
world-wide for crop production, agricultural production plan-
ning and management, government policy formulation and de-
cision making, teaching, and various research activities (Liknes
et al., 2009; Thompson, Prokopy; Hao et al., 2015; Lark et al.,
2015; Di et al., 2017). Currently, the CDL data covers the entire
conterminous United States (CONUS) at 30-meter spatial resol-
ution with a high accuracy up to 95% for classifying major crop
types (i.e., Corn, Soybean, and Wheat). However, the quality of
the early-year CDL products was not as good as recent years.
In early years, there are many misclassified pixels in the CDL
products because of cloud cover and lack of satellite images.
Moreover, only a few states of CDL data were produced before
2008. For example, the year 2000 CDL covers only Illinois,
Indiana, Mississippi, North Dakota, and a part of Arkansas and
Iowa. Obviously, the earlier year CDLs’ availability and low
quality issues affect many follow-on Land Use and Land Cover
(LULC) related researches and applications. Therefore, an ef-
fective method for refining and correcting the old CDL data is
badly needed to improve the quality and accuracy of the histor-
ical CDL data.
It is well known that monocropping will result in degradation of
soil, build-up of diseases and pests, and decline in productivity.
Thus crop rotation becomes a common farming practice in U.S.
Corn Belt. The crop rotation can significantly improve the soil
Corresponding author
condition, such as fertility and soil physical/chemical proper-
ties (Pikul et al., 2001; Karlen et al., 2006; Govaerts et al., 2007;
Karlen et al., 2013; Van Eerd et al., 2014). Meanwhile, the
crop sequence and cropping decision also have significant im-
pact on crop yields and profitability (Temperly, Borges; Parajuli
et al., 2013; Farmaha et al., 2016). Based on this common crop-
ping practice, many crop mapping and yield estimation mod-
els and approaches were developed. Secchi et al. (2011) con-
structed an prediction model of future land use scenario in the
state of Iowa based on the corn-soybean rotation and production
costs. Sch¨
onhart et al. (2011) developed a crop sequence model
to generate crop rotations based on agronomic criteria and ob-
served data. Sahajpal et al. (2014) detected the pronounced shifts
from grassland to cultivated area by modelling crop rotation in
the U.S. Western Corn Belt. Hao et al. (2016) explored the crop
classification based on the previous-year crop knowledge. Zhang
et al. (2019a) produced a crop cover map of Nebraska State
based on the common crop rotation patterns of corn, soybeans,
winter wheat, and alfalfa. They further implemented a crop
sequence-based machine learning framework for prediction of
crop cover maps (Zhang et al., 2019b).
In this paper, we present a machine learning-based crop se-
quence model to refine the historical CDL data. The proposed
model utilizes artificial neural network (ANN) to automatically
learn crop sequence information from the CDL time series. The
misclassified pixels in the crop cover map can be automatically
identified and corrected using the trained model on the histor-
ical CDL.
The rest of the paper is organized as follows. Section 2 in-
troduces the CDL data, the study area, and an end-to-end ma-
chine learning framework for the historical CDL data refine-
ment. Section 3 demonstrates the experiment results and as-
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XLII-3/W11, 2020
PECORA 21/ISRSE 38 Joint Meeting, 6–11 October 2019, Baltimore, Maryland, USA
This contribution has been peer-reviewed.
https://doi.org/10.5194/isprs-archives-XLII-3-W11-161-2020 | © Authors 2020. CC BY 4.0 License.
161
sesses the refinement performance. Section 4 discusses the lim-
itation of the current implementation and gives the conclusion.
2. METHODS
2.1 Cropland Data Layer
CDL is a raster formatted, geo-referenced, crop-specific land
cover map produced by USDA NASS. It is an annual product
covering the entire CONUS at 30-meter spatial resolution from
2008 to present and some states from 1997 to 2007. The pro-
duction of CDL is mainly based on moderate resolution satellite
imagery and extensive agricultural ground truth (Boryan et al.,
2011). The misclassified pixels in the CDL refer to the pixels
that are covered with “clouds” or “no data”. These pixels are
mainly existing in the CDL products before 2006 due to lack
of high-quality satellite data and the algorithm limitation back
then. Examples of the misclassified pixels in the early-year
CDL are shown in Figure 1.
The CDL data products are freely downloaded from CropScape
(https://nassgeodata.gmu.edu/CropScape/), which is developed
and maintained in cooperation with Center for Spatial Informa-
tion Science and Systems of George Mason University (Han et
al., 2012; Zhang et al., 2019c). It provides an easy-to-use Web
GIS application to visualize, analyse, and download CDL data.
All data hosted on CropScape are disseminated via the OGC
standards-compliant geospatial Web services, such as Web Map
Service (WMS), Web Coverage Service (WCS), Web Feature
Service (WFS), and Web Processing Service (WPS).
2.2 Study Area
The Agricultural Statistics District (ASD) #1760 of Illinois state
is selected as the study area. The study area lies on the Central
Corn Belt Plains Ecoregion, which is mainly covered by corn,
soybeans, grassland, and forest as shown in Figure 2. It can
be seen that the 2005 CDL contains a considerable number of
pixels are labelled as “clouds or no data” over the study area.
The purpose of this study is to restore those misclassified pixels
in the study area of 2005 CDL using the machine-learned crop
sequence model.
2.3 Machine Learning Framework
To automatically correct the misclassified pixels in CDL, an
end-to-end machine learning framework is proposed in this pa-
per. The proposed framework is composed of four major com-
ponents: data preparation, model training, classification, and
evaluation.
2.3.1 Data Preparation: In data preparation, the CDLs from
2006 to 2018 are stacked sequentially to form CDL time series.
All pixels of the CDL time series are arranged into a 2-D ar-
ray of samples. Each row of the data set array represents a
pixel consisting of a sequence of crop type values of different
years. Training and validation data sets are randomly sampled
from the “good pixels” in the study area and labelled with 2005
CDL. The experiment data set includes all pixels corresponding
to those misclassified pixels in the study area without labels.
Figure 1. Examples of misclassified pixels in the early-year
CDL data.
Figure 2. Study area with 2005 CDL as the experiment data
(data available from CropScape).
2.3.2 Model Training: The crop sequence model is trained
by feeding the training set into the artificial neural network,
which contains one input layer, multiple hidden layers, and one
output layer. The input layer contains a group of neurons cor-
responding to the same pixel of the CDL time series. Each input
pixel represents a specific value of its crop type. There are mul-
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XLII-3/W11, 2020
PECORA 21/ISRSE 38 Joint Meeting, 6–11 October 2019, Baltimore, Maryland, USA
This contribution has been peer-reviewed.
https://doi.org/10.5194/isprs-archives-XLII-3-W11-161-2020 | © Authors 2020. CC BY 4.0 License.
162
Figure 3. Applications of machine learning-based crop sequence model.
tiple hidden layers between the input layer and the output layer.
The output layer uses SoftMax to estimate the probability of
each crop type.
2.3.3 Classification and Validation: By feeding the exper-
iment data set to the well-trained crop sequence model, the mis-
classified pixels in the original CDL can be refined. To validate
the refinement performance, we applied the same crop sequence
model to the validation set. Then we measured the model by
calculating the agreement of the classified label and the original
label of the validation set.
The applications of the proposed machine learning-based crop
sequence model are illustrated in Figure 3. In this study, the
crop sequence model is used to restore the historical crop cover
map. This model, on the other hand, can be also applied to pre-
dict the future crop cover maps with the high-confident training
samples for early-season and in-season crop mapping.
3. RESULTS
The refined 2005 CDL data of ASD #1760 is illustrated in Fig-
ure 4. Comparing the refined result with the original 2005 CDL
data, we observed that the misclassified pixels had been correc-
ted with the crop sequence information learned from the histor-
ical CDL time series.
Figure 4. Refined 2005 CDL of ASD #1760.
The overall accuracy of the refined pixels is unable to be ac-
cessed directly due to lack of ground reference data. Instead, we
utilized the validation data set, derived from the “good pixels”
in the study area of 2005 CDL to indirectly measure the per-
formance of the model. The overall accuracy of validation based
on the validation sample set is over 85%. Therefore, the actual
overall accuracy of the refined pixels may vary. To further val-
idate the performance of refinement, the ground reference data
are required.
4. CONCLUSION
This study investigated the feasibility of using machine learn-
ing technology to refine CDL data. An end-to-end ANN-based
framework was proposed and tested to correct the misclassified
pixels in the historical CDL data. The preliminary experiment
result indicates that the misclassified pixels over the ASD #1760
could be corrected with reasonable accuracy (>85%). The find-
ings suggest that the proposed machine learning approach is ef-
fective and low-cost for correcting the misclassified pixels, and
has great potential for refining the historical CDL over large
geographic area. More experiments and validation will be con-
ducted in the future.
ACKNOWLEDGEMENTS
This work is supported by the U.S. Department of Agriculture
National Agricultural Statistics Service.
REFERENCES
Boryan, C., Yang, Z., Mueller, R., Craig, M., 2011. Mon-
itoring US Agriculture: The US Department of Agricul-
ture, National Agricultural Statistics Service, Cropland Data
Layer Program. Geocarto International, 26(5), 341–358.
doi:10.1080/10106049.2011.562309.
Di, L., Yu, E. G., Kang, L., Shrestha, R., Bai, Y., 2017. RF-
CLASS: A Remote-Sensing-Based Flood Crop Loss Assess-
ment Cyber-Service System for Supporting Crop Statistics
and Insurance Decision-Making. Journal of Integrative Agri-
culture, 16(2), 408-423. doi:10.1016/S2095-3119(16)61499-
5.
Farmaha, B. S., Eskridge, K. M., Cassman, K. G., Specht,
J. E., Yang, H., Grassini, P., 2016. Rotation Impact on On-
Farm Yield and Input-Use Efficiency in High-Yield Irrigated
Maize–Soybean Systems. Agronomy Journal, 108(6), 2313-
2321. doi:10.2134/agronj2016.01.0046.
Govaerts, B., Mezzalama, M., Unno, Y., Sayre, K. D.,
Luna-Guido, M., Vanherck, K., Dendooven, L., Deck-
ers, J., 2007. Influence of Tillage, Residue Manage-
ment, and Crop Rotation on Soil Microbial Biomass and
Catabolic Diversity. Applied Soil Ecology, 37(1), 18-30.
doi:10.1016/j.apsoil.2007.03.006.
Han, W., Yang, Z., Di, L., Mueller, R., 2012. CropScape: A
Web Service Based Application for Exploring and Dissemin-
ating US Conterminous Geospatial Cropland Data Products
for Decision Support. Computers and Electronics in Agricul-
ture, 84, 111–123. doi:10.1016/j.compag.2012.03.005.
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XLII-3/W11, 2020
PECORA 21/ISRSE 38 Joint Meeting, 6–11 October 2019, Baltimore, Maryland, USA
This contribution has been peer-reviewed.
https://doi.org/10.5194/isprs-archives-XLII-3-W11-161-2020 | © Authors 2020. CC BY 4.0 License.
163
Hao, P., Wang, L., Zhan, Y., Wang, C., Niu, Z., Wu, M.,
2016. Crop Classification Using Crop Knowledge of the
Previous-Year: Case Study in Southwest Kansas, USA.
European Journal of Remote Sensing, 49(1), 1061-1077.
doi:10.5721/EuJRS20164954.
Hao, P., Zhan, Y., Wang, L., Niu, Z., Shakir, M.,
2015. Feature Selection of Time Series MODIS Data for
Early Crop Classification Using Random Forest: A Case
Study in Kansas, USA. Remote Sensing, 7(5), 5347-5369.
doi:10.3390/rs70505347.
Karlen, D. L., Cambardella, C. A., Kovar, J. L., Colvin, T. S.,
2013. Soil Quality Response to Long-Term Tillage and Crop
Rotation Practices. Soil and Tillage Research, 133, 54-64.
doi:10.1016/j.still.2013.05.013.
Karlen, D. L., Hurley, E. G., Andrews, S. S., Cambardella,
C. A., Meek, D. W., Duffy, M. D., Mallarino, A. P., 2006.
Crop Rotation Effects on Soil Quality at Three Northern
Corn/Soybean Belt Locations. Agronomy Journal, 98(3),
484-495. doi:10.2134/agronj2005.0098.
Lark, T. J., Salmon, J. M., Gibbs, H. K., 2015. Cropland
Expansion Outpaces Agricultural and Biofuel Policies in
the United States. Environmental Research Letters, 10(4),
044003. doi:10.1088/1748-9326/10/4/044003.
Liknes, G. C., Nelson, M. D., Gormanson, D. D., Hansen,
M., 2009. The Utility of the Cropland Data Layer for
Forest Inventory and Analysis. Proceedings of the Eighth
Annual Forest Inventory and Analysis Symposium, 259–264.
doi:10.2737/WO-GTR-79.
Parajuli, P. B., Jayakody, P., Sassenrath, G. F., Ouyang, Y., Pote,
J. W., 2013. Assessing the Impacts of Crop-Rotation and
Tillage on Crop Yields and Sediment Yield Using a Model-
ing Approach. Agricultural Water Management, 119, 32-42.
doi:10.1016/j.agwat.2012.12.010.
Pikul, J. L., Carpenter-Boggs, L., Vigil, M., Schumacher, T. E.,
Lindstrom, M. J., Riedell, W. E., 2001. Crop Yield and Soil
Condition under Ridge and Chisel-Plow Tillage in the North-
ern Corn Belt, USA. Soil and Tillage Research, 60(1), 21-33.
doi:10.1016/S0167-1987(01)00174-X.
Sahajpal, R., Zhang, X., Izaurralde, R. C., Gelfand, I., Hurtt,
G. C., 2014. Identifying Representative Crop Rotation Pat-
terns and Grassland Loss in the US Western Corn Belt.
Computers and Electronics in Agriculture, 108, 173-182.
doi:10.1016/j.compag.2014.08.005.
Sch¨
onhart, M., Schmid, E., Schneider, U. A., 2011. CropRota –
A Crop Rotation Model to Support Integrated Land Use As-
sessments. European Journal of Agronomy, 34(4), 263-277.
doi:10.1016/j.eja.2011.02.004.
Secchi, S., Kurkalova, L., Gassman, P. W., Hart, C., 2011.
Land Use Change in a Biofuels Hotspot: The Case of
Iowa, USA. Biomass and Bioenergy, 35(6), 2391-2400.
doi:10.1016/j.biombioe.2010.08.047.
Temperly, R. J., Borges, R., 2006. Tillage and Crop Rotation
Impact on Soybean Grain Yield and Composition. Agronomy
Journal, 98(4), 999-1004. doi:10.2134/agronj2005.0215.
Thompson, A. W., Prokopy, L. S., 2009. Tracking Urban
Sprawl: Using Spatial Data to Inform Farmland Pre-
servation Policy. Land Use Policy, 26(2), 194-202.
doi:10.1016/j.landusepol.2008.02.005.
Van Eerd, L. L., Congreves, K. A., Hayes, A., Verhallen, A.,
Hooker, D. C., 2014. Long-Term Tillage and Crop Rota-
tion Effects on Soil Quality, Organic Carbon, and Total Ni-
trogen. Canadian Journal of Soil Science, 94(3), 303-315.
doi:10.4141/cjss2013-093.
Zhang, C., Di, L., Lin, L., Guo, L., 2019a. Ex-
tracting Trusted Pixels from Historical Cropland Data
Layer Using Crop Rotation Patterns: A Case Study
in Nebraska, USA. 2019 8th International Confer-
ence on Agro-Geoinformatics (Agro-Geoinformatics), 1–6.
doi:10.1109/Agro-Geoinformatics.2019.8820236.
Zhang, C., Di, L., Lin, L., Guo, L., 2019b. Machine-
Learned Prediction of Annual Crop Planting in the U.S.
Corn Belt Based on Historical Crop Planting Maps.
Computers and Electronics in Agriculture, 166, 104989.
doi:10.1016/j.compag.2019.104989.
Zhang, C., Di, L., Yang, Z., Lin, L., Yu, E. G., Yu, Z., Rahman,
M. S., Zhao, H., 2019c. Cloud Environment for Dissemin-
ating NASS Cropland Data Layer. 2019 8th International
Conference on Agro-Geoinformatics (Agro-Geoinformatics),
1–5. doi:10.1109/Agro-Geoinformatics.2019.8820465.
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XLII-3/W11, 2020
PECORA 21/ISRSE 38 Joint Meeting, 6–11 October 2019, Baltimore, Maryland, USA
This contribution has been peer-reviewed.
https://doi.org/10.5194/isprs-archives-XLII-3-W11-161-2020 | © Authors 2020. CC BY 4.0 License.
164
... Additionally, any uncertainties associated with the gridded data products used as input within our study are also present in our data product. Remotely sensed data products, relying on spectral signatures to distinguish crops, exhibit varying accuracy based on factors such as crop type, geographic location, and quality and quantity of satellite imagery available [33]. Furthermore, since crop-specific gridded data before 2008 were unavailable, we assumed that the distribution of crops prior to 2008 resembled the average crop distribution post-2008. ...
Article
Full-text available
The United States is a major producer and exporter of agricultural goods, fulfilling global demands for food, fiber, and fuel while generating substantial economic benefits. Agriculture in the U.S. not only dominates land use but also ranks as the largest water-consuming sector. High-resolution cropland mapping and insights into cultivation trends are essential to enhance sustainable management of land and water resources. Existing data sources present a trade-off between temporal breadth and spatial resolution, leading to gaps in detailed geographic crop distribution. To bridge this gap, we adopted a data-fusion methodology that leverages the advantages of various data sources, including county-level data from the U.S. Department of Agriculture, along with several gridded land use datasets. This approach enabled us to create annual maps, termed HarvestGRID, of irrigated and harvested areas for 30 key crops across the U.S. from 1981 to 2019 at a resolution of 2.5 arc minutes. Over the past four decades, irrigated harvested area has remained relatively stable nationally; however, several western states exhibit a declining trend, while some eastern states show an upward trend. Notably, more than 50% of the irrigated land in the U.S. lies above three major aquifers: the High Plains, Central Valley, and Mississippi Embayment Aquifers. We assessed the accuracy of HarvestGRID by comparing it with other large-scale gridded cropland databases, identifying both consistencies and discrepancies across different years, regions, and crops. This dataset is pivotal for analyzing long-term cropland use patterns and supports the advancement of more sustainable agricultural practices.
... Currently, the CDL represents the most comprehensive national crop type map available for free use and download, offering exceptional overall accuracy for major crops, particularly corn and soybeans, with accuracy rates exceeding 95% [5]. However, several studies have highlighted defects within the CDL, primarily due to its reliance on remote sensing datasets, which raises concerns regarding the quality of CDL products [9,[14][15][16][17][18]. To address these concerns, the USDA National Agricultural Statistics Service (NASS) has provided an accuracy rating since 2008 and introduced the confidence layer in 2017 to enhance data reliability [5]. ...
Article
Full-text available
Various systems have been developed to process agricultural land data for better management of crop production. One such system is Cropland Data Layer (CDL), produced by the National Agricultural Statistics Service of the United States Department of Agriculture (USDA). The CDL has been widely used for training deep learning (DL) segmentation models. However, it contains various errors, such as salt-and-pepper noise, and must be refined before being used in DL training. In this study, we used two approaches to refine the CDL for DL segmentation of major crops from a time series of Sentinel-2 monthly composite images. Firstly, different confidence intervals of the confidence layer were used to refine the CDL. Secondly, several image filters were employed to improve data quality. The refined CDLs were then used as the ground-truth in DL segmentation training and evaluation. The results demonstrate that the CDL with +45% and +55% confidence intervals produced the best results, improving the accuracy of DL segmentation by approximately 1% compared to non-refined data. Additionally, filtering the CDL using the majority and expand–shrink filters yielded the best performance, enhancing the evaluation metrics by about 1.5%. The findings suggest that pre-filtering the CDL and selecting an effective confidence interval can significantly improve DL segmentation performance, contributing to more accurate and reliable agricultural monitoring.
... The water is shallow (5-15 cm) and contaminated with soil in the sowing period (brown color in the Sentinel-2 FBR images), while it is deeper and clarified in the flood period (dark blue color in the Sentinel-2 FBR images). Moreover, machine-learning-based (support vector machine, decision tree, and random forest) or deep-learning-based (deep neural networks, convolutional neural networks, and recurrent neural network) supervised classification methods are widely used for LULC classification and water detection [62,63,[109][110][111][112]. One of the most important advantages of these methods is the ability to deal with uncertainties in the input data and cope with multiple input data sources in order to improve the accuracy of the classification or regression results [76,113]. ...
Article
Full-text available
The annual flood and the alteration in hydrological regimes are the most vital concerns in the Vietnamese Mekong Delta (VMD). Although synthetic aperture radar (SAR) Sentinel-1 imagery is widely used for water management, only a few studies have used Sentinel-1 data for mapping surface water and monitoring flood events in the VMD. This study developed an algorithm to implement (i) automatic Otsu threshold on a series of Sentinel-1 images to extract surface water and (ii) time series analyses on the derived surface water maps to detect flood water extent in near-real-time (NRT). Specifically, only cross-polarized VH was selected after an assessment of different Sentinel-1 polarizations. The dynamic Otsu thresholding algorithm was applied to identify an optimal threshold for each pre-processed Sentinel-1 VH image to separate water from non-water pixels for producing a time series of surface water maps. The derived Sentinel-1 surface water maps were visually compared with the Sentinel-2 Full Resolution Browse (FRB) and statistically examined with the Sentinel-2 Multispectral Instrument (MSI) surface water maps, which were generated by applying the Otsu threshold on the normalized difference water index (NDWI) and modified normalized difference water index (MNDWI) images. The visual comparison showed a strong correspondence between the Sentinel-1 surface water maps and Sentinel-2 FRB images in three periods, including rice’s sowing season, flood period, and rice’s maturation stage. A good statistical agreement suggested that the performance of the dynamic Otsu thresholding algorithm on Sentinel-1 image time series to map surface water is effective in river areas (R2 = 0.97 and RMSE = 1.18%), while it is somewhat lower in paddy field areas (R2 = 0.88 and RMSE = 3.88%). Afterward, a flood mapping algorithm in NRT was developed by applying the change-detection-based time series analyses on the derived Sentinel-1 surface water maps. Every single pixel at the time is respectively referred to its state in the water/non-water and flooded/non-flooded maps at the previous time to be classified into a flooded or non-flooded pixel. The flood mapping algorithm enables updates at each time step to generate temporal flood maps in NRT for monitoring flood water extent in large-scale areas. This study provides a tool to rapidly generate surface water and flood maps to support water management and risk reduction in the VMD. The future improvement of the current algorithm is discussed.
... The transfer learning technique is extremely useful when there are fewer labels [134]. Research has indicated that the combination of machine learning, remote sensing, and GIS has promising potential to outperform many traditional field surveys and standalone remote sensing classification methods [100,151,152]. ...
Article
Full-text available
In the past few decades, most urban areas in the world have been facing the pressure of an increasing population living in poverty. A recent study has shown that up to 80% of the population of some cities in Africa fall under the poverty line. Other studies have shown that poverty is one of the main contributors to residents’ poor health and social conflict. Reducing the number of people living in poverty and improving their living conditions have become some of the main tasks for many nations and international organizations. On the other hand, urban gentrification has been taking place in the poor neighborhoods of all major cities in the world. Although gentrification can reduce the poverty rate and increase the GDP and tax revenue of cities and potentially bring opportunities for poor communities, it displaces the original residents of the neighborhoods, negatively impacting their living and access to social services. In order to support the sustainable development of cities and communities and improve residents’ welfare, it is essential to identify the location, scale, and dynamics of urban poverty and gentrification, and remote sensing can play a key role in this. This paper reviews, summarizes, and evaluates state-of-the-art approaches for identifying and mapping urban poverty and gentrification with remote sensing, GIS, and machine learning techniques. It also discusses the pros and cons of remote sensing approaches in comparison with traditional approaches. With remote sensing approaches, both spatial and temporal resolutions for the identification of poverty and gentrification have been dramatically increased, while the economic cost is significantly reduced.
... To address this issue, we used the proposed machine learning model to refine and correct misclassified pixels in the historical CDLs. Our study showed that the proposed machine learning model can automatically correct most of misclassified pixels in an original CDL map [14]. Figure 4 illustrates the comparison of the original CDL data with the refined CDL data. ...
... On the one hand, the coverage of CDL for the study area was incomplete before 2007. On the other hand, there were many misclassified pixels in the early-year CDLs due to the cloud or lack of satellite images (Zhang et al., 2020b). This study used CDL and field data as reference data to evaluate mapping results. ...
Article
Full-text available
A timely and detailed crop-specific land cover map can support many agricultural applications and decision makings. However, in-season crop mapping over a large area is still challenging due to the insufficiency of ground truth in the early stage of a growing season. To address this issue, this paper presents an efficient machine-learning workflow for the rapid in-season mapping of corn and soybeans fields without ground truth data for the current year. We use trusted pixels, a set of pixels that are predicted from the historical Cropland Data Layer (CDL) data with high confidence in the current year’s crop type, to label training samples on multi-temporal satellite images for crop type classification. The entire mapping process only involves a limited number of satellite images acquired within the growing season (normally 3–4 images per scene) and no field data needs to be collected. According to the investigation on 12 states of the U.S. Corn Belt, it is found that a considerable number of trusted pixels can be identified from the historical CDL data by the trusted pixel prediction model based on artificial neural network. According to the experiment on 49 Landsat-8 scenes and 31 Sentinel-2 tiles, the in-season maps of corn and soybeans are expected to reach 85%–95% agreement with CDL as well as field data by mid-July. Once the in-season satellite imagery becomes available, the crop cover map can be rapidly created even with limited computational resources. This study provides a new perspective and detailed guidance for rapid in-season mapping of corn and soybeans, which can be potentially applied to identify more diverse crop types and scaled up to the entire United States.
... Each layer contains a series of neurons. Studies have shown that the artificial neural network is an effective and efficient approach for the prediction of crop mapping (Zhang et al. 2019a) and refinement of historical crop cover maps (Zhang et al. 2020b). ...
Chapter
Full-text available
Image processing is an essential part of the agricultural observation system. This chapter is the first attempt to provide an overview of the image processing methods, technologies, and tools from the perspective of agro-geoinformatics. First, we introduce the origins, definitions, and basic steps of digital image processing. Along with the traditional image processing hardware and software, the state-of-the-art technologies for agricultural image processing, such as mobile device-based image processing and cloud computing-based image processing, are covered. Image data could be acquired by different sensors in different ways. We discuss three common approaches to collect agricultural image data, in situ, airborne-based, and space-borne-based data collection, as well as the big data challenge in agro-geoinformatics. As the core image processing operation in the agricultural observation system, information extraction aims to understand agro-geoinformation from the raw image data. This chapter also illustrates several image information extraction methods that are widely employed in agro-geoinformatics, such as knowledge-based expert system, machine learning-based decision tree, and artificial neural network. Furthermore, a case study of the production of Cropland Data Layer (CDL) data, a comprehensive, raster-formatted, geo-referenced, annual crop-specific land cover map produced by the U.S. Department of Agriculture (USDA) National Agricultural Statistics Service (NASS), is demonstrated.
... Only a few states were fully covered before 2008. These quality and coverage issue of the early-year CDL data can potentially affect the follow-on studies (Zhang et al., 2020). ...
Article
Full-text available
Google Earth Engine (GEE) is an ideal platform for large-scale geospatial agricultural and environmental modeling based on its diverse geospatial datasets, easy-to-use APIs, rich reusable library, and high-performance computational capacity. However, using GEE to prepare agricultural land use data for geospatial agricultural and environment modeling requires not only the programming skills of GEE APIs but also the knowledge of the data. This paper presents a toolkit AgKit4EE to facilitate the use of the Cropland Data Layer (CDL) products over GEE platform. The toolkit contains a variety of frequently used functions for use of CDL products including crop sequence modeling, crop frequency modeling, confidence layer modeling, and land use change analysis. The experimental results suggest that the toolkit can significantly reduce the workload for modelers who perform geospatial agricultural and environmental modeling with CDL data as well as developers who build the GEE-enabled cyberinfrastructure for agricultural land use modeling of the conterminous United States.
Article
Full-text available
Using the case study of birds and food crops, we investigate whether diversifying crop production can enhance preservation of local biodiversity. To this end we combine annual bird survey data, high resolution land use data, and phylogenetic trees to create a landscape level panel data set covering the conterminous United States for over a decade. Our econometric analysis shows that greater local food crop hetereogeneity increases local avian diversity, although this is spatially limited. Supplementary county level data provides evidence that more food crop diversity is unlikely to be at the cost of lower revenues.
Article
Full-text available
An accurate crop planting map can provide essential information for decision support in agriculture. The method of post-season and in-season crop mapping has been widely studied in the land use and land cover community. However, it remains a challenge to predict the spatial distribution of crop planting before the growing season. This paper is the first attempt to use machine learning approach on the prediction of field-level annual crop planting from historical crop planting maps. We present an end-to-end machine learning framework for crop planting prediction using Cropland Data Layer (CDL) time series as reference data and multi-layer artificial neural network as prediction model. The proposed framework was first tested at Lancaster County of Nebraska State, then scaled up to the U.S. Corn Belt. According to the experiment results from 53 Agricultural Statistics Districts, we found the machine-learned crop planting map was expected to reach 88% agreement with the future CDL. Meanwhile, the crop acreage estimates derived from the machine-learned prediction were highly correlated (R2 > 0.9) with the crop acreage estimates of CDL and official statistics by the U.S. Department of Agriculture National Agricultural Statistics Service. This study provides a low-cost and efficient way to predict annual crop planting map, which can be used to support many agricultural applications and decision makings before the beginning of a growing season.
Conference Paper
Full-text available
It is still a challenge to generate the timely crop cover map at large geographic area due to the lack of reliable ground truths at early growing season. This paper introduces an efficient method to extract “trusted pixels” from the historical Cropland Data Layer (CDL) data using crop rotation patterns, which can be used to replace the actual ground truth in the crop mapping and other agricultural applications. A case study in the Nebraska state of USA is demonstrated. The common crop rotation patterns of four major crop types, corn, soybeans, winter wheat, and alfalfa, are compared and analyzed. The experiment results show a considerable number of pixels in CDL following the certain crop sequence during the past decade. Each observed crop type has at least one reliable crop rotation pattern. Based on the reliable crop rotation patterns, a great proportion of pixels can be correctly mapped a year ahead of the release of current-year CDL product. These trusted pixels can be potentially used to label training samples for crop type classification at early growing season.
Conference Paper
Full-text available
Cropland Data Layer (CDL) is an annual crop-specific land use map produced by the U.S. Department of Agricultural (USDA) National Agricultural Statistics Service (NASS). The CDL products are officially hosted on CropScape website which provides capabilities of geospatial data visualization, retrieval, processing, and statistics based on the open geospatial Web services. This study utilizes cloud computing technology to improve the performance of CropScape application and Web services. A cloud-based prototype of CropScape is implemented and tested. The experiment results show the performance of CropScape is significantly improved in the cloud environment. Comparing with the original system architecture of CropScape, the cloud-based architecture provides a more flexible and effective environment for the dissemination of CDL data.
Article
Full-text available
Floods often cause significant crop loss in the United States. Timely and objective information on flood-related crop loss, such as flooded acreage and degree of crop damage, is very important for crop monitoring and risk management in agricultural and disaster-related decision-making at many concerned agencies. Currently concerned agencies mostly rely on field surveys to obtain crop loss information and compensate farmers' loss claim. Such methods are expensive, labor intensive, and time consumptive, especially for a large flood that affects a large geographic area. The results from such methods suffer from inaccuracy, subjectiveness, untimeliness, and lack of reproducibility. Recent studies have demonstrated that Earth observation (EO) data could be used in post-flood crop loss assessment for a large geographic area objectively, timely, accurately, and cost effectively. However, there is no operational decision support system, which employs such EO-based data and algorithms for operational flood-related crop decision-making. This paper describes the development of an EO-based flood crop loss assessment cyber-service system, RF-CLASS, for supporting flood-related crop statistics and insurance decision-making. Based on the service-orientated architecture, RF-CLASS has been implemented with open interoperability specifications to facilitate the interoperability with EO data systems, particularly the National Aeronautics and Space Administration (NASA) Earth Observing System Data and Information System (EOSDIS), for automatically fetching the input data from the data systems. Validated EO algorithms have been implemented as web services in the system to operationally produce a set of flood-related products from EO data, such as flood frequency, flooded acreage, and degree of crop damage, for supporting decision-making in flood statistics and flood crop insurance policy. The system leverages recent advances in the remote sensing-based flood monitoring and assessment, the near-real-time availability of EO data, the service-oriented architecture, geospatial interoperability standards, and the standard-based geospatial web service technology. The prototypical system has automatically generated the flood crop loss products and demonstrated the feasibility of using such products to improve the agricultural decision-making. Evaluation of system by the end-user agencies indicates that significant improvement on flood-related crop decision-making has been achieved with the system.
Article
Full-text available
Crop-type distribution products of the previous-year were used to generate training samples in the classification year. For each pixel, if the frequency of one crop was higher than 50%, the pixel was assumed to be a “possible training sample” of the high-frequency crop. Next, features of the “possible samples” were compared with reference crop features, and matching “possible samples” were confirmed as training samples. The Crop Data Layer (CDL) in Southwest Kansas during 2006-2013 was used as the crop products and MODIS EVI time series were crop features; training samples in 2014 were then acquired. Most of these training samples had the same crop label as the 2014 CDL data, and the training samples achieved good classification accuracies. © 2016 by the authors; licensee Italian Society of Remote Sensing (AIT).
Article
Full-text available
Cereal yields tend to be higher in cereal–legume rotations relative to cereal monoculture yields. We investigated the influence of crop rotation on yield and input‐use efficiency in high‐yield irrigated maize ( Zea mays L.)‐based cropping systems using producer‐reported data from western U.S. Corn Belt (about 11,000 observations). Across regions, average yield of maize grown after soybean [ Glycine max (L.) Merr.] (S–M) was 0.2 to 0.6 Mg ha ⁻¹ (2–5%) higher, relative to yield of maize grown after maize (M–M). Soybean yield was 5% greater after two consecutive maize crops (M–M–S) than after only 1 yr of maize (S–M–S). Nitrogen fertilizer rate in maize fields was 13 kg N ha ⁻¹ (6%) lower in S–M than M–M fields, which, together with higher maize yields in S–M fields, resulted in 11% higher nitrogen partial factor productivity (PFP N ). Difference in PFP N was unrelated with residual soil N–NO 3 ⁻ from prior crop. Analysis of rotation data indicated that rotation effect persists across a wide range of maize yields, from 6 to 15 Mg ha ⁻¹ , though magnitude of rotation effect decreases with increasing yield level. Trends toward greater proportion of total maize area in S–M, rather than M–M, accounts for 8% of maize yield gain in U.S. Corn Belt since 1970. Similarity between our findings and previous research highlights the opportunity to quantify impact of management on yield and efficiencies by using producer data as a complement to high‐cost multi‐year, multi‐site field experiments. Core Ideas We assessed rotation effect on on‐farm yield and input‐use efficiency. Analysis was based on a large producer‐reported database collected from high‐yield irrigated maize–soybean systems. There was a consistent positive rotation effect on yield and partial factor productivity for N fertilizer. Number of previous maize crops did not affect maize yield in monoculture but soybean yields were higher following multiple maize crops. Increasing maize area in rotation relative to monoculture accounts for 8% of the maize yield gain in the U.S. Corn Belt since 1970.
Article
Full-text available
Currently, accurate information on crop area coverage is vital for food security and industry, and there is strong demand for timely crop mapping. In this study, we used MODIS time series data to investigate the effect of the time series length on crop mapping. Eight time series with different lengths (ranging from one month to eight months) were tested. For each time series, we first used the Random Forest (RF) algorithm to calculate the importance score for all features (including multi-spectral data, Normalized Difference Vegetation Index (NDVI), Normalized Difference Water Index (NDWI), and phenological metrics). Subsequently, an extension of the Jeffries-Matusita (JM) distance was used to measure class separability for each time series. Finally, the RF algorithm was used to classify crop types, and the classification accuracy and certainty were used to analyze the influence of the time series length and the number of features on classification performance; the features were added one by one based on their importance scores. Results indicated that when the time series was longer than five months, the top ten features remained stable. These features were mainly in July and August. In addition, the NDVI features contributed the majority of the most significant features for crop mapping. The NDWI and data from multi-spectral bands also contributed to improving crop mapping. On the other hand, separability, classification accuracy, and certainty increased with the number of features used and the time series length, although these values quickly reached saturation. Five months was the optimal time series length, as longer time series provided no further improvement in the classification performance. This result shows that relatively short time series have the potential to identify crops accurately, which allows for early crop mapping over large areas.
Article
Full-text available
Cultivation of corn and soybeans in the United States reached record high levels following the biofuels boom of the late 2000s. Debate exists about whether the expansion of these crops caused conversion of grasslands and other carbon-rich ecosystems to cropland or instead replaced other crops on existing agricultural land. We tracked crop-specific expansion pathways across the conterminous US and identified the types, amount, and locations of all land converted to and from cropland, 2008–2012. We found that crop expansion resulted in substantial transformation of the landscape, including conversion of long-term unimproved grasslands and land that had not been previously used for agriculture (cropland or pasture) dating back to at least the early 1970s. Corn was the most common crop planted directly on new land, as well as the largest indirect contributor to change through its displacement of other crops. Cropland expansion occurred most rapidly on land that is less suitable for cultivation, raising concerns about adverse environmental and economic costs of conversion. Our results reveal opportunities to increase the efficacy of current federal policy conservation measures by modifying coverage of the 2014 US Farm Bill Sodsaver provision and improving enforcement of the US Renewable Fuels Standard.
Article
Soil quality is influenced by inherent and anthropogenic factors. This study was conducted to provide multiple groups guidance on how to achieve and maintain improved soil quality/health. Our hypothesis was that tillage intensity was the primary anthropogenic factor degrading soil quality, and our objective was to prove that hypothesis through an intensive 2005 sampling of a central Iowa, USA field study. Chisel plow, disk tillage, moldboard plow, ridge-till and no-till treatments, used for 31 years in a two-year, corn (Zea mays L.)/soybean [Glycine max (L.) Merr.] (C/S) rotation or for 26 years of continuous corn (CC) production, were evaluated by measuring 23 potential soil quality indicators. Soil samples from 0 to 5- and 5 to 15-cm depth increments were collected from 158 loam or clay loam sampling sites throughout the 10-ha study site. Nine of the indicators were evaluated by depth increment using the Soil Management Assessment Framework (SMAF) which has scoring functions for 13 soil biological, chemical, and physical measurements and can be used to compute individual indicator indices and an overall soil quality index (SQI). Water-stable aggregation (WSA), total organic carbon (TOC), microbial biomass carbon (MBC), and potentially mineralizable nitrogen (PMN) were all significantly lower for the 0 to 5-cm and generally lower for 5 to 15-cm increments after long-term moldboard plowing and its associated secondary tillage operations. This presumably reflected greater physical breakup and oxidation of above- and below-ground plant residues. Bray-P concentrations in moldboard plow plots were also significantly lower at both depth increments. Between soil texture groups, significant differences were found for WSA, Bray-P, TOC and MBC at both depth increments and for both cropping systems. When combined into an overall SQI, both soil texture groups were functioning at 82-85% of their potential at 0-5-cm and at 75% of their potential at the 5-15-cm depth. Our hypothesis that moldboard plowing would have the greatest negative effect on soil quality indicators was verified. Based on this assessment, we recommend that to achieve and maintain good soil health, producers should strive to adopt less aggressive tillage practices.