Chapter

A Review of Managing Water Resources in Malaysia with Big Data Approaches

Authors:
To read the full-text of this research, you can request a copy directly from the authors.

Abstract

Big data have rapidly developed as a viable solution to many problems faced in engineering industries. Specifically, in the industry of water resource engineering, where there is a tremendous amount of data, various big data techniques could be applied to achieve innovative and efficient solutions for the industry. This study reviewed the proposal of big data as potential approaches to solve various difficulties encountered in managing water resources and related applications in Malaysia. The advantages and disadvantages of big data applications have also been discussed along with a brief literature review and some examples of case studies. © 2021 by Emerald Publishing Limited All rights of reproduction in any form reserved.

No full-text available

Request Full-text Paper PDF

To read the full-text of this research,
you can request a copy directly from the authors.

ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
Suspended sediment load (SSL) estimation is a required exercise in water resource management. This article proposes the use of hybrid artificial neural network (ANN) models, for the prediction of SSL, based on previous SSL values. Different input scenarios of daily SSL were used to evaluate the capacity of the ANN-ant lion optimization (ALO), ANN-bat algorithm (BA) and ANN-particle swarm optimization (PSO). The Goorganrood basin in Iran was selected for this study. First, the lagged SSL data were used as the inputs to the models. Next, the rainfall and temperature data were used. Optimization algorithms were used to fine-tune the parameters of the ANN model. Three statistical indexes were used to evaluate the accuracy of the models: the root-mean-square error (RMSE), mean absolute error (MAE) and Nash-Sutcliffe efficiency (NSE). An uncertainty analysis of the predicting models was performed to evaluate the capability of the hybrid ANN models. A comparison of models indicated that the ANN-ALO improved the RMSE accuracy of the ANN-BA and ANN-PSO models by 18% and 26%, respectively. Based on the uncertainty analysis, it can be surmised that the ANN-ALO has an acceptable degree of uncertainty in predicting daily SSL. Generally, the results indicate that the ANN-ALO is applicable for a variety of water resource management operations.
Article
Full-text available
Water Quality Index (WQI) is the most common determinant of the quality of the stream-flow. According to the Department of Environment (DOE, Malaysia), WQI is chiefly affected by six factors, which are, chemical oxygen demand (COD), biochemical oxygen demand (BOD), dissolved oxygen (DO), suspended solids (SS), -potential for hydrogen (pH), and ammoniacal nitrogen (AN). In fact, understanding the inter-relationships between these variables and WQI can improve predicting the WQI for better water resources management. The aim of this study is to create an input approach using ANNs (Artificial Neural Networks) to compute the WQI from input parameters instead of using the indices of the parameters when one of the parameters is absent. The data are collected from the nine water quality monitoring stations at the Klang River basin, Malaysia. In addition, comprehensive sensitivity analysis has been carried out to identify the most influential input parameters. The model is based on the frequency distribution of the significant factors showed exceptional ability to replicate the WQI and attained very high correlation (98.78%). Furthermore, the sensitivity analysis showed that the most influential parameter that affects WQI is DO, while pH is the least one. Additionally, the performance of models shows that the missing DO values caused deterioration in the accuracy.
Article
Full-text available
Earth observation technology has provided highly useful information in global climate change research over the past few decades and greatly promoted its development, especially through providing biological, physical, and chemical parameters on a global scale. Earth observation data has the 4V features (volume, variety, veracity, and velocity) of big data that are suitable for climate change research. Moreover, the large amount of data available from scientific satellites plays an important role. This study reviews the advances of climate change studies based on Earth observation big data and provides examples of case studies that utilize Earth observation big data in climate change research, such as synchronous satellite-aerial-ground observation experiments, which provide extremely large and abundant datasets; Earth observational sensitive factors (e.g., glaciers, lakes, vegetation, radiation, and urbanization); and global environmental change information and simulation systems. With the era of global environment change dawning, Earth observation big data will underpin the Future Earth program with a huge volume of various types of data and will play an important role in academia and decision-making. Inevitably, Earth observation big data will encounter opportunities and challenges brought about by global climate change.
Article
With the advent of big data, such data regarding the harmonious relationship of “people-land-time” have also gradually entered into the fields of natural resource management. Ecological land is one of the important natural resources and is fundamental to maintaining ecological security. Ecological land change can lead to a series of eco-environmental problems, including water shortages, soil erosion, increased drought intensity, ecosystem damage, and biodiversity loss. Based on relevant sets of big data, including spatial land data, soil data, DEM, climatic data, and socio-economic data, this study explores the factors influencing ecological land change during the period of 2000-2005 in China’s Beijing-Tianjin-Hebei Region. The results show that the factors influencing different types of ecological land change have substantial differences. For forest land coverage change, Slope type, Soil organic matter (SOM) content, Farmer’s population percentage, and Landform type are the most important independent variables. However, for grassland change, altitude, distance to the primary road and GDP per capita are the most important spatial determinants. Regarding the wetland change, farmer’s population percentage, GDP per capita and altitude are the most important factors influencing wetland changes. This study indicates that natural and social-economic factors can affect ecological land change in China’s Beijing-Tianjin-Hebei Region.
Article
Precipitation is a key control on watershed hydrologic modelling output, with errors in rainfall propagating through subsequent stages of water quantity and quality analysis. Most watershed models incorporate precipitation data from rain gauges; higher-resolution data sources are available, but they are associated with greater computational requirements and expertise. Here, we investigate whether the Multisensor Precipitation Estimator (MPE or Stage IV Next-Generation Radar) data improve the accuracy of streamflow simulations using the Soil and Water Assessment Tool (SWAT), compared with rain gauge data. Simulated flows from 2002 to 2010 at five timesteps were compared with observed flows for four nested subwatersheds of the Neuse River basin in North Carolina (21-, 203-, 2979-, and 10 100-km2 watershed area), using a multi-objective function, informal likelihood-weighted calibration approach. Across watersheds and timesteps, total gauge precipitation was greater than radar precipitation, but radar data showed a conditional bias of higher rainfall estimates during large events (>25–50 mm/day). Model parameterization differed between calibrations with the two datasets, despite the fact that all watershed characteristics were the same across simulation scenarios. This underscores the importance of linking calibration parameters to realistic processes. SWAT simulations with both datasets underestimated median and low flows, whereas radar-based simulations were more accurate than gauge-based simulations for high flows. At coarser timesteps, differences were less pronounced. Our results suggest that modelling efforts in watersheds with poor rain gauge coverage can be improved with MPE radar data, especially at short timesteps. Published 2013. This article is a U.S. Government work and is in the public domain in the USA.
Article
Big Data concern large-volume, complex, growing data sets with multiple, autonomous sources. With the fast development of networking, data storage, and the data collection capacity, Big Data are now rapidly expanding in all science and engineering domains, including physical, biological and biomedical sciences. This paper presents a HACE theorem that characterizes the features of the Big Data revolution, and proposes a Big Data processing model, from the data mining perspective. This data-driven model involves demand-driven aggregation of information sources, mining and analysis, user interest modeling, and security and privacy considerations. We analyze the challenging issues in the data-driven model and also in the Big Data revolution.
Article
Climate science is a Big Data domain that is experiencing unprecedented growth. In our efforts to address the Big Data challenges of climate science, we are moving toward a notion of Climate Analytics-as-a-Service (CAaaS). We focus on analytics, because it is the knowledge gained from our interactions with Big Data that ultimately produce societal benefits. We focus on CAaaS because we believe it provides a useful way of thinking about the problem: a specialization of the concept of business process-as-a-service, which is an evolving extension of IaaS, PaaS, and SaaS enabled by Cloud Computing. Within this framework, Cloud Computing plays an important role; however, we see it as only one element in a constellation of capabilities that are essential to delivering climate analytics as a service. These elements are essential because in the aggregate they lead to generativity, a capacity for self-assembly that we feel is the key to solving many of the Big Data challenges in this domain. MERRA Analytic Services (MERRA/AS) is an example of cloud-enabled CAaaS built on this principle. MERRA/AS enables MapReduce analytics over NASA’s Modern-Era Retrospective Analysis for Research and Applications (MERRA) data collection. The MERRA reanalysis integrates observational data with numerical models to produce a global temporally and spatially consistent synthesis of 26 key climate variables. It represents a type of data product that is of growing importance to scientists doing climate change research and a wide range of decision support applications. MERRA/AS brings together the following generative elements in a full, end-to-end demonstration of CAaaS capabilities: (1) high-performance, data proximal analytics, (2) scalable data management, (3) software appliance virtualization, (4) adaptive analytics, and (5) a domain-harmonized API. The effectiveness of MERRA/AS has been demonstrated in several applications. In our experience, Cloud Computing lowers the barriers and risk to organizational change, fosters innovation and experimentation, facilitates technology transfer, and provides the agility required to meet our customers’ increasing and changing needs. Cloud Computing is providing a new tier in the data services stack that helps connect earthbound, enterprise-level data and computational resources to new customers and new mobility-driven applications and modes of work. For climate science, Cloud Computing’s capacity to engage communities in the construction of new capabilities is perhaps the most important link between Cloud Computing and Big Data.