Article

Bauwerksüberwachung 4.0 an der Ennepetalsperre mittels Big-Data-Analysen

Authors:
To read the full-text of this research, you can request a copy directly from the authors.

No full-text available

Request Full-text Paper PDF

To read the full-text of this research,
you can request a copy directly from the authors.

Article
Measurement data management systems are a key factor in modern urban water resource management, that allow to judge data quality assessments and are thus crucial for further applications. The joint analysis of all sensor data is often not possible, due to the enormous amount of data incoming. In this study, a novel data analysis tool is presented that is able to analyse a water resource management system as a whole and to detect erroneous measurements.
Article
Full-text available
Several factors have an impact on the generation of floods, for example, antecedent moisture conditions and the shape of the catchment. A very important factor is the event rainfall, especially its temporal distribution. However, the categorization of temporal distributions is riddled with uncertainty, due to a priori assumptions on distribution types. Here, we propose a new clustering approach based on unsupervised learning, using dimensionless mass curves to describe the temporal distributions. The purpose of the proposed method is the identification of reoccurring temporal distributions of precipitation events. Additionally, the correlation of the resulting clusters of temporal distribution with rainfall‐induced flood types is investigated. The application to several catchments in Germany showed the existence of spatial patterns of six different clusters for the temporal distribution and a significant coherence with the flood types. It was found that the temporal distribution of rainfall intensities shifts from early peaked to more uniformly distributed when shifting the flood type from short floods with high peaks to long‐duration floods, often with several peaks.
Article
Full-text available
Karstic groundwater systems are often investigated by a combination of environmental or artificial tracers. One of the major downsides of tracer‐based methods is the limited availability of tracer measurements, especially in data sparse regions. This study presents an approach to systematically evaluate the information content of the available data, to interpret predictions of tracer concentration from machine learning algorithms, and to compare different machine learning algorithms to obtain an objective assessment of their applicability for predicting environmental tracers. There is a large variety of machine learning approaches, but no clear rules exist on which of them to use for this specific problem. In this study, we formulated a framework to choose the appropriate algorithm for this purpose. We compared four different well‐established machine learning algorithms (Support Vector Machines, Extreme Learning Machines, Decision Trees, and Artificial Neural Networks) in seven different karst springs in France for their capability to predict tracer concentrations, in this case SO42− and NO3−, from discharge. Our study reveals that the machine learning algorithms are able to predict some characteristics of the tracer concentration, but not the whole variance, which is caused by the limited information content in the discharge data. Nevertheless, discharge is often the only information available for a catchment, so the ability to predict at least some characteristics of the tracer concentrations from discharge time series to fill, for example, gaps or increase the database for consecutive analyses is a helpful application of machine learning in data sparse regions or for historic databases.
Article
There has been a recent resurgence of interest in the application of Information Theory to problems of system identification in the Earth and Environmental Sciences. While the concept of entropy has found increased application, little attention has yet been given to the practical problems of estimating entropy when dealing with the unique characteristics of two commonly used kinds of hydrologic data: rainfall and runoff. In this paper, we discuss four important issues of practical relevance that can bias the computation of entropy if not properly handled. The first (zero effect) arises when precipitation and ephemeral streamflow data must be viewed as arising from a discrete-continuous hybrid distribution due to the occurrence of many zero values (e.g., days with no rain/no runoff). Second, in the widely used bin-counting method for estimation of PDF’s, significant error can be introduced if the bin width is not carefully selected. The third (measurement effect) arises due to the fact that continuously varying hydrologic variables can typically only be observed discretely to some degree of precision. The Fourth (skewness effect) arises when the distribution of a variable is significantly skewed. Here, we present an approach that can deal with all four of these issues, and test them with artificially generated and real hydrological data. The results indicate that the method is accurate and robust.
Article
With regional socio-economic development, the gap between water use demand and available water resources in arid and humid semi-arid areas becomes increasingly serious. In this study, the size of regional water use in the Guanzhong region and Shiyang river basin in northwest China are analyzed to identify important factors affecting it, with the aim of providing better and optimal development planning for the region. Information entropy is used to measure and characterize the diversity of regional water use. Agricultural development and meteorological factors are found to be the main issues affecting regional water use in both regions. A multiple-linear regression (MLR) model was built by applying correlation coefficient (R) and mutual information (MI) scores in the process. Results show that the low value of information entropy of water use in the Shiyang river basin is due to the high proportion of agriculture water use. Using input factors chosen by MI score was found to be the best model to simulate the change of regional water use in both regions. A method using an MLR model together with MI is shown to be able to quantify the relationship between the influencing factors and water use diversity with limited available data.
Article
Bell System Technical Journal, also pp. 623-656 (October)