ArticlePDF Available

Wastewater Effluent Prediction Based on Decision Tree

Authors:

Abstract and Figures

Wastewater treatment systems speed up natural cleansing process to achieve the desired treatment objectives. Prediction of the obtained wastewater treatment characteristics provides to set up existing process steps and it is important to achieve maximum process efficiency. In this study, a computer aided decision tree based on gini algorithm is developed for estimating important output parameters of wastewater such as pH, DBO, DQO, and SS. Used dataset in this study was obtained from the University of California Irvine (UCI) Machine Learning library.
Content may be subject to copyright.
Digital Proceeding Of THE ICOEST2013 - , Cappadocia
C.Ozdemir, S. Şahinkaya, E. Kalıpcı, M.K. Oden (editors)
Nevsehir, Turkey, June 18 21, 2013
Wastewater Effluent Prediction Based on Decision Tree
U.Çelik*, Assist.Prof. N.Yurtay, C. Sertkaya
Computer Engineering Department, Faculty of Computer and Information Sciences, Institute of Arts and Sciences, Sakarya
University, TURKEY.
Esentepe Campus 54187, Sakarya, Turkey
(E-mails: ufuk.celik1@ogr.sakarya.edu.tr, nyurtay@sakarya.edu.tr, d085012051@sakarya.edu.tr)
ABSTRACT
Wastewater treatment systems speed up natural cleansing process to achieve the desired treatment objectives. Prediction of
the obtained wastewater treatment characteristics provides to set up existing process steps and it is important to achieve
maximum process efficiency. In this study, a computer aided decision tree based on gini algorithm is developed for
estimating important output parameters of wastewater such as pH, DBO, DQO, and SS. Used dataset in this study was
obtained from the University of California Irvine (UCI) Machine Learning library.
Keywords: wastewater, desicion tree, classification.
1. INTRODUCTION
Wastewater treatment plants (WWTP) need to be controlled properly to get required
discharge standards. Mismanagement of WWTP causes significant problems on water,
environment and human health. Several complex processes takes place in WWTP. These
processes are affected by many physical, biological, and chemical factors. Best treatment
performance can be achieved by monitoring of these continuous process parameters and
controling of these factors. However, wastewater parameters have non-linear behaviors.
Nonlinearity enforces to use of complex mathematical functions to estimate parameters.
Prediction of any of these parameters helps the operator to control the system and to take
necessary precautions before any problem arised and required treatment performance can be
achived.
Nowadays, many studies based on intelligent methods were conducted in wastewater
treatment. Theese researches are about prection of WWTP parameters and process control of
WWTP. Simple prediction models based on neural network were developped for Total
Suspended Solid (TSS) prediction in [1] and [2]. NW multi-layer forward artificial neural
network soft sensing model were proposed for predicting performance and controlling of
WWTP processes in [3]. An integrated neural-fuzzy process controller was developed to
control aeration in an Aerated Submerged Biofilm Wastewater Treatment Process in [4]. Self-
organizing networks were designed for one step ahead prediction of the outputs of the pre-
precipitation stage of a wastewater treatment plant in [5]. An approach based on an agent with
learning capabilities is proposed for the N-Ammonia removal process in [6].
139
Common characteristic of proposed models is using more tan one input parameter to
estimate only one output parameter. The multiple inputs - single output models success rate is
higher than others.
In this study, the predictive Decision Tree models based on Gini algorithm, presented
for the estimation of effluent parameters, PH, DBO, DQO, and SS. According to test results,
the developed model performance is at desirable level.
2. METHOD
Data collection and analysis is essential for successful development of a model. The
first step in the development process is collecting the data and saving it in computer
environment in a regular format. The next step involves data cleansing. Data cleansing is the
process of detecting and correcting (or removing) corrupt or inaccurate records from dataset
[7]. After this is step, Principal Component Analysis (PCA) is performed to find best input
parameters for relevant output parameter. Then proposed Decision Tree models have been
developped, models are simulated and the obtained results are discussed. Theese model
development steps are shown schematically in Figure 1.
Data Collection
Data Cleansing
Decision Tree Model Development
Model simulation and discussion
PCA Analysis
Figure 1. The steps of model development
2.1 Data Collection and Analysis
Data was obtained from the University of California Irvine (UCI) Machine Learning
library. This dataset was collected from the daily measures of sensors in a urban waste water
treatment plant. There are 527 daily data in this dataset [8]. 12 features (9 inputs and 3
outputs) were selected. Used UCI dataset features are shown below in Table 1.
Table 1. Dataset attributes
140
No
Parameter
Description
1
Q-E
input flow to plant
2
ZN-E
input Zinc to plant
3
PH-E
input pH to plant
4
DBO-E
input Biological demand of oxygen to plant
5
DQO-E
input chemical demand of oxygen to plant
6
SS-E
input suspended solids to plant
7
SSV-E
input volatile suspended solids to plant
8
SED-E
input sediments to plant
9
COND-E
input conductivity to plant
10
DBO-S
output Biological demand of oxygen
11
DQO-S
output chemical demand of oxygen
12
SS-S
output suspended solids
When examining the dataset, there are some missing and inaccurate values were found
in some records. Theese records were removed from existing dataset. After this stage, the
study continued with the remaining 323 data. Output parameters’s range is shown in Figure 2.
Figure 2. Output parameters’s range
After the data cleansing process, principal component analysis (PCA) was performed
for each output parameters. PCA analysis is known as a variable reduction procedure [9]. The
goal is selecting input parameters which have high correlation with output parameter.So that
saves the system from unnecessary input parameters and improves the system performance
[10, 11].
In this study linear correlation method is used for PCA analysis [12]. Correlation
equation is as follows:
     
days
141
where E is the expected value operator, X and Y are two random variables with expected
values and and standard deviations and .
After the correlation process, the relationship between the input and output parameters are
found as given in Table 2.
Table 2. Correlation results
Input
Output
PH-S
DBO-S
DQO-S
SS-S
Q-E
0,07
0,01
-0,06
-0,01
ZN-E
-0,14
-0,02
0,03
-0,04
PH-E
0,35
0,01
0,01
-0,06
DBO-E
0,00
0,15
0,28
0,15
DQO-E
-0,01
0,09
0,30
0,07
SS-E
0,10
0,02
-0,01
0,02
SSV-E
-0,11
-0,01
0,14
-0,02
SED-E
0,02
0,03
0,06
0,00
COND-E
0,10
0,02
0,17
-0,01
Correlation results summarize the relation between input and output parameters :
Output parameter, PH-S, has negative correlation with input parameters ZN-E, DQO-
E, SSV-E and has positive correlation with others.
Output parameter, DBO-S has negative correlation with input parameters ZN-E, SSV-
E and has positive correlation with others.
Output parameter, DQO-S has negative correlation with input parameters QE and the
SS-E and has positive correlation with others.
Output parameter, SS-S has negative correlation with input parameters Q-E, ZN-E,
PD-E, SSV-E, COND-E and has positive correlation with others.
Parameters which have the correlation above 0.05 were chosen for using in models. Based on
the obtained results, Decision Tree models for each output parameters are determined as
Figure 3.
Decision Tree
Decision Tree
Decision Tree
Decision Tree
142
Figure 3. Developped decision tree models
2.2 Gini Model
In this study decision tree has been obtained with Gini algorithm by using RapidMiner
software. Gini algorithm is a method based on binary division of dataset. Attribute values of
assets are owned by the Gini algorithm so that each one is divided into two groupings.
Branches, divisions occur as a result of these groupings. Each attribute value of the binary
elements of the group shall be deemed to have separated branches. This branch is used for
attribute values in the group’s formulation element numbers [13]. Gini formulation provides
the first calculation of the left and right values. Each attribute for the left and right divisions
and the Ginileft Giniright expressions are calculated as follows:
Li : On the left branch i the sample group (s) number
Ri : i group in the right branch sample (s) number
k : the number of classes
T : node samples
|Tleft| : Left branch sample (s) number
|Tright| : Right branch sample (s) number
be calculated with the following definitions of relations.
k
ileft
i
left T
L
Gini 1
2
1
k
iright
i
right T
R
Gini 1
2
1
The nature of the learning set, for each j the number of elements to be calculated the
following correlation [10].
Ginij =
( |Tleft|Ginileft + |Tright|Giniright )
143
2.3 Experimental Results
Figure 4. Decision Tree for PH analysis
Table 3. ROC analysis for PH results
accuracy: 64.62%
true (7.3-7.5)
true (7.5-7.7)
true (7.7-7.9)
true (>7.9)
class precision
pred. (7.3-7.5)
0
0
0
0
0.00%
pred. (7.5-7.7)
2
10
4
4
50.00%
pred. (7.7-7.9)
1
5
32
7
71.11%
pred. (>7.9)
0
0
0
0
0.00%
class recall
0.00%
66.67%
88.89%
0.00%
144
Decision tree analysis for PH output values class was arranged within the range of 7.3
and 7.9 with the incremental steps 0.2. It’s prediction accuracy is 64.62% in total.
Figure 5. Decision Tree for DBO analysis
Table 4. ROC analysis for DBO results
accuracy: 81.25%
true (0-15)
true (15-30)
true (30-45)
true (>45)
class precision
pred. (15-30)
8
41
2
0
80.39%
pred. (0-15)
11
2
0
0
84.62%
pred. (30-45)
0
0
0
0
0.00%
145
pred. (>45)
0
0
0
0
0.00%
class recall
57.89%
95.35%
0.00%
0.00%
Decision tree analysis for DBO output values class was arranged within the range of 0
and 45 with the incremental steps 15. It’s prediction accuracy is 81.25% in total.
Figure 6. Decision Tree for DQO analysis
Table 5. ROC analysis for DQO results
accuracy: 57.81%
true (0-50)
true (50-75)
true (75-100)
true (>100)
class precision
pred. (0-50)
2
1
1
0
50.00%
pred. (50-75)
3
16
6
1
61.54%
pred. (75-100)
0
2
16
11
55.17%
pred. (>100)
0
0
2
3
60.00%
class recall
40.00%
84.21%
64.00%
20.00%
146
Decision tree analysis for DQO output values class was arranged within the range of 0
and 100 with the incremental steps 25. It’s prediction accuracy is 57.81% in total.
Figure 7. Decision Tree for SS analysis
Table 6. ROC analysis for SS results
accuracy: 65.62%
true (0-10)
true (11-20)
true (21-30)
true (>30)
class precision
pred. (0-10)
1
0
1
0
50.00%
147
pred. (11-20)
1
35
12
6
64.81%
pred. (21-30)
0
2
6
0
75.00%
pred. (>30)
0
0
0
0
0.00%
class recall
50.00%
94.59%
31.58
0.00%
Decision tree analysis for SS output values class was arranged within the range of 0
and 30 with the incremental steps 10. It’s prediction accuracy is 65.62% in total.
3. Conclusions
Developing of a diagnosis tool for controlling a waste water treatment plant is an
interesting contribuiton to the field of intelligent systems when applied to industiral process.
Especially it can be very helpfull to the plant manager.
It is expected that every water treatment plant output effluent quality must fullfill the
waste water quality standart specified in the goverment regulations. Therefore it is important
to predict plant output values and make some preventions and improvements for those output.
Because of this the usage of expert systems for wastewater treatment plant monitoring,
control, diagnosis, assessment is a point of interest for the researchers in domain.
Decision tree accuracy results have average value but not perfect to water treatment
plant. The algorithm we used in this study is decision tree method and it is actually usefull
when there is a clustering problem. Decision tree dataset needs a good representation class in
order to get better results. Although waste water plant datasets are not very good for decision
algorithms they can be arranged by dividing output values as new groups. But there will be
some accuracy problems. This problem can be solved by adding other algorithms so it will
hybrid algorithms and accuracy results reach to maximum.
Future work will consist in adding fuzzy logic, neural networks or artifical immune
system algorithm to the developed expert system, in order to obtain a better expert system for
a wastewater treatment plant effluent assessment. Furthermore, the developed system can be
replenished with a control component for wastewater treatment plant effluent quality control.
REFERENCES
[1] Belanche, L., Valde´s, J. J., Comas, J., Roda, I. R., & Poch, M. (2000). Prediction of the
bulking phenomenon in wastewater treatment plants. Artificial Intelligence in
Engineering, 14(4), 307317.
[2] Hanbay, D., Turkoglu, I., Demir, Y.(2008), Prediction of wastewater treatment plant
performance based on wavelet packet decomposition and neural networks, Expert
Systems with Applications, vol 43:2, pp:1038-1043.
148
[3] Zhang, R., Hu, X., Effluent Quality Prediction of Wastewater Treatment System Based on
Small-world ANN, JOURNAL OF COMPUTERS, VOL. 7, NO. 9, SEPTEMBER
2012
[4] Mingzhi, H., Jinquan, W., Yongwen, M., Yan,W., Weijiang, L., Xiaofei, S., Control rules
of aeration in a submerged biofilm wastewater treatment process using fuzzy neural
networks, Expert Systems with Applications, vol 36:7, 2009.
[5] Nilsson, S., Stathaki, A., King, R.E., Prediction of Wastewater Pre-Precipitation Variables
Using Self-Organizing Networks, IEEE International Symposium 2005, pp. 932- 937.
[6] Olmo, F.H., Llanes, F.H., Gaudioso,E., An emergent approach for the control of
wastewater treatment plants by means of reinforcement learning techniques, Expert
Systems with Applications,Volume 39, Issue 3, 2012, pp. 23552360.
[7] Prasad, K.H., Faruquie, T.A., Joshi, S., Chaturvedi, S., Subramaniam, L.V., Mohania, M.,
(2011), Data Cleansing Techniques for Large Enterprise Datasets, SRII Global
Conference (SRII), pp. 135-144.
[8] Machine Learning Repository (UCI), (2013). Available at
http://archive.ics.uci.edu/ml/datasets/Water+Treatment+Plant
[9] Smith, L., (2002).A tutorial on Principal Components Analysis, Available at
http://www.sccg.sk/~haladova/principal_components.pdf
[10] Oliveira-Esquerre, K.P., Mori, M., Bruns, R.E., (2002). Simulation of an industrial
wastewater treatment using artificial neural networks and principal components
analysis, Brazilian Journal
[11] Civelekoglu, G., (2006). The Modeling of Treatment Processes with Artificial
Intelligence and Multistatistical Methods, Doctorate Thesis, Suleyman Demirel
University, Turkey.
[12] Wikipedia_2 (2013). Correlation and dependence, Available at
http://en.wikipedia.org/wiki/Correlation_and_dependence.
[13] Y. Özkan, Data Mining Methods, Papatya Publishing, Turkey, 2008
... Bir çevre tesisinde elde edilen veriler tesis hakkında sistem performansı için genel bilgi sahibi olmamızı sağlar. Bir çevresel tesise ait giriş ve çıkış değerlerine ait veriler arasındaki ilişki genellikle non-lineer'dir [1,2]. Bu doğrusal olmayan davranış biçimi parametrelerinin tahminini zorlaştırmakta ve karmaşık matematiksel fonksiyonlara ihtiyaç duyulmaktadır. ...
Conference Paper
Full-text available
Atık yönetiminin yapıldığı çevresel tesislerin kontrolünde gerçekçi modellere ve doğru tahminlere ihtiyaç vardır. Doğru tahmin modelinin geliştirilmesinin en önemli adımı sağlıklı verin olmasıdır. Çevresel tesislerden temin edilen verilerin ön işlem aşamasında kalitesiz verilerinin temizlenmesi ve eksik verilerin tamamlanması gerekmektedir. Bu çalışmada bir çevresel tesisten Ocak 2016 - Eylül 2018 tarihleri arasında toplanan verilerden veri madenciliği programı ile modelleme öncesi temiz verilerin elde edilmesi hedeflenmiştir. Verilerin temizlenmesi aşamasında; pH, Eİ, AKM, KOİ, BOİ5, Yağ-Gres ve TÇK parametrelerine ait sırasıyla 25, 141, 26, 22, 241, 645 ve 688 adet eksik veri tespit edilmiştir. Eksik veriler ortalama değerler göz önüne alınarak tamamlanmıştır. Sonrasında 10 adet gürültülü veri belirlenmiş ve satır bazlı temizleme yapılmıştır. Sezonluk ortalama değerlerin belirlenmesi için BOİ5 parametresi kullanılmış ve sezonluk ortalama değerler program aracılığı ile hesaplanmıştır. Böylece bir çevresel tesisin ham verilerinin veri madenciliği programları yardımı ile temizlenmesi ve eksik verilerin tamamlanarak modelleme uygulanması için hazır hale getirilmesi sağlanmıştır.
... In this study, we used a public database, called water treatment plant dataset, which is available in the University of California Irvine (UCI) machine learning repository, to predict electrical conductivity earlier. The algorithm works based on the combination of particle swarm optimization and decision tree for times series prediction [3]. This system can help the wastewater management in irrigation plants. ...
... Approaches that provide more intelligent control in WWTPs have been proposed in the literature. For example, machine learning models [25,26], genetic algorithms [27] or neural networks [28,29]. ...
Article
Full-text available
Currently, energy and environmental efficiency are critical aspects in wastewater treatment plants (WWTPs). In fact, WWTPs are significant energy consumers, especially in the active sludge process (ASP) for the N-ammonia removal. In this paper, we face the challenge of simultaneously improving the economic and environmental performance by using a reinforcement learning approach. This approach improves the costs of the N-ammonia removal process in the extended WWTP Benchmark Simulation Model 1 (BSM1). It also performs better than a manual plant operator when disturbances affect the plant. Satisfactory experimental results show significant savings in a year of a working BSM1 plant.
Chapter
Wastewater treatment is a crucial procedure for preserving the environment and public health, but conventional techniques frequently face difficulties with regard to efficacy, affordability, and sustainability. Efficient wastewater treatment and bioremediation remedies are intrinsically linked to microorganisms and their enzymatic activity, which are crucial. A vast spectrum of microbes may metabolize a wide range of complex organic molecules, degrading them down through their metabolic processes. Artificial intelligence (AI) and Machine learning algorithms (MLA) provide novel techniques for analyzing, forecasting and managing uncertainty in wastewater treatment operations. The idea of utilizing microbial communities and AI to transform wastewater treatment technology has gained popularity in recent years. This premise explores into novel techniques, processes, and the possible benefits of combining several disciplines. It demonstrates how this fusion has the potential to change wastewater treatment, opening the path for a more ecologically friendly and optimized future.
Article
Increasing energy efficiency in wastewater treatment plants (WWTPs) is becoming more important. An emerging approach to addressing this issue is to exploit development in data science and modelling. Deployment of sensors to measure various parameters in WWTPs opens greater opportunities for exploiting the wealth of data. Artificial intelligence (AI) is emerging as a solution for automation and digitalization in the wastewater sector. This review aims to comprehensively investigate, summarize and analyze recent developments in AI methods applied to the modelling of WWTPs. The review shows that among the standalone models, Artificial Neural Networks (ANN) was the most popular model followed by, in descending order: Decision Trees (DT), Fuzzy Logic (FL), Genetic algorithm (GA) and Support Vector Machine (SVM). In the case of incomplete data, FL was the most frequently used method as it uses linguistic expert rules to find an approximation for the missing data. Regarding accuracy and precision, hybrid models demonstrated relatively better performance than the standalone ones. Among these models, the Machine Learning (ML)-metaheuristic, which integrates an AI model with a bioinspired optimization method, was the most preferred type as it was used in more than 45% of the hybrid models. Correlation coefficient (R), Correlation of Determination (R²) and Root Mean Square Error (RMSE) were the frequently used metrics for model performance evaluation. Finally, the review shows that despite recent developments, industrial deployment is still lacking. The industrial application requires close interaction of interested parties, among which research institutes, private sector and public sector play an inevitable role. The future research should focus on mitigating the barriers for more in-depth collaboration of interested parties and finding new paths for more cooperative and harmonized activity of them.
Conference Paper
Water Quality is a basic need for human. If the quality level of the water is not appropriate, it will give dangerous impacts to the human life. Therefore, the measurement of the water quality becomes important because it needs specific treatments to make more acceptable for specific uses of the water such as drinkable water, paddy fields, river flow maintenance, water recreation or other environmental preservation purposes. in this paper we present measurement classification of water quality for treatment with involving some parameters of the water such as Biological Oxygen Demand, Chemical Oxygen Demand, Ph, Suspended Solid, etc. We use Neural Network to deal with the classification problems and make classification into 13 types for the water treatment. In this paper we propose our Reinforcement Programming algorithm to optimize weighting mechanism for Neural Network. Reinforcement Programming is an optimization algorithm derived from Reinforcement Learning, a new learning paradigm in machine learning that learns from the interaction with external environments to achieve a goal. Reinforcement Programming improved the Reinforcement Learning by shifting goal-based to function-based approach in order to solve the optimization problems in weighting mechanism of Neural Network. It updated the weights of Neural Network by implementing the exploitation and exploration of Neural Network weights, and then measured the differences of state values from a given state-function to assign a reward or punishment of the state. We applied our proposed Reinforcement Programming to optimize Neural Network weighting mechanism for water quality measurement classification and made series of experimental study with water quality treatment dataset provided by UCI Machine Learning Repository. To scrutinize the applicability of our proposed approach, we made performance comparison with common existing Neural. Neural weight update using Backpropagation. In the experimental results, our proposed Reinforcement Programming outperformed Backpropagation for weighting mechanism in precision and time.
Conference Paper
Full-text available
This paper describes the derivation and design of an array of self-organizing networks trained by inductive learning for one step ahead prediction of the outputs of the pre-precipitation stage of a wastewater treatment plant with a view to model predictive control of the stage
Article
Full-text available
One of the main problems in the automation of the control of wastewater treatment plants (WWTPs) appears when the control system does not respond as it should because of changes on influent load or flow. To tackle this difficult task, the application of Artificial Intelligence is not new, and in fact, currently Expert Systems may supervise the plant 24 h/day assisting the plant operators in their daily work. However, the knowledge of the Expert System must be elicited previously from interviews to plant operators and/or extracted from data previously stored in databases. Although this approach still has a place in the control of wastewater treatment plants, it should aim to develop autonomous systems that learn from the direct interaction with the WWTP and that can operate taking into account changing environmental circumstances. In this paper we present an approach based on an agent with learning capabilities. In this approach, the agent’s knowledge emerges from the interaction with the plant. In order to show the validity of our assertions, we have implemented such an emergent approach for the N-Ammonia removal process in a well established simulated WWTP known as Benchmark Simulation Model No.1 (BSM1).
Article
Full-text available
The present work is part of a global development of reliable real-time control and supervision tools applied to wastewater pollution removal processes. In these processes, oxygen is a key substrate in animal cell metabolism and its consumption is thus a parameter of great interest for the monitoring. In this paper, an integrated neural-fuzzy process controller was developed to control aeration in an Aerated Submerged Biofilm Wastewater Treatment Process (ASBWTP). In order to improve the fuzzy neural network performance, the self-learning ability embedded in the fuzzy neural network model was emphasized for improving the rule extraction performance. The fuzzy neural network proves to be very effective in modeling the aeration performs better than artificial neural networks (ANN).For comparing between operation with and without the fuzzy neural controller, an aeration unit in an Aerated Submerged Biofilm Wastewater Treatment Process (ASBWTP) was picked up to support the derivation of a solid fuzzy control rule base. It is shown that, using the fuzzy neural controller, in terms of the cost effectiveness, it enables us to save almost 33% of the operation cost during the time period when the controller can be applied. Thus, the fuzzy neural network proved to be a robust and effective DO control tool, easy to integrate in a global monitoring system for cost managing.
Article
Full-text available
This work presents a way to predict the biochemical oxygen demand (BOD) of the output stream of the biological wastewater treatment plant at RIPASA S/A Celulose e Papel, one of the major pulp and paper plants in Brazil. The best prediction performance is achieved when the data are preprocessed using principal components analysis (PCA) before they are fed to a backpropagated neural network. The influence of input variables is analyzed and satisfactory prediction results are obtained for an optimized situation.
Article
In order to provide a tool for predicting wastewater treatment performance and form a basis for controlling the operation of the process, a NW multi-layer forward small world artificial neural networks soft sensing model is proposed for the waste water treatment processes. The input and output variables of the network model were determined according to the waste water treatment system. The multi-layer forward small world artificial neural networks model was built, and the hidden layer structure of the network model was studied. The results of model calculation show that the predicted value can better match measured value, playing an effect of simulating and predicting and be able to optimize the operation status. The establishment of the predicting model provides a simple and practical way for the operation and management in wastewater treatment plant, and has good research and engineering practical value.
Article
The control and prediction of wastewater treatment plants poses an important goal: to avoid breaking the environmental balance by always keeping the system in stable operating conditions. It is known that qualitative information — coming from microscopic examinations and subjective remarks — has a deep influence on the activated sludge process. In particular, on the total amount of effluent suspended solids, one of the measures of overall plant performance. The search for an input–output model of this variable and the prediction of sudden increases (bulking episodes) is thus a central concern to ensure the fulfillment of current discharge limitations. Unfortunately, the strong interrelation between variables, their heterogeneity and the very high amount of missing information makes the use of traditional techniques difficult, or even impossible. Through the combined use of several methods — rough set theory and artificial neural networks, mainly — reasonable prediction models are found, which also serve to show the different importance of variables and provide insight into the process dynamics.
Article
In this paper, an intelligent wastewater treatment plant model is developed to predict the performance of a wastewater treatment plant (WWTP). The developed model is based on wavelet packet decomposition, entropy and neural network. The data used in this work were obtained from a WWTP in Malatya, Turkey. Daily records of these WWTP parameters over a year were obtained from the plant laboratory. Wavelet packet decomposition was used to reduce the input vectors dimensions of intelligent model. The suitable architecture of the neural network model is determined after several trial and error steps. Total suspended solid is one of the measures of overall plant performance so the developed model is used to predict the total suspended solid concentration in plant effluent. According to test results, the developed model performance is at desirable level. This model is an efficient and a robust tool to predict WWTP performance.
Available at http://archive.ics.uci
Machine Learning Repository (UCI), (2013). Available at http://archive.ics.uci.edu/ml/datasets/Water+Treatment+Plant
The Modeling of Treatment Processes with Artificial Intelligence and Multistatistical Methods, Doctorate Thesis
  • G Civelekoglu
Civelekoglu, G., (2006). The Modeling of Treatment Processes with Artificial Intelligence and Multistatistical Methods, Doctorate Thesis, Suleyman Demirel University, Turkey.