Conference PaperPDF Available

Electricity Consumption Pattern Detection

  • VoltronData Inc

Abstract and Figures

Increase of the usage of domestic electrical devices causes higher electricity bills. Detecting power consumption patterns can be used to minimize the electricity bill. This paper discusses how to detect electricity consumption patterns of domestic power usage via data mining and clustering using machine learning techniques and detect the device. A prototype data acquisition device was developed to collect power consumption data and transmit them to a central database. The analysis is done using neural networks, support vector machines, K-Means, Mean-Shift and Silhouette classifiers to select a classifier with better performance in real time to detect devices. and record the power consumption of each device to get a detailed power consumption data set. By analyzing this data set, the device is indentified and then the classifications are done to detect patterns of device usage, which provide means to optimize the device handling to minimize the power consumption.
Content may be subject to copyright.
Annual Research Symposium (ARS)-2016
Faculty of Engineering, University of Ruhuna.
Electricity Consumption Pattern Detection
V.L. Abeykoon a , K. D. Nishadi a , R. G. A. Senevirathna a , R.G.A P.S. Ranaveera a ,
M.R. Udawalpola a,*
a Department of Electrical and Information Engineering, University of Ruhuna
* Corresponding Author:
Increase of the usage of domestic electrical devices causes higher electricity bills. Detecting power consumption
patterns can be used to minimize the electricity bill. This paper discusses how to detect electricity consumption
patterns of domestic power usage via data mining and clustering using machine learning techniques and detect the
A prototype data acquisition device was developed to collect power consumption data and transmit them to a central
database. The analysis is done using neural networks, support vector machines, K-Means, Mean-Shift and Silhouette
classifiers to select a classifier with better performance in real time to detect devices. and record the power
consumption of each device to get a detailed power consumption data set. By analyzing this data set, the device is
indentified and then the classifications are done to detect patterns of device usage, which provide means to optimize
the device handling to minimize the power consumption.
support vector machines, neural networks, K-Means, machine learning, optimization
1. Introduction
Management of energy consumption in domestic environment is very important due to high electrical energy prices
and impact of electricity production on the environment. Current records of power consumption reporting in Sri
Lanka [1] only provides an overall idea about the power consumption of electrical devices. A detailed survey and
analysis of power consumption of households in Sri Lanka haven't being conducted yet. The customers don't get a
clear idea on how the power consumption was carried out by each device and how each device contributes to the
overall power consumption. And also there are no data analysis methods to provide customers a forecasting [9][10]
of power consumption using history of power consumption of a consumer.
With the introduction of digital energy meters different electricity tariff will be applied to different times.
Automatic detecting and profiling electric appliances is necessary for optimizing energy usage. A data acquisition
device is used to extract power composition data for each device. As an intelligent approach, machine learning
techniques can be used to understand the meaning of a data set in a logical way and provide useful outputs from raw
data for different purposes. In this project, a few supervised and unsupervised leaning methods are compared and a
better classifier is chosen for the data clustering and prediction. In considering power consumption patterns, neural
networks and support vector machines are used as supervised learning methods to classify data and predict patterns.
And the advanced clustering algorithms like K-Means, Silhouette were used as unsupervised learning methods for
classifying data. A model will be created to classify electrical devices from each other in order to collect data from
each and every electrical device via the power consumption pattern.
2. Background
A device called smart plug is created to detect the power consumption from each device. And the smart plug
identifies each device using machine learning techniques [14] and classifies data for further analysis [2][3]. There
are many researches done to detect electrical devices in real time and some different researches were done to
optimize and predict the power consumption [4][5][8][9][10]. Here both these scenarios are addressed in order to
provide an advanced overview on power consumption at domestic level in Sri Lanka. There are researches done to
detect electrical devices in real time to collect data with better classification in the earlier stages of data acquisition
[5] to provide a solid foundation for data analysis purposes. And later on neural networks and classification
algorithms are used to detect consumption patterns and forecast power consumption based on the history of power
Support Vector Machines (SVM) can be used as a fast classifier [17] which is a supervised learning technique and it
can converge faster and provide classification to a given data set. Artificial Neural Networks or ANN [18] can be
used for classification of data sets based on supervision. Data can be input to the neural network and the network can
be trained. There are hidden layers in the network which do the processing after providing data from an input layer.
The output layer can provide necessary predictions based on a testing data set.
K-Means Classification uses the mean [15] distance between clustering centers of a particular data set and the
algorithm itself adapts and changes the centers based on the new input data and dynamic classification can be done
by this algorithm. This algorithm is useful in unsupervised pattern detection. Silhouette Classification is similar to
the K-Means and the difference is this algorithm provides a graphical output [14] of classification and it learns itself
and provides classification in an unsupervised manner. Mean shift classification [15] provides a classification output
based on calculating mean between the centers of classification and this algorithm is also an unsupervised way of
classifying and predicting.
3. Experimental Analysis
A current transformer is used to measure the current and AC-AC adapter for the measurement of voltage. These
readings are directly fed to the micro-controller and there the real power, apparent power, vrms, irms and power
factor is calculated.
In analyzing the power consumption in domestic level the present method in Sri Lanka only provides an overall
view of the power consumption and cost. On the other hand the separate data extraction from devices provides more
detailed view of consumption. There are main factors that have to be considered, before creating a mathematical
model to analyze the data.
Devices like fans, irons, washing machines, refrigerators possess different modes of action. The complex behaviour
in these devices is being sorted out by recording data for each and every mode of action. So additional data
acquisition is done here. In the usage many devices are used in the same time, so there has to be a way of sorting out
power consumption from each device. This challenge is solved by the data acquisition device which collects data
from devices connected to it. The constraint is there has to be at least one data acquisition device for each and every
device. The data acquisition should not depend on the external factors like temperature, electromagnetic
interference, and communication failures. In some cases the voltage drops can occur due to many reasons and the
effect on the data collected must be verified based on the given scenario.
Electrical devices doesn't consume power in a constant way, the consumption fluctuates around an average value. In
the first few milliseconds of acquiring power it is not in the stabilized condition it takes a short time to get stabilized.
This stabilized data is the useful data to detect devices.
The smart data acquisition system extracts data and by using
clustering and classification algorithms these data is
being categorized. In clustering and classification process a unique signature from each device is being identified. In
classification process, main parameters; active power, apparent power, vrms, irms and phase shift are taken in to
consideration. All the data sets were collected from stabilized mode of operation. So the detection of devices starts
from stabilized mode. And also different mode of actions shows different profiles for the same device. So complex
and accurate analysis methods are important to differentiate these conditions.
Table 1 Data Acquisition Sample
Active Power
Reactive Power
Phase Shift
Device Name
Fan(Mode I)
Blender (Mode I)
Fig. 1. Accuracy of Data Acquisition Fig. 2. Multiple mode of a device
Fig. 3. Stabilizing Power Acquisition
Supervised Learning
In this analysis support vector machines (SVM), artificial neural networks are mainly used. In the usage of SVM and
ANN, a data set obtained from the smart data acquisition device was used for the training of these algorithms. In
ANN, the model was designed with 100 hidden layers and single input and output layers were used. And initially
provided a learning rate of 0.001 by testing data for different learning rates and hidden layers. These two values
provided a promising accuracy to the dataset.
Unsupervised Learning
In unsupervised learning the output group is decided by the algorithm itself. The only input to the algorithm is a
training data set and it clusters the inputs by itself. In this K-Means, Mean-Shift and Silhouette algorithms were used
to test the data set and classify data. The K-Means algorithm provides a higher accurate results than other two
algorithms. The mean-shift algorithm deviates from higher accuracy when the number of devices get increased in
the data set. But the Silhouette classifier works with higher accuracy and it provides graphically the nature of the
classification very clearly than other methods. Performance Analysis of supervised and unsupervised learning
methods showed that K-Means provide a far better performance than any other classifier and it doesn't need any
supervision. The following analysis was done for a data set of 100 household electrical devices.
3. Results and Discussion
Performance Analysis
Considering the efficiency of the algorithm and the accuracy of the prediction of SVM, K-Means and Silhouette
classifiers provide acceptable results as shown in Table 02. Considering these algorithms the K-Means classifier
shows an advantage over other classifiers.
Table 2 Data Acquisition and Performance
Device Usage Pattern Detection
Different devices have its own ways of usage depending on the user's behavior and other external factors. Basically
when understanding patterns, the Silhouette and K-Means clustering algorithms were used to detect patterns. The
pattern detection is done under three categories hour based, weekly based and month based pattern detection.
Hour based consumption pattern detects the patterns regarding the 24 hours of a particular day. By using the
clustering algorithms; Silhouette and K-Means these groups can be detected with an accuracy of 99% for a data set
of 50 different devices functioning in within a period of 3 months. Week based patterns demonstrates the week by
usage patterns of devices. Week based pattern detection provides the usage of a particular device in the seven days
of the week. There can be same pattern for couple of devices and there can be unique patterns for devices. These
patterns are detected by K-Means, Silhouette and Support Vector Machines algorithms. The accuracy is over 98% as
an average of the pattern detection and clustering. Month based pattern detection would be significant if a device is
Accuracy (%)
Execution (s)
Mean Shift
used in some special days of the month. It can be a periodic pattern or sometimes it can be one or two occurrences
per month. So there are instances this analysis is useful.
Fig. 4. Hour Based Pattern Recognition Fig. 5. Week Based Pattern Recognition
In pattern detection, the Silhouette and K-Means algorithms provide a better way of clustering patterns. The device
usage pattern is important to get an idea on how the consumers use the devices in order to find a way to minimize
the cost on power consumption.
Fig. 3. Classification Diagram of 15 devices
4. Conclusions
By the data collected and patterns detected, it is clear that there is a unique signature for each electrical device and
these signatures can be used to identify each device uniquely with a promising accuracy. There is no significant
difference in accuracy but performance of some algorithms are better.
Ability to predict the device provides the opportunity to filter power consumption data from overall consumption
data. In addition to that each device has its way of functioning based on human involvement. It gives a pattern to
every device when considering the activation period of all devices. To detect these patterns accurately we need to
train our algorithms for long period of time. Detecting these patterns leads to understand power consumption
patterns and provide better patterns to the consumer to minimize high frequency of usage and provide better
performance with less frequency of usage and less electricity usage.
Algorithms can be developed to suggest tips for consumers for what will improve usage and minimize cost. The
optimization techniques can be used based on user preferences or by detecting patterns of devices and suggesting
optimum devices out of devices performing same function in different ways. In addition to that the research data can
provide users an overview of power consumption of each device in an organized way to get a better understanding
about the way the power is being consumed by each device in domestic level.
5. References
[1] Public Utilities Commision Of Sri Lanka, ‘electricity consumption patterns of consumers in Sri Lanka-2011’,
2013. [Online]. Available:
[2] Shekarabi S.M et al. Smart plugs for building energy
ems in
and Machines Res.
Lab., Univ. of Tehran, Tehran, Iran ,(2015, June, 1),Available:
[3] Mihai Sanduleac,(2015, April,12), Smart
Meter, Av
[4] Jin Xiao et al (2012, September, 25-27) in Network Operations and Managemen
(APNOMS), 2012 14th
Wi-Fi energy
on smart
[5] Search Data Center: Smart Power and Data Center. [Online] Available:
[Accessed: 01-Jun-2015
[6] Maryam Jamela Ismail, Rosdiazil Ibrahim, Idris Ismail, (2011, January IEEE), Available:
[7 ] Hyojoo Son, Changwan Kim, (2015) , Forecasting Short-Term Electricity Deman in Residential Sector
Based on Support Vector Regression and Fuzzy-rough Feature Selection with Particle Swarm Optimization
Available :
[8] Jacky Kelly and William Knottenblet, UK-DALE: A dataset recording UK Domestic Appliance-Level
Electricity demand and whole-house demand, Available:
[9] F.T Romero, J.C.J. Hernandez and W.G Lopez, Predicting Electricity Consumption Using Neural
[10] Sunil Mallya, Entracker: Energy Tracker for Homes, (Spring 2011),
[11] Damien Zufferey,Christophe Gisler, Omar Abou Khaled, Jean Hennebert, Machine Learning Approaches
For Electric Appliance Classification,(2012,IEEE), Available :
[12] Bogdan Oancea, Stefan Cristian Ciucu, Time Series Forecasting Using Neural Networks, Available:
[13] R.J Frank, N.Davey, S.P Hunt, Time Series Prediction Neural Networks,
[14] Peter J. Rousseeuw, (1986,November,27), Silhouette: a graphical aid to interpretation and validation of
cluster analysis, Available:
[15] Tapas Kanungo, David M. Mount, Nathan S. NetanyahuChristine D. Piatko, Ruth Silverman, and Angela Y.
Wu. An Efficient k-Means Clustering Algorithm: Analysis and Implementation, Available :
[16]Dorin Comaniciu , and Peter Meer, Mean Shift: A Robust Approach Toward Feature Space
Analysis,(2002,May), Available :
[17] Christopher J.C Burgers, A Tutorial On Support Vector Machines for Pattern Recognition, Available:
[18] Sonali B. Maind, Priyanka Wankar, Research Paper On Basic of Artificial Neural Network,
... where T w is the length of weekly segment along time axis. The reason for considering the long-term correlations is that the work of [36] and [37] have found that electricity usage shows weekly patterns periodically in one certain period . The framework of the proposed MG-ASTGCN is illustrated in Fig. 2, including graph transformation operation, multigrained vector constructor, and ASTGCN. ...
Full-text available
Renewable energy resources (RERs) have been increasingly integrated into modern power systems, especially in large-scale distribution networks (DNs). In this paper, we propose a deep reinforcement learning (DRL)-based approach to dynamically search for the optimal operation point, i.e., optimal power flow (OPF), in DNs with a high uptake of RERs. Considering uncertainties and voltage fluctuation issues caused by RERs, we formulate OPF into a multi-objective optimization (MOO) problem. To solve the MOO problem, we develop a novel DRL algorithm leveraging the graphical information of the distribution network. Specifically, we employ the state-of-the-art DRL algorithm, i.e., deep deterministic policy gradient (DDPG), to learn an optimal strategy for OPF. Since power flow reallocation in the DN is a consecutive process, where nodes are self-correlated and interrelated in temporal and spatial views, to make full use of DNs' graphical information, we develop a multi-grained attention-based spatial-temporal graph convolution network (MG-ASTGCN) for spatial-temporal graph information extraction, preparing for its sequential DDPG. We validate our proposed DRL-based approach in modified IEEE 33, 69, and 118-bus radial distribution systems (RDSs) and show that our DRL-based approach outperforms other benchmark algorithms. Our experimental results also reveal that MG-ASTGCN can significantly accelerate the DDPG training process and improve DDPG's capability in reallocating power flow for OPF. The proposed DRL-based approach also promotes DNs' stability in the presence of node faults, especially for large-scale DNs.
This research coveys a comparative analysis between Urban Building energy model (UBEM) generated by scholar, researchers, and professional in academia and industry while highlighting the reliable main components to manifest a successful and reliable UBEM technologies. Nevertheless, it consolidates distributed generation on building blocks rather than a whole district relying on renewable energy sources. It guides engineers through energy system model simulation on Openmodelica platform to feed green sustained communities. Moreover, energy use-pattern is mapped and analyzed by internet of things (IOT) technologies to fine-tune energy uses and refine use-pattern. Demonstrating artificial Intelligence (AI) algorithmto predict energy consumption can reflect on the amount of energy required for storage to cover energy needs. AI shapes a robust positive energy district (PED) through storinggenerated renewable solar or bio-energy to cover predicted energy use-pattern.Distributed -power-plant stations capacity to cover clusters using AI in predicting energy consumption consolidates on-site energy generation recommended by multiple International rating systems. AI-based Energy management plan guide engineers and planners to design distributed-power-plants of energy generation capacity lies between the actual energy need and a predicted scenario.
Full-text available
The aim of this study is to provide a precise model for one-month-ahead forecast of electricity demand in residential sector. In this study, a total of 20 influential variables are taken into account including monthly electricity consumption, 14 weather variables, and 5 social variables. Based on support vector regression and fuzzy-rough feature selection with particle swarm optimization algorithms, the proposed method established a model with variables that relate to the forecast without ignoring some of these variables one may inevitably lead to forecasting errors. The proposed forecasting model was validated using historical data from South Korea. Its time period was from January 1991 to December 2012. The first 240 months were used for training and the remaining 24 for testing. The performance was evaluated using MAPE, MAE, RMSE, MBE, and UPA values. Furthermore, it was compared with that obtained from the artificial neural network, auto-regressive integrated moving average, multiple linear regression models, and the methods proposed in the previous studies, and found superior for every performance measure considered in this study. The proposed method has an advantage over the previous methods because it automatically determines appropriate and necessary variables for a reliable forecast. It is expected that the proposed model can contribute to more accurate forecasting of short-term electricity demand in residential sector. The ability to accurately forecast short-term electricity demand can assist power system operators and market participants in ensuring sustainable electricity planning decisions and secure electricity supply to the consumers.
Full-text available
Many countries are rolling out smart electricity meters. These measure a home's total power demand. However, research into consumer behaviour suggests that consumers are best able to improve their energy efficiency when provided with itemised, appliance-by-appliance consumption information. Energy disaggregation is a computational technique for estimating appliance-by-appliance energy consumption from a whole-house meter signal. To conduct research on disaggregation algorithms, researchers require data describing not just the aggregate demand per building but also the `ground truth' demand of individual appliances. In this context, we present UK-DALE: an open-access dataset from the UK recording Domestic Appliance-Level Electricity at a sample rate of 16 kHz for the whole-house and at 1/6 Hz for individual appliances. This is the first open access UK dataset at this temporal resolution. We recorded from five houses, one of which was recorded for 655 days, the longest duration we are aware of for any energy dataset at this sample rate. We also describe the low-cost, open-source, wireless system we built for collecting our dataset.
Conference Paper
Full-text available
Due to rising value of energy supplies, the need for managing electrical energy sources becomes prominent. In this paper, a smart plug device is presented; which provides features to measure energy consumption and recognizes the type of attached electrical devices. Furthermore, the proposed framework enables home controller systems to monitor and control home energy consumption efficiently. Subsequently an analysis of energy consumption of a pilot house, prior and after the installation of the smart plug, based on a hypothetical scenario, is provided. The results present positive impacts of using the smart plug on energy consumption rate; especially during peak hours.
Full-text available
Recent studies have shown the classification and prediction power of the Neural Networks. It has been demonstrated that a NN can approximate any continuous function. Neural networks have been successfully used for forecasting of financial data series. The classical methods used for time series prediction like Box-Jenkins or ARIMA assumes that there is a linear relationship between inputs and outputs. Neural Networks have the advantage that can approximate nonlinear functions. In this paper we compared the performances of different feed forward and recurrent neural networks and training algorithms for predicting the exchange rate EUR/RON and USD/RON. We used data series with daily exchange rates starting from 2005 until 2013.
Full-text available
We report on the development of an innovative system which can automatically recognize home appliances based on their electric consumption profiles. The purpose of our system is to apply adequate rules to control electric appliance in order to save energy and money. The novelty of our approach is in the use of plug-based low-end sensors that measure the electric consumption at low frequency, typically every 10 seconds. Another novelty is the use of machine learning approaches to perform the classification of the appliances. In this paper, we present the system architecture, the data acquisition protocol and the evaluation framework. More details are also given on the feature extraction and classification models being used. The evaluation showed promising results with a correct rate of identification of 85\%.
Conference Paper
Full-text available
Vast majority of the services running on the smart phone today are networked in that significant amount of com-munication is required. Smart phone energy expenditure due to Wi-Fi communications constitutes significant portion of the battery discharge. In this paper, we first investigate the key factors influencing Wi-Fi energy consumption, and propose three energy management schemes: 1) Dynamic control of Wi-Fi on/off interface; 2) improve communication efficiency via application packing; and 3) elongation of Wi-Fi Power Save Mode (PSM) via application alignment under mixed application workload. We also design and test our solution as a device-side application utilizing general system process scheduling and network firewall techniques. As the result, our solution is easy to deploy, applicable to most mobile devices, and we explicitly tackle the challenging case of download management. Through extensive experimenta-tion and solution prototyping, we show the effectiveness of device side Wi-Fi energy management and the importance of considering application characteristics. Index Terms—smart phone, energy management, design and experimentation
Full-text available
Prediction of some events affects both (1) decisions of a company and (2) the planning of resources for a greater and more efficient production. Moreover, knowledge of the future events allows taking preventive measures. Therefore in this work the primary goal is to realize prediction for a set of data, which correspond to the monthly maximum demand for electricity in distribution substation supplied by the Mexican Federal Commission of Electricity (Comisión Federal de Electricidad). This prediction is performed by artificial neural networks using backpropagation as learning algorithm, further compared these predictions with those obtained by time series methodology of Box and Jenkins.
In k\hbox{-}{\rm{means}} clustering, we are given a set of n data points in d\hbox{-}{\rm{dimensional}} space {\bf{R}}^d and an integer k and the problem is to determine a set of k points in {\bf{R}}^d, called centers, so as to minimize the mean squared distance from each data point to its nearest center. A popular heuristic for k\hbox{-}{\rm{means}} clustering is Lloyd's algorithm. In this paper, we present a simple and efficient implementation of Lloyd's k\hbox{-}{\rm{means}} clustering algorithm, which we call the filtering algorithm. This algorithm is easy to implement, requiring a kd-tree as the only major data structure. We establish the practical efficiency of the filtering algorithm in two ways. First, we present a data-sensitive analysis of the algorithm's running time, which shows that the algorithm runs faster as the separation between clusters increases. Second, we present a number of empirical studies both on synthetically generated data and on real data sets from applications in color quantization, data compression, and image segmentation.