A Combined Deep Learning and Ensemble Learning Methodology to Avoid Electricity Theft in Smart Grids

Zeeshan Aslam; Nadeem Javaid; Ashfaq Ahmad; Abrar Ahmed; Sardar Muhammad Gulfam

doi:10.3390/en13215599

A Combined Deep Learning and Ensemble Learning Methodology to Avoid Electricity Theft in Smart Grids

Energies ◽

10.3390/en13215599 ◽

2020 ◽

Vol 13 (21) ◽

pp. 5599

Author(s):

Zeeshan Aslam ◽

Nadeem Javaid ◽

Ashfaq Ahmad ◽

Abrar Ahmed ◽

Sardar Muhammad Gulfam

Keyword(s):

Deep Learning ◽

Real Time ◽

Ensemble Learning ◽

Smart Grids ◽

Power Efficiency ◽

Short Term Memory ◽

Support Vector ◽

Smart Meters ◽

Adaptive Boosting ◽

Electricity Theft

Electricity is widely used around 80% of the world. Electricity theft has dangerous effects on utilities in terms of power efficiency and costs billions of dollars per annum. The enhancement of the traditional grids gave rise to smart grids that enable one to resolve the dilemma of electricity theft detection (ETD) using an extensive amount of data formulated by smart meters. This data are used by power utilities to examine the consumption behaviors of consumers and to decide whether the consumer is an electricity thief or benign. However, the traditional data-driven methods for ETD have poor detection performances due to the high-dimensional imbalanced data and their limited ETD capability. In this paper, we present a new class balancing mechanism based on the interquartile minority oversampling technique and a combined ETD model to overcome the shortcomings of conventional approaches. The combined ETD model is composed of long short-term memory (LSTM), UNet and adaptive boosting (Adaboost), and termed LSTM–UNet–Adaboost. In this regard, LSTM–UNet–Adaboost combines the advantages of deep learning (LSTM-UNet) along with ensemble learning (Adaboost) for ETD. Moreover, the performance of the proposed LSTM–UNet–Adaboost scheme was simulated and evaluated over the real-time smart meter dataset given by the State Grid Corporation of China. The simulations were conducted using the most appropriate performance indicators, such as area under the curve, precision, recall and F1 measure. The proposed solution obtained the highest results as compared to the existing benchmark schemes in terms of selected performance measures. More specifically, it achieved the detection rate of 0.92, which was the highest among existing benchmark schemes, such as logistic regression, support vector machine and random under-sampling boosting technique. Therefore, the simulation outcomes validate that the proposed LSTM–UNet–Adaboost model surpasses other traditional methods in terms of ETD and is more acceptable for real-time practices.

Download Full-text

Malicious Node Detection in Smart Grid Networks

10.5121/csit.2021.110716 ◽

2021 ◽

Author(s):

Faisal Y Al Yahmadi ◽

Muhammad R Ahmed

Keyword(s):

Smart Grid ◽

Smart Grids ◽

Electricity Consumption ◽

Cyber Attacks ◽

Support Vector ◽

Smart Meters ◽

Malicious Nodes ◽

High Detection Rate ◽

Detection Model ◽

Electricity Theft

Many countries around the world are implementing smart grids and smart meters. Malicious users that have moderate level of computer knowledge can manipulate smart meters and launch cyber-attacks. This poses cyber threats to network operators and government security. In order to reduce the number of electricity theft cases, companies need to develop preventive and protective methods to minimize the losses from this issue. In this paper, we propose a model based on software that detects malicious nodes in a smart grid network. The model collects data (electricity consumption/electric bill) from the nodes and compares it with previously obtained data. Support Vector Machine (SVM) model is implemented to classify nodes into good or malicious nodes by (high dimensional) giving the statues of 1 for good nodes and status of -1 for malicious (abnormal) nodes. The detection model also displays the network graphically as well as the data table. Moreover, this model displays the detection error in each cycle. It has a very low false alarm rate (2%) and a high detection rate as high as (98%). Future developments can trace the attack origin to eliminate or block the attack source minimizing losses before human control arrives.

Download Full-text

Deep Learning Application to Ensemble Learning—The Simple, but Effective, Approach to Sentiment Classifying

Applied Sciences ◽

10.3390/app9132760 ◽

2019 ◽

Vol 9 (13) ◽

pp. 2760 ◽

Cited By ~ 4

Author(s):

Khai Tran ◽

Thi Phan

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Ensemble Learning ◽

Language Processing ◽

Short Term Memory ◽

Learning Model ◽

Sentiment Classification ◽

Machine Learning Techniques ◽

Support Vector ◽

Deep Learning Model

Sentiment analysis is an active research area in natural language processing. The task aims at identifying, extracting, and classifying sentiments from user texts in post blogs, product reviews, or social networks. In this paper, the ensemble learning model of sentiment classification is presented, also known as CEM (classifier ensemble model). The model contains various data feature types, including language features, sentiment shifting, and statistical techniques. A deep learning model is adopted with word embedding representation to address explicit, implicit, and abstract sentiment factors in textual data. The experiments conducted based on different real datasets found that our sentiment classification system is better than traditional machine learning techniques, such as Support Vector Machines and other ensemble learning systems, as well as the deep learning model, Long Short-Term Memory network, which has shown state-of-the-art results for sentiment analysis in almost corpuses. Our model’s distinguishing point consists in its effective application to different languages and different domains.

Download Full-text

A Novel Ensemble Learning Approach of Deep Learning Techniques to Monitor Distracted Driver Behaviour in Real Time

2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA) ◽

10.1109/caida51941.2021.9425243 ◽

2021 ◽

Author(s):

Hafiz Umer Draz ◽

Muhammad Zeeshan Khan ◽

Muhammad Usman Ghani Khan ◽

Amjad Rehman ◽

Ibrahim Abunadi

Keyword(s):

Deep Learning ◽

Real Time ◽

Ensemble Learning ◽

Learning Approach ◽

Driver Behaviour ◽

Learning Techniques

Download Full-text

PlncRNA-HDeep: plant long noncoding RNA prediction using hybrid deep learning based on two encoding styles

BMC Bioinformatics ◽

10.1186/s12859-020-03870-2 ◽

2021 ◽

Vol 22 (S3) ◽

Author(s):

Jun Meng ◽

Qiang Kang ◽

Zheng Chang ◽

Yushi Luan

Keyword(s):

Deep Learning ◽

Noncoding Rna ◽

Nearest Neighbor ◽

Short Term Memory ◽

Biological Activities ◽

Support Vector ◽

Multiple Perspectives ◽

K Nearest Neighbor ◽

Rna Sequences ◽

Deep Learning Model

Abstract Background Long noncoding RNAs (lncRNAs) play an important role in regulating biological activities and their prediction is significant for exploring biological processes. Long short-term memory (LSTM) and convolutional neural network (CNN) can automatically extract and learn the abstract information from the encoded RNA sequences to avoid complex feature engineering. An ensemble model learns the information from multiple perspectives and shows better performance than a single model. It is feasible and interesting that the RNA sequence is considered as sentence and image to train LSTM and CNN respectively, and then the trained models are hybridized to predict lncRNAs. Up to present, there are various predictors for lncRNAs, but few of them are proposed for plant. A reliable and powerful predictor for plant lncRNAs is necessary. Results To boost the performance of predicting lncRNAs, this paper proposes a hybrid deep learning model based on two encoding styles (PlncRNA-HDeep), which does not require prior knowledge and only uses RNA sequences to train the models for predicting plant lncRNAs. It not only learns the diversified information from RNA sequences encoded by p-nucleotide and one-hot encodings, but also takes advantages of lncRNA-LSTM proposed in our previous study and CNN. The parameters are adjusted and three hybrid strategies are tested to maximize its performance. Experiment results show that PlncRNA-HDeep is more effective than lncRNA-LSTM and CNN and obtains 97.9% sensitivity, 95.1% precision, 96.5% accuracy and 96.5% F1 score on Zea mays dataset which are better than those of several shallow machine learning methods (support vector machine, random forest, k-nearest neighbor, decision tree, naive Bayes and logistic regression) and some existing tools (CNCI, PLEK, CPC2, LncADeep and lncRNAnet). Conclusions PlncRNA-HDeep is feasible and obtains the credible predictive results. It may also provide valuable references for other related research.

Download Full-text

Deep Learning Methods for Classification of Certain Abnormalities in Echocardiography

Electronics ◽

10.3390/electronics10040495 ◽

2021 ◽

Vol 10 (4) ◽

pp. 495

Author(s):

Imayanmosha Wahlang ◽

Arnab Kumar Maji ◽

Goutam Saha ◽

Prasun Chakrabarti ◽

Michal Jasinski ◽

...

Keyword(s):

Deep Learning ◽

Short Term Memory ◽

Support Vector ◽

Variational Autoencoder ◽

Different Types ◽

Static Images ◽

Long Short Term Memory ◽

2D And 3D ◽

Better Than

This article experiments with deep learning methodologies in echocardiogram (echo), a promising and vigorously researched technique in the preponderance field. This paper involves two different kinds of classification in the echo. Firstly, classification into normal (absence of abnormalities) or abnormal (presence of abnormalities) has been done, using 2D echo images, 3D Doppler images, and videographic images. Secondly, based on different types of regurgitation, namely, Mitral Regurgitation (MR), Aortic Regurgitation (AR), Tricuspid Regurgitation (TR), and a combination of the three types of regurgitation are classified using videographic echo images. Two deep-learning methodologies are used for these purposes, a Recurrent Neural Network (RNN) based methodology (Long Short Term Memory (LSTM)) and an Autoencoder based methodology (Variational AutoEncoder (VAE)). The use of videographic images distinguished this work from the existing work using SVM (Support Vector Machine) and also application of deep-learning methodologies is the first of many in this particular field. It was found that deep-learning methodologies perform better than SVM methodology in normal or abnormal classification. Overall, VAE performs better in 2D and 3D Doppler images (static images) while LSTM performs better in the case of videographic images.

Download Full-text

Real-Time Detection of Dictionary DGA Network Traffic Using Deep Learning

SN Computer Science ◽

10.1007/s42979-021-00507-w ◽

2021 ◽

Vol 2 (2) ◽

Author(s):

Kate Highnam ◽

Domenic Puzio ◽

Song Luo ◽

Nicholas R. Jennings

Keyword(s):

Neural Network ◽

Deep Learning ◽

Real Time ◽

Network Traffic ◽

Short Term Memory ◽

Domain Names ◽

Control Networks ◽

Detection Techniques ◽

Lstm Network ◽

And Control

AbstractBotnets and malware continue to avoid detection by static rule engines when using domain generation algorithms (DGAs) for callouts to unique, dynamically generated web addresses. Common DGA detection techniques fail to reliably detect DGA variants that combine random dictionary words to create domain names that closely mirror legitimate domains. To combat this, we created a novel hybrid neural network, Bilbo the “bagging” model, that analyses domains and scores the likelihood they are generated by such algorithms and therefore are potentially malicious. Bilbo is the first parallel usage of a convolutional neural network (CNN) and a long short-term memory (LSTM) network for DGA detection. Our unique architecture is found to be the most consistent in performance in terms of AUC, $$F_1$$ F 1 score, and accuracy when generalising across different dictionary DGA classification tasks compared to current state-of-the-art deep learning architectures. We validate using reverse-engineered dictionary DGA domains and detail our real-time implementation strategy for scoring real-world network logs within a large enterprise. In 4 h of actual network traffic, the model discovered at least five potential command-and-control networks that commercial vendor tools did not flag.

Download Full-text

A semiautomatic annotation approach for sentiment analysis

Journal of Information Science ◽

10.1177/01655515211006594 ◽

2021 ◽

pp. 016555152110065

Author(s):

Rahma Alahmary ◽

Hmood Al-Dossari

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Short Term Memory ◽

Machine Learning Algorithms ◽

Support Vector ◽

Short Term ◽

Term Memory ◽

Annotation Process ◽

Learning Classifiers ◽

Long Short Term Memory

Sentiment analysis (SA) aims to extract users’ opinions automatically from their posts and comments. Almost all prior works have used machine learning algorithms. Recently, SA research has shown promising performance in using the deep learning approach. However, deep learning is greedy and requires large datasets to learn, so it takes more time for data annotation. In this research, we proposed a semiautomatic approach using Naïve Bayes (NB) to annotate a new dataset in order to reduce the human effort and time spent on the annotation process. We created a dataset for the purpose of training and testing the classifier by collecting Saudi dialect tweets. The dataset produced from the semiautomatic model was then used to train and test deep learning classifiers to perform Saudi dialect SA. The accuracy achieved by the NB classifier was 83%. The trained semiautomatic model was used to annotate the new dataset before it was fed into the deep learning classifiers. The three deep learning classifiers tested in this research were convolutional neural network (CNN), long short-term memory (LSTM) and bidirectional long short-term memory (Bi-LSTM). Support vector machine (SVM) was used as the baseline for comparison. Overall, the performance of the deep learning classifiers exceeded that of SVM. The results showed that CNN reported the highest performance. On one hand, the performance of Bi-LSTM was higher than that of LSTM and SVM, and, on the other hand, the performance of LSTM was higher than that of SVM. The proposed semiautomatic annotation approach is usable and promising to increase speed and save time and effort in the annotation process.

Download Full-text

A hybrid deep learning and ensemble learning mechanism for damaged power line detection in smart grids

Soft Computing ◽

10.1007/s00500-021-06482-x ◽

2021 ◽

Author(s):

Yangyang Tian ◽

Qi Wang ◽

Zhimin Guo ◽

Huitong Zhao ◽

Sulaiman Khan ◽

...

Keyword(s):

Deep Learning ◽

Ensemble Learning ◽

Smart Grids ◽

Power Line ◽

Line Detection ◽

Learning Mechanism

Download Full-text

Comparative analysis of Kernel-based versus BFGS-ANN and deep learning methods in monthly reference evaporation estimation

10.5194/hess-2020-224 ◽

2020 ◽

Author(s):

Mohammad Taghi Sattari ◽

Halit Apaydin ◽

Shahab Shamshirband ◽

Amir Mosavi

Keyword(s):

Deep Learning ◽

Minimum Temperature ◽

Meteorological Parameters ◽

Short Term Memory ◽

Sunshine Duration ◽

Gaussian Process Regression ◽

Support Vector ◽

Ann Model ◽

Learning Methods ◽

Average Maximum

Abstract. Proper estimation of the reference evapotranspiration (ET0) amount is an indispensable matter for agricultural water management in the efficient use of water. The aim of study is to estimate the amount of ET0 with a different machine and deep learning methods by using minimum meteorological parameters in the Corum region which is an arid and semi-arid climate with an important agricultural center of Turkey. In this context, meteorological variables of average, maximum and minimum temperature, sunshine duration, wind speed, average, maximum, and minimum relative humidity are used as input data monthly. Two different kernel-based (Gaussian Process Regression (GPR) and Support Vector Regression (SVR)) methods, BFGS-ANN and Long short-term memory models were used to estimate ET0 amounts in 10 different combinations. According to the results obtained, all four methods used predicted ET0 amounts in acceptable accuracy and error levels. BFGS-ANN model showed higher success than the others. In kernel-based GPR and SVR methods, Pearson VII function-based universal kernel was the most successful kernel function. Besides, the scenario that is related to temperature in all scenarios used, including average temperature, maximum and minimum temperature, and sunshine duration gave the best results. The second-best scenario was the one that covers only the sunshine duration. In this case, the ANN (BFGS-ANN) model, which is optimized with the BFGS method that uses only the sunshine duration, can be estimated with the 0.971 correlation coefficient of ET0 without the need for other meteorological parameters.

Download Full-text

Real time monitoring of water Quality using IoT and Deep learning

E3S Web of Conferences ◽

10.1051/e3sconf/202129701059 ◽

2021 ◽

Vol 297 ◽

pp. 01059

Author(s):

Saloua Senhaji ◽

Mohamed Hamlich ◽

Mohammed Ouazzani Jamil

Keyword(s):

Water Quality ◽

Deep Learning ◽

Real Time ◽

Environmental Protection Agency ◽

Short Term Memory ◽

Chemical Parameters ◽

Water Parameters ◽

Term Memory ◽

Single Board Computer ◽

Physico Chemical

Access to safe drinking water is one of the most pressing issues facing many developing countries. Water must meet Environmental Protection Agency (E.P.A.) requirements. The normal method of measuring physico-chemical parameters is to take samples manually and send them to the laboratory to check the water quality. In this paper, we proposed a new intelligent design of a real-time water quality monitoring system using Deep Learning technology. This system is composed of several sensors that allow us to measure water parameters (physico-chemical parameters), bacteriological parameters and organoleptic parameters) and to detect the presence of certain substances (undesirable substances, toxic substances) and of a single-board/mobile computer module, Internet and other accessories. Water parameters are automatically detected by the single-board computer. Raspberry Pi3 model B. The single board computer receives the data from the sensors and this data is sent to the web server using the Internet module. It is able to detect the water quality situation worldwide. The data will be analysed in real time. The application of deep learning to these areas has been an important research topic. The Long-Short Term Memory (LSTM) network has been shown to be well suited for processing and predicting large events with long intervals and delays in the time series. LSTM networks have the ability to retain long-term memory.

Download Full-text