A Two-Stage Big Data Analytics Framework with Real World Applications Using Spark Machine Learning and Long Short-Term Memory Network

Muhammad Ashfaq Khan; Md. Rezaul Karim; Yangwoo Kim

doi:10.3390/sym10100485

A Two-Stage Big Data Analytics Framework with Real World Applications Using Spark Machine Learning and Long Short-Term Memory Network

Symmetry ◽

10.3390/sym10100485 ◽

2018 ◽

Vol 10 (10) ◽

pp. 485 ◽

Cited By ~ 6

Author(s):

Muhammad Ashfaq Khan ◽

Md. Rezaul Karim ◽

Yangwoo Kim

Keyword(s):

Machine Learning ◽

Big Data ◽

Real World ◽

Data Analytics ◽

Short Term Memory ◽

Big Data Analytics ◽

Short Term ◽

Two Stage ◽

Term Memory ◽

Long Short Term Memory

Every day we experience unprecedented data growth from numerous sources, which contribute to big data in terms of volume, velocity, and variability. These datasets again impose great challenges to analytics framework and computational resources, making the overall analysis difficult for extracting meaningful information in a timely manner. Thus, to harness these kinds of challenges, developing an efficient big data analytics framework is an important research topic. Consequently, to address these challenges by exploiting non-linear relationships from very large and high-dimensional datasets, machine learning (ML) and deep learning (DL) algorithms are being used in analytics frameworks. Apache Spark has been in use as the fastest big data processing arsenal, which helps to solve iterative ML tasks, using distributed ML library called Spark MLlib. Considering real-world research problems, DL architectures such as Long Short-Term Memory (LSTM) is an effective approach to overcoming practical issues such as reduced accuracy, long-term sequence dependency, and vanishing and exploding gradient in conventional deep architectures. In this paper, we propose an efficient analytics framework, which is technically a progressive machine learning technique merged with Spark-based linear models, Multilayer Perceptron (MLP) and LSTM, using a two-stage cascade structure in order to enhance the predictive accuracy. Our proposed architecture enables us to organize big data analytics in a scalable and efficient way. To show the effectiveness of our framework, we applied the cascading structure to two different real-life datasets to solve a multiclass and a binary classification problem, respectively. Experimental results show that our analytical framework outperforms state-of-the-art approaches with a high-level of classification accuracy.

Download Full-text

Big Data Analytics for Energy Consumption Prediction in Smart Grid Using Genetic Algorithm and Long Short Term Memory

Computing and Informatics ◽

10.31577/cai_2021_1_29 ◽

2021 ◽

Vol 40 (1) ◽

pp. 29-56

Author(s):

Sanju Kumari ◽

Neeraj Kumar ◽

Prashant Singh Rana

Keyword(s):

Genetic Algorithm ◽

Big Data ◽

Data Analytics ◽

Short Term Memory ◽

Big Data Analytics ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory ◽

Energy Consumption Prediction ◽

Consumption Prediction

Download Full-text

Big Data Analytics Using Swarm-Based Long Short-Term Memory for Temperature Forecasting

Computers Materials & Continua ◽

10.32604/cmc.2022.021447 ◽

2022 ◽

Vol 71 (2) ◽

pp. 2347-2361

Author(s):

Malini M. Patil ◽

P. M. Rekha ◽

Arun Solanki ◽

Anand Nayyar ◽

Basit Qureshi

Keyword(s):

Big Data ◽

Data Analytics ◽

Short Term Memory ◽

Big Data Analytics ◽

Short Term ◽

Term Memory ◽

Temperature Forecasting ◽

Long Short Term Memory

Download Full-text

Prediction of Head Movement in 360-Degree Videos Using Attention Model

Sensors ◽

10.3390/s21113678 ◽

2021 ◽

Vol 21 (11) ◽

pp. 3678

Author(s):

Dongwon Lee ◽

Minji Choi ◽

Joohyun Lee

Keyword(s):

Machine Learning ◽

Short Term Memory ◽

Moving Average ◽

The Other ◽

Learning Models ◽

Short Term ◽

Term Memory ◽

Attention Model ◽

Long Short Term Memory ◽

Machine Learning Models

In this paper, we propose a prediction algorithm, the combination of Long Short-Term Memory (LSTM) and attention model, based on machine learning models to predict the vision coordinates when watching 360-degree videos in a Virtual Reality (VR) or Augmented Reality (AR) system. Predicting the vision coordinates while video streaming is important when the network condition is degraded. However, the traditional prediction models such as Moving Average (MA) and Autoregression Moving Average (ARMA) are linear so they cannot consider the nonlinear relationship. Therefore, machine learning models based on deep learning are recently used for nonlinear predictions. We use the Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) neural network methods, originated in Recurrent Neural Networks (RNN), and predict the head position in the 360-degree videos. Therefore, we adopt the attention model to LSTM to make more accurate results. We also compare the performance of the proposed model with the other machine learning models such as Multi-Layer Perceptron (MLP) and RNN using the root mean squared error (RMSE) of predicted and real coordinates. We demonstrate that our model can predict the vision coordinates more accurately than the other models in various videos.

Download Full-text

Random forest and long short-term memory based machine learning models for classification of ion mobility spectrometry spectra

Chemical, Biological, Radiological, Nuclear, and Explosives (CBRNE) Sensing XXII ◽

10.1117/12.2585829 ◽

2021 ◽

Author(s):

Patrick C. Riley ◽

Samir V. Deshpande ◽

Brian S. Ince ◽

Brian C. Hauck ◽

Kyle P. O'Donnell ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Ion Mobility ◽

Short Term Memory ◽

Learning Models ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory ◽

Machine Learning Models

Download Full-text

Data-driven predictions of a multiscale Lorenz 96 chaotic system using machine-learning methods: reservoir computing, artificial neural network, and long short-term memory network

Nonlinear Processes in Geophysics ◽

10.5194/npg-27-373-2020 ◽

2020 ◽

Vol 27 (3) ◽

pp. 373-389 ◽

Cited By ~ 7

Author(s):

Ashesh Chattopadhyay ◽

Pedram Hassanzadeh ◽

Devika Subramanian

Keyword(s):

Neural Network ◽

Machine Learning ◽

Short Term Memory ◽

Data Driven ◽

Reservoir Computing ◽

Short Term ◽

Term Memory ◽

Machine Learning Methods ◽

Long Short Term Memory ◽

Lorenz 96

Abstract. In this paper, the performance of three machine-learning methods for predicting short-term evolution and for reproducing the long-term statistics of a multiscale spatiotemporal Lorenz 96 system is examined. The methods are an echo state network (ESN, which is a type of reservoir computing; hereafter RC–ESN), a deep feed-forward artificial neural network (ANN), and a recurrent neural network (RNN) with long short-term memory (LSTM; hereafter RNN–LSTM). This Lorenz 96 system has three tiers of nonlinearly interacting variables representing slow/large-scale (X), intermediate (Y), and fast/small-scale (Z) processes. For training or testing, only X is available; Y and Z are never known or used. We show that RC–ESN substantially outperforms ANN and RNN–LSTM for short-term predictions, e.g., accurately forecasting the chaotic trajectories for hundreds of numerical solver's time steps equivalent to several Lyapunov timescales. The RNN–LSTM outperforms ANN, and both methods show some prediction skills too. Furthermore, even after losing the trajectory, data predicted by RC–ESN and RNN–LSTM have probability density functions (pdf's) that closely match the true pdf – even at the tails. The pdf of the data predicted using ANN, however, deviates from the true pdf. Implications, caveats, and applications to data-driven and data-assisted surrogate modeling of complex nonlinear dynamical systems, such as weather and climate, are discussed.

Download Full-text

Two-Stage Genetic Algorithm for Designing Long Short Term Memory (LSTM) Ensembles

2021 IEEE Congress on Evolutionary Computation (CEC) ◽

10.1109/cec45853.2021.9504788 ◽

2021 ◽

Author(s):

Ramya Anasseriyil Viswambaran ◽

Gang Chen ◽

Bing Xue ◽

Mohammad Nekooei

Keyword(s):

Genetic Algorithm ◽

Short Term Memory ◽

Short Term ◽

Two Stage ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

A two-stage approach for predicting the remaining useful life of tools using bidirectional long short-term memory

Measurement ◽

10.1016/j.measurement.2020.108029 ◽

2020 ◽

Vol 164 ◽

pp. 108029 ◽

Cited By ~ 1

Author(s):

Changfu Liu ◽

Lida Zhu

Keyword(s):

Short Term Memory ◽

Remaining Useful Life ◽

Short Term ◽

Two Stage ◽

Term Memory ◽

Useful Life ◽

Long Short Term Memory

Download Full-text

Efficient and data-driven prediction of water breakthrough in subsurface systems using deep long short-term memory machine learning

Computational Geosciences ◽

10.1007/s10596-020-10005-2 ◽

2020 ◽

Author(s):

Tao Bai ◽

Pejman Tahmasebi

Keyword(s):

Machine Learning ◽

Short Term Memory ◽

Data Driven ◽

Short Term ◽

Term Memory ◽

Water Breakthrough ◽

Long Short Term Memory

Download Full-text

Intelligent Data Analytics for Wind Speed Forecasting for Wind Power Production Using Long Short-Term Memory (LSTM) Network

Intelligent Data-Analytics for Condition Monitoring ◽

10.1016/b978-0-323-85510-5.00008-9 ◽

2021 ◽

pp. 165-192

Author(s):

Hasmat Malik ◽

Nuzhat Fatema ◽

Atif Iqbal

Keyword(s):

Wind Speed ◽

Wind Power ◽

Data Analytics ◽

Short Term Memory ◽

Power Production ◽

Short Term ◽

Term Memory ◽

Wind Speed Forecasting ◽

Long Short Term Memory ◽

Lstm Network

Download Full-text

Monaural Speech Enhancement Based On Two Stage Long Short-Term Memory Networks

2019 13th International Conference on Signal Processing and Communication Systems (ICSPCS) ◽

10.1109/icspcs47537.2019.9008709 ◽

2019 ◽

Author(s):

Yang Xian ◽

Yang Sun ◽

Wenwu Wang ◽

Syed Mohsen Naqvi

Keyword(s):

Speech Enhancement ◽

Short Term Memory ◽

Short Term ◽

Two Stage ◽

Term Memory ◽

Long Short Term Memory

Download Full-text