PhD Forum: Deep Learning and Probabilistic Models Applied to Sequential Data

Probabilistic Deep Learning to Quantify Uncertainty in Air Quality Forecasting

Sensors ◽

10.3390/s21238009 ◽

2021 ◽

Vol 21 (23) ◽

pp. 8009

Author(s):

Abdulmajid Murad ◽

Frank Alexander Kraemer ◽

Kerstin Bach ◽

Gavin Taylor

Keyword(s):

Deep Learning ◽

Air Quality ◽

Probabilistic Models ◽

Performance Metrics ◽

Current Data ◽

Data Driven ◽

Quality Data ◽

Empirical Performance ◽

Adversarial Training ◽

Scalable Methods

Data-driven forecasts of air quality have recently achieved more accurate short-term predictions. However, despite their success, most of the current data-driven solutions lack proper quantifications of model uncertainty that communicate how much to trust the forecasts. Recently, several practical tools to estimate uncertainty have been developed in probabilistic deep learning. However, there have not been empirical applications and extensive comparisons of these tools in the domain of air quality forecasts. Therefore, this work applies state-of-the-art techniques of uncertainty quantification in a real-world setting of air quality forecasts. Through extensive experiments, we describe training probabilistic models and evaluate their predictive uncertainties based on empirical performance, reliability of confidence estimate, and practical applicability. We also propose improving these models using “free” adversarial training and exploiting temporal and spatial correlation inherent in air quality data. Our experiments demonstrate that the proposed models perform better than previous works in quantifying uncertainty in data-driven air quality forecasts. Overall, Bayesian neural networks provide a more reliable uncertainty estimate but can be challenging to implement and scale. Other scalable methods, such as deep ensemble, Monte Carlo (MC) dropout, and stochastic weight averaging-Gaussian (SWAG), can perform well if applied correctly but with different tradeoffs and slight variations in performance metrics. Finally, our results show the practical impact of uncertainty estimation and demonstrate that, indeed, probabilistic models are more suitable for making informed decisions.

Download Full-text

SENTIMENT ANALYSIS OF ELECTRIC CARS USING RECURRENT NEURAL NETWORK METHOD IN INDONESIAN TWEETS

Kursor ◽

10.21107/kursor.v10i4.233 ◽

2020 ◽

Vol 10 (4) ◽

Author(s):

Felisia Handayani ◽

Metty Mustikasari

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Sentiment Analysis ◽

Recurrent Neural Network ◽

Confusion Matrix ◽

Data Representation ◽

Sequential Data ◽

Communication Tool ◽

Electric Cars

Sentiment analysis is computational research of the opinions of many people who are textually expressed against a particular topic. Twitter is the most popular communication tool among Internet users today to express their opinions. Deep Learning is a solution to allow computers to learn from experience and understand the world in terms of the hierarchy concept. Deep Learning objectives replace manual assignments with learning. The development of deep learning has a set of algorithms that focus on learning data representation. The recurrent Neural Network is one of the machine learning methods included in Deep learning because the data is processed through multi-players. RNN is also an algorithm that can recall the input with internal memory, therefore it is suitable for machine learning problems involving sequential data. The study aims to test models that have been created from tweets that are positive, negative, and neutral sentiment to determine the accuracy of the models. The models have been created using the Recurrent Neural Network when applied to tweet classifications to mark the individual classes of Indonesian-language tweet data sentiment. From the experiments conducted, results on the built system showed that the best test results in the tweet data with the RNN method using Confusion Matrix are with Precision 0.618, Recall 0.507 and Accuracy 0.722 on the data amounted to 3000 data and comparative data training and data testing of ratio data 80:20

Download Full-text

Development of Deep Learning Based Human-Centered Threat Assessment for Application to Automated Driving Vehicle

Applied Sciences ◽

10.3390/app10010253 ◽

2019 ◽

Vol 10 (1) ◽

pp. 253 ◽

Cited By ~ 2

Author(s):

Donghoon Shin ◽

Hyun-geun Kim ◽

Kang-moon Park ◽

Kyongsu Yi

Keyword(s):

Neural Network ◽

Deep Learning ◽

Threat Assessment ◽

Collision Probability ◽

Driver Model ◽

Sequential Data ◽

Automated Driving ◽

Human Driver ◽

Acceleration And Deceleration ◽

The Individual

This paper describes the development of deep learning based human-centered threat assessment for application to automated driving vehicle. To achieve naturalistic driver model that would feel natural while safe to a human driver, manual driving characteristics are investigated through real-world driving test data. A probabilistic threat assessment with predicted collision time and collision probability is conducted to evaluate driving situations. On the basis of collision risk analysis, two kinds of deep learning have been implemented to reflect human driving characteristics for automated driving. A deep neural network (DNN) and recurrent neural network (RNN) are designed by neural architecture search (NAS), and by learning from the sequential data, respectively. The NAS is used to automatically design the individual driver’s neural network for efficient and effortless design process while ensuring training performance. Sequential trends in the host vehicle’s state can be incorporated through hand-made RNN. It has been shown from human-centered risk assessment simulations that two successfully designed deep learning driver models can provide conservative and progressive driving behavior similar to a manual human driver in both acceleration and deceleration situations by preventing collision.

Download Full-text

LSTM Neural Network for Textual Ngrams

10.20944/preprints201811.0579.v1 ◽

2018 ◽

Author(s):

Shaun C. D'Souza

Keyword(s):

Neural Network ◽

Deep Learning ◽

Human Brain ◽

Cognitive Neuroscience ◽

Probabilistic Models ◽

Prediction Models ◽

Machine Learning Algorithms ◽

Language Models ◽

Brain Functions ◽

The Web

Cognitive neuroscience is the study of how the human brain functions on tasks like decision making, language, perception and reasoning. Deep learning is a class of machine learning algorithms that use neural networks. They are designed to model the responses of neurons in the human brain. Learning can be supervised or unsupervised. Ngram token models are used extensively in language prediction. Ngrams are probabilistic models that are used in predicting the next word or token. They are a statistical model of word sequences or tokens and are called Language Models or Lms. Ngrams are essential in creating language prediction models. We are exploring a broader sandbox ecosystems enabling for AI. Specifically, around Deep learning applications on unstructured content form on the web.

Download Full-text

Chess as Sequential Data in a Chess Match Outcome Prediction Using Deep Learning with Various Chessboard Representations

Procedia Computer Science ◽

10.1016/j.procs.2021.08.180 ◽

2021 ◽

Vol 192 ◽

pp. 1760-1769

Author(s):

Rafał Dreżewski ◽

Grzegorz Wątor

Keyword(s):

Deep Learning ◽

Outcome Prediction ◽

Sequential Data

Download Full-text

Remaining Useful Life Estimation from Event Data

Annual Conference of the PHM Society ◽

10.36001/phmconf.2021.v13i1.3055 ◽

2021 ◽

Vol 13 (1) ◽

Author(s):

Mahbubul Alam ◽

Laleh Jalali ◽

Mahbubul Alam ◽

Ahmed Farahat ◽

Chetan Gupta

Keyword(s):

Time Series ◽

Deep Learning ◽

Time Series Data ◽

Remaining Useful Life ◽

Sensor Data ◽

Series Data ◽

Sequential Data ◽

Event Data ◽

Useful Life ◽

Event Based

Abstract—Prognostics aims to predict the degradation of equipment by estimating their remaining useful life (RUL) and/or the failure probability within a specific time horizon. The high demand of equipment prognostics in the industry have propelled researchers to develop robust and efficient prognostics techniques. Among data driven techniques for prognostics, machine learning and deep learning (DL) based techniques, particularly Recurrent Neural Networks (RNNs) have gained significant attention due to their ability of effectively representing the degradation progress by employing dynamic temporal behaviors. RNNs are well known for handling sequential data, especially continuous time series sequential data where the data follows certain pattern. Such data is usually obtained from sensors attached to the equipment. However, in many scenarios sensor data is not readily available and often very tedious to acquire. Conversely, event data is more common and can easily be obtained from the error logs saved by the equipment and transmitted to a backend for further processing. Nevertheless, performing prognostics using event data is substantially more difficult than that of the sensor data due to the unique nature of event data. Though event data is sequential, it differs from other seminal sequential data such as time series and natural language in the following manner, i) unlike time series data, events may appear at any time, i.e., the appearance of events lacks periodicity; ii) unlike natural languages, event data do not follow any specific linguistic rule. Additionally, there may be a significant variability in the event types appearing within the same sequence. Therefore, this paper proposes an RUL estimation framework to effectively handle the intricate and novel event data. The proposed framework takes discrete events generated by an equipment (e.g., type, time, etc.) as input, and generates for each new event an estimate of the remaining operating cycles in the life of a given component. To evaluate the efficacy of our proposed method, we conduct extensive experiments using benchmark datasets such as the CMAPSS data after converting the time-series data in these datasets to sequential event data. The event data conversion is carried out by careful exploration and application of appropriate transformation techniques to the time series. To the best of our knowledge this is the first time such event-based RUL estimation problem is introduced to the community. Furthermore, we propose several deep learning and machine learning based solution for the event-based RUL estimation problem. Our results suggest that the deep learning models, 1D-CNN, LSTM, and multi-head attention show similar RMSE, MAE and Score performance. Foreseeably, the XGBoost model achieve lower performance compared to the deep learning models since the XGBoost model fails to capture ordering information from the sequence of events.

Download Full-text

LSTM neural network for textual ngrams

10.7287/peerj.preprints.27377 ◽

2018 ◽

Author(s):

Shaun C. D'Souza

Keyword(s):

Neural Network ◽

Deep Learning ◽

Human Brain ◽

Cognitive Neuroscience ◽

Probabilistic Models ◽

Prediction Models ◽

Machine Learning Algorithms ◽

Language Models ◽

Brain Functions ◽

The Web

Cognitive neuroscience is the study of how the human brain functions on tasks like decision making, language, perception and reasoning. Deep learning is a class of machine learning algorithms that use neural networks. They are designed to model the responses of neurons in the human brain. Learning can be supervised or unsupervised. Ngram token models are used extensively in language prediction. Ngrams are probabilistic models that are used in predicting the next word or token. They are a statistical model of word sequences or tokens and are called Language Models or Lms. Ngrams are essential in creating language prediction models. We are exploring a broader sandbox ecosystems enabling for AI. Specifically, around Deep learning applications on unstructured content form on the web.

Download Full-text

Investigation on Deep Learning Approach for Big Data

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Handbook of Research on Pattern Engineering System Development for Big Data Analytics ◽

10.4018/978-1-5225-3870-7.ch002 ◽

2018 ◽

pp. 25-38

Author(s):

Dharmendra Singh Rajput ◽

T. Sunil Kumar Reddy ◽

Dasari Naga Raju

Keyword(s):

Big Data ◽

Deep Learning ◽

Data Analytics ◽

Probabilistic Models ◽

Big Data Analytics ◽

Data Sets ◽

Semantic Indexing ◽

Learning Approaches ◽

Data Sampling ◽

Learning Mechanisms

In recent years, big data analytics is the major research area where the researchers are focused. Complex structures are trained at each level to simplify the data abstractions. Deep learning algorithms are one of the promising researches for automation of complex data extraction from large data sets. Deep learning mechanisms produce better results in machine learning, such as computer vision, improved classification modelling, probabilistic models of data samples, and invariant data sets. The challenges handled by the big data are fast information retrieval, semantic indexing, extracting complex patterns, and data tagging. Some investigations are concentrated on integration of deep learning approaches with big data analytics which pose some severe challenges like scalability, high dimensionality, data streaming, and distributed computing. Finally, the chapter concludes by posing some questions to develop the future work in semantic indexing, active learning, semi-supervised learning, domain adaptation modelling, data sampling, and data abstractions.

Download Full-text

Deep Learning vs. Traditional Probabilistic Models: Case Study on Short Inputs for Password Guessing

Algorithms and Architectures for Parallel Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-030-38991-8_31 ◽

2020 ◽

pp. 468-483

Author(s):

Yuan Linghu ◽

Xiangxue Li ◽

Zhenlong Zhang

Keyword(s):

Deep Learning ◽

Probabilistic Models

Download Full-text

INCO-GAN: Variable-Length Music Generation Method Based on Inception Model-Based Conditional GAN

Mathematics ◽

10.3390/math9040387 ◽

2021 ◽

Vol 9 (4) ◽

pp. 387

Author(s):

Shuyu Li ◽

Yunsick Sung

Keyword(s):

Neural Network ◽

Deep Learning ◽

Generative Models ◽

Variable Length ◽

Sequential Data ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Music Generation ◽

Automatic Music Generation ◽

Frequency Vectors

Deep learning has made significant progress in the field of automatic music generation. At present, the research on music generation via deep learning can be divided into two categories: predictive models and generative models. However, both categories have the same problems that need to be resolved. First, the length of the music must be determined artificially prior to generation. Second, although the convolutional neural network (CNN) is unexpectedly superior to the recurrent neural network (RNN), CNN still has several disadvantages. This paper proposes a conditional generative adversarial network approach using an inception model (INCO-GAN), which enables the generation of complete variable-length music automatically. By adding a time distribution layer that considers sequential data, CNN considers the time relationship in a manner similar to RNN. In addition, the inception model obtains richer features, which improves the quality of the generated music. In experiments conducted, the music generated by the proposed method and that by human composers were compared. High cosine similarity of up to 0.987 was achieved between the frequency vectors, indicating that the music generated by the proposed method is very similar to that created by a human composer.

Download Full-text