Genetic Algorithm for the Mutual Information-Based Feature Selection in Univariate Time Series Data

Exploratory Time Series Data Mining by Genetic Clustering

Mathematical Methods for Knowledge Discovery and Data Mining ◽

10.4018/978-1-59904-528-3.ch010 ◽

2011 ◽

pp. 157-178

Author(s):

T. Warren Liao

Keyword(s):

Data Mining ◽

Time Series ◽

Time Series Data ◽

Distance Measures ◽

Series Data ◽

Synthetic Control ◽

Data Set ◽

Univariate Time Series ◽

Genetic Clustering ◽

Data Objects

In this chapter, we present genetic algorithm (GA) based methods developed for clustering univariate time series with equal or unequal length as an exploratory step of data mining. These methods basically implement the k-medoids algorithm. Each chromosome encodes in binary the data objects serving as the k-medoids. To compare their performance, both fixed-parameter and adaptive GAs were used. We first employed the synthetic control chart data set to investigate the performance of three fitness functions, two distance measures, and other GA parameters such as population size, crossover rate, and mutation rate. Two more sets of time series with or without known number of clusters were also experimented: one is the cylinder-bell-funnel data and the other is the novel battle simulation data. The clustering results are presented and discussed.

Download Full-text

Improving anomaly detection in BGP time-series data by new guide features and moderated feature selection algorithm

TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES ◽

10.3906/elk-1804-55 ◽

2019 ◽

Vol 27 (1) ◽

pp. 392-406

Author(s):

Mahmoud HASHEM ◽

Ahmed BASHANDY ◽

Samir SHAHEEN

Keyword(s):

Time Series ◽

Feature Selection ◽

Anomaly Detection ◽

Time Series Data ◽

Series Data ◽

Selection Algorithm ◽

Feature Selection Algorithm

Download Full-text

Clustering of large scale QoS time series data in federated clouds using improved variable Chromosome Length Genetic Algorithm (CQGA)

Expert Systems with Applications ◽

10.1016/j.eswa.2020.113840 ◽

2021 ◽

Vol 164 ◽

pp. 113840

Author(s):

Amin Keshavarzi ◽

Abolfazl Toroghi Haghighat ◽

Mahdi Bohlouli

Keyword(s):

Genetic Algorithm ◽

Time Series ◽

Large Scale ◽

Time Series Data ◽

Chromosome Length ◽

Series Data

Download Full-text

A Metric Learning-Based Univariate Time Series Classification Method

Information ◽

10.3390/info11060288 ◽

2020 ◽

Vol 11 (6) ◽

pp. 288

Author(s):

Kuiyong Song ◽

Nianbin Wang ◽

Hongbin Wang

Keyword(s):

Time Series ◽

Time Series Data ◽

Multivariate Time Series ◽

Metric Learning ◽

Classification Method ◽

Series Data ◽

Classification Error ◽

Time Series Classification ◽

Classification Error Rate ◽

Univariate Time Series

High-dimensional time series classification is a serious problem. A similarity measure based on distance is one of the methods for time series classification. This paper proposes a metric learning-based univariate time series classification method (ML-UTSC), which uses a Mahalanobis matrix on metric learning to calculate the local distance between multivariate time series and combines Dynamic Time Warping(DTW) and the nearest neighbor classification to achieve the final classification. In this method, the features of the univariate time series are presented as multivariate time series data with a mean value, variance, and slope. Next, a three-dimensional Mahalanobis matrix is obtained based on metric learning in the data. The time series is divided into segments of equal intervals to enable the Mahalanobis matrix to more accurately describe the features of the time series data. Compared with the most effective measurement method, the related experimental results show that our proposed algorithm has a lower classification error rate in most of the test datasets.

Download Full-text

Successful network inference from time-series data using mutual information rate

Chaos An Interdisciplinary Journal of Nonlinear Science ◽

10.1063/1.4945420 ◽

2016 ◽

Vol 26 (4) ◽

pp. 043102 ◽

Cited By ~ 15

Author(s):

E. Bianco-Martinez ◽

N. Rubido ◽

Ch. G. Antonopoulos ◽

M. S. Baptista

Keyword(s):

Time Series ◽

Mutual Information ◽

Network Inference ◽

Time Series Data ◽

Series Data ◽

Information Rate

Download Full-text

Accuracy of automatic forecasting methods for univariate time series data: A case study predicting the results of the 2018 Swedish general election using decades-long data series

Communications in Statistics Case Studies Data Analysis and Applications ◽

10.1080/23737484.2021.1964407 ◽

2021 ◽

pp. 1-19

Author(s):

Andreas Karlsson Rosenblad

Keyword(s):

Time Series ◽

Time Series Data ◽

General Election ◽

Series Data ◽

Data Series ◽

Forecasting Methods ◽

Univariate Time Series ◽

Automatic Forecasting

Download Full-text

Short-Term Time Series Prediction for a Logistics Outsourcing Company

Outsourcing Management for Supply Chain Operations and Logistics Service - Advances in Logistics, Operations, and Management Science ◽

10.4018/978-1-4666-2008-7.ch009 ◽

2013 ◽

pp. 150-160

Author(s):

Angeliki Papana

Keyword(s):

Time Series ◽

Time Series Data ◽

Linear Time ◽

Time Series Prediction ◽

Service Level ◽

Optimal Choice ◽

Series Data ◽

Short Term ◽

Univariate Time Series ◽

Nonlinear Components

In this chapter, tools from univariate time series analysis and forecasting are presented and applied. Time series components, such as trend and seasonality are introduced and discussed, while time series methods are analyzed based on the type of the time series components. In the literature, linear methods are the most commonly used. However, real time series data often include nonlinear components, so linear time series forecasting may not be the optimal choice. Therefore, also a basic nonlinear forecasting method is presented. The necessity of these methods to logistics service providers and 3PL companies is presented by case studies that present how the operational and management costs can be cut down in order to ensure a service level. Short term forecasts are useful in all the units of activation of 3PL companies, i.e. supplies, production, distribution, storage, transportation, and service of customers.

Download Full-text

Anomaly Detection by STL Decomposition and Extended Isolation Forest on Environmental Univariate Time Series

10.5194/egusphere-egu2020-18471 ◽

2020 ◽

Author(s):

İsmail Sezen ◽

Alper Unal ◽

Ali Deniz

Keyword(s):

Time Series ◽

Seasonal Variation ◽

Anomaly Detection ◽

Time Series Data ◽

Complex Structure ◽

Series Data ◽

High Concentration ◽

Univariate Time Series ◽

Concentration Levels ◽

Isolation Forest

Atmospheric pollution is one of the primary problems and high concentration levels are critical for human health and environment. This requires to study causes of unusual high concentration levels which do not conform to the expected behavior of the pollutant but it is not always easy to decide which levels are unusual, especially, when data is big and has complex structure. A visual inspection is subjective in most cases and a proper anomaly detection method should be used. Anomaly detection has been widely used in diverse research areas, but most of them have been developed for certain application domains. It also might not be always a good idea to identify anomalies by using data from near measurement sites because of spatio-temporal complexity of the pollutant. That&#8217;s why, it&#8217;s required to use a method which estimates anomalies from univariate time series data.This work suggests a framework based on STL decomposition and extended isolation forest (EIF), which is a machine learning algorithm, to identify anomalies for univariate time series which has trend, multi-seasonality and seasonal variation. Main advantage of EIF method is that it defines anomalies by a score value.In this study, a multi-seasonal STL decomposition has been applied on a univariate PM10 time series to remove trend and seasonal parts but STL is not resourceful to remove seasonal variation from the data. The remainder part still has 24 hours and yearly variation. To remove the variation, hourly and annual inter-quartile ranges (IQR) are calculated and data is standardized by dividing each value to corresponding IQR value. This process ensures removing seasonality in variation and the resulting data is processed by EIF to decide which values are anomaly by an objective criterion.

Download Full-text

Classification of Posture Reconstruction with Univariate Time Series Data Type

2018 International Conference on Sustainable Information Engineering and Technology (SIET) ◽

10.1109/siet.2018.8693174 ◽

2018 ◽

Author(s):

Nindynar Rikatsih ◽

Ahmad Afif Supianto

Keyword(s):

Time Series ◽

Time Series Data ◽

Data Type ◽

Series Data ◽

Univariate Time Series

Download Full-text

Feature Selection Method for Multivariate Time Series Data Classification

Journal of the Korean Institute of Industrial Engineers ◽

10.7232/jkiie.2017.43.6.413 ◽

2017 ◽

Vol 43 (6) ◽

pp. 413-421

Author(s):

Gilseung Ahn ◽

Hwanchul Lee ◽

Sun Hur

Keyword(s):

Time Series ◽

Feature Selection ◽

Time Series Data ◽

Multivariate Time Series ◽

Feature Selection Method ◽

Data Classification ◽

Selection Method ◽

Series Data

Download Full-text