Interactive Visualization Adopting Dimensionality Reduction Techniques for Pattern Recognition in Large Temporal Datasets

Mapping Intimacies ◽

10.23889/suthesis.56896 ◽

2021 ◽

Author(s):

◽

Mohammed Ali

Keyword(s):

Machine Learning ◽

Time Series ◽

Dimensionality Reduction ◽

Visual Analytics ◽

Large Time ◽

Time Series Data ◽

Machine Learning Algorithms ◽

Series Data ◽

Reduction Techniques ◽

Dimensionality Reduction Techniques

In this thesis, we focus on time-series data, which is commonly used by domain experts in different domains to explore and understand phenomena or behaviors under consideration, as-sisting them in making decisions, predicting the future or solving problems. Utilizing sensor devices is one of the common ways of collecting time-series data. These devices collect large volumes of raw data, including multi-dimensional time-series data, and each value is associated with the time-stamp corresponding to when it was recorded. However, finding interesting pat-terns or behaviors in a large amount of data is not simple due to the nature of the data and other challenges related to its size and scalability, high dimensionality, complexity, representation, and unique structure.Researchers tend to use time-series chart visualization, which is usually unsuitable because of the small screen resolution which cannot accommodate the large size of the data. Hence, occlusion and overplotting issues occur, limiting or complicating the exploration and analysis tasks. Another challenge concerns the labeling of patterns in large time-series data, which is time-consuming and requires a great deal of expert knowledge.These issues are addressed in this thesis to improve the exploration, analysis and presen-tation of time-series data and enable users to gain insights into large and multi-dimensional time-series datasets using a combination of dimensionality reduction techniques and interac-tive visual methods. The provided solutions will help researchers from various domains who deal with large and multi-dimensional time-series data to efficiently explore and analyze such data with little effort and in record time.Initially, we explore the area of integration between machine learning algorithms and inter-active visualization techniques for exploring and understanding time-series data, specifically looking at clustering and classification for time-series data in visual analytics. The survey is considered to be a valuable guide for both new researchers and experts in the emerging field of integrating machine learning algorithms into visual analytics.Next, we present a novel approach that aims to explore, analyze, and present large temporal datasets through one image. The proposed approach uses a sliding window and dimensionality reduction techniques to depict a large time-series data as points into a 2D scatter plot. The approach provides novel solutions to many pattern discovery issues and can deal with both univariate and multivariate time-series data.Following this, our proposed approach is combined with both visualization and interaction techniques into one system called TimeCluster, which is a visual analytics tool allowing users to visualize, explore and interact with large time-series data. The system addresses different issues such as anomaly detection, the discovery of frequent patterns, and the labeling of in-teresting patterns in large time-series data all in a single system. We deploy our system with different time-series datasets and report real-world case studies of its utility.Later, the linkage between the 1D view (time-series chart) to the 2D view of the 2D embed-ding of time-series data, and parallel interactions such as selection and labeling, are employed to explore and examine the effectiveness of recent developments in machine learning and di-mension reduction in the context of time-series data exploration. We design a user study to evaluate and validate the effectiveness of the linkage between both a 1D and 2D visualization, and how their fitness in the context of projecting time-series data is, where different dimen-sionality reduction techniques are examined, evaluated and compared within our experimental setting.Lastly, we conclude our findings and outline possible areas for future work.

Download Full-text

Mining Massive Time Series Data: With Dimensionality Reduction Techniques

Communications in Computer and Information Science - Advances in Computing and Data Sciences ◽

10.1007/978-981-15-6634-9_45 ◽

2020 ◽

pp. 496-506

Author(s):

Justin Borg ◽

Joseph G. Vella

Keyword(s):

Time Series ◽

Dimensionality Reduction ◽

Time Series Data ◽

Series Data ◽

Reduction Techniques ◽

Dimensionality Reduction Techniques

Download Full-text

Time Series Data Representation and Dimensionality Reduction Techniques

Algorithms for Intelligent Systems - Applications of Machine Learning ◽

10.1007/978-981-15-3357-0_18 ◽

2020 ◽

pp. 267-284

Author(s):

Anshul Sharma ◽

Abhinav Kumar ◽

Anil Kumar Pandey ◽

Rishav Singh

Keyword(s):

Time Series ◽

Dimensionality Reduction ◽

Time Series Data ◽

Data Representation ◽

Series Data ◽

Reduction Techniques ◽

Dimensionality Reduction Techniques

Download Full-text

Similarity Measures and Dimensionality Reduction Techniques for Time Series Data Mining

Advances in Data Mining Knowledge Discovery and Applications ◽

10.5772/49941 ◽

2012 ◽

Cited By ~ 21

Author(s):

Carmelo Cassisi ◽

Placido Montalto ◽

Marco Aliotta ◽

Andrea Cannata ◽

Alfredo Pulvirenti

Keyword(s):

Data Mining ◽

Time Series ◽

Dimensionality Reduction ◽

Time Series Data ◽

Similarity Measures ◽

Series Data ◽

Time Series Data Mining ◽

Reduction Techniques ◽

Dimensionality Reduction Techniques

Download Full-text

Implementation of IoT Framework with Data Analysis Using Deep Learning Methods for Occupancy Prediction in a Building

Future Internet ◽

10.3390/fi13030067 ◽

2021 ◽

Vol 13 (3) ◽

pp. 67

Author(s):

Eric Hitimana ◽

Gaurav Bajpai ◽

Richard Musabe ◽

Louis Sibomana ◽

Jayavel Kayalvizhi

Keyword(s):

Machine Learning ◽

Time Series ◽

Deep Learning ◽

Time Series Data ◽

Multivariate Time Series ◽

Machine Learning Algorithms ◽

Series Data ◽

Support Vector ◽

Human Beings ◽

Feed Forward Network

Many countries worldwide face challenges in controlling building incidence prevention measures for fire disasters. The most critical issues are the localization, identification, detection of the room occupant. Internet of Things (IoT) along with machine learning proved the increase of the smartness of the building by providing real-time data acquisition using sensors and actuators for prediction mechanisms. This paper proposes the implementation of an IoT framework to capture indoor environmental parameters for occupancy multivariate time-series data. The application of the Long Short Term Memory (LSTM) Deep Learning algorithm is used to infer the knowledge of the presence of human beings. An experiment is conducted in an office room using multivariate time-series as predictors in the regression forecasting problem. The results obtained demonstrate that with the developed system it is possible to obtain, process, and store environmental information. The information collected was applied to the LSTM algorithm and compared with other machine learning algorithms. The compared algorithms are Support Vector Machine, Naïve Bayes Network, and Multilayer Perceptron Feed-Forward Network. The outcomes based on the parametric calibrations demonstrate that LSTM performs better in the context of the proposed application.

Download Full-text

Prediction and Analysis of Gold Prices using Ensemble Machine Learning Algorithms

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.36028 ◽

2021 ◽

Vol 9 (VI) ◽

pp. 4367-4374

Author(s):

Gudipally Chandrashakar

Keyword(s):

Machine Learning ◽

Time Series ◽

Time Series Data ◽

Gold Price ◽

Machine Learning Algorithms ◽

Series Data ◽

Gradient Boosting ◽

Support Vector ◽

Average Value ◽

Ensemble Machine Learning

In this article, we used historical time series data up to the current day gold price. In this study of predicting gold price, we consider few correlating factors like silver price, copper price, standard, and poor’s 500 value, dollar-rupee exchange rate, Dow Jones Industrial Average Value. Considering the prices of every correlating factor and gold price data where dates ranging from 2008 January to 2021 February. Few algorithms of machine learning are used to analyze the time-series data are Random Forest Regression, Support Vector Regressor, Linear Regressor, ExtraTrees Regressor and Gradient boosting Regression. While seeing the results the Extra Tree Regressor algorithm gives the predicted value of gold prices more accurately.

Download Full-text

A Novel Fractal Representation for Dimensionality Reduction of Large Time Series Data

Advances in Knowledge Discovery and Data Mining - Lecture Notes in Computer Science ◽

10.1007/978-3-642-01307-2_105 ◽

2009 ◽

pp. 989-996 ◽

Cited By ~ 3

Author(s):

Poat Sajjipanon ◽

Chotirat Ann Ratanamahatana

Keyword(s):

Time Series ◽

Dimensionality Reduction ◽

Large Time ◽

Time Series Data ◽

Series Data ◽

Fractal Representation

Download Full-text

Time-Series Data Classification and Analysis Associated With Machine Learning Algorithms for Cognitive Perception and Phenomenon

IEEE Access ◽

10.1109/access.2020.3018477 ◽

2020 ◽

Vol 8 ◽

pp. 222417-222428

Author(s):

Taikyeong Jeong

Keyword(s):

Machine Learning ◽

Time Series ◽

Time Series Data ◽

Learning Algorithms ◽

Data Classification ◽

Machine Learning Algorithms ◽

Series Data

Download Full-text

Eagle View: An Abstract Evaluation of Machine Learning Algorithms based on Data Properties

10.36227/techrxiv.14459361.v1 ◽

2021 ◽

Author(s):

Dhairya Vyas

Keyword(s):

Machine Learning ◽

Time Series ◽

Time Series Data ◽

Learning Algorithms ◽

Numerical Data ◽

Machine Learning Algorithms ◽

Series Data ◽

Learning Methods ◽

Machine Learning Methods ◽

Almost All

In terms of Machine Learning, the majority of the data can be grouped into four categories: numerical data, category data, time-series data, and text. We use different classifiers for different data properties, such as the Supervised; Unsupervised; and Reinforcement. Each Categorises has classifier we have tested almost all machine learning methods and make analysis among them.

Download Full-text

Advanced Hybrid Machine Learning Algorithms for Multistep Lake Water Level Forecasting

10.21203/rs.3.rs-912383/v1 ◽

2021 ◽

Author(s):

Elham Fijani ◽

Khabat Khosravi ◽

Rahim Barzegar ◽

John Quilty ◽

Jan Adamowski ◽

...

Keyword(s):

Machine Learning ◽

Time Series ◽

Water Level ◽

Lake Michigan ◽

Model Building ◽

Time Series Data ◽

Machine Learning Algorithms ◽

Series Data ◽

Lake Water Level ◽

Visually Based

Abstract Random Tree (RT) and Iterative Classifier Optimizer (ICO) based on Alternating Model Tree (AMT) regressor machine learning (ML) algorithms coupled with Bagging (BA) or Additive Regression (AR) hybrid algorithms were applied to forecasting multistep ahead (up to three months) Lake Superior and Lake Michigan water level (WL). Partial autocorrelation (PACF) of each lake’s WL time series estimated the most important lag times — up to five months in both lakes — as potential inputs. The WL time series data was partitioned into training (from 1918 to 1988) and testing (from 1989 to 2018) for model building and evaluation, respectively. Developed algorithms were validated through statistically and visually based metric using testing data. Although both hybrid ensemble algorithms improved individual ML algorithms’ performance, the BA algorithm outperformed the AR algorithm. As a novel model in forecasting problems, the ICO algorithm was shown to have great potential in generating robust multistep lake WL forecasts.

Download Full-text

A Visual Analytics Framework for Reviewing Multivariate Time-Series Data with Dimensionality Reduction

IEEE Transactions on Visualization and Computer Graphics ◽

10.1109/tvcg.2020.3028889 ◽

2020 ◽

pp. 1-1

Author(s):

Takanori Fujiwara ◽

Shilpika ◽

Naohisa Sakamoto ◽

Jorji Nonaka ◽

Keiji Yamamoto ◽

...

Keyword(s):

Time Series ◽

Dimensionality Reduction ◽

Visual Analytics ◽

Time Series Data ◽

Multivariate Time Series ◽

Series Data

Download Full-text