Energy Efficient Data Mining Scheme for High Dimensional Data

Recently, anomaly detection has acquired a realistic response from data mining scientists as a graph of its reputation has increased smoothly in various practical domains like product marketing, fraud detection, medical diagnosis, fault detection and so many other fields. High dimensional data subjected to outlier detection poses exceptional challenges for data mining experts and it is because of natural problems of the curse of dimensionality and resemblance of distant and adjoining points. Traditional algorithms and techniques were experimented on full feature space regarding outlier detection. Customary methodologies concentrate largely on low dimensional data and hence show ineffectiveness while discovering anomalies in a data set comprised of a high number of dimensions. It becomes a very difficult and tiresome job to dig out anomalies present in high dimensional data set when all subsets of projections need to be explored. All data points in high dimensional data behave like similar observations because of its intrinsic feature i.e., the distance between observations approaches to zero as the number of dimensions extends towards infinity. This research work proposes a novel technique that explores deviation among all data points and embeds its findings inside well established density-based techniques. This is a state of art technique as it gives a new breadth of research towards resolving inherent problems of high dimensional data where outliers reside within clusters having different densities. A high dimensional dataset from UCI Machine Learning Repository is chosen to test the proposed technique and then its results are compared with that of density-based techniques to evaluate its efficiency.

Download Full-text

Fuzzy comprehensive evaluation of physical education based on high dimensional data mining

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-169661 ◽

2018 ◽

Vol 35 (3) ◽

pp. 3065-3076 ◽

Cited By ~ 3

Author(s):

Zhihui Wang

Keyword(s):

Data Mining ◽

Physical Education ◽

Comprehensive Evaluation ◽

High Dimensional Data ◽

Fuzzy Comprehensive Evaluation ◽

High Dimensional

Download Full-text

Analysis of energy efficient data mining techniques in wireless sensor networks: A review

2017 2nd International Conference for Convergence in Technology (I2CT) ◽

10.1109/i2ct.2017.8226167 ◽

2017 ◽

Author(s):

Roshani Talmale ◽

N. Ramaraj ◽

Nita Thakare

Keyword(s):

Data Mining ◽

Wireless Sensor Networks ◽

Sensor Networks ◽

Energy Efficient ◽

Wireless Sensor ◽

Data Mining Techniques ◽

Efficient Data

Download Full-text

Generalizing rules by random forest-based learning classifier systems for high-dimensional data mining

Proceedings of the Genetic and Evolutionary Computation Conference Companion on - GECCO '18 ◽

10.1145/3205651.3208298 ◽

2018 ◽

Author(s):

Fumito Uwano ◽

Koji Dobashi ◽

Keiki Takadama ◽

Tim Kovacs

Keyword(s):

Data Mining ◽

Random Forest ◽

High Dimensional Data ◽

Learning Classifier Systems ◽

High Dimensional ◽

Classifier Systems ◽

Learning Classifier

Download Full-text

Exploiting the anomaly detection for high dimensional data using descriptive approach of data mining

2013 4th International Conference on Computer and Communication Technology (ICCCT) ◽

10.1109/iccct.2013.6749614 ◽

2013 ◽

Cited By ~ 1

Author(s):

Bharat Singh ◽

Nidhi Kushwaha ◽

O P Vyas

Keyword(s):

Data Mining ◽

Anomaly Detection ◽

High Dimensional Data ◽

High Dimensional ◽

Descriptive Approach

Download Full-text

Automatic subspace clustering of high dimensional data for data mining applications

ACM SIGMOD Record ◽

10.1145/276305.276314 ◽

1998 ◽

Vol 27 (2) ◽

pp. 94-105 ◽

Cited By ~ 378

Author(s):

Rakesh Agrawal ◽

Johannes Gehrke ◽

Dimitrios Gunopulos ◽

Prabhakar Raghavan

Keyword(s):

Data Mining ◽

High Dimensional Data ◽

Subspace Clustering ◽

High Dimensional

Download Full-text

Energy Efficient Data Mining in Multi-Feature Sensor Networks Using Improved Leach Communication Protocol

IOSR Journal of Computer Engineering ◽

10.9790/0661-0330811 ◽

2012 ◽

Vol 3 (3) ◽

pp. 08-11

Author(s):

Shivanna K

Keyword(s):

Data Mining ◽

Sensor Networks ◽

Energy Efficient ◽

Communication Protocol ◽

Efficient Data

Download Full-text

A New Cell-Based Clustering Method for High-Dimensional Data Mining Applications

Lecture Notes in Computer Science - Knowledge-Based Intelligent Information and Engineering Systems ◽

10.1007/11552413_56 ◽

2005 ◽

pp. 391-397 ◽

Cited By ~ 1

Author(s):

Jae-Woo Chang

Keyword(s):

Data Mining ◽

High Dimensional Data ◽

High Dimensional ◽

Clustering Method

Download Full-text

Aerodynamic Optimization of the Low-Pressure Turbine Module: Exploiting Surrogate Models in a High-Dimensional Design Space

Journal of Turbomachinery ◽

10.1115/1.4046232 ◽

2020 ◽

Vol 142 (3) ◽

Cited By ~ 1

Author(s):

Lieven Baert ◽

Emmanuel Chérière ◽

Caroline Sainvitu ◽

Ingrid Lepot ◽

Arnaud Nouvellon ◽

...

Keyword(s):

Data Mining ◽

Design Space ◽

Low Pressure ◽

Surrogate Models ◽

High Dimensional ◽

Efficiency Gain ◽

Aerodynamic Optimization ◽

Component Design ◽

Efficient Data ◽

Lp Turbine

Abstract Further improvement of state-of-the-art low-pressure (LP) turbines (LPTs) has become progressively more challenging. LP design is more than ever confronted to the need to further integrate complex models and to shift from single-component design to the design of the complete LPT module at once. This leads to high-dimensional design spaces and automatically challenges their applicability within an industrial context, where computing resources are limited and the cycle time is crucial. The aerodynamic design of a multistage LP turbine is discussed for a design space defined by 350 parameters. Using an online surrogate-based optimization (SBO) approach, a significant efficiency gain of almost 0.5pt has been achieved. By discussing the sampling of the design space, the quality of the surrogate models, and the application of adequate data mining capabilities to steer the optimization, it is shown that despite the high-dimensional nature of the design space, the followed approach allows to obtain performance gains beyond target. The ability to control both global as well as local characteristics of the flow throughout the full LP turbine, in combination with an agile reaction of the search process after dynamically strengthening and/or enforcing new constraints in order to adapt to the review feedback, not only illustrates the feasibility but also the potential of a global design space for the LP module. It is demonstrated that intertwining the capabilities of dynamic SBO and efficient data mining allows to incorporate high-fidelity simulations in design cycle practices of certified engines or novel engine concepts to jointly optimize the multiple stages of the LPT.

Download Full-text