Performance enhancement of classification scheme in data mining using hybrid algorithm

Data Mining Application in Classification Scheme of Human Subjects According to Ayurvedic Prakruti – Temperament

Indian Journal of Science and Technology ◽

10.17485/ijst/2016/v9i13/84658 ◽

2016 ◽

Vol 9 (13) ◽

Cited By ~ 1

Author(s):

Murtaza M. Junaid Farooque ◽

Mohammed Aref ◽

Mohammed Imran Khan ◽

Shareque Mohammed

Keyword(s):

Data Mining ◽

Classification Scheme ◽

Human Subjects ◽

Data Mining Application

Download Full-text

A Hybrid Algorithm of Mining Closed Itemsets for Large Databases

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.145.292 ◽

2011 ◽

Vol 145 ◽

pp. 292-296

Author(s):

Lee Wen Huang

Keyword(s):

Data Mining ◽

Association Rules ◽

Execution Time ◽

Hybrid Algorithm ◽

Hybrid Approach ◽

Market Basket Analysis ◽

Market Basket ◽

Large Databases ◽

Closed Itemsets ◽

Simulation Results

Data Mining means a process of nontrivial extraction of implicit, previously and potentially useful information from data in databases. Mining closed large itemsets is a further work of mining association rules, which aims to find the set of necessary subsets of large itemsets that could be representative of all large itemsets. In this paper, we design a hybrid approach, considering the character of data, to mine the closed large itemsets efficiently. Two features of market basket analysis are considered – the number of items is large; the number of associated items for each item is small. Combining the cut-point method and the hash concept, the new algorithm can find the closed large itemsets efficiently. The simulation results show that the new algorithm outperforms the FP-CLOSE algorithm in the execution time and the space of storage.

Download Full-text

Beyond Classification

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch111 ◽

2008 ◽

pp. 1855-1876

Author(s):

Anna Olecka

Keyword(s):

Data Mining ◽

Credit Risk ◽

Credit Card ◽

Classification Scheme ◽

Credit Scoring ◽

Complex Nature ◽

Acquisition Process ◽

Business Objective ◽

Simplifying Assumptions ◽

History Of

This chapter will focus on challenges in modeling credit risk for new accounts acquisition process in the credit card industry. First section provides an overview and a brief history of credit scoring. The second section looks at some of the challenges specific to the credit industry. In many of these applications business objective is tied only indirectly to the classification scheme. Opposing objectives, such as response, profit and risk, often play a tug of war with each other. Solving a business problem of such complex nature often requires a multiple of models working jointly. Challenges to data mining lie in exploring solutions that go beyond traditional, well-documented methodology and need for simplifying assumptions; often necessitated by the reality of dataset sizes and/or implementation issues. Examples of such challenges form an illustrative example of a compromise between data mining theory and applications.

Download Full-text

Beyond Classification

Knowledge Discovery and Data Mining ◽

10.4018/978-1-59904-252-7.ch008 ◽

2011 ◽

pp. 139-161

Author(s):

Anna Olecka

Keyword(s):

Data Mining ◽

Credit Risk ◽

Credit Card ◽

Classification Scheme ◽

Credit Scoring ◽

Complex Nature ◽

Acquisition Process ◽

Business Objective ◽

Simplifying Assumptions ◽

History Of

This chapter will focus on challenges in modeling credit risk for new accounts acquisition process in the credit card industry. First section provides an overview and a brief history of credit scoring. The second section looks at some of the challenges specific to the credit industry. In many of these applications business objective is tied only indirectly to the classification scheme. Opposing objectives, such as response, profit and risk, often play a tug of war with each other. Solving a business problem of such complex nature often requires a multiple of models working jointly. Challenges to data mining lie in exploring solutions that go beyond traditional, well-documented methodology and need for simplifying assumptions; often necessitated by the reality of dataset sizes and/or implementation issues. Examples of such challenges form an illustrative example of a compromise between data mining theory and applications.

Download Full-text

Intrusion Detection System by Using Hybrid Algorithm of Data Mining Technique

Proceedings of the 2018 7th International Conference on Software and Computer Applications - ICSCA 2018 ◽

10.1145/3185089.3185114 ◽

2018 ◽

Cited By ~ 2

Author(s):

Zohreh Abtahi Foroushani ◽

Yue Li

Keyword(s):

Data Mining ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Hybrid Algorithm ◽

Detection System ◽

Data Mining Technique ◽

Mining Technique

Download Full-text

Performance enhancement of DBSCAN density based clustering algorithm in data mining

2017 International Conference on Energy, Communication, Data Analytics and Soft Computing (ICECDS) ◽

10.1109/icecds.2017.8389708 ◽

2017 ◽

Author(s):

Deepak Jain ◽

Manoj Singh ◽

Arvind K Sharma

Keyword(s):

Data Mining ◽

Clustering Algorithm ◽

Performance Enhancement ◽

Density Based Clustering

Download Full-text

Hybrid Algorithm for Anomaly Removal in Time Series Data Mining

10.20944/preprints202111.0440.v1 ◽

2021 ◽

Author(s):

Abdul Razaque ◽

Marzhan Abenova ◽

Munif Alotaibi ◽

Bandar Alotaibi ◽

Hamoud Alshammari ◽

...

Keyword(s):

Data Mining ◽

Time Series ◽

Hybrid Algorithm ◽

Time Series Data ◽

State Of The Art ◽

Large Data ◽

Series Data ◽

Multidimensional Data ◽

Search Problem ◽

Short Text

Time series data are significant and are derived from temporal data, which involve real numbers representing values collected regularly over time. Time series have a great impact on many types of data. However, time series have anomalies. We introduce hybrid algorithm named novel matrix profile (NMP) to solve the all-pairs similarity search problem for time series data. The proposed NMP inherits the features from two state-of-the art algorithms: similarity time-series automatic multivariate prediction (STAMP), and short text online microblogging protocol (STOMP). The proposed algorithm caches the output in an easy-to-access fashion for single- and multidimensional data. The proposed NMP algorithm can be used on large data sets and generates approximate solutions of high quality in a reasonable time. The proposed NMP can also handle several data mining tasks. It is implemented on a Python platform. To determine its effectiveness, it is compared with the state-of-the-art matrix profile algorithms i.e., STAMP and STOMP. The results confirm that the proposed NMP provides higher accuracy than the compared algorithms.

Download Full-text

A Hybrid Algorithm for Privacy Preserving in Data Mining

International Journal of Intelligent Systems and Applications ◽

10.5815/ijisa.2013.08.06 ◽

2013 ◽

Vol 5 (8) ◽

pp. 47-53 ◽

Cited By ~ 1

Author(s):

Sridhar Mandapati ◽

Raveendra Babu Bhogapathi ◽

Ratna Babu Chekka

Keyword(s):

Data Mining ◽

Hybrid Algorithm ◽

Privacy Preserving

Download Full-text

Architecture of Proposed Secured Crypto-Hybrid Algorithm (SCHA) for Security and Privacy Issues in Data Mining

Advances in Intelligent Systems and Computing - Progress in Advanced Computing and Intelligent Engineering ◽

10.1007/978-981-15-6353-9_28 ◽

2020 ◽

pp. 315-321

Author(s):

Pasupuleti Nagendra Babu ◽

S. Ramakrishna

Keyword(s):

Data Mining ◽

Hybrid Algorithm ◽

Security And Privacy ◽

Privacy Issues

Download Full-text

A survey of open source data science tools

International Journal of Intelligent Computing and Cybernetics ◽

10.1108/ijicc-07-2014-0031 ◽

2015 ◽

Vol 8 (3) ◽

pp. 232-261 ◽

Cited By ~ 10

Author(s):

Panagiotis Barlas ◽

Ivor Lanning ◽

Cathal Heavey

Keyword(s):

Data Mining ◽

Open Source ◽

Data Science ◽

Classification Scheme ◽

Current Status ◽

Probability Models ◽

Content Type ◽

Project Activity ◽

Operational Characteristics ◽

Open Source Data

Purpose – Data science is the study of the generalizable extraction of knowledge from data. It includes a variety of components and develops on methods and concepts from many domains, containing mathematics, probability models, machine learning, statistical learning, computer programming, data engineering, pattern recognition and learning, visualization and data warehousing aiming to extract value from data. The purpose of this paper is to provide an overview of open source (OS) data science tools, proposing a classification scheme that can be used to study OS data science software. Design/methodology/approach – The proposed classification scheme is based on general characteristics, project activity, operational characteristics and data mining characteristics. The authors then use the proposed scheme to examine 70 identified Open Source Software. From this the authors provide insight about the current status of OS data science tools and reveal the state-of-the-art tools. Findings – The features of 70 OS tools are recorded based on the criteria of the four group characteristics, general characteristics, project activity, operational characteristics and data mining characteristics. Interesting results came from the analysis of these features and are recorded here. Originality/value – The contribution of this survey is development of a new classification scheme for examination and study of OS data science tools. In parallel, this study provides an overview of existing OS data science tools.

Download Full-text