A survey - data mining frameworks in credit card processing

Literature Study –Data Mining Techniques on Detecting Fradulent Activities in Credit Card

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i10.68 ◽

2017 ◽

Vol 6 (10) ◽

pp. 60

Author(s):

S. K. Saravanan ◽

G. N. K. Suresh Babu

Keyword(s):

Data Mining ◽

Credit Card ◽

Data Transfer ◽

Study Data ◽

Time Requirement ◽

Literature Study ◽

Data Mining Techniques ◽

Online Transactions ◽

Secured Data ◽

Credit Card Usage

In contemporary days the more secured data transfer occurs almost through internet. At same duration the risk also augments in secure data transfer. Having the rise and also light progressiveness in e – commerce, the usage of credit card (CC) online transactions has been also dramatically augmenting. The CC (credit card) usage for a safety balance transfer has been a time requirement. Credit-card fraud finding is the most significant thing like fraudsters that are augmenting every day. The intention of this survey has been assaying regarding the issues associated with credit card deception behavior utilizing data-mining methodologies. Data mining has been a clear procedure which takes data like input and also proffers throughput in the models forms or patterns forms. This investigation is very beneficial for any credit card supplier for choosing a suitable solution for their issue and for the researchers for having a comprehensive assessment of the literature in this field.

Download Full-text

A comparative study of data mining techniques in predicting consumers credit card risk in banks

AFRICAN JOURNAL OF BUSINESS MANAGEMENT ◽

10.5897/ajbm11.476 ◽

2013 ◽

Vol 5 (20) ◽

pp. 8307-8312 ◽

Cited By ~ 2

Author(s):

Kock Sheng Ling ◽

Ying Wah Teh

Keyword(s):

Data Mining ◽

Comparative Study ◽

Credit Card ◽

Data Mining Techniques

Download Full-text

Data mining for credit card fraud: A comparative study

Decision Support Systems ◽

10.1016/j.dss.2010.08.008 ◽

2011 ◽

Vol 50 (3) ◽

pp. 602-613 ◽

Cited By ~ 280

Author(s):

Siddhartha Bhattacharyya ◽

Sanjeev Jha ◽

Kurian Tharakunnel ◽

J. Christopher Westland

Keyword(s):

Data Mining ◽

Comparative Study ◽

Credit Card ◽

Credit Card Fraud

Download Full-text

The Data Mining: A Solution for Credit Card Fraud Detection in Banking

Advances in Intelligent Systems and Computing - Lecture Notes in Real-Time Intelligent Systems ◽

10.1007/978-3-319-91337-7_31 ◽

2018 ◽

pp. 332-341 ◽

Cited By ~ 1

Author(s):

Hafsa El-kaime ◽

Mostafa Hanoune ◽

Ahmed Eddaoui

Keyword(s):

Data Mining ◽

Credit Card ◽

Fraud Detection ◽

Credit Card Fraud

Download Full-text

Distance Based Pattern Driven Mining for Outlier Detection in High Dimensional Big Dataset

ACM Transactions on Management Information Systems ◽

10.1145/3469891 ◽

2022 ◽

Vol 13 (1) ◽

pp. 1-17

Author(s):

Ankit Kumar ◽

Abhishek Kumar ◽

Ali Kashif Bashir ◽

Mamoon Rashid ◽

V. D. Ambeth Kumar ◽

...

Keyword(s):

Data Mining ◽

Comparative Analysis ◽

Outlier Detection ◽

Credit Card ◽

High Dimensional ◽

Work Efficiency ◽

Average Value ◽

Novel Method ◽

Detection Of Outliers ◽

Better Than

Detection of outliers or anomalies is one of the vital issues in pattern-driven data mining. Outlier detection detects the inconsistent behavior of individual objects. It is an important sector in the data mining field with several different applications such as detecting credit card fraud, hacking discovery and discovering criminal activities. It is necessary to develop tools used to uncover the critical information established in the extensive data. This paper investigated a novel method for detecting cluster outliers in a multidimensional dataset, capable of identifying the clusters and outliers for datasets containing noise. The proposed method can detect the groups and outliers left by the clustering process, like instant irregular sets of clusters (C) and outliers (O), to boost the results. The results obtained after applying the algorithm to the dataset improved in terms of several parameters. For the comparative analysis, the accurate average value and the recall value parameters are computed. The accurate average value is 74.05% of the existing COID algorithm, and our proposed algorithm has 77.21%. The average recall value is 81.19% and 89.51% of the existing and proposed algorithm, which shows that the proposed work efficiency is better than the existing COID algorithm.

Download Full-text

Predicting Season Ticket Holder Retention Using Rich Behavioral Data

Journal of Sport Management ◽

10.1123/jsm.2020-0190 ◽

2020 ◽

pp. 1-14 ◽

Cited By ~ 1

Author(s):

Adam Karg ◽

Ali Tamaddoni ◽

Heath McDonald ◽

Michael Ewing

Keyword(s):

Data Mining ◽

Survey Data ◽

Team Performance ◽

Research Agenda ◽

Predictive Accuracy ◽

Behavioral Data ◽

Important Research ◽

Season Ticket ◽

As Relationship ◽

Game Attendance

Season ticket holders are a vital source of revenue for professional teams, but retention remains a perennial issue. Prior research has focused on broad variables, such as relationship tenure, game attendance frequency, and renewal intention, and has generally been limited to survey data with its attenuate problems. To advance this important research agenda, the present study analyzes team-supplied behavioral data to investigate and predict retention as a loyalty outcome for a single professional team over a 3-year period. Specifically, the authors embrace a broad range of loyalty measures and team performance to predict retention and employ novel data mining techniques to improve predictive accuracy.

Download Full-text

Outlier Detection Techniques for Data Mining

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch228 ◽

2011 ◽

pp. 1483-1488

Author(s):

Fabrizio Angiulli

Keyword(s):

Data Mining ◽

Outlier Detection ◽

Credit Card ◽

Detection Methods ◽

Distribution Model ◽

Main Task ◽

Data Set ◽

Homogeneous Groups ◽

Definition Of ◽

Dependency Detection

Data mining techniques can be grouped in four main categories: clustering, classification, dependency detection, and outlier detection. Clustering is the process of partitioning a set of objects into homogeneous groups, or clusters. Classification is the task of assigning objects to one of several predefined categories. Dependency detection searches for pairs of attribute sets which exhibit some degree of correlation in the data set at hand. The outlier detection task can be defined as follows: “Given a set of data points or objects, find the objects that are considerably dissimilar, exceptional or inconsistent with respect to the remaining data”. These exceptional objects as also referred to as outliers. Most of the early methods for outlier identification have been developed in the field of statistics (Hawkins, 1980; Barnett & Lewis, 1994). Hawkins’ definition of outlier clarifies the approach: “An outlier is an observation that deviates so much from other observations as to arouse suspicions that it was generated by a different mechanism”. Indeed, statistical techniques assume that the given data set has a distribution model. Outliers are those points that satisfy a discordancy test, that is, that are significantly far from what would be their expected position given the hypothesized distribution. Many clustering, classification and dependency detection methods produce outliers as a by-product of their main task. For example, in classification, mislabeled objects are considered outliers and thus they are removed from the training set to improve the accuracy of the resulting classifier, while in clustering, objects that do not strongly belong to any cluster are considered outliers. Nevertheless, it must be said that searching for outliers through techniques specifically designed for tasks different from outlier detection could not be advantageous. As an example, clusters can be distorted by outliers and, thus, the quality of the outliers returned is affected by their presence. Moreover, other than returning a solution of higher quality, outlier detection algorithms can be vastly more efficient than non ad-hoc algorithms. While in many contexts outliers are considered as noise that must be eliminated, as pointed out elsewhere, “one person’s noise could be another person’s signal”, and thus outliers themselves can be of great interest. Outlier mining is used in telecom or credit card frauds to detect the atypical usage of telecom services or credit cards, in intrusion detection for detecting unauthorized accesses, in medical analysis to test abnormal reactions to new medical therapies, in marketing and customer segmentations to identify customers spending much more or much less than average customer, in surveillance systems, in data cleaning, and in many other fields.

Download Full-text

Locally Adaptive Techniques for Pattern Classification

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch182 ◽

2011 ◽

pp. 1170-1175

Author(s):

Carlotta Domeniconi ◽

Dimitrios Gunopulos

Keyword(s):

Data Mining ◽

Electronic Commerce ◽

Pattern Classification ◽

Credit Card ◽

Curse Of Dimensionality ◽

Ease Of Use ◽

Practical Significance ◽

High Dimensional ◽

Finite Samples ◽

Recent Developments

Pattern classification is a very general concept with numerous applications ranging from science, engineering, target marketing, medical diagnosis and electronic commerce to weather forecast based on satellite imagery. A typical application of pattern classification is mass mailing for marketing. For example, credit card companies often mail solicitations to consumers. Naturally, they would like to target those consumers who are most likely to respond. Often, demographic information is available for those who have responded previously to such solicitations, and this information may be used in order to target the most likely respondents. Another application is electronic commerce of the new economy. E-commerce provides a rich environment to advance the state-of-the-art in classification because it demands effective means for text classification in order to make rapid product and market recommendations. Recent developments in data mining have posed new challenges to pattern classification. Data mining is a knowledge discovery process whose aim is to discover unknown relationships and/or patterns from a large set of data, from which it is possible to predict future outcomes. As such, pattern classification becomes one of the key steps in an attempt to uncover the hidden knowledge within the data. The primary goal is usually predictive accuracy, with secondary goals being speed, ease of use, and interpretability of the resulting predictive model. While pattern classification has shown promise in many areas of practical significance, it faces difficult challenges posed by real world problems, of which the most pronounced is Bellman’s curse of dimensionality: it states the fact that the sample size required to perform accurate prediction on problems with high dimensionality is beyond feasibility. This is because in high dimensional spaces data become extremely sparse and are apart from each other. As a result, severe bias that affects any estimation process can be introduced in a high dimensional feature space with finite samples. Learning tasks with data represented as a collection of a very large number of features abound. For example, microarrays contain an overwhelming number of genes relative to the number of samples. The Internet is a vast repository of disparate information growing at an exponential rate. Efficient and effective document retrieval and classification systems are required to turn the ocean of bits around us into useful information, and eventually into knowledge. This is a challenging task, since a word level representation of documents easily leads 30000 or more dimensions. This chapter discusses classification techniques to mitigate the curse of dimensionality and reduce bias, by estimating feature relevance and selecting features accordingly. This issue has both theoretical and practical relevance, since many applications can benefit from improvement in prediction performance.

Download Full-text

Data Mining for Fraud Detection System

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch065 ◽

2011 ◽

pp. 411-416

Author(s):

Roberto Marmo

Keyword(s):

Data Mining ◽

Credit Card ◽

User Behavior ◽

Detection System ◽

Fraud Detection ◽

Modern Technology ◽

Data Sets ◽

Security Measures ◽

Economic Activities ◽

Huge Data

As a conseguence of expansion of modern technology, the number and scenario of fraud are increasing dramatically. Therefore, the reputation blemish and losses caused are primary motivations for technologies and methodologies for fraud detection that have been applied successfully in some economic activities. The detection involves monitoring the behavior of users based on huge data sets such as the logged data and user behavior. The aim of this contribution is to show some data mining techniques for fraud detection and prevention with applications in credit card and telecommunications, within a business of mining the data to achieve higher cost savings, and also in the interests of determining potential legal evidence. The problem is very difficult because fraudsters takes many different forms and are adaptive, so they will usually look for ways to avoid every security measures.

Download Full-text

Beyond Classification

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch111 ◽

2008 ◽

pp. 1855-1876

Author(s):

Anna Olecka

Keyword(s):

Data Mining ◽

Credit Risk ◽

Credit Card ◽

Classification Scheme ◽

Credit Scoring ◽

Complex Nature ◽

Acquisition Process ◽

Business Objective ◽

Simplifying Assumptions ◽

History Of

This chapter will focus on challenges in modeling credit risk for new accounts acquisition process in the credit card industry. First section provides an overview and a brief history of credit scoring. The second section looks at some of the challenges specific to the credit industry. In many of these applications business objective is tied only indirectly to the classification scheme. Opposing objectives, such as response, profit and risk, often play a tug of war with each other. Solving a business problem of such complex nature often requires a multiple of models working jointly. Challenges to data mining lie in exploring solutions that go beyond traditional, well-documented methodology and need for simplifying assumptions; often necessitated by the reality of dataset sizes and/or implementation issues. Examples of such challenges form an illustrative example of a compromise between data mining theory and applications.

Download Full-text