Two-Stage Automobile Insurance Fraud Detection by Using Optimized Fuzzy C-Means Clustering and Supervised Learning

Sharmila Subudhi; Suvasini Panigrahi

doi:10.4018/ijisp.2020070102

Two-Stage Automobile Insurance Fraud Detection by Using Optimized Fuzzy C-Means Clustering and Supervised Learning

International Journal of Information Security and Privacy ◽

10.4018/ijisp.2020070102 ◽

2020 ◽

Vol 14 (3) ◽

pp. 18-37 ◽

Cited By ~ 1

Author(s):

Sharmila Subudhi ◽

Suvasini Panigrahi

Keyword(s):

Fraud Detection ◽

Support Vector ◽

Group Method ◽

Final Decision ◽

Automobile Insurance ◽

Insurance Fraud ◽

Two Stage ◽

Data Set ◽

Fuzzy C Means ◽

Fuzzy C Means Clustering

A novel two-stage automobile insurance fraud detection system is proposed that initially extracts a test set from the original imbalanced insurance dataset. A genetic algorithm based optimized fuzzy c-means clustering is then applied on the remaining data set for undersampling the majority samples by eliminating the outliers among them. Thereafter, the detection of the fraudulent claims occurs in two stages. In the first stage, each insurance record is passed to the clustering module that identifies the claim as genuine, malicious, or suspicious. The genuine and malicious samples are removed and only the suspicious instances are further scrutinized in the second stage by four trained supervised classifiers − Decision Tree, Support Vector Machine, Group Method for Data Handling and Multi-Layer Perceptron individually for final decision making. Extensive experiments and comparative analysis with another recent approach using a real-world automobile insurance dataset justifies the effectiveness of the proposed system.

Download Full-text

Use of optimized Fuzzy C-Means clustering and supervised classifiers for automobile insurance fraud detection

Journal of King Saud University - Computer and Information Sciences ◽

10.1016/j.jksuci.2017.09.010 ◽

2020 ◽

Vol 32 (5) ◽

pp. 568-575 ◽

Cited By ~ 11

Author(s):

Sharmila Subudhi ◽

Suvasini Panigrahi

Keyword(s):

Fraud Detection ◽

Automobile Insurance ◽

Insurance Fraud ◽

Fuzzy C Means ◽

Supervised Classifiers ◽

Fuzzy C Means Clustering

Download Full-text

A hybrid mobile call fraud detection model using optimized fuzzy C-means clustering and group method of data handling-based network

Vietnam Journal of Computer Science ◽

10.1007/s40595-018-0116-x ◽

2018 ◽

Vol 5 (3-4) ◽

pp. 205-217 ◽

Cited By ~ 4

Author(s):

Sharmila Subudhi ◽

Suvasini Panigrahi

Keyword(s):

Fraud Detection ◽

Group Method ◽

Data Handling ◽

Fuzzy C Means ◽

Detection Model ◽

Fuzzy C Means Clustering

Download Full-text

Automobile Insurance Fraud Detection Using Social Network Analysis

Applications of Data Management and Analysis - Lecture Notes in Social Networks ◽

10.1007/978-3-319-95810-1_2 ◽

2018 ◽

pp. 11-16 ◽

Cited By ~ 3

Author(s):

Arezo Bodaghi ◽

Babak Teimourpour

Keyword(s):

Social Network ◽

Social Network Analysis ◽

Network Analysis ◽

Fraud Detection ◽

Automobile Insurance ◽

Insurance Fraud

Download Full-text

Fuzzy Clustering

Advances in Business Information Systems and Analytics - Handbook of Research on Intelligent Techniques and Modeling Applications in Marketing Analytics ◽

10.4018/978-1-5225-0997-4.ch003 ◽

2017 ◽

pp. 40-61 ◽

Cited By ~ 1

Author(s):

Mashhour H. Baeshen ◽

Malcolm J. Beynon ◽

Kate L. Daunt

Keyword(s):

Data Analysis ◽

Service Quality ◽

Mobile Phone ◽

Fuzzy Clustering ◽

Data Set ◽

Clustering Techniques ◽

Fuzzy C Means ◽

Fuzzy Environment ◽

External Variables ◽

Fuzzy C Means Clustering

This chapter presents a study of the development of the clustering methodology to data analysis, with particular attention to the analysis from a crisp environment to a fuzzy environment. An applied problem concerning service quality (using SERVQUAL) of mobile phone users, and subsequent loyalty and satisfaction forms the data set to demonstrate the clustering issue. Following details on both the crisp k-means and fuzzy c-means clustering techniques, comparable results from their analysis are shown, on a subset of data, to enable both graphical and statistical elucidation. Fuzzy c-means is then employed on the full SERVQUAL dimensions, and the established results interpreted before tested on external variables, namely the level of loyalty and satisfaction across the different clusters established.

Download Full-text

Fuzzy clustering using salp swarm algorithm for automobile insurance fraud detection

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-169944 ◽

2019 ◽

Vol 36 (3) ◽

pp. 2333-2344 ◽

Cited By ~ 12

Author(s):

Santosh Kumar Majhi ◽

Subho Bhatachharya ◽

Rosy Pradhan ◽

Shubhra Biswal

Keyword(s):

Fuzzy Clustering ◽

Fraud Detection ◽

Automobile Insurance ◽

Insurance Fraud ◽

Salp Swarm Algorithm ◽

Swarm Algorithm

Download Full-text

Prognosis of Diabetes Using Data mining Approach-Fuzzy C Means Clustering and Support Vector Machine

International Journal of Computer Trends and Technology ◽

10.14445/22312803/ijctt-v11p120 ◽

2014 ◽

Vol 11 (2) ◽

pp. 94-98 ◽

Cited By ~ 14

Author(s):

Ravi Sanakal ◽

◽

Smt. T Jayakumari

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Support Vector ◽

Fuzzy C Means ◽

Data Mining Approach ◽

Fuzzy C Means Clustering ◽

Using Data

Download Full-text

Enhanced Fuzzy C-Means Clustering with Optimization of Support Vector Regression for Imputation of Medical Database

Journal of Medical Imaging and Health Informatics ◽

10.1166/jmihi.2016.1859 ◽

2016 ◽

Vol 6 (7) ◽

pp. 1612-1616

Author(s):

S. Thirukumaran ◽

A. Sumathi

Keyword(s):

Support Vector Regression ◽

Support Vector ◽

Medical Database ◽

Fuzzy C Means ◽

Fuzzy C Means Clustering

Download Full-text

Fast Unsupervised Automobile Insurance Fraud Detection Based on Spectral Ranking of Anomalies

International Journal of Engineering ◽

10.5829/ije.2020.33.07a.10 ◽

2020 ◽

Vol 33 (7) ◽

Keyword(s):

Fraud Detection ◽

Automobile Insurance ◽

Insurance Fraud

Download Full-text

Electricity Load Prediction using Fuzzy c-means Clustering EMD based Support Vector Regression for University Building

2019 International Conference on Fuzzy Theory and Its Applications (iFUZZY) ◽

10.1109/ifuzzy46984.2019.9066226 ◽

2019 ◽

Author(s):

Irene Karijadi ◽

Shuo Yan Chou ◽

Anindhita Dewabharata ◽

Ray Guang Cheng

Keyword(s):

Support Vector Regression ◽

Support Vector ◽

Load Prediction ◽

Fuzzy C Means ◽

Fuzzy C Means Clustering ◽

Electricity Load

Download Full-text

Big-data driven building retrofitting: An integrated Support Vector Machines and Fuzzy C-means clustering method

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/588/4/042013 ◽

2020 ◽

Vol 588 ◽

pp. 042013

Author(s):

Weizhuo Lu ◽

Kailun Feng

Keyword(s):

Big Data ◽

Support Vector Machines ◽

Data Driven ◽

Support Vector ◽

Clustering Method ◽

Fuzzy C Means ◽

Vector Machines ◽

Fuzzy C Means Clustering

Download Full-text