Fuzzy Rule Extraction Using Recombined RecBF for Very-Imbalanced Datasets

Author(s):  
Vicenç Soler ◽  
Jordi Roig ◽  
Marta Prim
Author(s):  
M. A.H. Farquad ◽  
V. Ravi ◽  
Raju S. Bapi

Support vector machines (SVMs) have proved to be a good alternative compared to other machine learning techniques specifically for classification problems. However just like artificial neural networks (ANN), SVMs are also black box in nature because of its inability to explain the knowledge learnt in the process of training, which is very crucial in some applications like medical diagnosis, security and bankruptcy prediction etc. In this chapter a novel hybrid approach for fuzzy rule extraction based on SVM is proposed. This approach handles rule-extraction as a learning task, which proceeds in two major steps. In the first step the authors use labeled training patterns to build an SVM model, which in turn yields the support vectors. In the second step extracted support vectors are used as input patterns to fuzzy rule based systems (FRBS) to generate fuzzy “if-then” rules. To study the effectiveness and validity of the extracted fuzzy rules, the hybrid SVM+FRBS is compared with other classification techniques like decision tree (DT), radial basis function network (RBF) and adaptive network based fuzzy inference system. To illustrate the effectiveness of the hybrid developed, the authors applied it to solve a bank bankruptcy prediction problem. The dataset used pertain to Spanish, Turkish and US banks. The quality of the extracted fuzzy rules is evaluated in terms of fidelity, coverage and comprehensibility.


Algorithms ◽  
2021 ◽  
Vol 14 (2) ◽  
pp. 54
Author(s):  
Chen Fu ◽  
Jianhua Yang

The problem of classification for imbalanced datasets is frequently encountered in practical applications. The data to be classified in this problem are skewed, i.e., the samples of one class (the minority class) are much less than those of other classes (the majority class). When dealing with imbalanced datasets, most classifiers encounter a common limitation, that is, they often obtain better classification performances on the majority classes than those on the minority class. To alleviate the limitation, in this study, a fuzzy rule-based modeling approach using information granules is proposed. Information granules, as some entities derived and abstracted from data, can be used to describe and capture the characteristics (distribution and structure) of data from both majority and minority classes. Since the geometric characteristics of information granules depend on the distance measures used in the granulation process, the main idea of this study is to construct information granules on each class of imbalanced data using Minkowski distance measures and then to establish the classification models by using “If-Then” rules. The experimental results involving synthetic and publicly available datasets reflect that the proposed Minkowski distance-based method can produce information granules with a series of geometric shapes and construct granular models with satisfying classification performance for imbalanced datasets.


Sign in / Sign up

Export Citation Format

Share Document