REPMAC: A New Hybrid Approach to Highly Imbalanced Classification Problems

A new instance density-based synthetic minority oversampling method for imbalanced classification problems

Engineering Optimization ◽

10.1080/0305215x.2021.1982929 ◽

2021 ◽

pp. 1-15

Author(s):

Chung-Kang Ma ◽

You-Jin Park

Keyword(s):

Classification Problems ◽

Imbalanced Classification

Download Full-text

IA-SUWO: An Improving Adaptive semi-unsupervised weighted oversampling for imbalanced classification problems

Knowledge-Based Systems ◽

10.1016/j.knosys.2020.106116 ◽

2020 ◽

Vol 203 ◽

pp. 106116

Author(s):

Jianan Wei ◽

Haisong Huang ◽

Liguo Yao ◽

Yao Hu ◽

Qingsong Fan ◽

...

Keyword(s):

Classification Problems ◽

Imbalanced Classification

Download Full-text

Imbalanced Classification Problems: A Comparative Study of Non-ensemble and Ensemble-Based Approaches

10.1007/978-981-16-2709-5_36 ◽

2021 ◽

pp. 469-484

Author(s):

Satyam Maheshwari ◽

R. C. Jain ◽

R. S. Jadon

Keyword(s):

Comparative Study ◽

Classification Problems ◽

Imbalanced Classification

Download Full-text

Neuro-evolutionary models for imbalanced classification problems

Journal of King Saud University - Computer and Information Sciences ◽

10.1016/j.jksuci.2020.11.005 ◽

2020 ◽

Author(s):

Israa Al-Badarneh ◽

Maria Habib ◽

Ibrahim Aljarah ◽

Hossam Faris

Keyword(s):

Evolutionary Models ◽

Classification Problems ◽

Imbalanced Classification

Download Full-text

Constraint relaxation, cost-sensitive learning and bagging for imbalanced classification problems with outliers

Optimization Letters ◽

10.1007/s11590-015-0934-z ◽

2015 ◽

Vol 11 (5) ◽

pp. 915-928 ◽

Cited By ~ 5

Author(s):

Talayeh Razzaghi ◽

Petros Xanthopoulos ◽

Onur Şeref

Keyword(s):

Classification Problems ◽

Cost Sensitive Learning ◽

Imbalanced Classification ◽

Constraint Relaxation

Download Full-text

Evaluation of a new hybrid algorithm for highly imbalanced classification problems

International Journal of Hybrid Intelligent Systems ◽

10.3233/his-2011-0140 ◽

2011 ◽

Vol 8 (4) ◽

pp. 199-211

Author(s):

Hernán Ahumada ◽

Guillermo L. Grinblat ◽

Lucas C. Uzal ◽

Alejandro Ceccatto ◽

Pablo M. Granitto

Keyword(s):

Hybrid Algorithm ◽

Classification Problems ◽

Imbalanced Classification

Download Full-text

A New Diversity Technique for Imbalance Learning Ensembles

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.11251 ◽

2018 ◽

Vol 7 (2.14) ◽

pp. 478 ◽

Cited By ~ 2

Author(s):

Hartono . ◽

Opim Salim Sitompul ◽

Erna Budhiarti Nababan ◽

Tulus . ◽

Dahlan Abdullah ◽

...

Keyword(s):

Hybrid Approach ◽

Class Imbalance ◽

Machine Learning Techniques ◽

Classifier Ensembles ◽

Classification Problems ◽

Class Imbalance Problem ◽

Weighting Method ◽

Imbalance Problem ◽

Learning Ensembles ◽

Imbalance Learning

Data mining and machine learning techniques designed to solve classification problems require balanced class distribution. However, in reality sometimes the classification of datasets indicates the existence of a class represented by a large number of instances whereas there are classes with far fewer instances. This problem is known as the class imbalance problem. Classifier Ensembles is a method often used in overcoming class imbalance problems. Data Diversity is one of the cornerstones of ensembles. An ideal ensemble system should have accurrate individual classifiers and if there is an error it is expected to occur on different objects or instances. This research will present the results of overview and experimental study using Hybrid Approach Redefinition (HAR) Method in handling class imbalance and at the same time expected to get better data diversity. This research will be conducted using 6 datasets with different imbalanced ratios and will be compared with SMOTEBoost which is one of the Re-Weighting method which is often used in handling class imbalance. This study shows that the data diversity is related to performance in the imbalance learning ensembles and the proposed methods can obtain better data diversity.

Download Full-text

Self-Configuring Hybrid Evolutionary Algorithm for Fuzzy Imbalanced Classification with Adaptive Instance Selection

Journal of Artificial Intelligence and Soft Computing Research ◽

10.1515/jaiscr-2016-0013 ◽

2016 ◽

Vol 6 (3) ◽

pp. 173-188 ◽

Cited By ~ 13

Author(s):

Vladimir Stanovov ◽

Eugene Semenkin ◽

Olga Semenkina

Keyword(s):

Evolutionary Process ◽

Fuzzy Classification ◽

Instance Selection ◽

Problem Solver ◽

Data Sets ◽

Classification Problems ◽

Imbalanced Classification ◽

Novel Approach ◽

Training Samples ◽

Classification Quality

Abstract A novel approach for instance selection in classification problems is presented. This adaptive instance selection is designed to simultaneously decrease the amount of computation resources required and increase the classification quality achieved. The approach generates new training samples during the evolutionary process and changes the training set for the algorithm. The instance selection is guided by means of changing probabilities, so that the algorithm concentrates on problematic examples which are difficult to classify. The hybrid fuzzy classification algorithm with a self-configuration procedure is used as a problem solver. The classification quality is tested upon 9 problem data sets from the KEEL repository. A special balancing strategy is used in the instance selection approach to improve the classification quality on imbalanced datasets. The results prove the usefulness of the proposed approach as compared with other classification methods.

Download Full-text

A Distributed Methodology for Imbalanced Classification Problems

2012 11th International Symposium on Parallel and Distributed Computing ◽

10.1109/ispdc.2012.30 ◽

2012 ◽

Cited By ~ 3

Author(s):

Camelia Lemnaru ◽

Mihai Cuibus ◽

Adrian Bona ◽

Andrei Alic ◽

Rodica Potolea

Keyword(s):

Classification Problems ◽

Imbalanced Classification

Download Full-text

Application of an Interpretable Classification Model on Early Folding Residues during Protein Folding

10.1101/381483 ◽

2018 ◽

Author(s):

Sebastian Bittrich ◽

Marika Kaden ◽

Christoph Leberecht ◽

Florian Kaiser ◽

Thomas Villmann ◽

...

Keyword(s):

Machine Learning ◽

Protein Folding ◽

Learning Strategies ◽

Life Sciences ◽

Classification Model ◽

Classification Problems ◽

Hydrophobic Residues ◽

Imbalanced Classification ◽

Fine Grained ◽

Generalized Matrix

AbstractBackgroundMachine learning strategies are prominent tools for data analysis. Especially in life sciences, they have become increasingly important to handle the growing datasets collected by the scientific community. Meanwhile, algorithms improve in performance, but also gain complexity, and tend to neglect interpretability and comprehensiveness of the resulting models.ResultsGeneralized Matrix Learning Vector Quantization (GMLVQ) is a supervised, prototype-based machine learning method and provides comprehensive visualization capabilities not present in other classifiers which allow for a fine-grained interpretation of the data. In contrast to commonly used machine learning strategies, GMLVQ is well-suited for imbalanced classification problems which are frequent in life sciences. We present a Weka plug-in implementing GMLVQ. The feasibility of GMLVQ is demonstrated on a dataset of Early Folding Residues (EFR) that have been shown to initiate and guide the protein folding process. Using 27 features, an area under the receiver operating characteristic of 76.6% was achieved which is comparable to other state-of-the-art classifiers.ConclusionsThe application on EFR prediction demonstrates how an easy interpretation of classification models can promote the comprehension of biological mechanisms. The results shed light on the special features of EFR which were reported as most influential for the classification: EFR are embedded in ordered secondary structure elements and they participate in networks of hydrophobic residues. Visualization capabilities of GMLVQ are presented as we demonstrate how to interpret the results.

Download Full-text