Extreme Multiclass Classification Criteria

Anna Choromanska; Ish Kumar Jain

doi:10.3390/computation7010016

Extreme Multiclass Classification Criteria

Computation ◽

10.3390/computation7010016 ◽

2019 ◽

Vol 7 (1) ◽

pp. 16

Author(s):

Anna Choromanska ◽

Ish Kumar Jain

Keyword(s):

Decision Tree ◽

Objective Function ◽

Multiclass Classification ◽

Closed Form Expression ◽

Classification Error ◽

Complete Proof ◽

Surrogate Function ◽

Weak Hypothesis ◽

Multi Class Classification ◽

And Training

We analyze the theoretical properties of the recently proposed objective function for efficient online construction and training of multiclass classification trees in the settings where the label space is very large. We show the important properties of this objective and provide a complete proof that maximizing it simultaneously encourages balanced trees and improves the purity of the class distributions at subsequent levels in the tree. We further explore its connection to the three well-known entropy-based decision tree criteria, i.e., Shannon entropy, Gini-entropy and its modified variant, for which efficient optimization strategies are largely unknown in the extreme multiclass setting. We show theoretically that this objective can be viewed as a surrogate function for all of these entropy criteria and that maximizing it indirectly optimizes them as well. We derive boosting guarantees and obtain a closed-form expression for the number of iterations needed to reduce the considered entropy criteria below an arbitrary threshold. The obtained theorem relies on a weak hypothesis assumption that directly depends on the considered objective function. Finally, we prove that optimizing the objective directly reduces the multi-class classification error of the decision tree.

Download Full-text

The Application Based on Decision Tree SVM for Multi-class Classification

Proceedings of the 2015 International Conference on Electrical, Computer Engineering and Electronics ◽

10.2991/icecee-15.2015.312 ◽

2015 ◽

Author(s):

Huifang Hou ◽

Ping Han ◽

Dan Cao

Keyword(s):

Decision Tree ◽

Multi Class Classification

Download Full-text

A novel hybrid intelligent method based on C4.5 decision tree classifier and one-against-all approach for multi-class classification problems

Expert Systems with Applications ◽

10.1016/j.eswa.2007.11.051 ◽

2009 ◽

Vol 36 (2) ◽

pp. 1587-1592 ◽

Cited By ~ 137

Author(s):

Kemal Polat ◽

Salih Güneş

Keyword(s):

Decision Tree ◽

Classification Problems ◽

Decision Tree Classifier ◽

Tree Classifier ◽

C4.5 Decision Tree ◽

Multi Class Classification ◽

Hybrid Intelligent Method

Download Full-text

A novel Bagged Naïve Bayes-Decision Tree approach for multi-class classification problems

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-169937 ◽

2019 ◽

Vol 36 (3) ◽

pp. 2261-2271 ◽

Cited By ~ 6

Author(s):

Namrata Singh ◽

Pradeep Singh

Keyword(s):

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

Classification Problems ◽

Multi Class Classification ◽

Tree Approach

Download Full-text

An Optimized Multi-class Classification Algorithm Based on SVM Decision Tree

2010 Second International Conference on Information Technology and Computer Science ◽

10.1109/itcs.2010.17 ◽

2010 ◽

Author(s):

Chen Donghui ◽

Liu Zhijing

Keyword(s):

Decision Tree ◽

Classification Algorithm ◽

Multi Class Classification

Download Full-text

Weighted Oblique Decision Trees

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33015621 ◽

2019 ◽

Vol 33 ◽

pp. 5621-5627

Author(s):

Bin-Bin Yang ◽

Song-Qing Shen ◽

Wei Gao

Keyword(s):

Decision Tree ◽

Objective Function ◽

Decision Trees ◽

Information Entropy ◽

Heuristic Algorithms ◽

Continuous Optimization ◽

Tree Structure ◽

The Past ◽

Axis Parallel ◽

Random Initialization

Decision trees have attracted much attention during the past decades. Previous decision trees include axis-parallel and oblique decision trees; both of them try to find the best splits via exhaustive search or heuristic algorithms in each iteration. Oblique decision trees generally simplify tree structure and take better performance, but are always accompanied with higher computation, as well as the initialization with the best axis-parallel splits. This work presents the Weighted Oblique Decision Tree (WODT) based on continuous optimization with random initialization. We consider different weights of each instance for child nodes at all internal nodes, and then obtain a split by optimizing the continuous and differentiable objective function of weighted information entropy. Extensive experiments show the effectiveness of the proposed algorithm.

Download Full-text

Conceptual Approach to Predict Loan Defaults Using Decision Trees

Advances in Business Information Systems and Analytics - Sentiment Analysis and Knowledge Discovery in Contemporary Business ◽

10.4018/978-1-5225-4999-4.ch009 ◽

2019 ◽

pp. 148-161 ◽

Cited By ~ 1

Author(s):

Syed Muzamil Basha ◽

Dharmendra Singh Rajput ◽

N. Ch. S. N. Iyengar

Keyword(s):

Decision Tree ◽

Decision Trees ◽

Prediction Algorithm ◽

Classification Error ◽

Selection Algorithm ◽

Decision Tree Classifier ◽

Time Data ◽

Conceptual Approach ◽

Tree Classifier ◽

Loan Defaults

In this chapter, the authors show how to build a decision tree from given real-time data. They interpret the output of decision tree by learning decision tree classifier using really recursive greedy algorithm. Feature selection is made based on classification error using the algorithm called feature split selection algorithm (FSSA), with all different possible stopping conditions for splitting. The authors perform prediction with decision trees using decision tree prediction algorithm (DTPA), followed by multiclass predictions and their probabilities. Finally, they perform splitting procedure on real continuous value input using threshold split selection algorithm (TSSA).

Download Full-text

Evaluation of Distance Measures for Multi-class Classification in Binary SVM Decision Tree

Artificial Intelligence and Soft Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-642-13208-7_55 ◽

2010 ◽

pp. 437-444 ◽

Cited By ~ 5

Author(s):

Gjorgji Madzarov ◽

Dejan Gjorgjevikj

Keyword(s):

Decision Tree ◽

Distance Measures ◽

Multi Class Classification

Download Full-text

Least squares twin SVM decision tree for multi-class classification

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) ◽

10.1109/cisp-bmei.2016.7853033 ◽

2016 ◽

Cited By ~ 1

Author(s):

Qing Yu ◽

Lihui Wang

Keyword(s):

Decision Tree ◽

Least Squares ◽

Multi Class Classification

Download Full-text

Decision tree support vector machine based on genetic algorithm for multi-class classification

Journal of Systems Engineering and Electronics ◽

10.3969/j.issn.1004-4132.2011.02.020 ◽

2011 ◽

Vol 22 (2) ◽

pp. 322-326 ◽

Cited By ~ 7

Author(s):

Huanhuan Chen ◽

Qiang Wang ◽

Yi Shen

Keyword(s):

Genetic Algorithm ◽

Support Vector Machine ◽

Decision Tree ◽

Support Vector ◽

Multi Class Classification

Download Full-text

Training Feedforward Neural Networks with Gain Constraints

Neural Computation ◽

10.1162/089976600300015600 ◽

2000 ◽

Vol 12 (4) ◽

pp. 811-829 ◽

Cited By ~ 9

Author(s):

Eric Hartman

Keyword(s):

Neural Networks ◽

Objective Function ◽

Network Models ◽

Feedforward Neural Networks ◽

Bound Constraints ◽

Neural Network Models ◽

Optimization Control ◽

Input Variables ◽

Derivatives Of ◽

And Training

Inaccurate input-output gains (partial derivatives of outputs with respect to inputs) are common in neural network models when input variables are correlated or when data are incomplete or inaccurate. Accurate gains are essential for optimization, control, and other purposes. We develop and explore a method for training feedforward neural networks subject to inequality or equality-bound constraints on the gains of the learned mapping. Gain constraints are implemented as penalty terms added to the objective function, and training is done using gradient descent. Adaptive and robust procedures are devised for balancing the relative strengths of the various terms in the objective function, which is essential when the constraints are inconsistent with the data. The approach has the virtue that the model domain of validity can be extended via extrapolation training, which can dramatically improve generalization. The algorithm is demonstrated here on artificial and real-world problems with very good results and has been advantageously applied to dozens of models currently in commercial use.

Download Full-text