Depth-Based Classification for Distributions with Nonconvex Support

Journal of Probability and Statistics ◽

10.1155/2013/629184 ◽

2013 ◽

Vol 2013 ◽

pp. 1-7 ◽

Cited By ~ 1

Author(s):

Daniel Hlubinka ◽

Ondrej Vencalek

Keyword(s):

Statistical Analysis ◽

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Classification Problem ◽

Data Depth ◽

Misclassification Rate ◽

Skewed Data ◽

Weighted Version ◽

Linear Discriminant ◽

Normal Populations

Halfspace depth became a popular nonparametric tool for statistical analysis of multivariate data during the last two decades. One of applications of data depth considered recently in literature is the classification problem. The data depth approach is used instead of the linear discriminant analysis mostly to avoid the parametric assumptions and to get better classifier for data whose distribution is not elliptically symmetric, for example, skewed data. In our paper, we suggest to use weighted version of halfspace depth rather than the halfspace depth itself in order to obtain lower misclassification rate in the case of “nonconvex” distributions. Simulations show that the results of depth-based classifiers are comparable with linear discriminant analysis for two normal populations, while for nonelliptic distributions the classifier based on weighted halfspace depth outperforms both linear discriminant analysis and classifier based on the usual (nonweighted) halfspace depth.

Download Full-text

Effect Size Estimation and Misclassification Rate Based Variable Selection in Linear Discriminant Analysis

Journal of Data Science ◽

10.6339/jds.2013.11(3).1185 ◽

2021 ◽

Vol 11 (3) ◽

pp. 537-558

Author(s):

Bernd Klaus

Keyword(s):

Discriminant Analysis ◽

Variable Selection ◽

Linear Discriminant Analysis ◽

Effect Size ◽

Misclassification Rate ◽

Size Estimation ◽

Linear Discriminant ◽

Effect Size Estimation

Download Full-text

Appropriateness of Linear Discriminant and Multinomial Classification Analysis in Marketing Research

Journal of Marketing Research ◽

10.1177/002224377801500112 ◽

1978 ◽

Vol 15 (1) ◽

pp. 103-112 ◽

Cited By ~ 14

Author(s):

William R. Dillon ◽

Matthew Goldstein ◽

Leon G. Schiffman

Keyword(s):

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Marketing Research ◽

Classification Problem ◽

Relative Performance ◽

New Method ◽

Classification Methods ◽

Classification Analysis ◽

Linear Discriminant ◽

Usage Behavior

Buyer usage behavior data are used to compare the relative performance of a linear discriminant analysis and several multinomial classification methods. The potential shortcomings of each of the procedures investigated are cited, and a new method for determining the contribution of a variable to discrimination in the context of the multinomial classification problem also is presented.

Download Full-text

Bayesian estimation for misclassification rate in linear discriminant analysis

Japanese Journal of Statistics and Data Science ◽

10.1007/s42081-021-00139-7 ◽

2021 ◽

Author(s):

Koshiro Yonenaga ◽

Akio Suzukawa

Keyword(s):

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Bayesian Estimation ◽

Misclassification Rate ◽

Linear Discriminant

Download Full-text

Asymptotic approximation of misclassification probabilities in linear discriminant analysis with repeated measurements

Acta et Commentationes Universitatis Tartuensis de Mathematica ◽

10.12697/acutm.2021.25.05 ◽

2021 ◽

Vol 25 (1) ◽

pp. 67-85

Author(s):

Edward K. Ngailo ◽

Dietrich Von Rosen ◽

Martin Singull

Keyword(s):

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Asymptotic Approximation ◽

Repeated Measurements ◽

Asymptotic Approximations ◽

Multivariate Normal ◽

Linear Discriminant ◽

Normal Populations ◽

Misclassification Errors ◽

Observation Vector

We propose asymptotic approximations for the probabilities of misclassification in linear discriminant analysis when the group means follow a growth curve structure. The discriminant function can classify a new observation vector of p repeated measurements into one of several multivariate normal populations with equal covariance matrix. We derive certain relations of the statistics under consideration in order to obtain asymptotic approximation of misclassification errors for the two group case. Finally, we perform Monte Carlo simulations to evaluate the reliability of the proposed results.

Download Full-text

Sequential Data Fusion Techniques for the Authentication of the P.G.I. Senise (“Crusco”) Bell Pepper

Applied Sciences ◽

10.3390/app11041709 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1709

Author(s):

Alessandra Biancolillo ◽

Francesca Di Donato ◽

Francesco Merola ◽

Federico Marini ◽

Angelo Antonio D’Archivio

Keyword(s):

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Near Infrared ◽

External Validation ◽

Classification Problem ◽

Sweet Pepper ◽

Bell Pepper ◽

Spectroscopic Techniques ◽

Sequential Data ◽

Linear Discriminant

Bell pepper is the common name of the berry obtained from some varieties of the Capsicum annuum species. This agro-food is appreciated all over the world and represents one of the key ingredients of several traditional dishes. It is used as a fresh product, or dried and ground as a seasoning (e.g., paprika). Specific varieties of sweet pepper present organoleptic peculiarities and they have been awarded by quality marks as a further confirmation of their unicity (e.g., Piment d’Espelette, Pimiento de Herbón, Peperone di Senise). Due to the market value of this aliment, it can be subjected to frauds, such as adulterations and sophistication. The present study lays on these considerations and aims at developing a spectroscopy-based approach for authenticating Senise bell pepper and for detecting its adulteration with common paprika. In order to achieve this goal, 60 pure samples of bell pepper from Senise were analyzed by mid- and near-infrared spectroscopies. Then, in order to mimic the adulteration, 40 mixtures of Senise bell pepper and paprika were prepared and analyzed (by the same spectroscopic techniques). Eventually, two different multi-block classification approaches (sequential and orthogonalized partial least squares linear discriminant analysis and sequential and orthogonalized covariance selection linear discriminant analysis) were used to discriminate between pure and adulterated Senise bell pepper samples. Both proposed procedures achieved extremely successful results in external validation, correctly classifying all the (thirty-five) test samples, indicating that both approaches represent a winning solution for the investigated classification problem.

Download Full-text

Linear Discriminant Analysis of Normal Populations with Coinciding Covariance Matrices

Multivariate Statistical Analysis ◽

10.1007/978-94-015-9468-4_10 ◽

2000 ◽

pp. 156-168

Author(s):

V. Serdobolskii

Keyword(s):

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Covariance Matrices ◽

Linear Discriminant ◽

Normal Populations

Download Full-text

Performance of Linear Discriminant Analysis Using Different Robust Methods

European Journal of Pure and Applied Mathematics ◽

10.29020/nybg.ejpam.v11i1.3176 ◽

2018 ◽

Vol 11 (1) ◽

pp. 284 ◽

Cited By ~ 2

Author(s):

Mufda Jameel Alrawashdeh ◽

Taha Radwan Radwan ◽

Kalid Abunawas Abunawas

Keyword(s):

Statistical Analysis ◽

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Multivariate Statistical Analysis ◽

Robust Methods ◽

Minimum Covariance Determinant ◽

Multivariate Statistical ◽

Linear Discriminant ◽

Misclassification Probability ◽

Conventional Banks

This study aims to combine the new deterministic minimum covariance determinant (DetMCD) algorithm with linear discriminant analysis (LDA) and compare it with the fast minimum covariance determinant (FastMCD), fast consistent high breakdown (FCH), and robust FCH (RFCH) algorithms. LDA classifies new observations into one of the unknown groups and it is widely used in multivariate statistical analysis. The LDA mean and covariance matrix parameters are highly influenced by outliers. The DetMCD algorithm is highly robust and resistant to outliers and it is constructed to overcome the outlier problem. Moreover, the DetMCD algorithm is used to estimate location and scatter matrices. The DetMCD, FastMCD, FCH, and RFCH algorithms are applied to estimate misclassification probability using robust LDA. All the algorithms are expected to improve the LDA model for classification purposes in banks, such as bankruptcy and failures, and to distinguish between Islamic and conventional banks. The performances of the estimators are investigated through simulation and actual data.

Download Full-text

Classification of Diesel Engine Health Using Sparse Linear Discriminant Analysis (SLDA)

ASME 2009 Dynamic Systems and Control Conference, Volume 1 ◽

10.1115/dscc2009-2790 ◽

2009 ◽

Author(s):

Neha Chandrachud ◽

Ravindra Kakade ◽

Peter H. Meckl ◽

Galen B. King ◽

Kristofer Jennings

Keyword(s):

Steady State ◽

Diesel Engine ◽

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Optimal Number ◽

Classification Model ◽

Misclassification Rate ◽

Linear Discriminant ◽

State Data ◽

Input Variables

With requirements for on-board diagnostics on diesel engines becoming more stringent for the coming model years, diesel engine manufacturers must improve their ability to identify fault conditions that lead to increased exhaust emissions. This paper proposes a statistical classifier model to identify the state of the engine, i.e. healthy or faulty, using an optimal number of sensors based on the data acquired from the engine. The classification model proposed in this paper is based on Sparse Linear Discriminant Analysis. This technique performs Linear Discriminant Analysis with a sparseness criterion imposed such that classification, dimension reduction and feature selection are merged into one step. It was concluded that the analysis technique could produce 0% misclassification rate for the steady-state data acquired from the diesel engine using five input variables. The classifier model was also extended to transient operation of the engine. The misclassification rate in the case of transient data was reduced from 31% to 26% by using the steady-state data trained classifier using thirteen variables.

Download Full-text

Effect Size Estimation and Misclassification Rate Based Variable Selection in Linear Discriminant Analysis

Journal of Data Science ◽

10.6339/jds.201307_11(3).0008 ◽

2021 ◽

Vol 11 (3) ◽

pp. 537-558

Author(s):

Bernd Klaus

Keyword(s):

Discriminant Analysis ◽

Variable Selection ◽

Linear Discriminant Analysis ◽

Effect Size ◽

Misclassification Rate ◽

Size Estimation ◽

Linear Discriminant ◽

Effect Size Estimation

Download Full-text

$\mathcal{DBSDA}$ : Lowering the Bound of Misclassification Rate for Sparse Linear Discriminant Analysis via Model Debiasing

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2018.2846783 ◽

2019 ◽

Vol 30 (3) ◽

pp. 707-717 ◽

Cited By ~ 4

Author(s):

Haoyi Xiong ◽

Wei Cheng ◽

Jiang Bian ◽

Wenqing Hu ◽

Zeyi Sun ◽

...

Keyword(s):

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Misclassification Rate ◽

Linear Discriminant

Download Full-text