On-line gradient learning algorithms for K-nearest neighbor classifiers

Performance Evaluation of Different Machine Learning Classification Algorithms for Diseases Diagnosis

International Journal of E-Health and Medical Communications ◽

10.4018/ijehmc.20211101oa09 ◽

2021 ◽

Vol 12 (6) ◽

pp. 0-0

Keyword(s):

Machine Learning ◽

Nearest Neighbor ◽

Performance Metrics ◽

Confusion Matrix ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Classification Algorithms ◽

K Nearest Neighbor ◽

Machine Learning Classification ◽

Nearest Neighbor Classifiers

Knowledge extraction within a healthcare field is a very challenging task since we are having many problems such as noise and imbalanced datasets. They are obtained from clinical studies where uncertainty and variability are popular. Lately, a wide number of machine learning algorithms are considered and evaluated to check their validity of being used in the medical field. Usually, the classification algorithms are compared against medical experts who are specialized in certain disease diagnoses and provide an effective methodological evaluation of classifiers by applying performance metrics. The performance metrics contain four criteria: accuracy, sensitivity, and specificity forming the confusion matrix of each used algorithm. We have utilized eight different well-known machine learning algorithms to evaluate their performances in six different medical datasets. Based on the experimental results we conclude that the XGBoost and K-Nearest Neighbor classifiers were the best overall among the used datasets and signs can be used for diagnosing various diseases.

Download Full-text

Ensembling evidential k-nearest neighbor classifiers through multi-modal perturbation

Applied Soft Computing ◽

10.1016/j.asoc.2006.10.002 ◽

2007 ◽

Vol 7 (3) ◽

pp. 1072-1083 ◽

Cited By ~ 29

Author(s):

Hakan Altınçay

Keyword(s):

Nearest Neighbor ◽

K Nearest Neighbor ◽

Nearest Neighbor Classifiers

Download Full-text

A Comparative Analysis of Machine Learning Algorithms Modeled from Machine Vision-Based Lettuce Growth Stage Classification in Smart Aquaponics

International Journal of Environmental Science and Development ◽

10.18178/ijesd.2020.11.9.1288 ◽

2020 ◽

Vol 11 (9) ◽

pp. 442-449 ◽

Cited By ~ 1

Author(s):

Sandy C. Lauguico ◽

◽

Ronnie S. Concepcion II ◽

Jonnel D. Alejandrino ◽

Rogelio Ruzcko Tobias ◽

...

Keyword(s):

Machine Learning ◽

Comparative Analysis ◽

Machine Vision ◽

Nearest Neighbor ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Support Vector ◽

Urban Farming ◽

K Nearest Neighbor ◽

Lettuce Growth

The arising problem on food scarcity drives the innovation of urban farming. One of the methods in urban farming is the smart aquaponics. However, for a smart aquaponics to yield crops successfully, it needs intensive monitoring, control, and automation. An efficient way of implementing this is the utilization of vision systems and machine learning algorithms to optimize the capabilities of the farming technique. To realize this, a comparative analysis of three machine learning estimators: Logistic Regression (LR), K-Nearest Neighbor (KNN), and Linear Support Vector Machine (L-SVM) was conducted. This was done by modeling each algorithm from the machine vision-feature extracted images of lettuce which were raised in a smart aquaponics setup. Each of the model was optimized to increase cross and hold-out validations. The results showed that KNN having the tuned hyperparameters of n_neighbors=24, weights='distance', algorithm='auto', leaf_size = 10 was the most effective model for the given dataset, yielding a cross-validation mean accuracy of 87.06% and a classification accuracy of 91.67%.

Download Full-text

Book Genre Categorization Using Machine Learning Algorithms (K-Nearest Neighbor, Support Vector Machine and Logistic Regression) using Customized Dataset

International Journal of Computer Science and Mobile Computing ◽

10.47760/ijcsmc.2021.v10i03.002 ◽

2021 ◽

Vol 10 (3) ◽

pp. 14-25

Author(s):

Parilkumar Shiroya

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Logistic Regression ◽

Nearest Neighbor ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Support Vector ◽

K Nearest Neighbor

Download Full-text

Outlier Detection for Soft-Sensor Modeling Data Based on k-Nearest Neighbor

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.468-471.2504 ◽

2012 ◽

Vol 468-471 ◽

pp. 2504-2509

Author(s):

Qiang Da Yang ◽

Zhen Quan Liu

Keyword(s):

Outlier Detection ◽

Nearest Neighbor ◽

Detection Method ◽

Soft Sensor ◽

K Nearest Neighbor ◽

Sensor Model ◽

Sensor Technique ◽

On Line ◽

Modeling Data ◽

Sensor Modeling

The on-line estimation of some key hard-to-measure process variables by using soft-sensor technique has received extensive concern in industrial production process. The precision of on-line estimation is closely related to the accuracy of soft-sensor model, while the accuracy of soft-sensor model depends strongly on the accuracy of modeling data. Aiming at the special character of the definition for outliers in soft-sensor modeling process, an outlier detection method based on k-nearest neighbor (k-NN) is proposed in this paper. The proposed method can be realized conveniently from data without priori knowledge and assumption of the process. The simulation result and practical application show that the proposed outlier detection method based on k-NN has good detection effect and high application value.

Download Full-text

Investigating the Performance of Naive- Bayes Classifiers and K- Nearest Neighbor Classifiers

Journal of Convergence Information Technology ◽

10.4156/jcit.vol5.issue2.15 ◽

2010 ◽

Vol 5 (2) ◽

pp. 133-137 ◽

Cited By ~ 1

Author(s):

Mohammed J. Islam ◽

Q. M. Jonathan Wu ◽

Majid Ahmadi ◽

Maher A. SidAhmed

Keyword(s):

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

K Nearest Neighbor ◽

Nearest Neighbor Classifiers

Download Full-text

Modeling Barrier Island Habitats Using Landscape Position Information

Remote Sensing ◽

10.3390/rs11080976 ◽

2019 ◽

Vol 11 (8) ◽

pp. 976

Author(s):

Nicholas M. Enwright ◽

Lei Wang ◽

Hongqing Wang ◽

Michael J. Osland ◽

Laura C. Feher ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Nearest Neighbor ◽

Learning Algorithms ◽

Barrier Island ◽

Barrier Islands ◽

Machine Learning Algorithms ◽

Landscape Position ◽

K Nearest Neighbor ◽

Island Habitats

Barrier islands are dynamic environments because of their position along the marine–estuarine interface. Geomorphology influences habitat distribution on barrier islands by regulating exposure to harsh abiotic conditions. Researchers have identified linkages between habitat and landscape position, such as elevation and distance from shore, yet these linkages have not been fully leveraged to develop predictive models. Our aim was to evaluate the performance of commonly used machine learning algorithms, including K-nearest neighbor, support vector machine, and random forest, for predicting barrier island habitats using landscape position for Dauphin Island, Alabama, USA. Landscape position predictors were extracted from topobathymetric data. Models were developed for three tidal zones: subtidal, intertidal, and supratidal/upland. We used a contemporary habitat map to identify landscape position linkages for habitats, such as beach, dune, woody vegetation, and marsh. Deterministic accuracy, fuzzy accuracy, and hindcasting were used for validation. The random forest algorithm performed best for intertidal and supratidal/upland habitats, while the K-nearest neighbor algorithm performed best for subtidal habitats. A posteriori application of expert rules based on theoretical understanding of barrier island habitats enhanced model results. For the contemporary model, deterministic overall accuracy was nearly 70%, and fuzzy overall accuracy was over 80%. For the hindcast model, deterministic overall accuracy was nearly 80%, and fuzzy overall accuracy was over 90%. We found machine learning algorithms were well-suited for predicting barrier island habitats using landscape position. Our model framework could be coupled with hydrodynamic geomorphologic models for forecasting habitats with accelerated sea-level rise, simulated storms, and restoration actions.

Download Full-text

Speculate-correct error bounds for k-nearest neighbor classifiers

Machine Learning ◽

10.1007/s10994-019-05814-1 ◽

2019 ◽

Vol 108 (12) ◽

pp. 2087-2111 ◽

Cited By ~ 1

Author(s):

Eric Bax ◽

Lingjie Weng ◽

Xu Tian

Keyword(s):

Error Bounds ◽

Nearest Neighbor ◽

K Nearest Neighbor ◽

Nearest Neighbor Classifiers

Download Full-text

Using a Genetic Algorithm for Editing k-Nearest Neighbor Classifiers

Intelligent Data Engineering and Automated Learning - IDEAL 2007 - Lecture Notes in Computer Science ◽

10.1007/978-3-540-77226-2_114 ◽

2007 ◽

pp. 1141-1150 ◽

Cited By ~ 7

Author(s):

R. Gil-Pita ◽

X. Yao

Keyword(s):

Genetic Algorithm ◽

Nearest Neighbor ◽

K Nearest Neighbor ◽

Nearest Neighbor Classifiers

Download Full-text

Identification of Leukemia Subtypes from Microscopic Images Using Convolutional Neural Network

Diagnostics ◽

10.3390/diagnostics9030104 ◽

2019 ◽

Vol 9 (3) ◽

pp. 104 ◽

Cited By ~ 11

Author(s):

Ahmed ◽

Yigit ◽

Isik ◽

Alpkocak

Keyword(s):

Machine Learning ◽

Data Augmentation ◽

Nearest Neighbor ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Training Data ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Set ◽

Leukemia Data

Leukemia is a fatal cancer and has two main types: Acute and chronic. Each type has two more subtypes: Lymphoid and myeloid. Hence, in total, there are four subtypes of leukemia. This study proposes a new approach for diagnosis of all subtypes of leukemia from microscopic blood cell images using convolutional neural networks (CNN), which requires a large training data set. Therefore, we also investigated the effects of data augmentation for an increasing number of training samples synthetically. We used two publicly available leukemia data sources: ALL-IDB and ASH Image Bank. Next, we applied seven different image transformation techniques as data augmentation. We designed a CNN architecture capable of recognizing all subtypes of leukemia. Besides, we also explored other well-known machine learning algorithms such as naive Bayes, support vector machine, k-nearest neighbor, and decision tree. To evaluate our approach, we set up a set of experiments and used 5-fold cross-validation. The results we obtained from experiments showed that our CNN model performance has 88.25% and 81.74% accuracy, in leukemia versus healthy and multiclass classification of all subtypes, respectively. Finally, we also showed that the CNN model has a better performance than other wellknown machine learning algorithms.

Download Full-text

On-line gradient learning algorithms for K-nearest neighbor classifiers

Performance Evaluation of Different Machine Learning Classification Algorithms for Diseases Diagnosis

Ensembling evidential k-nearest neighbor classifiers through multi-modal perturbation

A Comparative Analysis of Machine Learning Algorithms Modeled from Machine Vision-Based Lettuce Growth Stage Classification in Smart Aquaponics

Book Genre Categorization Using Machine Learning Algorithms (K-Nearest Neighbor, Support Vector Machine and Logistic Regression) using Customized Dataset﻿

Outlier Detection for Soft-Sensor Modeling Data Based on k-Nearest Neighbor

Investigating the Performance of Naive- Bayes Classifiers and K- Nearest Neighbor Classifiers

Modeling Barrier Island Habitats Using Landscape Position Information

Speculate-correct error bounds for k-nearest neighbor classifiers

Using a Genetic Algorithm for Editing k-Nearest Neighbor Classifiers

Identification of Leukemia Subtypes from Microscopic Images Using Convolutional Neural Network

Book Genre Categorization Using Machine Learning Algorithms (K-Nearest Neighbor, Support Vector Machine and Logistic Regression) using Customized Dataset