A Hybrid Classification Approach That Combines K-Nearest Neighbor and Helps Vector Machine Will Provide Results That Are Closer to The True Value

International Journal of Scientific Research in Science Engineering and Technology ◽

10.32628/ijsrset2183131 ◽

2021 ◽

pp. 572-580

Author(s):

Rajni Bhalla ◽

Jyoti

Keyword(s):

Support Vector Machine ◽

Classification System ◽

Classification Accuracy ◽

Nearest Neighbor ◽

Classification Model ◽

Support Vector ◽

K Nearest Neighbor ◽

Classification Approach ◽

Knn Classification ◽

Hybrid Classification

To construct a new text message classifier, this paper combines the K-nearest neighbor (KNN) classification approach with the support vector machine (SVM) training algorithm. The hybrid classification system is built by combining KNN and Support Vector Machine is abbreviated as K-VM. Due to its flexibility and reliability in handling different forms of classification activities, the KNN has been stated as one of the most frequently used classification approaches. The KNN faces a significant challenge in determining the acceptable value for parameter K to ensure good classification efficacy. This is because the value of parameter K has a significant effect on the KNN classifier's accuracy. The KNN is a method of learning that is based on laziness that holds the entire training examples before classification time, in addition to deciding the optimum value of parameter K. As a result, as the value of parameter K increases, the KNN's computational method becomes more intensive. This paper proposes the K-VM hybrid classification system to reduce the impact of parameters on classification accuracy. The Euclidean distance function is used to measure the average distance between the testing data point and each range in SVs in various categories. Experiments on a variety of benchmark datasets show that the K-VM approach outperforms the conventional KNN classification model in classification accuracy.

Download Full-text

Comparison of Support Vector Machine and K-Nearest Neighbor for Baby Foot Identification based on Image Geometric Characteristics

Jurnal Ilmiah Merpati (Menara Penelitian Akademika Teknologi Informasi) ◽

10.24843/jim.2021.v09.i01.p08 ◽

2021 ◽

pp. 84

Author(s):

Angga Pratama Nugraha ◽

I Nyoman Piarsa ◽

I Made Suwija Putra

Keyword(s):

Support Vector Machine ◽

Nearest Neighbor ◽

Digital Data ◽

Image Feature ◽

Classification Method ◽

Gaussian Kernel ◽

Support Vector ◽

Identification System ◽

K Nearest Neighbor ◽

Knn Classification

Biometric recognition of infant identification systems is critical in security access for identification and verification systems. However, until now, hospitals or health centres in Indonesia still use conventional biometric identification, such as stamping or inking on the soles of babies' feet affixed to paper and are very vulnerable to the risk of damage or loss of data. To resolve this problem, computer vision technology can accurately identify the baby's feet' soles with the final result in the form of digital data. This study compares the classification method of baby feet using the SVM (Support Vector Machine) algorithm with the K-Nearest Neighbor algorithm. The baby's feet understudy image was taken using a cellphone camera with sample data of 3 months old babies. Comparing the SVM and KNN classification methods obtained high accuracy, precision and recall values, namely 98.80% accuracy, 89.51% precision and 88.00% recall. (for the SVM Gaussian kernel classification), with an accuracy of 99.08%, 92.65% precision and 90.75% recall (for the KNN Ecluidean Distance classification), it can be concluded that the KNN classification method using Euclidean distance is the best for applied in the baby palm identification system using the geometric image feature.

Download Full-text

A hybrid text classification approach with low dependency on parameter by integrating K-nearest neighbor and support vector machine

Expert Systems with Applications ◽

10.1016/j.eswa.2012.02.068 ◽

2012 ◽

Vol 39 (15) ◽

pp. 11880-11888 ◽

Cited By ~ 46

Author(s):

Chin Heng Wan ◽

Lam Hong Lee ◽

Rajprasad Rajkumar ◽

Dino Isa

Keyword(s):

Support Vector Machine ◽

Text Classification ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor ◽

Classification Approach

Download Full-text

A Hybrid Classification Approach Based on Support Vector Machine and K-Nearest Neighbor for Remote Sensing Data

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001417500343 ◽

2017 ◽

Vol 31 (10) ◽

pp. 1750034 ◽

Cited By ~ 8

Author(s):

Gulnaz Alimjan ◽

Tieli Sun ◽

Hurxida Jumahun ◽

Yu Guan ◽

Wanting Zhou ◽

...

Keyword(s):

Remote Sensing ◽

Classification Accuracy ◽

Nearest Neighbor ◽

Remote Sensing Data ◽

Support Vector ◽

K Nearest Neighbor ◽

Sensing Data ◽

Knn Classifier ◽

Training Samples ◽

Hybrid Classification

Analysis and classification for remote sensing landscape based on remote sensing imagery is a popular research topic. In this paper, we propose a new remote sensing data classifier by incorporating the support vector machine (SVM) learning information into the K-nearest neighbor (KNN) classifier. The SVM is well known for its extraordinary generalization capability even with limited learning samples, and it is very useful for remote sensing applications as data samples are usually limited. The KNN has been widely used in data classification due to its simplicity and effectiveness. However, the KNN is instance-based and needs to keep all the training samples for classification, which could cause not only high computation complexity but also overfitting problems. Meanwhile, the performance of the KNN classifier is sensitive to the neighborhood size [Formula: see text] and how to select the value of the parameter [Formula: see text] relies heavily on practice and experience. Based on the observations that the SVM can contribute to the KNN on the problems of smaller training samples size as well as the selection of the parameter [Formula: see text], we propose a support vector nearest neighbor (abbreviated as SV-NN) hybrid classification approach which can simplify the parameter selection while maintaining classification accuracy. The proposed approach is consist of two stages. In the first stage, the SVM is performed on the training samples to obtain the reduced support vectors (SVs) for each of the sample categories. In the second stage, a nearest neighbor classifier (NNC) is used to classify a testing sample, i.e. the average Euclidean distance between the testing data point to each set of SVs from different categories is calculated and the NNC identifies the category with minimum distance. To evaluate the effectiveness of the proposed approach, firstly experiments of classification for samples from remote sensing data are evaluated, and then experiments of identifying different land covers regions in the remote sensing images are evaluated. Experimental results show that the SV-NN approach maintains good classification accuracy while reduces the training samples compared with the conventional SVM and KNN classification model.

Download Full-text

Framing Twitter Public Sentiment on Nigerian Government COVID-19 Palliatives Distribution Using Machine Learning

Sustainability ◽

10.3390/su13063497 ◽

2021 ◽

Vol 13 (6) ◽

pp. 3497

Author(s):

Hassan Adamu ◽

Syaheerah Lebai Lutfi ◽

Nurul Hashimah Ahamed Hassain Malim ◽

Rohail Hassan ◽

Assunta Di Vaio ◽

...

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Nearest Neighbor ◽

Primary Objective ◽

Support Vector ◽

Standard English ◽

Emotion Classification ◽

K Nearest Neighbor ◽

The Public ◽

The Government

Sustainable development plays a vital role in information and communication technology. In times of pandemics such as COVID-19, vulnerable people need help to survive. This help includes the distribution of relief packages and materials by the government with the primary objective of lessening the economic and psychological effects on the citizens affected by disasters such as the COVID-19 pandemic. However, there has not been an efficient way to monitor public funds’ accountability and transparency, especially in developing countries such as Nigeria. The understanding of public emotions by the government on distributed palliatives is important as it would indicate the reach and impact of the distribution exercise. Although several studies on English emotion classification have been conducted, these studies are not portable to a wider inclusive Nigerian case. This is because Informal Nigerian English (Pidgin), which Nigerians widely speak, has quite a different vocabulary from Standard English, thus limiting the applicability of the emotion classification of Standard English machine learning models. An Informal Nigerian English (Pidgin English) emotions dataset is constructed, pre-processed, and annotated. The dataset is then used to classify five emotion classes (anger, sadness, joy, fear, and disgust) on the COVID-19 palliatives and relief aid distribution in Nigeria using standard machine learning (ML) algorithms. Six ML algorithms are used in this study, and a comparative analysis of their performance is conducted. The algorithms are Multinomial Naïve Bayes (MNB), Support Vector Machine (SVM), Random Forest (RF), Logistics Regression (LR), K-Nearest Neighbor (KNN), and Decision Tree (DT). The conducted experiments reveal that Support Vector Machine outperforms the remaining classifiers with the highest accuracy of 88%. The “disgust” emotion class surpassed other emotion classes, i.e., sadness, joy, fear, and anger, with the highest number of counts from the classification conducted on the constructed dataset. Additionally, the conducted correlation analysis shows a significant relationship between the emotion classes of “Joy” and “Fear”, which implies that the public is excited about the palliatives’ distribution but afraid of inequality and transparency in the distribution process due to reasons such as corruption. Conclusively, the results from this experiment clearly show that the public emotions on COVID-19 support and relief aid packages’ distribution in Nigeria were not satisfactory, considering that the negative emotions from the public outnumbered the public happiness.

Download Full-text

Ovarian cancer classification using K-Nearest Neighbor and Support Vector Machine

Journal of Physics Conference Series ◽

10.1088/1742-6596/1821/1/012007 ◽

2021 ◽

Vol 1821 (1) ◽

pp. 012007

Author(s):

V V P Wibowo ◽

Z Rustam ◽

S Hartini ◽

F Maulidina ◽

I Wirasati ◽

...

Keyword(s):

Ovarian Cancer ◽

Support Vector Machine ◽

Nearest Neighbor ◽

Cancer Classification ◽

Support Vector ◽

K Nearest Neighbor

Download Full-text

Data Mining Approach to Analyze COVID-19 Clinical Dataset

10.53350/pjmhs211561812 ◽

2021 ◽

Vol 15 (6) ◽

pp. 1812-1819

Author(s):

Azita Yazdani ◽

Ramin Ravangard ◽

Roxana Sharifian

Keyword(s):

Artificial Intelligence ◽

Data Mining ◽

Support Vector Machine ◽

Nearest Neighbor ◽

Clinical Signs ◽

Study Data ◽

Mining Machine ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Approach

The new coronavirus has been spreading since the beginning of 2020 and many efforts have been made to develop vaccines to help patients recover. It is now clear that the world needs a rapid solution to curb the spread of COVID-19 worldwide with non-clinical approaches such as data mining, enhanced intelligence, and other artificial intelligence techniques. These approaches can be effective in reducing the burden on the health care system to provide the best possible way to diagnose and predict the COVID-19 epidemic. In this study, data mining models for early detection of Covid-19 in patients were developed using the epidemiological dataset of patients and individuals suspected of having Covid-19 in Iran. C4.5, support vector machine, Naive Bayes, logistic regression, Random Forest, and k-nearest neighbor algorithm were used directly on the dataset using Rapid miner to develop the models. By receiving clinical signs, this model diagnosis the risk of contracting the COVID-19 virus. Examination of the models in this study has shown that the support vector machine with 93.41% accuracy is more efficient in the diagnosis of patients with COVID-19 pandemic, which is the best model among other developed models. Keywords: COVID-19, Data mining, Machine Learning, Artificial Intelligence, Classification

Download Full-text

Recognition of 3D Objects from 2D Views Features

Journal of Electronic Commerce in Organizations ◽

10.4018/jeco.2015040105 ◽

2015 ◽

Vol 13 (2) ◽

pp. 50-58

Author(s):

R. Khadim ◽

R. El Ayachi ◽

Mohamed Fakir

Keyword(s):

Neural Network ◽

Support Vector Machine ◽

Nearest Neighbor ◽

Color Image ◽

Recognition Rate ◽

Experimental Results ◽

Support Vector ◽

K Nearest Neighbor ◽

3D Objects ◽

Color Descriptor

This paper focuses on the recognition of 3D objects using 2D attributes. In order to increase the recognition rate, the present an hybridization of three approaches to calculate the attributes of color image, this hybridization based on the combination of Zernike moments, Gist descriptors and color descriptor (statistical moments). In the classification phase, three methods are adopted: Neural Network (NN), Support Vector Machine (SVM), and k-nearest neighbor (KNN). The database COIL-100 is used in the experimental results.

Download Full-text

Aspect Term Extraction for Aspect Based Opinion Mining

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.k2050.0981119 ◽

2019 ◽

Vol 8 (11) ◽

pp. 2228-2233

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Random Fields ◽

Opinion Mining ◽

Nearest Neighbor ◽

Conditional Random Fields ◽

International Workshop ◽

Support Vector ◽

K Nearest Neighbor ◽

Term Extraction

Opinion Mining (OM) is also called as Sentiment Analysis (SA). Aspect Based Opinion Mining (ABOM) is also called as Aspect Based Sentiment Analysis (ABSA). In this paper, three new features are proposed to extract the aspect term for Aspect Based Sentiment Analysis (ABSA). The influence of the proposed features is evaluated on five classifiers namely Decision Tree (DT), Naive Bayes (NB), K-Nearest Neighbor (KNN), Support Vector Machine (SVM) and Conditional Random Fields (CRF). The proposed features are evaluated on the Two datasets on Restaurant and Laptop domains available in International Workshop on Semantic Evaluation 2014 i.e. SemEval 2014. The influence of proposed features is evaluated using Precision, Recall and F1 measures. The proposed features are highly influencing for aspect term extraction on classifiers. The performance of SVM and CRF classifiers with proposed features is more influencing for aspect term extraction compared with NB, DT and KNN classifiers.

Download Full-text

Book Genre Categorization Using Machine Learning Algorithms (K-Nearest Neighbor, Support Vector Machine and Logistic Regression) using Customized Dataset

International Journal of Computer Science and Mobile Computing ◽

10.47760/ijcsmc.2021.v10i03.002 ◽

2021 ◽

Vol 10 (3) ◽

pp. 14-25

Author(s):

Parilkumar Shiroya

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Logistic Regression ◽

Nearest Neighbor ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Support Vector ◽

K Nearest Neighbor

Download Full-text

Twitter sentiment analysis of the relocation of Indonesia's capital city

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v9i4.2352 ◽

2020 ◽

Vol 9 (4) ◽

pp. 1620-1630

Author(s):

Edi Sutoyo ◽

Ahmad Almaarif

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Nearest Neighbor ◽

Point Of View ◽

Capital City ◽

Support Vector ◽

K Nearest Neighbor ◽

Evaluation Algorithm ◽

The Government ◽

The Many

Indonesia has a capital city which is one of the many big cities in the world called Jakarta. Jakarta's role in the dynamics that occur in Indonesia is very central because it functions as a political and government center, and is a business and economic center that drives the economy. Recently the discourse of the government to relocate the capital city has invited various reactions from the community. Therefore, in this study, sentiment analysis of the relocation of the capital city was carried out. The analysis was performed by doing a classification to describe the public sentiment sourced from twitter data, the data is classified into 2 classes, namely positive and negative sentiments. The algorithms used in this study include Naïve Bayes classifier, logistic regression, support vector machine, and K-nearest neighbor. The results of the performance evaluation algorithm showed that support vector machine outperformed as compared to 3 algorithms with the results of Accuracy, Precision, Recall, and F-measure are 97.72%, 96.01%, 99.18%, and 97.57%, respectively. Sentiment analysis of the discourse of relocation of the capital city is expected to provide an overview to the government of public opinion from the point of view of data coming from social media.

Download Full-text

A Hybrid Classification Approach That Combines K-Nearest Neighbor and Helps Vector Machine Will Provide Results That Are Closer to The True Value

Comparison of Support Vector Machine and K-Nearest Neighbor for Baby Foot Identification based on Image Geometric Characteristics

A hybrid text classification approach with low dependency on parameter by integrating K-nearest neighbor and support vector machine

A Hybrid Classification Approach Based on Support Vector Machine and K-Nearest Neighbor for Remote Sensing Data

Framing Twitter Public Sentiment on Nigerian Government COVID-19 Palliatives Distribution Using Machine Learning

Ovarian cancer classification using K-Nearest Neighbor and Support Vector Machine

Data Mining Approach to Analyze COVID-19 Clinical Dataset

Recognition of 3D Objects from 2D Views Features

Aspect Term Extraction for Aspect Based Opinion Mining

Book Genre Categorization Using Machine Learning Algorithms (K-Nearest Neighbor, Support Vector Machine and Logistic Regression) using Customized Dataset﻿

Twitter sentiment analysis of the relocation of Indonesia's capital city

Book Genre Categorization Using Machine Learning Algorithms (K-Nearest Neighbor, Support Vector Machine and Logistic Regression) using Customized Dataset