Classification of Riboswitch Families Using Block Location-Based Feature Extraction (BLBFE) Method

Faegheh Golabi; Mousa Shamsi; Mohammad Hosein Sedaaghi; Abolfazl Barzegar; Mohammad Saeid Hejazi

doi:10.15171/apb.2020.012

Classification of Riboswitch Families Using Block Location-Based Feature Extraction (BLBFE) Method

Advanced Pharmaceutical Bulletin ◽

10.15171/apb.2020.012 ◽

2019 ◽

Vol 10 (1) ◽

pp. 97-105

Author(s):

Faegheh Golabi ◽

Mousa Shamsi ◽

Mohammad Hosein Sedaaghi ◽

Abolfazl Barzegar ◽

Mohammad Saeid Hejazi

Keyword(s):

Feature Extraction ◽

Probabilistic Neural Network ◽

Nearest Neighbors ◽

Correct Classification Rate ◽

K Nearest Neighbors ◽

Classification Rate ◽

Regulate Gene Expression ◽

Linear Discriminant ◽

Sensitivity Specificity

Purpose: Riboswitches are special non-coding sequences usually located in mRNAs’ un-translated regions and regulate gene expression and consequently cellular function. Furthermore, their interaction with antibiotics has been recently implicated. This raises more interest in development of bioinformatics tools for riboswitch studies. Herein, we describe the development and employment of novel block location-based feature extraction (BLBFE) method for classification of riboswitches. Methods: We have already developed and reported a sequential block finding (SBF) algorithm which, without operating alignment methods, identifies family specific sequential blocks for riboswitch families. Herein, we employed this algorithm for 7 riboswitch families including lysine, cobalamin, glycine, SAM-alpha, SAM-IV, cyclic-di-GMP-I and SAH. Then the study was extended toward implementation of BLBFE method for feature extraction. The outcome features were applied in various classifiers including linear discriminant analysis (LDA), probabilistic neural network (PNN), decision tree and k-nearest neighbors (KNN) classifiers for classification of the riboswitch families. The performance of the classifiers was investigated according to performance measures such as correct classification rate (CCR), accuracy, sensitivity, specificity and f-score. Results: As a result, average CCR for classification of riboswitches was 87.87%. Furthermore, application of BLBFE method in 4 classifiers displayed average accuracies of 93.98% to 96.1%, average sensitivities of 76.76% to 83.61%, average specificities of 96.53% to 97.69% and average f-scores of 74.9% to 81.91%. Conclusion: Our results approved that the proposed method of feature extraction; i.e. BLBFE method; can be successfully used for classification and discrimination of the riboswitch families with high CCR, accuracy, sensitivity, specificity and f-score values.

Download Full-text

CLASSIFICATION OF BATIK LAMONGAN BASED ON FEATURES OF COLOR, TEXTURE AND SHAPE

Kursor ◽

10.28961/kursor.v9i1.114 ◽

2018 ◽

Vol 9 (1) ◽

Author(s):

Miftahus Sholihin

Keyword(s):

Feature Extraction ◽

Nearest Neighbors ◽

Gray Level ◽

Accuracy Rate ◽

K Nearest Neighbors ◽

Moment Invariant ◽

Color Features ◽

Color Moment ◽

Color Texture

Classification aims to classify object into specific classes based on the value of the attribute associated with the object being observed. In this research designed a system that serves to classify Lamongan batik cloth based on color features using color moment, texture using Gray Level Co-occurence Matrix (GLCM), and shape using moment invariant, classification using K-Nearest Neighbors (K-NN) method. In outline the system was built consists of three main processes namely pre-processing, feature extraction, and classification. The highest accuracy rate in this study was 90.4% when the value of k = 6.

Download Full-text

Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method

Bioimpacts ◽

10.34172/bi.2021.17 ◽

2020 ◽

Vol 11 (2) ◽

pp. 101-109

Author(s):

Faegheh Golabi ◽

Elnaz Mehdizadeh Aghdam ◽

Mousa Shamsi ◽

Mohammad Hossein Sedaaghi ◽

Abolfazl Barzegar ◽

...

Keyword(s):

Feature Extraction ◽

Cross Validation ◽

Regulatory Elements ◽

Extraction Methods ◽

Untranslated Regions ◽

Classification Rate ◽

Feature Extraction Method ◽

Sensitivity Specificity ◽

Fold Cross Validation

Introduction: Riboswitches are short regulatory elements generally found in the untranslated regions of prokaryotes’ mRNAs and classified into several families. Due to the binding possibility between riboswitches and antibiotics, their usage as engineered regulatory elements and also their evolutionary contribution, the need for bioinformatics tools of riboswitch detection is increasing. We have previously introduced an alignment independent algorithm for the identification of frequent sequential blocks in the families of riboswitches. Herein, we report the application of block location-based feature extraction strategy (BLBFE), which uses the locations of detected blocks on riboswitch sequences as features for classification of seed sequences. Besides, mono- and dinucleotide frequencies, k-mer, DAC, DCC, DACC, PC-PseDNC-General and SC-PseDNC-General methods as some feature extraction strategies were investigated. Methods: The classifiers of the Decision tree, KNN, LDA, and Naïve Bayes, as well as k-fold cross-validation, were employed for all methods of feature extraction to compare their performances based on the criteria of accuracy, sensitivity, specificity, and f-score performance measures. Results: The outcome of the study showed that the BLBFE strategy classified the riboswitches indicating 87.65% average correct classification rate (CCR). Moreover, the performance of the proposed feature extraction method was confirmed with average values of 94.31%, 85.01%, 95.45% and 85.38% for accuracy, sensitivity, specificity, and f-score, respectively. Conclusion: Our result approved the performance of the BLBFE strategy in the classification and discrimination of the riboswitch groups showing remarkable higher values of CCR, accuracy, sensitivity, specificity and f-score relative to previously studied feature extraction methods.

Download Full-text

Food Detection Using Histogram of Oriented Gradient (HOG) as Feature Extraction and K-Nearest Neighbors (K-NN) as Classifier

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2020/3191.52020 ◽

2020 ◽

Vol 9 (1.5) ◽

pp. 219-225

Author(s):

Diah Rahmadani

Keyword(s):

Feature Extraction ◽

Nearest Neighbors ◽

K Nearest Neighbors ◽

Histogram Of Oriented Gradient ◽

Food Detection

Download Full-text

Classification of soil quality using K-Nearest Neighbors methods

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/739/1/012011 ◽

2021 ◽

Vol 739 (1) ◽

pp. 012011

Author(s):

I D Ratih ◽

S M Retnaningsih ◽

V M Dewi

Keyword(s):

Soil Quality ◽

Nearest Neighbors ◽

K Nearest Neighbors

Download Full-text

Classification of EEG Features Extracted from Classroom Experiment using Weighted K-Nearest Neighbors

2020 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE) ◽

10.1109/iccceee49695.2021.9429624 ◽

2021 ◽

Author(s):

Areej Babiker ◽

Eltaf Abdalsalam

Keyword(s):

Nearest Neighbors ◽

K Nearest Neighbors ◽

Classroom Experiment ◽

Eeg Features

Download Full-text

Adaptive Global k-Nearest Neighbors for Hierarchical Classification of Data Streams*

10.1109/smc52423.2021.9658648 ◽

2021 ◽

Author(s):

Eduardo Tieppo ◽

Jean Paul Barddal ◽

Julio Cesar Nievola

Keyword(s):

Data Streams ◽

Hierarchical Classification ◽

Nearest Neighbors ◽

K Nearest Neighbors

Download Full-text

Alzheimer's Disease Classification Based on Multi-feature Fusion

Current Medical Imaging Formerly Current Medical Imaging Reviews ◽

10.2174/1573405614666181012102626 ◽

2019 ◽

Vol 15 (2) ◽

pp. 161-169 ◽

Cited By ~ 3

Author(s):

Nuwan Madusanka ◽

Heung-Kook Choi ◽

Jae-Hong So ◽

Boo-Kyeong Choi

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Feature Fusion ◽

Biological Data ◽

Correct Classification ◽

Mr Images ◽

Correct Classification Rate ◽

Classification Rate ◽

Morphometric Features

Background: In this study, we investigated the fusion of texture and morphometric features as a possible diagnostic biomarker for Alzheimer’s Disease (AD). Methods: In particular, we classified subjects with Alzheimer’s disease, Mild Cognitive Impairment (MCI) and Normal Control (NC) based on texture and morphometric features. Currently, neuropsychiatric categorization provides the ground truth for AD and MCI diagnosis. This can then be supported by biological data such as the results of imaging studies. Cerebral atrophy has been shown to correlate strongly with cognitive symptoms. Hence, Magnetic Resonance (MR) images of the brain are important resources for AD diagnosis. In the proposed method, we used three different types of features identified from structural MR images: Gabor, hippocampus morphometric, and Two Dimensional (2D) and Three Dimensional (3D) Gray Level Co-occurrence Matrix (GLCM). The experimental results, obtained using a 5-fold cross-validated Support Vector Machine (SVM) with 2DGLCM and 3DGLCM multi-feature fusion approaches, indicate that we achieved 81.05% ±1.34, 86.61% ±1.25 correct classification rate with 95% Confidence Interval (CI) falls between (80.75-81.35) and (86.33-86.89) respectively, 83.33%±2.15, 84.21%±1.42 sensitivity and 80.95%±1.52, 85.00%±1.24 specificity in our classification of AD against NC subjects, thus outperforming recent works found in the literature. For the classification of MCI against AD, the SVM achieved a 76.31% ± 2.18, 78.95% ±2.26 correct classification rate, 75.00% ±1.34, 76.19%±1.84 sensitivity and 77.78% ±1.14, 82.35% ±1.34 specificity. Results and Conclusion: The results of the third experiment, with MCI against NC, also showed that the multiclass SVM provided highly accurate classification results. These findings suggest that this approach is efficient and may be a promising strategy for obtaining better AD, MCI and NC classification performance.

Download Full-text

Identification and classification of brain tumor MRI images with feature extraction using DWT and probabilistic neural network

Brain Informatics ◽

10.1007/s40708-017-0075-5 ◽

2018 ◽

Vol 5 (1) ◽

pp. 23-30 ◽

Cited By ~ 69

Author(s):

N. Varuna Shree ◽

T. N. R. Kumar

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Brain Tumor ◽

Probabilistic Neural Network ◽

Tumor Mri

Download Full-text

Bio-Inspired Optimization Algorithms for Arabic Handwritten Characters

Handbook of Research on Machine Learning Innovations and Trends - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-5225-2229-4.ch039 ◽

2017 ◽

pp. 897-914 ◽

Cited By ~ 3

Author(s):

Ahmed.T. Sahlol ◽

Aboul Ella Hassanien

Keyword(s):

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Random Forests ◽

Classification Accuracy ◽

Processing Time ◽

Optimization Algorithms ◽

Nearest Neighbors ◽

Benchmark Dataset ◽

K Nearest Neighbors ◽

Linear Discriminant

There are still many obstacles for achieving high recognition accuracy for Arabic handwritten optical character recognition system, each character has a different shape, as well as the similarities between characters. In this chapter, several feature selection-based bio-inspired optimization algorithms including Bat Algorithm, Grey Wolf Optimization, Whale optimization Algorithm, Particle Swarm Optimization and Genetic Algorithm have been presented and an application of Arabic handwritten characters recognition has been chosen to see their ability and accuracy to recognize Arabic characters. The experiments have been performed using a benchmark dataset, CENPARMI by k-Nearest neighbors, Linear Discriminant Analysis, and random forests. The achieved results show superior results for the selected features when comparing the classification accuracy for the selected features by the optimization algorithms with the whole feature set in terms of the classification accuracy and the processing time. The experiments have been performed using a benchmark dataset, CENPARMI by k-Nearest neighbors, Linear Discriminant Analysis, and random forests. The achieved results show superior results for the selected features when comparing the classification accuracy for the selected features by the optimization algorithms with the whole feature set in terms of the classification accuracy and the processing time.

Download Full-text

Classification of Children with Attention Deficit Hyperactivity Disorder Using PCA and K-Nearest Neighbors During Interference Control Task

Advances in Cognitive Neurodynamics (V) - Advances in Cognitive Neurodynamics ◽

10.1007/978-981-10-0207-6_61 ◽

2016 ◽

pp. 447-453 ◽

Cited By ~ 2

Author(s):

Jiaojiao Yang ◽

Wenjie Li ◽

Suhong Wang ◽

Jieru Lu ◽

Ling Zou

Keyword(s):

Attention Deficit Hyperactivity Disorder ◽

Attention Deficit ◽

Nearest Neighbors ◽

Interference Control ◽

Control Task ◽

K Nearest Neighbors ◽

Hyperactivity Disorder

Download Full-text