scholarly journals Classification of Riboswitch Families Using Block Location-Based Feature Extraction (BLBFE) Method

2019 ◽  
Vol 10 (1) ◽  
pp. 97-105
Author(s):  
Faegheh Golabi ◽  
Mousa Shamsi ◽  
Mohammad Hosein Sedaaghi ◽  
Abolfazl Barzegar ◽  
Mohammad Saeid Hejazi

Purpose: Riboswitches are special non-coding sequences usually located in mRNAs’ un-translated regions and regulate gene expression and consequently cellular function. Furthermore, their interaction with antibiotics has been recently implicated. This raises more interest in development of bioinformatics tools for riboswitch studies. Herein, we describe the development and employment of novel block location-based feature extraction (BLBFE) method for classification of riboswitches. Methods: We have already developed and reported a sequential block finding (SBF) algorithm which, without operating alignment methods, identifies family specific sequential blocks for riboswitch families. Herein, we employed this algorithm for 7 riboswitch families including lysine, cobalamin, glycine, SAM-alpha, SAM-IV, cyclic-di-GMP-I and SAH. Then the study was extended toward implementation of BLBFE method for feature extraction. The outcome features were applied in various classifiers including linear discriminant analysis (LDA), probabilistic neural network (PNN), decision tree and k-nearest neighbors (KNN) classifiers for classification of the riboswitch families. The performance of the classifiers was investigated according to performance measures such as correct classification rate (CCR), accuracy, sensitivity, specificity and f-score. Results: As a result, average CCR for classification of riboswitches was 87.87%. Furthermore, application of BLBFE method in 4 classifiers displayed average accuracies of 93.98% to 96.1%, average sensitivities of 76.76% to 83.61%, average specificities of 96.53% to 97.69% and average f-scores of 74.9% to 81.91%. Conclusion: Our results approved that the proposed method of feature extraction; i.e. BLBFE method; can be successfully used for classification and discrimination of the riboswitch families with high CCR, accuracy, sensitivity, specificity and f-score values.

Kursor ◽  
2018 ◽  
Vol 9 (1) ◽  
Author(s):  
Miftahus Sholihin

Classification aims to classify object into specific classes based on the value of the attribute associated with the object being observed. In this research designed a system that serves to classify Lamongan batik cloth based on color features using color moment, texture using Gray Level Co-occurence Matrix (GLCM), and shape using moment invariant, classification using K-Nearest Neighbors (K-NN) method. In outline the system was built consists of three main processes namely pre-processing, feature extraction, and classification. The highest accuracy rate in this study was 90.4% when the value of k = 6.


Bioimpacts ◽  
2020 ◽  
Vol 11 (2) ◽  
pp. 101-109
Author(s):  
Faegheh Golabi ◽  
Elnaz Mehdizadeh Aghdam ◽  
Mousa Shamsi ◽  
Mohammad Hossein Sedaaghi ◽  
Abolfazl Barzegar ◽  
...  

Introduction: Riboswitches are short regulatory elements generally found in the untranslated regions of prokaryotes’ mRNAs and classified into several families. Due to the binding possibility between riboswitches and antibiotics, their usage as engineered regulatory elements and also their evolutionary contribution, the need for bioinformatics tools of riboswitch detection is increasing. We have previously introduced an alignment independent algorithm for the identification of frequent sequential blocks in the families of riboswitches. Herein, we report the application of block location-based feature extraction strategy (BLBFE), which uses the locations of detected blocks on riboswitch sequences as features for classification of seed sequences. Besides, mono- and dinucleotide frequencies, k-mer, DAC, DCC, DACC, PC-PseDNC-General and SC-PseDNC-General methods as some feature extraction strategies were investigated. Methods: The classifiers of the Decision tree, KNN, LDA, and Naïve Bayes, as well as k-fold cross-validation, were employed for all methods of feature extraction to compare their performances based on the criteria of accuracy, sensitivity, specificity, and f-score performance measures. Results: The outcome of the study showed that the BLBFE strategy classified the riboswitches indicating 87.65% average correct classification rate (CCR). Moreover, the performance of the proposed feature extraction method was confirmed with average values of 94.31%, 85.01%, 95.45% and 85.38% for accuracy, sensitivity, specificity, and f-score, respectively. Conclusion: Our result approved the performance of the BLBFE strategy in the classification and discrimination of the riboswitch groups showing remarkable higher values of CCR, accuracy, sensitivity, specificity and f-score relative to previously studied feature extraction methods.


Author(s):  
Nuwan Madusanka ◽  
Heung-Kook Choi ◽  
Jae-Hong So ◽  
Boo-Kyeong Choi

Background: In this study, we investigated the fusion of texture and morphometric features as a possible diagnostic biomarker for Alzheimer’s Disease (AD). Methods: In particular, we classified subjects with Alzheimer’s disease, Mild Cognitive Impairment (MCI) and Normal Control (NC) based on texture and morphometric features. Currently, neuropsychiatric categorization provides the ground truth for AD and MCI diagnosis. This can then be supported by biological data such as the results of imaging studies. Cerebral atrophy has been shown to correlate strongly with cognitive symptoms. Hence, Magnetic Resonance (MR) images of the brain are important resources for AD diagnosis. In the proposed method, we used three different types of features identified from structural MR images: Gabor, hippocampus morphometric, and Two Dimensional (2D) and Three Dimensional (3D) Gray Level Co-occurrence Matrix (GLCM). The experimental results, obtained using a 5-fold cross-validated Support Vector Machine (SVM) with 2DGLCM and 3DGLCM multi-feature fusion approaches, indicate that we achieved 81.05% ±1.34, 86.61% ±1.25 correct classification rate with 95% Confidence Interval (CI) falls between (80.75-81.35) and (86.33-86.89) respectively, 83.33%±2.15, 84.21%±1.42 sensitivity and 80.95%±1.52, 85.00%±1.24 specificity in our classification of AD against NC subjects, thus outperforming recent works found in the literature. For the classification of MCI against AD, the SVM achieved a 76.31% ± 2.18, 78.95% ±2.26 correct classification rate, 75.00% ±1.34, 76.19%±1.84 sensitivity and 77.78% ±1.14, 82.35% ±1.34 specificity. Results and Conclusion: The results of the third experiment, with MCI against NC, also showed that the multiclass SVM provided highly accurate classification results. These findings suggest that this approach is efficient and may be a promising strategy for obtaining better AD, MCI and NC classification performance.


Author(s):  
Ahmed.T. Sahlol ◽  
Aboul Ella Hassanien

There are still many obstacles for achieving high recognition accuracy for Arabic handwritten optical character recognition system, each character has a different shape, as well as the similarities between characters. In this chapter, several feature selection-based bio-inspired optimization algorithms including Bat Algorithm, Grey Wolf Optimization, Whale optimization Algorithm, Particle Swarm Optimization and Genetic Algorithm have been presented and an application of Arabic handwritten characters recognition has been chosen to see their ability and accuracy to recognize Arabic characters. The experiments have been performed using a benchmark dataset, CENPARMI by k-Nearest neighbors, Linear Discriminant Analysis, and random forests. The achieved results show superior results for the selected features when comparing the classification accuracy for the selected features by the optimization algorithms with the whole feature set in terms of the classification accuracy and the processing time. The experiments have been performed using a benchmark dataset, CENPARMI by k-Nearest neighbors, Linear Discriminant Analysis, and random forests. The achieved results show superior results for the selected features when comparing the classification accuracy for the selected features by the optimization algorithms with the whole feature set in terms of the classification accuracy and the processing time.


Sign in / Sign up

Export Citation Format

Share Document