Fuzzy Ontology Based Document Feature Vector Modification Using Fuzzy Tree Transducer

Deep Convolution Feature Vector for Fast Face Image Retrieval

Journal of Computer-Aided Design & Computer Graphics ◽

10.3724/sp.j.1089.2018.17119 ◽

2018 ◽

Vol 30 (12) ◽

pp. 2311

Author(s):

Zhendong Li ◽

Yong Zhong ◽

Dongping Cao

Keyword(s):

Image Retrieval ◽

Feature Vector ◽

Face Image

Download Full-text

Convolutional Neural Networks for ATC Classification

Current Pharmaceutical Design ◽

10.2174/1381612824666181112113438 ◽

2019 ◽

Vol 24 (34) ◽

pp. 4007-4012 ◽

Cited By ~ 5

Author(s):

Alessandra Lumini ◽

Loris Nanni

Keyword(s):

Classification System ◽

Feature Vector ◽

Chemical Interaction ◽

Anatomical Therapeutic Chemical ◽

World Health ◽

Therapeutic Effects ◽

True Rate ◽

Unknown Compound ◽

The Absolute ◽

Learned Features

Background: Anatomical Therapeutic Chemical (ATC) classification of unknown compound has raised high significance for both drug development and basic research. The ATC system is a multi-label classification system proposed by the World Health Organization (WHO), which categorizes drugs into classes according to their therapeutic effects and characteristics. This system comprises five levels and includes several classes in each level; the first level includes 14 main overlapping classes. The ATC classification system simultaneously considers anatomical distribution, therapeutic effects, and chemical characteristics, the prediction for an unknown compound of its ATC classes is an essential problem, since such a prediction could be used to deduce not only a compound’s possible active ingredients but also its therapeutic, pharmacological, and chemical properties. Nevertheless, the problem of automatic prediction is very challenging due to the high variability of the samples and the presence of overlapping among classes, resulting in multiple predictions and making machine learning extremely difficult. Methods: In this paper, we propose a multi-label classifier system based on deep learned features to infer the ATC classification. The system is based on a 2D representation of the samples: first a 1D feature vector is obtained extracting information about a compound’s chemical-chemical interaction and its structural and fingerprint similarities to other compounds belonging to the different ATC classes, then the original 1D feature vector is reshaped to obtain a 2D matrix representation of the compound. Finally, a convolutional neural network (CNN) is trained and used as a feature extractor. Two general purpose classifiers designed for multi-label classification are trained using the deep learned features and resulting scores are fused by the average rule. Results: Experimental evaluation based on rigorous cross-validation demonstrates the superior prediction quality of this method compared to other state-of-the-art approaches developed for this problem. Conclusion: Extensive experiments demonstrate that the new predictor, based on CNN, outperforms other existing predictors in the literature in almost all the five metrics used to examine the performance for multi-label systems, particularly in the “absolute true” rate and the “absolute false” rate, the two most significant indexes. Matlab code will be available at https://github.com/LorisNanni.

Download Full-text

Identifying Alzheimer’s Disease-related miRNA Based on Semi-clustering

Current Gene Therapy ◽

10.2174/1566523219666190924113737 ◽

2019 ◽

Vol 19 (4) ◽

pp. 216-223 ◽

Cited By ~ 2

Author(s):

Tianyi Zhao ◽

Donghua Wang ◽

Yang Hu ◽

Ningyi Zhang ◽

Tianyi Zang ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Drug Targets ◽

Molecular Mechanisms ◽

Feature Vector ◽

Mirna Gene ◽

Interaction Network ◽

Gene Interaction ◽

Proteinprotein Interaction ◽

Synaptic Structures

Background: More and more scholars are trying to use it as a specific biomarker for Alzheimer’s Disease (AD) and mild cognitive impairment (MCI). Multiple studies have indicated that miRNAs are associated with poor axonal growth and loss of synaptic structures, both of which are early events in AD. The overall loss of miRNA may be associated with aging, increasing the incidence of AD, and may also be involved in the disease through some specific molecular mechanisms. Objective: Identifying Alzheimer’s disease-related miRNA can help us find new drug targets, early diagnosis. Materials and Methods: We used genes as a bridge to connect AD and miRNAs. Firstly, proteinprotein interaction network is used to find more AD-related genes by known AD-related genes. Then, each miRNA’s correlation with these genes is obtained by miRNA-gene interaction. Finally, each miRNA could get a feature vector representing its correlation with AD. Unlike other studies, we do not generate negative samples randomly with using classification method to identify AD-related miRNAs. Here we use a semi-clustering method ‘one-class SVM’. AD-related miRNAs are considered as outliers and our aim is to identify the miRNAs that are similar to known AD-related miRNAs (outliers). Results and Conclusion: We identified 257 novel AD-related miRNAs and compare our method with SVM which is applied by generating negative samples. The AUC of our method is much higher than SVM and we did case studies to prove that our results are reliable.

Download Full-text

An Integrated Prediction Method for Identifying Protein-Protein Interactions

Current Proteomics ◽

10.2174/1570164616666190306152318 ◽

2020 ◽

Vol 17 (4) ◽

pp. 271-286

Author(s):

Chang Xu ◽

Limin Jiang ◽

Zehua Zhang ◽

Xuyao Yu ◽

Renhai Chen ◽

...

Keyword(s):

Protein Interactions ◽

Feature Vector ◽

Prediction Method ◽

Feature Representation ◽

Protein Interaction Networks ◽

Learning Approach ◽

Biological Processes ◽

Integrated Learning ◽

Protein Protein Interactions ◽

Training Process

Background: Protein-Protein Interactions (PPIs) play a key role in various biological processes. Many methods have been developed to predict protein-protein interactions and protein interaction networks. However, many existing applications are limited, because of relying on a large number of homology proteins and interaction marks. Methods: In this paper, we propose a novel integrated learning approach (RF-Ada-DF) with the sequence-based feature representation, for identifying protein-protein interactions. Our method firstly constructs a sequence-based feature vector to represent each pair of proteins, viaMultivariate Mutual Information (MMI) and Normalized Moreau-Broto Autocorrelation (NMBAC). Then, we feed the 638- dimentional features into an integrated learning model for judging interaction pairs and non-interaction pairs. Furthermore, this integrated model embeds Random Forest in AdaBoost framework and turns weak classifiers into a single strong classifier. Meanwhile, we also employ double fault detection in order to suppress over-adaptation during the training process. Results: To evaluate the performance of our method, we conduct several comprehensive tests for PPIs prediction. On the H. pyloridataset, our method achieves 88.16% accuracy and 87.68% sensitivity, the accuracy of our method is increased by 0.57%. On the S. cerevisiaedataset, our method achieves 95.77% accuracy and 93.36% sensitivity, the accuracy of our method is increased by 0.76%. On the Humandataset, our method achieves 98.16% accuracy and 96.80% sensitivity, the accuracy of our method is increased by 0.6%. Experiments show that our method achieves better results than other outstanding methods for sequence-based PPIs prediction. The datasets and codes are available at https://github.com/guofei-tju/RF-Ada-DF.git.

Download Full-text

Suitability of Sequence-Based Feature Vector for Classification Algorithm Improves Accuracy of Human Protein-Protein Interaction Prediction: A Red Blood Cell Case Study

Current Bioinformatics ◽

10.2174/1574893610666151026215233 ◽

2016 ◽

Vol 11 (2) ◽

pp. 291-300 ◽

Cited By ~ 2

Author(s):

Afsaneh Maali ◽

Mahmood A. Mahdavi ◽

Reza Gheshlaghi

Keyword(s):

Blood Cell ◽

Protein Interaction ◽

Red Blood Cell ◽

Feature Vector ◽

Classification Algorithm ◽

Human Protein ◽

Interaction Prediction ◽

Protein Protein Interaction ◽

Protein Interaction Prediction

Download Full-text

Analysis of reduced‐set construction using image reconstruction from a HOG feature vector

IET Computer Vision ◽

10.1049/iet-cvi.2016.0317 ◽

2017 ◽

Vol 11 (8) ◽

pp. 725-732 ◽

Cited By ~ 2

Author(s):

Ho Gi Jung

Keyword(s):

Image Reconstruction ◽

Feature Vector

Download Full-text

A Novel Unsupervised Classification Method for Sandy Land Using Fully Polarimetric SAR Data

Remote Sensing ◽

10.3390/rs13030355 ◽

2021 ◽

Vol 13 (3) ◽

pp. 355

Author(s):

Weixian Tan ◽

Borong Sun ◽

Chenyu Xiao ◽

Pingping Huang ◽

Wei Xu ◽

...

Keyword(s):

Spectral Clustering ◽

Large Scale ◽

Clustering Algorithm ◽

Feature Vector ◽

Unsupervised Classification ◽

Classification Method ◽

Sandy Land ◽

Classification Methods ◽

The Many ◽

Representative Points

Classification based on polarimetric synthetic aperture radar (PolSAR) images is an emerging technology, and recent years have seen the introduction of various classification methods that have been proven to be effective to identify typical features of many terrain types. Among the many regions of the study, the Hunshandake Sandy Land in Inner Mongolia, China stands out for its vast area of sandy land, variety of ground objects, and intricate structure, with more irregular characteristics than conventional land cover. Accounting for the particular surface features of the Hunshandake Sandy Land, an unsupervised classification method based on new decomposition and large-scale spectral clustering with superpixels (ND-LSC) is proposed in this study. Firstly, the polarization scattering parameters are extracted through a new decomposition, rather than other decomposition approaches, which gives rise to more accurate feature vector estimate. Secondly, a large-scale spectral clustering is applied as appropriate to meet the massive land and complex terrain. More specifically, this involves a beginning sub-step of superpixels generation via the Adaptive Simple Linear Iterative Clustering (ASLIC) algorithm when the feature vector combined with the spatial coordinate information are employed as input, and subsequently a sub-step of representative points selection as well as bipartite graph formation, followed by the spectral clustering algorithm to complete the classification task. Finally, testing and analysis are conducted on the RADARSAT-2 fully PolSAR dataset acquired over the Hunshandake Sandy Land in 2016. Both qualitative and quantitative experiments compared with several classification methods are conducted to show that proposed method can significantly improve performance on classification.

Download Full-text

Confronting Sparseness and High Dimensionality in Short Text Clustering via Feature Vector Projections

2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI) ◽

10.1109/ictai50040.2020.00129 ◽

2020 ◽

Author(s):

Leonidas Akritidis ◽

Miltiadis Alamaniotis ◽

Athanasios Fevgas ◽

Panayiotis Bozanis

Keyword(s):

Feature Vector ◽

Text Clustering ◽

High Dimensionality ◽

Short Text ◽

Short Text Clustering

Download Full-text

Compressed sensing image restoration algorithm based on improved SURF operator

Open Physics ◽

10.1515/phys-2018-0124 ◽

2018 ◽

Vol 16 (1) ◽

pp. 1033-1045

Author(s):

Guodong Zhou ◽

Huailiang Zhang ◽

Raquel Martínez Lucas

Keyword(s):

Compressed Sensing ◽

Image Restoration ◽

Feature Vector ◽

Optimization Method ◽

Image Feature ◽

Diffusion Method ◽

Local Similarity ◽

Weighting Method ◽

Data Smoothing ◽

Restoration Algorithm

Abstract Aiming at the excellent descriptive ability of SURF operator for local features of images, except for the shortcoming of global feature description ability, a compressed sensing image restoration algorithm based on improved SURF operator is proposed. The SURF feature vector set of the image is extracted, and the vector set data is reduced into a single high-dimensional feature vector by using a histogram algorithm, and then the image HSV color histogram is extracted.MSA image decomposition algorithm is used to obtain sparse representation of image feature vectors. Total variation curvature diffusion method and Bayesian weighting method perform image restoration for data smoothing feature and local similarity feature of texture part respectively. A compressed sensing image restoration model is obtained by using Schatten-p norm, and image color supplement is performed on the model. The compressed sensing image is iteratively solved by alternating optimization method, and the compressed sensing image is restored. The experimental results show that the proposed algorithm has good restoration performance, and the restored image has finer edge and texture structure and better visual effect.

Download Full-text

Anomaly Detection Using Autoencoder with Feature Vector Frequency Map

IEEE Access ◽

10.1109/access.2021.3080330 ◽

2021 ◽

pp. 1-1

Author(s):

Young-Gyu Kim ◽

Tae-Hyoung Park

Keyword(s):

Anomaly Detection ◽

Feature Vector ◽

Frequency Map

Download Full-text