A semi-supervised rough set and random forest approach for pattern classification of gene expression data

Pradeep Kumar Mallick; Debahuti Mishra; Srikanta Patnaik; Kailash Shaw

doi:10.1504/ijris.2016.082976

A semi-supervised rough set and random forest approach for pattern classification of gene expression data

International Journal of Reasoning-based Intelligent Systems ◽

10.1504/ijris.2016.082976 ◽

2016 ◽

Vol 8 (3/4) ◽

pp. 155 ◽

Cited By ~ 1

Author(s):

Pradeep Kumar Mallick ◽

Debahuti Mishra ◽

Srikanta Patnaik ◽

Kailash Shaw

Keyword(s):

Gene Expression ◽

Random Forest ◽

Pattern Classification ◽

Gene Expression Data ◽

Rough Set ◽

Expression Data

Download Full-text

A semi-supervised rough set and random forest approach for pattern classification of gene expression data

International Journal of Reasoning-based Intelligent Systems ◽

10.1504/ijris.2016.10003972 ◽

2016 ◽

Vol 8 (3/4) ◽

pp. 155

Author(s):

Kailash Shaw ◽

Debahuti Mishra ◽

Srikanta Patnaik ◽

Pradeep Kumar Mallick

Keyword(s):

Gene Expression ◽

Random Forest ◽

Pattern Classification ◽

Gene Expression Data ◽

Rough Set ◽

Expression Data

Download Full-text

A Comparative Performance Evaluation of Random Forest Feature Selection on Classification of Hepatocellular Carcinoma Gene Expression Data

2019 3rd International Conference on Informatics and Computational Sciences (ICICoS) ◽

10.1109/icicos48119.2019.8982435 ◽

2019 ◽

Cited By ~ 1

Author(s):

Moh Abdul Latief ◽

Titin Siswantining ◽

Alhadi Bustamam ◽

Devvi Sarwinda

Keyword(s):

Gene Expression ◽

Hepatocellular Carcinoma ◽

Feature Selection ◽

Performance Evaluation ◽

Random Forest ◽

Gene Expression Data ◽

Expression Data ◽

Comparative Performance

Download Full-text

Rough Set based Attribute Clustering for Sample Classification of Gene Expression Data

Procedia Engineering ◽

10.1016/j.proeng.2012.06.219 ◽

2012 ◽

Vol 38 ◽

pp. 1788-1792 ◽

Cited By ~ 6

Author(s):

Rudra Kalyan Nayak ◽

Debahuti Mishra ◽

Kailash Shaw ◽

Sashikala Mishra

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Rough Set ◽

Expression Data ◽

Sample Classification ◽

Attribute Clustering

Download Full-text

A class imbalance-aware Relief algorithm for the classification of tumors using microarray gene expression data

Computational Biology and Chemistry ◽

10.1016/j.compbiolchem.2019.03.017 ◽

2019 ◽

Vol 80 ◽

pp. 121-127 ◽

Cited By ~ 3

Author(s):

Yuanyu He ◽

Junhai Zhou ◽

Yaping Lin ◽

Tuanfei Zhu

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Class Imbalance ◽

Microarray Gene Expression Data ◽

Expression Data ◽

Microarray Gene Expression ◽

Relief Algorithm ◽

Classification Of Tumors ◽

Microarray Gene

Download Full-text

Improving the Performance of Principal Components for Classification of Gene Expression Data Through Feature Selection

Studies in Classification, Data Analysis, and Knowledge Organization - Data Science and Classification ◽

10.1007/3-540-34416-0_35 ◽

2006 ◽

pp. 325-332

Author(s):

Edgar Acuña ◽

Jaime Porras

Keyword(s):

Gene Expression ◽

Feature Selection ◽

Gene Expression Data ◽

Principal Components ◽

Expression Data

Download Full-text

Classification of micro-array gene expression data using neural networks

The 2010 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2010.5596568 ◽

2010 ◽

Author(s):

David Tian ◽

Keith Burley

Keyword(s):

Gene Expression ◽

Neural Networks ◽

Gene Expression Data ◽

Expression Data ◽

Micro Array

Download Full-text

Classification of Microarray Gene Expression Data by MultiBlock Dimension Reduction

Communications for Statistical Applications and Methods ◽

10.5351/ckss.2006.13.3.567 ◽

2006 ◽

Vol 13 (3) ◽

pp. 567-576

Author(s):

Mi-Ra Oh ◽

Seo-Young Kim ◽

Kyung-Sook Kim ◽

Jang-Sun Baek ◽

Young-Sook Son

Keyword(s):

Gene Expression ◽

Dimension Reduction ◽

Gene Expression Data ◽

Microarray Gene Expression Data ◽

Expression Data ◽

Microarray Gene Expression ◽

Microarray Gene

Download Full-text

Inference of Genetic Networks From Time-Series and Static Gene Expression Data: Combining a Random-Forest-Based Inference Method With Feature Selection Methods

Frontiers in Genetics ◽

10.3389/fgene.2020.595912 ◽

2020 ◽

Vol 11 ◽

Author(s):

Shuhei Kimura ◽

Ryo Fukutomi ◽

Masato Tokuhisa ◽

Mariko Okada

Keyword(s):

Gene Expression ◽

Feature Selection ◽

Random Forest ◽

Gene Expression Data ◽

Computational Cost ◽

Expression Data ◽

Selection Methods ◽

Inference Method ◽

Combined Application ◽

Inference Methods

Several researchers have focused on random-forest-based inference methods because of their excellent performance. Some of these inference methods also have a useful ability to analyze both time-series and static gene expression data. However, they are only of use in ranking all of the candidate regulations by assigning them confidence values. None have been capable of detecting the regulations that actually affect a gene of interest. In this study, we propose a method to remove unpromising candidate regulations by combining the random-forest-based inference method with a series of feature selection methods. In addition to detecting unpromising regulations, our proposed method uses outputs from the feature selection methods to adjust the confidence values of all of the candidate regulations that have been computed by the random-forest-based inference method. Numerical experiments showed that the combined application with the feature selection methods improved the performance of the random-forest-based inference method on 99 of the 100 trials performed on the artificial problems. However, the improvement tends to be small, since our combined method succeeded in removing only 19% of the candidate regulations at most. The combined application with the feature selection methods moreover makes the computational cost higher. While a bigger improvement at a lower computational cost would be ideal, we see no impediments to our investigation, given that our aim is to extract as much useful information as possible from a limited amount of gene expression data.

Download Full-text

INCORPORATING FEATURE RANKING AND EVOLUTIONARY METHODS FOR THE CLASSIFICATION OF HIGH-DIMENSIONAL DNA MICROARRAY GENE EXPRESSION DATA

Australasian Medical Journal ◽

10.21767/amj.2013.1641 ◽

2013 ◽

Vol 06 (05) ◽

Author(s):

Mani Abedini ◽

Michael Kirley ◽

Raymond Chiong

Keyword(s):

Gene Expression ◽

Dna Microarray ◽

Gene Expression Data ◽

Microarray Gene Expression Data ◽

High Dimensional ◽

Feature Ranking ◽

Expression Data ◽

Microarray Gene Expression ◽

Microarray Gene

Download Full-text

COMBINING GENERALIZED NMF AND DISCRIMINATIVE MIXTURE MODELS FOR CLASSIFICATION OF GENE EXPRESSION DATA

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001408006892 ◽

2008 ◽

Vol 22 (08) ◽

pp. 1587-1598 ◽

Cited By ~ 3

Author(s):

WEIXIANG LIU ◽

KEHONG YUAN ◽

JIAN WU ◽

DATIAN YE ◽

ZHEN JI ◽

...

Keyword(s):

Gene Expression ◽

Mixture Model ◽

Gene Expression Data ◽

Small Sample Size ◽

Data Classification ◽

Small Sample ◽

Training Data ◽

Microarray Data Analysis ◽

Expression Data

Classification of gene expression samples is a core task in microarray data analysis. How to reduce thousands of genes and to select a suitable classifier are two key issues for gene expression data classification. This paper introduces a framework on combining both feature extraction and classifier simultaneously. Considering the non-negativity, high dimensionality and small sample size, we apply a discriminative mixture model which is designed for non-negative gene express data classification via non-negative matrix factorization (NMF) for dimension reduction. In order to enhance the sparseness of training data for fast learning of the mixture model, a generalized NMF is also adopted. Experimental results on several real gene expression datasets show that the classification accuracy, stability and decision quality can be significantly improved by using the generalized method, and the proposed method can give better performance than some previous reported results on the same datasets.

Download Full-text