Using dimension reduction with feature selection to enhance accuracy of tumor classification

Author(s):  
Thuy Hang Dang ◽  
Dung Pham Trung ◽  
Hoai Linh Tran ◽  
Quang Le Van
2017 ◽  
Vol 27 (1) ◽  
pp. 169-180 ◽  
Author(s):  
Marton Szemenyei ◽  
Ferenc Vajda

Abstract Dimension reduction and feature selection are fundamental tools for machine learning and data mining. Most existing methods, however, assume that objects are represented by a single vectorial descriptor. In reality, some description methods assign unordered sets or graphs of vectors to a single object, where each vector is assumed to have the same number of dimensions, but is drawn from a different probability distribution. Moreover, some applications (such as pose estimation) may require the recognition of individual vectors (nodes) of an object. In such cases it is essential that the nodes within a single object remain distinguishable after dimension reduction. In this paper we propose new discriminant analysis methods that are able to satisfy two criteria at the same time: separating between classes and between the nodes of an object instance. We analyze and evaluate our methods on several different synthetic and real-world datasets.


2020 ◽  
Vol 21 (1) ◽  
Author(s):  
F. William Townes ◽  
Stephanie C. Hicks ◽  
Martin J. Aryee ◽  
Rafael A. Irizarry

2019 ◽  
Vol 20 (1) ◽  
Author(s):  
F. William Townes ◽  
Stephanie C. Hicks ◽  
Martin J. Aryee ◽  
Rafael A. Irizarry

AbstractSingle-cell RNA-Seq (scRNA-Seq) profiles gene expression of individual cells. Recent scRNA-Seq datasets have incorporated unique molecular identifiers (UMIs). Using negative controls, we show UMI counts follow multinomial sampling with no zero inflation. Current normalization procedures such as log of counts per million and feature selection by highly variable genes produce false variability in dimension reduction. We propose simple multinomial methods, including generalized principal component analysis (GLM-PCA) for non-normal distributions, and feature selection using deviance. These methods outperform the current practice in a downstream clustering assessment using ground truth datasets.


2017 ◽  
Vol 84 ◽  
pp. 24-36 ◽  
Author(s):  
Laith Mohammad Abualigah ◽  
Ahamad Tajudin Khader ◽  
Mohammed Azmi Al-Betar ◽  
Osama Ahmad Alomari

2019 ◽  
Author(s):  
Peter Washington ◽  
Kelley Marie Paskov ◽  
Haik Kalantarian ◽  
Nathaniel Stockham ◽  
Catalin Voss ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document