Unsupervised feature extraction and band subset selection techniques based on relative entropy criteria for hyperspectral data analysis

AbstractMultiomics data analysis is the central issue of genomics science. In spite of that, there are not well defined methods that can integrate multomics data sets, which are formatted as matrices with different sizes. In this paper, I propose the usage of tensor decomposition based unsupervised feature extraction as a data mining tool for multiomics data set. It can successfully integrate miRNA expression, mRNA expression and proteome, which were used as a demonstration example of DIABLO that is the recently proposed advanced method for the integrated analysis of multiomics data set.

Download Full-text

Classification of hyperspectral data using extended attribute profiles based on supervised and unsupervised feature extraction techniques

International Journal of Image and Data Fusion ◽

10.1080/19479832.2012.702687 ◽

2012 ◽

Vol 3 (3) ◽

pp. 269-298 ◽

Cited By ~ 45

Author(s):

Prashanth Reddy Marpu ◽

Mattia Pedergnana ◽

Mauro Dalla Mura ◽

Stijn Peeters ◽

Jon Atli Benediktsson ◽

...

Keyword(s):

Feature Extraction ◽

Hyperspectral Data ◽

Extraction Techniques ◽

Unsupervised Feature Extraction

Download Full-text

AN EXTENDED SPECTRAL–SPATIAL CLASSIFICATION APPROACH FOR HYPERSPECTRAL DATA

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iv-4-w4-37-2017 ◽

2017 ◽

Vol IV-4/W4 ◽

pp. 37-41

Author(s):

D. Akbari

Keyword(s):

Feature Extraction ◽

Spatial Information ◽

Principal Component ◽

Component Analysis ◽

Extraction Methods ◽

Hyperspectral Data ◽

Classification Approach ◽

Supervised Feature Extraction ◽

Noise Fraction ◽

Unsupervised Feature Extraction

In this paper an extended classification approach for hyperspectral imagery based on both spectral and spatial information is proposed. The spatial information is obtained by an enhanced marker-based minimum spanning forest (MSF) algorithm. Three different methods of dimension reduction are first used to obtain the subspace of hyperspectral data: (1) unsupervised feature extraction methods including principal component analysis (PCA), independent component analysis (ICA), and minimum noise fraction (MNF); (2) supervised feature extraction including decision boundary feature extraction (DBFE), discriminate analysis feature extraction (DAFE), and nonparametric weighted feature extraction (NWFE); (3) genetic algorithm (GA). The spectral features obtained are then fed into the enhanced marker-based MSF classification algorithm. In the enhanced MSF algorithm, the markers are extracted from the classification maps obtained by both SVM and watershed segmentation algorithm. To evaluate the proposed approach, the Pavia University hyperspectral data is tested. Experimental results show that the proposed approach using GA achieves an approximately 8&thinsp;% overall accuracy higher than the original MSF-based algorithm.

Download Full-text

Novel feature selection via kernel tensor decomposition for improved multi-omics data analysis

10.1101/2021.05.21.445049 ◽

2021 ◽

Author(s):

Y-h. Taguchi ◽

Turki Turki

Keyword(s):

Feature Extraction ◽

Feature Selection ◽

Data Analysis ◽

Tensor Decomposition ◽

Processing Unit ◽

Omics Data ◽

Central Processing ◽

Unsupervised Feature Extraction ◽

Selection Of ◽

Omics Data Analysis

Motivation: Feature selection of multi-omics data analysis remains challenging since omics data include 102-105 features. How to weight an individual omics dataset is unclear and greatly affects feature selection consequences. In this study, a recently proposed kernel tensor decomposition (KTD)-based unsupervised feature extraction (FE) was extended to integrate multi-omics datasets measured over common samples in a weight-free manner. Results: KTD-based unsupervised FE was reformatted as the collection of kernelized tensors sharing common samples and was applied to synthetic, as well as real, datasets. The proposed advanced KTD-based unsupervised FE performed comparatively with the previously proposed KTD, as well as TD-based unsupervised FE, with reduced memory and central processing unit time. This advanced KTD method, specifically designed for multi-omics analysis, attributes P-values to features, which other multi-omics-oriented methods rarely do. Availability: Sample R code is available in https://github.com/tagtag/MultiR/

Download Full-text

RNA-Seq data analysis for Planarian with tensor decomposition-based unsupervised feature extraction

10.1101/2021.06.15.448531 ◽

2021 ◽

Author(s):

Makoto Kashima ◽

Nobuyoshi Kumagai ◽

Hiromi Hirata ◽

Y-h. Taguchi

Keyword(s):

Feature Extraction ◽

Data Analysis ◽

De Novo ◽

Model Organism ◽

Tensor Decomposition ◽

Time Development ◽

Model Organisms ◽

Rna Seq ◽

Experimental Conditions ◽

Unsupervised Feature Extraction

RNA-Seq data analysis of non-model organisms is often difficult because of the lack of a well-annotated genome. In model organisms, after short reads are mapped to the genome, it is possible to focus on the analysis of regions well-annotated regions. However, in non-model organisms, contigs can be generated by de novo assembling. This can result in a large number of transcripts, making it difficult to easily remove redundancy. A large number of transcripts can also lead to difficulty in the recognition of differentially expressed transcripts (DETs) between more than two experimental conditions, because P-values must be corrected by considering multiple comparison corrections whose effect is enhanced as the number of transcripts increases. Heavily corrected P-values often fail to take sufficiently small P-values as significant. In this study, we applied a recently proposed tensor decomposition (TD)-based unsupervised feature extraction (FE) to the RNA-seq data obtained for a non-model organism, Planarian; we successfully obtained a limited number of transcripts whose expression was altered between normal and defective samples as well as during time development. TD-based unsupervised FE is expected to be an effective tool that can identify a limited number of DETs, even when a poorly annotated genome is available.

Download Full-text

(2D)2UFFCA: Two-directional Two-dimensional Unsupervised Feature Extraction Method with Fuzzy Clustering Ability

ACTA AUTOMATICA SINICA ◽

10.3724/sp.j.1004.2012.00549 ◽

2012 ◽

Vol 38 (4) ◽

pp. 549-562 ◽

Cited By ~ 1

Author(s):

Jun GAO ◽

Chang-Yin SUN ◽

Shi-Tong WANG

Keyword(s):

Feature Extraction ◽

Fuzzy Clustering ◽

Extraction Method ◽

Two Dimensional ◽

Feature Extraction Method ◽

Unsupervised Feature Extraction

Download Full-text

Human motion data analysis and retrieval based on 3D feature extraction

Journal of Computer Applications ◽

10.3724/sp.j.1087.2008.01344 ◽

2008 ◽

Vol 28 (5) ◽

pp. 1344-1346 ◽

Cited By ~ 1

Author(s):

Jian XIANG

Keyword(s):

Feature Extraction ◽

Data Analysis ◽

Human Motion ◽

Motion Data ◽

3D Feature Extraction

Download Full-text

Unsupervised Classification System for Hyperspectral Data Analysis

10.21236/ada398803 ◽

2001 ◽

Author(s):

Luis O. Jimenez ◽

Miguel Velez ◽

Shawn Hunt

Keyword(s):

Data Analysis ◽

Classification System ◽

Unsupervised Classification ◽

Hyperspectral Data

Download Full-text