New Application of Life Rank Algorithm: A Case Study

Neha Sharma; Dr. RashiAgarwal; Dr. NarendraKohli; Dr. Shubha Jain

doi:10.51201/jusst/21/05240

New Application of Life Rank Algorithm: A Case Study

Journal of University of Shanghai for Science and Technology ◽

10.51201/jusst/21/05240 ◽

2021 ◽

Vol 23 (06) ◽

pp. 438-447

Author(s):

Neha Sharma ◽

Dr. RashiAgarwal ◽

Dr. NarendraKohli ◽

Dr. Shubha Jain

Keyword(s):

Feature Extraction ◽

Feature Selection ◽

Dimensionality Reduction ◽

Learning To Rank ◽

Feature Reduction ◽

Linear Feature ◽

Redundant Data ◽

Reduction Techniques ◽

Before And After ◽

Reduction Methods

The past few years have seen the emergence of learning-to-rank (LTR) in the field of machine learning. In information acquiring the size of data is very large and empowering a learning-to-rank model on it will be a costly and time taking process. High dimension data leads to irrelevant and redundant data which results in overfitting. “Dimensionality reduction” methods are used to manage this issue. There are two-dimensionality reduction techniques namely feature selection and feature reduction. There is extensive research available on the algorithm for learning-to-rank but this not the case for dimensionality reduction approaches in LTR, despite its importance. Feature selection techniques for classification are directly used for ranking. To the best of our understanding, feature extraction techniques in the context of ranking problems are not explored much to date. So, we make an effort to fill this void and explore feature extraction in the context of LTR problems. The LifeRank algorithm is a linear feature extraction algorithm for ranking. Its performance is analyzed on RankSVM and Linear regression. It is not applied to other learning-to-rank algorithms. So, in this task, an attempt is made to study the effect of the application of the LifeRank algorithm on other LTR algorithms. LifeRank algorithm is applied on RankNet and RankBoost. Then, the performance of several LTR algorithms on the LETOR dataset is analyzed before and after feature extraction.

Download Full-text

Bi-level dimensionality reduction methods using feature selection and feature extraction

International Journal of Computer Applications ◽

10.5120/800-1137 ◽

2010 ◽

Vol 4 (2) ◽

pp. 33-38 ◽

Cited By ~ 12

Author(s):

Mr. Veerabhadrappa ◽

Lalitha Rangarajan

Keyword(s):

Feature Extraction ◽

Feature Selection ◽

Dimensionality Reduction ◽

Reduction Methods

Download Full-text

Multi-Level Dimensionality Reduction Methods Using Feature Selection and Feature Extraction

International Journal of Artificial Intelligence & Applications ◽

10.5121/ijaia.2010.1405 ◽

2010 ◽

Vol 1 (4) ◽

pp. 54-68 ◽

Cited By ~ 6

Author(s):

Veerabhadrappa ◽

Rangarajan Lalitha

Keyword(s):

Feature Extraction ◽

Feature Selection ◽

Dimensionality Reduction ◽

Reduction Methods ◽

Multi Level

Download Full-text

COMPARISON OF DIMENSIONALITY REDUCTION TECHNIQUES USING A BACKPROPAGATION NEURAL NETWORK BASED CLASSIFIER

International Journal of Information Acquisition ◽

10.1142/s0219878911002410 ◽

2011 ◽

Vol 08 (02) ◽

pp. 161-169

Author(s):

E. SIVASANKAR ◽

H. SRIDHAR ◽

V. BALAKRISHNAN ◽

K. ASHWIN ◽

R. S. RAJESH

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Feature Selection ◽

Dimensionality Reduction ◽

Information Gain ◽

Computation Time ◽

Principal Component ◽

Reduction Techniques ◽

Dimensionality Reduction Techniques ◽

Credit Risk Analysis

Data mining methods are used to mine voluminous data to find useful information from data. The data that is to be mined may have a large number of dimensions, so the mining process will take a lot of time. In general, the computation time is an exponential function of the number of dimensions. It is in this context that we use dimensionality reduction techniques to speed up the decision-making process. Dimensionality reduction techniques can be categorized as Feature Selection and Feature Extraction Techniques. In this paper we compare the two categories of dimensionality reduction techniques. Feature selection has been implemented using the Information Gain and Goodman–Kruskal measure. Principal Component Analysis has been used for Feature Extraction. In order to compare the accuracy of the methods, we have also implemented a classifier using back-propagation neural network. In general, it is found that feature extraction methods are more accurate than feature selection methods in the framework of credit risk analysis.

Download Full-text

A Comprehensive Review of Dimensionality Reduction Techniques for Feature Selection and Feature Extraction

Journal of Applied Science and Technology Trends ◽

10.38094/jastt1224 ◽

2020 ◽

Vol 1 (2) ◽

pp. 56-70 ◽

Cited By ~ 2

Author(s):

Rizgar Zebari ◽

Adnan Abdulazeez ◽

Diyar Zeebaree ◽

Dilovan Zebari ◽

Jwan Saeed

Keyword(s):

Feature Extraction ◽

Feature Selection ◽

Dimensionality Reduction ◽

Computational Time ◽

High Dimensions ◽

Training Time ◽

Reduction Techniques ◽

Processing Step ◽

Dimensionality Reduction Techniques ◽

Processing And Storage

Due to sharp increases in data dimensions, working on every data mining or machine learning (ML) task requires more efficient techniques to get the desired results. Therefore, in recent years, researchers have proposed and developed many methods and techniques to reduce the high dimensions of data and to attain the required accuracy. To ameliorate the accuracy of learning features as well as to decrease the training time dimensionality reduction is used as a pre-processing step, which can eliminate irrelevant data, noise, and redundant features. Dimensionality reduction (DR) has been performed based on two main methods, which are feature selection (FS) and feature extraction (FE). FS is considered an important method because data is generated continuously at an ever-increasing rate; some serious dimensionality problems can be reduced with this method, such as decreasing redundancy effectively, eliminating irrelevant data, and ameliorating result comprehensibility. Moreover, FE transacts with the problem of finding the most distinctive, informative, and decreased set of features to ameliorate the efficiency of both the processing and storage of data. This paper offers a comprehensive approach to FS and FE in the scope of DR. Moreover, the details of each paper, such as used algorithms/approaches, datasets, classifiers, and achieved results are comprehensively analyzed and summarized. Besides, a systematic discussion of all of the reviewed methods to highlight authors' trends, determining the method(s) has been done, which significantly reduced computational time, and selecting the most accurate classifiers. As a result, the different types of both methods have been discussed and analyzed the findings.

Download Full-text

Supervised dimensionality reduction for big data

Nature Communications ◽

10.1038/s41467-021-23102-2 ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Joshua T. Vogelstein ◽

Eric W. Bridgeford ◽

Minh Tang ◽

Da Zheng ◽

Christopher Douville ◽

...

Keyword(s):

Dimensionality Reduction ◽

Data Science ◽

Real Data ◽

Low Rank ◽

Conditional Moment ◽

Desktop Computer ◽

Reduction Techniques ◽

Reduction Methods ◽

The Individual ◽

Low Dimensional

AbstractTo solve key biomedical problems, experimentalists now routinely measure millions or billions of features (dimensions) per sample, with the hope that data science techniques will be able to build accurate data-driven inferences. Because sample sizes are typically orders of magnitude smaller than the dimensionality of these data, valid inferences require finding a low-dimensional representation that preserves the discriminating information (e.g., whether the individual suffers from a particular disease). There is a lack of interpretable supervised dimensionality reduction methods that scale to millions of dimensions with strong statistical theoretical guarantees. We introduce an approach to extending principal components analysis by incorporating class-conditional moment estimates into the low-dimensional projection. The simplest version, Linear Optimal Low-rank projection, incorporates the class-conditional means. We prove, and substantiate with both synthetic and real data benchmarks, that Linear Optimal Low-Rank Projection and its generalizations lead to improved data representations for subsequent classification, while maintaining computational efficiency and scalability. Using multiple brain imaging datasets consisting of more than 150 million features, and several genomics datasets with more than 500,000 features, Linear Optimal Low-Rank Projection outperforms other scalable linear dimensionality reduction techniques in terms of accuracy, while only requiring a few minutes on a standard desktop computer.

Download Full-text

Dimensionality reduction techniques for multivariate data classification, interactive visualization, and analysis-systematic feature selection vs. extraction

KES'2000. Fourth International Conference on Knowledge-Based Intelligent Engineering Systems and Allied Technologies. Proceedings (Cat. No.00TH8516) ◽

10.1109/kes.2000.885757 ◽

2002 ◽

Cited By ~ 6

Author(s):

A. Konig

Keyword(s):

Feature Selection ◽

Dimensionality Reduction ◽

Multivariate Data ◽

Interactive Visualization ◽

Data Classification ◽

Reduction Techniques ◽

Systematic Feature ◽

Dimensionality Reduction Techniques

Download Full-text

Research on the Feature Selection of Rolling Bearings’ Degradation Features

Shock and Vibration ◽

10.1155/2019/6450719 ◽

2019 ◽

Vol 2019 ◽

pp. 1-19

Author(s):

Yaolong Li ◽

Hongru Li ◽

Bing Wang ◽

He Yu ◽

Weiguo Wang

Keyword(s):

Feature Selection ◽

Dimensionality Reduction ◽

Root Mean Square ◽

Sample Entropy ◽

Remaining Useful Life ◽

Mean Square ◽

Rolling Bearings ◽

Reduction Methods ◽

Useful Life ◽

Selection Of

The bearings’ degradation features are crucial to assess the performance degradation and predict the remaining useful life of rolling bearings. So far, numerous degradation features have been proposed. Many researchers have devoted to use dimensionality reduction methods to reduce the redundancy of those features. However, they have not considered the properties and similarity of those features. In this paper, we present a simple way to reduce dimensionality by classifying different features based on their trends. And the degradation features can be classified into two subdivisions, namely, uptrends and downtrends. In each subdivision, there exists visible trend similarity, and we have introduced two indexes to measure this similarity. By selecting the representative features of the subdivision, the multifeatures can be dimensionality reduced. Through the comparison, the root mean square and sample entropy are two good representatives of uptrend and downtrend features. This method gives an alternative way for dimensionality reduction of the rolling bearings’ degradation features.

Download Full-text

Comparative study of feature extraction algorithms for complex-valued gradient fields of digital images using linear dimensionality reduction methods

Eleventh International Conference on Machine Vision (ICMV 2018) ◽

10.1117/12.2523098 ◽

2019 ◽

Author(s):

Egor Dmitriev ◽

Vladislav Myasnikov

Keyword(s):

Feature Extraction ◽

Comparative Study ◽

Dimensionality Reduction ◽

Digital Images ◽

Gradient Fields ◽

Reduction Methods ◽

Linear Dimensionality Reduction ◽

Complex Valued

Download Full-text

Feature Snatching and Performance Assessment for Connoting the Admittance Likelihood of student using Principal Component Reduction

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b2286.078219 ◽

2019 ◽

Vol 8 (2) ◽

pp. 4800-4807

Keyword(s):

Higher Education ◽

Machine Learning ◽

Dimensionality Reduction ◽

Admission Rate ◽

Principal Component ◽

Feature Reduction ◽

Experimental Result ◽

Sparse Pca ◽

Reduction Methods ◽

Incremental Pca

Recently, engineers are concentrating on designing an effective prediction model for finding the rate of student admission in order to raise the educational growth of the nation. The method to predict the student admission towards the higher education is a challenging task for any educational organization. There is a high visibility of crisis towards admission in the higher education. The admission rate of the student is the major risk to the educational society in the world. The student admission greatly affects the economic, social, academic, profit and cultural growth of the nation. The student admission rate also depends on the admission procedures and policies of the educational institutions. The chance of student admission also depends on the feedback given by all the stake holders of the educational sectors. The forecasting of the student admission is a major task for any educational institution to protect the profit and wealth of the organization. This paper attempts to analyze the performance of the student admission prediction by using machine learning dimensionality reduction algorithms. The Admission Predict dataset from Kaggle machine learning Repository is used for prediction analysis and the features are reduced by feature reduction methods. The prediction of the chance of Admit is achieved in four ways. Firstly, the correlation between each of the dataset attributes are found and depicted as a histogram. Secondly, the top most high correlated features are identified which are directly contributing to the prediction of chance of admit. Thirdly, the Admission Predict dataset is subjected to dimensionality reduction methods like principal component analysis (PCA), Sparse PCA, Incremental PCA , Kernel PCA and Mini Batch Sparse PCA. Fourth, the optimized dimensionality reduced dataset is then executed to analyze and compare the mean squared error, Mean Absolute Error and R2 Score of each method. The implementation is done by python in Anaconda Spyder Navigator Integrated Development Environment. Experimental Result shows that the CGPA, GRE Score and TOEFL Score are highly correlated features in predicting the chance of admit. The execution of performance analysis shows that Incremental PCA have achieved the effective prediction of chance of admit with minimum MSE of 0.09, MAE of 0.24 and reasonable R2 Score of 0.26.

Download Full-text

A quantitative framework for evaluating single-cell data structure preservation by dimensionality reduction techniques

10.1101/684340 ◽

2019 ◽

Cited By ~ 2

Author(s):

Cody N. Heiser ◽

Ken S. Lau

Keyword(s):

Data Structure ◽

Dimensionality Reduction ◽

Single Cell ◽

Structure Preservation ◽

Reduction Techniques ◽

Reduction Methods ◽

Biological Interpretation ◽

Low Dimensional ◽

Genome Scale ◽

Global And Local

SummaryHigh-dimensional data, such as those generated using single-cell RNA sequencing, present challenges in interpretation and visualization. Numerical and computational methods for dimensionality reduction allow for low-dimensional representation of genome-scale expression data for downstream clustering, trajectory reconstruction, and biological interpretation. However, a comprehensive and quantitative evaluation of the performance of these techniques has not been established. We present an unbiased framework that defines metrics of global and local structure preservation in dimensionality reduction transformations. Using discrete and continuous scRNA-seq datasets, we find that input cell distribution and method parameters are largely determinant of global, local, and organizational data structure preservation by eleven published dimensionality reduction methods. Code available atgithub.com/KenLauLab/DR-structure-preservationallows for rapid evaluation of further datasets and methods.

Download Full-text