Semi-supervised Feature Importance Evaluation with Ensemble Learning

Recursive Feature Elimination with Ensemble Learning Using SOM Variants

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026817500043 ◽

2017 ◽

Vol 16 (01) ◽

pp. 1750004 ◽

Cited By ~ 2

Author(s):

Ameni Filali ◽

Chiraz Jlassi ◽

Najet Arous

Keyword(s):

Feature Selection ◽

Theoretical Analysis ◽

Ensemble Learning ◽

Random Forests ◽

Data Representation ◽

Recursive Feature Elimination ◽

Feature Importance ◽

Efficiency And Effectiveness ◽

Benchmark Datasets ◽

Handwritten Digit

To uncover an appropriate latent subspace for data representation, we propose in this paper a new extension of the random forests method which leads to the unsupervised feature selection called Feature Selection with Random Forests (RFS) based on SOM variants that evaluates the out-of-bag feature importance from a set of partitions. Every partition is created using a several bootstrap samples and a random features subset. We obtain empirical results on 19 benchmark datasets specifying that RFS, boosted with a recursive feature elimination (RFE) method, can lead to important enhancement in terms of clustering accuracy with a very restricted subset of features. Simulations are performed on nine different benchmarks, including face data, handwritten digit data, and document data. Promising experimental results and theoretical analysis prove the efficiency and effectiveness of the proposed method for feature selection in comparison with competitive representative algorithms.

Download Full-text

Company bankruptcy prediction framework based on the most influential features using XGBoost and stacking ensemble learning

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v11i6.pp5549-5557 ◽

2021 ◽

Vol 11 (6) ◽

pp. 5549

Author(s):

Much Aziz Muslim ◽

Yosza Dasril

Keyword(s):

Feature Selection ◽

Ensemble Learning ◽

Nearest Neighbor ◽

Bankruptcy Prediction ◽

K Nearest Neighbor ◽

Accuracy Rate ◽

Model Accuracy ◽

Analysis Process ◽

Feature Importance ◽

The Impact

<span>Company bankruptcy is often a very big problem for companies. The impact of bankruptcy can cause losses to elements of the company such as owners, investors, employees, and consumers. One way to prevent bankruptcy is to predict the possibility of bankruptcy based on the company's financial data. Therefore, this study aims to find the best predictive model or method to predict company bankruptcy using the dataset from Polish companies bankruptcy. The prediction analysis process uses the best feature selection and ensemble learning. The best feature selection is selected using feature importance to XGBoost with a weight value filter of 10. The ensemble learning method used is stacking. Stacking is composed of the base model and meta learner. The base model consists of K-nearest neighbor, decision tree, SVM, and random forest, while the meta learner used is LightGBM. The stacking model accuracy results can outperform the base model accuracy with an accuracy rate of 97%.</span>

Download Full-text

Ensemble learning for the early prediction of neonatal jaundice with genetic features

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-021-01701-9 ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Haowen Deng ◽

Youyou Zhou ◽

Lin Wang ◽

Cheng Zhang

Keyword(s):

Ensemble Learning ◽

Neonatal Jaundice ◽

False Positive Rate ◽

High Dimensional ◽

Feature Importance ◽

Genetic Features ◽

Positive Rate ◽

Term Newborns ◽

Accuracy Of Prediction ◽

High Bilirubin

Abstract Background Neonatal jaundice may cause severe neurological damage if poorly evaluated and diagnosed when high bilirubin occurs. The study explored how to effectively integrate high-dimensional genetic features into predicting neonatal jaundice. Methods This study recruited 984 neonates from the Suzhou Municipal Central Hospital in China, and applied an ensemble learning approach to enhance the prediction of high-dimensional genetic features and clinical risk factors (CRF) for physiological neonatal jaundice of full-term newborns within 1-week after birth. Further, sigmoid recalibration was applied for validating the reliability of our methods. Results The maximum accuracy of prediction reached 79.5% Area Under Curve (AUC) by CRF and could be marginally improved by 3.5% by including genetic variant (GV). Feature importance illustrated that 36 GVs contributed 55.5% in predicting neonatal jaundice in terms of gain from splits. Further analysis revealed that the main contribution of GV was to reduce the false-positive rate, i.e., to increase the specificity in the prediction. Conclusions Our study shed light on the theoretical and practical value of GV in the prediction of neonatal jaundice.

Download Full-text

Review for "Prediction of the reflection intensity of natural hydroxyapatite using generalized linear model and ensemble learning methods"

10.1002/eng2.12292/v1/review4 ◽

2020 ◽

Keyword(s):

Linear Model ◽

Ensemble Learning ◽

Generalized Linear Model ◽

Learning Methods ◽

Natural Hydroxyapatite ◽

Reflection Intensity

Download Full-text

Author response for "Prediction of the reflection intensity of natural hydroxyapatite using generalized linear model and ensemble learning methods"

10.1002/eng2.12292/v3/response1 ◽

2020 ◽

Author(s):

Emmanuel Okafor ◽

David O. Obada ◽

Yusuf Ibrahim ◽

David Dodoo‐Arhin

Keyword(s):

Linear Model ◽

Ensemble Learning ◽

Generalized Linear Model ◽

Author Response ◽

Learning Methods ◽

Natural Hydroxyapatite ◽

Reflection Intensity

Download Full-text

A Two-Stage Approach for Flight Departure Delay Forecasting Using Ensemble Learning

CICTP 2020 ◽

10.1061/9780784483053.018 ◽

2020 ◽

Author(s):

Feng Guan ◽

Mengyan Hao ◽

Zhen Guo

Keyword(s):

Ensemble Learning ◽

Two Stage

Download Full-text

Short-Term Traffic Flow Prediction Based on Multi-Model by Stacking Ensemble Learning

CICTP 2020 ◽

10.1061/9780784483053.008 ◽

2020 ◽

Author(s):

Yong Chen

Keyword(s):

Traffic Flow ◽

Ensemble Learning ◽

Short Term ◽

Traffic Flow Prediction ◽

Flow Prediction

Download Full-text

Software Defect Prediction Incremental Model using Ensemble Learning

International Journal of Performability Engineering ◽

10.23940/ijpe.20.11.p9.17711780 ◽

2020 ◽

Vol 16 (11) ◽

pp. 1771

Author(s):

Wang Shibo ◽

Li Yong ◽

Mi Wenbo ◽

Liu Ying

Keyword(s):

Ensemble Learning ◽

Defect Prediction ◽

Software Defect Prediction ◽

Incremental Model ◽

Software Defect

Download Full-text

Bayesian network ensemble learning with inadequate information

Advanced Computer Control ◽

10.2495/icacc130621 ◽

2014 ◽

Cited By ~ 1

Author(s):

Ruijie Du ◽

Shuangcheng Wang ◽

Cuiping Leng ◽

Yunbin Fu

Keyword(s):

Bayesian Network ◽

Ensemble Learning ◽

Inadequate Information

Download Full-text

A New Age Estimation Method Based on Ensemble Learning

ACTA AUTOMATICA SINICA ◽

10.3724/sp.j.1004.2008.00997 ◽

2009 ◽

Vol 34 (8) ◽

pp. 997-1000 ◽

Cited By ~ 1

Author(s):

Yu ZHANG

Keyword(s):

Ensemble Learning ◽

Age Estimation ◽

Estimation Method ◽

New Age

Download Full-text