PSO Based Optimized Ensemble Learning and Feature Selection Approach for Efficient Energy Forecast

Wafa Shafqat; Sehrish Malik; Kyu-Tae Lee; Do-Hyeun Kim

doi:10.3390/electronics10182188

PSO Based Optimized Ensemble Learning and Feature Selection Approach for Efficient Energy Forecast

Electronics ◽

10.3390/electronics10182188 ◽

2021 ◽

Vol 10 (18) ◽

pp. 2188

Author(s):

Wafa Shafqat ◽

Sehrish Malik ◽

Kyu-Tae Lee ◽

Do-Hyeun Kim

Keyword(s):

Feature Selection ◽

Ensemble Learning ◽

Optimization Technique ◽

Ensemble Model ◽

Success Rates ◽

Proposed Model ◽

Random Samples ◽

Feature Selection Approach ◽

Energy Forecast ◽

Fine Tune

Swarm intelligence techniques with incredible success rates are broadly used for various irregular and interdisciplinary topics. However, their impact on ensemble models is considerably unexplored. This study proposes an optimized-ensemble model integrated for smart home energy consumption management based on ensemble learning and particle swarm optimization (PSO). The proposed model exploits PSO in two distinct ways; first, PSO-based feature selection is performed to select the essential features from the raw dataset. Secondly, with larger datasets and comprehensive range problems, it can become a cumbersome task to tune hyper-parameters in a trial-and-error manner manually. Therefore, PSO was used as an optimization technique to fine-tune hyper-parameters of the selected ensemble model. A hybrid ensemble model is built by using combinations of five different baseline models. Hyper-parameters of each combination model were optimized using PSO followed by training on different random samples. We compared our proposed model with our previously proposed ANN-PSO model and a few other state-of-the-art models. The results show that optimized-ensemble learning models outperform individual models and the ANN-PSO model by minimizing RMSE to 6.05 from 9.63 and increasing the prediction accuracy by 95.6%. Moreover, our results show that random sampling can help improve prediction results compared to the ANN-PSO model from 92.3% to around 96%.

Download Full-text

Enterprise Credit Risk Assessment Using Feature Selection Approach and Ensemble Learning Technique

2020 16th International Conference on Computational Intelligence and Security (CIS) ◽

10.1109/cis52066.2020.00056 ◽

2020 ◽

Author(s):

Di Wang ◽

Zuoquan Zhang

Keyword(s):

Risk Assessment ◽

Feature Selection ◽

Credit Risk ◽

Ensemble Learning ◽

Credit Risk Assessment ◽

Enterprise Credit ◽

Selection Approach ◽

Learning Technique ◽

Feature Selection Approach

Download Full-text

A Novel Feature Selection Approach based on Binary Particle Swarm Optimization and Ensemble Learning for Heterogeneous Defect Prediction

2021 3rd Asia Pacific Information Technology Conference ◽

10.1145/3449365.3449384 ◽

2021 ◽

Author(s):

Ruchika Malhotra ◽

Anmol Budhiraja ◽

Abhinav Kumar Singh ◽

Ishani Ghoshal

Keyword(s):

Feature Selection ◽

Particle Swarm Optimization ◽

Ensemble Learning ◽

Particle Swarm ◽

Defect Prediction ◽

Binary Particle Swarm Optimization ◽

Swarm Optimization ◽

Selection Approach ◽

Feature Selection Approach ◽

Heterogeneous Defect Prediction

Download Full-text

Multi-objective Evolutionary Approach for the Performance Improvement of Learners using Ensembling Feature Selection and Discretization Technique on Medical Data

Current Medical Imaging Formerly Current Medical Imaging Reviews ◽

10.2174/1573405614666180903114534 ◽

2020 ◽

Vol 16 (4) ◽

pp. 355-370

Author(s):

Deepak Singh ◽

Dilip Singh Sisodia ◽

Pradeep Singh

Keyword(s):

Feature Selection ◽

Information Gain ◽

Feature Reduction ◽

Misclassification Rate ◽

Original Form ◽

Feature Subset ◽

Ensemble Model ◽

Biomedical Data ◽

Multi Objective ◽

Proposed Model

Background: Biomedical data is filled with continuous real values; these values in the feature set tend to create problems like underfitting, the curse of dimensionality and increase in misclassification rate because of higher variance. In response, pre-processing techniques on dataset minimizes the side effects and have shown success in maintaining the adequate accuracy. Aims: Feature selection and discretization are the two necessary preprocessing steps that were effectively employed to handle the data redundancies in the biomedical data. However, in the previous works, the absence of unified effort by integrating feature selection and discretization together in solving the data redundancy problem leads to the disjoint and fragmented field. This paper proposes a novel multi-objective based dimensionality reduction framework, which incorporates both discretization and feature reduction as an ensemble model for performing feature selection and discretization. Selection of optimal features and the categorization of discretized and non-discretized features from the feature subset is governed by the multi-objective genetic algorithm (NSGA-II). The two objectives, minimizing the error rate during the feature selection and maximizing the information gain, while discretization is considered as fitness criteria. Methods: The proposed model used wrapper-based feature selection algorithm to select the optimal features and categorized these selected features into two blocks namely discretized and nondiscretized blocks. The feature belongs to the discretized block will participate in the binary discretization while the second block features will not be discretized and used in its original form. Results: For the establishment and acceptability of the proposed ensemble model, the experiment is conducted on the fifteen medical datasets, and the metric such as accuracy, mean and standard deviation are computed for the performance evaluation of the classifiers. Conclusion: After an extensive experiment conducted on the dataset, it can be said that the proposed model improves the classification rate and outperform the base learner.

Download Full-text

A new forecasting model with wrapper-based feature selection approach using multi-objective optimization technique for chaotic crude oil time series

Energy ◽

10.1016/j.energy.2020.118750 ◽

2020 ◽

Vol 212 ◽

pp. 118750 ◽

Cited By ~ 15

Author(s):

Seçkin Karasu ◽

Aytaç Altan ◽

Stelios Bekiros ◽

Wasim Ahmad

Keyword(s):

Time Series ◽

Feature Selection ◽

Crude Oil ◽

Optimization Technique ◽

Forecasting Model ◽

Multi Objective Optimization ◽

Multi Objective ◽

Selection Approach ◽

Feature Selection Approach

Download Full-text

Feature selection approach using ensemble learning for network anomaly detection

CAAI Transactions on Intelligence Technology ◽

10.1049/trit.2020.0073 ◽

2020 ◽

Vol 5 (4) ◽

pp. 283-293

Author(s):

Doreswamy ◽

Mohammad Kazim Hooshmand ◽

Ibrahim Gad

Keyword(s):

Feature Selection ◽

Anomaly Detection ◽

Ensemble Learning ◽

Selection Approach ◽

Feature Selection Approach ◽

Network Anomaly Detection

Download Full-text

Ensemble Learning for Skeleton-Based Body Mass Index Classification

Applied Sciences ◽

10.3390/app10217812 ◽

2020 ◽

Vol 10 (21) ◽

pp. 7812

Author(s):

Beom Kwon ◽

Sanghoon Lee

Keyword(s):

Body Mass Index ◽

Ensemble Learning ◽

Body Mass ◽

Average Length ◽

Average Height ◽

Ensemble Model ◽

Body Parts ◽

K Nearest Neighbor ◽

Proposed Model ◽

Anthropometric Features

In this study, we performed skeleton-based body mass index (BMI) classification by developing a unique ensemble learning method for human healthcare. Traditionally, anthropometric features, including the average length of each body part and average height, have been utilized for this kind of classification. Average values are generally calculated for all frames because the length of body parts and the subject height vary over time, as a result of the inaccuracy in pose estimation. Thus, traditionally, anthropometric features are measured over a long period. In contrast, we controlled the window used to measure anthropometric features over short/mid/long-term periods. This approach enables our proposed ensemble model to obtain robust and accurate BMI classification results. To produce final results, the proposed ensemble model utilizes multiple k-nearest neighbor classifiers trained using anthropometric features measured over several different time periods. To verify the effectiveness of the proposed model, we evaluated it using a public dataset. The simulation results demonstrate that the proposed model achieves state-of-the-art performance when compared with benchmark methods.

Download Full-text

An Improved Intelligent Approach to Enhance the Sentiment Classifier for Knowledge Discovery Using Machine Learning

International Journal of Sensors Wireless Communications and Control ◽

10.2174/2210327910999200528114552 ◽

2020 ◽

Vol 10 (4) ◽

pp. 582-593

Author(s):

Midde Venkateswarlu Naik ◽

D. Vasumathi ◽

A.P. Siva Kumar

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Global Warming ◽

Particle Swarm Optimization ◽

Sentiment Analysis ◽

Optimization Technique ◽

Particle Swarm ◽

Sentiment Classification ◽

Support Vector ◽

Swarm Optimization

Aims: The proposed research work is on an evolutionary enhanced method for sentiment or emotion classification on unstructured review text in the big data field. The sentiment analysis plays a vital role for current generation of people for extracting valid decision points about any aspect such as movie ratings, education institute or politics ratings, etc. The proposed hybrid approach combined the optimal feature selection using Particle Swarm Optimization (PSO) and sentiment classification through Support Vector Machine (SVM). The current approach performance is evaluated with statistical measures, such as precision, recall, sensitivity, specificity, and was compared with the existing approaches. The earlier authors have achieved an accuracy of sentiment classifier in the English text up to 94% as of now. In the proposed scheme, an average accuracy of sentiment classifier on distinguishing datasets outperformed as 99% by tuning various parameters of SVM, such as constant c value and kernel gamma value in association with PSO optimization technique. The proposed method utilized three datasets, such as airline sentiment data, weather, and global warming datasets, that are publically available. The current experiment produced results that are trained and tested based on 10- Fold Cross-Validations (FCV) and confusion matrix for predicting sentiment classifier accuracy. Background: The sentiment analysis plays a vital role for current generation people for extracting valid decisions about any aspect such as movie rating, education institute or even politics ratings, etc. Sentiment Analysis (SA) or opinion mining has become fascinated scientifically as a research domain for the present environment. The key area is sentiment classification on semi-structured or unstructured data in distinguish languages, which has become a major research aspect. User-Generated Content [UGC] from distinguishing sources has been hiked significantly with rapid growth in a web environment. The huge user-generated data over social media provides substantial value for discovering hidden knowledge or correlations, patterns, and trends or sentiment extraction about any specific entity. SA is a computational analysis to determine the actual opinion of an entity which is expressed in terms of text. SA is also called as computation of emotional polarity expressed over social media as natural text in miscellaneous languages. Usually, the automatic superlative sentiment classifier model depends on feature selection and classification algorithms. Methods: The proposed work used Support vector machine as classification technique and particle swarm optimization technique as feature selection purpose. In this methodology, we tune various permutations and combination parameters in order to obtain expected desired results with kernel and without kernel technique for sentiment classification on three datasets, including airline, global warming, weather sentiment datasets, that are freely hosted for research practices. Results: In the proposed scheme, The proposed method has outperformed with 99.2% of average accuracy to classify the sentiment on different datasets, among other machine learning techniques. The attained high accuracy in classifying sentiment or opinion about review text proves superior effectiveness over existing sentiment classifiers. The current experiment produced results that are trained and tested based on 10- Fold Cross-Validations (FCV) and confusion matrix for predicting sentiment classifier accuracy. Conclusion: The objective of the research issue sentiment classifier accuracy has been hiked with the help of Kernel-based Support Vector Machine (SVM) based on parameter optimization. The optimal feature selection to classify sentiment or opinion towards review documents has been determined with the help of a particle swarm optimization approach. The proposed method utilized three datasets to simulate the results, such as airline sentiment data, weather sentiment data, and global warming data that are freely available datasets.

Download Full-text

Hybrid GA-PCA Feature Selection Approach for Inertial Human Activity Recognition

2018 IEEE Symposium Series on Computational Intelligence (SSCI) ◽

10.1109/ssci.2018.8628702 ◽

2018 ◽

Author(s):

Ayman M. Abo El-Maaty ◽

Amr G. Wassal

Keyword(s):

Feature Selection ◽

Activity Recognition ◽

Human Activity ◽

Human Activity Recognition ◽

Selection Approach ◽

Feature Selection Approach

Download Full-text

Distributed Rough Set Based Feature Selection Approach to Analyse Deep and Hand-crafted Features for Mammography Mass Classification

2018 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata.2018.8621962 ◽

2018 ◽

Cited By ~ 2

Author(s):

Azam Hamidinekoo ◽

Zaineb Chelly Dagdia ◽

Zobia Suhail ◽

Reyer Zwiggelaar

Keyword(s):

Feature Selection ◽

Rough Set ◽

Selection Approach ◽

Mass Classification ◽

Feature Selection Approach

Download Full-text

Feature Selection and Ensemble Learning Techniques in One-Class Classifiers: An Empirical Study of Two-Class Imbalanced Datasets

IEEE Access ◽

10.1109/access.2021.3051969 ◽

2021 ◽

Vol 9 ◽

pp. 13717-13726

Author(s):

Chih-Fong Tsai ◽

Wei-Chao Lin

Keyword(s):

Feature Selection ◽

Empirical Study ◽

Ensemble Learning ◽

Imbalanced Datasets ◽

Learning Techniques

Download Full-text