A NOVEL HYBRID FRAMEWORK OF COEVOLUTIONARY GA AND MACHINE LEARNING

Author(s):  
HISASHI HANDA ◽  
MITSURU BABA ◽  
TADASHI HORIUCHI ◽  
OSAMU KATAI

In this paper, we will propose a novel framework of hybridization of Coevolutionary Genetic Algorithm and Machine Learning. The Coevolutionary Genetic Algorithm (CGA) which has already been proposed by Handa et al. consists of two GA populations: the first GA (H-GA) population searches for the solutions in given problems, and the second GA (P-GA) population searches for effective schemata of the H-GA. The CGA adopts the notion of commensalism, a kind of co-evolution. The new hybrid framework incorporates a schema extraction mechanism by Machine Learning techniques into the CGA. Considerable improvement in its search ability is obtained by extracting more efficient and useful schemata from the H-GA population and then by incorporating those extracted schemata into the P-GA. We will examine and compare two kinds of machine learning techniques in extracting schema information: C4.5 and CN2. Several computational simulations on multidimensional knapsack problems, constraint satisfaction problems and function optimization problems will reveal the effectiveness of the proposed methods.

Polymers ◽  
2021 ◽  
Vol 13 (18) ◽  
pp. 3100
Author(s):  
Anusha Mairpady ◽  
Abdel-Hamid I. Mourad ◽  
Mohammad Sayem Mozumder

The selection of nanofillers and compatibilizing agents, and their size and concentration, are always considered to be crucial in the design of durable nanobiocomposites with maximized mechanical properties (i.e., fracture strength (FS), yield strength (YS), Young’s modulus (YM), etc). Therefore, the statistical optimization of the key design factors has become extremely important to minimize the experimental runs and the cost involved. In this study, both statistical (i.e., analysis of variance (ANOVA) and response surface methodology (RSM)) and machine learning techniques (i.e., artificial intelligence-based techniques (i.e., artificial neural network (ANN) and genetic algorithm (GA)) were used to optimize the concentrations of nanofillers and compatibilizing agents of the injection-molded HDPE nanocomposites. Initially, through ANOVA, the concentrations of TiO2 and cellulose nanocrystals (CNCs) and their combinations were found to be the major factors in improving the durability of the HDPE nanocomposites. Further, the data were modeled and predicted using RSM, ANN, and their combination with a genetic algorithm (i.e., RSM-GA and ANN-GA). Later, to minimize the risk of local optimization, an ANN-GA hybrid technique was implemented in this study to optimize multiple responses, to develop the nonlinear relationship between the factors (i.e., the concentration of TiO2 and CNCs) and responses (i.e., FS, YS, and YM), with minimum error and with regression values above 95%.


2021 ◽  
Author(s):  
Hugo Abreu Mendes ◽  
João Fausto Lorenzato Oliveira ◽  
Paulo Salgado Gomes Mattos Neto ◽  
Alex Coutinho Pereira ◽  
Eduardo Boudoux Jatoba ◽  
...  

Within the context of clean energy generation, solar radiation forecast is applied for photovoltaic plants to increase maintainability and reliability. Statistical models of time series like ARIMA and machine learning techniques help to improve the results. Hybrid Statistical + ML are found in all sorts of time series forecasting applications. This work presents a new way to automate the SARIMAX modeling, nesting PSO and ACO optimization algorithms, differently from R's AutoARIMA, its searches optimal seasonality parameter and combination of the exogenous variables available. This work presents 2 distinct hybrid models that have MLPs as their main elements, optimizing the architecture with Genetic Algorithm. A methodology was used to obtain the results, which were compared to LSTM, CLSTM, MMFF and NARNN-ARMAX topologies found in recent works. The obtained results for the presented models is promising for use in automatic radiation forecasting systems since it outperformed the compared models on at least two metrics.


2021 ◽  
Author(s):  
Asad Mustafa Elmgerbi ◽  
Clemens Peter Ettinger ◽  
Peter Mbah Tekum ◽  
Gerhard Thonhauser ◽  
Andreas Nascimento

Abstract Over the past decade, several models have been generated to predict Rate of Penetration (ROP) in real-time. In general, these models can be classified into two categories, model-driven (analytical models) and data-driven models (based on machine learning techniques), which is considered as cutting-edge technology in terms of predictive accuracy and minimal human interfering. Nevertheless, most existing machine learning models are mainly used for prediction, not optimization. The ROP ahead of the bit for a certain formation layer can be predicted with such methods, but the limitation of the applications of these techniques is to find an optimum set of operating parameters for the optimization of ROP. In this regard, two data-driven models for ROP prediction have been developed and thereafter have been merged into an optimizer model. The purpose of the optimization process is to seek the ideal combinations of drilling parameters that would lead to an improvement in the ROP in real-time for a given formation. This paper is mainly focused on describing the process of development to create smart data-driven models (built on MATLAB software environment) for real-time rate of penetration prediction and optimization within a sufficient time span and without disturbing the drilling process, as it is typically required by a drill-off test. The used models here can be classified into two groups: two predictive models, Artificial Neural Network (ANN) and Random Forest (RF), in addition to one optimizer, namely genetic algorithm. The process started by developing, optimizing, and validation of the predictive models, which subsequently were linked to the genetic algorithm (GA) for real-time optimization. Automated optimization algorithms were integrated into the process of developing the productive models to improve the model efficiency and to reduce the errors. In order to validate the functionalities of the developed ROP optimization model, two different cases were studied. For the first case, historical drilling data from different wells were used, and the results confirmed that for the three known controllable surface drilling parameters, weight on bit (WOB) has the highest impact on ROP, followed by flow rate (FR) and finally rotation per minute (RPM), which has the least impact. In the second case, a laboratory scaled drilling rig "CDC miniRig" was utilized to validate the developed model, during the validation only the previous named parameters were used. Several meters were drilled through sandstone cubes at different weights on bit, rotations per minute, and flow rates to develop the productive models; then the optimizer was activated to propose the optimal set of the used parameters, which likely maximize the ROP. The proposed parameters were implemented, and the results showed that ROP improved as expected.


2015 ◽  
Vol 9s3 ◽  
pp. BBI.S29469 ◽  
Author(s):  
Lucas J. Adams ◽  
Ghalib Bello ◽  
Gerard G. Dumancas

The problem of selecting important variables for predictive modeling of a specific outcome of interest using questionnaire data has rarely been addressed in clinical settings. In this study, we implemented a genetic algorithm (GA) technique to select optimal variables from questionnaire data for predicting a five-year mortality. We examined 123 questions (variables) answered by 5,444 individuals in the National Health and Nutrition Examination Survey. The GA iterations selected the top 24 variables, including questions related to stroke, emphysema, and general health problems requiring the use of special equipment, for use in predictive modeling by various parametric and nonparametric machine learning techniques. Using these top 24 variables, gradient boosting yielded the nominally highest performance (area under curve [AUC] = 0.7654), although there were other techniques with lower but not significantly different AUC. This study shows how GA in conjunction with various machine learning techniques could be used to examine questionnaire data to predict a binary outcome.


Author(s):  
Tales Lima Fonseca ◽  
Yulia Gorodetskaya ◽  
Gisele Goulart Tavares ◽  
Celso Bandeira de Melo Ribeiro ◽  
Leonardo Goliatt da Fonseca

The short-term streamflow forecast is an important parameter in studies related to energy generation and the prediction of possible floods. Flowing through three Brazilian states, the Paraíba do Sul river is responsible for the supply and energy generation in several municipalities.  Machine learning techniques have been studied with the aim of improving these predictions through the use of hydrological and hydrometeorological parameters. Furthermore, the predictive performance of the machine learning techniques are directly related to the quality of the training base and, moreover, to the set of hyperparameters used. The present study explores the combination of the Gradient Boosting technique coupled with a Genetic Algorithm to found the best set of hyperparameter to maximize the predicting performance of the Paraíba do Sul river streamflow.


Author(s):  
Mushtaq Talb Tally ◽  
Haleh Amintoosi

With the development of web applications nowadays, intrusions represent a crucial aspect in terms of violating the security policies. Intrusions can be defined as a specific change in the normal behavior of the network operations that intended to violate the security policies of a particular network and affect its performance. Recently, several researchers have examined the capabilities of machine learning techniques in terms of detecting intrusions. One of the important issues behind using the machine learning techniques lies on employing proper set of features. Since the literature has shown diversity of feature types, there is a vital demand to apply a feature selection approach in order to identify the most appropriate features for intrusion detection. This study aims to propose a hybrid method of Genetic Algorithm and Support Vector Machine. GA has been as a feature selection in order to select the best features, while SVM has been used as a classification method to categorize the behavior into normal and intrusion based on the selected features from GA. A benchmark dataset of intrusions (NSS-KDD) has been in the experiment. In addition, the proposed method has been compared with the traditional SVM. Results showed that GA has significantly improved the SVM classification by achieving 0.927 of f-measure.


Sign in / Sign up

Export Citation Format

Share Document