Estimating the Total Organic Carbon for Unconventional Shale Resources During the Drilling Process: A Machine Learning Approach

Journal of Energy Resources Technology ◽

10.1115/1.4051737 ◽

2021 ◽

pp. 1-26

Author(s):

Ahmed Mahmoud ◽

Hany Gamal ◽

Salaheldin Elkatatny ◽

Ahmed Alsaihati

Keyword(s):

Machine Learning ◽

Organic Carbon ◽

Total Organic Carbon ◽

Gamma Ray ◽

Fuzzy Inference ◽

Support Vector ◽

Drilling Process ◽

Validation Data ◽

Inference System ◽

Data Points

Abstract Total organic carbon (TOC) is an essential parameter that indicates the quality of unconventional reservoirs. In this study, four machine learning (ML) algorithms of the adaptive neuro-fuzzy inference system (ANFIS), support vector regression (SVR), functional neural networks (FNN), and random forests (RF) were optimized to evaluate the TOC. The novelty of this work is that the optimized models predict the TOC from the bulk gamma-ray (GR) and spectral GR logs of uranium, thorium, and potassium only. The ML algorithms were trained on 749 datasets from Well-1, tested on 226 datasets from Well-2, and validated on 73 data points from Well-3. The predictability of the optimized algorithms was also compared with the available equations. The results of this study indicated that the optimized ANFIS, SVR, and RF models overperformed the available empirical equations in predicting the TOC. For validation data of Well-3, the optimized ANFIS, SVR, and RF algorithms predicted the TOC with AAPE's of 10.6%, 12.0%, and 8.9%, respectively, compared with the AAPE of 21.1% when the FNN model was used. While for the same data, the TOC was assessed with AAPE's of 48.6%, 24.6%, 20.2%, and 17.8% when Schmoker model, ΔlogR method, Zhao et al. correlation, and Mahmoud et al. correlation was used, respectively. The optimized models could be applied to estimate the TOC during the drilling process if the drillstring is provided with GR and spectral GR logging tools.

Download Full-text

Application of Various Machine Learning Techniques in Predicting Total Organic Carbon from Well Logs

Computational Intelligence and Neuroscience ◽

10.1155/2021/7390055 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Osama Siddig ◽

Ahmed Farid Ibrahim ◽

Salaheldin Elkatatny

Keyword(s):

Machine Learning ◽

Organic Carbon ◽

Total Organic Carbon ◽

The Other ◽

Well Logs ◽

Machine Learning Techniques ◽

Percentage Error ◽

Average Error ◽

Support Vector ◽

Empirical Correlations

Unconventional resources have recently gained a lot of attention, and as a consequence, there has been an increase in research interest in predicting total organic carbon (TOC) as a crucial quality indicator. TOC is commonly measured experimentally; however, due to sampling restrictions, obtaining continuous data on TOC is difficult. Therefore, different empirical correlations for TOC have been presented. However, there are concerns about the generalization and accuracy of these correlations. In this paper, different machine learning (ML) techniques were utilized to develop models that predict TOC from well logs, including formation resistivity (FR), spontaneous potential (SP), sonic transit time (Δt), bulk density (RHOB), neutron porosity (CNP), gamma ray (GR), and spectrum logs of thorium (Th), uranium (Ur), and potassium (K). Over 1250 data points from the Devonian Duvernay shale were utilized to create and validate the model. These datasets were obtained from three wells; the first was used to train the models, while the data sets from the other two wells were utilized to test and validate them. Support vector machine (SVM), random forest (RF), and decision tree (DT) were the ML approaches tested, and their predictions were contrasted with three empirical correlations. Various AI methods’ parameters were tested to assure the best possible accuracy in terms of correlation coefficient (R) and average absolute percentage error (AAPE) between the actual and predicted TOC. The three ML methods yielded good matches; however, the RF-based model has the best performance. The RF model was able to predict the TOC for the different datasets with R values range between 0.93 and 0.99 and AAPE values less than 14%. In terms of average error, the ML-based models outperformed the other three empirical correlations. This study shows the capability and robustness of ML models to predict the total organic carbon from readily available logging data without the need for core analysis or additional well interventions.

Download Full-text

Using Machine Learning-Based Algorithms to Analyze Erosion Rates of a Watershed in Northern Taiwan

Sustainability ◽

10.3390/su12052022 ◽

2020 ◽

Vol 12 (5) ◽

pp. 2022 ◽

Cited By ~ 1

Author(s):

Kieu Anh Nguyen ◽

Walter Chen ◽

Bor-Shiun Lin ◽

Uma Seeboonruang

Keyword(s):

Machine Learning ◽

Random Forest ◽

Fuzzy Inference ◽

Absolute Error ◽

Organic Content ◽

Rank Test ◽

Support Vector ◽

Inference System ◽

Erosion Rates ◽

Northern Taiwan

This study continues a previous study with further analysis of watershed-scale erosion pin measurements. Three machine learning (ML) algorithms—Support Vector Machine (SVM), Adaptive Neuro-Fuzzy Inference System (ANFIS), and Artificial Neural Network (ANN)—were used to analyze depth of erosion of a watershed (Shihmen reservoir) in northern Taiwan. In addition to three previously used statistical indexes (Mean Absolute Error, Root Mean Square of Error, and R-squared), Nash–Sutcliffe Efficiency (NSE) was calculated to compare the predictive performances of the three models. To see if there was a statistical difference between the three models, the Wilcoxon signed-rank test was used. The research utilized 14 environmental attributes as the input predictors of the ML algorithms. They are distance to river, distance to road, type of slope, sub-watershed, slope direction, elevation, slope class, rainfall, epoch, lithology, and the amount of organic content, clay, sand, and silt in the soil. Additionally, measurements of a total of 550 erosion pins installed on 55 slopes were used as the target variable of the model prediction. The dataset was divided into a training set (70%) and a testing set (30%) using the stratified random sampling with sub-watershed as the stratification variable. The results showed that the ANFIS model outperforms the other two algorithms in predicting the erosion rates of the study area. The average RMSE of the test data is 2.05 mm/yr for ANFIS, compared to 2.36 mm/yr and 2.61 mm/yr for ANN and SVM, respectively. Finally, the results of this study (ANN, ANFIS, and SVM) were compared with the previous study (Random Forest, Decision Tree, and multiple regression). It was found that Random Forest remains the best predictive model, and ANFIS is the second-best among the six ML algorithms.

Download Full-text

Machine Learning Approaches for Accurate Prediction of Relative Humidity based on Temperature and Wet-Bulb Depression

10.20944/preprints202002.0075.v2 ◽

2020 ◽

Author(s):

Mahdi Ghadiri ◽

Azam Marjani ◽

Samira Mohammadinia ◽

Manouchehr Shokri

Keyword(s):

Relative Humidity ◽

Statistical Learning ◽

Air Conditioning ◽

Fuzzy Inference ◽

Least Square ◽

Support Vector ◽

Cooling Towers ◽

Learning Approaches ◽

Inference System ◽

Data Points

The main parameters for calculation of relative humidity are the wet-bulb depression and dry bulb temperature. In this work, easy-to-used predictive tools based on statistical learning concepts, i.e., the Adaptive Network-Based Fuzzy Inference System (ANFIS) and Least Square Support Vector Machine (LSSVM) are developed for calculating relative humidity in terms of wet bulb depression and dry bulb temperature. To evaluate the aforementioned models, some statistical analyses have been done between the actual and estimated data points. Results obtained from the present models showed their capabilities to calculate relative humidity for divers values of dry bulb temperatures and also wet-bulb depression. The obtained values of MSE and MRE were 0.132 and 0.931, 0.193 and 1.291 for the LSSVM and ANFIS approaches respectively. These developed tools are user-friend and can be of massive value for scientists especially, those dealing with air conditioning and wet cooling towers systems to have a noble check of the relative humidity in terms of wet bulb depression and dry bulb temperatures.

Download Full-text

Prediction of Compressive Strength of Rice Husk Ash Concrete through Different Machine Learning Processes

Crystals ◽

10.3390/cryst11040352 ◽

2021 ◽

Vol 11 (4) ◽

pp. 352

Author(s):

Ammar Iqtidar ◽

Niaz Bahadur Khan ◽

Sardar Kashif-ur-Rehman ◽

Muhmmad Faisal Javed ◽

Fahid Aslam ◽

...

Keyword(s):

Machine Learning ◽

Compressive Strength ◽

Rice Husk ◽

Carbon Dioxide Emissions ◽

Rice Husk Ash ◽

Fuzzy Inference ◽

Correlation Factor ◽

Inference System ◽

Neuro Fuzzy ◽

Data Points

Cement is among the major contributors to the global carbon dioxide emissions. Thus, sustainable alternatives to the conventional cement are essential for producing greener concrete structures. Rice husk ash has shown promising characteristics to be a sustainable option for further research and investigation. Since the experimental work required for assessing its properties is both time consuming and complex, machine learning can be used to successfully predict the properties of concrete containing rice husk ash. A total of 192 data points are used in this study to assess the compressive strength of rice husk ash blended concrete. Input parameters include age, amount of cement, rice husk ash, super plasticizer, water, and aggregates. Four soft computing and machine learning methods, i.e., artificial neural networks (ANN), adaptive neuro-fuzzy inference system (ANFIS), multiple nonlinear regression (NLR), and linear regression are employed in this research. Sensitivity analysis, parametric analysis, and correlation factor (R2) are used to evaluate the obtained results. The ANN and ANFIS outperformed other methods.

Download Full-text

Prediction of drug synergy in cancer using ensemble-based machine learning techniques

Modern Physics Letters B ◽

10.1142/s0217984918501324 ◽

2018 ◽

Vol 32 (11) ◽

pp. 1850132 ◽

Cited By ~ 9

Author(s):

Harpreet Singh ◽

Prashant Singh Rana ◽

Urvinder Singh

Keyword(s):

Machine Learning ◽

Fuzzy Inference System ◽

Fuzzy Inference ◽

Machine Learning Techniques ◽

Support Vector ◽

Prediction Errors ◽

Learning Approaches ◽

Inference System ◽

Drug Synergy ◽

Learning Techniques

Drug synergy prediction plays a significant role in the medical field for inhibiting specific cancer agents. It can be developed as a pre-processing tool for therapeutic successes. Examination of different drug–drug interaction can be done by drug synergy score. It needs efficient regression-based machine learning approaches to minimize the prediction errors. Numerous machine learning techniques such as neural networks, support vector machines, random forests, LASSO, Elastic Nets, etc., have been used in the past to realize requirement as mentioned above. However, these techniques individually do not provide significant accuracy in drug synergy score. Therefore, the primary objective of this paper is to design a neuro-fuzzy-based ensembling approach. To achieve this, nine well-known machine learning techniques have been implemented by considering the drug synergy data. Based on the accuracy of each model, four techniques with high accuracy are selected to develop ensemble-based machine learning model. These models are Random forest, Fuzzy Rules Using Genetic Cooperative-Competitive Learning method (GFS.GCCL), Adaptive-Network-Based Fuzzy Inference System (ANFIS) and Dynamic Evolving Neural-Fuzzy Inference System method (DENFIS). Ensembling is achieved by evaluating the biased weighted aggregation (i.e. adding more weights to the model with a higher prediction score) of predicted data by selected models. The proposed and existing machine learning techniques have been evaluated on drug synergy score data. The comparative analysis reveals that the proposed method outperforms others in terms of accuracy, root mean square error and coefficient of correlation.

Download Full-text

Novel Empirical Correlation for Estimation of the Total Organic Carbon in Devonian Shale from the Spectral Gamma-Ray and Based on the Artificial Neural Networks

Journal of Energy Resources Technology ◽

10.1115/1.4050777 ◽

2021 ◽

pp. 1-28

Author(s):

Ahmed Abdulhamid Mahmoud ◽

Salaheldin Elkatatny

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Organic Carbon ◽

Total Organic Carbon ◽

Gamma Ray ◽

Linear Regression Analysis ◽

Empirical Correlation ◽

Training Data ◽

Validation Data ◽

Artificial Neural

Abstract Evaluation of the quality of unconventional hydrocarbon resources becomes a critical stage toward characterizing these resources, this evaluation requires evaluation of the total organic carbon (TOC). Generally, TOC is determined from laboratory experiments, however, it is hard to obtain a continuous profile for the TOC along the drilled formations using these experiments. Another way to evaluate the TOC is through the use of empirical correlation, the currently available correlations lack the accuracy especially when used in formations other than the ones used to develop these correlations. This study introduces an empirical equation for evaluation of the TOC in Devonian Duvernay shale from only gamma-ray and spectral gamma-ray logs of uranium, thorium, and potassium as well as a newly developed term that accounts for the TOC from the linear regression analysis. This new correlation was developed based on the artificial neural networks (ANN) algorithm which was learned on 750 datasets from Well-A. The developed correlation was tested and validated on 226 and 73 datasets from Well-B and Well-C, respectively. The results of this study indicated that for the training data, the TOC was predicted by the ANN with an AAPE of only 8.5%. Using the developed equation, the TOC was predicted with an AAPE of only 11.5% for the testing data. For the validation data, the developed equation overperformed the previous models in estimating the TOC with an AAPE of only 11.9%.

Download Full-text

Real-Time Prediction of Rate of Penetration in S-Shape Well Profile Using Artificial Intelligence Models

Sensors ◽

10.3390/s20123506 ◽

2020 ◽

Vol 20 (12) ◽

pp. 3506 ◽

Cited By ~ 1

Author(s):

Salaheldin Elkatatny

Keyword(s):

Artificial Intelligence ◽

Real Time ◽

Support Vector ◽

Empirical Correlations ◽

Validation Data ◽

Inference System ◽

Rate Of Penetration ◽

Drilling Parameters ◽

Svm Model ◽

Data Points

Rate of penetration (ROP) is defined as the amount of removed rock per unit area per unit time. It is affected by several factors which are inseparable. Current established models for determining the ROP include the basic mathematical and physics equations, as well as the use of empirical correlations. Given the complexity of the drilling process, the use of artificial intelligence (AI) has been a game changer because most of the unknown parameters can now be accounted for entirely at the modeling process. The objective of this paper is to evaluate the ability of the optimized adaptive neuro-fuzzy inference system (ANFIS), functional neural networks (FN), random forests (RF), and support vector machine (SVM) models to predict the ROP in real time from the drilling parameters in the S-shape well profile, for the first time, based on the drilling parameters of weight on bit (WOB), drillstring rotation (DSR), torque (T), pumping rate (GPM), and standpipe pressure (SPP). Data from two wells were used for training and testing (Well A and Well B with 4012 and 1717 data points, respectively), and one well for validation (Well C) with 2500 data points. Well A and Well B data were combined in the training-testing phase and were randomly divided into a 70:30 ratio for training/testing. The results showed that the ANFIS, FN, and RF models could effectively predict the ROP from the drilling parameters in the S-shape well profile, while the accuracy of the SVM model was very low. The ANFIS, FN, and RF models predicted the ROP for the training data with average absolute percentage errors (AAPEs) of 9.50%, 13.44%, and 3.25%, respectively. For the testing data, the ANFIS, FN, and RF models predicted the ROP with AAPEs of 9.57%, 11.20%, and 8.37%, respectively. The ANFIS, FN, and RF models overperformed the available empirical correlations for ROP prediction. The ANFIS model estimated the ROP for the validation data with an AAPE of 9.06%, whereas the FN model predicted the ROP with an AAPE of 10.48%, and the RF model predicted the ROP with an AAPE of 10.43%. The SVM model predicted the ROP for the validation data with a very high AAPE of 30.05% and all empirical correlations predicted the ROP with AAPEs greater than 25%.

Download Full-text

Evaluation of the Total Organic Carbon (TOC) Using Different Artificial Intelligence Techniques

Sustainability ◽

10.3390/su11205643 ◽

2019 ◽

Vol 11 (20) ◽

pp. 5643 ◽

Cited By ~ 5

Author(s):

Ahmed Abdulhamid Mahmoud ◽

Salaheldin Elkatatny ◽

Abdulwahab Z. Ali ◽

Mohamed Abouelresh ◽

Abdulazeez Abdulraheem

Keyword(s):

Artificial Intelligence ◽

Organic Carbon ◽

Total Organic Carbon ◽

Well Logs ◽

Support Vector ◽

Validation Data ◽

Fuzzy Interference System ◽

Interference System ◽

Unseen Data ◽

Devonian Shale

Total organic carbon (TOC) is an essential parameter used in unconventional shale resources evaluation. Current methods that are used for TOC estimation are based, either on conducting time-consuming laboratory experiments, or on using empirical correlations developed for specific formations. In this study, four artificial intelligence (AI) models were developed to estimate the TOC using conventional well logs of deep resistivity, gamma-ray, sonic transit time, and bulk density. These models were developed based on the Takagi-Sugeno-Kang fuzzy interference system (TSK-FIS), Mamdani fuzzy interference system (M-FIS), functional neural network (FNN), and support vector machine (SVM). Over 800 data points of the conventional well logs and core data collected from Barnett shale were used to train and test the AI models. The optimized AI models were validated using unseen data from Devonian shale. The developed AI models showed accurate predictability of TOC in both Barnett and Devonian shale. FNN model overperformed others in estimating TOC for the validation data with average absolute percentage error (AAPE) and correlation coefficient (R) of 12.02%, and 0.879, respectively, followed by M-FIS and SVM, while TSK-FIS model showed the lowest predictability of TOC, with AAPE of 15.62% and R of 0.832. All AI models overperformed Wang models, which have recently developed to evaluate the TOC for Devonian formation.

Download Full-text

Improving signal detection accuracy at FC of a CRN using machine learning and fuzzy rules

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v21.i2.pp1140-1150 ◽

2021 ◽

Vol 21 (2) ◽

pp. 1140

Author(s):

Md Abul Kalam Azad ◽

Anup Majumder ◽

Jugal Krishna Das ◽

Md Imdadul Islam

Keyword(s):

Machine Learning ◽

Signal Detection ◽

Fuzzy Inference ◽

Fuzzy Rule ◽

Energy Detection ◽

Decision Time ◽

Machine Learning Techniques ◽

Support Vector ◽

Detection Accuracy ◽

Inference System

<span>The performance of a cognitive radio network (CRN) mainly depends on the faithful signal detection at fusion center (FC). In this paper, the concept of weighted Fuzzy rule in Iris data classification, as well as, four machine learning techniques named fuzzy inference system (FIS), fuzzy <em>c</em>-means clustering (FCMC), support vector machine (SVM) and convolutional neural network (CNN) are applied in signal detection at FC taking signal-to-interference plus noise ratio of secondary users as parameter. The weighted Fuzzy rule gave the detection accuracy of 86.6%, which resembles the energy detection model of majority rule of FC; however, CNN gave an accuracy of 91.3% at the expense of more decision time. The FIS, FCMC and SVM gave some intermediate results; however, the combined method gave the best result compared to that of any individual technique.</span>

Download Full-text

Total Organic Carbon Content Prediction in Lacustrine Shale Using Extreme Gradient Boosting Machine Learning Based on Bayesian Optimization

Geofluids ◽

10.1155/2021/6155663 ◽

2021 ◽

Vol 2021 ◽

pp. 1-18

Author(s):

Xingzhou Liu ◽

Zhi Tian ◽

Chang Chen

Keyword(s):

Machine Learning ◽

Organic Carbon ◽

Total Organic Carbon ◽

Bayesian Optimization ◽

Gradient Boosting ◽

Support Vector ◽

Shale Oil ◽

The Core ◽

Extreme Gradient Boosting ◽

Lacustrine Shale

The total organic carbon (TOC) content is a critical parameter for estimating shale oil resources. However, common TOC prediction methods rely on empirical formulas, and their applicability varies widely from region to region. In this study, a novel data-driven Bayesian optimization extreme gradient boosting (XGBoost) model was proposed to predict the TOC content using wireline log data. The lacustrine shale in the Damintun Sag, Bohai Bay Basin, China, was used as a case study. Firstly, correlation analysis was used to analyze the relationship between the well logs and the core-measured TOC data. Based on the degree of correlation, six logging curves reflecting TOC content were selected to construct training dataset for machine learning. Then, the performance of the XGBoost model was tested using K -fold cross-validation, and the hyperparameters of the model were determined using a Bayesian optimization method to improve the search efficiency and reduce the uncertainty caused by the rule of thumb. Next, through the analysis of prediction errors, the coefficient of determination ( R 2 ) of the TOC content predicted by the XGBoost model and the core-measured TOC content reached 0.9135. The root mean square error (RMSE), mean absolute error (MAE), and mean absolute percentage error (MAPE) were 0.63, 0.77, and 12.55%, respectively. In addition, five commonly used methods, namely, Δ log R method, random forest, support vector machine, K -nearest neighbors, and multiple linear regression, were used to predict the TOC content to confirm that the XGBoost model has higher prediction accuracy and better robustness. Finally, the proposed approach was applied to predict the TOC curves of 20 exploration wells in the Damintun Sag. We obtained quantitative contour maps of the TOC content of this block for the first time. The results of this study facilitate the rapid detection of the sweet spots of the lacustrine shale oil.

Download Full-text