Prediction of Reservoir Saturation Pressure and Reservoir Type in a Niger Delta Field using Supervised Machine Learning ML Algorithms

2020 ◽  
Author(s):  
Stella I. Eyitayo ◽  
Jamiu M. Ekundayo ◽  
Eyitayo O. Mumuney
2021 ◽  
Author(s):  
Robert Shelley ◽  
Oladapo Oduba ◽  
Howard Melcher

Abstract The subject of this paper is the application of a unique machine learning approach to the evaluation of Wolfcamp B completions. A database consisting of Reservoir, Completion, Frac and Production information from 301 Multi-Fractured Horizontal Wolfcamp B Completions was assembled. These completions were from a 10-County area located in the Texas portion of the Permian Basin. Within this database there is a wide variation in completion design from many operators; lateral lengths ranging from a low of about 4,000 ft to a high of almost 15,000 ft, proppant intensities from 500 to 4,000 lb/ft and frac stage spacing from 59 to 769 ft. Two independent self-organizing data mappings (SOM) were performed; the first on completion and frac stage parameters, the second on reservoir and geology. Characteristics for wells assigned to each SOM bin were determined. These two mappings were then combined into a reservoir type vs completion type matrix. This type of approach is intended to remove systemactic errors in measuement, bias and inconsistencies in the database so that more realistic assessments about well performance can be made. Production for completion and reservoir type combinations were determined. As a final step, a feed forward neural network (ANN) model was developed from the mapped data. This model was used to estimate Wolfcamp B production and economics for completion and frac designs. In the performance of this project, it became apparent that the incorporation of reservoir data was essential to understanding the impact of completion and frac design on multi-fractured horizontal Wolfcamp B well production and economic performance. As we would expect, wells with the most permeability, higher pore pressure, effective porosity and lower water saturation have the greatest potential for hydrocarbon production. The most effective completion types have an optimum combination of proppant intensity, fluid intensity, treatment rate, frac stage spacing and perforation clustering. This paper will be of interest to anyone optimizing hydraulically fractured Wolfcamp B completion design or evaluating Permian Basin prospects. Also, of interest is the impact of reservoir and completion characteristics such as permeability, porosity, water saturation, pressure, offset well production, proppant intensity, fluid intensity, frac stage spacing and lateral length on well production and economics. The methodology used to evaluate the impact of reservoir and completion parameters for this Wolfcamp project is unique and novel. In addition, compared to other methodologies, it is low cost and fast. And though the focus of this paper is on the Wolfcamp B Formation in the Midland Basin, this approach and workflow can be applied to any formation in any Basin, provided sufficient data is available.


2020 ◽  
Vol 14 (2) ◽  
pp. 140-159
Author(s):  
Anthony-Paul Cooper ◽  
Emmanuel Awuni Kolog ◽  
Erkki Sutinen

This article builds on previous research around the exploration of the content of church-related tweets. It does so by exploring whether the qualitative thematic coding of such tweets can, in part, be automated by the use of machine learning. It compares three supervised machine learning algorithms to understand how useful each algorithm is at a classification task, based on a dataset of human-coded church-related tweets. The study finds that one such algorithm, Naïve-Bayes, performs better than the other algorithms considered, returning Precision, Recall and F-measure values which each exceed an acceptable threshold of 70%. This has far-reaching consequences at a time where the high volume of social media data, in this case, Twitter data, means that the resource-intensity of manual coding approaches can act as a barrier to understanding how the online community interacts with, and talks about, church. The findings presented in this article offer a way forward for scholars of digital theology to better understand the content of online church discourse.


2017 ◽  
Author(s):  
Sabrina Jaeger ◽  
Simone Fulle ◽  
Samo Turk

Inspired by natural language processing techniques we here introduce Mol2vec which is an unsupervised machine learning approach to learn vector representations of molecular substructures. Similarly, to the Word2vec models where vectors of closely related words are in close proximity in the vector space, Mol2vec learns vector representations of molecular substructures that are pointing in similar directions for chemically related substructures. Compounds can finally be encoded as vectors by summing up vectors of the individual substructures and, for instance, feed into supervised machine learning approaches to predict compound properties. The underlying substructure vector embeddings are obtained by training an unsupervised machine learning approach on a so-called corpus of compounds that consists of all available chemical matter. The resulting Mol2vec model is pre-trained once, yields dense vector representations and overcomes drawbacks of common compound feature representations such as sparseness and bit collisions. The prediction capabilities are demonstrated on several compound property and bioactivity data sets and compared with results obtained for Morgan fingerprints as reference compound representation. Mol2vec can be easily combined with ProtVec, which employs the same Word2vec concept on protein sequences, resulting in a proteochemometric approach that is alignment independent and can be thus also easily used for proteins with low sequence similarities.


2020 ◽  
Vol 28 (2) ◽  
pp. 253-265 ◽  
Author(s):  
Gabriela Bitencourt-Ferreira ◽  
Amauri Duarte da Silva ◽  
Walter Filgueira de Azevedo

Background: The elucidation of the structure of cyclin-dependent kinase 2 (CDK2) made it possible to develop targeted scoring functions for virtual screening aimed to identify new inhibitors for this enzyme. CDK2 is a protein target for the development of drugs intended to modulate cellcycle progression and control. Such drugs have potential anticancer activities. Objective: Our goal here is to review recent applications of machine learning methods to predict ligand- binding affinity for protein targets. To assess the predictive performance of classical scoring functions and targeted scoring functions, we focused our analysis on CDK2 structures. Methods: We have experimental structural data for hundreds of binary complexes of CDK2 with different ligands, many of them with inhibition constant information. We investigate here computational methods to calculate the binding affinity of CDK2 through classical scoring functions and machine- learning models. Results: Analysis of the predictive performance of classical scoring functions available in docking programs such as Molegro Virtual Docker, AutoDock4, and Autodock Vina indicated that these methods failed to predict binding affinity with significant correlation with experimental data. Targeted scoring functions developed through supervised machine learning techniques showed a significant correlation with experimental data. Conclusion: Here, we described the application of supervised machine learning techniques to generate a scoring function to predict binding affinity. Machine learning models showed superior predictive performance when compared with classical scoring functions. Analysis of the computational models obtained through machine learning could capture essential structural features responsible for binding affinity against CDK2.


2019 ◽  
Vol 23 (1) ◽  
pp. 12-21 ◽  
Author(s):  
Shikha N. Khera ◽  
Divya

Information technology (IT) industry in India has been facing a systemic issue of high attrition in the past few years, resulting in monetary and knowledge-based loses to the companies. The aim of this research is to develop a model to predict employee attrition and provide the organizations opportunities to address any issue and improve retention. Predictive model was developed based on supervised machine learning algorithm, support vector machine (SVM). Archival employee data (consisting of 22 input features) were collected from Human Resource databases of three IT companies in India, including their employment status (response variable) at the time of collection. Accuracy results from the confusion matrix for the SVM model showed that the model has an accuracy of 85 per cent. Also, results show that the model performs better in predicting who will leave the firm as compared to predicting who will not leave the company.


Sign in / Sign up

Export Citation Format

Share Document