scholarly journals Determination of the quantitative content of chlorophylls in leaves by reflection spectra using the random forest algorithm

2021 ◽  
Vol 25 (1) ◽  
pp. 64-70
Author(s):  
E. A. Urbanovich ◽  
D. A. Afonnikov ◽  
S. V. Nikolaev
2020 ◽  
Vol 222 (2) ◽  
pp. 978-988
Author(s):  
Yury Meshalkin ◽  
Anuar Shakirov ◽  
Evgeniy Popov ◽  
Dmitry Koroteev ◽  
Irina Gurbatova

SUMMARY Rock thermal conductivity is an essential input parameter for enhanced oil recovery methods design and optimization and for basin and petroleum system modelling. Absence of any effective technique for direct in situ measurements of rock thermal conductivity makes the development of well-log based methods for rock thermal conductivity determination highly desirable. A major part of the existing problem solutions is regression model-based approaches. Literature review revealed that there are only several studies performed to assess the applicability of neural network-based algorithms to predict rock thermal conductivity from well-logging data. In this research, we aim to define the most effective machine-learning algorithms for well-log based determination of rock thermal conductivity. Well-logging data acquired at a heavy oil reservoir together with results of thermal logging on cores extracted from two wells were the basis for our research. Eight different regression models were developed and tested to predict vertical variations of rock conductivity from well-logging data. Additionally, rock thermal conductivity was determined based on Lichtenecker–Asaad model. Comparison study of regression-based and theoretical-based approaches was performed. Among considered machine learning techniques Random Forest algorithm was found to be the most accurate at well-log based determination of rock thermal conductivity. From a comparison of the thermal conductivity—depth profile predicted from well-logging data with the experimental data, and it can be concluded that thermal conductivity can be determined with a total relative error of 12.54 per cent. The obtained results prove that rock thermal conductivity can be inferred from well-logging data for wells that are drilled in a similar geological setting based on the Random Forest algorithm with an accuracy sufficient for industrial needs.


Author(s):  
Yessi Yunitasari ◽  
Aina Musdholifah ◽  
Anny Kartika Sari

Twitter is one of the social medias that are widely used at the moment. Tweet conversations can be classified according to their sentiments. The existence of sarcasm contained in a tweet sometimes causes incorrect determination of the tweet’s sentiment because sarcasm is difficult to analyze automatically, even by humans. Hence, sarcasm detection needs to be conducted, which is expected to improve the results of sentiment analysis. The effect of sarcasm detection on sentiment analysis can be seen in terms of accuracy, precision and recall. In this paper, detection of sarcasm is applied to Indonesian tweets. The feature extraction of sarcasm detection uses unigram and 4 Boazizi feature sets which consist of sentiment-relate features, punctuation-relate features, lexical and syntactic features, and top word features. Detection of sarcasm uses the Random Forest algorithm. The feature extraction of sentiment analysis uses TF-IDF, while the classification uses Naïve Bayes algorithm. The evaluation shows that sentiment analysis with sarcasm detection improves the  accuracy of sentiment analysis about 5.49%. The accuracy of the model is 80.4%, while the precision is 83.2%, and the recall is 91.3%.


2021 ◽  
Vol 5 (2) ◽  
pp. 369-378
Author(s):  
Eka Pandu Cynthia ◽  
M. Afif Rizky A. ◽  
Alwis Nazir ◽  
Fadhilah Syafria

This paper explains the use of the Random Forest Algorithm to investigate the Case of Acute Coronary Syndrome (ACS). The objectives of this study are to review the evaluation of the use of data science techniques and machine learning algorithms in creating a model that can classify whether or not cases of acute coronary syndrome occur. The research method used in this study refers to the IBM Foundational Methodology for Data Science, include: i) inventorying dataset about ACS, ii) preprocessing for the data into four sub-processes, i.e. requirements, collection, understanding, and preparation, iii) determination of RFA, i.e. the "n" of the tree which will form a forest and forming trees from the random forest that has been created, and iv) determination of the model evaluation and result in analysis based on Python programming language. Based on the experiments that the learning have been conducted using a random forest machine-learning algorithm with an n-estimator value of 100 and each tree's depth (max depth) with a value of 4, learning scenarios of 70:30, 80:20, and 90:10 on 444 cases of acute coronary syndrome data. The results show that the 70:30 scenario model has the best results, with an accuracy value of 83.45%, a precision value of 85%, and a recall value of 92.4%. Conclusions obtained from the experiment results were evaluated with various statistical metrics (accuracy, precision, and recall) in each learning scenario on 444 cases of acute coronary syndrome data with a cross-validation value of 10 fold.


Author(s):  
A.E. Semenov

The method of pedestrian navigation in the cities illustrated by the example of Saint-Petersburg was investigated. The factors influencing people when they choose a route for their walk were determined. Based on acquired factors corresponding data was collected and used to develop model determining attractiveness of a street in the city using Random Forest algorithm. The results obtained shows that routes provided by the method are 14% more attractive and just 6% longer compared with the shortest ones.


Sign in / Sign up

Export Citation Format

Share Document