scholarly journals Classification Based on Decision Tree Algorithm for Machine Learning

2021 ◽  
Vol 2 (01) ◽  
pp. 20-28
Author(s):  
Bahzad Charbuty ◽  
Adnan Abdulazeez

Decision tree classifiers are regarded to be a standout of the most well-known methods to data classification representation of classifiers. Different researchers from various fields and backgrounds have considered the problem of extending a decision tree from available data, such as machine study, pattern recognition, and statistics. In various fields such as medical disease analysis, text classification, user smartphone classification, images, and many more the employment of Decision tree classifiers has been proposed in many ways. This paper provides a detailed approach to the decision trees. Furthermore, paper specifics, such as algorithms/approaches used, datasets, and outcomes achieved, are evaluated and outlined comprehensively. In addition, all of the approaches analyzed were discussed to illustrate the themes of the authors and identify the most accurate classifiers. As a result, the uses of different types of datasets are discussed and their findings are analyzed.

Author(s):  
Giuseppe Nuti ◽  
Lluís Antoni Jiménez Rugama ◽  
Andreea-Ingrid Cross

Bayesian Decision Trees provide a probabilistic framework that reduces the instability of Decision Trees while maintaining their explainability. While Markov Chain Monte Carlo methods are typically used to construct Bayesian Decision Trees, here we provide a deterministic Bayesian Decision Tree algorithm that eliminates the sampling and does not require a pruning step. This algorithm generates the greedy-modal tree (GMT) which is applicable to both regression and classification problems. We tested the algorithm on various benchmark classification data sets and obtained similar accuracies to other known techniques. Furthermore, we show that we can statistically analyze how was the GMT derived from the data and demonstrate this analysis with a financial example. Notably, the GMT allows for a technique that provides explainable simpler models which is often a prerequisite for applications in finance or the medical industry.


Author(s):  
Mr. G Ragu

Abstract: Recently the methods of Data mining and machine learning are widely used in medical field. These methods/techniques have given better results in the prediction of respective diseases. Hepatitis B is a Liver inflammation; it can affect people of all age groups. Lakhs of people across the globe are thought to be affected by Hepatitis B. Early prediction of Hepatitis B with accurate results can save many people. Hepatitis B is a tough challenge for public health care system because of limited clinical diagnosis in the early stages of disease. This paper presents the decision tree algorithm to diagnose the Hepatitis B. The proposed algorithm includes collection of datasets, pre-processing, EDA (Exploratory Data Analysis), Feature Selection, data visualizing, Interpreting, saving and evaluating the model. After the data visualization process decision tree algorithm is implemented to diagnose the disease along with the patient chances of living. Keywords: Hepatitis B virus, Machine Learning, Decision Tree, Public Health, EDA


2020 ◽  
Author(s):  
Juan Luis Delgado-Gallegos ◽  
Gener Avilés-Rodriguez ◽  
Gerardo R. Padilla-Rivas ◽  
María De los Ángeles Cosio-León ◽  
Héctor Franco-Villareal ◽  
...  

AbstractStress and anxiety have shown to be indirect effects of the COVID-19 pandemic, therefore managing stress becomes essential. One of the most affected populations by the pandemic are healthcare professionals. Thus, it is paramount to understand and categorize their perceived levels of stress, as it can be a detonating factor leading to mental illness. In our study, we used a machine learning prediction model to help measure perceived stress; a C5.0 decision tree algorithm was used to analyze and classify datasets obtained from healthcare professionals of the northeast region of Mexico. Our analysis showed that 6 out of 102 instances were incorrectly classified. Missing two cases for mild, three for moderate and 1 for severe (accuracy of 94.1%), statistical correlation analysis was performed to ensure integrity of the method, in addition we concluded that severe stress cases can be related mostly to high levels of Xenophobia and Compulsive stress.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Hamidreza Abbasianjahromi ◽  
Mehdi Aghakarimi

PurposeUnsafe behavior accounts for a major part of high accident rates in construction projects. The awareness of unsafe circumstances can help modify unsafe behaviors. To improve awareness in project teams, the present study proposes a framework for predicting safety performance before the implementation of projects.Design/methodology/approachThe machine learning approach was adopted in this work. The proposed framework consists of two major phases: (1) data collection and (2) model development. The first phase involved several steps, including the identification of safety performance criteria, using a questionnaire to collect data, and converting the data into useful information. The second phase, on the other hand, included the use of the decision tree algorithm coupled with the k-Nearest Neighbors algorithm as the predictive tool along with the proposing modification strategies.FindingsA total of nine safety performance criteria were identified. The results showed that safety employees, training, rule adherence and management commitment were key criteria for safety performance prediction. It was also found that the decision tree algorithm is capable of predicting safety performance.Originality/valueThe main novelty of the present study is developing an integrated model to propose strategies for the safety enhancement of projects in the case of incorrect predictions.


As a wrongdoing of utilizing specialized intends to take sensitive data of clients and users in the internet, phishing is as of now an advanced risk confronting the Internet, and misfortunes due to phishing are developing consistently. Recognition of these phishing scams is a very testing issue on the grounds that phishing is predominantly a semantics based assault, which particularly manhandles human vulnerabilities, anyway not system or framework vulnerabilities. Phishing costs. As a product discovery plot, two primary methodologies are generally utilized: blacklists/whitelists and machine learning approaches. Every phishing technique has different parameters and type of attack. Using decision tree algorithm we find out whether the attack is legitimate or a scam. We measure this by grouping them with diverse parameters and features, thereby assisting the machine learning algorithm to edify.


Heart disease is a common problem which can be very severe in old ages and also in people not having a healthy lifestyle. With regular check-up and diagnosis in addition to maintaining a decent eating habit can prevent it to some extent. In this paper we have tried to implement the most sought after and important machine learning algorithm to predict the heart disease in a patient. The decision tree classifier is implemented based on the symptoms which are specifically the attributes required for the purpose of prediction. Using the decision tree algorithm, we will be able to identify those attributes which are the best one that will lead us to a better prediction of the datasets. The decision tree algorithm works in a way where it tries to solve the problem by the help of tree representation. Here each internal node of the tree represents an attribute, and each leaf node corresponds to a class label. The support vector machine algorithm helps us to classify the datasets on the basis of kernel and it also groups the dataset using hyperplane. The main objective of this project is to try and reduce the number of occurrences of the heart diseases in patients


Sign in / Sign up

Export Citation Format

Share Document