An Initial Screening Method for Tuberculosis Diseases Using a Multi-objective Gradient Evolution-Based Support Vector Machine and C5.0 Decision Tree

Predicting preeclampsia and related risk factors using data mining approaches: A cross-sectional study

International Journal of Reproductive BioMedicine ◽

10.18502/ijrm.v19i11.9911 ◽

2021 ◽

Author(s):

Zohreh Manoochehri ◽

Sara Manoochehri ◽

Farzaneh Soltani ◽

Majid Sadeghifar

Keyword(s):

Risk Factors ◽

Data Mining ◽

Support Vector Machine ◽

Logistic Regression ◽

Random Forest ◽

Decision Tree ◽

Cross Sectional Study ◽

Support Vector ◽

Cross Sectional ◽

C5.0 Decision Tree

Background: Preeclampsia is a type of pregnancy hypertension disorder that has adverse effects on both the mother and the fetus. Despite recent advances in the etiology of preeclampsia, no adequate clinical screening tests have been identified to diagnose the disorder. Objective: We aimed to provide a model based on data mining approaches that can be used as a screening tool to identify patients with this syndrome and also to identify the risk factors associated with it. Materials and Methods: The data used to perform this cross-sectional study were extracted from the clinical records of 726 mothers with preeclampsia and 726 mothers without preeclampsia who were referred to Fatemieh Hospital in Hamadan City during April 2005–March 2015. In this study, six data mining methods were adopted, including logistic regression, k-nearest neighborhood, C5.0 decision tree, discriminant analysis, random forest, and support vector machine, and their performance was compared using the criteria of accuracy, sensitivity, and specificity. Results: Underlying condition, age, pregnancy season and the number of pregnancies were the most important risk factors for diagnosing preeclampsia. The accuracy of the models were as follows: logistic regression (0.713), k-nearest neighborhood (0.742), C5.0 decision tree (0.788), discriminant analysis (0.687), random forest (0.758) and support vector machine (0.791). Conclusion: Among the data mining methods employed in this study, support vector machine was the most accurate in predicting preeclampsia. Therefore, this model can be considered as a screening tool to diagnose this disorder. Key words: Preeclampsia, Random forest, C5.0 decision tree, Support vector machine, Logistic regression.

Download Full-text

Landslide Susceptibility Zoning Using C5.0 Decision Tree, Random Forest, Support Vector Machine and Comparison of Their Performance in a Coal Mine Area

Frontiers in Earth Science ◽

10.3389/feart.2021.781472 ◽

2021 ◽

Vol 9 ◽

Author(s):

Qiaomei Su ◽

Weiheng Tao ◽

Shiguang Mei ◽

Xiaoyuan Zhang ◽

Kaixin Li ◽

...

Keyword(s):

Support Vector Machine ◽

Random Forest ◽

Decision Tree ◽

Coal Mine ◽

Landslide Susceptibility ◽

Shanxi Province ◽

Support Vector ◽

Mine Area ◽

Ground Collapse ◽

C5.0 Decision Tree

The main purpose of this study is to establish an effective landslide susceptibility zoning model and test whether underground mined areas and ground collapse in coal mine areas seriously affect the occurrence of landslides. Taking the Fenxi Coal Mine Area of Shanxi Province in China as the research area, landslide data has been investigated by the Shanxi Geological Environment Monitoring Center; adopting the 5-fold cross-validation method, and through Geostatistics analysis means the datasets of all non-landslides and landslides were divided into 80:20 proportions randomly for training and validating models. A set of 15 condition factors including terrain, geological, hydrological, land cover, and human engineering activity factors (distance to road, distance to mined area, ground collapse density) were selected as the evaluation indices to construct the susceptibility assessment model. Three machine learning algorithms for landslide susceptibility prediction (LSP) including C5.0 Decision Tree (C5.0), Random Forest (RF), and Support Vector Machine (SVM) have been selected and compared through the Areas under the Receiver Operating Characteristics (ROC) Curves (AUC), and several statistical estimates. The study revealed that for these three models the value range of prediction accuracies vary from 83.49 to 99.29% (in the training stage), and 62.26–73.58% (in the validation stage). In the two stages, AUCs are between 0.92 to 0.99 and 0.71 to 0.80 respectively. Using Jenks Natural Breaks algorithm, three LSPs levels are established as very low, low, medium, high, and very high probability of landslide by dividing the indices of the LSP. Compared with RF and SVM, C5.0 is considered better in five categories according to quantities and distribution of the landslides and their area percentage for different LSP zones. Four factors such as distance to road, lithology, profile curvature, and ground collapse density are the most suitable condition factors for LSP. The distance to mine area factor has a medium contribution and plays an obvious role in the occurrence of landslides in all the models. The result reveals that C5.0 possesses better prediction efficiency than RF and SVM, and underground mined area and ground collapse sifnigicantly affect significantly the occurrence of landslides in the Fenxi Coal Mine Area.

Download Full-text

KLASIFIKASI SMS SPAM MENGGUNAKAN SUPPORT VECTOR MACHINE

Jurnal Pilar Nusa Mandiri ◽

10.33480/pilar.v15i2.693 ◽

2019 ◽

Vol 15 (2) ◽

pp. 275-280

Author(s):

Agus Setiyono ◽

Hilman F Pardede

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Spam Detection ◽

Support Vector Machine Algorithm ◽

Data Mining Techniques ◽

To Receive

It is now common for a cellphone to receive spam messages. Great number of received messages making it difficult for human to classify those messages to Spam or no Spam. One way to overcome this problem is to use Data Mining for automatic classifications. In this paper, we investigate various data mining techniques, named Support Vector Machine, Multinomial Naïve Bayes and Decision Tree for automatic spam detection. Our experimental results show that Support Vector Machine algorithm is the best algorithm over three evaluated algorithms. Support Vector Machine achieves 98.33%, while Multinomial Naïve Bayes achieves 98.13% and Decision Tree is at 97.10 % accuracy.

Download Full-text

Support vector machine based decision tree for very high resolution multispectral forest mapping

2011 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2011.6048893 ◽

2011 ◽

Cited By ~ 7

Author(s):

Petra Krahwinkler ◽

Juergen Rossmann ◽

Bjoern Sondermann

Keyword(s):

Support Vector Machine ◽

High Resolution ◽

Decision Tree ◽

Support Vector ◽

Forest Mapping ◽

Very High

Download Full-text

Shape optimization of GFRP elastic gridshells by the weighted Lagrange ε-twin support vector machine and multi-objective particle swarm optimization algorithm considering structural weight

Structures ◽

10.1016/j.istruc.2021.05.077 ◽

2021 ◽

Vol 33 ◽

pp. 2066-2084

Author(s):

Soheila Kookalani ◽

Bin Cheng ◽

Sheng Xiang

Keyword(s):

Support Vector Machine ◽

Particle Swarm Optimization ◽

Shape Optimization ◽

Optimization Algorithm ◽

Particle Swarm Optimization Algorithm ◽

Particle Swarm ◽

Twin Support Vector Machine ◽

Support Vector ◽

Swarm Optimization ◽

Multi Objective

Download Full-text

Support Vector Machine and Decision Tree-Based Elective Course Suggestion System: A Case Study

10.1109/3ict53449.2021.9581846 ◽

2021 ◽

Author(s):

M. Fatih Adak ◽

Serpil Ercan

Keyword(s):

Support Vector Machine ◽

Decision Tree ◽

Support Vector ◽

System A ◽

Elective Course

Download Full-text

A comparison of pixel-based decision tree and object-based Support Vector Machine methods for land-cover classification based on aerial images and airborne lidar data

International Journal of Remote Sensing ◽

10.1080/01431161.2017.1371864 ◽

2017 ◽

Vol 38 (23) ◽

pp. 7176-7195 ◽

Cited By ~ 12

Author(s):

Qiong Wu ◽

Ruofei Zhong ◽

Wenji Zhao ◽

Han Fu ◽

Kai Song

Keyword(s):

Support Vector Machine ◽

Land Cover ◽

Decision Tree ◽

Land Cover Classification ◽

Airborne Lidar ◽

Aerial Images ◽

Support Vector ◽

Lidar Data ◽

Object Based ◽

Airborne Lidar Data

Download Full-text

Analysis of Decision Tree and Smooth Support Vector Machine Methods on Data Mining

Journal of Physics Conference Series ◽

10.1088/1742-6596/1255/1/012067 ◽

2019 ◽

Vol 1255 ◽

pp. 012067

Author(s):

Natalina Br Sitepu ◽

Sawaluddin ◽

M Zarlis ◽

Syahril Efendi ◽

Hanna Willa Dhany

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Decision Tree ◽

Support Vector ◽

Smooth Support Vector Machine

Download Full-text

Tinjauan Algoritma RoI (Region of Interest) Dengan Metode Pengambangan Otsu Dan Klasterisasi K-Mean; Hasil Dan Tantangannya

Informatik : Jurnal Ilmu Komputer ◽

10.52958/iftk.v16i2.1961 ◽

2020 ◽

Vol 16 (2) ◽

pp. 75

Author(s):

Didit Widiyanto

Keyword(s):

Support Vector Machine ◽

Decision Tree ◽

Naive Bayes ◽

Region Of Interest ◽

Naïve Bayes ◽

Support Vector ◽

Gray Level

Akurasi sebuah klasifikasi citra ditentukan oleh pengklasifikasi. Meskipun RoI (Region of Interest) tidak menentukan secara langsung akurasi, namun RoI menentukan lingkup klasifikasi citra. Terdapat tiga algoritma yang dapat digunakan sebagai algoritma RoI yaitu; Balanced Histogram Thresholding (BHT), algoritma Otsu, dan algoritma klasterisasi K-Means. Paper ini meninjau algoritma Otsu dan algoritma klasterisasi K-Means yang digunakan oleh lima peneliti. Dari ke lima peneliti; tiga peneliti menerapkan algoritma Otsu dan dua peneliti menerapkan algoritma K-Means sebagai algoritma RoI. Setelah operasi RoI, ke lima peneliti menerapkan algoritma GLCM (Gray Level Co-occurance Matrix) sebagai pengekstraksi ciri tekstur. Hasil ekstraksi ciri diklasifikasi dengan menggunakan berbagai pengklasifikasi antara lain SVM (Support Vector Machine), Naive Bayes, dan Decision Tree. Akhirnya dengan membandingkan hasil dari ke lima peneliti, akurasi tertinggi diperoleh sebesar 100% dengan pengklasifikasi SVM menggunakan algoritma Otsu sebagai algoritma RoI, dan akurasi terendah adalah sebesar52% yang menggunakan algoritma Otsu pada kanal S dari citra HSV (Hue, Saturation Value).

Download Full-text

Multi-objective online optimization of a marine diesel engine using NSGA-II coupled with enhancing trained support vector machine

Applied Thermal Engineering ◽

10.1016/j.applthermaleng.2018.03.080 ◽

2018 ◽

Vol 137 ◽

pp. 218-227 ◽

Cited By ~ 16

Author(s):

Xiaoxiao Niu ◽

Hechun Wang ◽

Song Hu ◽

Chuanlei Yang ◽

Yinyan Wang

Keyword(s):

Support Vector Machine ◽

Diesel Engine ◽

Online Optimization ◽

Support Vector ◽

Marine Diesel Engine ◽

Nsga Ii ◽

Multi Objective ◽

Marine Diesel

Download Full-text