scholarly journals Application of Support Vector Machine and Gene Expression Programming on Tropospheric ozone Prognosticating for Tehran Metropolitan

2017 ◽  
Vol 3 (8) ◽  
pp. 557 ◽  
Author(s):  
Vahid Mehdipour ◽  
Mahsa Memarianfard

Air pollution became fatal issue for humanity and all environment and developed countries unanimously allocated vast investments on monitoring and researches about air pollutants. Soft computing as a novel way for pollutants prediction can be used for measurement tools calibration which can coincidently decrease the expenditures and enhance their ability to adapt quickly. In this paper support vector machine (SVM) and gene expression programming (GEP) as two powerful approaches with reliable results in previous studies, used to predict tropospheric ozone in Tehran metropolitan by using the photochemical precursors and meteorological parameters as predictors. In a comparison between the two approaches, the best model of SVM gave superior results as it depicted the RMSE= 0.0774 and R= 0.8459 while these results of gene expression programming, respectively, are 0.0883 and 0.7938. Sensitivity of O3 against photochemical precursors and meteorological parameters and also for every input parameter, has been analysed discreetly and the gained results imply that PM2.5, PM10, temperature, CO and NO2 are the most effective parameters for O3 values tolerances. For SVM, several kernel tricks used and the best appropriate kernel selected due to its result. Nonetheless, gamma and sin2 values varied for every kernel and in the last radial basis function kernel opted as the best trick in this study. Finally, the best model of both applications revealed, and the resulted models evaluated as reliable and acceptable.

2021 ◽  
Vol 15 (4) ◽  
pp. 68-74
Author(s):  
Alireza Afradi ◽  
Arash Ebrahimabadi ◽  
Tahereh Hallajian

Purpose. Disc cutters are the main cutting tools for the Tunnel Boring Machines (TBMs). Prediction of the number of consumed disc cutters of TBMs is one of the most significant factors in the tunneling projects. Choosing the right model for predicting the number of consumed disc cutters in mechanized tunneling projects has been the most important mechanized tunneling topics in recent years. Methods. In this research, the prediction of the number of consumed disc cutters considering machine and ground conditions such as Power (KW), Revolutions per minute (RPM) (Cycle/Min), Thrust per Cutter (KN), Geological Strength Index (GSI) in the Sabzkooh water conveyance tunnel has been conducted by multiple linear regression analysis and multiple nonlinear regression, Gene Expression Programming (GEP) method and Support Vector Machine (SVM) approaches. Findings. Results showed that the number of consumed disc cutters for linear regression method is R2 = 0.95 and RMSE = 0.83, nonlinear regression method is – R2 = 0.95 and RMSE = 0.84, Gene Expression Programming (GEP) method is – R2 = 0.94 and RMSE = 0.95, Support Vector Machine (SVM) method is – R2 = 0.98 and RMSE = 0.45. Originality. During the analyses, in order to evaluate the accuracy and efficiency of predictive models, the coefficient of determination (R2) and root mean square error (RMSE) have been used. Practical implications. Results demonstrated that all four methods are effective and have high accuracy but the method of support vector machine has a special superiority over other methods.


2016 ◽  
Vol 24 (1) ◽  
pp. 54-65 ◽  
Author(s):  
Stefano Parodi ◽  
Chiara Manneschi ◽  
Damiano Verda ◽  
Enrico Ferrari ◽  
Marco Muselli

This study evaluates the performance of a set of machine learning techniques in predicting the prognosis of Hodgkin’s lymphoma using clinical factors and gene expression data. Analysed samples from 130 Hodgkin’s lymphoma patients included a small set of clinical variables and more than 54,000 gene features. Machine learning classifiers included three black-box algorithms ( k-nearest neighbour, Artificial Neural Network, and Support Vector Machine) and two methods based on intelligible rules (Decision Tree and the innovative Logic Learning Machine method). Support Vector Machine clearly outperformed any of the other methods. Among the two rule-based algorithms, Logic Learning Machine performed better and identified a set of simple intelligible rules based on a combination of clinical variables and gene expressions. Decision Tree identified a non-coding gene ( XIST) involved in the early phases of X chromosome inactivation that was overexpressed in females and in non-relapsed patients. XIST expression might be responsible for the better prognosis of female Hodgkin’s lymphoma patients.


Water ◽  
2019 ◽  
Vol 11 (3) ◽  
pp. 582 ◽  
Author(s):  
Sultan Noman Qasem ◽  
Saeed Samadianfard ◽  
Hamed Sadri Nahand ◽  
Amir Mosavi ◽  
Shahaboddin Shamshirband ◽  
...  

In the current study, the ability of three data-driven methods of Gene Expression Programming (GEP), M5 model tree (M5), and Support Vector Regression (SVR) were investigated in order to model and estimate the dew point temperature (DPT) at Tabriz station, Iran. For this purpose, meteorological parameters of daily average temperature (T), relative humidity (RH), actual vapor pressure (Vp), wind speed (W), and sunshine hours (S) were obtained from the meteorological organization of East Azerbaijan province, Iran for the period 1998 to 2016. Following this, the methods mentioned above were examined by defining 15 different input combinations of meteorological parameters. Additionally, root mean square error (RMSE) and the coefficient of determination (R2) were implemented to analyze the accuracy of the proposed methods. The results showed that the GEP-10 method, using three input parameters of T, RH, and S, with RMSE of 0.96°, the SVR-5, using two input parameters of T and RH, with RMSE of 0.44, and M5-15, using five input parameters of T, RH, Vp, W, and S with RMSE of 0.37 present better performance in the estimation of the DPT. As a conclusion, the M5-15 is recommended as the most precise model in the estimation of DPT in comparison with other considered models. As a conclusion, the obtained results proved the high capability of proposed M5 models in DPT estimation.


2019 ◽  
Vol 21 (6) ◽  
pp. 1014-1029 ◽  
Author(s):  
Kiyoumars Roushangar ◽  
Ghazaleh Nasssaji Matin ◽  
Roghayeh Ghasempour ◽  
Seyed Mahdi Saghebian

Abstract Energy dissipation in culverts is a complex phenomenon due to the nonlinearity and uncertainties of the process. In the current study, the capability of Gaussian process regression (GPR) and support vector machine (SVM) as kernel-based approaches and the gene expression programming (GEP) method was assessed in predicting energy losses in culverts. Two types of bend loss in rectangular culverts and entrance loss in circular culverts with different inlet end treatments were considered. Various input combinations were developed and tested using experimental data. The OAT (one-at-a-time), factorial sensitivity analysis and Monte Carlo uncertainty analysis were used to select the effective parameters in modeling. The results of performance criteria proved the capability of the applied methods (i.e. high correlation coefficient (R) and determination coefficient (DC) and low root mean square error (RSME)). For rectangular culverts, the model with parameters Fr (Froude number) and θ (bend angle), and for circular culverts, the model with parameters Fr and Hw/D (depth ratio), were the superior models. It showed that using the bend downstream Froude number caused an increment in model efficiency. Among the four end inlet treatments, mitered flush to 1.5:1 fill slope inlet yielded more accurate prediction. The sensitivity and uncertainty analysis showed that θ and Hw/D had the most significant impact on modeling, and Fr had the highest uncertainty.


Author(s):  
JUANA CANUL-REICH ◽  
LAWRENCE O. HALL ◽  
DMITRY B. GOLDGOF ◽  
JOHN N. KORECKI ◽  
STEVEN ESCHRICH

Gene-expression microarray datasets often consist of a limited number of samples with a large number of gene-expression measurements, usually on the order of thousands. Therefore, dimensionality reduction is critical prior to any classification task. In this work, the iterative feature perturbation method (IFP), an embedded gene selector, is introduced and applied to four microarray cancer datasets: colon cancer, leukemia, Moffitt colon cancer, and lung cancer. We compare results obtained by IFP to those of support vector machine-recursive feature elimination (SVM-RFE) and the t-test as a feature filter using a linear support vector machine as the base classifier. Analysis of the intersection of gene sets selected by the three methods across the four datasets was done. Additional experiments included an initial pre-selection of the top 200 genes based on their p values. IFP and SVM-RFE were then applied on the reduced feature sets. These results showed up to 3.32% average performance improvement for IFP across the four datasets. A statistical analysis (using the Friedman/Holm test) for both scenarios showed the highest accuracies came from the t-test as a filter on experiments without gene pre-selection. IFP and SVM-RFE had greater classification accuracy after gene pre-selection. Analysis showed the t-test is a good gene selector for microarray data. IFP and SVM-RFE showed performance improvement on a reduced by t-test dataset. The IFP approach resulted in comparable or superior average class accuracy when compared to SVM-RFE on three of the four datasets. The same or similar accuracies can be obtained with different sets of genes.


2016 ◽  
Vol 16 (4) ◽  
pp. 1002-1016 ◽  
Author(s):  
Hazi Mohammad Azamathulla ◽  
Amir Hamzeh Haghiabi ◽  
Abbas Parsaie

Side weirs have many possible applications in the field of hydraulic engineering. They are also considered an important structure in hydro systems. In this study, the support vector machine (SVM) technique was employed to predict the side weir discharge coefficient. The performance of SVM was compared with other types of soft computing techniques such as artificial neural networks (ANN) and adaptive neuro fuzzy inference systems (ANFIS). While ANN and ANFIS models provided a good prediction performance, the SVM model with a radial basis function kernel function outperforms them. The best SVM model was developed with a gamma coefficient and epsilon of 15 and 0.3, respectively. The SVM yielded a coefficient of determination (R2) equal to 0.96 and 0.93 for the training and testing data. Sensitivity analyses of the ANN, ANFIS and SVM models showed that the Froude number and ratio of weir length to the flow depth upstream of the weir are the most effective parameters for the prediction of the discharge coefficient.


2021 ◽  
Vol 18 (17) ◽  
Author(s):  
Micheal Olaolu AROWOLO ◽  
Marion Olubunmi ADEBIYI ◽  
Chiebuka Timothy NNODIM ◽  
Sulaiman Olaniyi ABDULSALAM ◽  
Ayodele Ariyo ADEBIYI

As mosquito parasites breed across many parts of the sub-Saharan Africa part of the world, infected cells embrace an unpredictable and erratic life period. Millions of individual parasites have gene expressions. Ribonucleic acid sequencing (RNA-seq) is a popular transcriptional technique that has improved the detection of major genetic probes. The RNA-seq analysis generally requires computational improvements of machine learning techniques since it computes interpretations of gene expressions. For this study, an adaptive genetic algorithm (A-GA) with recursive feature elimination (RFE) (A-GA-RFE) feature selection algorithms was utilized to detect important information from a high-dimensional gene expression malaria vector RNA-seq dataset. Support Vector Machine (SVM) kernels were used as the classification algorithms to evaluate its predictive performances. The feasibility of this study was confirmed by using an RNA-seq dataset from the mosquito Anopheles gambiae. The technique results in related performance had 98.3 and 96.7 % accuracy rates, respectively. HIGHLIGHTS Dimensionality reduction method based of feature selection Classification using Support vector machine Classification of malaria vector dataset using an adaptive GA-RFE-SVM GRAPHICAL ABSTRACT


Sign in / Sign up

Export Citation Format

Share Document