Sparse Ensemble Machine Learning to improve robustness of long-term decoding in iBMIs

Mapping Intimacies ◽

10.1101/834028 ◽

2019 ◽

Author(s):

Shoeb Shaikh ◽

Rosa So ◽

Tafadzwa Sibindi ◽

Camilo Libedinsky ◽

Arindam Basu

Keyword(s):

Machine Learning ◽

Support Vector ◽

Linear Discriminant ◽

Ensemble Machine Learning ◽

Machine Learning Approach ◽

Recording Channels ◽

The Face ◽

Learning Machine ◽

Decoding Accuracy

AbstractThis paper presents a novel sparse ensemble based machine learning approach to enhance robustness of intracortical Brain Machine Interfaces (iBMIs) in the face of non-stationary distribution of input neural data across time. Each classifier in the ensemble is trained on a randomly sampled (with replacement) set of input channels. These sparse connections ensure that with a high chance, few of the base classifiers should be less affected by the variations in some of the recording channels. We have tested the generality of this technique on different base classifiers - linear discriminant analysis (LDA), support vector machine (SVM), extreme learning machine (ELM) and multilayer perceptron (MLP). Results show decoding accuracy improvements of up to ≈ 21%, 13%, 19%, 10% in non-human primate (NHP) A and 7%, 9%, 7%, 9% in NHP B across test days while using the sparse ensemble approach over a single classifier model for LDA, SVM, ELM and MLP algorithms respectively. The technique also holds ground when the most informative electrode on the test day is dropped. Accordingly, improvements of up to ≈ 24%, 11%, 22%, 9% in NHP A and 14%, 19%, 7%, 28% in NHP B are obtained for LDA, SVM, ELM and MLP respectively.

Download Full-text

Comparative Study of Machine Learning Approaches for Predicting Creep Behavior of Polyurethane Elastomer

Polymers ◽

10.3390/polym13111768 ◽

2021 ◽

Vol 13 (11) ◽

pp. 1768

Author(s):

Chunhao Yang ◽

Wuning Ma ◽

Jianlin Zhong ◽

Zhendong Zhang

Keyword(s):

Machine Learning ◽

Support Vector ◽

Polyurethane Elastomer ◽

Learning Approaches ◽

Creep Stress ◽

Machine Learning Approach ◽

Creep Time ◽

Multilayer Perceptron Network ◽

Testing Set

The long-term mechanical properties of viscoelastic polymers are among their most important aspects. In the present research, a machine learning approach was proposed for creep properties’ prediction of polyurethane elastomer considering the effect of creep time, creep temperature, creep stress and the hardness of the material. The approaches are based on multilayer perceptron network, random forest and support vector machine regression, respectively. While the genetic algorithm and k-fold cross-validation were used to tune the hyper-parameters. The results showed that the three models all proposed excellent fitting ability for the training set. Moreover, the three models had different prediction capabilities for the testing set by focusing on various changing factors. The correlation coefficient values between the predicted and experimental strains were larger than 0.913 (mostly larger than 0.998) on the testing set when choosing the reasonable model.

Download Full-text

Implementation of machine learning algorithms to create diabetic patient re-admission profiles

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-019-0990-x ◽

2019 ◽

Vol 19 (S9) ◽

Cited By ~ 3

Author(s):

Mohamed Alloghani ◽

Ahmed Aljaaf ◽

Abir Hussain ◽

Thar Baker ◽

Jamila Mustafina ◽

...

Keyword(s):

Machine Learning ◽

Nearest Neighbor ◽

Machine Learning Algorithms ◽

Support Vector ◽

Diabetic Patients ◽

K Nearest Neighbor ◽

Critical Approach ◽

Linear Discriminant ◽

Applied Machine Learning ◽

Learning Machine

Abstract Background Machine learning is a branch of Artificial Intelligence that is concerned with the design and development of algorithms, and it enables today’s computers to have the property of learning. Machine learning is gradually growing and becoming a critical approach in many domains such as health, education, and business. Methods In this paper, we applied machine learning to the diabetes dataset with the aim of recognizing patterns and combinations of factors that characterizes or explain re-admission among diabetes patients. The classifiers used include Linear Discriminant Analysis, Random Forest, k–Nearest Neighbor, Naïve Bayes, J48 and Support vector machine. Results Of the 100,000 cases, 78,363 were diabetic and over 47% were readmitted.Based on the classes that models produced, diabetic patients who are more likely to be readmitted are either women, or Caucasians, or outpatients, or those who undergo less rigorous lab procedures, treatment procedures, or those who receive less medication, and are thus discharged without proper improvements or administration of insulin despite having been tested positive for HbA1c. Conclusion Diabetic patients who do not undergo vigorous lab assessments, diagnosis, medications are more likely to be readmitted when discharged without improvements and without receiving insulin administration, especially if they are women, Caucasians, or both.

Download Full-text

An Ensemble Machine Learning Approach to Understanding the Effect of a Global Pandemic on Twitter Users’ Attitudes

International Journal of Computers Communications & Control ◽

10.15837/ijccc.2021.2.4207 ◽

2021 ◽

Vol 16 (2) ◽

Author(s):

Bokang Jia ◽

Domnica Dzitac ◽

Samridha Shrestha ◽

Komiljon Turdaliev ◽

Nurgazy Seidaliev

Keyword(s):

Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Learning Approach ◽

Logistic Regression Models ◽

Global Pandemic ◽

Ensemble Machine Learning ◽

Machine Learning Approach ◽

Twitter Users ◽

Asian People

It is thought that the COVID-19 outbreak has significantly fuelled racism and discrimination, especially towards Asian individuals[10]. In order to test this hypothesis, in this paper, we build upon existing work in order to classify racist tweets before and after COVID-19 was declared a global pandemic. To overcome the difficult linguistic and unbalanced nature of the classification task, we combine an ensemble of machine learning techniques such as a Linear Support Vector Classifiers, Logistic Regression models, and Deep Neural Networks. We fill the gap in existing literature by (1) using a combined Machine Learning approach to understand the effect of COVID-19 on Twitter users’ attitudes and by (2) improving on the performance of automatic racism detectors. Here we show that there has not been a sharp increase in racism towards Asian people on Twitter and that users that posted racist Tweets before the pandemic are prone to post an approximately equal amount during the outbreak. Previous research on racism and other virus outbreaks suggests that racism towards communities associated with the region of the origin of the virus is not exclusively attributed to the outbreak but rather it is a continued symptom of deep-rooted biases towards minorities[13]. Our research supports these previous findings. We conclude that the COVID-19 outbreak is an additional outlet to discriminate against Asian people, instead of it being the main cause.

Download Full-text

Application of Machine Learning Approaches for the Design and Study of Anticancer Drugs

Current Drug Targets ◽

10.2174/1389450119666180809122244 ◽

2019 ◽

Vol 20 (5) ◽

pp. 488-500 ◽

Cited By ~ 6

Author(s):

Yan Hu ◽

Yi Lu ◽

Shuo Wang ◽

Mengying Zhang ◽

Xiaosheng Qu ◽

...

Keyword(s):

Machine Learning ◽

Drug Design ◽

Anticancer Drugs ◽

Nearest Neighbor ◽

Cost Effective ◽

Support Vector ◽

Learning Approaches ◽

K Nearest Neighbor ◽

Activity Prediction ◽

Linear Discriminant

Background: Globally the number of cancer patients and deaths are continuing to increase yearly, and cancer has, therefore, become one of the world's highest causes of morbidity and mortality. In recent years, the study of anticancer drugs has become one of the most popular medical topics. Objective: In this review, in order to study the application of machine learning in predicting anticancer drugs activity, some machine learning approaches such as Linear Discriminant Analysis (LDA), Principal components analysis (PCA), Support Vector Machine (SVM), Random forest (RF), k-Nearest Neighbor (kNN), and Naïve Bayes (NB) were selected, and the examples of their applications in anticancer drugs design are listed. Results: Machine learning contributes a lot to anticancer drugs design and helps researchers by saving time and is cost effective. However, it can only be an assisting tool for drug design. Conclusion: This paper introduces the application of machine learning approaches in anticancer drug design. Many examples of success in identification and prediction in the area of anticancer drugs activity prediction are discussed, and the anticancer drugs research is still in active progress. Moreover, the merits of some web servers related to anticancer drugs are mentioned.

Download Full-text

Cloud based ensemble machine learning approach for smart detection of epileptic seizures using higher order spectral analysis

Physical and Engineering Sciences in Medicine ◽

10.1007/s13246-021-00970-y ◽

2021 ◽

Author(s):

Kuldeep Singh ◽

Jyoteesh Malhotra

Keyword(s):

Machine Learning ◽

Spectral Analysis ◽

Epileptic Seizures ◽

Higher Order ◽

Learning Approach ◽

Ensemble Machine Learning ◽

Machine Learning Approach

Download Full-text

Comparison of Ensemble Machine Learning Methods for Soil Erosion Pin Measurements

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10010042 ◽

2021 ◽

Vol 10 (1) ◽

pp. 42

Author(s):

Kieu Anh Nguyen ◽

Walter Chen ◽

Bor-Shiun Lin ◽

Uma Seeboonruang

Keyword(s):

Machine Learning ◽

Soil Erosion ◽

Ensemble Methods ◽

Machine Learning Algorithms ◽

Multivariate Adaptive Regression Splines ◽

Gradient Boosting ◽

Support Vector ◽

Ensemble Machine Learning ◽

Boosting Method ◽

Bagging Method

Although machine learning has been extensively used in various fields, it has only recently been applied to soil erosion pin modeling. To improve upon previous methods of quantifying soil erosion based on erosion pin measurements, this study explored the possible application of ensemble machine learning algorithms to the Shihmen Reservoir watershed in northern Taiwan. Three categories of ensemble methods were considered in this study: (a) Bagging, (b) boosting, and (c) stacking. The bagging method in this study refers to bagged multivariate adaptive regression splines (bagged MARS) and random forest (RF), and the boosting method includes Cubist and gradient boosting machine (GBM). Finally, the stacking method is an ensemble method that uses a meta-model to combine the predictions of base models. This study used RF and GBM as the meta-models, decision tree, linear regression, artificial neural network, and support vector machine as the base models. The dataset used in this study was sampled using stratified random sampling to achieve a 70/30 split for the training and test data, and the process was repeated three times. The performance of six ensemble methods in three categories was analyzed based on the average of three attempts. It was found that GBM performed the best among the ensemble models with the lowest root-mean-square error (RMSE = 1.72 mm/year), the highest Nash-Sutcliffe efficiency (NSE = 0.54), and the highest index of agreement (d = 0.81). This result was confirmed by the spatial comparison of the absolute differences (errors) between model predictions and observations using GBM and RF in the study area. In summary, the results show that as a group, the bagging method and the boosting method performed equally well, and the stacking method was third for the erosion pin dataset considered in this study.

Download Full-text

Prediction and Analysis of Gold Prices using Ensemble Machine Learning Algorithms

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.36028 ◽

2021 ◽

Vol 9 (VI) ◽

pp. 4367-4374

Author(s):

Gudipally Chandrashakar

Keyword(s):

Machine Learning ◽

Time Series ◽

Time Series Data ◽

Gold Price ◽

Machine Learning Algorithms ◽

Series Data ◽

Gradient Boosting ◽

Support Vector ◽

Average Value ◽

Ensemble Machine Learning

In this article, we used historical time series data up to the current day gold price. In this study of predicting gold price, we consider few correlating factors like silver price, copper price, standard, and poor’s 500 value, dollar-rupee exchange rate, Dow Jones Industrial Average Value. Considering the prices of every correlating factor and gold price data where dates ranging from 2008 January to 2021 February. Few algorithms of machine learning are used to analyze the time-series data are Random Forest Regression, Support Vector Regressor, Linear Regressor, ExtraTrees Regressor and Gradient boosting Regression. While seeing the results the Extra Tree Regressor algorithm gives the predicted value of gold prices more accurately.

Download Full-text

Long-term streamflow forecasting for the Cascade Reservoir System of Han River using SWAT with CFS output

Hydrology Research ◽

10.2166/nh.2018.114 ◽

2018 ◽

Vol 50 (2) ◽

pp. 655-671

Author(s):

Tian Liu ◽

Yuanfang Chen ◽

Binquan Li ◽

Yiming Hu ◽

Hui Qiu ◽

...

Keyword(s):

Machine Learning ◽

Assessment Tool ◽

Flow Simulation ◽

Machine Learning Algorithms ◽

Support Vector ◽

Streamflow Forecasting ◽

Distribution Models ◽

Han River ◽

Reservoir System

Abstract Due to the large uncertainties of long-term precipitation prediction and reservoir operation, it is difficult to forecast long-term streamflow for large basins with cascade reservoirs. In this paper, a framework coupling the original Climate Forecasting System (CFS) precipitation with the Soil and Water Assessment Tool (SWAT) was proposed to forecast the nine-month streamflow for the Cascade Reservoir System of Han River (CRSHR) including Shiquan, Ankang and Danjiangkou reservoirs. First, CFS precipitation was tested against the observation and post-processed through two machine learning algorithms, random forest and support vector regression. Results showed the correlation coefficients between the monthly areal CFS precipitation (post-processed) and observation were 0.91–0.96, confirming that CFS precipitation post-processing using machine learning was not affected by the extended forecast period. Additionally, two precipitation spatio-temporal distribution models, original CFS and similar historical observation, were adopted to disaggregate the processed monthly areal CFS precipitation to daily subbasin-scale precipitation. Based on the reservoir restoring flow, the regional SWAT was calibrated for CRSHR. The Nash–Sutcliffe efficiencies for three reservoirs flow simulation were 0.86, 0.88 and 0.84, respectively, meeting the accuracy requirement. The experimental forecast showed that for three reservoirs, long-term streamflow forecast with similar historical observed distribution was more accurate than that with original CFS.

Download Full-text

New-Onset Diabetes and Preexisting Diabetes Are Associated With Comparable Reduction in Long-Term Survival After Liver Transplant: A Machine Learning Approach

Mayo Clinic Proceedings ◽

10.1016/j.mayocp.2018.06.020 ◽

2018 ◽

Vol 93 (12) ◽

pp. 1794-1802 ◽

Cited By ~ 9

Author(s):

Venkat Bhat ◽

Mahmood Tazari ◽

Kymberly D. Watt ◽

Mamatha Bhat

Keyword(s):

Machine Learning ◽

Liver Transplant ◽

Learning Approach ◽

Term Survival ◽

Long Term Survival ◽

Onset Diabetes ◽

Machine Learning Approach ◽

New Onset

Download Full-text

Distribution Grids Fault Location employing ST based Optimized Machine Learning Approach

Energies ◽

10.3390/en11092328 ◽

2018 ◽

Vol 11 (9) ◽

pp. 2328 ◽

Cited By ~ 12

Author(s):

Md Shafiullah ◽

M. Abido ◽

Taher Abdel-Fattah

Keyword(s):

Machine Learning ◽

Fault Location ◽

Percentage Error ◽

Support Vector ◽

Learning Approach ◽

Efficiency Coefficient ◽

Learning Tools ◽

Performance Indices ◽

Machine Learning Approach ◽

Distribution Grids

Precise information of fault location plays a vital role in expediting the restoration process, after being subjected to any kind of fault in power distribution grids. This paper proposed the Stockwell transform (ST) based optimized machine learning approach, to locate the faults and to identify the faulty sections in the distribution grids. This research employed the ST to extract useful features from the recorded three-phase current signals and fetches them as inputs to different machine learning tools (MLT), including the multilayer perceptron neural networks (MLP-NN), support vector machines (SVM), and extreme learning machines (ELM). The proposed approach employed the constriction-factor particle swarm optimization (CF-PSO) technique, to optimize the parameters of the SVM and ELM for their better generalization performance. Hence, it compared the obtained results of the test datasets in terms of the selected statistical performance indices, including the root mean squared error (RMSE), mean absolute percentage error (MAPE), percent bias (PBIAS), RMSE-observations to standard deviation ratio (RSR), coefficient of determination (R2), Willmott’s index of agreement (WIA), and Nash–Sutcliffe model efficiency coefficient (NSEC) to confirm the effectiveness of the developed fault location scheme. The satisfactory values of the statistical performance indices, indicated the superiority of the optimized machine learning tools over the non-optimized tools in locating faults. In addition, this research confirmed the efficacy of the faulty section identification scheme based on overall accuracy. Furthermore, the presented results validated the robustness of the developed approach against the measurement noise and uncertainties associated with pre-fault loading condition, fault resistance, and inception angle.

Download Full-text