A LIME-Based Explainable Machine Learning Model for Predicting the Severity Level of COVID-19 Diagnosed Patients

Freddy Gabbay; Shirly Bar-Lev; Ofer Montano; Noam Hadad

doi:10.3390/app112110417

A LIME-Based Explainable Machine Learning Model for Predicting the Severity Level of COVID-19 Diagnosed Patients

Applied Sciences ◽

10.3390/app112110417 ◽

2021 ◽

Vol 11 (21) ◽

pp. 10417

Author(s):

Freddy Gabbay ◽

Shirly Bar-Lev ◽

Ofer Montano ◽

Noam Hadad

Keyword(s):

Machine Learning ◽

The Novel ◽

Learning Models ◽

Severity Level ◽

Patient Deterioration ◽

Machine Learning Model ◽

Laboratory Test Results ◽

Hospital Resources ◽

Novel Coronavirus ◽

Machine Learning Models

The fast and seemingly uncontrollable spread of the novel coronavirus disease (COVID-19) poses great challenges to an already overloaded health system worldwide. It thus exemplifies an urgent need for fast and effective triage. Such triage can help in the implementation of the necessary measures to prevent patient deterioration and conserve strained hospital resources. We examine two types of machine learning models, a multilayer perceptron artificial neural networks and decision trees, to predict the severity level of illness for patients diagnosed with COVID-19, based on their medical history and laboratory test results. In addition, we combine the machine learning models with a LIME-based explainable model to provide explainability of the model prediction. Our experimental results indicate that the model can achieve up to 80% prediction accuracy for the dataset we used. Finally, we integrate the explainable machine learning models into a mobile application to enable the usage of the proposed models by medical staff worldwide.

Download Full-text

Short-Term Prediction of COVID-19 Cases Using Machine Learning Models

Applied Sciences ◽

10.3390/app11094266 ◽

2021 ◽

Vol 11 (9) ◽

pp. 4266

Author(s):

Md. Shahriare Satu ◽

Koushik Chandra Howlader ◽

Mufti Mahmud ◽

M. Shamim Kaiser ◽

Sheikh Mohammad Shariful Islam ◽

...

Keyword(s):

Machine Learning ◽

Web Application ◽

Learning Models ◽

Short Term ◽

Infected People ◽

Infected Case ◽

First Case ◽

Novel Coronavirus ◽

Short Term Prediction ◽

Machine Learning Models

The first case in Bangladesh of the novel coronavirus disease (COVID-19) was reported on 8 March 2020, with the number of confirmed cases rapidly rising to over 175,000 by July 2020. In the absence of effective treatment, an essential tool of health policy is the modeling and forecasting of the progress of the pandemic. We, therefore, developed a cloud-based machine learning short-term forecasting model for Bangladesh, in which several regression-based machine learning models were applied to infected case data to estimate the number of COVID-19-infected people over the following seven days. This approach can accurately forecast the number of infected cases daily by training the prior 25 days sample data recorded on our web application. The outcomes of these efforts could aid the development and assessment of prevention strategies and identify factors that most affect the spread of COVID-19 infection in Bangladesh.

Download Full-text

Comparison of machine learning models based on time domain and frequency domain features for faults diagnosis in rotating machines

MATEC Web of Conferences ◽

10.1051/matecconf/201821117009 ◽

2018 ◽

Vol 211 ◽

pp. 17009

Author(s):

Natalia Espinoza Sepulveda ◽

Jyoti Sinha

Keyword(s):

Machine Learning ◽

Frequency Domain ◽

Time Domain ◽

Intelligent Systems ◽

Learning Models ◽

Machine Vibration ◽

Vibration Data ◽

Machine Learning Model ◽

The Time Domain ◽

Machine Learning Models

The development of technologies for the maintenance industry has taken an important role to meet the demanding challenges. One of the important challenges is to predict the defects, if any, in machines as early as possible to manage the machines downtime. The vibration-based condition monitoring (VCM) is well-known for this purpose but requires the human experience and expertise. The machine learning models using the intelligent systems and pattern recognition seem to be the future avenue for machine fault detection without the human expertise. Several such studies are published in the literature. This paper is also on the machine learning model for the different machine faults classification and detection. Here the time domain and frequency domain features derived from the measured machine vibration data are used separated in the development of the machine learning models using the artificial neutral network method. The effectiveness of both the time and frequency domain features based models are compared when they are applied to an experimental rig. The paper presents the proposed machine learning models and their performance in terms of the observations and results.

Download Full-text

Automated Retraining of Machine Learning Models

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.l3322.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 445-452

Keyword(s):

Machine Learning ◽

Input Data ◽

Research Work ◽

Learning Models ◽

Machine Learning Methods ◽

Machine Learning Model ◽

Crucial Component ◽

Conventional Machine ◽

Over Time ◽

Machine Learning Models

Data is the most crucial component of a successful ML system. Once a machine learning model is developed, it gets obsolete over time due to presence of new input data being generated every second. In order to keep our predictions accurate we need to find a way to keep our models up to date. Our research work involves finding a mechanism which can retrain the model with new data automatically. This research also involves exploring the possibilities of automating machine learning processes. We started this project by training and testing our model using conventional machine learning methods. The outcome was then compared with the outcome of those experiments conducted using the AutoML methods like TPOT. This helped us in finding an efficient technique to retrain our models. These techniques can be used in areas where people do not deal with the actual working of a ML model but only require the outputs of ML processes

Download Full-text

TOPICAL ISSUES OF APPLICATION OF MACHINE LEARNING METHODS IN ECONOMY

Инновационные аспекты развития науки и техники. Сборник статей VIII Международной научно-практической конференции: сборник статей, [электронное издание сетевого распространения] / Под ред. Н.В. Емельянова. – М.: “КДУ”, “Добросвет”, 2021. – 149 с. ◽

10.31453/kdu.ru.978-5-7913-1176-4-2021-28-33 ◽

2021 ◽

Author(s):

Natalia Pavlovna Persteneva ◽

◽

Darya Dmitrievn Skryleva ◽

Keyword(s):

Machine Learning ◽

Unsupervised Learning ◽

Supervised Learning ◽

Learning Model ◽

Learning Models ◽

Learning Methods ◽

Machine Learning Methods ◽

Machine Learning Model ◽

Popular Classes ◽

Machine Learning Models

The article discusses machine learning methods. Using the example of two popular classes: supervised learning and unsupervised learning. Variants of the main types of machine learning models for each method are presented. A generalized algorithm for building any machine learning model is formed.

Download Full-text

Improving Logging Prediction on Imbalanced Datasets

International Journal of Open Source Software and Processes ◽

10.4018/ijossp.2016040103 ◽

2016 ◽

Vol 7 (2) ◽

pp. 43-71 ◽

Cited By ~ 3

Author(s):

Sangeeta Lal ◽

Neetu Sardana ◽

Ashish Sureka

Keyword(s):

Machine Learning ◽

Open Source ◽

Class Imbalance ◽

Learning Model ◽

Learning Models ◽

Class Imbalance Problem ◽

Imbalanced Datasets ◽

Imbalance Problem ◽

Machine Learning Model ◽

Machine Learning Models

Logging is an important yet tough decision for OSS developers. Machine-learning models are useful in improving several steps of OSS development, including logging. Several recent studies propose machine-learning models to predict logged code construct. The prediction performances of these models are limited due to the class-imbalance problem since the number of logged code constructs is small as compared to non-logged code constructs. No previous study analyzes the class-imbalance problem for logged code construct prediction. The authors first analyze the performances of J48, RF, and SVM classifiers for catch-blocks and if-blocks logged code constructs prediction on imbalanced datasets. Second, the authors propose LogIm, an ensemble and threshold-based machine-learning model. Third, the authors evaluate the performance of LogIm on three open-source projects. On average, LogIm model improves the performance of baseline classifiers, J48, RF, and SVM, by 7.38%, 9.24%, and 4.6% for catch-blocks, and 12.11%, 14.95%, and 19.13% for if-blocks logging prediction.

Download Full-text

GPT-3 AI language tool calls for cautious optimism

Emerald Expert Briefings ◽

10.1108/oxan-db256373 ◽

2020 ◽

Keyword(s):

Machine Learning ◽

Public Funding ◽

Computational Effort ◽

Learning Models ◽

Content Type ◽

Business Applications ◽

Machine Learning Model ◽

Explainable Ai ◽

Advance Research ◽

Machine Learning Models

Significance It required arguably the single largest computational effort for a machine learning model to date, and is it capable of producing text at times indistinguishable from the work of a human author. This has generated considerable excitement about potentially transformative business applications -- and concerns about the system's weaknesses and possible misuse. Impacts Stereotypes and biases in machine learning models will become increasingly problematic as they are adopted by businesses and governments. The use of flawed AI tools that result in embarrassing failures risk cuts to public funding for AI research. Academia and industry face pressure to advance research into explainable AI, but progress is slow.

Download Full-text

Unscented Kalman Filter-Aided Long Short-Term Memory Approach for Wind Nowcasting

Aerospace ◽

10.3390/aerospace8090236 ◽

2021 ◽

Vol 8 (9) ◽

pp. 236

Author(s):

Junghyun Kim ◽

Kyuman Lee

Keyword(s):

Machine Learning ◽

Short Term Memory ◽

Unscented Kalman Filter ◽

Aviation Industry ◽

Learning Models ◽

Short Term ◽

Machine Learning Model ◽

Long Short Term Memory ◽

Lstm Network ◽

Machine Learning Models

Obtaining reliable wind information is critical for efficiently managing air traffic and airport operations. Wind forecasting has been considered one of the most challenging tasks in the aviation industry. Recently, with the advent of artificial intelligence, many machine learning techniques have been widely used to address a variety of complex phenomena in wind predictions. In this paper, we propose a hybrid framework that combines a machine learning model with Kalman filtering for a wind nowcasting problem in the aviation industry. More specifically, this study has three objectives as follows: (1) compare the performance of the machine learning models (i.e., Gaussian process, multi-layer perceptron, and long short-term memory (LSTM) network) to identify the most appropriate model for wind predictions, (2) combine the machine learning model selected in step (1) with an unscented Kalman filter (UKF) to improve the fidelity of the model, and (3) perform Monte Carlo simulations to quantify uncertainties arising from the modeling process. Results show that short-term time-series wind datasets are best predicted by the LSTM network compared to the other machine learning models and the UKF-aided LSTM (UKF-LSTM) approach outperforms the LSTM network only, especially when long-term wind forecasting needs to be considered.

Download Full-text

Hybrid Machine Learning Model for Body Fat Percentage Prediction Based on Support Vector Regression and Emotional Artificial Neural Networks

Applied Sciences ◽

10.3390/app11219797 ◽

2021 ◽

Vol 11 (21) ◽

pp. 9797

Author(s):

Solaf A. Hussain ◽

Nadire Cavus ◽

Boran Sekeroglu

Keyword(s):

Machine Learning ◽

Body Fat ◽

Support Vector ◽

Body Fat Percentage ◽

Learning Models ◽

Fat Percentage ◽

Machine Learning Model ◽

Proposed Model ◽

Hybrid Machine ◽

Machine Learning Models

Obesity or excessive body fat causes multiple health problems and diseases. However, obesity treatment and control need an accurate determination of body fat percentage (BFP). The existing methods for BFP estimation require several procedures, which reduces their cost-effectivity and generalization. Therefore, developing cost-effective models for BFP estimation is vital for obesity treatment. Machine learning models, particularly hybrid models, have a strong ability to analyze challenging data and perform predictions by combining different characteristics of the models. This study proposed a hybrid machine learning model based on support vector regression and emotional artificial neural networks (SVR-EANNs) for accurate recent BFP prediction using a primary BFP dataset. SVR was applied as a consistent attribute selection model on seven properties and measurements, using the left-out sensitivity analysis, and the regression ability of the EANN was considered in the prediction phase. The proposed model was compared to seven benchmark machine learning models. The obtained results show that the proposed hybrid model (SVR-EANN) outperformed other machine learning models by achieving superior results in the three considered evaluation metrics. Furthermore, the proposed model suggested that abdominal circumference is a significant factor in BFP prediction, while age has a minor effect.

Download Full-text

Value of the Application of CE-MRI Radiomics and Machine Learning in Preoperative Prediction of Sentinel Lymph Node Metastasis in Breast Cancer

Frontiers in Oncology ◽

10.3389/fonc.2021.757111 ◽

2021 ◽

Vol 11 ◽

Author(s):

Yadi Zhu ◽

Ling Yang ◽

Hailin Shen

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Lymph Node ◽

Sentinel Lymph Node ◽

Breast Cancer Patients ◽

Learning Models ◽

Preoperative Prediction ◽

Machine Learning Model ◽

Validation Set ◽

Machine Learning Models

PurposeTo explore the value of machine learning model based on CE-MRI radiomic features in preoperative prediction of sentinel lymph node (SLN) metastasis of breast cancer.MethodsThe clinical, pathological and MRI data of 177 patients with pathologically confirmed breast cancer (81 with SLN positive and 96 with SLN negative) and underwent conventional DCE-MRI before surgery in the First Affiliated Hospital of Soochow University from January 2015 to May 2021 were analyzed retrospectively. The samples were randomly divided into the training set (n=123) and validation set (n= 54) according to the ratio of 7:3. The radiomic features were derived from DCE-MRI phase 2 images, and 1,316 original eigenvectors are normalized by maximum and minimum normalization. The optimal feature filter and selection operator (LASSO) algorithm were used to obtain the optimal features. Five machine learning models of Support Vector Machine, Random Forest, Logistic Regression, Gradient Boosting Decision Tree, and Decision Tree were constructed based on the selected features. Radiomics signature and independent risk factors were incorporated to build a combined model. The receiver operating characteristic curve and area under the curve were used to evaluate the performance of the above models, and the accuracy, sensitivity, and specificity were calculated.ResultsThere is no significant difference between all clinical and histopathological variables in breast cancer patients with and without SLN metastasis (P >0.05), except tumor size and BI-RADS classification (P< 0.01). Thirteen features were obtained as optimal features for machine learning model construction. In the validation set, the AUC (0.86) of SVM was the highest among the five machine learning models. Meanwhile, the combined model showed better performance in sentinel lymph node metastasis (SLNM) prediction and achieved a higher AUC (0.88) in the validation set.ConclusionsWe revealed the clinical value of machine learning models established based on CE-MRI radiomic features, providing a highly accurate, non-invasive, and convenient method for preoperative prediction of SLNM in breast cancer patients.

Download Full-text

An Introduction on Interpretable Machine Learning

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.g1023.0597s20 ◽

2020 ◽

Vol 9 (7S) ◽

pp. 107-111

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Human Life ◽

Accuracy Score ◽

Learning Models ◽

Ethical Practices ◽

Interpretable Machine Learning ◽

Machine Learning Model ◽

Machine Learning Applications ◽

Machine Learning Models

As Artificial Intelligence penetrates all aspects of human life, more and more questions about ethical practices and fair uses arise, which has motivated the research community to look inside and develop methods to interpret these Artificial Intelligence/Machine Learning models. This concept of interpretability can not only help with the ethical questions but also can provide various insights into the working of these machine learning models, which will become crucial in trust-building and understanding how a model makes decisions. Furthermore, in many machine learning applications, the feature of interpretability is the primary value that they offer. However, in practice, many developers select models based on the accuracy score and disregarding the level of interpretability of that model, which can be chaotic as predictions by many high accuracy models are not easily explainable. In this paper, we introduce the concept of Machine Learning Model Interpretability, Interpretable Machine learning, and the methods used for interpretation and explanations.

Download Full-text