Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression

Belisario Panay; Nelson Baloian; José A. Pino; Sergio Peñafiel; Horacio Sanson; Nicolas Bersano

doi:10.3390/s20164392

Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression

Sensors ◽

10.3390/s20164392 ◽

2020 ◽

Vol 20 (16) ◽

pp. 4392

Author(s):

Belisario Panay ◽

Nelson Baloian ◽

José A. Pino ◽

Sergio Peñafiel ◽

Horacio Sanson ◽

...

Keyword(s):

Neural Network ◽

Health Care ◽

Artificial Neural Network ◽

Health Care Costs ◽

Black Box ◽

Health Costs ◽

Gradient Boosting ◽

Healthcare Budget ◽

Artificial Neural ◽

Care Costs

Although many authors have highlighted the importance of predicting people’s health costs to improve healthcare budget management, most of them do not address the frequent need to know the reasons behind this prediction, i.e., knowing the factors that influence this prediction. This knowledge allows avoiding arbitrariness or people’s discrimination. However, many times the black box methods (that is, those that do not allow this analysis, e.g., methods based on deep learning techniques) are more accurate than those that allow an interpretation of the results. For this reason, in this work, we intend to develop a method that can achieve similar returns as those obtained with black box methods for the problem of predicting health costs, but at the same time it allows the interpretation of the results. This interpretable regression method is based on the Dempster-Shafer theory using Evidential Regression (EVREG) and a discount function based on the contribution of each dimension. The method “learns” the optimal weights for each feature using a gradient descent technique. The method also uses the nearest k-neighbor algorithm to accelerate calculations. It is possible to select the most relevant features for predicting a patient’s health care costs using this approach and the transparency of the Evidential Regression model. We can obtain a reason for a prediction with a k-NN approach. We used the Japanese health records at Tsuyama Chuo Hospital to test our method, which included medical examinations, test results, and billing information from 2013 to 2018. We compared our model to methods based on an Artificial Neural Network, Gradient Boosting, Regression Tree and Weighted k-Nearest Neighbors. Our results showed that our transparent model performed like the Artificial Neural Network and Gradient Boosting with an R2 of 0.44.

Download Full-text

Predicting Health Care Costs Using Evidence Regression

Proceedings ◽

10.3390/proceedings2019031074 ◽

2019 ◽

Vol 31 (1) ◽

pp. 74 ◽

Cited By ~ 1

Author(s):

Belisario Panay ◽

Nelson Baloian ◽

José Pino ◽

Sergio Peñafiel ◽

Horacio Sanson ◽

...

Keyword(s):

Neural Network ◽

Health Care ◽

Artificial Neural Network ◽

Regression Model ◽

Health Care Costs ◽

Health Care Cost ◽

Gradient Boosting ◽

Boosting Method ◽

Artificial Neural ◽

Care Costs

People’s health care cost prediction is nowadays a valuable tool to improve accountability in health care. In this work, we study if an interpretable method can reach the performance of black-box methods for the problem of predicting health care costs. We present an interpretable regression method based on the Dempster-Shafer theory, using the Evidence Regression model and a discount function based on the contribution of each dimension. Optimal parameters are learned using gradient descent. The k-nearest neighbors’ algorithm was also used to speed up computations. With the transparency of the evidence regression model, it is possible to create a set of rules based on a patient’s vicinity. When making a prediction, the model gives a set of rules for such a result. We used Japanese health records from Tsuyama Chuo Hospital to test our method, which includes medical checkups, exam results, and billing information from 2016 to 2017. We compared our model to an Artificial Neural Network and Gradient Boosting method. Our results showed that our transparent model outperforms the Artificial Neural Network and Gradient Boosting with an R 2 of 0 . 44 .

Download Full-text

Prediction of seismic drift responses of planar steel moment frames using artificial neural network and extreme gradient boosting

Engineering Structures ◽

10.1016/j.engstruct.2021.112518 ◽

2021 ◽

Vol 242 ◽

pp. 112518

Author(s):

Hoang D. Nguyen ◽

Nhan D. Dao ◽

Myoungsu Shin

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Gradient Boosting ◽

Moment Frames ◽

Steel Moment Frames ◽

Extreme Gradient Boosting ◽

Artificial Neural ◽

Seismic Drift

Download Full-text

Desain Sistem Informasi Cerdas pada Smart Grid Berbasis Internet of Things dan Artificial Neural Network

JSAI (Journal Scientific and Applied Informatics) ◽

10.36085/jsai.v4i1.1219 ◽

2021 ◽

Vol 4 (1) ◽

pp. 11-19

Author(s):

Muhammad Ridwan Arif Cahyono ◽

Surya Wirawan

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Internet Of Things ◽

Smart Grid ◽

Black Box ◽

Raspberry Pi ◽

Mean Square ◽

Layer Neuron ◽

Artificial Neural ◽

Artificial Neural Network Ann

Smart grid merupakan sistem kelistrikan yang memungkinkan pengguna untuk melakukan proses menjual dan membeli daya listrik. Pada penelitian ini dirancang model smart grid dengan sumber daya dari listrik PLN dan panel surya yang terhubung dengan beban. Beban yang digunakan memiliki daya maksimal 40 W dan panel surya yang digunakan memiliki kapasitas 100 Wp. ESP32 digunakan sebagai perangkat Internet of Things, yang digunakan sebagai pengukur dan pengontrol daya listrik yang akan dijual atau dibeli. Raspberry Pi digunakan sebagai web server pengolah data dari smart grid. Aplikasi “Smart Grid Dikti” merupakan aplikasi berbasis android yang dapat digunakan untuk melakukan pemantauan serta pengaturan dalam sistem smart grid tersebut. Aplikasi android tersebut telah diuji coba dengan metode Black Box, dengan hasil pengujian 100% berhasil. Kecerdasan buatan berbasis Artificial Neural Network (ANN) dengan metode backpropagation diimplementasikan dalam sistem smart grid yang berfungsi sebagai pengaturan otomatis dalam proses jual dan beli daya listrik. ANN yang digunakan memiliki 3 input, 2 layer neuron, 3 output, dan masing-masing layer memiliki 4 neuron yang diimplementasikan ke dalam bahasa Python. Setelah pelatihan sebanyak 11.000 kali, didapatkan Root Mean Square Error (RMSE) sebesar 0,12151 dan pada saat uji coba didapatkan RMSE sebesar 0,10500 dengan akurasi rata-rata sebesar 89,50%.

Download Full-text

Application of Entropy-Based Attribute Reduction and an Artificial Neural Network in Medicine: A Case Study of Estimating Medical Care Costs Associated with Myocardial Infarction

Entropy ◽

10.3390/e16094788 ◽

2014 ◽

Vol 16 (9) ◽

pp. 4788-4800 ◽

Cited By ~ 4

Author(s):

Qingyun Du ◽

Ke Nie ◽

Zhensheng Wang

Keyword(s):

Neural Network ◽

Myocardial Infarction ◽

Artificial Neural Network ◽

Medical Care ◽

Attribute Reduction ◽

Medical Care Costs ◽

Artificial Neural ◽

Care Costs

Download Full-text

Application of Extreme Gradient Boosting Based on Grey Relation Analysis for Prediction of Compressive Strength of Concrete

Advances in Civil Engineering ◽

10.1155/2021/8878396 ◽

2021 ◽

Vol 2021 ◽

pp. 1-14

Author(s):

Liyun Cui ◽

Peiyuan Chen ◽

Liang Wang ◽

Jin Li ◽

Hao Ling

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Compressive Strength ◽

Gradient Boosting ◽

Grey Relation Analysis ◽

Extreme Gradient Boosting ◽

Compressive Strength Of Concrete ◽

Relation Analysis ◽

Artificial Neural ◽

Grey Relation

The prediction of concrete strength is an interesting point of investigation and could be realized well, especially for the concrete with the complex system, with the development of machine learning and artificial intelligence. Therefore, an excellent algorithm should put emphasis to receiving increased attention from researchers. This study presents a novel predictive system as follows: extreme gradient boosting (XGBoost) based on grey relation analysis (GRA) for predicting the compressive strength of concrete containing slag and metakaolin. One of its highlights is a feature selection methodology, i.e., GRA, which was used to determine the main input variables. Another highlight is that its performance was compared with the frequently used artificial neural network (ANN) and genetic algorithm-artificial neural network (GA-ANN) by using random dataset and the same testing datasets. For three same testing datasets, the average R2 values of ANN, GA-ANN, and XGBoost are 0.674, 0.829, and 0.880, respectively, indicating that XGBoost has the highest absolute fraction of variance (R2). XGBoost can provide best result by testing the root mean squared error (RMSE) and mean absolute percentage error (MAPE). The average RMSE values of ANN, GA-ANN, and XGBoost are 15.569 MPa, 10.530 MPa, and 9.532 MPa, respectively, and those of MAPE of ANN, GA-ANN, and XGBoost are 11.224%, 9.140%, and 8.718%, respectively. Thus, the XGBoost definitely performed better than the ANN and GA-ANN. Finally, a type of application software based on XGBoost was developed for practical applications. This vivid software interfaces could help users in prediction and easy and efficient analysis.

Download Full-text

Predicting 30-day Hospital Readmissions Using Artificial Neural Networks with Medical Code Embedding

10.1101/741504 ◽

2019 ◽

Cited By ~ 1

Author(s):

Wenshuo Liu ◽

Karandeep Singh ◽

Andrew M. Ryan ◽

Devraj Sukul ◽

Elham Mahmoudi ◽

...

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Logistic Regression ◽

Claims Data ◽

Prediction Models ◽

Hospital Performance ◽

Gradient Boosting ◽

Administrative Claims ◽

Hierarchical Logistic Regression ◽

Artificial Neural

ABSTRACTReducing unplanned readmissions is a major focus of current hospital quality efforts. In order to avoid unfair penalization, administrators and policymakers use prediction models to adjust for the performance of hospitals from healthcare claims data. Regression-based models are a commonly utilized method for such risk-standardization across hospitals; however, these models often suffer in accuracy. In this study we, compare four prediction models for unplanned patient readmission for patients hospitalized with acute myocardial infarction (AMI), congestive health failure (HF), and pneumonia (PNA) within the Nationwide Readmissions Database in 2014. We evaluated hierarchical logistic regression and compared its performance with gradient boosting and two models that utilize artificial neural network. We show that unsupervised Global Vector for Word Representations embedding representations of administrative claims data combined with artificial neural network classification models significantly improves prediction of 30-day readmission. Our best models increased the AUC for prediction of 30-day readmissions from 0.68 to 0.72 for AMI, 0.60 to 0.64 for HF, and 0.63 to 0.68 for PNA compared to hierarchical logistic regression. Furthermore, risk-standardized hospital readmission rates calculated from our artificial neural network model that employed embeddings led to reclassification of approximately 10% of hospitals across categories of hospital performance. This finding suggests that prediction models that incorporate new methods classify hospitals differently than traditional regression-based approaches and that their role in assessing hospital performance warrants further investigation.

Download Full-text

Pengembangan Smart Home System Berbasis Kecerdasan Buatan untuk Memanajemen Konsumsi Energi Rumah Tangga dengan Pendekatan Finansial

JSAI (Journal Scientific and Applied Informatics) ◽

10.36085/jsai.v4i1.1218 ◽

2021 ◽

Vol 4 (1) ◽

pp. 1-10

Author(s):

Ihsan Auditia Akhinov ◽

Muhammad Ridwan Arif Cahyono

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Internet Of Things ◽

Smart Home ◽

Web Server ◽

Black Box ◽

Raspberry Pi ◽

Artificial Neural ◽

Online Web

Teknologi rumah pintar yang dikembangkan saat ini belum sepenuhnya mampu mendukung program konservasi energi yang dicanangkan pemerintah. Selain Saat ini kontrol untuk pengaturan rumah pintar masih dilakukan secara manual, belum sepenuhnya otomatis. Pada penelitian ini akan dibangun sistem rumah pintar yang dikendalikan oleh kecerdasan buatan untuk mengendalikan pemakaian energi berdasarkan besaran nilai tagihan bulanan. ESP32 digunakan sebagai perangkat Internet of Things (IoT) yang berfungsi mendeteksi keberadaan manusia dan mengukur energi listrik yang dikonsumsi. Data-data tersebut disimpan dalam online web server yang dibangun dari Raspberry Pi. Sistem ini dapat dimonitor dan dikendalikan oleh aplikasi berbasis Web. Aplikasi ini sudah diuji dengan menggunakan metode Black Box, hasilnya 100% aplikasi berjalan lancar. Artificial Neural Network diimplementasikan menggunakan bahasa Python, dengan 4 input, 2 layer, dan 4 output dimana masing-masing layer terdiri dari 4 neuron. Variabel masukan yang digunakan dalam ANN yaitu intensitas cahaya, temperatur ruangan, durasi waktu penggunaan ruangan, dan target biaya bulanan, sedangkan keluaran dari ANN ini yaitu durasi penggunaan peralatan listrik, dalam purwarupa ini yaitu durasi penggunaan AC, TV, refrigerator, dan lampu. Sistem sudah mampu berjalan dengan baik, mampu memberikan rekomendasi durasi maksimal penggunaan peralatan listrik dengan tingkat kesalahan sebesar 1,64%.

Download Full-text

Artificial Neural Network and Machine Learning Based Methods for Population Estimation of Rohingya Refugees: Comparing Data-Driven and Satellite Image-Driven Approaches

Vietnam Journal of Computer Science ◽

10.1142/s2196888819500246 ◽

2019 ◽

Vol 06 (04) ◽

pp. 439-455 ◽

Cited By ~ 1

Author(s):

Nahian Ahmed ◽

Nazmul Alam Diptu ◽

M. Sakil Khan Shadhin ◽

M. Abrar Fahim Jaki ◽

M. Ferdous Hasan ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Artificial Neural Network ◽

Satellite Image ◽

Population Estimation ◽

Data Driven ◽

Gradient Boosting ◽

Ann Model ◽

Data Driven Approach ◽

Artificial Neural

Manual field-based population census data collection method is slow and expensive, especially for refugee management situations where more frequent censuses are necessary. This study aims to explore the approaches of population estimation of Rohingya migrants using remote sensing and machine learning. Two different approaches of population estimation viz., (i) data-driven approach and (ii) satellite image-driven approach have been explored. A total of 11 machine learning models including Artificial Neural Network (ANN) are applied for both approaches. It is found that, in situations where the surface population distribution is unknown, a smaller satellite image grid cell length is required. For data-driven approach, ANN model is placed fourth, Linear Regression model performed the worst and Gradient Boosting model performed the best. For satellite image-driven approach, ANN model performed the best while Ada Boost model has the worst performance. Gradient Boosting model can be considered as a suitable model to be applied for both the approaches.

Download Full-text

Rule Extraction from Training Data Using Neural Network

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213017500063 ◽

2016 ◽

Vol 26 (03) ◽

pp. 1750006 ◽

Cited By ~ 8

Author(s):

Saroj Kumar Biswas ◽

Manomita Chakraborty ◽

Biswajit Purkayastha ◽

Pinki Roy ◽

Dalton Meitei Thounaojam

Keyword(s):

Neural Network ◽

Data Mining ◽

Artificial Neural Network ◽

High Performance ◽

Rule Extraction ◽

Black Box ◽

Training Data ◽

Extraction Algorithm ◽

Artificial Neural ◽

Trained Neural Network

Data Mining is a powerful technology to help organization to concentrate on most important data by extracting useful information from large database. One of the most commonly used techniques in data mining is Artificial Neural Network due to its high performance in many application domains. Despite many advantages of Artificial Neural Network, one of its main drawbacks is its inherent black box nature which is the main problem of using Artificial Neural Network in data mining. Therefore, this paper proposes a rule extraction algorithm from neural network using classified and misclassified data to convert the black box nature of Artificial Neural Network into a white box. The proposed algorithm is a modification of the existing algorithm, Rule Extraction by Reverse Engineering (RxREN). The proposed algorithm extracts rules from trained neural network for datasets with mixed mode attributes using pedagogical approach. The proposed algorithm uses both classified as well as misclassified data to find out the data ranges of significant attributes in respective classes, which is the innovation of the proposed algorithm. The experimental results clearly show that the performance of the proposed algorithm is superior to existing algorithms.

Download Full-text

Gender Effects on Health Care Costs in Cardiovascular Medicine—A Black Box?

The Thoracic and Cardiovascular Surgeon ◽

10.1055/s-0032-1328931 ◽

2012 ◽

Vol 61 (01) ◽

pp. 074-078 ◽

Cited By ~ 1

Author(s):

Beatrice Vogel ◽

Hermann Reichenspurner ◽

Helmut Gulbins

Keyword(s):

Health Care ◽

Health Care Costs ◽

Black Box ◽

Gender Effects ◽

Cardiovascular Medicine ◽

Care Costs

Download Full-text