scholarly journals Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression

Sensors ◽  
2020 ◽  
Vol 20 (16) ◽  
pp. 4392
Author(s):  
Belisario Panay ◽  
Nelson Baloian ◽  
José A. Pino ◽  
Sergio Peñafiel ◽  
Horacio Sanson ◽  
...  

Although many authors have highlighted the importance of predicting people’s health costs to improve healthcare budget management, most of them do not address the frequent need to know the reasons behind this prediction, i.e., knowing the factors that influence this prediction. This knowledge allows avoiding arbitrariness or people’s discrimination. However, many times the black box methods (that is, those that do not allow this analysis, e.g., methods based on deep learning techniques) are more accurate than those that allow an interpretation of the results. For this reason, in this work, we intend to develop a method that can achieve similar returns as those obtained with black box methods for the problem of predicting health costs, but at the same time it allows the interpretation of the results. This interpretable regression method is based on the Dempster-Shafer theory using Evidential Regression (EVREG) and a discount function based on the contribution of each dimension. The method “learns” the optimal weights for each feature using a gradient descent technique. The method also uses the nearest k-neighbor algorithm to accelerate calculations. It is possible to select the most relevant features for predicting a patient’s health care costs using this approach and the transparency of the Evidential Regression model. We can obtain a reason for a prediction with a k-NN approach. We used the Japanese health records at Tsuyama Chuo Hospital to test our method, which included medical examinations, test results, and billing information from 2013 to 2018. We compared our model to methods based on an Artificial Neural Network, Gradient Boosting, Regression Tree and Weighted k-Nearest Neighbors. Our results showed that our transparent model performed like the Artificial Neural Network and Gradient Boosting with an R2 of 0.44.

Proceedings ◽  
2019 ◽  
Vol 31 (1) ◽  
pp. 74 ◽  
Author(s):  
Belisario Panay ◽  
Nelson Baloian ◽  
José Pino ◽  
Sergio Peñafiel ◽  
Horacio Sanson ◽  
...  

People’s health care cost prediction is nowadays a valuable tool to improve accountability in health care. In this work, we study if an interpretable method can reach the performance of black-box methods for the problem of predicting health care costs. We present an interpretable regression method based on the Dempster-Shafer theory, using the Evidence Regression model and a discount function based on the contribution of each dimension. Optimal parameters are learned using gradient descent. The k-nearest neighbors’ algorithm was also used to speed up computations. With the transparency of the evidence regression model, it is possible to create a set of rules based on a patient’s vicinity. When making a prediction, the model gives a set of rules for such a result. We used Japanese health records from Tsuyama Chuo Hospital to test our method, which includes medical checkups, exam results, and billing information from 2016 to 2017. We compared our model to an Artificial Neural Network and Gradient Boosting method. Our results showed that our transparent model outperforms the Artificial Neural Network and Gradient Boosting with an R 2 of 0 . 44 .


2021 ◽  
Vol 4 (1) ◽  
pp. 11-19
Author(s):  
Muhammad Ridwan Arif Cahyono ◽  
Surya Wirawan

Smart grid merupakan sistem kelistrikan yang memungkinkan pengguna untuk melakukan proses menjual dan membeli daya listrik. Pada penelitian ini dirancang model smart grid dengan sumber daya dari listrik PLN dan panel surya yang terhubung dengan beban. Beban yang digunakan memiliki daya maksimal 40 W dan panel surya yang digunakan memiliki kapasitas 100 Wp. ESP32 digunakan sebagai perangkat Internet of Things, yang digunakan sebagai pengukur dan pengontrol daya listrik yang akan dijual atau dibeli. Raspberry Pi digunakan sebagai web server pengolah data dari smart grid. Aplikasi “Smart Grid Dikti” merupakan aplikasi berbasis android yang dapat digunakan untuk melakukan pemantauan serta pengaturan dalam sistem smart grid tersebut. Aplikasi android tersebut telah diuji coba dengan metode Black Box, dengan hasil pengujian 100% berhasil. Kecerdasan buatan berbasis Artificial Neural Network (ANN) dengan metode backpropagation diimplementasikan dalam sistem smart grid yang berfungsi sebagai pengaturan otomatis dalam proses jual dan beli daya listrik. ANN yang digunakan memiliki 3 input, 2 layer neuron, 3 output, dan masing-masing layer memiliki 4 neuron yang diimplementasikan ke dalam bahasa Python. Setelah pelatihan sebanyak 11.000 kali, didapatkan Root Mean Square  Error (RMSE) sebesar 0,12151 dan pada saat uji coba didapatkan RMSE sebesar 0,10500 dengan akurasi rata-rata sebesar 89,50%.


2021 ◽  
Vol 2021 ◽  
pp. 1-14
Author(s):  
Liyun Cui ◽  
Peiyuan Chen ◽  
Liang Wang ◽  
Jin Li ◽  
Hao Ling

The prediction of concrete strength is an interesting point of investigation and could be realized well, especially for the concrete with the complex system, with the development of machine learning and artificial intelligence. Therefore, an excellent algorithm should put emphasis to receiving increased attention from researchers. This study presents a novel predictive system as follows: extreme gradient boosting (XGBoost) based on grey relation analysis (GRA) for predicting the compressive strength of concrete containing slag and metakaolin. One of its highlights is a feature selection methodology, i.e., GRA, which was used to determine the main input variables. Another highlight is that its performance was compared with the frequently used artificial neural network (ANN) and genetic algorithm-artificial neural network (GA-ANN) by using random dataset and the same testing datasets. For three same testing datasets, the average R2 values of ANN, GA-ANN, and XGBoost are 0.674, 0.829, and 0.880, respectively, indicating that XGBoost has the highest absolute fraction of variance (R2). XGBoost can provide best result by testing the root mean squared error (RMSE) and mean absolute percentage error (MAPE). The average RMSE values of ANN, GA-ANN, and XGBoost are 15.569 MPa, 10.530 MPa, and 9.532 MPa, respectively, and those of MAPE of ANN, GA-ANN, and XGBoost are 11.224%, 9.140%, and 8.718%, respectively. Thus, the XGBoost definitely performed better than the ANN and GA-ANN. Finally, a type of application software based on XGBoost was developed for practical applications. This vivid software interfaces could help users in prediction and easy and efficient analysis.


2019 ◽  
Author(s):  
Wenshuo Liu ◽  
Karandeep Singh ◽  
Andrew M. Ryan ◽  
Devraj Sukul ◽  
Elham Mahmoudi ◽  
...  

ABSTRACTReducing unplanned readmissions is a major focus of current hospital quality efforts. In order to avoid unfair penalization, administrators and policymakers use prediction models to adjust for the performance of hospitals from healthcare claims data. Regression-based models are a commonly utilized method for such risk-standardization across hospitals; however, these models often suffer in accuracy. In this study we, compare four prediction models for unplanned patient readmission for patients hospitalized with acute myocardial infarction (AMI), congestive health failure (HF), and pneumonia (PNA) within the Nationwide Readmissions Database in 2014. We evaluated hierarchical logistic regression and compared its performance with gradient boosting and two models that utilize artificial neural network. We show that unsupervised Global Vector for Word Representations embedding representations of administrative claims data combined with artificial neural network classification models significantly improves prediction of 30-day readmission. Our best models increased the AUC for prediction of 30-day readmissions from 0.68 to 0.72 for AMI, 0.60 to 0.64 for HF, and 0.63 to 0.68 for PNA compared to hierarchical logistic regression. Furthermore, risk-standardized hospital readmission rates calculated from our artificial neural network model that employed embeddings led to reclassification of approximately 10% of hospitals across categories of hospital performance. This finding suggests that prediction models that incorporate new methods classify hospitals differently than traditional regression-based approaches and that their role in assessing hospital performance warrants further investigation.


2021 ◽  
Vol 4 (1) ◽  
pp. 1-10
Author(s):  
Ihsan Auditia Akhinov ◽  
Muhammad Ridwan Arif Cahyono

Teknologi rumah pintar yang dikembangkan saat ini belum sepenuhnya mampu mendukung program konservasi energi yang dicanangkan pemerintah. Selain Saat ini kontrol untuk pengaturan rumah pintar masih dilakukan secara manual, belum sepenuhnya otomatis. Pada penelitian ini akan dibangun sistem rumah pintar yang dikendalikan oleh kecerdasan buatan untuk mengendalikan pemakaian energi berdasarkan besaran nilai tagihan bulanan. ESP32 digunakan sebagai perangkat Internet of Things (IoT) yang berfungsi mendeteksi keberadaan manusia dan mengukur energi listrik yang dikonsumsi. Data-data tersebut disimpan dalam online web server yang dibangun dari Raspberry Pi. Sistem ini dapat dimonitor dan dikendalikan oleh aplikasi berbasis Web. Aplikasi ini sudah diuji dengan menggunakan metode Black Box, hasilnya 100% aplikasi berjalan lancar. Artificial Neural Network diimplementasikan menggunakan bahasa Python, dengan 4 input, 2 layer, dan 4 output dimana masing-masing layer terdiri dari 4 neuron. Variabel masukan yang digunakan dalam ANN yaitu intensitas cahaya, temperatur ruangan, durasi waktu penggunaan ruangan, dan target biaya bulanan, sedangkan keluaran dari ANN ini yaitu durasi penggunaan peralatan listrik, dalam purwarupa ini yaitu durasi penggunaan AC, TV, refrigerator, dan lampu. Sistem sudah mampu berjalan dengan baik, mampu memberikan rekomendasi durasi maksimal penggunaan peralatan listrik dengan tingkat kesalahan sebesar 1,64%.


2019 ◽  
Vol 06 (04) ◽  
pp. 439-455 ◽  
Author(s):  
Nahian Ahmed ◽  
Nazmul Alam Diptu ◽  
M. Sakil Khan Shadhin ◽  
M. Abrar Fahim Jaki ◽  
M. Ferdous Hasan ◽  
...  

Manual field-based population census data collection method is slow and expensive, especially for refugee management situations where more frequent censuses are necessary. This study aims to explore the approaches of population estimation of Rohingya migrants using remote sensing and machine learning. Two different approaches of population estimation viz., (i) data-driven approach and (ii) satellite image-driven approach have been explored. A total of 11 machine learning models including Artificial Neural Network (ANN) are applied for both approaches. It is found that, in situations where the surface population distribution is unknown, a smaller satellite image grid cell length is required. For data-driven approach, ANN model is placed fourth, Linear Regression model performed the worst and Gradient Boosting model performed the best. For satellite image-driven approach, ANN model performed the best while Ada Boost model has the worst performance. Gradient Boosting model can be considered as a suitable model to be applied for both the approaches.


2016 ◽  
Vol 26 (03) ◽  
pp. 1750006 ◽  
Author(s):  
Saroj Kumar Biswas ◽  
Manomita Chakraborty ◽  
Biswajit Purkayastha ◽  
Pinki Roy ◽  
Dalton Meitei Thounaojam

Data Mining is a powerful technology to help organization to concentrate on most important data by extracting useful information from large database. One of the most commonly used techniques in data mining is Artificial Neural Network due to its high performance in many application domains. Despite many advantages of Artificial Neural Network, one of its main drawbacks is its inherent black box nature which is the main problem of using Artificial Neural Network in data mining. Therefore, this paper proposes a rule extraction algorithm from neural network using classified and misclassified data to convert the black box nature of Artificial Neural Network into a white box. The proposed algorithm is a modification of the existing algorithm, Rule Extraction by Reverse Engineering (RxREN). The proposed algorithm extracts rules from trained neural network for datasets with mixed mode attributes using pedagogical approach. The proposed algorithm uses both classified as well as misclassified data to find out the data ranges of significant attributes in respective classes, which is the innovation of the proposed algorithm. The experimental results clearly show that the performance of the proposed algorithm is superior to existing algorithms.


2012 ◽  
Vol 61 (01) ◽  
pp. 074-078 ◽  
Author(s):  
Beatrice Vogel ◽  
Hermann Reichenspurner ◽  
Helmut Gulbins

Sign in / Sign up

Export Citation Format

Share Document