scholarly journals Data Mining Risk Score Models for Big Biomedical and Healthcare Data

2015 ◽  
Vol 08 (06) ◽  
Author(s):  
Emad Elsebakhi ◽  
Ognian Asparouhov ◽  
Anton Berisha
Author(s):  
M. Nandhini ◽  
S. N. Sivanandam ◽  
S. Renugadevi

Data mining is likely to explore hidden patterns from the huge quantity of data and provides a way of analyzing and categorizing the data. Associative classification (AC) is an integration of two data mining tasks, association rule mining, and classification which is used to classify the unknown data. Though association rule mining techniques are successfully utilized to construct classifiers, it lacks in generating a small set of significant class association rules (CARs) to build an accurate associative classifier. In this work, an attempt is made to generate significant CARs using Artificial Bee Colony (ABC) algorithm, an optimization technique to construct an efficient associative classifier. Associative classifier, thus built using ABC discovered CARs achieve high prognostic accurateness and interestingness value. Promising results were provided by the ABC based AC when experiments were conducted using health care datasets from the UCI machine learning repository.


Author(s):  
Güney Gürsel

Data mining has great contributions to the healthcare such as support for effective treatment, healthcare management, customer relation management, fraud and abuse detection and decision making. The common data mining methods used in healthcare are Artificial Neural Network, Decision trees, Genetic Algorithms, Nearest neighbor method, Logistic regression, Fuzzy logic, Fuzzy based Neural Networks, Bayesian Networks and Support Vector Machines. The most used task is classification. Because of the complexity and toughness of medical domain, data mining is not an easy task to accomplish. In addition, privacy and security of patient data is a big issue to deal with because of the sensitivity of healthcare data. There exist additional serious challenges. This chapter is a descriptive study aimed to provide an acquaintance to data mining and its usage and applications in healthcare domain. The use of Data mining in healthcare informatics and challenges will be examined.


High volumes and varieties of data is piling every day from healthcare and related fields. This big data sources if managed and analysed properly will provide vital knowledge. Data mining and data analytics have been playing an important role in extracting useful information from healthcare and related data sources. The knowledge extracted from these data sources guiding patients and healthcare personnel towards improved health conditions. Analytical techniques from statistics, functionalities from data mining and machine learning already proved their capability with significant contributions to healthcare industry. The dominant functionality of data mining is classification which has been in use in mining healthcare data. Though classification is a good learning technique it may not provide a causation model which will be a reliable model for better decision making particularly in the medical field. The present models for causality have limitations in terms of scalability and reliability. The present study is targeted to study causal models for causal relationship mining. This study tried to conclude with some proposals for causal relationship discovery which are efficient, reliable and scalable. The proposed model is going to make use of some qualities of decision trees along with statistical tests and analytics. It is proposed to build the learning models on healthcare big data sources.


2021 ◽  
Vol 13 (16) ◽  
pp. 8900
Author(s):  
Naeem Ahmed Mahoto ◽  
Asadullah Shaikh ◽  
Mana Saleh Al Reshan ◽  
Muhammad Ali Memon ◽  
Adel Sulaiman

The medical history of a patient is an essential piece of information in healthcare agencies, which keep records of patients. Due to the fact that each person may have different medical complications, healthcare data remain sparse, high-dimensional and possibly inconsistent. The knowledge discovery from such data is not easily manageable for patient behaviors. It becomes a challenge for both physicians and healthcare agencies to discover knowledge from many healthcare electronic records. Data mining, as evidenced from the existing published literature, has proven its effectiveness in transforming large data collections into meaningful information and knowledge. This paper proposes an overview of the data mining techniques used for knowledge discovery in medical records. Furthermore, based on real healthcare data, this paper also demonstrates a case study of discovering knowledge with the help of three data mining techniques: (1) association analysis; (2) sequential pattern mining; (3) clustering. Particularly, association analysis is used to extract frequent correlations among examinations done by patients with a specific disease, sequential pattern mining allows extracting frequent patterns of medical events and clustering is used to find groups of similar patients. The discovered knowledge may enrich healthcare guidelines, improve their processes and detect anomalous patients’ behavior with respect to the medical guidelines.


Processes ◽  
2019 ◽  
Vol 7 (4) ◽  
pp. 222 ◽  
Author(s):  
Bodur ◽  
Atsa’am

This research developed and tested a filter algorithm that serves to reduce the feature space in healthcare datasets. The algorithm binarizes the dataset, and then separately evaluates the risk ratio of each predictor with the response, and outputs ratios that represent the association between a predictor and the class attribute. The value of the association translates to the importance rank of the corresponding predictor in determining the outcome. Using Random Forest and Logistic regression classification, the performance of the developed algorithm was compared against the regsubsets and varImp functions, which are unsupervised methods of variable selection. Equally, the proposed algorithm was compared with the supervised Fisher score and Pearson’s correlation feature selection methods. Different datasets were used for the experiment, and, in the majority of the cases, the predictors selected by the new algorithm outperformed those selected by the existing algorithms. The proposed filter algorithm is therefore a reliable alternative for variable ranking in data mining classification tasks with a dichotomous response.


2018 ◽  
Vol 2018 ◽  
pp. 1-11 ◽  
Author(s):  
Yuxia Lei ◽  
Yushu Yan ◽  
Yonghua Han ◽  
Feng Jiang

In mobile computing, machine learning models for natural language processing (NLP) have become one of the most attractive focus areas in research. Association rules among attributes are common knowledge patterns, which can often provide potential and useful information such as mobile users' interests. Actually, almost each attribute is associated with a hierarchy of the domain. Given an relation R=(U,A) and any cut αa on the hierarchy for every attribute a, there is another rough relation RΦ, where Φ=(αa:a∈A). This paper will establish the connection between the functional dependencies in R and RΦ, propose the method for extracting reducts in RΦ, and demonstrate the implementation of proposed method on an application in data mining of association rules. The method for acquiring association rules consists of the following three steps: (1) translating natural texts into relations, by NLP; (2) translating relations into rough ones, by attributes analysis or fuzzy k-means (FKM) clustering; and (3) extracting association rules from concept lattices, by formal concept analysis (FCA). Our experimental results show that the proposed methods, which can be applied directly to regular mobile data such as healthcare data, improved quality, and relevance of rules.


Author(s):  
Nancy Masih ◽  
Sachin Ahuja

Health care organizations accumulate large amount of healthcare data, but it is not ‘extracted' to draw hidden patterns which can prove efficient for the decision making process. Data mining techniques can be used to gain insights by discovering hidden patterns which remain undetected manually. Data analytics proves to be useful in detection and identification of the diseases. A complete analysis has been conducted on the FHS (Framingham Heart Study) using various data analytic techniques viz. Decision tree, Naïve Bayes, Support vector machine (SVM) and Artificial neural network (ANN) and the results were ranked according to the accuracy. ANN produce better results than other classification algorithms. The output helps to find out the prominent features that cause heart disease and also identifies the most common features that must be analyzed for prediction of deaths due to heart disease. Despite various studies carried out on heart diseases, the main focus of this study is prediction of heart disease on the dataset of FHS by using various classification algorithms to achieve high accuracy.


Sign in / Sign up

Export Citation Format

Share Document