Data Mining Risk Score Models for Big Biomedical and Healthcare Data

Data mining is likely to explore hidden patterns from the huge quantity of data and provides a way of analyzing and categorizing the data. Associative classification (AC) is an integration of two data mining tasks, association rule mining, and classification which is used to classify the unknown data. Though association rule mining techniques are successfully utilized to construct classifiers, it lacks in generating a small set of significant class association rules (CARs) to build an accurate associative classifier. In this work, an attempt is made to generate significant CARs using Artificial Bee Colony (ABC) algorithm, an optimization technique to construct an efficient associative classifier. Associative classifier, thus built using ABC discovered CARs achieve high prognostic accurateness and interestingness value. Promising results were provided by the ABC based AC when experiments were conducted using health care datasets from the UCI machine learning repository.

Download Full-text

For Better Healthcare Mining Health Data

Advances in Data Mining and Database Management - Intelligent Multidimensional Data Clustering and Analysis ◽

10.4018/978-1-5225-1776-4.ch006 ◽

2017 ◽

pp. 135-158

Author(s):

Güney Gürsel

Keyword(s):

Data Mining ◽

Nearest Neighbor ◽

Healthcare Management ◽

Support Vector ◽

Privacy And Security ◽

Healthcare Data ◽

Management Fraud ◽

Use Of Data ◽

Customer Relation Management ◽

Vector Machines

Data mining has great contributions to the healthcare such as support for effective treatment, healthcare management, customer relation management, fraud and abuse detection and decision making. The common data mining methods used in healthcare are Artificial Neural Network, Decision trees, Genetic Algorithms, Nearest neighbor method, Logistic regression, Fuzzy logic, Fuzzy based Neural Networks, Bayesian Networks and Support Vector Machines. The most used task is classification. Because of the complexity and toughness of medical domain, data mining is not an easy task to accomplish. In addition, privacy and security of patient data is a big issue to deal with because of the sensitivity of healthcare data. There exist additional serious challenges. This chapter is a descriptive study aimed to provide an acquaintance to data mining and its usage and applications in healthcare domain. The use of Data mining in healthcare informatics and challenges will be examined.

Download Full-text

Validity of association rules extracted by healthcare-data-mining

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society ◽

10.1109/embc.2014.6944737 ◽

2014 ◽

Cited By ~ 2

Author(s):

Hiroshi Takeuchi ◽

Naoki Kodama

Keyword(s):

Data Mining ◽

Association Rules ◽

Healthcare Data

Download Full-text

Healthcare data mining from multi-source data

10.14264/uql.2017.365 ◽

2017 ◽

Author(s):

Ling Chen

Keyword(s):

Data Mining ◽

Healthcare Data ◽

Source Data

Download Full-text

Data Analytics and Mining in Healthcare with Emphasis on Causal Relationship Mining

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.d6492.118419 ◽

2019 ◽

Vol 8 (4) ◽

pp. 195-204

Keyword(s):

Data Mining ◽

Big Data ◽

Causal Relationship ◽

Data Analytics ◽

Statistical Tests ◽

Analytical Techniques ◽

Data Sources ◽

Healthcare Personnel ◽

Related Data ◽

Healthcare Data

High volumes and varieties of data is piling every day from healthcare and related fields. This big data sources if managed and analysed properly will provide vital knowledge. Data mining and data analytics have been playing an important role in extracting useful information from healthcare and related data sources. The knowledge extracted from these data sources guiding patients and healthcare personnel towards improved health conditions. Analytical techniques from statistics, functionalities from data mining and machine learning already proved their capability with significant contributions to healthcare industry. The dominant functionality of data mining is classification which has been in use in mining healthcare data. Though classification is a good learning technique it may not provide a causation model which will be a reliable model for better decision making particularly in the medical field. The present models for causality have limitations in terms of scalability and reliability. The present study is targeted to study causal models for causal relationship mining. This study tried to conclude with some proposals for causal relationship discovery which are efficient, reliable and scalable. The proposed model is going to make use of some qualities of decision trees along with statistical tests and analytics. It is proposed to build the learning models on healthcare big data sources.

Download Full-text

Knowledge Discovery from Healthcare Electronic Records for Sustainable Environment

Sustainability ◽

10.3390/su13168900 ◽

2021 ◽

Vol 13 (16) ◽

pp. 8900

Author(s):

Naeem Ahmed Mahoto ◽

Asadullah Shaikh ◽

Mana Saleh Al Reshan ◽

Muhammad Ali Memon ◽

Adel Sulaiman

Keyword(s):

Data Mining ◽

Knowledge Discovery ◽

Association Analysis ◽

Pattern Mining ◽

Large Data ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Electronic Records ◽

Data Mining Techniques ◽

Healthcare Data

The medical history of a patient is an essential piece of information in healthcare agencies, which keep records of patients. Due to the fact that each person may have different medical complications, healthcare data remain sparse, high-dimensional and possibly inconsistent. The knowledge discovery from such data is not easily manageable for patient behaviors. It becomes a challenge for both physicians and healthcare agencies to discover knowledge from many healthcare electronic records. Data mining, as evidenced from the existing published literature, has proven its effectiveness in transforming large data collections into meaningful information and knowledge. This paper proposes an overview of the data mining techniques used for knowledge discovery in medical records. Furthermore, based on real healthcare data, this paper also demonstrates a case study of discovering knowledge with the help of three data mining techniques: (1) association analysis; (2) sequential pattern mining; (3) clustering. Particularly, association analysis is used to extract frequent correlations among examinations done by patients with a specific disease, sequential pattern mining allows extracting frequent patterns of medical events and clustering is used to find groups of similar patients. The discovered knowledge may enrich healthcare guidelines, improve their processes and detect anomalous patients’ behavior with respect to the medical guidelines.

Download Full-text

Filter Variable Selection Algorithm Using Risk Ratios for Dimensionality Reduction of Healthcare Data for Classification

Processes ◽

10.3390/pr7040222 ◽

2019 ◽

Vol 7 (4) ◽

pp. 222 ◽

Cited By ~ 4

Author(s):

Bodur ◽

Atsa’am

Keyword(s):

Data Mining ◽

Variable Selection ◽

Feature Space ◽

Selection Methods ◽

Selection Algorithm ◽

Fisher Score ◽

Healthcare Data ◽

Classification Tasks ◽

Risk Ratios ◽

Variable Ranking

This research developed and tested a filter algorithm that serves to reduce the feature space in healthcare datasets. The algorithm binarizes the dataset, and then separately evaluates the risk ratio of each predictor with the response, and outputs ratios that represent the association between a predictor and the class attribute. The value of the association translates to the importance rank of the corresponding predictor in determining the outcome. Using Random Forest and Logistic regression classification, the performance of the developed algorithm was compared against the regsubsets and varImp functions, which are unsupervised methods of variable selection. Equally, the proposed algorithm was compared with the supervised Fisher score and Pearson’s correlation feature selection methods. Different datasets were used for the experiment, and, in the majority of the cases, the predictors selected by the new algorithm outperformed those selected by the existing algorithms. The proposed filter algorithm is therefore a reliable alternative for variable ranking in data mining classification tasks with a dichotomous response.

Download Full-text

Study on Security and Privacy in Healthcare Data Mining

10.9734/bpi/idhr/v6/14237d ◽

2021 ◽

pp. 108-114

Author(s):

E. Kesavulu Reddy

Keyword(s):

Data Mining ◽

Security And Privacy ◽

Healthcare Data

Download Full-text

The Hierarchies of Multivalued Attribute Domains and Corresponding Applications in Data Mining

Wireless Communications and Mobile Computing ◽

10.1155/2018/1789121 ◽

2018 ◽

Vol 2018 ◽

pp. 1-11 ◽

Cited By ~ 3

Author(s):

Yuxia Lei ◽

Yushu Yan ◽

Yonghua Han ◽

Feng Jiang

Keyword(s):

Data Mining ◽

Association Rules ◽

Language Processing ◽

Common Knowledge ◽

Formal Concept ◽

Research Association ◽

Functional Dependencies ◽

Mobile Data ◽

Healthcare Data ◽

Machine Learning Models

In mobile computing, machine learning models for natural language processing (NLP) have become one of the most attractive focus areas in research. Association rules among attributes are common knowledge patterns, which can often provide potential and useful information such as mobile users' interests. Actually, almost each attribute is associated with a hierarchy of the domain. Given an relation R=(U,A) and any cut αa on the hierarchy for every attribute a, there is another rough relation RΦ, where Φ=(αa:a∈A). This paper will establish the connection between the functional dependencies in R and RΦ, propose the method for extracting reducts in RΦ, and demonstrate the implementation of proposed method on an application in data mining of association rules. The method for acquiring association rules consists of the following three steps: (1) translating natural texts into relations, by NLP; (2) translating relations into rough ones, by attributes analysis or fuzzy k-means (FKM) clustering; and (3) extracting association rules from concept lattices, by formal concept analysis (FCA). Our experimental results show that the proposed methods, which can be applied directly to regular mobile data such as healthcare data, improved quality, and relevance of rules.

Download Full-text

Prediction of Heart Diseases Using Data Mining Techniques

International Journal of Big Data and Analytics in Healthcare ◽

10.4018/ijbdah.2018070101 ◽

2018 ◽

Vol 3 (2) ◽

pp. 1-9 ◽

Cited By ~ 1

Author(s):

Nancy Masih ◽

Sachin Ahuja

Keyword(s):

Data Mining ◽

Heart Disease ◽

Heart Diseases ◽

Complete Analysis ◽

Support Vector ◽

Classification Algorithms ◽

Process Data ◽

Data Mining Techniques ◽

Healthcare Data ◽

Hidden Patterns

Health care organizations accumulate large amount of healthcare data, but it is not ‘extracted' to draw hidden patterns which can prove efficient for the decision making process. Data mining techniques can be used to gain insights by discovering hidden patterns which remain undetected manually. Data analytics proves to be useful in detection and identification of the diseases. A complete analysis has been conducted on the FHS (Framingham Heart Study) using various data analytic techniques viz. Decision tree, Naïve Bayes, Support vector machine (SVM) and Artificial neural network (ANN) and the results were ranked according to the accuracy. ANN produce better results than other classification algorithms. The output helps to find out the prominent features that cause heart disease and also identifies the most common features that must be analyzed for prediction of deaths due to heart disease. Despite various studies carried out on heart diseases, the main focus of this study is prediction of heart disease on the dataset of FHS by using various classification algorithms to achieve high accuracy.

Download Full-text