Big data, big results: Knowledge discovery in output from large-scale analytics

Tyler H. McCormick; Rebecca Ferrell; Alan F. Karr; Patrick B. Ryan

doi:10.1002/sam.11237

Effective Statistical Methods for Big Data Analytics

Handbook of Research on Applied Cybernetics and Systems Science - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-5225-2498-4.ch014 ◽

2017 ◽

pp. 280-299 ◽

Cited By ~ 3

Author(s):

Cheng Meng ◽

Ye Wang ◽

Xinlian Zhang ◽

Abhyuday Mandal ◽

Wenxuan Zhong ◽

...

Keyword(s):

Decision Making ◽

Big Data ◽

Knowledge Discovery ◽

Statistical Methods ◽

Large Scale ◽

Big Data Analytics ◽

Divide And Conquer ◽

Data Driven ◽

The Past ◽

Large Scale Dataset

With advances in technologies in the past decade, the amount of data generated and recorded has grown enormously in virtually all fields of industry and science. This extraordinary amount of data provides unprecedented opportunities for data-driven decision-making and knowledge discovery. However, the task of analyzing such large-scale dataset poses significant challenges and calls for innovative statistical methods specifically designed for faster speed and higher efficiency. In this chapter, we review currently available methods for big data, with a focus on the subsampling methods using statistical leveraging and divide and conquer methods.

Download Full-text

Big Data Mining or Turning Data Mining into Predictive Analytics from Large-Scale 3Vs Data: The Future Challenge for Knowledge Discovery

Model and Data Engineering - Lecture Notes in Computer Science ◽

10.1007/978-3-319-11587-0_2 ◽

2014 ◽

pp. 4-8 ◽

Cited By ~ 8

Author(s):

Alfredo Cuzzocrea

Keyword(s):

Data Mining ◽

Big Data ◽

Knowledge Discovery ◽

Large Scale ◽

Predictive Analytics ◽

Future Challenge ◽

Big Data Mining ◽

The Future

Download Full-text

A Cognitive Adopted Framework for IoT Big-Data Management and Knowledge Discovery Prospective

International Journal of Distributed Sensor Networks ◽

10.1155/2015/718390 ◽

2015 ◽

Vol 2015 ◽

pp. 1-12 ◽

Cited By ~ 11

Author(s):

Nilamadhab Mishra ◽

Chung-Chih Lin ◽

Hsien-Tsung Chang

Keyword(s):

Big Data ◽

Data Management ◽

Knowledge Discovery ◽

Large Scale ◽

Industrial Applications ◽

Sensor Technology ◽

Industrial Automation ◽

Data Framework ◽

Industrial Internet ◽

Knowledge Exploration

In future IoT big-data management and knowledge discovery for large scale industrial automation application, the importance of industrial internet is increasing day by day. Several diversified technologies such as IoT (Internet of Things), computational intelligence, machine type communication, big-data, and sensor technology can be incorporated together to improve the data management and knowledge discovery efficiency of large scale automation applications. So in this work, we need to propose a Cognitive Oriented IoT Big-data Framework (COIB-framework) along with implementation architecture, IoT big-data layering architecture, and data organization and knowledge exploration subsystem for effective data management and knowledge discovery that is well-suited with the large scale industrial automation applications. The discussion and analysis show that the proposed framework and architectures create a reasonable solution in implementing IoT big-data based smart industrial applications.

Download Full-text

Big Data Analytics Using Local Exceptionality Detection

Advances in Business Information Systems and Analytics - Enterprise Big Data Engineering, Analytics, and Management ◽

10.4018/978-1-5225-0293-7.ch007 ◽

2016 ◽

pp. 108-125 ◽

Cited By ~ 4

Author(s):

Martin Atzmueller ◽

Dennis Mollenhauer ◽

Andreas Schmidt

Keyword(s):

Big Data ◽

Knowledge Discovery ◽

Large Scale ◽

Pattern Mining ◽

Subgroup Discovery ◽

Data Sets ◽

Basic Algorithm ◽

Real World Datasets ◽

Large Scale Data Processing ◽

Complex Target

Large-scale data processing is one of the key challenges concerning many application domains, especially considering ubiquitous and big data. In these contexts, subgroup discovery provides both a flexible data analysis and knowledge discovery method. Subgroup discovery and pattern mining are important descriptive data mining tasks. They can be applied, for example, in order to obtain an overview on the relations in the data, for automatic hypotheses generation, and for a number of knowledge discovery applications. This chapter presents the novel SD-MapR algorithmic framework for large-scale local exceptionality detection implemented using subgroup discovery on the Map/Reduce framework. We describe the basic algorithm in detail and provide an experimental evaluation using several real-world datasets. We tackle two algorithmic variants focusing on simple and more complex target concepts, i.e., presenting an implementation of exceptional model mining on large attributed graphs. The results of our evaluation show the scalability of the presented approach for large data sets.

Download Full-text

A NOVEL ANALYTIC APPROACH FOR LARGE SCALE POWER PLANT WIDE PROCESSES WITH BIG DATA

Advances in Mathematics: Scientific Journal ◽

10.37418/amsj.9.6.30 ◽

2020 ◽

Vol 9 (6) ◽

pp. 3509-3517

Author(s):

K. Malakonda Rayudu ◽

A. Kumar

Keyword(s):

Big Data ◽

Power Plant ◽

Large Scale ◽

Analytic Approach

Download Full-text

Multi Disease-Prediction Framework Using Hybrid Deep Learning: An Optimal Prediction Model (Preprint)

10.2196/preprints.22865 ◽

2020 ◽

Author(s):

Anusha Ampavathi ◽

Vijaya Saradhi T

Keyword(s):

Feature Extraction ◽

Big Data ◽

Deep Learning ◽

Weight Function ◽

Optimization Algorithm ◽

Large Scale ◽

Heuristic Algorithms ◽

Disease Prediction ◽

Health Care Decisions ◽

Proposed Model

UNSTRUCTURED Big data and its approaches are generally helpful for healthcare and biomedical sectors for predicting the disease. For trivial symptoms, the difficulty is to meet the doctors at any time in the hospital. Thus, big data provides essential data regarding the diseases on the basis of the patient’s symptoms. For several medical organizations, disease prediction is important for making the best feasible health care decisions. Conversely, the conventional medical care model offers input as structured that requires more accurate and consistent prediction. This paper is planned to develop the multi-disease prediction using the improvised deep learning concept. Here, the different datasets pertain to “Diabetes, Hepatitis, lung cancer, liver tumor, heart disease, Parkinson’s disease, and Alzheimer’s disease”, from the benchmark UCI repository is gathered for conducting the experiment. The proposed model involves three phases (a) Data normalization (b) Weighted normalized feature extraction, and (c) prediction. Initially, the dataset is normalized in order to make the attribute's range at a certain level. Further, weighted feature extraction is performed, in which a weight function is multiplied with each attribute value for making large scale deviation. Here, the weight function is optimized using the combination of two meta-heuristic algorithms termed as Jaya Algorithm-based Multi-Verse Optimization algorithm (JA-MVO). The optimally extracted features are subjected to the hybrid deep learning algorithms like “Deep Belief Network (DBN) and Recurrent Neural Network (RNN)”. As a modification to hybrid deep learning architecture, the weight of both DBN and RNN is optimized using the same hybrid optimization algorithm. Further, the comparative evaluation of the proposed prediction over the existing models certifies its effectiveness through various performance measures.

Download Full-text

Analysis of the Influence of Big Data Background on the Spread of Large-Scale Sports Events

Journal of Physics Conference Series ◽

10.1088/1742-6596/1744/3/032003 ◽

2021 ◽

Vol 1744 (3) ◽

pp. 032003

Author(s):

Tieniu Xia

Keyword(s):

Big Data ◽

Large Scale ◽

Sports Events

Download Full-text

BDF-SDN: A Big Data Framework for DDoS Attack Detection in Large-Scale SDN-Based Cloud

2021 IEEE Conference on Dependable and Secure Computing (DSC) ◽

10.1109/dsc49826.2021.9346269 ◽

2021 ◽

Author(s):

Phuc Trinh Dinh ◽

Minho Park

Keyword(s):

Big Data ◽

Large Scale ◽

Attack Detection ◽

Ddos Attack ◽

Data Framework ◽

Ddos Attack Detection

Download Full-text

Guest Editorial for ACM TECS Special Issue on Effective Divide-and-Conquer, Incremental, or Distributed Mechanisms of Embedded Designs for Extremely Big Data in Large-Scale Devices

ACM Transactions on Embedded Computing Systems ◽

10.1145/3068457 ◽

2017 ◽

Vol 16 (3) ◽

pp. 1-2

Keyword(s):

Big Data ◽

Large Scale ◽

Guest Editorial ◽

Divide And Conquer ◽

Special Issue

Download Full-text

Circular supply chain management with large scale group decision making in the big data era: The macro-micro model

Technological Forecasting and Social Change ◽

10.1016/j.techfore.2021.120791 ◽

2021 ◽

Vol 169 ◽

pp. 120791

Author(s):

Tsan-Ming Choi ◽

Yue Chen

Keyword(s):

Decision Making ◽

Big Data ◽

Supply Chain ◽

Supply Chain Management ◽

Group Decision Making ◽

Large Scale ◽

Group Decision ◽

Micro Model ◽

Chain Management

Download Full-text