Data Processing Model to Perform Big Data Analytics in Hybrid Infrastructures

Modeling Big Data Analytics with a Real-Time Executable Specification Language

Handbook of Research on Trends and Future Directions in Big Data and Web Intelligence - Advances in Data Mining and Database Management ◽

10.4018/978-1-4666-8505-5.ch014 ◽

2015 ◽

pp. 289-312

Author(s):

Amir A. Khwaja

Keyword(s):

Big Data ◽

Data Processing ◽

Real Time ◽

Data Analytics ◽

Big Data Analytics ◽

Exception Handling ◽

Specification Language ◽

Big Data Processing ◽

Business Decisions ◽

Concurrent Processes

Big data explosion has already happened and the situation is only going to exacerbate with such a high number of data sources and high-end technology prevalent everywhere, generating data at a frantic pace. One of the most important aspects of big data is being able to capture, process, and analyze data as it is happening in real-time to allow real-time business decisions. Alternate approaches must be investigated especially consisting of highly parallel and real-time computations for big data processing. The chapter presents RealSpec real-time specification language that may be used for the modeling of big data analytics due to the inherent language features needed for real-time big data processing such as concurrent processes, multi-threading, resource modeling, timing constraints, and exception handling. The chapter provides an overview of RealSpec and applies the language to a detailed big data event recognition case study to demonstrate language applicability to big data framework and analytics modeling.

Download Full-text

Big Data Analytics in Cloud Computing

Advances in Computer and Electrical Engineering - Novel Practices and Trends in Grid and Cloud Computing ◽

10.4018/978-1-5225-9023-1.ch018 ◽

2019 ◽

pp. 325-341

Author(s):

Rajganesh Nagarajan ◽

Ramkumar Thirunavukarasu

Keyword(s):

Cloud Computing ◽

Big Data ◽

Data Processing ◽

Data Visualization ◽

Data Analytics ◽

Big Data Analytics ◽

Future Research ◽

Big Data Processing ◽

Investment Cost

In this chapter, the authors consider different categories of data, which are processed by the big data analytics tools. The challenges with respect to the big data processing are identified and a solution with the help of cloud computing is highlighted. Since the emergence of cloud computing is highly advocated because of its pay-per-use concept, the data processing tools can be effectively deployed within cloud computing and certainly reduce the investment cost. In addition, this chapter talks about the big data platforms, tools, and applications with data visualization concept. Finally, the applications of data analytics are discussed for future research.

Download Full-text

Modeling Big Data Analytics with a Real-Time Executable Specification Language

Big Data ◽

10.4018/978-1-4666-9840-6.ch021 ◽

2016 ◽

pp. 418-440

Author(s):

Amir A. Khwaja

Keyword(s):

Big Data ◽

Data Processing ◽

Real Time ◽

Data Analytics ◽

Big Data Analytics ◽

Exception Handling ◽

Specification Language ◽

Big Data Processing ◽

Business Decisions ◽

Concurrent Processes

Big data explosion has already happened and the situation is only going to exacerbate with such a high number of data sources and high-end technology prevalent everywhere, generating data at a frantic pace. One of the most important aspects of big data is being able to capture, process, and analyze data as it is happening in real-time to allow real-time business decisions. Alternate approaches must be investigated especially consisting of highly parallel and real-time computations for big data processing. The chapter presents RealSpec real-time specification language that may be used for the modeling of big data analytics due to the inherent language features needed for real-time big data processing such as concurrent processes, multi-threading, resource modeling, timing constraints, and exception handling. The chapter provides an overview of RealSpec and applies the language to a detailed big data event recognition case study to demonstrate language applicability to big data framework and analytics modeling.

Download Full-text

Urban Planning and Smart City Decision Management Empowered by Real-Time Data Processing Using Big Data Analytics

Sensors ◽

10.3390/s18092994 ◽

2018 ◽

Vol 18 (9) ◽

pp. 2994 ◽

Cited By ~ 28

Author(s):

Bhagya Silva ◽

Murad Khan ◽

Changsu Jung ◽

Jihun Seo ◽

Diyan Muhammad ◽

...

Keyword(s):

Big Data ◽

Data Processing ◽

Real Time ◽

Smart City ◽

Data Analytics ◽

Smart Cities ◽

Big Data Analytics ◽

Time Data ◽

Real Time Data

The Internet of Things (IoT), inspired by the tremendous growth of connected heterogeneous devices, has pioneered the notion of smart city. Various components, i.e., smart transportation, smart community, smart healthcare, smart grid, etc. which are integrated within smart city architecture aims to enrich the quality of life (QoL) of urban citizens. However, real-time processing requirements and exponential data growth withhold smart city realization. Therefore, herein we propose a Big Data analytics (BDA)-embedded experimental architecture for smart cities. Two major aspects are served by the BDA-embedded smart city. Firstly, it facilitates exploitation of urban Big Data (UBD) in planning, designing, and maintaining smart cities. Secondly, it occupies BDA to manage and process voluminous UBD to enhance the quality of urban services. Three tiers of the proposed architecture are liable for data aggregation, real-time data management, and service provisioning. Moreover, offline and online data processing tasks are further expedited by integrating data normalizing and data filtering techniques to the proposed work. By analyzing authenticated datasets, we obtained the threshold values required for urban planning and city operation management. Performance metrics in terms of online and offline data processing for the proposed dual-node Hadoop cluster is obtained using aforementioned authentic datasets. Throughput and processing time analysis performed with regard to existing works guarantee the performance superiority of the proposed work. Hence, we can claim the applicability and reliability of implementing proposed BDA-embedded smart city architecture in the real world.

Download Full-text

Big Data Overview

Big Data ◽

10.4018/978-1-4666-9840-6.ch001 ◽

2016 ◽

pp. 1-29 ◽

Cited By ~ 3

Author(s):

Yushi Shen ◽

Yale Li ◽

Ling Wu ◽

Shaofeng Liu ◽

Qian Wen

Keyword(s):

Big Data ◽

Data Processing ◽

Data Analytics ◽

Big Data Analytics ◽

Big Data Processing ◽

Data Scientist ◽

Definition Of

This chapter provides an overview of big data and its environment and opportunities. It starts with a definition of big data and describes the unique characteristics, structure, and value of big data, and the business drivers for big data analytics. It defines the role of the data scientist and describes the new ecosystem for big data processing and analysis.

Download Full-text

An Approach in Big Data Analytics to Improve the Velocity of Unstructured Data Using MapReduce

International Journal of System Dynamics Applications ◽

10.4018/ijsda.20211001.oa6 ◽

2021 ◽

Vol 10 (4) ◽

pp. 1-25

Author(s):

Sundarakumar M. R. ◽

Mahadevan G. ◽

Ramasubbareddy Somula ◽

Sankar Sennan ◽

Bharat S. Rawal

Keyword(s):

Big Data ◽

Time Delay ◽

Data Processing ◽

Data Analytics ◽

Big Data Analytics ◽

Data Retrieval ◽

High Volume ◽

Minimum Latency ◽

Hadoop Clusters ◽

Search Index

Big Data Analytics is an innovative approach for extracting the data from a huge volume of data warehouse systems. It reveals the method to compress the high volume of data into clusters by MapReduce and HDFS. However, the data processing has taken more time for extract and store in Hadoop clusters. The proposed system deals with the challenges of time delay in shuffle phase of map-reduce due to scheduling and sequencing. For improving the speed of big data, this proposed work using the Compressed Elastic Search Index (CESI) and MapReduce-Based Next Generation Sequencing Approach (MRBNGSA). This approach helps to increase the speed of data retrieval from HDFS clusters because of the way it is stored in that. this method is stored only the metadata in HDFS which takes less memory during runtime compare to big data due to the volume of data stored in HDFS. This approach is reduces the CPU utilization and memory allocation of the resource manager in Hadoop Framework and imroves data processing speed, such a way that time delay has to be reduced with minimum latency.

Download Full-text

An Approach in Big Data Analytics to improve the velocity of unstructured data Using Map Reduce

International Journal of System Dynamics Applications ◽

10.4018/ijsda.20211001oa06 ◽

2021 ◽

Vol 10 (4) ◽

pp. 0-0

Keyword(s):

Big Data ◽

Time Delay ◽

Data Processing ◽

Data Analytics ◽

Big Data Analytics ◽

Data Retrieval ◽

High Volume ◽

Map Reduce ◽

Hadoop Clusters ◽

Search Index

Big Data Analytics is an innovative approach for extracting the data from a huge volume of data warehouse systems. It reveals the method to compress the high volume of data into clusters by MapReduce and HDFS. However, the data processing has taken more time for extract and store in Hadoop clusters. The proposed system deals with the challenges of time delay in shuffle phase of map-reduce due to scheduling and sequencing. For improving the speed of big data, this proposed work using the Compressed Elastic Search Index (CESI) and MapReduce-Based Next Generation Sequencing Approach (MRBNGSA). This approach helps to increase the speed of data retrieval from HDFS clusters because of the way it is stored in that. this method is stored only the metadata in HDFS which takes less memory during runtime compare to big data due to the volume of data stored in HDFS. This approach is reduces the CPU utilization and memory allocation of the resource manager in Hadoop Framework and imroves data processing speed, such a way that time delay has to be reduced with minimum latency.

Download Full-text

Big Data Analytics for Concurrent Data Processing

International Journal of Computer Applications ◽

10.5120/21211-3912 ◽

2015 ◽

Vol 120 (3) ◽

pp. 36-41

Author(s):

A Samydurai ◽

C Vijayakumaran ◽

G Kumaresan ◽

B Muthusenthil

Keyword(s):

Big Data ◽

Data Processing ◽

Data Analytics ◽

Big Data Analytics

Download Full-text

A System for Big Data Analytics over Diverse Data Processing Platforms

10.5339/qfarc.2016.ictop2798 ◽

2016 ◽

Author(s):

Jorge Quiane ◽

Divy Agrawal ◽

Sanjay Chawla ◽

Ahmed Elmagarmid ◽

Zoi Kaoudi ◽

...

Keyword(s):

Big Data ◽

Data Processing ◽

Data Analytics ◽

Big Data Analytics ◽

Diverse Data

Download Full-text

Big data analytics with swarm intelligence

Industrial Management & Data Systems ◽

10.1108/imds-06-2015-0222 ◽

2016 ◽

Vol 116 (4) ◽

pp. 646-666 ◽

Cited By ~ 29

Author(s):

Shi Cheng ◽

Qingyu Zhang ◽

Quande Qin

Keyword(s):

Big Data ◽

Data Processing ◽

Swarm Intelligence ◽

Data Analytics ◽

Large Scale ◽

Big Data Analytics ◽

Underlying Mechanism ◽

Research Area ◽

Future Research ◽

Content Type

Purpose – The quality and quantity of data are vital for the effectiveness of problem solving. Nowadays, big data analytics, which require managing an immense amount of data rapidly, has attracted more and more attention. It is a new research area in the field of information processing techniques. It faces the big challenges and difficulties of a large amount of data, high dimensionality, and dynamical change of data. However, such issues might be addressed with the help from other research fields, e.g., swarm intelligence (SI), which is a collection of nature-inspired searching techniques. The paper aims to discuss these issues. Design/methodology/approach – In this paper, the potential application of SI in big data analytics is analyzed. The correspondence and association between big data analytics and SI techniques are discussed. As an example of the application of the SI algorithms in the big data processing, a commodity routing system in a port in China is introduced. Another example is the economic load dispatch problem in the planning of a modern power system. Findings – The characteristics of big data include volume, variety, velocity, veracity, and value. In the SI algorithms, these features can be, respectively, represented as large scale, high dimensions, dynamical, noise/surrogates, and fitness/objective problems, which have been effectively solved. Research limitations/implications – In current research, the example problem of the port is formulated but not solved yet given the ongoing nature of the project. The example could be understood as advanced IT or data processing technology, however, its underlying mechanism could be the SI algorithms. This paper is the first step in the research to utilize the SI algorithm to a big data analytics problem. The future research will compare the performance of the method and fit it in a dynamic real system. Originality/value – Based on the combination of SI and data mining techniques, the authors can have a better understanding of the big data analytics problems, and design more effective algorithms to solve real-world big data analytical problems.

Download Full-text