Geographic spatiotemporal big data correlation analysis via the Hilbert–Huang transformation

Security Metric Methods for Network Multistep Attacks Using AMC and Big Data Correlation Analysis

Security and Communication Networks ◽

10.1155/2018/5787102 ◽

2018 ◽

Vol 2018 ◽

pp. 1-14 ◽

Cited By ~ 8

Author(s):

Hao Hu ◽

Yuling Liu ◽

Hongqi Zhang ◽

Yuchen Zhang

Keyword(s):

Big Data ◽

Network Security ◽

Correlation Analysis ◽

Transition Probability ◽

Success Probability ◽

Transition Probability Matrix ◽

Data Correlation ◽

Security Metrics ◽

Proposed Model ◽

Attack Scenario

Network security metrics allow quantitatively evaluating the overall resilience of networked systems against attacks. From this aim, security metrics are of great importance to the security-related decision-making process of enterprises. In this paper, we employ absorbing Markov chain (AMC) to estimate the network security combining with the technique of big data correlation analysis. Specifically, we construct the model of AMC using a large amount of alert data to describe the scenario of multistep attacks in the real world. In addition, we implement big data correlation analysis to generate the transition probability matrix from alert stream, which defines the probabilities of transferring from one attack action to another according to a given scenario before reaching one of some attack targets. Based on the probability reasoning, two metric algorithms are designed to estimate the attack scenario as well as the attackers, namely, the expected number of visits (ENV) and the expected success probability (ESP). The superiority is that the proposed model and algorithms assist the administrator in building new scenarios, prioritizing alerts, and ranking them.

Download Full-text

Machine learning concepts for correlated Big Data privacy

Journal Of Big Data ◽

10.1186/s40537-021-00530-x ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Sreemoyee Biswas ◽

Nilay Khare ◽

Pragati Agrawal ◽

Priyank Jain

Keyword(s):

Machine Learning ◽

Big Data ◽

Correlation Analysis ◽

Real World ◽

Data Privacy ◽

Correlated Data ◽

Data Correlation ◽

The Real ◽

Big Data Privacy ◽

Real World Datasets

AbstractWith data becoming a salient asset worldwide, dependence amongst data kept on growing. Hence the real-world datasets that one works upon in today’s time are highly correlated. Since the past few years, researchers have given attention to this aspect of data privacy and found a correlation among data. The existing data privacy guarantees cannot assure the expected data privacy algorithms. The privacy guarantees provided by existing algorithms were enough when there existed no relation between data in the datasets. Hence, by keeping the existence of data correlation into account, there is a dire need to reconsider the privacy algorithms. Some of the research has considered utilizing a well-known machine learning concept, i.e., Data Correlation Analysis, to understand the relationship between data in a better way. This concept has given some promising results as well. Though it is still concise, the researchers did a considerable amount of research on correlated data privacy. Researchers have provided solutions using probabilistic models, behavioral analysis, sensitivity analysis, information theory models, statistical correlation analysis, exhaustive combination analysis, temporal privacy leakages, and weighted hierarchical graphs. Nevertheless, researchers are doing work upon the real-world datasets that are often large (technologically termed big data) and house a high amount of data correlation. Firstly, the data correlation in big data must be studied. Researchers are exploring different analysis techniques to find the best suitable. Then, they might suggest a measure to guarantee privacy for correlated big data. This survey paper presents a detailed survey of the methods proposed by different researchers to deal with the problem of correlated data privacy and correlated big data privacy and highlights the future scope in this area. The quantitative analysis of the reviewed articles suggests that data correlation is a significant threat to data privacy. This threat further gets magnified with big data. While considering and analyzing data correlation, then parameters such as Maximum queries executed, Mean average error values show better results when compared with other methods. Hence, there is a grave need to understand and propose solutions for correlated big data privacy.

Download Full-text

The Application of Cloud Computing and Big Data Correlation Analysis to the Intelligent Evaluation Modeling

Journal of Physics Conference Series ◽

10.1088/1742-6596/1982/1/012039 ◽

2021 ◽

Vol 1982 (1) ◽

pp. 012039

Author(s):

Yunxuan Xiao ◽

Leize Yang

Keyword(s):

Cloud Computing ◽

Big Data ◽

Correlation Analysis ◽

Data Correlation

Download Full-text

A novel method of data correlation analysis of the big data based on network clustering algorithm

2015 IEEE International Conference on Communication Software and Networks (ICCSN) ◽

10.1109/iccsn.2015.7296184 ◽

2015 ◽

Cited By ~ 1

Author(s):

Yue Yang ◽

Chunting Wang

Keyword(s):

Big Data ◽

Correlation Analysis ◽

Clustering Algorithm ◽

Network Clustering ◽

Data Correlation ◽

Novel Method

Download Full-text

Research and Application of Big Data Correlation Analysis in Education

Advances in Intelligent Networking and Collaborative Systems - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-030-29035-1_44 ◽

2019 ◽

pp. 454-462

Author(s):

Du Bo ◽

Li Ai ◽

Yuan Chen

Keyword(s):

Big Data ◽

Correlation Analysis ◽

Data Correlation

Download Full-text

The Theoretical and Experimental Analysis of the Maximal Information Coefficient Approximate Algorithm

Journal of Systems Science and Information ◽

10.21078/jssi-2021-095-10 ◽

2021 ◽

Vol 9 (1) ◽

pp. 95-104

Author(s):

Fubo Shao ◽

Hui Liu

Keyword(s):

Big Data ◽

Correlation Analysis ◽

Experimental Analysis ◽

Time Complexity ◽

Computation Time ◽

Approximate Algorithm ◽

Data Correlation ◽

Information Coefficient ◽

Maximal Information Coefficient ◽

Good Preparation

Abstract In the era of big data, correlation analysis is significant because it can quickly detect the correlation between factors. And then, it has been received much attention. Due to the good properties of generality and equitability of the maximal information coefficient (MIC), MIC is a hotspot in the research of correlation analysis. However, if the original approximate algorithm of MIC is directly applied into mining correlations in big data, the computation time is very long. Then the theoretical time complexity of the original approximate algorithm is analyzed in depth and the time complexity is n 2.4 when parameters are default. And the experiments show that the large number of candidate partitions of random relationships results in long computation time. The analysis is a good preparation for the next step work of designing new fast algorithms.

Download Full-text

Data Correlation Analysis Algorithm of University-enterprise Cooperation and Cultivation Quality

2020 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS) ◽

10.1109/tocs50858.2020.9339616 ◽

2020 ◽

Author(s):

Chunming Xie ◽

Caiming Liu

Keyword(s):

Correlation Analysis ◽

Data Correlation ◽

Analysis Algorithm

Download Full-text

A study on data correlation for interferometric observations like vlbi and delta-dor applications and the correlation analysis of XF correlator

2017 International Conference on Networks & Advances in Computational Technologies (NetACT) ◽

10.1109/netact.2017.8076804 ◽

2017 ◽

Author(s):

Arathi S. Nair ◽

Salim Paul

Keyword(s):

Correlation Analysis ◽

Data Correlation

Download Full-text

CoPart: a context-based partitioning technique for big data

Journal Of Big Data ◽

10.1186/s40537-021-00410-4 ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Sara Migliorini ◽

Alberto Belussi ◽

Elisa Quintarelli ◽

Damiano Carra

Keyword(s):

Big Data ◽

Quality Criteria ◽

Query Execution ◽

Data Correlation ◽

Huge Amount ◽

Programming Paradigm ◽

Computation Node ◽

Contextual Dimension ◽

Partitioning Technique ◽

The Way

AbstractThe MapReduce programming paradigm is frequently used in order to process and analyse a huge amount of data. This paradigm relies on the ability to apply the same operation in parallel on independent chunks of data. The consequence is that the overall performances greatly depend on the way data are partitioned among the various computation nodes. The default partitioning technique, provided by systems like Hadoop or Spark, basically performs a random subdivision of the input records, without considering the nature and correlation between them. Even if such approach can be appropriate in the simplest case where all the input records have to be always analyzed, it becomes a limit for sophisticated analyses, in which correlations between records can be exploited to preliminarily prune unnecessary computations. In this paper we design a context-based multi-dimensional partitioning technique, called CoPart, which takes care of data correlation in order to determine how records are subdivided between splits (i.e., units of work assigned to a computation node). More specifically, it considers not only the correlation of data w.r.t. contextual attributes, but also the distribution of each contextual dimension in the dataset. We experimentally compare our approach with existing ones, considering both quality criteria and the query execution times.

Download Full-text

Multi-dimensional Data Correlation Analysis Method Based on Neighborhood Preserving Embedding Mechanism

10.1109/bmsb53066.2021.9547142 ◽

2021 ◽

Author(s):

Zhongdi Ge ◽

Longjun Zhao ◽

Zhen Wang ◽

Dandan Cui ◽

Yang Yang ◽

...

Keyword(s):

Correlation Analysis ◽

Analysis Method ◽

Data Correlation ◽

Correlation Analysis Method

Download Full-text