A Novel on Transmission Line Tower Big Data Analysis Model Using Altered K-means and ADQL

Se-Hoon Jung; Jun-Ho Huh

doi:10.3390/su11133499

A Novel on Transmission Line Tower Big Data Analysis Model Using Altered K-means and ADQL

Sustainability ◽

10.3390/su11133499 ◽

2019 ◽

Vol 11 (13) ◽

pp. 3499 ◽

Cited By ~ 5

Author(s):

Se-Hoon Jung ◽

Jun-Ho Huh

Keyword(s):

Big Data ◽

Data Analysis ◽

Transmission Line ◽

Clustering Algorithm ◽

Learning Algorithm ◽

Principal Component ◽

Big Data Analysis ◽

Standard Normal Distribution ◽

Analysis Model ◽

Q Learning

This study sought to propose a big data analysis and prediction model for transmission line tower outliers to assess when something is wrong with transmission line tower big data based on deep reinforcement learning. The model enables choosing automatic cluster K values based on non-labeled sensor big data. It also allows measuring the distance of action between data inside a cluster with the Q-value representing network output in the altered transmission line tower big data clustering algorithm containing transmission line tower outliers and old Deep Q Network. Specifically, this study performed principal component analysis to categorize transmission line tower data and proposed an automatic initial central point approach through standard normal distribution. It also proposed the A-Deep Q-Learning algorithm altered from the deep Q-Learning algorithm to explore policies based on the experiences of clustered data learning. It can be used to perform transmission line tower outlier data learning based on the distance of data within a cluster. The performance evaluation results show that the proposed model recorded an approximately 2.29%~4.19% higher prediction rate and around 0.8% ~ 4.3% higher accuracy rate compared to the old transmission line tower big data analysis model.

Download Full-text

A Parallel Clustering Algorithm for Power Big Data Analysis

Communications in Computer and Information Science - Parallel Architecture, Algorithm and Programming ◽

10.1007/978-981-10-6442-5_51 ◽

2017 ◽

pp. 533-540

Author(s):

Xiangjun Meng ◽

Liang Chen ◽

Yidong Li

Keyword(s):

Big Data ◽

Data Analysis ◽

Clustering Algorithm ◽

Big Data Analysis ◽

Parallel Clustering

Download Full-text

Big Data Analysis Using Principal Component Analysis

Journal of Korean institute of intelligent systems ◽

10.5391/jkiis.2015.25.6.592 ◽

2015 ◽

Vol 25 (6) ◽

pp. 592-599 ◽

Cited By ~ 3

Author(s):

Seung-Joo Lee

Keyword(s):

Principal Component Analysis ◽

Big Data ◽

Data Analysis ◽

Principal Component ◽

Component Analysis ◽

Big Data Analysis

Download Full-text

A Wearable Device-Based Personalized Big Data Analysis Model

Ubiquitous Computing and Ambient Intelligence. Personalisation and User Adapted Services - Lecture Notes in Computer Science ◽

10.1007/978-3-319-13102-3_39 ◽

2014 ◽

pp. 236-242 ◽

Cited By ~ 3

Author(s):

Shujaat Hussain ◽

Byeong Ho Kang ◽

Sungyoung Lee

Keyword(s):

Big Data ◽

Data Analysis ◽

Big Data Analysis ◽

Wearable Device ◽

Analysis Model

Download Full-text

K-Nearest Robust Active Learning on Big Data and Application in Epitope Prediction

Wireless Communications and Mobile Computing ◽

10.1155/2021/8752022 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Tianchi Lu

Keyword(s):

Big Data ◽

Data Analysis ◽

Active Learning ◽

Learning Algorithm ◽

Epitope Prediction ◽

Big Data Analysis ◽

Epitope Region ◽

Specific Antibodies ◽

Active Learning Method

B-cells that induce antigen-specific immune responses in vivo produce large numbers of antigen-specific antibodies by recognizing subregions (epitopes) of antigenic proteins, in which they can inhibit the function of antigen protein. Epitope region prediction facilitates the design and development of vaccines that induce the production of antigen-specific antibodies. There are many diseases which are difficult to treat without vaccines. And the COVID-19 has destroyed many people’s lives. Therefore, making vaccines to COVID-19 is very important. Making vaccines needs a large number of experiments to get labeled targets. However, obtaining tremendous labeled data from experiments is a challenge for humans. Big data analysis has proposed some solutions to deal with this challenge. Big data technology has developed very fast and has been applied in many areas. In the bioinformatics area, big data analysis solves a large number of problems, particularly in the area of active learning. Active learning is a method of building more predictive models with less labeled data. Active learning establishes models with less data by asking the oracle (human) for the most valuable samples to train models. Hence, active learning’s application in making vaccines is meaningful that the scientists do not need to do tremendous experiments. This paper proposed a more robust active learning method based on uncertainty sampling and K-nearest density and applies it to the vaccine manufacture. This paper evaluates the new algorithm with accuracy and robustness. In order to evaluate the robustness of active learners, a new robustness index is designed in this paper. And this paper compares the new algorithm with a pool-based active learning algorithm, density-weighted active learning algorithm, and traditional machine learning algorithm. Finally, the new algorithm is applied to epitope prediction of B-cell data, which is significant to making vaccines.

Download Full-text

THE MODELING OF "MUSTAHIQ" DATA USING K-MEANS CLUSTERING ALGORITHM AND BIG DATA ANALYSIS (CASE STUDY: LAZ)

JURNAL TEKNIK INFORMATIKA ◽

10.15408/jti.v13i2.19610 ◽

2021 ◽

Vol 13 (2) ◽

pp. 213-230

Author(s):

Nurhayati Buslim ◽

Rayi Pradono Iswara ◽

Fajar Agustian

Keyword(s):

Big Data ◽

Data Analysis ◽

Clustering Algorithm ◽

System Modeling ◽

Big Data Analysis ◽

Graphical Form ◽

Data Types ◽

Online Data ◽

Combining Data ◽

Data Survey

There are a lot of Mustahiq data in LAZ (Lembaga Amil Zakat) which is spread in many locations today. Each LAZ has Mustahiq data that is different in type from other LAZ. There are differences in Mustahiq data types so that data that is so large cannot be used together even though the purpose of the data is the same to determine Mustahiq data. And to find out whether the Mustahiq data is still up to date (renewable), of course it will be very difficult due to the types of data types that are not uniform or different, long time span, and the large amount of data. To give zakat to Mustahiq certainly requires speed of information. So, in giving zakat to Mustahiq, LAZ will find it difficult to monitor the progress of the Mustahiq. It is possible that a Mustahiq will change his condition to become a Muzaki. This is the reason for the researcher to take this theme in order to help the existing LAZ to make it easier to cluster Mustahiq data. Furthermore, the data already in the cluster can be used by LAZ managers to develop the organization. This can also be a reference for determining the zakat recipient cluster to those who are entitled later. The research is "Modeling using K-Means Algorithm and Big Data analysis in determine Mustahiq data ". We got data Mustahiq with random sample from online and offline survey. Online data survey with Google form and Offline Data survey we got from BAZNAS (National Amil Zakat Agency) in Indonesia and another zakat agency (LAZ) in Jakarta. We conducted by combining data to analyzed using Big Data and K-Means Algorithm. K-Means algorithm is an algorithm for cluster n objects based on attributes into k partitions according to criteria that will be determined from large and diverse Mustahiq data. This research focuses on modeling that applies K-Means Algorithms and Big Data Analysis. The first we made tools for grouping simulation test data. We do several experimental and simulation scenarios to find a model in mapping Mustahiq data to developed best model for processing the data. The results of this study are displayed in tabular and graphical form, namely the proposed Mustahiq data processing model at Zakat Agency (LAZ). The simulation result from a total of 1109 correspondents, 300 correspondents are included in the Mustahiq cluster and 809 correspondents are included in the Non-Mustahiq cluster and have an accuracy rate of 83.40%. That means accuracy of the system modeling able to determine data Mustahiq. Result filtering based on Gender is “Male” accuracy 83.93%, based on Age is ”30-39” accuracy 71,03%, based on Job is “PNS” accuracy 83.39%, based on Education is “S1” accuracy 83.79%. The advantaged of research expected to be able to determine quickly whether the person meets the criteria as a mustahik or Muzaki for LAZ (Amil Zakat Agency). The result of modeling is K-Means clustering algorithm application program can be used if UIN Syarif Hidayatullah Jakarta want to develop LAZ (Amil Zakat Agency) too.

Download Full-text

Research and Application on Spark Clustering Algorithm in Campus Big Data Analysis

Journal of Computer Science Research ◽

10.30564/jcsr.v2i1.1808 ◽

2020 ◽

Vol 2 (1) ◽

Author(s):

Qing Hou ◽

Guangjian Wang ◽

Xiaozheng Wang ◽

Jiaxi Xu ◽

Yang Xin

Keyword(s):

Big Data ◽

Data Analysis ◽

Clustering Algorithm ◽

Big Data Analysis ◽

Psychological Status ◽

Student Management ◽

Multiple Dimensions ◽

Daily Behavior ◽

Teachers And Students ◽

Management Work

Big data analysis has penetrated into all fields of society and has brought about profound changes. However, there is relatively little research on big data supporting student management regarding college and university’s big data. Taking the student card information as the research sample, using spark big data mining technology and K-Means clustering algorithm, taking scholarship evaluation as an example, the big data is analyzed. Data includes analysis of students' daily behavior from multiple dimensions, and it can prevent the unreasonable scholarship evaluation caused by unfair factors such as plagiarism, votes of teachers and students, etc. At the same time, students' absenteeism, physical health and psychological status in advance can be predicted, which makes student management work more active, accurate and effective.

Download Full-text

A method of constructing distributed big data analysis model for machine learning based on Cloud Computing

10.1145/3482632.3484007 ◽

2021 ◽

Author(s):

Jicai Li ◽

Dan Liu

Keyword(s):

Machine Learning ◽

Cloud Computing ◽

Big Data ◽

Data Analysis ◽

Big Data Analysis ◽

Analysis Model

Download Full-text

Big Data Analysis Model of Customer Appeal Based on Power Enterprise Service Platform

2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA) ◽

10.1109/icaica50127.2020.9182539 ◽

2020 ◽

Author(s):

Zhenhua Liu ◽

Liwei Su

Keyword(s):

Big Data ◽

Data Analysis ◽

Big Data Analysis ◽

Analysis Model ◽

Service Platform ◽

Power Enterprise

Download Full-text

An efficient unstructured big data analysis method for enhancing performance using machine learning algorithm

2015 International Conference on Circuits, Power and Computing Technologies [ICCPCT-2015] ◽

10.1109/iccpct.2015.7159492 ◽

2015 ◽

Cited By ~ 1

Author(s):

A. K. Reshmy ◽

D. Paulraj

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analysis ◽

Learning Algorithm ◽

Big Data Analysis ◽

Machine Learning Algorithm ◽

Analysis Method ◽

Data Analysis Method

Download Full-text

Feature Extraction of Ancient Chinese Characters Based on Deep Convolution Neural Network and Big Data Analysis

Computational Intelligence and Neuroscience ◽

10.1155/2021/2491116 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Cheng Zhang ◽

Xingjun Liu

Keyword(s):

Neural Network ◽

Big Data ◽

Data Analysis ◽

Ming Dynasty ◽

Learning Algorithm ◽

Recognition Rate ◽

Big Data Analysis ◽

Convolution Neural Network ◽

Chinese Characters ◽

Deep Convolution Neural Network

In recent years, deep learning has made good progress and has been applied to face recognition, video monitoring, image processing, and other fields. In this big data background, deep convolution neural network has also received more and more attention. In order to extract the ancient Chinese characters effectively, the paper will discuss the structure model, pool process, and network training of deep convolution neural network and compare the algorithm with the traditional machine learning algorithm. The results show that the accuracy and recall rate of the Chinese characters in the plaque of Ming Dynasty can reach the peak, 81.38% and 81.31%, respectively. When the number of training samples increases to 50, the recognition rate of MFA is 99.72%, which is much higher than other algorithms. This shows that the algorithm based on deep convolution neural network and big data analysis has excellent performance and can effectively identify the Chinese characters under different dynasties, different sample sizes, and different interference factors, which can provide a powerful reference for the extraction of ancient Chinese characters.

Download Full-text