K-Nearest Robust Active Learning on Big Data and Application in Epitope Prediction

Wireless Communications and Mobile Computing ◽

10.1155/2021/8752022 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Tianchi Lu

Keyword(s):

Big Data ◽

Data Analysis ◽

Active Learning ◽

Learning Algorithm ◽

Epitope Prediction ◽

Big Data Analysis ◽

Epitope Region ◽

Specific Antibodies ◽

Active Learning Method

B-cells that induce antigen-specific immune responses in vivo produce large numbers of antigen-specific antibodies by recognizing subregions (epitopes) of antigenic proteins, in which they can inhibit the function of antigen protein. Epitope region prediction facilitates the design and development of vaccines that induce the production of antigen-specific antibodies. There are many diseases which are difficult to treat without vaccines. And the COVID-19 has destroyed many people’s lives. Therefore, making vaccines to COVID-19 is very important. Making vaccines needs a large number of experiments to get labeled targets. However, obtaining tremendous labeled data from experiments is a challenge for humans. Big data analysis has proposed some solutions to deal with this challenge. Big data technology has developed very fast and has been applied in many areas. In the bioinformatics area, big data analysis solves a large number of problems, particularly in the area of active learning. Active learning is a method of building more predictive models with less labeled data. Active learning establishes models with less data by asking the oracle (human) for the most valuable samples to train models. Hence, active learning’s application in making vaccines is meaningful that the scientists do not need to do tremendous experiments. This paper proposed a more robust active learning method based on uncertainty sampling and K-nearest density and applies it to the vaccine manufacture. This paper evaluates the new algorithm with accuracy and robustness. In order to evaluate the robustness of active learners, a new robustness index is designed in this paper. And this paper compares the new algorithm with a pool-based active learning algorithm, density-weighted active learning algorithm, and traditional machine learning algorithm. Finally, the new algorithm is applied to epitope prediction of B-cell data, which is significant to making vaccines.

Download Full-text

A Novel on Transmission Line Tower Big Data Analysis Model Using Altered K-means and ADQL

Sustainability ◽

10.3390/su11133499 ◽

2019 ◽

Vol 11 (13) ◽

pp. 3499 ◽

Cited By ~ 5

Author(s):

Se-Hoon Jung ◽

Jun-Ho Huh

Keyword(s):

Big Data ◽

Data Analysis ◽

Transmission Line ◽

Clustering Algorithm ◽

Learning Algorithm ◽

Principal Component ◽

Big Data Analysis ◽

Standard Normal Distribution ◽

Analysis Model ◽

Q Learning

This study sought to propose a big data analysis and prediction model for transmission line tower outliers to assess when something is wrong with transmission line tower big data based on deep reinforcement learning. The model enables choosing automatic cluster K values based on non-labeled sensor big data. It also allows measuring the distance of action between data inside a cluster with the Q-value representing network output in the altered transmission line tower big data clustering algorithm containing transmission line tower outliers and old Deep Q Network. Specifically, this study performed principal component analysis to categorize transmission line tower data and proposed an automatic initial central point approach through standard normal distribution. It also proposed the A-Deep Q-Learning algorithm altered from the deep Q-Learning algorithm to explore policies based on the experiences of clustered data learning. It can be used to perform transmission line tower outlier data learning based on the distance of data within a cluster. The performance evaluation results show that the proposed model recorded an approximately 2.29%~4.19% higher prediction rate and around 0.8% ~ 4.3% higher accuracy rate compared to the old transmission line tower big data analysis model.

Download Full-text

Active Learning with an Adaptive Classifier for Inaccessible Big Data Analysis

10.1109/ijcnn52387.2021.9534046 ◽

2021 ◽

Author(s):

Sadia Jahan ◽

Md Rafiqul Islam ◽

Khan Md. Hasib ◽

Usman Naseem ◽

Md. Saiful Islam

Keyword(s):

Big Data ◽

Data Analysis ◽

Active Learning ◽

Big Data Analysis

Download Full-text

An efficient unstructured big data analysis method for enhancing performance using machine learning algorithm

2015 International Conference on Circuits, Power and Computing Technologies [ICCPCT-2015] ◽

10.1109/iccpct.2015.7159492 ◽

2015 ◽

Cited By ~ 1

Author(s):

A. K. Reshmy ◽

D. Paulraj

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analysis ◽

Learning Algorithm ◽

Big Data Analysis ◽

Machine Learning Algorithm ◽

Analysis Method ◽

Data Analysis Method

Download Full-text

Feature Extraction of Ancient Chinese Characters Based on Deep Convolution Neural Network and Big Data Analysis

Computational Intelligence and Neuroscience ◽

10.1155/2021/2491116 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Cheng Zhang ◽

Xingjun Liu

Keyword(s):

Neural Network ◽

Big Data ◽

Data Analysis ◽

Ming Dynasty ◽

Learning Algorithm ◽

Recognition Rate ◽

Big Data Analysis ◽

Convolution Neural Network ◽

Chinese Characters ◽

Deep Convolution Neural Network

In recent years, deep learning has made good progress and has been applied to face recognition, video monitoring, image processing, and other fields. In this big data background, deep convolution neural network has also received more and more attention. In order to extract the ancient Chinese characters effectively, the paper will discuss the structure model, pool process, and network training of deep convolution neural network and compare the algorithm with the traditional machine learning algorithm. The results show that the accuracy and recall rate of the Chinese characters in the plaque of Ming Dynasty can reach the peak, 81.38% and 81.31%, respectively. When the number of training samples increases to 50, the recognition rate of MFA is 99.72%, which is much higher than other algorithms. This shows that the algorithm based on deep convolution neural network and big data analysis has excellent performance and can effectively identify the Chinese characters under different dynasties, different sample sizes, and different interference factors, which can provide a powerful reference for the extraction of ancient Chinese characters.

Download Full-text

Comparing shopping experiences in department stores and street markets: a big data analysis of TripAdvisor reviews

International Journal of Culture Tourism and Hospitality Research ◽

10.1108/ijcthr-10-2020-0228 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Chayanon Phucharoen ◽

Tatiyaporn Jarumaneerat ◽

Nichapat Sangkaew

Keyword(s):

Big Data ◽

Data Analysis ◽

Learning Algorithm ◽

Big Data Analysis ◽

Department Stores ◽

Shopping Malls ◽

Content Type ◽

Shopping Experience ◽

The Mean

Purpose Based on big data analytical and statistical techniques, this study aims to examine tourists’ shopping experiences at department stores and street markets in Phuket. Design/methodology/approach A Naïve Bayes machine learning algorithm was used to identify the most frequently used terms in TripAdvisor reviews of both department stores and street markets contributed by the same pool of 729 tourists. Findings A total of 18 out of 62 terms used were common in reviews of both shopping settings. However, the study found significant differences in the mean use of the 18 common terms and the likelihood of those terms being used in overall positive reviews. Practical implications The study’s findings indicate differences in tourist shopping experiences at department stores and street markets. Several concrete recommendations are made, including a greater focus on the linkage to the national characteristic of street markets, and particularly the quality of local fruit, to enhance the tourist shopping experience. Originality/value Understanding the differences between shopping malls and street markets from the tourist’s perspective would further enhance the coexistence of shopping malls and street markets in tourism-led growth cities. As such, using reviews of both shopping malls and street markets from an identical pool of tourists, the present study will analyse and compare tourists’ actual shopping experiences, thereby addressing this gap in the research canon via integrated statistical and big data analysis techniques.

Download Full-text

A Study on Effectiveness of a New Network Marketing Model with M-Code Compensation System Using Big Data Analysis

10.33645/cnc.2018.12.40.8.1015 ◽

2018 ◽

Vol 40 (8) ◽

pp. 1015-1042

Author(s):

Koono Kim ◽

Hyebong Choi

Keyword(s):

Big Data ◽

Data Analysis ◽

Big Data Analysis ◽

Compensation System ◽

Network Marketing ◽

Marketing Model

Download Full-text

Insights into seismic hazard from big data analysis of ground motion simulations

International Journal of Safety and Security Engineering ◽

10.2495/safe-v9-n1-01-12 ◽

2019 ◽

Vol 9 (1) ◽

pp. 01-12 ◽

Cited By ~ 1

Author(s):

Kristy F. Tiampo ◽

Javad Kazemian ◽

Hadi Ghofrani ◽

Yelena Kropivnitskaya ◽

Gero Michel

Keyword(s):

Big Data ◽

Data Analysis ◽

Seismic Hazard ◽

Ground Motion ◽

Big Data Analysis ◽

Ground Motion Simulations

Download Full-text

A Study on Social Cognitions of ‘Chinese Education’ in Korea Using Big Data Analysis

The Journal of Chinese Language and Literature ◽

10.25021/jcll.2019.6.116.83 ◽

2019 ◽

Vol 116 ◽

pp. 83-112

Author(s):

Eun-jae Choi

Keyword(s):

Big Data ◽

Data Analysis ◽

Big Data Analysis ◽

Social Cognitions ◽

Chinese Education

Download Full-text

AN EFFICIENT DEDUPLICATION MECHANISM FOR BIG DATA ANALYSIS IN CLOUD ENVIRONMENTS

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i4.389395 ◽

2018 ◽

Vol 6 (4) ◽

pp. 389-395

Author(s):

M.Murugesan . ◽

◽

A. Kalaiyarasi

Keyword(s):

Big Data ◽

Data Analysis ◽

Big Data Analysis ◽

Cloud Environments

Download Full-text

A Social Big Data Analysis on Sport Participation

Korean Journal of Sport Management ◽

10.31308/kssm.25.2.2 ◽

2020 ◽

Vol 25 (2) ◽

pp. 18-30

Author(s):

Seung Wook Oh ◽

Jin-Wook Han ◽

Min Soo Kim

Keyword(s):

Big Data ◽

Data Analysis ◽

Sport Participation ◽

Big Data Analysis ◽

Social Big Data

Download Full-text