Research on Methodology of Correlation Analysis of Sci-Tech Literature Based on Deep Learning Technology in the Big Data

Wen Zeng; Hongjiao Xu; Hui Li; Xiang Li

doi:10.4018/jdm.2018070104

Research on Methodology of Correlation Analysis of Sci-Tech Literature Based on Deep Learning Technology in the Big Data

Deep Learning and Neural Networks ◽

10.4018/978-1-7998-0414-7.ch085 ◽

2020 ◽

pp. 1524-1546

Author(s):

Wen Zeng ◽

Hongjiao Xu ◽

Hui Li ◽

Xiang Li

Keyword(s):

Big Data ◽

Deep Learning ◽

Correlation Analysis ◽

Vector Space ◽

Input Word ◽

Learning Technology ◽

Depth Analysis ◽

Presentation Method ◽

High Level ◽

Deep Learning Model

In the big data era, it is a great challenge to identify high-level abstract features out of a flood of sci-tech literature to achieve in-depth analysis of data. The deep learning technology has developed rapidly and achieved applications in many fields, but has rarely been utilized in the research of sci-tech literature data. This article introduced the presentation method of vector space of terminologies in sci-tech literature based on the deep learning model. It explored and adopted a deep AE model to reduce the dimensionality of input word vector feature. Also put forward is the methodology of correlation analysis of sci-tech literature based on deep learning technology. The experimental results showed that the processing of sci-tech literature data could be simplified into the computation of vectors in the multi-dimensional vector space, and the similarity in vector space could be used to represent similarity in text semantics. The correlation analysis of subject contents between sci-tech literatures of the same or different types can be made using this method.

Download Full-text

A hybrid deep learning model for efficient intrusion detection in big data environment

Information Sciences ◽

10.1016/j.ins.2019.10.069 ◽

2020 ◽

Vol 513 ◽

pp. 386-396 ◽

Cited By ~ 26

Author(s):

Mohammad Mehedi Hassan ◽

Abdu Gumaei ◽

Ahmed Alsanad ◽

Majed Alrubaian ◽

Giancarlo Fortino

Keyword(s):

Big Data ◽

Deep Learning ◽

Intrusion Detection ◽

Learning Model ◽

Data Environment ◽

Deep Learning Model

Download Full-text

A novel two-stage method of plant seedlings classification based on deep learning

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211507 ◽

2021 ◽

pp. 1-11

Author(s):

Tianhong Dai ◽

Shijie Cong ◽

Jianping Huang ◽

Yanwen Zhang ◽

Xinwang Huang ◽

...

Keyword(s):

Deep Learning ◽

Learning Technology ◽

Two Stage ◽

Second Stage ◽

Stage Classification ◽

Different Types ◽

Two Stages ◽

Plant Seedlings

In agricultural production, weed removal is an important part of crop cultivation, but inevitably, other plants compete with crops for nutrients. Only by identifying and removing weeds can the quality of the harvest be guaranteed. Therefore, the distinction between weeds and crops is particularly important. Recently, deep learning technology has also been applied to the field of botany, and achieved good results. Convolutional neural networks are widely used in deep learning because of their excellent classification effects. The purpose of this article is to find a new method of plant seedling classification. This method includes two stages: image segmentation and image classification. The first stage is to use the improved U-Net to segment the dataset, and the second stage is to use six classification networks to classify the seedlings of the segmented dataset. The dataset used for the experiment contained 12 different types of plants, namely, 3 crops and 9 weeds. The model was evaluated by the multi-class statistical analysis of accuracy, recall, precision, and F1-score. The results show that the two-stage classification method combining the improved U-Net segmentation network and the classification network was more conducive to the classification of plant seedlings, and the classification accuracy reaches 97.7%.

Download Full-text

Prediction of Merchandise Sales on E-Commerce Platforms Based on Data Mining and Deep Learning

Scientific Programming ◽

10.1155/2021/2179692 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Xiaoting Yin ◽

Xiaosha Tao

Keyword(s):

Data Mining ◽

Deep Learning ◽

Learning Algorithm ◽

Research Process ◽

Deep Learning Algorithm ◽

Sales Prediction ◽

Different Types ◽

Online Business ◽

Product Sales ◽

Deep Learning Model

Online business has grown exponentially during the last decade, and the industries are focusing on online business more than before. However, just setting up an online store and starting selling might not work. Different machine learning and data mining techniques are needed to know the users’ preferences and know what would be best for business. According to the decision-making needs of online product sales, combined with the influencing factors of online product sales in various industries and the advantages of deep learning algorithm, this paper constructs a sales prediction model suitable for online products and focuses on evaluating the adaptability of the model in different types of online products. In the research process, the full connection model is compared with the training results of CNN, which proves the accuracy and generalization ability of CNN model. By selecting the non-deep learning model as the comparison baseline, the performance advantages of CNN model under different categories of products are proved. In addition, the experiment concludes that the unsupervised pretrained CNN model is more effective and adaptable in sales forecasting.

Download Full-text

In Machines We Trust: Are Robo-Advisers More Trustworthy Than Human Financial Advisers?

Law, Technology and Humans ◽

10.5204/lthj.v1i0.1261 ◽

2019 ◽

pp. 129-141 ◽

Cited By ~ 1

Author(s):

Hui Xian Chia

Keyword(s):

Deep Learning ◽

Black Box ◽

Best Interests ◽

Decision Making Process ◽

Learning Technology ◽

Technical Expertise ◽

Learning Agents ◽

The People ◽

Self Interest ◽

Deep Learning Model

This article examines the use of artificial intelligence (AI) and deep learning, specifically, to create financial robo-advisers. These machines have the potential to be perfectly honest fiduciaries, acting in their client’s best interests without conflicting self-interest or greed, unlike their human counterparts. However, the application of AI technology to create financial robo-advisers is not without risk. This article will focus on the unique risks posed by deep learning technology. One of the main fears regarding deep learning is that it is a “black box”, its decision-making process is opaque and not open to scrutiny even by the people who developed it. This poses a significant challenge to financial regulators, whom would not be able to examine the underlying rationale and rules of the robo-adviser to determine its safety for public use. The rise of deep learning has been met with calls for ‘explainability’ of how deep learning agents make their decisions. This paper argues that greater explainability can be achieved by describing the ‘personality’ of deep learning robo-advisers, and further proposes a framework for describing the parameters of the deep learning model using concepts that can be readily understood by people without technical expertise. This regards whether the robo-adviser is ‘greedy’, ‘selfish’ or ‘prudent’. Greater understanding will enable regulators and consumers to better judge the safety and suitability of deep learning financial robo-advisers.

Download Full-text

Study on Feature Selection and Feature Deep Learning Model For Big Data

2018 3rd International Conference on Smart City and Systems Engineering (ICSCSE) ◽

10.1109/icscse.2018.00171 ◽

2018 ◽

Cited By ~ 2

Author(s):

Ping Yu ◽

Hui Yan

Keyword(s):

Feature Selection ◽

Big Data ◽

Deep Learning ◽

Learning Model ◽

Deep Learning Model

Download Full-text

Deep Learning Model and Its Application in Big Data

Design, User Experience, and Usability: Theory and Practice - Lecture Notes in Computer Science ◽

10.1007/978-3-319-91797-9_55 ◽

2018 ◽

pp. 795-806

Author(s):

Yuanming Zhou ◽

Shifeng Zhao ◽

Xuesong Wang ◽

Wei Liu

Keyword(s):

Big Data ◽

Deep Learning ◽

Learning Model ◽

Deep Learning Model

Download Full-text

A High-Order CFS Algorithm for Clustering Big Data

Mobile Information Systems ◽

10.1155/2016/4356127 ◽

2016 ◽

Vol 2016 ◽

pp. 1-8 ◽

Cited By ~ 4

Author(s):

Fanyu Bu ◽

Zhikui Chen ◽

Peng Li ◽

Tong Tang ◽

Ying Zhang

Keyword(s):

Big Data ◽

Deep Learning ◽

Clustering Algorithm ◽

Big Data Analytics ◽

Learning Model ◽

Heterogeneous Data ◽

High Order ◽

Tensor Model ◽

Industrial Internet ◽

Deep Learning Model

With the development of Internet of Everything such as Internet of Things, Internet of People, and Industrial Internet, big data is being generated. Clustering is a widely used technique for big data analytics and mining. However, most of current algorithms are not effective to cluster heterogeneous data which is prevalent in big data. In this paper, we propose a high-order CFS algorithm (HOCFS) to cluster heterogeneous data by combining the CFS clustering algorithm and the dropout deep learning model, whose functionality rests on three pillars: (i) an adaptive dropout deep learning model to learn features from each type of data, (ii) a feature tensor model to capture the correlations of heterogeneous data, and (iii) a tensor distance-based high-order CFS algorithm to cluster heterogeneous data. Furthermore, we verify our proposed algorithm on different datasets, by comparison with other two clustering schemes, that is, HOPCM and CFS. Results confirm the effectiveness of the proposed algorithm in clustering heterogeneous data.

Download Full-text

BIG DATA ENSEMBLE CLINICAL PREDICTION FOR HEALTHCARE DATA BY USING DEEP LEARNING MODEL

International Journal of Big Data Intelligence ◽

10.1504/ijbdi.2018.10008867 ◽

2018 ◽

Vol 1 (1) ◽

pp. 1

Author(s):

Gondkar R R ◽

Sreekanth Rallapalli

Keyword(s):

Big Data ◽

Deep Learning ◽

Learning Model ◽

Clinical Prediction ◽

Healthcare Data ◽

Deep Learning Model

Download Full-text

A Screening Method to Identify Potential Endocrine Disruptors Using Chemical Toxicity Big Data and a Deep Learning Model with a Focus on Cleaning and Laundry Products

Korean Journal of Environmental Health Sciences ◽

10.5668/jehs.2021.47.5.462 ◽

2021 ◽

Vol 47 (5) ◽

pp. 462-471

Author(s):

Inhye Lee ◽

Sujin Lee ◽

Kyunghee Ji

Keyword(s):

Big Data ◽

Deep Learning ◽

Endocrine Disruptors ◽

Screening Method ◽

Learning Model ◽

Chemical Toxicity ◽

Laundry Products ◽

Deep Learning Model

Download Full-text