Open Relation Extraction in Patent Claims with a Hybrid Network

Wireless Communications and Mobile Computing ◽

10.1155/2021/5547281 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Boting Geng

Keyword(s):

Neural Network ◽

Natural Language ◽

Language Processing ◽

Network Architecture ◽

Complex Structure ◽

Relation Extraction ◽

Attention Mechanism ◽

Hybrid Network ◽

Long Distance ◽

Hybrid Neural Network

Research on relation extraction from patent documents, a high-priority topic of natural language process in recent years, is of great significance to a series of patent downstream applications, such as patent content mining, patent retrieval, and patent knowledge base constructions. Due to lengthy sentences, crossdomain technical terms, and complex structure of patent claims, it is extremely difficult to extract open triples with traditional methods of Natural Language Processing (NLP) parsers. In this paper, we propose an Open Relation Extraction (ORE) approach with transforming relation extraction problem into sequence labeling problem in patent claims, which extract none predefined relationship triples from patent claims with a hybrid neural network architecture based on multihead attention mechanism. The hybrid neural network framework combined with Bi-LSTM and CNN is proposed to extract argument phrase features and relation phrase features simultaneously. The Bi-LSTM network gains long distance dependency features, and the CNN obtains local content feature; then, multihead attention mechanism is applied to get potential dependency relationship for time series of RNN model; the result of neural network proposed above applied to our constructed open patent relation dataset shows that our method outperforms both traditional classification algorithms of machine learning and the-state-of-art neural network classification models in the measures of Precision, Recall, and F1.

Download Full-text

Traffic accident duration prediction based on natural language processing and a hybrid neural network architecture

10.1117/12.2614987 ◽

2021 ◽

Author(s):

Siyao Xiao

Keyword(s):

Neural Network ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Network Architecture ◽

Traffic Accident ◽

Neural Network Architecture ◽

Hybrid Neural Network ◽

Duration Prediction

Download Full-text

High accuracy offering attention mechanisms based deep learning approach using CNN/bi-LSTM for sentiment analysis

International Journal of Intelligent Computing and Cybernetics ◽

10.1108/ijicc-06-2021-0109 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Venkateswara Rao Kota ◽

Shyamala Devi Munisamy

Keyword(s):

Neural Network ◽

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Sentiment Analysis ◽

Language Processing ◽

Attention Mechanism ◽

Supervised Machine Learning ◽

Method Performance ◽

Content Type

PurposeNeural network (NN)-based deep learning (DL) approach is considered for sentiment analysis (SA) by incorporating convolutional neural network (CNN), bi-directional long short-term memory (Bi-LSTM) and attention methods. Unlike the conventional supervised machine learning natural language processing algorithms, the authors have used unsupervised deep learning algorithms.Design/methodology/approachThe method presented for sentiment analysis is designed using CNN, Bi-LSTM and the attention mechanism. Word2vec word embedding is used for natural language processing (NLP). The discussed approach is designed for sentence-level SA which consists of one embedding layer, two convolutional layers with max-pooling, one LSTM layer and two fully connected (FC) layers. Overall the system training time is 30 min.FindingsThe method performance is analyzed using metrics like precision, recall, F1 score, and accuracy. CNN is helped to reduce the complexity and Bi-LSTM is helped to process the long sequence input text.Originality/valueThe attention mechanism is adopted to decide the significance of every hidden state and give a weighted sum of all the features fed as input.

Download Full-text

Malicious URL Detection Algorithm Based on Multi Neural Network Series

CONVERTER ◽

10.17762/converter.209 ◽

2021 ◽

pp. 579-590

Author(s):

Weirong Xiu

Keyword(s):

Neural Network ◽

Natural Language Processing ◽

Natural Language ◽

Convolutional Neural Network ◽

Language Processing ◽

Recurrent Neural Network ◽

Detection Algorithm ◽

Attention Mechanism ◽

Global Features ◽

Multi Neural Network

Convolutional neural network based on attention mechanism and a bidirectional independent recurrent neural network tandem joint algorithm (CATIR) are proposed. In natural language processing related technologies, word vector features are extracted based on URLs, and the extracted URL information features and host information features are merged. The proposed CATIR algorithm uses CNN (Convolutional Neural Network) to obtain the deep local features in the data, uses the Attention mechanism to adjust the weights, and uses IndRNN (Independent Recurrent Neural Network) to obtain the global features in the data. The experimental results shows that the CATIR algorithm has significantly improved the accuracy of malicious URL detection based on traditional algorithms to 96.9%.

Download Full-text

Attention as Relation: Learning Supervised Multi-head Self-Attention for Relation Extraction

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/524 ◽

2020 ◽

Author(s):

Jie Liu ◽

Shaowei Chen ◽

Bingquan Wang ◽

Jiaxin Zhang ◽

Na Li ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

State Of The Art ◽

Relation Extraction ◽

Attention Mechanism ◽

Entity Extraction ◽

Relation Type ◽

Benchmark Datasets ◽

Relation Learning

Joint entity and relation extraction is critical for many natural language processing (NLP) tasks, which has attracted increasing research interest. However, it is still faced with the challenges of identifying the overlapping relation triplets along with the entire entity boundary and detecting the multi-type relations. In this paper, we propose an attention-based joint model, which mainly contains an entity extraction module and a relation detection module, to address the challenges. The key of our model is devising a supervised multi-head self-attention mechanism as the relation detection module to learn the token-level correlation for each relation type separately. With the attention mechanism, our model can effectively identify overlapping relations and flexibly predict the relation type with its corresponding intensity. To verify the effectiveness of our model, we conduct comprehensive experiments on two benchmark datasets. The experimental results demonstrate that our model achieves state-of-the-art performances.

Download Full-text

Bi-LSTM Model to Increase Accuracy in Text Classification: Combining Word2vec CNN and Attention Mechanism

Applied Sciences ◽

10.3390/app10175841 ◽

2020 ◽

Vol 10 (17) ◽

pp. 5841 ◽

Cited By ~ 4

Author(s):

Beakcheol Jang ◽

Myeonghwi Kim ◽

Gaspard Harerimana ◽

Sang-ug Kang ◽

Jong Wook Kim

Keyword(s):

Neural Network ◽

Natural Language Processing ◽

Natural Language ◽

Convolutional Neural Network ◽

Language Processing ◽

Text Classification ◽

Short Term Memory ◽

User Behavior ◽

Hybrid Approach ◽

Attention Mechanism

There is a need to extract meaningful information from big data, classify it into different categories, and predict end-user behavior or emotions. Large amounts of data are generated from various sources such as social media and websites. Text classification is a representative research topic in the field of natural-language processing that categorizes unstructured text data into meaningful categorical classes. The long short-term memory (LSTM) model and the convolutional neural network for sentence classification produce accurate results and have been recently used in various natural-language processing (NLP) tasks. Convolutional neural network (CNN) models use convolutional layers and maximum pooling or max-overtime pooling layers to extract higher-level features, while LSTM models can capture long-term dependencies between word sequences hence are better used for text classification. However, even with the hybrid approach that leverages the powers of these two deep-learning models, the number of features to remember for classification remains huge, hence hindering the training process. In this study, we propose an attention-based Bi-LSTM+CNN hybrid model that capitalize on the advantages of LSTM and CNN with an additional attention mechanism. We trained the model using the Internet Movie Database (IMDB) movie review data to evaluate the performance of the proposed model, and the test results showed that the proposed hybrid attention Bi-LSTM+CNN model produces more accurate classification results, as well as higher recall and F1 scores, than individual multi-layer perceptron (MLP), CNN or LSTM models as well as the hybrid models.

Download Full-text

SWACG: A Hybrid Neural Network Integrating Sliding Window for Biomedical Event Trigger Extraction

Journal of Imaging Science and Technology ◽

10.2352/j.imagingsci.technol.2021.65.6.060502 ◽

2021 ◽

Author(s):

Xinyu He ◽

Bo Yu ◽

Yonggong Ren

Keyword(s):

Neural Network ◽

Language Processing ◽

Sliding Window ◽

Extraction Process ◽

Hybrid Structure ◽

Biomedical Literature ◽

Event Extraction ◽

Long Distance ◽

Hybrid Neural Network ◽

Event Trigger

As an important part of biomedical text mining, biomedical events play a key role in improving biomedical research and disease prevention. Trigger identification, extracting the words describing the event types, is a critical and prerequisite step for biomedical event extraction. Traditional methods excessively rely on natural language processing tools in the feature extraction process, incurring a significant manual cost. In addition, because of the particularity of the biomedical literature, the problem of long-distance dependency is obvious. To solve these problems, we propose a hybrid structure SWACG, which consists of the ReCNN-BiGRU (Residual CNN and Bidirectional Gated Recurrent Unit) hybrid neural network and MH-attention (Multi-Head attention) mechanism. The proposed model uses ReCNN to extract vocabulary-level features and BiGRU to obtain contextual semantic information. Furthermore, sliding window divides long sentences into equal-length short sentences without destroying context information, which can avoid long-distance dependency. Experimental results show that our method advances the state-of-the-art performance on the commonly used Multi-Level Event Extraction (MLEE) corpus, achieving 82.20% F-score.

Download Full-text

A Relation-Specific Attention Network for Joint Entity and Relation Extraction

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/561 ◽

2020 ◽

Author(s):

Yue Yuan ◽

Xiaofei Zhou ◽

Shirui Pan ◽

Qiannan Zhu ◽

Zeliang Song ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

State Of The Art ◽

Relation Extraction ◽

Attention Mechanism ◽

Important Task ◽

Entity Recognition ◽

Attention Network ◽

Public Datasets

Joint extraction of entities and relations is an important task in natural language processing (NLP), which aims to capture all relational triplets from plain texts. This is a big challenge due to some of the triplets extracted from one sentence may have overlapping entities. Most existing methods perform entity recognition followed by relation detection between every possible entity pairs, which usually suffers from numerous redundant operations. In this paper, we propose a relation-specific attention network (RSAN) to handle the issue. Our RSAN utilizes relation-aware attention mechanism to construct specific sentence representations for each relation, and then performs sequence labeling to extract its corresponding head and tail entities. Experiments on two public datasets show that our model can effectively extract overlapping triplets and achieve state-of-the-art performance.

Download Full-text

Application of stochastic real-valued reinforcement neural networks to batch production rescheduling

Proceedings of the Institution of Mechanical Engineers Part B Journal of Engineering Manufacture ◽

10.1243/0954405981516535 ◽

1997 ◽

Vol 211 (8) ◽

pp. 591-603 ◽

Cited By ~ 6

Author(s):

M I Heywood ◽

M-C Chan ◽

C R Chatwin

Keyword(s):

Neural Network ◽

Cross Section ◽

Network Architecture ◽

Hybrid Network ◽

Neural Network Architecture ◽

Batch Production ◽

Worst Case ◽

Hybrid Neural Network ◽

Definition Of ◽

Selection Of

This paper details the design and application of a hybrid neural network architecture for the rescheduling problem of batch manufacture. Design issues include the selection of an appropriate neural network paradigm, specification of the network architecture and support for multistep prediction. Application issues include decoupling the network dimension from that of the problem and the definition of suitable rescheduling operators. The ensuing hybrid network is tested against heuristics previously identified as typically representing estimates for best and worst case performance within a cross-section of batch rescheduling problems.

Download Full-text

Fast Neural Network Engine for Natural Science Language Processing: A Drug-Search Case.

10.26434/chemrxiv.12800348 ◽

2020 ◽

Author(s):

Vadim V. Korolev ◽

Artem Mitrofanov ◽

Kirill Karpov ◽

Valery Tkachenko

Keyword(s):

Neural Network ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Natural Science ◽

Therapeutic Agent ◽

Semantic Relations ◽

Chemical Data ◽

Processing Methods ◽

Modern Natural

The main advantage of modern natural language processing methods is a possibility to turn an amorphous human-readable task into a strict mathematic form. That allows to extract chemical data and insights from articles and to find new semantic relations. We propose a universal engine for processing chemical and biological texts. We successfully tested it on various use-cases and applied to a case of searching a therapeutic agent for a COVID-19 disease by analyzing PubMed archive.

Download Full-text

Intent Detection and Slot Filling with Capsule Net Architectures for a Romanian Home Assistant

Sensors ◽

10.3390/s21041230 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1230

Author(s):

Anda Stoica ◽

Tibor Kadar ◽

Camelia Lemnaru ◽

Rodica Potolea ◽

Mihaela Dînşoreanu

Keyword(s):

Neural Network ◽

Error Analysis ◽

Natural Language ◽

Network Model ◽

Network Architecture ◽

Natural Language Understanding ◽

Wide Spread ◽

Neural Network Architecture ◽

Language Understanding ◽

Slot Filling

As virtual home assistants are becoming more popular, there is an emerging need for supporting languages other than English. While more wide-spread or popular languages such as Spanish, French or Hindi are already integrated into existing home assistants like Google Home or Alexa, integration of other less-known languages such as Romanian is still missing. This paper explores the problem of Natural Language Understanding (NLU) applied to a Romanian home assistant. We propose a customized capsule neural network architecture that performs intent detection and slot filling in a joint manner and we evaluate how well it handles utterances containing various levels of complexity. The capsule network model shows a significant improvement in intent detection when compared to models built using the well-known Rasa NLU tool. Through error analysis, we observe clear error patterns that occur systematically. Variability in language when expressing one intent proves to be the biggest challenge encountered by the model.

Download Full-text