An Analysis Method for Interpretability of CNN Text Classification Model

Peng Ce; Bao Tie

doi:10.3390/fi12120228

An Analysis Method for Interpretability of CNN Text Classification Model

Future Internet ◽

10.3390/fi12120228 ◽

2020 ◽

Vol 12 (12) ◽

pp. 228

Author(s):

Peng Ce ◽

Bao Tie

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Text Classification ◽

Sequence Data ◽

Classification Model ◽

Learning Technology ◽

Analysis Method ◽

Text Data ◽

Data Set ◽

Backtracking Analysis

With continuous development of artificial intelligence, text classification has gradually changed from a knowledge-based method to a method based on statistics and machine learning. Among them, it is a very important and efficient way to classify text based on the convolutional neural network (CNN) model. Text data are a kind of sequence data, while time sequentiality of the general text data is relatively weak, so text classification is usually less relevant to the sequential structure of the full text. Therefore, CNN-based text classification has gradually become a research hotspot when dealing with issues of text classification. For machine learning, especially deep learning, model interpretability has increasingly become the focus of academic research and industrial applications, and also become a key issue for further development and application of deep learning technology. Therefore, we recommend using the backtracking analysis method to conduct in-depth research on deep learning models. This paper proposes an analysis method for interpretability of a CNN text classification model. The method proposed by us can perform multi-angle analysis on the discriminant results of multi-classified text and multi-label classification tasks through backtracking analysis on model prediction results. Finally, the analysis results of the model can be displayed using visualization technology from multiple dimensions based on interpretability. The representative data set IMDB (Internet Movie Database) in text classification is verified by examples, and the results show that the model can be effectively analyzed when using our method.

Download Full-text

Deep Learning for text in limted data settings

10.36227/techrxiv.12100692 ◽

2020 ◽

Author(s):

Pathikkumar Patel ◽

Bhargav Lad ◽

Jinan Fiaidhi

Keyword(s):

Machine Learning ◽

Time Series ◽

Deep Learning ◽

Sentiment Analysis ◽

Transfer Learning ◽

Text Classification ◽

State Of The Art ◽

Time Series Forecasting ◽

Text Data ◽

Performance Levels

During the last few years, RNN models have been extensively used and they have proven to be better for sequence and text data. RNNs have achieved state-of-the-art performance levels in several applications such as text classification, sequence to sequence modelling and time series forecasting. In this article we will review different Machine Learning and Deep Learning based approaches for text data and look at the results obtained from these methods. This work also explores the use of transfer learning in NLP and how it affects the performance of models on a specific application of sentiment analysis.

Download Full-text

Text classification models for the automatic detection of nonmedical prescription medication use from social media

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-021-01394-0 ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Mohammed Ali Al-Garadi ◽

Yuan-Chi Yang ◽

Haitao Cai ◽

Yucheng Ruan ◽

Karen O’Connor ◽

...

Keyword(s):

Machine Learning ◽

Social Media ◽

Deep Learning ◽

Text Classification ◽

Prescription Medication ◽

The United States ◽

Language Models ◽

Classification Model ◽

Future Research ◽

Classification Models

Abstract Background Prescription medication (PM) misuse/abuse has emerged as a national crisis in the United States, and social media has been suggested as a potential resource for performing active monitoring. However, automating a social media-based monitoring system is challenging—requiring advanced natural language processing (NLP) and machine learning methods. In this paper, we describe the development and evaluation of automatic text classification models for detecting self-reports of PM abuse from Twitter. Methods We experimented with state-of-the-art bi-directional transformer-based language models, which utilize tweet-level representations that enable transfer learning (e.g., BERT, RoBERTa, XLNet, AlBERT, and DistilBERT), proposed fusion-based approaches, and compared the developed models with several traditional machine learning, including deep learning, approaches. Using a public dataset, we evaluated the performances of the classifiers on their abilities to classify the non-majority “abuse/misuse” class. Results Our proposed fusion-based model performs significantly better than the best traditional model (F1-score [95% CI]: 0.67 [0.64–0.69] vs. 0.45 [0.42–0.48]). We illustrate, via experimentation using varying training set sizes, that the transformer-based models are more stable and require less annotated data compared to the other models. The significant improvements achieved by our best-performing classification model over past approaches makes it suitable for automated continuous monitoring of nonmedical PM use from Twitter. Conclusions BERT, BERT-like and fusion-based models outperform traditional machine learning and deep learning models, achieving substantial improvements over many years of past research on the topic of prescription medication misuse/abuse classification from social media, which had been shown to be a complex task due to the unique ways in which information about nonmedical use is presented. Several challenges associated with the lack of context and the nature of social media language need to be overcome to further improve BERT and BERT-like models. These experimental driven challenges are represented as potential future research directions.

Download Full-text

Text Classification Models for the Automatic Detection of Nonmedical Prescription Medication Use from Social Media

10.21203/rs.3.rs-58679/v1 ◽

2020 ◽

Cited By ~ 1

Author(s):

Mohammed Al-Garadi ◽

Yuan-Chi Yang ◽

Haitao Cai ◽

Yucheng Ruan ◽

Karen O’Connor ◽

...

Keyword(s):

Machine Learning ◽

Social Media ◽

Deep Learning ◽

Text Classification ◽

Prescription Medication ◽

The United States ◽

Language Models ◽

Classification Model ◽

Future Research ◽

Classification Models

Abstract Background Prescription medication (PM) misuse/abuse has emerged as a national crisis in the United States, and social media has been suggested as a potential resource for performing active monitoring. However, automating a social media-based monitoring system is challenging—requiring advanced natural language processing (NLP) and machine learning methods. In this paper, we describe the development and evaluation of automatic text classification models for detecting self-reports of PM abuse from Twitter. Methods We experimented with state-of-the-art bi-directional transformer-based language models, which utilize tweet-level representations that enable transfer learning (e.g., BERT, RoBERTa, XLNet, AlBERT, and DistilBERT), proposed fusion-based approaches, and compared the developed models with several traditional machine learning, including deep learning, approaches. Using a public dataset, we evaluated the performances of the classifiers on their abilities to classify the non-majority “abuse/misuse” class. Results Our proposed fusion-based model performs significantly better than the best traditional model (F1-score [95% CI]: 0.67 [0.64–0.69] vs. 0.45 [0.42–0.48]). We illustrate, via experimentation using differing training set sizes, that the transformer-based models are more stable and require less annotated data compared to the other models. The significant improvements achieved by our best-performing classification model over past approaches makes it suitable for automated continuous monitoring of nonmedical PM use from Twitter. Conclusions BERT, BERT-like and fusion-based models not only outperform traditional machine learning and deep learning models, but also show substantial improvements over many years of past research on the topic of prescription medication misuse/abuse classification from social media, which had been shown to be a complex task due to the unique ways in which information about nonmedical use is presented. However, several challenges, such as lack of complete context and the nature of social media language, must be overcome to further improve BERT and BERT-like models despite their advantages over other approaches. These experimental driven challenges are represented as potential future research directions.

Download Full-text

Deep Learning for text in limted data settings

10.36227/techrxiv.12100692.v1 ◽

2020 ◽

Author(s):

Pathikkumar Patel ◽

Bhargav Lad ◽

Jinan Fiaidhi

Keyword(s):

Machine Learning ◽

Time Series ◽

Deep Learning ◽

Sentiment Analysis ◽

Transfer Learning ◽

Text Classification ◽

State Of The Art ◽

Time Series Forecasting ◽

Text Data ◽

Performance Levels

Download Full-text

A New Text Classification Model Based on Contrastive Word Embedding for Detecting Cybersecurity Intelligence in Twitter

Electronics ◽

10.3390/electronics9091527 ◽

2020 ◽

Vol 9 (9) ◽

pp. 1527 ◽

Cited By ~ 1

Author(s):

Han-Sub Shin ◽

Hyuk-Yoon Kwon ◽

Seung-Jin Ryu

Keyword(s):

Deep Learning ◽

Text Classification ◽

Area Under The Curve ◽

Word Embedding ◽

Classification Model ◽

Data Set ◽

Feature Vectors ◽

Model Based ◽

Proposed Model ◽

The Difference

Detecting cybersecurity intelligence (CSI) on social media such as Twitter is crucial because it allows security experts to respond cyber threats in advance. In this paper, we devise a new text classification model based on deep learning to classify CSI-positive and -negative tweets from a collection of tweets. For this, we propose a novel word embedding model, called contrastive word embedding, that enables to maximize the difference between base embedding models. First, we define CSI-positive and -negative corpora, which are used for constructing embedding models. Here, to supplement the imbalance of tweet data sets, we additionally employ the background knowledge for each tweet corpus: (1) CVE data set for CSI-positive corpus and (2) Wikitext data set for CSI-negative corpus. Second, we adopt the deep learning models such as CNN or LSTM to extract adequate feature vectors from the embedding models and integrate the feature vectors into one classifier. To validate the effectiveness of the proposed model, we compare our method with two baseline classification models: (1) a model based on a single embedding model constructed with CSI-positive corpus only and (2) another model with CSI-negative corpus only. As a result, we indicate that the proposed model shows high accuracy, i.e., 0.934 of F1-score and 0.935 of area under the curve (AUC), which improves the baseline models by 1.76∼6.74% of F1-score and by 1.64∼6.98% of AUC.

Download Full-text

Text Classification Models for the Automatic Detection of Nonmedical Prescription Medication Use from Social Media

10.21203/rs.3.rs-58679/v2 ◽

2021 ◽

Author(s):

Mohammed Al-Garadi ◽

Yuan-Chi Yang ◽

Haitao Cai ◽

Yucheng Ruan ◽

Karen O’Connor ◽

...

Keyword(s):

Machine Learning ◽

Social Media ◽

Deep Learning ◽

Text Classification ◽

Prescription Medication ◽

The United States ◽

Language Models ◽

Classification Model ◽

Future Research ◽

Classification Models

Abstract BackgroundPrescription medication (PM) misuse/abuse has emerged as a national crisis in the United States, and social media has been suggested as a potential resource for performing active monitoring. However, automating a social media-based monitoring system is challenging—requiring advanced natural language processing (NLP) and machine learning methods. In this paper, we describe the development and evaluation of automatic text classification models for detecting self-reports of PM abuse from Twitter.MethodsWe experimented with state-of-the-art bi-directional transformer-based language models, which utilize tweet-level representations that enable transfer learning (e.g., BERT, RoBERTa, XLNet, AlBERT, and DistilBERT), proposed fusion-based approaches, and compared the developed models with several traditional machine learning, including deep learning, approaches. Using a public dataset, we evaluated the performances of the classifiers on their abilities to classify the non-majority “abuse/misuse” class.ResultsOur proposed fusion-based model performs significantly better than the best traditional model (F1-score [95% CI]: 0.67 [0.64-0.69] vs. 0.45 [0.42-0.48]). We illustrate, via experimentation using differing training set sizes, that the transformer-based models are more stable and require less annotated data compared to the other models. The significant improvements achieved by our best-performing classification model over past approaches makes it suitable for automated continuous monitoring of nonmedical PM use from Twitter.ConclusionsBERT, BERT-like and fusion-based models not only outperform traditional machine learning and deep learning models, but also show substantial improvements over many years of past research on the topic of prescription medication misuse/abuse classification from social media, which had been shown to be a complex task due to the unique ways in which information about nonmedical use is presented. Several challenges, such as lack of complete context and the nature of social media language, need to be overcome to further improve BERT and BERT-like models. These experimental driven challenges are represented as potential future research directions.

Download Full-text

Swarm Intelligence Optimization: An Exploration and Application of Machine Learning Technology

Journal of Intelligent Systems ◽

10.1515/jisys-2020-0084 ◽

2021 ◽

Vol 30 (1) ◽

pp. 460-469

Author(s):

Yinying Cai ◽

Amit Sharma

Keyword(s):

Machine Learning ◽

Swarm Intelligence ◽

Research Result ◽

Machine Learning Algorithms ◽

Learning Technology ◽

Data Set ◽

Rice Pests ◽

Machine Leaning ◽

Smart Agriculture ◽

Swarm Intelligence Optimization

Abstract In the agriculture development and growth, the efficient machinery and equipment plays an important role. Various research studies are involved in the implementation of the research and patents to aid the smart agriculture and authors and reviewers that machine leaning technologies are providing the best support for this growth. To explore machine learning technology and machine learning algorithms, the most of the applications are studied based on the swarm intelligence optimization. An optimized V3CFOA-RF model is built through V3CFOA. The algorithm is tested in the data set collected concerning rice pests, later analyzed and compared in detail with other existing algorithms. The research result shows that the model and algorithm proposed are not only more accurate in recognition and prediction, but also solve the time lagging problem to a degree. The model and algorithm helped realize a higher accuracy in crop pest prediction, which ensures a more stable and higher output of rice. Thus they can be employed as an important decision-making instrument in the agricultural production sector.

Download Full-text

Comparison of Sampling Methods Using Machine Learning and Deep Learning Algorithms with an Imbalanced Data Set for the Prevention of Violence Against Physicians

10.1109/uyms54260.2021.9659758 ◽

2021 ◽

Author(s):

Hilal Cakir ◽

Nilgun Incereis ◽

Bekir Tevfik Akgun ◽

Av. S. Yazgulu Tastemir

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Sampling Methods ◽

Learning Algorithms ◽

Imbalanced Data ◽

Data Set ◽

Prevention Of Violence

Download Full-text

A Novel ECG Automatic Detection Using LongShort-Term Memory Network and Internet of Things Technology

Journal of Medical Imaging and Health Informatics ◽

10.1166/jmihi.2021.3684 ◽

2021 ◽

Vol 11 (6) ◽

pp. 1592-1598

Author(s):

Xufei Liu

Keyword(s):

Machine Learning ◽

Cardiovascular Diseases ◽

Internet Of Things ◽

Network Structure ◽

Recognition Rate ◽

Classification Model ◽

Data Set ◽

Public Data ◽

Lstm Network ◽

Internet Of Things Technology

The early detection of cardiovascular diseases based on electrocardiogram (ECG) is very important for the timely treatment of cardiovascular patients, which increases the survival rate of patients. ECG is a visual representation that describes changes in cardiac bioelectricity and is the basis for detecting heart health. With the rise of edge machine learning and Internet of Things (IoT) technologies, small machine learning models have received attention. This study proposes an ECG automatic classification method based on Internet of Things technology and LSTM network to achieve early monitoring and early prevention of cardiovascular diseases. Specifically, this paper first proposes a single-layer bidirectional LSTM network structure. Make full use of the timing-dependent features of the sampling points before and after to automatically extract features. The network structure is more lightweight and the calculation complexity is lower. In order to verify the effectiveness of the proposed classification model, the relevant comparison algorithm is used to verify on the MIT-BIH public data set. Secondly, the model is embedded in a wearable device to automatically classify the collected ECG. Finally, when an abnormality is detected, the user is alerted by an alarm. The experimental results show that the proposed model has a simple structure and a high classification and recognition rate, which can meet the needs of wearable devices for monitoring ECG of patients.

Download Full-text

Birds Sound Classification Based on Machine Learning Algorithms

Asian Journal of Research in Computer Science ◽

10.9734/ajrcos/2021/v9i430227 ◽

2021 ◽

pp. 1-11

Author(s):

Aska E. Mehyadin ◽

Adnan Mohsin Abdulazeez ◽

Dathar Abas Hasan ◽

Jwan N. Saeed

Keyword(s):

Machine Learning ◽

Noise Suppression ◽

Bird Species ◽

Machine Learning Algorithms ◽

Data Sets ◽

Learning Technology ◽

Species Classification ◽

Data Set ◽

Sound Classification ◽

Mel Frequency Cepstral Coefficient

The bird classifier is a system that is equipped with an area machine learning technology and uses a machine learning method to store and classify bird calls. Bird species can be known by recording only the sound of the bird, which will make it easier for the system to manage. The system also provides species classification resources to allow automated species detection from observations that can teach a machine how to recognize whether or classify the species. Non-undesirable noises are filtered out of and sorted into data sets, where each sound is run via a noise suppression filter and a separate classification procedure so that the most useful data set can be easily processed. Mel-frequency cepstral coefficient (MFCC) is used and tested through different algorithms, namely Naïve Bayes, J4.8 and Multilayer perceptron (MLP), to classify bird species. J4.8 has the highest accuracy (78.40%) and is the best. Accuracy and elapsed time are (39.4 seconds).

Download Full-text