Sentiment Analysis towards Actionable Intelligence via Deep Learning

TEM Journal ◽

10.18421/tem94-44 ◽

2020 ◽

pp. 1663-1668

Author(s):

Shorouq Fathi Eletter

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Service Providers ◽

Unstructured Data ◽

Classification Model ◽

Online Review ◽

Support Vector ◽

Competitive Advantages ◽

Accuracy Rate ◽

Vector Machines

The exponential growth of unstructured data and the ability of businesses to utilize such data in decision-making have led to competitive advantages. The knowledge provided by analyzing unstructured data is crucial for product developers or service providers because it might affect the sustainability of the business. Sentiment analysis is used to gain an understanding of the attitudes, opinions, and emotions expressed within an online review. Naïve Bayes (NB), logistic regression (LR), decision trees (DT), deep learning (DL), and support vector machines (SVM) were used to build a classification model. In the data mining settings, the classification accuracy is the best metric to highlight the best classifier. The DL classifier outperformed other models in terms of accuracy rate. Classifying customers' feelings toward a product or service is critical for providing actionable insights. Utilizing such models will help to analyze huge volumes of reviews, saving both time and costs.

Download Full-text

Twitter sentiment analysis for the estimation of voting intention in the 2017 Chilean elections

Intelligent Data Analysis ◽

10.3233/ida-194768 ◽

2020 ◽

Vol 24 (5) ◽

pp. 1141-1160

Author(s):

Tomás Alegre Sepúlveda ◽

Brian Keith Norambuena

Keyword(s):

Machine Learning ◽

Support Vector Machines ◽

Sentiment Analysis ◽

Classification Model ◽

Machine Learning Techniques ◽

Support Vector ◽

Traditional Methods ◽

Actual Result ◽

Learning Techniques ◽

Vector Machines

In this paper, we apply sentiment analysis methods in the context of the first round of the 2017 Chilean elections. The purpose of this work is to estimate the voting intention associated with each candidate in order to contrast this with the results from classical methods (e.g., polls and surveys). The data are collected from Twitter, because of its high usage in Chile and in the sentiment analysis literature. We obtained tweets associated with the three main candidates: Sebastián Piñera (SP), Alejandro Guillier (AG) and Beatriz Sánchez (BS). For each candidate, we estimated the voting intention and compared it to the traditional methods. To do this, we first acquired the data and labeled the tweets as positive or negative. Afterward, we built a model using machine learning techniques. The classification model had an accuracy of 76.45% using support vector machines, which yielded the best model for our case. Finally, we use a formula to estimate the voting intention from the number of positive and negative tweets for each candidate. For the last period, we obtained a voting intention of 35.84% for SP, compared to a range of 34–44% according to traditional polls and 36% in the actual elections. For AG we obtained an estimate of 37%, compared with a range of 15.40% to 30.00% for traditional polls and 20.27% in the elections. For BS we obtained an estimate of 27.77%, compared with the range of 8.50% to 11.00% given by traditional polls and an actual result of 22.70% in the elections. These results are promising, in some cases providing an estimate closer to reality than traditional polls. Some differences can be explained due to the fact that some candidates have been omitted, even though they held a significant number of votes.

Download Full-text

Quality Assessment in Vocational Education Based on a Hierarchical Classification Technique

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.403-408.3724 ◽

2011 ◽

Vol 403-408 ◽

pp. 3724-3728

Author(s):

Chantima Ekwong ◽

Sageemas Na Wichain ◽

Choochart Haruechaiyasak

Keyword(s):

Quality Assessment ◽

Vocational Education ◽

Hierarchical Classification ◽

Unstructured Data ◽

Classification Model ◽

Support Vector ◽

Education Standards ◽

National Education ◽

Vector Machines

According to the laws of education in Thailand, the Office for National Education Standards and Quality Assessment is responsible for assessing the external educational institutes in order to develop the quality and educational standards. The external quality assessment reports are represented in both structured and unstructured data. In this paper, we focus on the analysis of unstructured data, i.e., to automatically classify strength and weakness points. We propose and evaluate two different classification models: Flat Classification and Hierarchical Classification. Three algorithms, Naive Bayes, Support Vector Machines (SVM) and Decision Tree, were used in the experiments. The results showed that classification viathe Hierarchical Classification model by using the SVM yielded the best performance. The classification of strength and weakness points yielded the F-measure equal to 0.843 and 0.893, respectively. The proposed approach can be applied as a decision support function for quality assessment in vocational education.

Download Full-text

Modelos de aprendizaje automático para el apoyo en la clasificación de tipos de cáncer a partir de datos estructurados y no estructurados de expedientes clínicos

10.24275/uama.6732.7811 ◽

2020 ◽

Author(s):

◽

Erick Esteven Montelongo González

Keyword(s):

Unstructured Data ◽

Classification Model ◽

Support Vector ◽

Classification Models ◽

Multi Layer Perceptron ◽

Health Area ◽

Vector Machines ◽

Clinical Records ◽

Unstructured Information

The existence of large volumes of data generated by the health area presents an important opportunity for analysis. This can obtain information to support physicians in the decisionmaking process for the diagnosis or treatment of diseases, such as cancer. The present work shows a methodology for the classification of patients with liver, lung and breast cancer, through machine learning models, to obtain the model that performs best in the classification. The methodology considers three classification models: Support Vector Machines (SVM), Multi-Layer Perceptron (MLP) and AdaBoost using both structured and unstructured information from the patient's clinical records. Results show that the best classification model is MLP using only unstructured data, obtaining 89% of precision, showing the usefulness of this type of data in the classification of cancer patients.

Download Full-text

Study on Product Opinion Analysis for Customer Satisfaction on E-Commerce Websites

10.3233/apc210206 ◽

2021 ◽

Author(s):

Vijaya Sagvekar ◽

Prashant Sharma

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Learning Strategy ◽

Classification Model ◽

Online Review ◽

Electronic Trade ◽

Typical Application ◽

Opinion Analysis ◽

Human Reaction ◽

Client Care

The E-commerce websites have been emerged in a high range of marketing benefits for the users to publish or share the experience of the received product by posting review that contain useful comments, opinions and feedback on the product. These days, a large number of clients acquire freedoms to look at comparative items in online sites and pick their top choices in computerized retailers, like Amazon.com and Taobao.com. Client audits in online media and electronic trade Websites contain important electronic word data of items. Sentiment Analysis is broadly applied as voice of clients for applications that target showcasing and client care. Sentiment extractors in their most essential structure classify messages as either having a good or negative or once in a while neutral supposition. A typical application of sentiment investigation is the programmed assurance of whether an online review contains a positive or negative review. Subsequently, in this paper, with the use of the strategies on sentiment analysis, obstinate sentences alluding to a particular element are first recognized from item online audits. We have proposed deep learning strategy as a classification model for discovering the condition of review. The outcomes showed suggested site for the client dependent on the early audits, past reviews and answer given to inquiry audit for the client. Additionally, it is seen that the proposed strategy can ready to answer every one of the reviews with a superior closeness like a human reaction to the client.

Download Full-text

Deep Learning Based Sentiment Analysis for Phishing SMS Detection

10.4018/978-1-7998-8061-5.ch001 ◽

2021 ◽

pp. 1-28

Author(s):

Aakanksha Sharaff ◽

Ramya Allenki ◽

Rakhi Seth

Keyword(s):

Neural Network ◽

Deep Learning ◽

Sentiment Analysis ◽

Classification Model ◽

Machine Learning Techniques ◽

Support Vector ◽

Classification Models ◽

Learning Techniques ◽

Artificial Neural Network Ann ◽

Baseline Algorithm

Sentiment analysis works on the principle of categorizing and identifying the text-based content and the process of classifying documents into one of the predefined classes commonly known as text classification. Hackers deploy a strategy by sending malicious content as an advertisement link and attack the user system to gain information. For protecting the system from this type of phishing attack, one needs to classify the spam data. This chapter is based on a discussion and comparison of various classification models that are used for phishing SMS detection through sentiment analysis. In this chapter, SMS data is collected from Kaggle, which is classified as ham or spam; while implementing the deep learning techniques like Convolutional Neural Network (CNN), CNN with 7 layers, and CNN with 11 layers, different results are generated. For evaluating these results, different machine learning techniques are used as a baseline algorithm like Naive Bayes, Decision Trees, Support Vector Machine (SVM), and Artificial Neural Network (ANN). After evaluation, CNN showed the highest accuracy of 99.47% as a classification model.

Download Full-text

A Novel Approach to Component Assembly Inspection Based on Mask R-CNN and Support Vector Machines

Information ◽

10.3390/info10090282 ◽

2019 ◽

Vol 10 (9) ◽

pp. 282 ◽

Cited By ~ 2

Author(s):

Haisong Huang ◽

Zhongyu Wei ◽

Liguo Yao

Keyword(s):

Industry 4.0 ◽

Feature Vector ◽

Classification Model ◽

Support Vector ◽

Accuracy Rate ◽

Invariant Moment ◽

Experimental Platform ◽

Novel Approach ◽

Vector Machines ◽

Component Assembly

Assembly is a very important manufacturing process in the age of Industry 4.0. Aimed at the problems of part identification and assembly inspection in industrial production, this paper proposes a method of assembly inspection based on machine vision and a deep neural network. First, the image acquisition platform is built to collect the part and assembly images. We use the Mask R-CNN model to identify and segment the shape from each part image, and to obtain the part category and position coordinates in the image. Then, according to the image segmentation results, the area, perimeter, circularity, and Hu invariant moment of the contour are extracted to form the feature vector. Finally, the SVM classification model is constructed to identify the assembly defects, with a classification accuracy rate of over 86.5%. The accuracy of the method is verified by constructing an experimental platform. The results show that the method effectively completes the identification of missing and misaligned parts in the assembly, and has good robustness.

Download Full-text

Detection of Fake News Text Classification on COVID-19 Using Deep Learning Approaches

Computational and Mathematical Methods in Medicine ◽

10.1155/2021/5514220 ◽

2021 ◽

Vol 2021 ◽

pp. 1-14

Author(s):

Waqas Haider Bangyal ◽

Rukhma Qasim ◽

Najeeb ur Rehman ◽

Zeeshan Ahmad ◽

Hafsa Dar ◽

...

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Confusion Matrix ◽

Research Work ◽

Noise Removal ◽

Machine Learning Algorithms ◽

Classification Model ◽

Support Vector ◽

Primary Concern ◽

Fake News

A vast amount of data is generated every second for microblogs, content sharing via social media sites, and social networking. Twitter is an essential popular microblog where people voice their opinions about daily issues. Recently, analyzing these opinions is the primary concern of Sentiment analysis or opinion mining. Efficiently capturing, gathering, and analyzing sentiments have been challenging for researchers. To deal with these challenges, in this research work, we propose a highly accurate approach for SA of fake news on COVID-19. The fake news dataset contains fake news on COVID-19; we started by data preprocessing (replace the missing value, noise removal, tokenization, and stemming). We applied a semantic model with term frequency and inverse document frequency weighting for data representation. In the measuring and evaluation step, we applied eight machine-learning algorithms such as Naive Bayesian, Adaboost, K -nearest neighbors, random forest, logistic regression, decision tree, neural networks, and support vector machine and four deep learning CNN, LSTM, RNN, and GRU. Afterward, based on the results, we boiled a highly efficient prediction model with python, and we trained and evaluated the classification model according to the performance measures (confusion matrix, classification rate, true positives rate...), then tested the model on a set of unclassified fake news on COVID-19, to predict the sentiment class of each fake news on COVID-19. Obtained results demonstrate a high accuracy compared to the other models. Finally, a set of recommendations is provided with future directions for this research to help researchers select an efficient sentiment analysis model on Twitter data.

Download Full-text

Comparison of SVM, RF and SGD Methods for Determination of Programmer's Performance Classification Model in Social Media Activities

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v4i2.1770 ◽

2020 ◽

Vol 4 (2) ◽

pp. 329-335

Author(s):

Rusydi Umar ◽

Imam Riadi ◽

Purwono

Keyword(s):

Social Media ◽

Gradient Descent ◽

Classification Model ◽

Stochastic Gradient Descent ◽

Support Vector ◽

Svm Algorithm ◽

Vector Machines ◽

Performance Patterns ◽

A Company

The failure of most startups in Indonesia is caused by team performance that is not solid and competent. Programmers are an integral profession in a startup team. The development of social media can be used as a strategic tool for recruiting the best programmer candidates in a company. This strategic tool is in the form of an automatic classification system of social media posting from prospective programmers. The classification results are expected to be able to predict the performance patterns of each candidate with a predicate of good or bad performance. The classification method with the best accuracy needs to be chosen in order to get an effective strategic tool so that a comparison of several methods is needed. This study compares classification methods including the Support Vector Machines (SVM) algorithm, Random Forest (RF) and Stochastic Gradient Descent (SGD). The classification results show the percentage of accuracy with k = 10 cross validation for the SVM algorithm reaches 81.3%, RF at 74.4%, and SGD at 80.1% so that the SVM method is chosen as a model of programmer performance classification on social media activities.

Download Full-text

Why Deep Learning Is More Efficient than Support Vector Machines, and How it is Related to Sparsity Techniques in Signal Processing

Proceedings of the 2020 4th International Conference on Intelligent Systems, Metaheuristics & Swarm Intelligence ◽

10.1145/3396474.3396478 ◽

2020 ◽

Author(s):

Laxman Bokati ◽

Olga Kosheleva ◽

Vladik Kreinovich ◽

Anibal Sosa

Keyword(s):

Signal Processing ◽

Deep Learning ◽

Support Vector Machines ◽

Support Vector ◽

Vector Machines

Download Full-text

A semiautomatic annotation approach for sentiment analysis

Journal of Information Science ◽

10.1177/01655515211006594 ◽

2021 ◽

pp. 016555152110065

Author(s):

Rahma Alahmary ◽

Hmood Al-Dossari

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Short Term Memory ◽

Machine Learning Algorithms ◽

Support Vector ◽

Short Term ◽

Term Memory ◽

Annotation Process ◽

Learning Classifiers ◽

Long Short Term Memory

Sentiment analysis (SA) aims to extract users’ opinions automatically from their posts and comments. Almost all prior works have used machine learning algorithms. Recently, SA research has shown promising performance in using the deep learning approach. However, deep learning is greedy and requires large datasets to learn, so it takes more time for data annotation. In this research, we proposed a semiautomatic approach using Naïve Bayes (NB) to annotate a new dataset in order to reduce the human effort and time spent on the annotation process. We created a dataset for the purpose of training and testing the classifier by collecting Saudi dialect tweets. The dataset produced from the semiautomatic model was then used to train and test deep learning classifiers to perform Saudi dialect SA. The accuracy achieved by the NB classifier was 83%. The trained semiautomatic model was used to annotate the new dataset before it was fed into the deep learning classifiers. The three deep learning classifiers tested in this research were convolutional neural network (CNN), long short-term memory (LSTM) and bidirectional long short-term memory (Bi-LSTM). Support vector machine (SVM) was used as the baseline for comparison. Overall, the performance of the deep learning classifiers exceeded that of SVM. The results showed that CNN reported the highest performance. On one hand, the performance of Bi-LSTM was higher than that of LSTM and SVM, and, on the other hand, the performance of LSTM was higher than that of SVM. The proposed semiautomatic annotation approach is usable and promising to increase speed and save time and effort in the annotation process.

Download Full-text