Influence factor based opinion mining of Twitter data using supervised learning

In recent years, there is a rapid growth in online communication. There are many social networking sites and related mobile applications, and some more are still emerging. Huge amount of data is generated by these sites everyday and this data can be used as a source for various analysis purposes. Twitter is one of the most popular networking sites with millions of users. There are users with different views and varieties of reviews in the form of tweets are generated by them. Nowadays Opinion Mining has become an emerging topic of research due to lot of opinionated data available on Blogs & social networking sites. Tracking different types of opinions & summarizing them can provide valuable insight to different types of opinions to users who use Social networking sites to get reviews about any product, service or any topic. Analysis of opinions & its classification on the basis of polarity (positive, negative, neutral) is a challenging task. Lot of work has been done on sentiment analysis of twitter data and lot needs to be done. In this paper we discuss the levels, approaches of sentiment analysis, sentiment analysis of twitter data, existing tools available for sentiment analysis and the steps involved for same. Two approaches are discussed with an example which works on machine learning and lexicon based respectively.

Download Full-text

A gradient boosted decision tree-based sentiment classification of twitter data

International Journal of Wavelets Multiresolution and Information Processing ◽

10.1142/s0219691320500277 ◽

2020 ◽

Vol 18 (04) ◽

pp. 2050027

Author(s):

S. Neelakandan ◽

D. Paulraj

Keyword(s):

Decision Tree ◽

Opinion Mining ◽

Research Topic ◽

Sentiment Classification ◽

Decision Tree Classifier ◽

Twitter Data ◽

Tree Classifier ◽

Boosted Decision Tree ◽

Text Sentiment Analysis

People communicate their views, arguments and emotions about their everyday life on social media (SM) platforms (e.g. Twitter and Facebook). Twitter stands as an international micro-blogging service that features a brief message called tweets. Freestyle writing, incorrect grammar, typographical errors and abbreviations are some noises that occur in the text. Sentiment analysis (SA) centered on a tweet posted by the user, and also opinion mining (OM) of the customers review is another famous research topic. The texts are gathered from users’ tweets by means of OM and automatic-SA centered on ternary classifications, namely positive, neutral and negative. It is very challenging for the researchers to ascertain sentiments as a result of its limited size, misspells, unstructured nature, abbreviations and slangs for Twitter data. This paper, with the aid of the Gradient Boosted Decision Tree classifier (GBDT), proposes an efficient SA and Sentiment Classification (SC) of Twitter data. Initially, the twitter data undergoes pre-processing. Next, the pre-processed data is processed using HDFS MapReduce. Now, the features are extracted from the processed data, and then efficient features are selected using the Improved Elephant Herd Optimization (I-EHO) technique. Now, score values are calculated for each of those chosen features and given to the classifier. At last, the GBDT classifier classifies the data as negative, positive, or neutral. Experiential results are analyzed and contrasted with the other conventional techniques to show the highest performance of the proposed method.

Download Full-text

Real Time Opinion Mining and Analysis of Twitter Data

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i3.12.16104 ◽

2018 ◽

Vol 7 (3.12) ◽

pp. 351

Author(s):

K Senthil Kumar ◽

Mohammad Musab Trumboo ◽

Vaibhav . ◽

Satyajai Ahlawat

Keyword(s):

Sentiment Analysis ◽

Opinion Mining ◽

Far East ◽

Lexical Database ◽

Training Set ◽

Twitter Data ◽

The Far East ◽

The People ◽

Single Person ◽

Mass Information

This era, in which we currently stand, is an era of public opinion and mass information. People from all around the globe are joined together through various information junctions to create a global community, where one thing from the far east reaches to the people of the far west within seconds. Nothing is hidden, everything and anything can be scrutinized to its core and through these global criticisms and mass discussions of gigantic magnitude, we have reached to the pinnacle of correct decisions and better choices. These pseudo social groups and data junctions have bombarded our society so much that they now hold the forelock of our opinions and sentiments, ergo, we reach out to these groups to achieve a better outcome. But, all this enormous data and all these opinions cannot be researched by a single person, hence, comes the need of sentiment analysis. In this paper we’ll try to accomplish this by creating a system that will enable us to fetch tweets from twitter and use those tweets against a lexical database which will create a training set and then compare it with the pre-fetched tweets. Through this we will be able to assign a polarity to all the tweets by means of which we can address them as negative, positive or neutral and this is the very foundation of sentiment analysis, so subtle yet so magnificent.

Download Full-text

Detecting a Risk Signal in Stock Investment Through Opinion Mining and Graph-Based Semi-Supervised Learning

IEEE Access ◽

10.1109/access.2020.3021182 ◽

2020 ◽

Vol 8 ◽

pp. 161943-161957

Author(s):

Byungun Yoon ◽

Yujin Jeong ◽

Sunhye Kim

Keyword(s):

Supervised Learning ◽

Opinion Mining ◽

Stock Investment

Download Full-text

Opinion Mining and Classification of Music Lyrics Using Supervised Learning Algorithms

2018 First International Conference on Secure Cyber Computing and Communication (ICSCCC) ◽

10.1109/icsccc.2018.8703292 ◽

2018 ◽

Author(s):

Mahesh Ahuja ◽

A. L. Sangal

Keyword(s):

Supervised Learning ◽

Opinion Mining ◽

Learning Algorithms ◽

Supervised Learning Algorithms ◽

Music Lyrics

Download Full-text

Real-Time Streaming Data Analysis Using a Three-Way Classification Method for Sentimental Analysis

International Journal of Information Technology and Web Engineering ◽

10.4018/ijitwe.2018070107 ◽

2018 ◽

Vol 13 (3) ◽

pp. 99-111 ◽

Cited By ~ 2

Author(s):

Srinidhi Hiriyannaiah ◽

G.M. Siddesh ◽

K.G. Srinivasa

Keyword(s):

Social Media ◽

Opinion Mining ◽

Streaming Data ◽

Classification Method ◽

Emotion Classification ◽

Knowledge Based ◽

Twitter Data ◽

Internet Users ◽

Word Classification ◽

Three Stages

This article describes how recent advances in computing have led to an increase in the generation of data in fields such as social media, medical, power and others. With the rapid increase in internet users, social media has given power for sentiment analysis or opinion mining. It is a highly challenging task for storing, querying and analyzing such types of data. This article aims at providing a solution to store, query and analyze streaming data using Apache Kafka as the platform and twitter data as an example for analysis. A three-way classification method is proposed for sentimental analysis of twitter data that combines both the approaches for knowledge-based and machine-learning using three stages namely emotion classification, word classification and sentiment classification. The hybrid three-way classification approach was evaluated using a sample of five query strings on twitter and compared with existing emotion classifier, polarity classifier and Naïve Bayes classifier for sentimental analysis. The accuracy of the results of the proposed approach is superior when compared to existing approaches.

Download Full-text

Feature Selection for Ordinal Text Classification

Neural Computation ◽

10.1162/neco_a_00558 ◽

2014 ◽

Vol 26 (3) ◽

pp. 557-591 ◽

Cited By ~ 23

Author(s):

Stefano Baccianella ◽

Andrea Esuli ◽

Fabrizio Sebastiani

Keyword(s):

Feature Selection ◽

Supervised Learning ◽

Opinion Mining ◽

Rating Scale ◽

Learning Algorithms ◽

Learning Task ◽

Support Vector ◽

Data Sets ◽

Product Review ◽

Ordinal Classification

Ordinal classification (also known as ordinal regression) is a supervised learning task that consists of estimating the rating of a data item on a fixed, discrete rating scale. This problem is receiving increased attention from the sentiment analysis and opinion mining community due to the importance of automatically rating large amounts of product review data in digital form. As in other supervised learning tasks such as binary or multiclass classification, feature selection is often needed in order to improve efficiency and avoid overfitting. However, although feature selection has been extensively studied for other classification tasks, it has not for ordinal classification. In this letter, we present six novel feature selection methods that we have specifically devised for ordinal classification and test them on two data sets of product review data against three methods previously known from the literature, using two learning algorithms from the support vector regression tradition. The experimental results show that all six proposed metrics largely outperform all three baseline techniques (and are more stable than these others by an order of magnitude), on both data sets and for both learning algorithms.

Download Full-text

Opinion mining and sentiment analysis on a Twitter data stream

International Conference on Advances in ICT for Emerging Regions (ICTer2012) ◽

10.1109/icter.2012.6423033 ◽

2012 ◽

Cited By ~ 56

Author(s):

Balakrishnan Gokulakrishnan ◽

Pavalanathan Priyanthan ◽

Thiruchittampalam Ragavan ◽

Nadarajah Prasath ◽

AShehan Perera

Keyword(s):

Sentiment Analysis ◽

Data Stream ◽

Opinion Mining ◽

Twitter Data

Download Full-text

The Effect of Exposure to (Non-)Like-Minded Information on the Use of Political Incivility on Twitter

10.31124/advance.14191046 ◽

2021 ◽

Author(s):

Kohei Nishi

Keyword(s):

Supervised Learning ◽

Negative Emotions ◽

Political Information ◽

Learning Method ◽

Twitter Data ◽

Competing Hypotheses

Does exposure to like-minded/non-like-minded information lead to the use of political incivility? Few studies have investigated this question, and the results have been mixed. There are two conflicting possibilities: (i) if individuals are frequently exposed to like-minded political information, they reinforce their pre-existing beliefs and are, thus, more likely to use uncivil language, and (ii) if individuals are frequently exposed to non-like-minded information, they often feel negative emotions and are, therefore, more likely to use incivility. To evaluate these two competing hypotheses, I analyze Japanese Twitter data using a semi-supervised learning method. The results show that individuals who are exposed to non-like-minded information are more likely to use political incivility.

Download Full-text