Influence factor based opinion mining of Twitter data using supervised learning

Author(s):  
Malhar Anjaria ◽  
Ram Mohana Reddy Guddeti
Author(s):  
Vishnu VardanReddy ◽  
Mahesh Maila ◽  
Sai Sri Raghava ◽  
Yashwanth Avvaru ◽  
Sri. V. Koteswarao

In recent years, there is a rapid growth in online communication. There are many social networking sites and related mobile applications, and some more are still emerging. Huge amount of data is generated by these sites everyday and this data can be used as a source for various analysis purposes. Twitter is one of the most popular networking sites with millions of users. There are users with different views and varieties of reviews in the form of tweets are generated by them. Nowadays Opinion Mining has become an emerging topic of research due to lot of opinionated data available on Blogs & social networking sites. Tracking different types of opinions & summarizing them can provide valuable insight to different types of opinions to users who use Social networking sites to get reviews about any product, service or any topic. Analysis of opinions & its classification on the basis of polarity (positive, negative, neutral) is a challenging task. Lot of work has been done on sentiment analysis of twitter data and lot needs to be done. In this paper we discuss the levels, approaches of sentiment analysis, sentiment analysis of twitter data, existing tools available for sentiment analysis and the steps involved for same. Two approaches are discussed with an example which works on machine learning and lexicon based respectively.


Author(s):  
S. Neelakandan ◽  
D. Paulraj

People communicate their views, arguments and emotions about their everyday life on social media (SM) platforms (e.g. Twitter and Facebook). Twitter stands as an international micro-blogging service that features a brief message called tweets. Freestyle writing, incorrect grammar, typographical errors and abbreviations are some noises that occur in the text. Sentiment analysis (SA) centered on a tweet posted by the user, and also opinion mining (OM) of the customers review is another famous research topic. The texts are gathered from users’ tweets by means of OM and automatic-SA centered on ternary classifications, namely positive, neutral and negative. It is very challenging for the researchers to ascertain sentiments as a result of its limited size, misspells, unstructured nature, abbreviations and slangs for Twitter data. This paper, with the aid of the Gradient Boosted Decision Tree classifier (GBDT), proposes an efficient SA and Sentiment Classification (SC) of Twitter data. Initially, the twitter data undergoes pre-processing. Next, the pre-processed data is processed using HDFS MapReduce. Now, the features are extracted from the processed data, and then efficient features are selected using the Improved Elephant Herd Optimization (I-EHO) technique. Now, score values are calculated for each of those chosen features and given to the classifier. At last, the GBDT classifier classifies the data as negative, positive, or neutral. Experiential results are analyzed and contrasted with the other conventional techniques to show the highest performance of the proposed method.


2018 ◽  
Vol 7 (3.12) ◽  
pp. 351
Author(s):  
K Senthil Kumar ◽  
Mohammad Musab Trumboo ◽  
Vaibhav . ◽  
Satyajai Ahlawat

This era, in which we currently stand, is an era of public opinion and mass information. People from all around the globe are joined together through various information junctions to create a global community, where one thing from the far east reaches to the people of the far west within seconds. Nothing is hidden, everything and anything can be scrutinized to its core and through these global criticisms and mass discussions of gigantic magnitude, we have reached to the pinnacle of correct decisions and better choices. These pseudo social groups and data junctions have bombarded our society so much that they now hold the forelock of our opinions and sentiments, ergo, we reach out to these groups to achieve a better outcome. But, all this enormous data and all these opinions cannot be researched by a single person, hence, comes the need of sentiment analysis. In this paper we’ll try to accomplish this by creating a system that will enable us to fetch tweets from twitter and use those tweets against a lexical database which will create a training set and then compare it with the pre-fetched tweets. Through this we will be able to assign a polarity to all the tweets by means of which we can address them as negative, positive or neutral and this is the very foundation of sentiment analysis, so subtle yet so magnificent.  


Author(s):  
Srinidhi Hiriyannaiah ◽  
G.M. Siddesh ◽  
K.G. Srinivasa

This article describes how recent advances in computing have led to an increase in the generation of data in fields such as social media, medical, power and others. With the rapid increase in internet users, social media has given power for sentiment analysis or opinion mining. It is a highly challenging task for storing, querying and analyzing such types of data. This article aims at providing a solution to store, query and analyze streaming data using Apache Kafka as the platform and twitter data as an example for analysis. A three-way classification method is proposed for sentimental analysis of twitter data that combines both the approaches for knowledge-based and machine-learning using three stages namely emotion classification, word classification and sentiment classification. The hybrid three-way classification approach was evaluated using a sample of five query strings on twitter and compared with existing emotion classifier, polarity classifier and Naïve Bayes classifier for sentimental analysis. The accuracy of the results of the proposed approach is superior when compared to existing approaches.


2014 ◽  
Vol 26 (3) ◽  
pp. 557-591 ◽  
Author(s):  
Stefano Baccianella ◽  
Andrea Esuli ◽  
Fabrizio Sebastiani

Ordinal classification (also known as ordinal regression) is a supervised learning task that consists of estimating the rating of a data item on a fixed, discrete rating scale. This problem is receiving increased attention from the sentiment analysis and opinion mining community due to the importance of automatically rating large amounts of product review data in digital form. As in other supervised learning tasks such as binary or multiclass classification, feature selection is often needed in order to improve efficiency and avoid overfitting. However, although feature selection has been extensively studied for other classification tasks, it has not for ordinal classification. In this letter, we present six novel feature selection methods that we have specifically devised for ordinal classification and test them on two data sets of product review data against three methods previously known from the literature, using two learning algorithms from the support vector regression tradition. The experimental results show that all six proposed metrics largely outperform all three baseline techniques (and are more stable than these others by an order of magnitude), on both data sets and for both learning algorithms.


Author(s):  
Balakrishnan Gokulakrishnan ◽  
Pavalanathan Priyanthan ◽  
Thiruchittampalam Ragavan ◽  
Nadarajah Prasath ◽  
AShehan Perera

2021 ◽  
Author(s):  
Kohei Nishi

Does exposure to like-minded/non-like-minded information lead to the use of political incivility? Few studies have investigated this question, and the results have been mixed. There are two conflicting possibilities: (i) if individuals are frequently exposed to like-minded political information, they reinforce their pre-existing beliefs and are, thus, more likely to use uncivil language, and (ii) if individuals are frequently exposed to non-like-minded information, they often feel negative emotions and are, therefore, more likely to use incivility. To evaluate these two competing hypotheses, I analyze Japanese Twitter data using a semi-supervised learning method. The results show that individuals who are exposed to non-like-minded information are more likely to use political incivility.


Sign in / Sign up

Export Citation Format

Share Document