scholarly journals Multilingual Sentiment Analysis Using Latent Semantic Indexing and Machine Learning

Author(s):  
Brett W. Bader ◽  
W. Philip Kegelmeyer ◽  
Peter A. Chew
Author(s):  
Jesus M. Meneses ◽  
Karen W. Cantilang ◽  
Delbert A. Dala ◽  
Jovito B. Madeja

The purpose of this study was to decode the hidden views and sentiments from the collated written responses of Eastern Samar State University’s Program Heads regarding supervision of instructions amidst the COVID-19 pandemic. This study utilized Exploratory Sequential Mixed Method to explore and understand the perspective or sentiments of Eastern Samar State University program heads towards supervision of instruction in the midst of the COVID-19 pandemic. Data were collected/collated from the participants indirectly using an interview questionnaire containing an open-ended question. The same were processed and analyzed using an open-source machine learning software called Orange toolbox (Demsar et al., 2013) wherein pre-processing, sentiment analysis and topic modelling built-in tools were utilized. The results showed that the most prominent words generated by the machine learning tool from the text file of responses are the words pandemic, performance, program, learning, difficult, supervision, instruction, internet, faculty, online students, teaching, delivery confusing, challenging, poor and connectivity. The dominant sentiment associated thereof lean towards negative polarity which implicate negative sentiments. Hidden topics were automatically generated by the machine which allowed the researchers to come up with the following related themes: “Impact of pandemic in the supervision of instruction of faculty and learning of students”, “Challenges in the delivery of instruction and supervision due to poor internet connectivity”, and “Strategic role of online modalities and connectivity in supervision and delivery of instruction”. There are limited researches navigating in text mining and sentiment analysis with the use of Orange toolbox particularly those that deals with supervision of instruction in a Philippine State University. There are related studies using machine learning software, but nothing like this study directed towards a specific gap in specific locale. KEYWORDS: Pandemic, Latent Semantic Indexing, Orange Toolbox, Sentiment Analysis, Thematic Analysis.


2021 ◽  
Author(s):  
Adebayo Abayomi-Alli ◽  
Olusola Abayomi-Alli ◽  
Sanjay Misra ◽  
Luis Fernandez-Sanz

Abstract BackgroundSocial media opinion has become a medium to quickly access large, valuable, and rich details of information on any subject matter within a short period. Twitter being a social microblog site, generate over 330 million tweets monthly across different countries. Analyzing trending topics on Twitter presents opportunities to extract meaningful insight into different opinions on various issues.AimThis study aims to gain insights into the trending yahoo-yahoo topic on Twitter using content analysis of selected historical tweets.MethodologyThe widgets and workflow engine in the Orange Data mining toolbox were employed for all the text mining tasks. 5500 tweets were collected from Twitter using the 'yahoo yahoo' hashtag. The corpus was pre-processed using a pre-trained tweet tokenizer, Valence Aware Dictionary for Sentiment Reasoning (VADER) was used for the sentiment and opinion mining, Latent Dirichlet Allocation (LDA) and Latent Semantic Indexing (LSI) was used for topic modeling. In contrast, Multidimensional scaling (MDS) was used to visualize the modeled topics. ResultsResults showed that "yahoo" appeared in the corpus 9555 times, 175 unique tweets were returned after duplicate removal. Contrary to expectation, Spain had the highest number of participants tweeting on the 'yahoo yahoo' topic within the period. The result of Vader sentiment analysis returned 35.85%, 24.53%, 15.09%, and 24.53%, negative, neutral, no-zone, and positive sentiment tweets, respectively. The word yahoo was highly representative of the LDA topics 1, 3, 4, 6, and LSI topic 1.ConclusionIt can be concluded that emojis are even more representative of the sentiments in tweets faster than the textual contents. Also, despite popular belief, a significant number of youths regard cybercrime as a detriment to society.


Author(s):  
Ajeet Ram Pathak ◽  
Manjusha Pandey ◽  
Siddharth Rautaray

Background: The large amount of data emanated from social media platforms need scalable topic modeling in order to get current trends and themes of events discussed on such platforms. Topic modeling play crucial role in many natural language processing applications like sentiment analysis, recommendation systems, event tracking, summarization, etc. Objectives: The aim of the proposed work is to adaptively extract the dynamically evolving topics over streaming data, and infer the current trends and get the notion of trend of topics over time. Because of various world level events, many uncorrelated streaming channels tend to start discussion on similar topics. We aim to find the effect of uncorrelated streaming channels on topic modeling when they tend to start discussion on similar topics. Method: An adaptive framework for dynamic and temporal topic modeling using deep learning has been put forth in this paper. The framework approximates online latent semantic indexing constrained by regularization on streaming data using adaptive learning method. The framework is designed using deep layers of feedforward neural network. Results: This framework supports dynamic and temporal topic modeling. The proposed approach is scalable to large collection of data. We have performed exploratory data analysis and correspondence analysis on real world Twitter dataset. Results state that our approach works well to extract topic topics associated with a given hashtag. Given the query, the approach is able to extract both implicit and explicit topics associated with the terms mentioned in the query. Conclusion: The proposed approach is a suitable solution for performing topic modeling over Big Data. We are approximating the Latent Semantic Indexing model with regularization using deep learning with differentiable ℓ1 regularization, which makes the model work on streaming data adaptively at real-time. The model also supports the extraction of aspects from sentences based on interrelation of topics and thus, supports aspect modeling in aspect-based sentiment analysis.


2010 ◽  
Vol 14 (2) ◽  
pp. 159-181
Author(s):  
MUNPYO HONG ◽  
MIYOUNG SHIN ◽  
Shinhye Park ◽  
Hyungmin Lee

2020 ◽  
Author(s):  
Pathikkumar Patel ◽  
Bhargav Lad ◽  
Jinan Fiaidhi

During the last few years, RNN models have been extensively used and they have proven to be better for sequence and text data. RNNs have achieved state-of-the-art performance levels in several applications such as text classification, sequence to sequence modelling and time series forecasting. In this article we will review different Machine Learning and Deep Learning based approaches for text data and look at the results obtained from these methods. This work also explores the use of transfer learning in NLP and how it affects the performance of models on a specific application of sentiment analysis.


2020 ◽  
Vol 9 (3) ◽  
pp. 1239-1250
Author(s):  
D. Yadav ◽  
A. Sharma ◽  
S. Ahmad ◽  
U. Chandra

Author(s):  
Farrikh Alzami ◽  
Erika Devi Udayanti ◽  
Dwi Puji Prabowo ◽  
Rama Aria Megantara

Sentiment analysis in terms of polarity classification is very important in everyday life, with the existence of polarity, many people can find out whether the respected document has positive or negative sentiment so that it can help in choosing and making decisions. Sentiment analysis usually done manually. Therefore, an automatic sentiment analysis classification process is needed. However, it is rare to find studies that discuss extraction features and which learning models are suitable for unstructured sentiment analysis types with the Amazon food review case. This research explores some extraction features such as Word Bags, TF-IDF, Word2Vector, as well as a combination of TF-IDF and Word2Vector with several machine learning models such as Random Forest, SVM, KNN and Naïve Bayes to find out a combination of feature extraction and learning models that can help add variety to the analysis of polarity sentiments. By assisting with document preparation such as html tags and punctuation and special characters, using snowball stemming, TF-IDF results obtained with SVM are suitable for obtaining a polarity classification in unstructured sentiment analysis for the case of Amazon food review with a performance result of 87,3 percent.


Sign in / Sign up

Export Citation Format

Share Document