Characterisation of COVID-19-Related Tweets in the Croatian Language: Framework Based on the Cro-CoV-cseBERT Model

Karlo Babić; Milan Petrović; Slobodan Beliga; Sanda Martinčić-Ipšić; Mihaela Matešić; Ana Meštrović

doi:10.3390/app112110442

Characterisation of COVID-19-Related Tweets in the Croatian Language: Framework Based on the Cro-CoV-cseBERT Model

Applied Sciences ◽

10.3390/app112110442 ◽

2021 ◽

Vol 11 (21) ◽

pp. 10442

Author(s):

Karlo Babić ◽

Milan Petrović ◽

Slobodan Beliga ◽

Sanda Martinčić-Ipšić ◽

Mihaela Matešić ◽

...

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Crisis Communication ◽

Learning Algorithms ◽

Language Model ◽

Machine Learning Algorithms ◽

Large Dataset ◽

Communication Problems ◽

The Republic ◽

Over Time

This study aims to provide insights into the COVID-19-related communication on Twitter in the Republic of Croatia. For that purpose, we developed an NL-based framework that enables automatic analysis of a large dataset of tweets in the Croatian language. We collected and analysed 206,196 tweets related to COVID-19 and constructed a dataset of 10,000 tweets which we manually annotated with a sentiment label. We trained the Cro-CoV-cseBERT language model for the representation and clustering of tweets. Additionally, we compared the performance of four machine learning algorithms on the task of sentiment classification. After identifying the best performing setup of NLP methods, we applied the proposed framework in the task of characterisation of COVID-19 tweets in Croatia. More precisely, we performed sentiment analysis and tracked the sentiment over time. Furthermore, we detected how tweets are grouped into clusters with similar themes across three pandemic waves. Additionally, we characterised the tweets by analysing the distribution of sentiment polarity (in each thematic cluster and over time) and the number of retweets (in each thematic cluster and sentiment class). These results could be useful for additional research and interpretation in the domains of sociology, psychology or other sciences, as well as for the authorities, who could use them to address crisis communication problems.

Download Full-text

Sentiment Analysis of Movie Reviews: A Study of Machine Learning Algorithms with Various Feature Selection Methods

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v5i9.113121 ◽

2017 ◽

Vol 5 (9) ◽

Cited By ~ 1

Author(s):

Rajwinder Kaur

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Sentiment Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Selection Methods

Download Full-text

Twitter Sentiment Analysis Using Machine Learning Algorithms: A Case Study

2020 International Conference on Advances in Computing, Communication & Materials (ICACCM) ◽

10.1109/icaccm50413.2020.9213011 ◽

2020 ◽

Author(s):

Sheresh Zahoor ◽

Rajesh Rohilla

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms

Download Full-text

Comparative Study of Machine Learning Algorithms for Twitter Sentiment Analysis

2021 International Conference on Emerging Smart Computing and Informatics (ESCI) ◽

10.1109/esci50559.2021.9396925 ◽

2021 ◽

Author(s):

Yash Indulkar ◽

Abhijit Patil

Keyword(s):

Machine Learning ◽

Comparative Study ◽

Sentiment Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms

Download Full-text

Performance Analysis of Machine Learning Algorithms and Feature Extraction Methods for Sentiment Analysis

10.1109/icses52305.2021.9633882 ◽

2021 ◽

Author(s):

Anshumaan Chauhan ◽

Ayushi Agarwal ◽

Razia Sulthana

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Performance Analysis ◽

Sentiment Analysis ◽

Learning Algorithms ◽

Extraction Methods ◽

Machine Learning Algorithms

Download Full-text

Sentiment Analysis Using Machine Learning Algorithms

2021 International Wireless Communications and Mobile Computing (IWCMC) ◽

10.1109/iwcmc51323.2021.9498965 ◽

2021 ◽

Author(s):

Fatma Jemai ◽

Mohamed Hayouni ◽

Sahbi Baccar

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms

Download Full-text

Dynamic Sentiment Analysis Using Multiple Machine Learning Algorithms: A Comparative Knowledge Methodology

Advances in Data and Information Sciences - Lecture Notes in Networks and Systems ◽

10.1007/978-981-10-8360-0_26 ◽

2018 ◽

pp. 273-286

Author(s):

Manmeet Kaur ◽

Krishna Kant Agrawal ◽

Deepak Arora

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms

Download Full-text

Comparison of the efficiency of Machine Learning algorithms on Twitter Sentiment Analysis of Pathao

2019 22nd International Conference on Computer and Information Technology (ICCIT) ◽

10.1109/iccit48885.2019.9038208 ◽

2019 ◽

Author(s):

Mahamudul Islam Sajib ◽

Shoeib Mahmud Shargo ◽

Md. Alomgir Hossain

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms

Download Full-text

Experimental investigation of automated system for twitter sentiment analysis to predict the public emotions using machine learning algorithms

Materials Today Proceedings ◽

10.1016/j.matpr.2020.09.351 ◽

2020 ◽

Author(s):

Priti Sharma ◽

A.K. Sharma

Keyword(s):

Machine Learning ◽

Experimental Investigation ◽

Sentiment Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Automated System ◽

The Public

Download Full-text

Sentiment Analysis for Scraping of Product Reviews from Multiple Web Pages Using Machine Learning Algorithms

Advances in Intelligent Systems and Computing - Intelligent Systems Design and Applications ◽

10.1007/978-3-030-16660-1_66 ◽

2019 ◽

pp. 677-685

Author(s):

E. Suganya ◽

S. Vijayarani

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Web Pages ◽

Product Reviews

Download Full-text

A Surveillance on Machine Learning Algorithms and Its Applications

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2020.9064 ◽

2020 ◽

Vol 17 (9) ◽

pp. 4294-4298

Author(s):

B. R. Sunil Kumar ◽

B. S. Siddhartha ◽

S. N. Shwetha ◽

K. Arpitha

Keyword(s):

Machine Learning ◽

Health Care ◽

Sentiment Analysis ◽

Learning Algorithm ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Machine Learning Algorithm ◽

Data Set ◽

Pros And Cons ◽

Primary Advantage

This paper intends to use distinct machine learning algorithms and exploring its multi-features. The primary advantage of machine learning is, a machine learning algorithm can predict its work automatically by learning what to do with information. This paper reveals the concept of machine learning and its algorithms which can be used for different applications such as health care, sentiment analysis and many more. Sometimes the programmers will get confused which algorithm to apply for their applications. This paper provides an idea related to the algorithm used on the basis of how accurately it fits. Based on the collected data, one of the algorithms can be selected based upon its pros and cons. By considering the data set, the base model is developed, trained and tested. Then the trained model is ready for prediction and can be deployed on the basis of feasibility.

Download Full-text