Multimodal Hate Speech Detection in Greek Social Media

Konstantinos Perifanos; Dionysis Goutsos

doi:10.3390/mti5070034

Multimodal Hate Speech Detection in Greek Social Media

Multimodal Technologies and Interaction ◽

10.3390/mti5070034 ◽

2021 ◽

Vol 5 (7) ◽

pp. 34

Author(s):

Konstantinos Perifanos ◽

Dionysis Goutsos

Keyword(s):

Social Media ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Hate Speech ◽

Language Model ◽

Fine Tuning ◽

Accuracy Score ◽

Speech Detection ◽

Online Social Media

Hateful and abusive speech presents a major challenge for all online social media platforms. Recent advances in Natural Language Processing and Natural Language Understanding allow for more accurate detection of hate speech in textual streams. This study presents a new multimodal approach to hate speech detection by combining Computer Vision and Natural Language processing models for abusive context detection. Our study focuses on Twitter messages and, more specifically, on hateful, xenophobic, and racist speech in Greek aimed at refugees and migrants. In our approach, we combine transfer learning and fine-tuning of Bidirectional Encoder Representations from Transformers (BERT) and Residual Neural Networks (Resnet). Our contribution includes the development of a new dataset for hate speech classification, consisting of tweet IDs, along with the code to obtain their visual appearance, as they would have been rendered in a web browser. We have also released a pre-trained Language Model trained on Greek tweets, which has been used in our experiments. We report a consistently high level of accuracy (accuracy score = 0.970, f1-score = 0.947 in our best model) in racist and xenophobic speech detection.

Download Full-text

Detection of Cyberbullying on Social Media Using Machine learning

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.38635 ◽

2021 ◽

Vol 9 (10) ◽

pp. 1401-1409

Author(s):

Mitta Roja

Keyword(s):

Machine Learning ◽

Social Media ◽

Feature Extraction ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Hate Speech ◽

Text Data ◽

Model Based

Abstract: Cyberbullying is a major problem encountered on internet that affects teenagers and also adults. It has lead to mishappenings like suicide and depression. Regulation of content on Social media platorms has become a growing need. The following study uses data from two different forms of cyberbullying, hate speech tweets from Twittter and comments based on personal attacks from Wikipedia forums to build a model based on detection of Cyberbullying in text data using Natural Language Processing and Machine learning. Threemethods for Feature extraction and four classifiers are studied to outline the best approach. For Tweet data the model provides accuracies above 90% and for Wikipedia data it givesaccuracies above 80%. Keywords: Cyberbullying, Hate speech, Personal attacks,Machine learning, Feature extraction, Twitter, Wikipedia

Download Full-text

Research Journey of Hate Content Detection From Cyberspace

Advances in Business Information Systems and Analytics - Natural Language Processing for Global and Local Business ◽

10.4018/978-1-7998-4240-8.ch009 ◽

2021 ◽

pp. 200-225

Author(s):

Sayani Ghosal ◽

Amita Jain

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Digital Media ◽

Language Processing ◽

Social Networking Sites ◽

Hate Speech ◽

Detection System ◽

Machine Learning Algorithms ◽

Speech Detection ◽

Healthy Environment

Hate content detection is the most prospective and challenging research area under the natural language processing domain. Hate speech abuse individuals or groups of people based on religion, caste, language, or sex. Enormous growth of digital media and cyberspace has encouraged researchers to work on hatred speech detection. A commonly acceptable automatic hate detection system is required to stop flowing hate-motivated data. Anonymous hate content is affecting the young generation and adults on social networking sites. Through numerous studies and review papers, the chapter identifies the need for artificial intelligence (AI) in hate speech research. The chapter explores the current state-of-the-art and prospects of AI in natural language processing (NLP) and machine learning algorithms. The chapter aims to identify the most successful methods or techniques for hate speech detection to date. Revolution in this research helps social media to provide a healthy environment for everyone.

Download Full-text

A Survey on Hate Speech Detection using Natural Language Processing

10.18653/v1/w17-1101 ◽

2017 ◽

Cited By ~ 107

Author(s):

Anna Schmidt ◽

Michael Wiegand

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Hate Speech ◽

Speech Detection

Download Full-text

Automatic hate speech detection using killer natural language processing optimizing ensemble deep learning approach

Computing ◽

10.1007/s00607-019-00745-0 ◽

2019 ◽

Vol 102 (2) ◽

pp. 501-522 ◽

Cited By ~ 3

Author(s):

Zafer Al-Makhadmeh ◽

Amr Tolba

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Hate Speech ◽

Learning Approach ◽

Speech Detection

Download Full-text

Fake News Detector in Online Social Media

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a1089.1291s419 ◽

2019 ◽

Vol 9 (1S4) ◽

pp. 58-60

Keyword(s):

Social Media ◽

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Fake News ◽

The Public ◽

Online Social Media ◽

Learning Techniques ◽

State Of Art

Spreading of fake news in online social media is a major nuisance to the public and there is no state of art tool to detect whether a news is a fake or an original one in an automated manner. Hence, this paper analyses the online social media and the news feeds for detection of fake news. The work proposes solution using Natural Language Processing and Deep Learning techniques for detecting the fake news in online social media.

Download Full-text

Augment BERT with average pooling layer for Chinese summary generation

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211229 ◽

2021 ◽

pp. 1-10

Author(s):

Shuai Zhao ◽

Fucheng You ◽

Wen Chang ◽

Tianyu Zhang ◽

Man Hu

Keyword(s):

Experimental Data ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Semantic Information ◽

Chinese Language ◽

Language Model ◽

Fine Tuning ◽

Generation Model ◽

Expected Effect

The BERT pre-trained language model has achieved good results in various subtasks of natural language processing, but its performance in generating Chinese summaries is not ideal. The most intuitive reason is that the BERT model is based on character-level composition, while the Chinese language is mostly in the form of phrases. Directly fine-tuning the BERT model cannot achieve the expected effect. This paper proposes a novel summary generation model with BERT augmented by the pooling layer. In our model, we perform an average pooling operation on token embedding to improve the model’s ability to capture phrase-level semantic information. We use LCSTS and NLPCC2017 to verify our proposed method. Experimental data shows that the average pooling model’s introduction can effectively improve the generated summary quality. Furthermore, different data needs to be set with varying pooling kernel sizes to achieve the best results through comparative analysis. In addition, our proposed method has strong generalizability. It can be applied not only to the task of generating summaries, but also to other natural language processing tasks.

Download Full-text

Hate Speech Detection Using Natural Language Processing: Applications and Challenges

2021 5th International Conference on Trends in Electronics and Informatics (ICOEI) ◽

10.1109/icoei51242.2021.9452882 ◽

2021 ◽

Author(s):

Anil Singh Parihar ◽

Surendrabikram Thapa ◽

Sushruti Mishra

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Hate Speech ◽

Speech Detection

Download Full-text

A Semantic Web Platform for Online Vaccine Sentiment Surveillance

Online Journal of Public Health Informatics ◽

10.5210/ojphi.v7i1.5823 ◽

2015 ◽

Vol 7 (1) ◽

Author(s):

Arash Shaban-Nejad ◽

Sonia Menon ◽

David Buckeridge

Keyword(s):

Social Media ◽

Natural Language Processing ◽

Semantic Web ◽

Natural Language ◽

Language Processing ◽

Concept Extraction ◽

Vaccine Refusal ◽

Online Social Media ◽

Web Platform

The Vaccon Sentiment Ontology (VASON) provides knowledge on the factors driving vaccine refusal by analyzing content of online social media. VASON facilitates concept extraction and analysis of the extracted concepts using an Natural Language Processing (NLP) module.

Download Full-text