A Language Model for Misogyny Detection in Latin American Spanish Driven by Multisource Feature Extraction and Transformers

Edwin Aldana-Bobadilla; Alejandro Molina-Villegas; Yuridia Montelongo-Padilla; Ivan Lopez-Arevalo; Oscar S. Sordia

doi:10.3390/app112110467

A Language Model for Misogyny Detection in Latin American Spanish Driven by Multisource Feature Extraction and Transformers

Applied Sciences ◽

10.3390/app112110467 ◽

2021 ◽

Vol 11 (21) ◽

pp. 10467

Author(s):

Edwin Aldana-Bobadilla ◽

Alejandro Molina-Villegas ◽

Yuridia Montelongo-Padilla ◽

Ivan Lopez-Arevalo ◽

Oscar S. Sordia

Keyword(s):

Deep Learning ◽

Latin American ◽

Data Augmentation ◽

Physical Violence ◽

Language Model ◽

Learning Models ◽

Exact Figure ◽

Latin American Spanish ◽

American Spanish ◽

Statistical Systems

Creating effective mechanisms to detect misogyny online automatically represents significant scientific and technological challenges. The complexity of recognizing misogyny through computer models lies in the fact that it is a subtle type of violence, it is not always explicitly aggressive, and it can even hide behind seemingly flattering words, jokes, parodies, and other expressions. Currently, it is even difficult to have an exact figure for the rate of misogynistic comments online because, unlike other types of violence, such as physical violence, these events are not registered by any statistical systems. This research contributes to the development of models for the automatic detection of misogynistic texts in Latin American Spanish and contributes to the design of data augmentation methodologies since the amount of data required for deep learning models is considerable.

Download Full-text

A cross-cultural analysis of the Test of Memory Malingering among Latin American Spanish-speaking adults.

Law and Human Behavior ◽

10.1037/lhb0000250 ◽

2017 ◽

Vol 41 (5) ◽

pp. 422-428 ◽

Cited By ~ 6

Author(s):

Alicia Nijdam-Jones ◽

Diego Rivera ◽

Barry Rosenfeld ◽

Juan Carlos Arango-Lasprilla

Keyword(s):

Latin American ◽

Cultural Analysis ◽

Cross Cultural ◽

Test Of Memory Malingering ◽

Spanish Speaking ◽

Latin American Spanish ◽

American Spanish ◽

Cross Cultural Analysis

Download Full-text

Levenshtein Augmentation Improves Performance of SMILES Based Deep-Learning Synthesis Prediction

10.26434/chemrxiv.12562121 ◽

2020 ◽

Author(s):

Dean Sumner ◽

Jiazhen He ◽

Amol Thakkar ◽

Ola Engkvist ◽

Esben Jannik Bjerrum

Keyword(s):

Neural Networks ◽

Pattern Recognition ◽

Deep Learning ◽

Recurrent Neural Networks ◽

Data Augmentation ◽

State Of The Art ◽

Sequence Similarity ◽

Learning Models ◽

Underlying Network

<p>SMILES randomization, a form of data augmentation, has previously been shown to increase the performance of deep learning models compared to non-augmented baselines. Here, we propose a novel data augmentation method we call “Levenshtein augmentation” which considers local SMILES sub-sequence similarity between reactants and their respective products when creating training pairs. The performance of Levenshtein augmentation was tested using two state of the art models - transformer and sequence-to-sequence based recurrent neural networks with attention. Levenshtein augmentation demonstrated an increase performance over non-augmented, and conventionally SMILES randomization augmented data when used for training of baseline models. Furthermore, Levenshtein augmentation seemingly results in what we define as <i>attentional gain </i>– an enhancement in the pattern recognition capabilities of the underlying network to molecular motifs.</p>

Download Full-text

Validation of the Latin American‐Spanish version of the scale ‘Quality of Life in Life‐Threatening Illness–Family Caregiver Version’ (QOLLTI‐F)

Health & Social Care in the Community ◽

10.1111/hsc.13453 ◽

2021 ◽

Author(s):

Mauricio Arias‐Rojas ◽

Edith Arredondo Holgín ◽

Sonia Carreño Moreno ◽

Carolina Posada López ◽

Bertha Tellez

Keyword(s):

Quality Of Life ◽

Latin American ◽

Family Caregiver ◽

Spanish Version ◽

Life Threatening Illness ◽

Life Threatening ◽

Latin American Spanish ◽

American Spanish

Download Full-text

Face-Name Associative Memory Exam--Latin American Spanish Version

PsycTESTS Dataset ◽

10.1037/t78967-000 ◽

2020 ◽

Author(s):

Clara Vila-Castelar ◽

Kathryn V. Papp ◽

Rebecca E. Amariglio ◽

Valeria L. Torres ◽

Ana Baena ◽

...

Keyword(s):

Associative Memory ◽

Latin American ◽

Spanish Version ◽

Latin American Spanish ◽

American Spanish

Download Full-text

Data Augmentation for Improving Deep Learning Models in Building Inspections or Postdisaster Evaluation

Journal of Performance of Constructed Facilities ◽

10.1061/(asce)cf.1943-5509.0001594 ◽

2021 ◽

Vol 35 (4) ◽

Author(s):

Samuel Leach ◽

Yunhe Xue ◽

Rahul Sridhar ◽

Stephanie Paal ◽

Zhangyang Wang ◽

...

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Learning Models

Download Full-text

Migrations and globalization: Their effects on contact varieties of Latin American Spanish

Español en Estados Unidos y otros contextos de contacto ◽

10.31819/9783865279033-004 ◽

2009 ◽

pp. 39-66 ◽

Cited By ~ 6

Keyword(s):

Latin American ◽

Latin American Spanish ◽

American Spanish

Download Full-text

Introduction. Contemporary research on Latin American Spanish dialectology

Aspects of Latin American Spanish Dialectology - Issues in Hispanic and Lusophone Linguistics ◽

10.1075/ihll.32.int ◽

2021 ◽

pp. 1-8

Author(s):

Manuel Díaz-Campos ◽

Sandro Sessarego

Keyword(s):

Latin American ◽

Contemporary Research ◽

Latin American Spanish ◽

American Spanish

Download Full-text

Assessing the measurement invariance of a Latin-American Spanish translation of the Body Appreciation Scale-2 in Mexican, Argentinean, and Colombian adolescents

Body Image ◽

10.1016/j.bodyim.2020.01.004 ◽

2020 ◽

Vol 32 ◽

pp. 180-189 ◽

Cited By ~ 7

Author(s):

Vanesa C. Góngora ◽

Verónica Cruz Licea ◽

Moises R. Mebarak Chams ◽

Tracey Thornborrow

Keyword(s):

Measurement Invariance ◽

Latin American ◽

The Body ◽

Body Appreciation ◽

Spanish Translation ◽

Latin American Spanish ◽

American Spanish

Download Full-text

(Dis)continuity in language change: ser and estar + age in Latin-American Spanish

Linguistics in the Netherlands ◽

10.1075/avt.10.09jon ◽

1993 ◽

Vol 10 ◽

pp. 69-80

Author(s):

Bob de Jonge

Keyword(s):

Latin American ◽

Language Change ◽

Latin American Spanish ◽

American Spanish ◽

Ser And Estar

Download Full-text

La variation dans les Constructions Verbales Figées de l’espagnol d’Amérique

Lingvisticae Investigationes ◽

10.1075/li.38.2.05mog ◽

2015 ◽

Vol 38 (2) ◽

pp. 276-300 ◽

Cited By ~ 2

Author(s):

Pedro Mogorrón Huerta

Keyword(s):

Latin American ◽

Complete Data ◽

Research Papers ◽

Data Bases ◽

Syntactic Structures ◽

Fixed Expressions ◽

Latin American Spanish ◽

American Spanish ◽

Peninsular Spanish ◽

Near Future

Traditionally, research papers on fixed expressions emphasize the fact that those sequences are fixed compared to constructions with free components. After one study which was carried out in 2010 through which we were able to prove that a considerable number of verbal fixed expressions in common Peninsular Spanish allow changes in some of their components without causing a change in the meaning and maintaining their fixed state, in this paper we analyze verbal fixed expressions in the Latin American Spanish variety. This analysis allows us to observe the modes of variation in the Latin American Spanish verbal fixed expressions (paradigm, lexic, morphology, grammar) by following the same patterns and syntactic structures as in common Penninsular Spanish which we find in the case of diatopic expressions formed in the verbal fixed expressions of common Penninsular Spanish as well as in new diatopic verbal fixed expressions. The fact that there are so many verbal fixed expressions in the Latin American Spanish variety and also that this number will only increase in the near future reinforces the idea that we should create very complete data bases.

Download Full-text