Cross-lingual Adaptation Using Universal Dependencies

Nasrin Taghizadeh; Heshaam Faili

doi:10.1145/3448251

Cross-lingual Adaptation Using Universal Dependencies

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3448251 ◽

2021 ◽

Vol 20 (4) ◽

pp. 1-23

Author(s):

Nasrin Taghizadeh ◽

Heshaam Faili

Keyword(s):

Case Studies ◽

Relation Extraction ◽

Low Resource ◽

Parse Trees ◽

Cross Lingual ◽

Adaptation Method

We describe a cross-lingual adaptation method based on syntactic parse trees obtained from the Universal Dependencies (UD), which are consistent across languages, to develop classifiers in low-resource languages. The idea of UD parsing is to capture similarities as well as idiosyncrasies among typologically different languages. In this article, we show that models trained using UD parse trees for complex NLP tasks can characterize very different languages. We study two tasks of paraphrase identification and relation extraction as case studies. Based on UD parse trees, we develop several models using tree kernels and show that these models trained on the English dataset can correctly classify data of other languages, e.g., French, Farsi, and Arabic. The proposed approach opens up avenues for exploiting UD parsing in solving similar cross-lingual tasks, which is very useful for languages for which no labeled data is available.

Download Full-text

Towards an entity relation extraction framework in the cross-lingual context

The Electronic Library ◽

10.1108/el-10-2020-0304 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Chuanming Yu ◽

Haodong Xue ◽

Manyi Wang ◽

Lu An

Keyword(s):

Knowledge Acquisition ◽

Relation Extraction ◽

Content Type ◽

Low Resource ◽

High Resource ◽

Acquisition Task ◽

Entity Relation Extraction ◽

The Cross ◽

Cross Lingual ◽

Language Context

Purpose Owing to the uneven distribution of annotated corpus among different languages, it is necessary to bridge the gap between low resource languages and high resource languages. From the perspective of entity relation extraction, this paper aims to extend the knowledge acquisition task from a single language context to a cross-lingual context, and to improve the relation extraction performance for low resource languages. Design/methodology/approach This paper proposes a cross-lingual adversarial relation extraction (CLARE) framework, which decomposes cross-lingual relation extraction into parallel corpus acquisition and adversarial adaptation relation extraction. Based on the proposed framework, this paper conducts extensive experiments in two tasks, i.e. the English-to-Chinese and the English-to-Arabic cross-lingual entity relation extraction. Findings The Macro-F1 values of the optimal models in the two tasks are 0.880 1 and 0.789 9, respectively, indicating that the proposed CLARE framework for CLARE can significantly improve the effect of low resource language entity relation extraction. The experimental results suggest that the proposed framework can effectively transfer the corpus as well as the annotated tags from English to Chinese and Arabic. This study reveals that the proposed approach is less human labour intensive and more effective in the cross-lingual entity relation extraction than the manual method. It shows that this approach has high generalizability among different languages. Originality/value The research results are of great significance for improving the performance of the cross-lingual knowledge acquisition. The cross-lingual transfer may greatly reduce the time and cost of the manual construction of the multi-lingual corpus. It sheds light on the knowledge acquisition and organization from the unstructured text in the era of big data.

Download Full-text

Exploring syntactic structured features over parse trees for relation extraction using kernel methods

Information Processing & Management ◽

10.1016/j.ipm.2007.07.013 ◽

2008 ◽

Vol 44 (2) ◽

pp. 687-701 ◽

Cited By ~ 32

Author(s):

Min Zhang ◽

GuoDong Zhou ◽

Aiti Aw

Keyword(s):

Kernel Methods ◽

Relation Extraction ◽

Parse Trees

Download Full-text

Cross-lingual and ensemble MLPs strategies for low-resource speech recognition

10.21437/interspeech.2012-11 ◽

2012 ◽

Author(s):

Yanmin Qian ◽

Jia Liu

Keyword(s):

Speech Recognition ◽

Low Resource ◽

Cross Lingual

Download Full-text

Cross-lingual transfer learning during supervised training in low resource scenarios

10.21437/interspeech.2015-700 ◽

2015 ◽

Author(s):

Amit Das ◽

Mark Hasegawa-Johnson

Keyword(s):

Transfer Learning ◽

Low Resource ◽

Supervised Training ◽

Cross Lingual

Download Full-text

How to Parse Low-Resource Languages: Cross-Lingual Parsing, Target Language Annotation, or Both?

10.18653/v1/w19-7713 ◽

2019 ◽

Author(s):

Ailsa Meechan-Maddon ◽

Joakim Nivre

Keyword(s):

Target Language ◽

Low Resource ◽

Cross Lingual

Download Full-text

MetaXL: Meta Representation Transformation for Low-resource Cross-lingual Learning

10.18653/v1/2021.naacl-main.42 ◽

2021 ◽

Author(s):

Mengzhou Xia ◽

Guoqing Zheng ◽

Subhabrata Mukherjee ◽

Milad Shokouhi ◽

Graham Neubig ◽

...

Keyword(s):

Low Resource ◽

Cross Lingual ◽

Representation Transformation

Download Full-text

Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation

10.18653/v1/w18-6324 ◽

2018 ◽

Cited By ~ 1

Author(s):

Zhong Zhou ◽

Matthias Sperber ◽

Alexander Waibel

Keyword(s):

Language Translation ◽

Target Language ◽

Massively Parallel ◽

Low Resource ◽

Cross Lingual

Download Full-text

Neural Cross-Lingual Relation Extraction Based on Bilingual Word Embedding Mapping

10.18653/v1/d19-1038 ◽

2019 ◽

Author(s):

Jian Ni ◽

Radu Florian

Keyword(s):

Relation Extraction ◽

Word Embedding ◽

Cross Lingual

Download Full-text

Expanding the JHU Bible Corpus for Machine Translation of the Indigenous Languages of North America

10.33011/computel.v1i.949 ◽

2021 ◽

Vol 1 (2) ◽

Author(s):

Garrett Nicolai ◽

Edith Coates ◽

Ming Zhang ◽

Miika Silfverberg

Keyword(s):

North America ◽

Machine Translation ◽

Indigenous Languages ◽

Low Resource ◽

Bible Translations ◽

Cross Lingual

We present an extension to the JHU Bible corpus, collecting and normalizing more than thirty Bible translations in thirty Indigenous languages of North America. These exhibit a wide variety of interesting syntactic and morphological phenomena that are understudied in the computational community. Neural translation experiments demonstrate significant gains obtained through cross-lingual, many-to-many translation, with improvements of up to 8.4 BLEU over monolingual models for extremely low-resource languages.

Download Full-text

Deep Domain Adaptation for Low-Resource Cross-Lingual Text Classification Tasks

Communications in Computer and Information Science - Computational Linguistics ◽

10.1007/978-981-15-6168-9_14 ◽

2020 ◽

pp. 155-168

Author(s):

Guan-Yuan Chen ◽

Von-Wun Soo

Keyword(s):

Text Classification ◽

Domain Adaptation ◽

Low Resource ◽

Classification Tasks ◽

Cross Lingual

Download Full-text