Comparing Different Approaches to Treat Translation Ambiguity in CLIR: Structured Queries vs. Target Co-occurrence Based Selection

Author(s):  
Xabier Saralegi ◽  
Maddalen Lopez de Lacalle
2013 ◽  
Author(s):  
Katharine Donnelly Adams ◽  
Janet G. Van Hell ◽  
Natasha Tokowicz

2016 ◽  
Vol 6 (3) ◽  
pp. 290-307 ◽  
Author(s):  
Tamar Degani ◽  
Anat Prior ◽  
Chelsea M. Eddington ◽  
Ana B. Arêas da Luz Fontes ◽  
Natasha Tokowicz

Abstract Ambiguity in translation is highly prevalent, and has consequences for second-language learning and for bilingual lexical processing. To better understand this phenomenon, the current study compared the determinants of translation ambiguity across four sets of translation norms from English to Spanish, Dutch, German and Hebrew. The number of translations an English word received was correlated across these different languages, and was also correlated with the number of senses the word has in English, demonstrating that translation ambiguity is partially determined by within-language semantic ambiguity. For semantically-ambiguous English words, the probability of the different translations in Spanish and Hebrew was predicted by the meaning-dominance structure in English, beyond the influence of other lexical and semantic factors, for bilinguals translating from their L1, and translating from their L2. These findings are consistent with models postulating direct access to meaning from L2 words for moderately-proficient bilinguals.


2019 ◽  
Vol 9 (16) ◽  
pp. 3318
Author(s):  
Azmat Anwar ◽  
Xiao Li ◽  
Yating Yang ◽  
Yajuan Wang

Although considerable effort has been devoted to building commonsense knowledge bases (CKB), it is still not available for many low-resource languages such as Uyghur because of expensive construction cost. Focusing on this issue, we proposed a cross-lingual knowledge-projection method to construct an Uyghur CKB by projecting ConceptNet’s Chinese facts into Uyghur. We used a Chinese–Uyghur bilingual dictionary to get high-quality entity translation in facts and employed a back-translation method to eliminate the entity-translation ambiguity. Moreover, to tackle the inner relation ambiguity in translated facts, we made a hand-crafted rule to convert the structured facts into natural-language phrases and built the Chinese–Uyghur lingual phrases based on the similarity of phrases that corresponded to the bilingual semantic similarity scoring model. Experimental results show that the accuracy of our semantic similarity scoring model reached 94.75% for our task, and they successfully project 55,872 Chinese facts into Uyghur as well as obtain 67,375 Uyghur facts within a very short period.


2018 ◽  
Vol 10 (4) ◽  
pp. 559-586 ◽  
Author(s):  
Dana M. Basnight-Brown ◽  
Stephanie A. Kazanas ◽  
Jeanette Altarriba

Abstract Research focused on the cognitive processes surrounding bilingual language representation has revealed the important role that translation ambiguity plays in how languages are stored in memory (Tokowicz & Kroll, 2007). In addition, translation of emotionally related information has been shown to be challenging because a direct translation does not always exist (Basnight-Brown & Altarriba, 2014). The focus of the current study was to explore the processing of ambiguous words for translations that differ in orthography. In Experiment 1, Chinese-English bilinguals translated concrete and abstract words that differed in the number of translations across languages. In Experiment 2, emotion words were introduced into the context, in order to examine differences in emotion translation across languages. The results revealed that words with a single translation were produced faster and more accurately than words that had multiple translations. Finally, translation of emotional stimuli was faster when translating Chinese words as compared to English words.


2014 ◽  
Author(s):  
Natasha Tokowicz ◽  
Alba Tuninetti ◽  
Tessa Warren ◽  
Karla Rivera-Torres

2012 ◽  
Vol 16 (2) ◽  
pp. 458-474 ◽  
Author(s):  
ANAT PRIOR ◽  
JUDITH F. KROLL ◽  
BRIAN MACWHINNEY

We investigated the influence of word class and translation ambiguity on cross-linguistic representation and processing. Bilingual speakers of English and Spanish performed translation production and translation recognition tasks on nouns and verbs in both languages. Words either had a single translation or more than one translation. Translation probability, as determined by normative data, was the strongest predictor of translation production and translation recognition, after controlling for psycholinguistic variables. Word class did not explain additional variability in translation performance, raising the possibility that previous findings of differences between nouns and verbs might be attributed to the greater translation ambiguity of verbs relative to nouns. Proficiency in the second language was associated with quicker and more successful production of translations for ambiguous words, and with more accurate recognition of translations for ambiguous words. Working memory capacity was related to the speed of recognizing low probability translations for ambiguous words. These results underscore the importance of considering translation ambiguity in research on bilingual lexical and conceptual knowledge.


2010 ◽  
Vol 32 (1) ◽  
pp. 93-111 ◽  
Author(s):  
ANAT PRIOR ◽  
SHULY WINTNER ◽  
BRIAN MACWHINNEY ◽  
ALON LAVIE

ABSTRACTWe compare translations of single words, made by bilingual speakers in a laboratory setting, with contextualized translation choices of the same items, made by professional translators and extracted from parallel language corpora. The translation choices in both cases show moderate convergence, demonstrating that decontextualized translation probabilities partially reflect bilinguals’ life experience regarding the conditional distributions of alternative translations. Lexical attributes of the target word differ in their ability to predict translation probability: form similarity is a stronger predictor in decontextualized translation choice, whereas word frequency and semantic salience are stronger predictors for context-embedded translation choice. These findings establish the utility of parallel language corpora as important tools in psycholinguistic investigations of bilingual language processing.


Sign in / Sign up

Export Citation Format

Share Document