spoken language translation Latest Research Papers

Abstract Multimodal machine translation involves drawing information from more than one modality, based on the assumption that the additional modalities will contain useful alternative views of the input data. The most prominent tasks in this area are spoken language translation, image-guided translation, and video-guided translation, which exploit audio and visual modalities, respectively. These tasks are distinguished from their monolingual counterparts of speech recognition, image captioning, and video captioning by the requirement of models to generate outputs in a different language. This survey reviews the major data resources for these tasks, the evaluation campaigns concentrated around them, the state of the art in end-to-end and pipeline approaches, and also the challenges in performance evaluation. The paper concludes with a discussion of directions for future research in these areas: the need for more expansive and challenging datasets, for targeted evaluations of model performance, and for multimodality in both the input and output space.

Download Full-text

Social Applications of Speech-Translation Technology

The Oxford Handbook of Translation and Social Practices ◽

10.1093/oxfordhb/9780190067205.013.6 ◽

2020 ◽

pp. 559-586

Author(s):

Mark Seligman

Keyword(s):

Speech Recognition ◽

Language Translation ◽

Spoken Language ◽

Speech Translation ◽

Spoken Language Translation ◽

Education Academic ◽

Translation Accuracy ◽

Social Applications ◽

Translation Systems ◽

Current Systems

Automatic spoken language translation has finally entered widespread use. Still emerging, however, are speech-translation systems directed at various demanding and socially significant use cases. Because speech-recognition and -translation technologies remain error-prone, speech-translation output is often below the threshold of usability when accuracy is essential; and present use is still largely restricted to areas like social networking or travel in which no representation concerning accuracy is demanded. This chapter, while recognizing the importance of continued improvements in speech-recognition and -translation accuracy per se, aims to support the conviction that the path toward widespread use of socially substantial spoken language translation can be shortened by emphasizing reliability (accountability and user confidence) and customization (tight adaptation for the targeted use case). Examined here are several socially significant speech-translation systems which aim to meet those requirements, with focus upon current systems in (1) healthcare and (2) presentations for education, academic conferences, and government.

Download Full-text

Re-Translation Strategies for Long Form, Simultaneous, Spoken Language Translation

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9054585 ◽

2020 ◽

Author(s):

Naveen Arivazhagan ◽

Colin Cherry ◽

I Te ◽

Wolfgang Macherey ◽

Pallavi Baljekar ◽

...

Keyword(s):

Language Translation ◽

Spoken Language ◽

Translation Strategies ◽

Long Form ◽

Spoken Language Translation

Download Full-text

Proceedings of the 17th International Conference on Spoken Language Translation

10.18653/v1/2020.iwslt-1 ◽

2020 ◽

Keyword(s):

Language Translation ◽

Spoken Language ◽

International Conference ◽

Spoken Language Translation

Download Full-text

Adapting Transformer to End-to-End Spoken Language Translation

10.21437/interspeech.2019-3045 ◽

2019 ◽

Cited By ~ 5

Author(s):

Mattia A. Di Gangi ◽

Matteo Negri ◽

Marco Turchi

Keyword(s):

Language Translation ◽

Spoken Language ◽

Spoken Language Translation ◽

End To End

Download Full-text

Bilingual Prosodic Dataset Compilation for Spoken Language Translation

10.21437/iberspeech.2018-5 ◽

2018 ◽

Cited By ~ 1

Author(s):

Alp Öktem ◽

Mireia Farrús ◽

Antonio Bonafonte

Keyword(s):

Language Translation ◽

Spoken Language ◽

Spoken Language Translation

Download Full-text

Gender aware spoken language translation applied to English-Arabic

2018 2nd International Conference on Natural Language and Speech Processing (ICNLSP) ◽

10.1109/icnlsp.2018.8374387 ◽

2018 ◽

Cited By ~ 1

Author(s):

Mostafa Elaraby ◽

Ahmed Y. Tawfik ◽

Mahmoud Khaled ◽

Hany Hassan ◽

Aly Osama

Keyword(s):

Language Translation ◽

Spoken Language ◽

Spoken Language Translation

Download Full-text

spoken language translation
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Jointly Trained Transformers Models for Spoken Language Translation

Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021)

SLTEV: Comprehensive Evaluation of Spoken Language Translation

Multimodal machine translation through visuals and speech

Social Applications of Speech-Translation Technology

Re-Translation Strategies for Long Form, Simultaneous, Spoken Language Translation

Proceedings of the 17th International Conference on Spoken Language Translation

Adapting Transformer to End-to-End Spoken Language Translation

Bilingual Prosodic Dataset Compilation for Spoken Language Translation

Gender aware spoken language translation applied to English-Arabic

Export Citation Format

spoken language translationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Jointly Trained Transformers Models for Spoken Language Translation

Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021)

SLTEV: Comprehensive Evaluation of Spoken Language Translation

Multimodal machine translation through visuals and speech

Social Applications of Speech-Translation Technology

Re-Translation Strategies for Long Form, Simultaneous, Spoken Language Translation

Proceedings of the 17th International Conference on Spoken Language Translation

Adapting Transformer to End-to-End Spoken Language Translation

Bilingual Prosodic Dataset Compilation for Spoken Language Translation

Gender aware spoken language translation applied to English-Arabic

spoken language translation
Recently Published Documents