Mongolian speech corpus for text-to-speech development

AbstractText-to-speech (TTS) synthesis systems have been widely used in general-purpose applications based on the generation of speech. Nonetheless, there are some domains, such as storytelling or voice output aid devices, which may also require singing. To enable a corpus-based TTS system to sing, a supplementary singing database should be recorded. This solution, however, might be too costly for eventual singing needs, or even unfeasible if the original speaker is unavailable or unable to sing properly. This work introduces a unit selection-based text-to-speech-and-singing (US-TTS&S) synthesis framework, which integrates speech-to-singing (STS) conversion to enable the generation of both speech and singing from an input text and a score, respectively, using the same neutral speech corpus. The viability of the proposal is evaluated considering three vocal ranges and two tempos on a proof-of-concept implementation using a 2.6-h Spanish neutral speech corpus. The experiments show that challenging STS transformation factors are required to sing beyond the corpus vocal range and/or with notes longer than 150 ms. While score-driven US configurations allow the reduction of pitch-scale factors, time-scale factors are not reduced due to the short length of the spoken vowels. Moreover, in the MUSHRA test, text-driven and score-driven US configurations obtain similar naturalness rates of around 40 for all the analysed scenarios. Although these naturalness scores are far from those of vocaloid, the singing scores of around 60 which were obtained validate that the framework could reasonably address eventual singing needs.

Download Full-text

Modern Arabic speech corpus for Text to Speech synthesis

2020 IEEE International Conference on Technology Management, Operations and Decisions (ICTMOD) ◽

10.1109/ictmod49425.2020.9380606 ◽

2020 ◽

Author(s):

Zine Oumaima ◽

Abdelouafi Meziane

Keyword(s):

Speech Synthesis ◽

Text To Speech ◽

Speech Corpus ◽

Text To Speech Synthesis ◽

Modern Arabic

Download Full-text

A text-to-speech development system

10.1109/icassp.1983.1171946 ◽

2005 ◽

Cited By ~ 1

Author(s):

W. Fisher

Keyword(s):

Speech Development ◽

Text To Speech ◽

Development System

Download Full-text

Comparison between physical and auditory parametrization of speech corpus for the unit selection in text‐to‐speech synthesis

The Journal of the Acoustical Society of America ◽

10.1121/1.4743576 ◽

2000 ◽

Vol 108 (5) ◽

pp. 2576-2576

Author(s):

Minoru Tsuzaki

Keyword(s):

Speech Synthesis ◽

Text To Speech ◽

Speech Corpus ◽

Unit Selection ◽

Text To Speech Synthesis

Download Full-text

Emilia: a speech corpus for Argentine Spanish text to speech synthesis

Language Resources and Evaluation ◽

10.1007/s10579-019-09447-7 ◽

2019 ◽

Vol 53 (3) ◽

pp. 419-447

Author(s):

Humberto M. Torres ◽

Jorge A. Gurlekian ◽

Diego A. Evin ◽

Christian G. Cossio Mercado

Keyword(s):

Spanish Text ◽

Speech Synthesis ◽

Text To Speech ◽

Speech Corpus ◽

Argentine Spanish ◽

Text To Speech Synthesis

Download Full-text

Design of a Yoruba Language Speech Corpus for the Purposes of Text-to-Speech (TTS) Synthesis

Intelligent Information and Database Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-662-49381-6_16 ◽

2016 ◽

pp. 161-169

Author(s):

Théophile K. Dagba ◽

John O. R. Aoga ◽

Codjo C. Fanou

Keyword(s):

Text To Speech ◽

Speech Corpus ◽

Yoruba Language

Download Full-text

A Review of 21 iPad Applications for Augmentative and Alternative Communication Purposes

Perspectives on Augmentative and Alternative Communication ◽

10.1044/aac21.2.60 ◽

2012 ◽

Vol 21 (2) ◽

pp. 60-71 ◽

Cited By ~ 24

Author(s):

Ashley Alliano ◽

Kimberly Herriger ◽

Anthony D. Koutsoftas ◽

Theresa E. Bartolotta

Keyword(s):

Augmentative And Alternative Communication ◽

Cost Effective ◽

Alternative Form ◽

Alternative Communication ◽

Text To Speech ◽

Reference Guide ◽

Expressive Communication ◽

Communication Needs ◽

User Friendly ◽

Ipad Applications

Abstract Using the iPad tablet for Augmentative and Alternative Communication (AAC) purposes can facilitate many communicative needs, is cost-effective, and is socially acceptable. Many individuals with communication difficulties can use iPad applications (apps) to augment communication, provide an alternative form of communication, or target receptive and expressive language goals. In this paper, we will review a collection of iPad apps that can be used to address a variety of receptive and expressive communication needs. Based on recommendations from Gosnell, Costello, and Shane (2011), we describe the features of 21 apps that can serve as a reference guide for speech-language pathologists. We systematically identified 21 apps that use symbols only, symbols and text-to-speech, and text-to-speech only. We provide descriptions of the purpose of each app, along with the following feature descriptions: speech settings, representation, display, feedback features, rate enhancement, access, motor competencies, and cost. In this review, we describe these apps and how individuals with complex communication needs can use them for a variety of communication purposes and to target a variety of treatment goals. We present information in a user-friendly table format that clinicians can use as a reference guide.

Download Full-text

Handheld Screen Time Linked to Delayed Speech Development

ASHA Leader ◽

10.1044/leader.rib1.22082017.16 ◽

2017 ◽

Vol 22 (8) ◽

pp. 16-16 ◽

Cited By ~ 1

Keyword(s):

Screen Time ◽

Speech Development

Download Full-text