Applying scalable phonetic context similarity in unit selection of concatenative text-to-speech

Author(s):  
Wei Zhang ◽  
Xiaodong Cui
Gipan ◽  
2019 ◽  
Vol 4 ◽  
pp. 106-116
Author(s):  
Roop Shree Ratna Bajracharya ◽  
Santosh Regmi ◽  
Bal Krishna Bal ◽  
Balaram Prasain

Text-to-Speech (TTS) synthesis has come far from its primitive synthetic monotone voices to more natural and intelligible sounding voices. One of the direct applications of a natural sounding TTS systems is the screen reader applications for the visually impaired and the blind community. The Festival Speech Synthesis System uses a concatenative speech synthesis method together with the unit selection process to generate a natural sounding voice. This work primarily gives an account of the efforts put towards developing a Natural sounding TTS system for Nepali using the Festival system. We also shed light on the issues faced and the solutions derived which can be quite overlapping across other similar under-resourced languages in the region.


2009 ◽  
Vol 55 (2) ◽  
pp. 613-621 ◽  
Author(s):  
Sotiris Karabetsos ◽  
Pirros Tsiakoulis ◽  
Aimilios Chalamandaris ◽  
Spyros Raptis

2011 ◽  
Vol 19 (22) ◽  
pp. 6633-6638 ◽  
Author(s):  
John A. Kalaitzis ◽  
Qian Cheng ◽  
Dario Meluzzi ◽  
Longkuan Xiang ◽  
Miho Izumikawa ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document