Chapter 12. From pitch stylization to automatic tonal annotation of speech corpora

2019 ◽  
pp. 233-250
Author(s):  
Piet Mertens
Keyword(s):  
Author(s):  
Agha Ali Raza ◽  
Awais Athar ◽  
Shan Randhawa ◽  
Zain Tariq ◽  
Muhammad Bilal Saleem ◽  
...  

2021 ◽  
Vol 7 (s1) ◽  
Author(s):  
Nanna Haug Hilton

Abstract This paper presents the project Stimmen fan Fryslân ‘Voices of Fryslân’. The project relies on a smartphone application developed to involve local communities in the creation of speech corpora, particularly of lesser used languages. This paper lays out the scientific and societal context of the project, showcases the smartphone application and gives an overview of the results from the project that attracted more than 15,000 users. Some key methodological issues are considered, and the paper discusses the role of smartphone technology for citizen science in minority language areas while also showing new maps with distributions of lexical and phonological variation in Frisian.


2017 ◽  
Vol 142 (1) ◽  
pp. 406-421 ◽  
Author(s):  
Margaret E. L. Renwick ◽  
Rachel M. Olsen

Author(s):  
Abdelaziz A. Abdelhamid ◽  
Waleed H. Abdulla

Motivated by the inherent correlation between the speech features and their lexical words, we propose in this paper a new framework for learning the parameters of the corresponding acoustic and language models jointly. The proposed framework is based on discriminative training of the models' parameters using minimum classification error criterion. To verify the effectiveness of the proposed framework, a set of four large decoding graphs is constructed using weighted finite-state transducers as a composition of two sets of context-dependent acoustic models and two sets of n-gram-based language models. The experimental results conducted on this set of decoding graphs validated the effectiveness of the proposed framework when compared with four baseline systems based on maximum likelihood estimation and separate discriminative training of acoustic and language models in benchmark testing of two speech corpora, namely TIMIT and RM1.


Sign in / Sign up

Export Citation Format

Share Document