Chapter 12. From pitch stylization to automatic tonal annotation of speech corpora

Rapid Collection of Spontaneous Speech Corpora Using Telephonic Community Forums

10.21437/interspeech.2018-1139 ◽

2018 ◽

Cited By ~ 3

Author(s):

Agha Ali Raza ◽

Awais Athar ◽

Shan Randhawa ◽

Zain Tariq ◽

Muhammad Bilal Saleem ◽

...

Keyword(s):

Spontaneous Speech ◽

Speech Corpora

Download Full-text

A cross-linguistic, longitudinal case study of pauses and interpausal units in spontaneous speech corpora of older speakers of German and French

10.21437/speechprosody.2018-43 ◽

2018 ◽

Cited By ~ 1

Author(s):

Annette Gerstenberg ◽

Susanne Fuchs ◽

Julie Marie Kairet ◽

Johannes Schröder ◽

Claudia Frankenberg

Keyword(s):

Spontaneous Speech ◽

Longitudinal Case Study ◽

Speech Corpora

Download Full-text

Prosodic phrasing in Russian spontaneous and read speech: evidence from large speech corpora

10.21437/speechprosody.2020-34 ◽

2020 ◽

Author(s):

Tatiana Kachkovskaia ◽

Pavel Skrelin

Keyword(s):

Prosodic Phrasing ◽

Speech Corpora ◽

Read Speech

Download Full-text

Stimmen: A citizen science approach to minority language sociolinguistics

Linguistics Vanguard ◽

10.1515/lingvan-2019-0017 ◽

2021 ◽

Vol 7 (s1) ◽

Author(s):

Nanna Haug Hilton

Keyword(s):

Citizen Science ◽

Smartphone Application ◽

Minority Language ◽

Phonological Variation ◽

Methodological Issues ◽

Speech Corpora ◽

Societal Context ◽

Smartphone Technology ◽

Language Areas

Abstract This paper presents the project Stimmen fan Fryslân ‘Voices of Fryslân’. The project relies on a smartphone application developed to involve local communities in the creation of speech corpora, particularly of lesser used languages. This paper lays out the scientific and societal context of the project, showcases the smartphone application and gives an overview of the results from the project that attracted more than 15,000 users. Some key methodological issues are considered, and the paper discusses the role of smartphone technology for citizen science in minority language areas while also showing new maps with distributions of lexical and phonological variation in Frisian.

Download Full-text

Analyzing dialect variation in historical speech corpora

The Journal of the Acoustical Society of America ◽

10.1121/1.4991009 ◽

2017 ◽

Vol 142 (1) ◽

pp. 406-421 ◽

Cited By ~ 3

Author(s):

Margaret E. L. Renwick ◽

Rachel M. Olsen

Keyword(s):

Speech Corpora ◽

Dialect Variation

Download Full-text

A Framework for Recording Audio-Visual Speech Corpora with a Microphone and a High-Speed Camera

Speech and Computer - Lecture Notes in Computer Science ◽

10.1007/978-3-319-11581-8_6 ◽

2014 ◽

pp. 50-57 ◽

Cited By ~ 4

Author(s):

Alexey Karpov ◽

Irina Kipyatkova ◽

Miloš Železný

Keyword(s):

High Speed ◽

Visual Speech ◽

High Speed Camera ◽

Speech Corpora

Download Full-text

CRF-Based Phrase Boundary Detection Trained on Large-Scale TTS Speech Corpora

Speech and Computer - Lecture Notes in Computer Science ◽

10.1007/978-3-319-66429-3_26 ◽

2017 ◽

pp. 272-281 ◽

Cited By ~ 1

Author(s):

Markéta Jůzová

Keyword(s):

Large Scale ◽

Boundary Detection ◽

Phrase Boundary ◽

Speech Corpora

Download Full-text

Emotion recognition of mandarin speech for different speech corpora based on nonlinear features

2012 IEEE 11th International Conference on Signal Processing ◽

10.1109/icosp.2012.6491552 ◽

2012 ◽

Cited By ~ 2

Author(s):

Hui Gao ◽

Shanguang Chen ◽

Ping An ◽

Guangchuan Su

Keyword(s):

Emotion Recognition ◽

Speech Corpora ◽

Nonlinear Features

Download Full-text

Analysis of the Influence of Speech Corpora in the PLDA Verification in the Task of Speaker Recognition

Text, Speech and Dialogue - Lecture Notes in Computer Science ◽

10.1007/978-3-642-32790-2_56 ◽

2012 ◽

pp. 464-471 ◽

Cited By ~ 3

Author(s):

Lukáš Machlica ◽

Zbyněk Zajíc

Keyword(s):

Speaker Recognition ◽

Speech Corpora

Download Full-text

Joint optimization on decoding graphs using minimum classification error criterion

APSIPA Transactions on Signal and Information Processing ◽

10.1017/atsip.2014.5 ◽

2014 ◽

Vol 3 ◽

Author(s):

Abdelaziz A. Abdelhamid ◽

Waleed H. Abdulla

Keyword(s):

Likelihood Estimation ◽

Discriminative Training ◽

Language Models ◽

Classification Error ◽

Error Criterion ◽

Speech Corpora ◽

Finite State ◽

Minimum Classification Error ◽

Speech Features ◽

Weighted Finite State Transducers

Motivated by the inherent correlation between the speech features and their lexical words, we propose in this paper a new framework for learning the parameters of the corresponding acoustic and language models jointly. The proposed framework is based on discriminative training of the models' parameters using minimum classification error criterion. To verify the effectiveness of the proposed framework, a set of four large decoding graphs is constructed using weighted finite-state transducers as a composition of two sets of context-dependent acoustic models and two sets of n-gram-based language models. The experimental results conducted on this set of decoding graphs validated the effectiveness of the proposed framework when compared with four baseline systems based on maximum likelihood estimation and separate discriminative training of acoustic and language models in benchmark testing of two speech corpora, namely TIMIT and RM1.

Download Full-text