Restoration of Missing Voiced Speech Signal Segments

Open Quotient Measurements Based on Multiscale Product of Speech Signal Wavelet Transform

Research Letters in Signal Processing ◽

10.1155/2007/62521 ◽

2007 ◽

Vol 2007 ◽

pp. 1-5 ◽

Cited By ~ 15

Author(s):

Aïcha Bouzid ◽

Noureddine Ellouze

Keyword(s):

Wavelet Transform ◽

Relative Error ◽

Speech Processing ◽

Speech Signal ◽

Accurate Estimation ◽

Pitch Period ◽

Open Time ◽

Wide Range ◽

Voiced Speech ◽

Egg Signal

This paper describes a multiscale product method (MPM) for open quotient measure in voiced speech. The method is based on determining the glottal closing and opening instants. The proposed approach consists of making the products of wavelet transform of speech signal at different scales in order to enhance the edge detection and parameter estimation. We show that the proposed method is effective and robust for detecting speech singularity. Accurate estimation of glottal closing instants (GCIs) and opening instants (GOIs) is important in a wide range of speech processing tasks. In this paper, accurate estimation of GCIs and GOIs is used to measure the local open quotient (Oq) which is the ratio of the open time by the pitch period. Multiscale product operates automatically on speech signal; the reference electroglottogram (EGG) signal is used for performance evaluation. The ratio of good GCI detection is 95.5% and that of GOI is 76%. The pitch period relative error is 2.6% and the open phase relative error is 5.6%. The relative error measured on open quotient reaches 3% for the whole Keele database.

Download Full-text

Statistical estimators of a periodically correlated random process for a voiced speech signal

The Journal of the Acoustical Society of America ◽

10.1121/1.4780527 ◽

2003 ◽

Vol 113 (4) ◽

pp. 2271-2271 ◽

Cited By ~ 3

Author(s):

Lesya B. Chorna

Keyword(s):

Random Process ◽

Speech Signal ◽

Periodically Correlated Random Process ◽

Voiced Speech

Download Full-text

A New Algorithm for Noisy Speech Classification Based on GMM

The Open Electrical & Electronic Engineering Journal ◽

10.2174/1874129001408010508 ◽

2014 ◽

Vol 8 (1) ◽

pp. 508-511

Author(s):

Zhongbao Chen ◽

Zhigang Fang ◽

Jie Xu ◽

Pengying Du ◽

Xiaoping Luo

Keyword(s):

Speech Signal ◽

Speech Synthesis ◽

Gaussian Mixture ◽

Synthetic Speech ◽

Noisy Environments ◽

The Arts ◽

Voiced Speech ◽

Signal Simulation ◽

Speech Classification ◽

Increasing Demand

Speech can be broadly categorized into voiceless, voiced, and mute signal, in which voiced speech can be further classified into vowel and voiced consonant. With the ever increasing demand of the speech synthesis applications, it is urgent to develop an effective classification method to differentiate vowel and voiced consonant signal since they are two distinct components that affect the naturalness of the synthetic speech signal. State-of-the-arts algorithms for speech signal classification are effective in classifying voiceless, voiced and mute speech signal, however, not effective in further classifying the voiced signal. In view of the issue, a new algorithm for speech classification based on Gaussian Mixture Model (GMM) is proposed, which can directly classify a speech into voiceless, voiced consonant, vowel and mute signal. Simulation results demonstrate that the proposed algorithm is effective even under the noisy environments.

Download Full-text

A New Method for Noisy Speech Classification Based on Gaussian Mixture Models

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.532-533.1253 ◽

2012 ◽

Vol 532-533 ◽

pp. 1253-1257

Author(s):

Li Hai Yao ◽

Jie Xu ◽

Hao Jiang

Keyword(s):

Speech Signal ◽

Speech Synthesis ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Noisy Environments ◽

The Arts ◽

Voiced Speech ◽

Speech Feature ◽

Speech Classification ◽

Increasing Demand

Speech can be broadly categorized into voiceless, voiced, and mute signal, in which voiced speech can be further classified into vowel and voiced consonant. With the ever increasing demand of the speech synthesis applications, it is urgent to develop an effective classification method to differentiate vowel and voiced consonant signal since they are two distinct components that affect the naturalness of the synthetic speech signal. State-of-the-arts algorithms for speech signal classification are effective in classifying voiceless, voiced and mute speech signal, however, not effective in further classifying the voiced signal. In view of the issue, a new algorithm for speech classification based on Gaussian Mixture Model (GMM) is proposed, which can directly classify a speech into voiceless, voiced consonant, vowel and mute signal. Specifically, a new speech feature is proposed, and the GMM is also modified for speech classification. Simulation results demonstrate that the proposed algorithm is effective even under the noisy environments.

Download Full-text

A New Technique for the Estimation of Jitter and Shimmer of Voiced Speech Signal

2006 Canadian Conference on Electrical and Computer Engineering ◽

10.1109/ccece.2006.277799 ◽

2006 ◽

Cited By ~ 6

Author(s):

C. Shahnaz ◽

W.-p. Zhu ◽

M.O. Ahmad

Keyword(s):

Speech Signal ◽

New Technique ◽

Voiced Speech ◽

A New Technique

Download Full-text

Measuring the Effect of Music Therapy on Voiced Speech Signal

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering - Future Internet Technologies and Trends ◽

10.1007/978-3-319-73712-6_15 ◽

2018 ◽

pp. 147-156 ◽

Cited By ~ 1

Author(s):

Pradeep Tiwari ◽

Utkarsh V. Rane ◽

A. D. Darji

Keyword(s):

Music Therapy ◽

Speech Signal ◽

Voiced Speech

Download Full-text

Wigner ville representation of voiced speech signal: Quasi-harmonic model vs. sinusoidal model

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) ◽

10.1109/atsip.2016.7523172 ◽

2016 ◽

Author(s):

Khawla Zammel ◽

Noureddine Ellouze

Keyword(s):

Speech Signal ◽

Harmonic Model ◽

Sinusoidal Model ◽

Voiced Speech

Download Full-text

Empirical mode decomposition of voiced speech signal

First International Symposium on Control, Communications and Signal Processing, 2004. ◽

10.1109/isccsp.2004.1296465 ◽

2004 ◽

Cited By ~ 10

Author(s):

A. Bouzid ◽

N. Ellouze

Keyword(s):

Empirical Mode Decomposition ◽

Speech Signal ◽

Mode Decomposition ◽

Voiced Speech

Download Full-text

Perceptual Learning of Vocoded Speech With and Without Contralateral Hearing: Implications for Cochlear Implant Rehabilitation

Journal of Speech Language and Hearing Research ◽

10.1044/2020_jslhr-20-00385 ◽

2020 ◽

pp. 1-10

Author(s):

Martin Chavant ◽

Alexis Hervais-Adelman ◽

Olivier Macherey

Keyword(s):

Cochlear Implant ◽

Perceptual Learning ◽

Speech Signal ◽

Training Phase ◽

Monosyllabic Words ◽

Low Pass ◽

Contralateral Ear ◽

Number Of Individuals ◽

Insight Into ◽

Vocoded Speech

Purpose An increasing number of individuals with residual or even normal contralateral hearing are being considered for cochlear implantation. It remains unknown whether the presence of contralateral hearing is beneficial or detrimental to their perceptual learning of cochlear implant (CI)–processed speech. The aim of this experiment was to provide a first insight into this question using acoustic simulations of CI processing. Method Sixty normal-hearing listeners took part in an auditory perceptual learning experiment. Each subject was randomly assigned to one of three groups of 20 referred to as NORMAL, LOWPASS, and NOTHING. The experiment consisted of two test phases separated by a training phase. In the test phases, all subjects were tested on recognition of monosyllabic words passed through a six-channel “PSHC” vocoder presented to a single ear. In the training phase, which consisted of listening to a 25-min audio book, all subjects were also presented with the same vocoded speech in one ear but the signal they received in their other ear differed across groups. The NORMAL group was presented with the unprocessed speech signal, the LOWPASS group with a low-pass filtered version of the speech signal, and the NOTHING group with no sound at all. Results The improvement in speech scores following training was significantly smaller for the NORMAL than for the LOWPASS and NOTHING groups. Conclusions This study suggests that the presentation of normal speech in the contralateral ear reduces or slows down perceptual learning of vocoded speech but that an unintelligible low-pass filtered contralateral signal does not have this effect. Potential implications for the rehabilitation of CI patients with partial or full contralateral hearing are discussed.

Download Full-text

“I Can See What You’re Saying”: Clinical Utility of Spectral Moment Analysis

Perspectives on Speech Science and Orofacial Disorders ◽

10.1044/ssod21.2.44 ◽

2011 ◽

Vol 21 (2) ◽

pp. 44-54

Author(s):

Kerry Callahan Mandulak

Keyword(s):

Speech Production ◽

Speech Signal ◽

Clinical Utility ◽

Acoustic Analysis ◽

Moment Analysis ◽

Analysis Tool ◽

Spectral Moment ◽

Clinical Measure ◽

Perceptual Analysis ◽

Disordered Speech

Spectral moment analysis (SMA) is an acoustic analysis tool that shows promise for enhancing our understanding of normal and disordered speech production. It can augment auditory-perceptual analysis used to investigate differences across speakers and groups and can provide unique information regarding specific aspects of the speech signal. The purpose of this paper is to illustrate the utility of SMA as a clinical measure for both clinical speech production assessment and research applications documenting speech outcome measurements. Although acoustic analysis has become more readily available and accessible, clinicians need training with, and exposure to, acoustic analysis methods in order to integrate them into traditional methods used to assess speech production.

Download Full-text