Consonant recognition in noise with temporal cues. III. Effects of temporal envelope enhancement on identification thresholds

2001 ◽  
Vol 109 (5) ◽  
pp. 2468-2468
Author(s):  
Frederic Apoux ◽  
Stephane Garnier ◽  
Christian Lorenzi
1999 ◽  
Vol 105 (3) ◽  
pp. 1801-1809 ◽  
Author(s):  
René van der Horst ◽  
A. Rens Leeuw ◽  
Wouter A. Dreschler

1992 ◽  
Vol 92 (3) ◽  
pp. 1247-1257 ◽  
Author(s):  
Dianne J. Van Tasell ◽  
Donna G. Greenfield ◽  
Joelle J. Logemann ◽  
David A. Nelson

2021 ◽  
Vol 15 ◽  
Author(s):  
Zhong Zheng ◽  
Keyi Li ◽  
Gang Feng ◽  
Yang Guo ◽  
Yinan Li ◽  
...  

Objectives: Mandarin-speaking users of cochlear implants (CI) perform poorer than their English counterpart. This may be because present CI speech coding schemes are largely based on English. This study aims to evaluate the relative contributions of temporal envelope (E) cues to Mandarin phoneme (including vowel, and consonant) and lexical tone recognition to provide information for speech coding schemes specific to Mandarin.Design: Eleven normal hearing subjects were studied using acoustic temporal E cues that were extracted from 30 continuous frequency bands between 80 and 7,562 Hz using the Hilbert transform and divided into five frequency regions. Percent-correct recognition scores were obtained with acoustic E cues presented in three, four, and five frequency regions and their relative weights calculated using the least-square approach.Results: For stimuli with three, four, and five frequency regions, percent-correct scores for vowel recognition using E cues were 50.43–84.82%, 76.27–95.24%, and 96.58%, respectively; for consonant recognition 35.49–63.77%, 67.75–78.87%, and 87.87%; for lexical tone recognition 60.80–97.15%, 73.16–96.87%, and 96.73%. For frequency region 1 to frequency region 5, the mean weights in vowel recognition were 0.17, 0.31, 0.22, 0.18, and 0.12, respectively; in consonant recognition 0.10, 0.16, 0.18, 0.23, and 0.33; in lexical tone recognition 0.38, 0.18, 0.14, 0.16, and 0.14.Conclusion: Regions that contributed most for vowel recognition was Region 2 (502–1,022 Hz) that contains first formant (F1) information; Region 5 (3,856–7,562 Hz) contributed most to consonant recognition; Region 1 (80–502 Hz) that contains fundamental frequency (F0) information contributed most to lexical tone recognition.


1991 ◽  
Vol 34 (2) ◽  
pp. 415-426 ◽  
Author(s):  
Richard L. Freyman ◽  
G. Patrick Nerbonne ◽  
Heather A. Cote

This investigation examined the degree to which modification of the consonant-vowel (C-V) intensity ratio affected consonant recognition under conditions in which listeners were forced to rely more heavily on waveform envelope cues than on spectral cues. The stimuli were 22 vowel-consonant-vowel utterances, which had been mixed at six different signal-to-noise ratios with white noise that had been modulated by the speech waveform envelope. The resulting waveforms preserved the gross speech envelope shape, but spectral cues were limited by the white-noise masking. In a second stimulus set, the consonant portion of each utterance was amplified by 10 dB. Sixteen subjects with normal hearing listened to the unmodified stimuli, and 16 listened to the amplified-consonant stimuli. Recognition performance was reduced in the amplified-consonant condition for some consonants, presumably because waveform envelope cues had been distorted. However, for other consonants, especially the voiced stops, consonant amplification improved recognition. Patterns of errors were altered for several consonant groups, including some that showed only small changes in recognition scores. The results indicate that when spectral cues are compromised, nonlinear amplification can alter waveform envelope cues for consonant recognition.


2012 ◽  
Vol 43 (01) ◽  
Author(s):  
L Timm ◽  
D Agrawal ◽  
M Wittfoth ◽  
R Dengler

2021 ◽  
Vol 27 (5) ◽  
pp. 2566-2576
Author(s):  
Erwin Wu ◽  
Mitski Piekenbrock ◽  
Takuto Nakumura ◽  
Hideki Koike

Sign in / Sign up

Export Citation Format

Share Document