Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing

Søren Jørgensen; Torsten Dau

doi:10.1121/1.3621502

Predicting speech intelligibility based on the envelope power signal‐to‐noise ratio after modulation‐frequency selective processing.

The Journal of the Acoustical Society of America ◽

10.1121/1.3587737 ◽

2011 ◽

Vol 129 (4) ◽

pp. 2384-2384 ◽

Cited By ~ 1

Author(s):

Torsten Dau ◽

So/ren Jo/rgensen

Keyword(s):

Speech Intelligibility ◽

Signal To Noise Ratio ◽

Modulation Frequency ◽

Signal To Noise ◽

Selective Processing ◽

Frequency Selective ◽

Noise Ratio

Download Full-text

Effects of manipulating the signal-to-noise envelope power ratio on speech intelligibility

The Journal of the Acoustical Society of America ◽

10.1121/1.4908240 ◽

2015 ◽

Vol 137 (3) ◽

pp. 1401-1410 ◽

Cited By ~ 11

Author(s):

Søren Jørgensen ◽

Rémi Decorsière ◽

Torsten Dau

Keyword(s):

Speech Intelligibility ◽

Power Ratio ◽

Signal To Noise

Download Full-text

Evaluation of an Adaptive Dynamic Compensation System in Cochlear Implant Listeners

Trends in Hearing ◽

10.1177/2331216520970349 ◽

2020 ◽

Vol 24 ◽

pp. 233121652097034

Author(s):

Florian Langner ◽

Andreas Büchner ◽

Waldo Nogueira

Keyword(s):

Signal Processing ◽

Cochlear Implant ◽

Speech Intelligibility ◽

Discrimination Task ◽

Signal To Noise Ratio ◽

Compensation System ◽

Signal To Noise ◽

Dynamic Compensation ◽

Front End ◽

Noise Ratio

Cochlear implant (CI) sound processing typically uses a front-end automatic gain control (AGC), reducing the acoustic dynamic range (DR) to control the output level and protect the signal processing against large amplitude changes. It can also introduce distortions into the signal and does not allow a direct mapping between acoustic input and electric output. For speech in noise, a reduction in DR can result in lower speech intelligibility due to compressed modulations of speech. This study proposes to implement a CI signal processing scheme consisting of a full acoustic DR with adaptive properties to improve the signal-to-noise ratio and overall speech intelligibility. Measurements based on the Short-Time Objective Intelligibility measure and an electrodogram analysis, as well as behavioral tests in up to 10 CI users, were used to compare performance with a single-channel, dual-loop, front-end AGC and with an adaptive back-end multiband dynamic compensation system (Voice Guard [VG]). Speech intelligibility in quiet and at a +10 dB signal-to-noise ratio was assessed with the Hochmair–Schulz–Moser sentence test. A logatome discrimination task with different consonants was performed in quiet. Speech intelligibility was significantly higher in quiet for VG than for AGC, but intelligibility was similar in noise. Participants obtained significantly better scores with VG than AGC in the logatome discrimination task. The objective measurements predicted significantly better performance estimates for VG. Overall, a dynamic compensation system can outperform a single-stage compression (AGC + linear compression) for speech perception in quiet.

Download Full-text

Energetic and Informational Components of Speech-on-Speech Masking in Binaural Speech Intelligibility and Perceived Listening Effort

Trends in Hearing ◽

10.1177/2331216519854597 ◽

2019 ◽

Vol 23 ◽

pp. 233121651985459 ◽

Cited By ~ 8

Author(s):

Jan Rennies ◽

Virginia Best ◽

Elin Roverud ◽

Gerald Kidd

Keyword(s):

Speech Intelligibility ◽

Signal To Noise Ratio ◽

Spatial Separation ◽

Signal To Noise ◽

Listening Effort ◽

Complex Sound ◽

Time Frequency ◽

Sound Fields ◽

Energetic Masking

Speech perception in complex sound fields can greatly benefit from different unmasking cues to segregate the target from interfering voices. This study investigated the role of three unmasking cues (spatial separation, gender differences, and masker time reversal) on speech intelligibility and perceived listening effort in normal-hearing listeners. Speech intelligibility and categorically scaled listening effort were measured for a female target talker masked by two competing talkers with no unmasking cues or one to three unmasking cues. In addition to natural stimuli, all measurements were also conducted with glimpsed speech—which was created by removing the time–frequency tiles of the speech mixture in which the maskers dominated the mixture—to estimate the relative amounts of informational and energetic masking as well as the effort associated with source segregation. The results showed that all unmasking cues as well as glimpsing improved intelligibility and reduced listening effort and that providing more than one cue was beneficial in overcoming informational masking. The reduction in listening effort due to glimpsing corresponded to increases in signal-to-noise ratio of 8 to 18 dB, indicating that a significant amount of listening effort was devoted to segregating the target from the maskers. Furthermore, the benefit in listening effort for all unmasking cues extended well into the range of positive signal-to-noise ratios at which speech intelligibility was at ceiling, suggesting that listening effort is a useful tool for evaluating speech-on-speech masking conditions at typical conversational levels.

Download Full-text

An Evaluation of Output Signal to Noise Ratio as a Predictor of Cochlear Implant Speech Intelligibility

Ear and Hearing ◽

10.1097/aud.0000000000000556 ◽

2018 ◽

Vol 39 (5) ◽

pp. 958-968 ◽

Cited By ~ 3

Author(s):

Greg D. Watkins ◽

Brett A. Swanson ◽

Gregg J. Suaning

Keyword(s):

Cochlear Implant ◽

Output Signal ◽

Speech Intelligibility ◽

Signal To Noise Ratio ◽

Signal To Noise ◽

Noise Ratio

Download Full-text

Effect of Visual Cuing on Synthetic Speech Intelligibility: A Comparison of Native and Nonnative Speakers of English

Perceptual and Motor Skills ◽

10.2466/pms.1997.84.2.695 ◽

1997 ◽

Vol 84 (2) ◽

pp. 695-698 ◽

Cited By ~ 2

Author(s):

Mary E. Reynolds ◽

Donald Fucci ◽

Z. S. Bond

Keyword(s):

Background Noise ◽

Speech Intelligibility ◽

Native Speakers ◽

Nonnative Speakers ◽

Synthetic Speech ◽

Signal To Noise ◽

Noise Condition ◽

Nonnative Speakers Of English

This study compared the effect of visual cuing on the intelligibility of DECtalk for native and nonnative speakers of English in both ideal listening conditions and in the presence of background noise at a signal to noise (S/N) ratio of + 10dB. Visual cuing improved DECtalk's intelligibility for normative speakers more than for native speakers, especially in the background noise condition. Implications of these findings and the need for further research are discussed.

Download Full-text

A Test to Measure Subjective and Objective Speech Intelligibility

Journal of the American Academy of Audiology ◽

10.1055/s-0040-1715946 ◽

2002 ◽

Vol 13 (01) ◽

pp. 038-049 ◽

Cited By ~ 4

Author(s):

Gabrielle H. Saunders ◽

Kathleen M. Cienkowski

Keyword(s):

Speech Intelligibility ◽

Signal To Noise Ratio ◽

Hearing Aid ◽

Clinical Tool ◽

Signal To Noise ◽

Speech Reception ◽

Measurement Signal ◽

Perceptual Performance ◽

Noise Ratio ◽

Performance Discrepancy

Measurement of hearing aid outcome is particularly difficult because there are numerous dimensions to consider (e.g., performance, satisfaction, benefit). Often there are discrepancies between scores in these dimensions. It is difficult to reconcile these discrepancies because the materials and formats used to measure each dimension are so very different. We report data obtained with an outcome measure that examines both objective and subjective dimensions with the same test format and materials and gives results in the same unit of measurement (signal-to-noise ratio). Two variables are measured: a “performance” speech reception threshold and a “perceptual” speech reception threshold. The signal-to-noise ratio difference between these is computed to determine the perceptual-performance discrepancy (PPDIS). The results showed that, on average, 48 percent of the variance in subjective ratings of a hearing aid could be explained by a combination of the performance speech reception threshold and the PPDIS. These findings suggest that the measure is potentially a valuable clinical tool.

Download Full-text

Discrimination of Stochastic Frequency Modulation by Cochlear Implant Users

Journal of the American Academy of Audiology ◽

10.3766/jaaa.14067 ◽

2015 ◽

Vol 26 (06) ◽

pp. 572-581 ◽

Cited By ~ 3

Author(s):

Stanley Sheft ◽

Min-Yu Cheng ◽

Valeriy Shafiro

Keyword(s):

Speech Perception ◽

Cochlear Implant ◽

Stimulus Duration ◽

Frequency Modulation ◽

Speech Intelligibility ◽

Signal To Noise Ratio ◽

Normal Hearing ◽

Signal To Noise ◽

Common Basis ◽

Low Rate

Background: Past work has shown that low-rate frequency modulation (FM) may help preserve signal coherence, aid segmentation at word and syllable boundaries, and benefit speech intelligibility in the presence of a masker. Purpose: This study evaluated whether difficulties in speech perception by cochlear implant (CI) users relate to a deficit in the ability to discriminate among stochastic low-rate patterns of FM. Research Design: This is a correlational study assessing the association between the ability to discriminate stochastic patterns of low-rate FM and the intelligibility of speech in noise. Study Sample: Thirteen postlingually deafened adult CI users participated in this study. Data Collection and Analysis: Using modulators derived from 5-Hz lowpass noise applied to a 1-kHz carrier, thresholds were measured in terms of frequency excursion both in quiet and with a speech-babble masker present, stimulus duration, and signal-to-noise ratio in the presence of a speech-babble masker. Speech perception ability was assessed in the presence of the same speech-babble masker. Relationships were evaluated with Pearson product–moment correlation analysis with correction for family-wise error, and commonality analysis to determine the unique and common contributions across psychoacoustic variables to the association with speech ability. Results: Significant correlations were obtained between masked speech intelligibility and three metrics of FM discrimination involving either signal-to-noise ratio or stimulus duration, with shared variance among the three measures accounting for much of the effect. Compared to past results from young normal-hearing adults and older adults with either normal hearing or a mild-to-moderate hearing loss, mean FM discrimination thresholds obtained from CI users were higher in all conditions. Conclusions: The ability to process the pattern of frequency excursions of stochastic FM may, in part, have a common basis with speech perception in noise. Discrimination of differences in the temporally distributed place coding of the stimulus could serve as this common basis for CI users.

Download Full-text

Predicting binaural speech intelligibility using the signal-to-noise ratio in the envelope power spectrum domain

The Journal of the Acoustical Society of America ◽

10.1121/1.4954254 ◽

2016 ◽

Vol 140 (1) ◽

pp. 192-205 ◽

Cited By ~ 9

Author(s):

Alexandre Chabot-Leclerc ◽

Ewen N. MacDonald ◽

Torsten Dau

Keyword(s):

Power Spectrum ◽

Speech Intelligibility ◽

Signal To Noise Ratio ◽

Signal To Noise ◽

Noise Ratio

Download Full-text

Design and Simulation of OFDM System Based on Matlab

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.635-637.1081 ◽

2014 ◽

Vol 635-637 ◽

pp. 1081-1085

Author(s):

Xin Xin Sha ◽

Jian Zhou ◽

Yuan Xue Song

Keyword(s):

Bit Error Rate ◽

Error Rate ◽

Channel Coding ◽

Signal To Noise Ratio ◽

Average Power ◽

System Structure ◽

Power Ratio ◽

Basic System ◽

Ofdm System ◽

Signal To Noise

OFDM is a key modulation and multiplexing technique. The basic system structure of OFDM is introduced firstly. This paper chose appropriate implementation schemes for channel coding, PAPR(Peak To Average Power Ratio) reducing and synchronization of the OFDM system based on the minimum BER(Bit Error Rate). Finally, the paper realized the simulation and got the BER in different SNR(Signal To Noise Ratio) in the matlab environment .

Download Full-text