Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing

2011 ◽  
Vol 130 (3) ◽  
pp. 1475-1487 ◽  
Author(s):  
Søren Jørgensen ◽  
Torsten Dau
2015 ◽  
Vol 137 (3) ◽  
pp. 1401-1410 ◽  
Author(s):  
Søren Jørgensen ◽  
Rémi Decorsière ◽  
Torsten Dau

2020 ◽  
Vol 24 ◽  
pp. 233121652097034
Author(s):  
Florian Langner ◽  
Andreas Büchner ◽  
Waldo Nogueira

Cochlear implant (CI) sound processing typically uses a front-end automatic gain control (AGC), reducing the acoustic dynamic range (DR) to control the output level and protect the signal processing against large amplitude changes. It can also introduce distortions into the signal and does not allow a direct mapping between acoustic input and electric output. For speech in noise, a reduction in DR can result in lower speech intelligibility due to compressed modulations of speech. This study proposes to implement a CI signal processing scheme consisting of a full acoustic DR with adaptive properties to improve the signal-to-noise ratio and overall speech intelligibility. Measurements based on the Short-Time Objective Intelligibility measure and an electrodogram analysis, as well as behavioral tests in up to 10 CI users, were used to compare performance with a single-channel, dual-loop, front-end AGC and with an adaptive back-end multiband dynamic compensation system (Voice Guard [VG]). Speech intelligibility in quiet and at a +10 dB signal-to-noise ratio was assessed with the Hochmair–Schulz–Moser sentence test. A logatome discrimination task with different consonants was performed in quiet. Speech intelligibility was significantly higher in quiet for VG than for AGC, but intelligibility was similar in noise. Participants obtained significantly better scores with VG than AGC in the logatome discrimination task. The objective measurements predicted significantly better performance estimates for VG. Overall, a dynamic compensation system can outperform a single-stage compression (AGC + linear compression) for speech perception in quiet.


2019 ◽  
Vol 23 ◽  
pp. 233121651985459 ◽  
Author(s):  
Jan Rennies ◽  
Virginia Best ◽  
Elin Roverud ◽  
Gerald Kidd

Speech perception in complex sound fields can greatly benefit from different unmasking cues to segregate the target from interfering voices. This study investigated the role of three unmasking cues (spatial separation, gender differences, and masker time reversal) on speech intelligibility and perceived listening effort in normal-hearing listeners. Speech intelligibility and categorically scaled listening effort were measured for a female target talker masked by two competing talkers with no unmasking cues or one to three unmasking cues. In addition to natural stimuli, all measurements were also conducted with glimpsed speech—which was created by removing the time–frequency tiles of the speech mixture in which the maskers dominated the mixture—to estimate the relative amounts of informational and energetic masking as well as the effort associated with source segregation. The results showed that all unmasking cues as well as glimpsing improved intelligibility and reduced listening effort and that providing more than one cue was beneficial in overcoming informational masking. The reduction in listening effort due to glimpsing corresponded to increases in signal-to-noise ratio of 8 to 18 dB, indicating that a significant amount of listening effort was devoted to segregating the target from the maskers. Furthermore, the benefit in listening effort for all unmasking cues extended well into the range of positive signal-to-noise ratios at which speech intelligibility was at ceiling, suggesting that listening effort is a useful tool for evaluating speech-on-speech masking conditions at typical conversational levels.


1997 ◽  
Vol 84 (2) ◽  
pp. 695-698 ◽  
Author(s):  
Mary E. Reynolds ◽  
Donald Fucci ◽  
Z. S. Bond

This study compared the effect of visual cuing on the intelligibility of DECtalk for native and nonnative speakers of English in both ideal listening conditions and in the presence of background noise at a signal to noise (S/N) ratio of + 10dB. Visual cuing improved DECtalk's intelligibility for normative speakers more than for native speakers, especially in the background noise condition. Implications of these findings and the need for further research are discussed.


2002 ◽  
Vol 13 (01) ◽  
pp. 038-049 ◽  
Author(s):  
Gabrielle H. Saunders ◽  
Kathleen M. Cienkowski

Measurement of hearing aid outcome is particularly difficult because there are numerous dimensions to consider (e.g., performance, satisfaction, benefit). Often there are discrepancies between scores in these dimensions. It is difficult to reconcile these discrepancies because the materials and formats used to measure each dimension are so very different. We report data obtained with an outcome measure that examines both objective and subjective dimensions with the same test format and materials and gives results in the same unit of measurement (signal-to-noise ratio). Two variables are measured: a “performance” speech reception threshold and a “perceptual” speech reception threshold. The signal-to-noise ratio difference between these is computed to determine the perceptual-performance discrepancy (PPDIS). The results showed that, on average, 48 percent of the variance in subjective ratings of a hearing aid could be explained by a combination of the performance speech reception threshold and the PPDIS. These findings suggest that the measure is potentially a valuable clinical tool.


2015 ◽  
Vol 26 (06) ◽  
pp. 572-581 ◽  
Author(s):  
Stanley Sheft ◽  
Min-Yu Cheng ◽  
Valeriy Shafiro

Background: Past work has shown that low-rate frequency modulation (FM) may help preserve signal coherence, aid segmentation at word and syllable boundaries, and benefit speech intelligibility in the presence of a masker. Purpose: This study evaluated whether difficulties in speech perception by cochlear implant (CI) users relate to a deficit in the ability to discriminate among stochastic low-rate patterns of FM. Research Design: This is a correlational study assessing the association between the ability to discriminate stochastic patterns of low-rate FM and the intelligibility of speech in noise. Study Sample: Thirteen postlingually deafened adult CI users participated in this study. Data Collection and Analysis: Using modulators derived from 5-Hz lowpass noise applied to a 1-kHz carrier, thresholds were measured in terms of frequency excursion both in quiet and with a speech-babble masker present, stimulus duration, and signal-to-noise ratio in the presence of a speech-babble masker. Speech perception ability was assessed in the presence of the same speech-babble masker. Relationships were evaluated with Pearson product–moment correlation analysis with correction for family-wise error, and commonality analysis to determine the unique and common contributions across psychoacoustic variables to the association with speech ability. Results: Significant correlations were obtained between masked speech intelligibility and three metrics of FM discrimination involving either signal-to-noise ratio or stimulus duration, with shared variance among the three measures accounting for much of the effect. Compared to past results from young normal-hearing adults and older adults with either normal hearing or a mild-to-moderate hearing loss, mean FM discrimination thresholds obtained from CI users were higher in all conditions. Conclusions: The ability to process the pattern of frequency excursions of stochastic FM may, in part, have a common basis with speech perception in noise. Discrimination of differences in the temporally distributed place coding of the stimulus could serve as this common basis for CI users.


2014 ◽  
Vol 635-637 ◽  
pp. 1081-1085
Author(s):  
Xin Xin Sha ◽  
Jian Zhou ◽  
Yuan Xue Song

OFDM is a key modulation and multiplexing technique. The basic system structure of OFDM is introduced firstly. This paper chose appropriate implementation schemes for channel coding, PAPR(Peak To Average Power Ratio) reducing and synchronization of the OFDM system based on the minimum BER(Bit Error Rate). Finally, the paper realized the simulation and got the BER in different SNR(Signal To Noise Ratio) in the matlab environment .


Sign in / Sign up

Export Citation Format

Share Document