Spatial Release From Masking Using Clinical Corpora: Sentence Recognition In a Colocated or Spatially Separated Speech Masker

Author(s):  
Grant King ◽  
Nicole E. Corbin ◽  
Lori J. Leibold ◽  
Emily Buss

Background: Speech recognition in complex multisource environments is challenging, particularly forlisteners with hearing loss. One source of difficulty is the reduced ability of listeners with hearing loss tobenefit from spatial separation of the target and masker, an effect called spatial release from masking(SRM). Despite the prevalence of complex multisource environments in everyday life, SRM is not routinelyevaluated in the audiology clinic.<br />Purpose: The purpose of this study was to demonstrate the feasibility of assessing SRM in adults usingwidely available tests of speech-in-speech recognition that can be conducted using standard clinicalequipment.<br />Research Design: Participants were 22 young adults with normal hearing. The task was masked sentencerecognition, using each of five clinically available corpora with speech maskers. The target alwayssounded like it originated from directly in front of the listener, and the masker either sounded like it originatedfrom the front (colocated with the target) or from the side (separated from the target). In the realspatial manipulation conditions, source location was manipulated by routing the target and masker toeither a single speaker or to two speakers: one directly in front of the participant, and one mountedin an adjacent corner, 90° to the right. In the perceived spatial separation conditions, the target andmasker were presented from both speakers with delays that made them sound as if they were eithercolocated or separated.<br />Results: With real spatial manipulations, the mean SRM ranged from 7.1 to 11.4 dB, depending on thespeech corpus. With perceived spatial manipulations, the mean SRM ranged from 1.8 to 3.1 dB. Whereasreal separation improves the signal-to-noise ratio in the ear contralateral to the masker, SRM in the perceivedspatial separation conditions is based solely on interaural timing cues.<br />Conclusions: The finding of robust SRM with widely available speech corpora supports the feasibility ofmeasuring this important aspect of hearing in the audiology clinic. The finding of a small but significantSRM in the perceived spatial separation conditions suggests that modified materials could be used toevaluate the use of interaural timing cues specifically.<br />

2020 ◽  
Vol 31 (04) ◽  
pp. 271-276
Author(s):  
Grant King ◽  
Nicole E. Corbin ◽  
Lori J. Leibold ◽  
Emily Buss

Abstract Background Speech recognition in complex multisource environments is challenging, particularly for listeners with hearing loss. One source of difficulty is the reduced ability of listeners with hearing loss to benefit from spatial separation of the target and masker, an effect called spatial release from masking (SRM). Despite the prevalence of complex multisource environments in everyday life, SRM is not routinely evaluated in the audiology clinic. Purpose The purpose of this study was to demonstrate the feasibility of assessing SRM in adults using widely available tests of speech-in-speech recognition that can be conducted using standard clinical equipment. Research Design Participants were 22 young adults with normal hearing. The task was masked sentence recognition, using each of five clinically available corpora with speech maskers. The target always sounded like it originated from directly in front of the listener, and the masker either sounded like it originated from the front (colocated with the target) or from the side (separated from the target). In the real spatial manipulation conditions, source location was manipulated by routing the target and masker to either a single speaker or to two speakers: one directly in front of the participant, and one mounted in an adjacent corner, 90° to the right. In the perceived spatial separation conditions, the target and masker were presented from both speakers with delays that made them sound as if they were either colocated or separated. Results With real spatial manipulations, the mean SRM ranged from 7.1 to 11.4 dB, depending on the speech corpus. With perceived spatial manipulations, the mean SRM ranged from 1.8 to 3.1 dB. Whereas real separation improves the signal-to-noise ratio in the ear contralateral to the masker, SRM in the perceived spatial separation conditions is based solely on interaural timing cues. Conclusions The finding of robust SRM with widely available speech corpora supports the feasibility of measuring this important aspect of hearing in the audiology clinic. The finding of a small but significant SRM in the perceived spatial separation conditions suggests that modified materials could be used to evaluate the use of interaural timing cues specifically.


2018 ◽  
Vol 61 (2) ◽  
pp. 428-435 ◽  
Author(s):  
Navin Viswanathan ◽  
Kostas Kokkinakis ◽  
Brittany T. Williams

Purpose The purpose of this study was to evaluate whether listeners with normal hearing perceiving noise-vocoded speech-in-speech demonstrate better intelligibility of target speech when the background speech was mismatched in language (linguistic release from masking [LRM]) and/or location (spatial release from masking [SRM]) relative to the target. We also assessed whether the spectral resolution of the noise-vocoded stimuli affected the presence of LRM and SRM under these conditions. Method In Experiment 1, a mixed factorial design was used to simultaneously manipulate the masker language (within-subject, English vs. Dutch), the simulated masker location (within-subject, right, center, left), and the spectral resolution (between-subjects, 6 vs. 12 channels) of noise-vocoded target–masker combinations presented at +25 dB signal-to-noise ratio (SNR). In Experiment 2, the study was repeated using a spectral resolution of 12 channels at +15 dB SNR. Results In both experiments, listeners' intelligibility of noise-vocoded targets was better when the background masker was Dutch, demonstrating reliable LRM in all conditions. The pattern of results in Experiment 1 was not reliably different across the 6- and 12-channel noise-vocoded speech. Finally, a reliable spatial benefit (SRM) was detected only in the more challenging SNR condition (Experiment 2). Conclusion The current study is the first to report a clear LRM benefit in noise-vocoded speech-in-speech recognition. Our results indicate that this benefit is available even under spectrally degraded conditions and that it may augment the benefit due to spatial separation of target speech and competing backgrounds.


2019 ◽  
Author(s):  
Ysabel Domingo ◽  
Emma Holmes ◽  
Ewan Macpherson ◽  
Ingrid Johnsrude

The ability to segregate simultaneous speech streams is crucial for successful communication. Recent studies have demonstrated that participants can report 10–20% more words spoken by naturally familiar (e.g., friends or spouses) than unfamiliar talkers in two-voice mixtures. This benefit is commensurate with one of the largest benefits to speech intelligibility currently known—that gained by spatially separating two talkers. However, because of differences in the methods of these previous studies, the relative benefits of spatial separation and voice familiarity are unclear. Here, we directly compared the familiar-voice benefit and spatial release from masking, and examined if and how these two cues interact with one another. We recorded talkers speaking sentences from a published closed-set “matrix” task and then presented listeners with three different sentences played simultaneously. Each target sentence was played at 0° azimuth, and two masker sentences were symmetrically separated about the target. On average, participants reported 10–30% more words correctly when the target sentence was spoken in a familiar than unfamiliar voice (collapsed over spatial separation conditions); we found that participants gain a similar benefit from a familiar target as when an unfamiliar voice is separated from two symmetrical maskers by approximately 15° azimuth.


2021 ◽  
Vol 29 (2) ◽  
Author(s):  
Lubna Bushara ◽  
Mohamed Yousef ◽  
Ikhlas Abdelaziz ◽  
Mogahid Zidan ◽  
Dalia Bilal ◽  
...  

This study aimed to determine the measurements of the cochlea among healthy subjects and hearing deafness subjects using a High Resolution Computed Tomography (HRCT). A total of 230 temporal bone HRCT cases were retrospectively investigated in the period spanning from 2011 to 2015. Three 64-slice units were used to examine patients with clinical complaints of hearing loss conditions at three Radiology departments in Khartoum, Sudan. For the control group (A) healthy subjects, the mean width of the right and left cochlear were 5.61±0.40 mm and 5.56±0.58 mm, the height were 3.56±0.36 mm and 3.54±0.36 mm, the basal turn width were 1.87±0.19 mm and 1.88 ±0.18 mm, the width of the cochlear nerve canal were 2.02±1.23 and 1.93±0.20, cochlear nerve density was 279.41±159.02 and 306.84±336.9 HU respectively. However, for the experimental group (B), the mean width of the right and left cochlear width were 5.38±0.46 mm and 5.34±0.30 mm, the height were 3.53±0.25 mm and 3.49±0.28mm, the basal turn width were 1.76±0.13 mm, and 1.79±0.13 mm, the width of the cochlear nerve canal were 1.75±0.18mm and 1.73±0.18mm, and cochlear nerve density were 232.84±316.82 and 196.58±230.05 HU, respectively. The study found there was a significant difference in cochlea’s measurement between the two groups with a p-value < 0.05. This study had established baseline measurements for the cochlear for the healthy Sudanese population. Furthermore, it found that HRCT of the temporal bone was the best for investigation of the cochlear and could provide a guide for the clinicians to manage congenital hearing loss.


OTO Open ◽  
2019 ◽  
Vol 3 (2) ◽  
pp. 2473974X1984185 ◽  
Author(s):  
Hilary McCrary ◽  
Veronica del Calvo ◽  
Jeremy Purser ◽  
Geoff Casazza ◽  
Albert Park

Objective Antioxidants have been used as a therapeutic measure for several causes of hearing loss, and this study aims to examine the use of antioxidants in children with congenital cytomegalovirus (cCMV)–related hearing loss. Study Design Case-control study. Setting Academic pediatric hospital. Subjects and Methods A retrospective chart review of pediatric patients with cCMV-related hearing loss treated with and without antioxidants (vitamins A, C, and E and magnesium, known as ACE-Mg) was completed. The primary end point was the mean change in hearing thresholds for the right and left ears after therapy. An evaluation of the mean change in thresholds was evaluated at the following frequencies: 500 Hz, 1000 Hz, 2000 Hz, and 4000 Hz. A 2-sample t test and multiple linear regression were used to evaluate the data. Results A total of 78 children with cCMV-related hearing loss were included in the study, of whom 10 were treated with antioxidants. The average amount of time in which antioxidants were taken was 387 days. When comparing cases and controls, there was no differences in the mean change of hearing thresholds at each frequency for both the right and left ears ( P > .05). Length of antioxidant therapy and age at which therapy was initiated had no effect on hearing scores ( P > .05). Conclusions Oxidative stress plays a role in the pathogenesis of cCMV-related hearing loss. ACE-Mg is a safe adjuvant therapy for the treatment of hearing loss in children; however, this study demonstrates no hearing-related benefit from ACE-Mg antioxidant therapy.


2003 ◽  
Vol 12 (1) ◽  
pp. 41-51 ◽  
Author(s):  
Paula Henry ◽  
Todd Ricketts

Improving the signal-to-noise ratio (SNR) for individuals with hearing loss who are listening to speech in noise provides an obvious benefit. Although binaural hearing provides the greatest advantage over monaural hearing in noise, some individuals with symmetrical hearing loss choose to wear only one hearing aid. The present study tested the hypothesis that individuals with symmetrical hearing loss fit with one hearing aid would demonstrate improved speech recognition in background noise with increases in head turn. Fourteen individuals were fit monaurally with a Starkey Gemini in-the-ear (ITE) hearing aid with directional and omnidirectional microphone modes. Speech recognition performance in noise was tested using the audiovisual version of the Connected Speech Test (CST v.3). The test was administered in auditory-only conditions as well as with the addition of visual cues for each of three head angles: 0°, 20°, and 40°. Results indicated improvement in speech recognition performance with changes in head angle for the auditory-only presentation mode at the 20° and 40° head angles when compared to 0°. Improvement in speech recognition performance for the auditory + visual mode was noted for the 20° head angle when compared to 0°. Additionally, a decrement in speech recognition performance for the auditory + visual mode was noted for the 40° head angle when compared to 0°. These results support a speech recognition advantage for listeners fit with one ITE hearing aid listening in a close listener-to-speaker distance when they turn their head slightly in order to increase signal intensity.


2012 ◽  
Vol 23 (03) ◽  
pp. 171-181 ◽  
Author(s):  
Rachel A. McArdle ◽  
Mead Killion ◽  
Monica A. Mennite ◽  
Theresa H. Chisolm

Background: The decision to fit one or two hearing aids in individuals with binaural hearing loss has been debated for years. Although some 78% of U.S. hearing aid fittings are binaural (Kochkin , 2010), Walden and Walden (2005) presented data showing that 82% (23 of 28 patients) of their sample obtained significantly better speech recognition in noise scores when wearing one hearing aid as opposed to two. Purpose: To conduct two new experiments to fuel the monaural/binaural debate. The first experiment was a replication of Walden and Walden (2005), whereas the second experiment examined the use of binaural cues to improve speech recognition in noise. Research Design: A repeated measures experimental design. Study Sample: Twenty veterans (aged 59–85 yr), with mild to moderately severe binaurally symmetrical hearing loss who wore binaural hearing aids were recruited from the Audiology Department at the Bay Pines VA Healthcare System. Data Collection and Analysis: Experiment 1 followed the procedures of the Walden and Walden study, where signal-to-noise ratio (SNR) loss was measured using the Quick Speech-in-Noise (QuickSIN) test on participants who were aided with their current hearing aids. Signal and noise were presented in the sound booth at 0° azimuth under five test conditions: (1) right ear aided, (2) left ear aided, (3) both ears aided, (4) right ear aided, left ear plugged, and (5) unaided. The opposite ear in (1) and (2) was left open. In Experiment 2, binaural Knowles Electronics Manikin for Acoustic Research (KEMAR) manikin recordings made in Lou Malnati's pizza restaurant during a busy period provided a typical real-world noise, while prerecorded target sentences were presented through a small loudspeaker located in front of the KEMAR manikin. Subjects listened to the resulting binaural recordings through insert earphones under the following four conditions: (1) binaural, (2) diotic, (3) monaural left, and (4) monaural right. Results: Results of repeated measures ANOVAs demonstrated that the best speech recognition in noise performance was obtained by most participants with both ears aided in Experiment 1 and in the binaural condition in Experiment 2. Conclusions: In both experiments, only 20% of our subjects did better in noise with a single ear, roughly similar to the earlier Jerger et al (1993) finding that 8–10% of elderly hearing aid users preferred one hearing aid.


2020 ◽  
Vol 14 (2) ◽  
pp. 106-114
Author(s):  
Abdullahi Musa Kirfi ◽  
◽  
Mohammed Bello Fufore ◽  
Garba Mohammed Mainasara ◽  
Abdulrazak Ajiya ◽  
...  

Background: Prisoners, due to confinement are isolated from contact with the society and access to many of the facilities, including medical care. This study aimed to evaluate the hearing threshold of inmates of Kaduna convict prison. Method: It was a cross-sectional study of prison inmates at the Kaduna convict prison between April 2017 and February 2019. Ethical approvals were obtained from relevant bodies and all consented inmates aged 18 – 55years in the Kaduna convict prison were enrolled. Equal number of control matched for age and gender were enrolled from the communities in Kaduna North Local Government Area. Data were collated using a structured questionnaire. A diagnostic Pure Tone Audiometry was performed to assess their hearing threshold. Statistical Package for Social Sciences (SPSS) version 20.0 was used for analysis. Results: Four hundred and thirty inmates and equal number of control group were enrolled. The mean age for the inmates and controls were 30.2±7.5 and 30.4±8.02 years respectively. There were 383 males and 47 females in both groups. Among the 860 ears of the inmates, 238(27.7%) ears had hearing loss while in the control group, 95/ (11.1%) ears had impaired hearing. Conductive hearing loss was the commonest among the inmates 111(46.6%) while sensorineural was commoner among the controls 57(60.0%). The mean pure tone average among the inmates was 25.6±11.3dBHL and 26.1±11.2dBHL on the right and left ears respectively while in the control group, it was 18.4±7.8dBHL on the right and 17.9±7.9dBHL on the left. Conclusion: This study revealed that hearing loss was more prevalent among prison inmates than the general population. In majority of the inmates, the hearing loss was mild, conductive and mostly affecting all the frequencies.


2018 ◽  
Vol 27 (4) ◽  
pp. 529-538 ◽  
Author(s):  
Kasey M. Jakien ◽  
Frederick J. Gallun

Purpose The purpose of this study is to report normative data and predict thresholds for a rapid test of spatial release from masking for speech perception. The test is easily administered and has good repeatability, with the potential to be used in clinics and laboratories. Normative functions were generated for adults varying in age and amounts of hearing loss. Method The test of spatial release presents a virtual auditory scene over headphones with 2 conditions: colocated (with target and maskers at 0°) and spatially separated (with target at 0° and maskers at ± 45°). Listener thresholds are determined as target-to-masker ratios, and spatial release from masking (SRM) is determined as the difference between the colocated condition and spatially separated condition. Multiple linear regression was used to fit the data from 82 adults 18–80 years of age with normal to moderate hearing loss (0–40 dB HL pure-tone average [PTA]). The regression equations were then used to generate normative functions that relate age (in years) and hearing thresholds (as PTA) to target-to-masker ratios and SRM. Results Normative functions were able to predict thresholds with an error of less than 3.5 dB in all conditions. In the colocated condition, the function included only age as a predictive parameter, whereas in the spatially separated condition, both age and PTA were included as parameters. For SRM, PTA was the only significant predictor. Different functions were generated for the 1st run, the 2nd run, and the average of the 2 runs. All 3 functions were largely similar in form, with the smallest error being associated with the function on the basis of the average of 2 runs. Conclusion With the normative functions generated from this data set, it would be possible for a researcher or clinician to interpret data from a small number of participants or even a single patient without having to first collect data from a control group, substantially reducing the time and resources needed. Supplemental Material https://doi.org/10.23641/asha.7080878


Sign in / Sign up

Export Citation Format

Share Document