Speech intelligibility model including room and loudspeaker influences

L. Faiget; R. Ruiz

doi:10.1121/1.424663

Binaural Speech Intelligibility in a Real Elementary Classroom

Journal of Physics Conference Series ◽

10.1088/1742-6596/2069/1/012165 ◽

2021 ◽

Vol 2069 (1) ◽

pp. 012165

Author(s):

G Minelli ◽

G E Puglisi ◽

A Astolfi ◽

C Hauth ◽

A Warzybok

Keyword(s):

Speech Intelligibility ◽

Noise Source ◽

Signal To Noise Ratio ◽

Spatial Separation ◽

Elementary Classrooms ◽

Acoustic Environment ◽

Acoustic Measures ◽

Speech Reception ◽

Speech Intelligibility Model ◽

Speech Reception Thresholds

Abstract Since the fundamental phases of the learning process take place in elementary classrooms, it is necessary to guarantee a proper acoustic environment for the listening activity to children immersed in them. In this framework, speech intelligibility is especially important. In order to better understand and objectively quantify the effect of background noise and reverberation on speech intelligibility various models have been developed. Here, a binaural speech intelligibility model (BSIM) is investigated for speech intelligibility predictions in a real classroom considering the effect of talker-to-listener distance and binaural unmasking due to the spatial separation of noise and speech source. BSIM predictions are compared to the well-established room acoustic measures as reverberation time (T30), clarity or definition. Objective acoustical measurements were carried out in one Italian primary school classroom before (T30= 1.43s±0.03 s) and after (T30= 0.45±0.02 s) the acoustical treatment. Speech reception thresholds (SRTs) corresponding to signal-to-noise ratio yielding 80% of speech intelligibility will be obtained through the BSIM simulations using the measured binaural room impulse responses (BRIRs). A focus on the effect of different speech and noise source spatial positions on the SRT values will aim to show the importance of a model able to deal with the binaural aspects of the auditory system. In particular, it will be observed how the position of the noise source influences speech intelligibility when the target speech source lies always in the same position.

Download Full-text

Revision, extension, and evaluation of a binaural speech intelligibility model

The Journal of the Acoustical Society of America ◽

10.1121/1.3295575 ◽

2010 ◽

Vol 127 (4) ◽

pp. 2479-2497 ◽

Cited By ~ 75

Author(s):

Rainer Beutelmann ◽

Thomas Brand ◽

Birger Kollmeier

Keyword(s):

Speech Intelligibility ◽

Speech Intelligibility Model

Download Full-text

Measurement and Prediction of Binaural-Temporal Integration of Speech Reflections

Trends in Hearing ◽

10.1177/2331216519854267 ◽

2019 ◽

Vol 23 ◽

pp. 233121651985426

Author(s):

Jan Rennies ◽

Anna Warzybok ◽

Thomas Brand ◽

Birger Kollmeier

Keyword(s):

Speech Recognition ◽

Speech Intelligibility ◽

Temporal Integration ◽

Variable Number ◽

Interaural Phase ◽

Speech Intelligibility Model ◽

Speech Information ◽

Early Late ◽

Late Part ◽

Phase Differences

For speech intelligibility in rooms, the temporal integration of speech reflections is typically modeled by separating the room impulse response (RIR) into an early (assumed beneficial for speech intelligibility) and a late part (assumed detrimental). This concept was challenged in this study by employing binaural RIRs with systematically varied interaural phase differences (IPDs) and amplitude of the direct sound and a variable number of reflections delayed by up to 200 ms. Speech recognition thresholds in stationary noise were measured in normal-hearing listeners for 86 conditions. The data showed that direct sound and one or several early speech reflections could be perfectly integrated when they had the same IPD. Early reflections with the same IPD as the noise (but not as the direct sound) could not be perfectly integrated with the direct sound. All conditions in which the dominant speech information was within the early RIR components could be well predicted by a binaural speech intelligibility model using classic early/late separation. In contrast, when amplitude or IPD favored late RIR components, listeners appeared to be capable of focusing on these components rather than on the precedent direct sound. This could not be modeled by an early/late separation window but required a temporal integration window that can be flexibly shifted along the RIR.

Download Full-text

A Family With Autosomal-Dominant Progressive Sensorineural Hearing Loss

American Journal of Audiology ◽

10.1044/1059-0889.0501.23 ◽

1996 ◽

Vol 5 (1) ◽

pp. 23-32 ◽

Cited By ~ 3

Author(s):

Chris Halpin ◽

Barbara Herrmann ◽

Margaret Whearty

Keyword(s):

Speech Production ◽

Hearing Aids ◽

Role Models ◽

Speech Intelligibility ◽

Large Scale ◽

Speech Language Pathology ◽

The Family ◽

Patient Will ◽

Language Pathology

The family described in this article provides an unusual opportunity to relate findings from genetic, histological, electrophysiological, psychophysical, and rehabilitative investigation. Although the total number evaluated is large (49), the known, living affected population is smaller (14), and these are spread from age 20 to age 59. As a result, the findings described above are those of a large-scale case study. Clearly, more data will be available through longitudinal study of the individuals documented in the course of this investigation but, given the slow nature of the progression in this disease, such studies will be undertaken after an interval of several years. The general picture presented to the audiologist who must rehabilitate these cases is that of a progressive cochlear degeneration that affects only thresholds at first, and then rapidly diminishes speech intelligibility. The expected result is that, after normal language development, the patient may accept hearing aids well, encouraged by the support of the family. Performance and satisfaction with the hearing aids is good, until the onset of the speech intelligibility loss, at which time the patient will encounter serious difficulties and may reject hearing aids as unhelpful. As the histological and electrophysiological results indicate, however, the eighth nerve remains viable, especially in the younger affected members, and success with cochlear implantation may be expected. Audiologic counseling efforts are aided by the presence of role models and support from the other affected members of the family. Speech-language pathology services were not considered important by the members of this family since their speech production developed normally and has remained very good. Self-correction of speech was supported by hearing aids and cochlear implants (Case 5’s speech production was documented in Perkell, Lane, Svirsky, & Webster, 1992). These patients received genetic counseling and, due to the high penetrance of the disease, exhibited serious concerns regarding future generations and the hope of a cure.

Download Full-text

Comparison of In-the-Ear and Over-the-Ear Hearing Aid Fittings

Journal of Speech and Hearing Disorders ◽

10.1044/jshd.5104.362 ◽

1986 ◽

Vol 51 (4) ◽

pp. 362-369 ◽

Cited By ~ 4

Author(s):

Donna M. Risberg ◽

Robyn M. Cox

Keyword(s):

Hearing Aids ◽

High Frequency ◽

Comparative Evaluation ◽

Speech Intelligibility ◽

Hearing Aid ◽

Hearing Aid Fitting ◽

The Difference ◽

Functional Gain ◽

The Relationship

A custom in-the-ear (ITE) hearing aid fitting was compared to two over-the-ear (OTE) hearing aid fittings for each of 9 subjects with mild to moderately severe hearing losses. Speech intelligibility via the three instruments was compared using the Speech Intelligibility Rating (SIR) test. The relationship between functional gain and coupler gain was compared for the ITE and the higher rated OTE instruments. The difference in input received at the microphone locations of the two types of hearing aids was measured for 10 different subjects and compared to the functional gain data. It was concluded that (a) for persons with mild to moderately severe hearing losses, appropriately adjusted custom ITE fittings typically yield speech intelligibility that is equal to the better OTE fitting identified in a comparative evaluation; and (b) gain prescriptions for ITE hearing aids should be adjusted to account for the high-frequency emphasis associated with in-the-concha microphone placement.

Download Full-text

Dysarthric Sentence Intelligibility

Journal of Speech Language and Hearing Research ◽

10.1044/jslhr.4106.1282 ◽

1998 ◽

Vol 41 (6) ◽

pp. 1282-1293 ◽

Cited By ~ 32

Author(s):

Jane Mertz Garcia ◽

Paul A. Dagenais

Keyword(s):

Speech Intelligibility ◽

Presentation Mode ◽

Contextual Influences ◽

Motor Functioning ◽

Video Presentation ◽

Independent Information ◽

Communicative Process ◽

Presentation Formats ◽

Sentence Intelligibility ◽

Audio Video

This study examined changes in the sentence intelligibility scores of speakers with dysarthria in association with different signal-independent factors (contextual influences). This investigation focused on the presence or absence of iconic gestures while speaking sentences with low or high semantic predictiveness. The speakers were 4 individuals with dysarthria, who varied from one another in terms of their level of speech intelligibility impairment, gestural abilities, and overall level of motor functioning. Ninety-six inexperienced listeners (24 assigned to each speaker) orthographically transcribed 16 test sentences presented in an audio + video or audio-only format. The sentences had either low or high semantic predictiveness and were spoken by each speaker with and without the corresponding gestures. The effects of signal-independent factors (presence or absence of iconic gestures, low or high semantic predictiveness, and audio + video or audio-only presentation formats) were analyzed for individual speakers. Not all signal-independent information benefited speakers similarly. Results indicated that use of gestures and high semantic predictiveness improved sentence intelligibility for 2 speakers. The other 2 speakers benefited from high predictive messages. The audio + video presentation mode enhanced listener understanding for all speakers, although there were interactions related to specific speaking situations. Overall, the contributions of relevant signal-independent information were greater for the speakers with more severely impaired intelligibility. The results are discussed in terms of understanding the contribution of signal-independent factors to the communicative process.

Download Full-text

Translating Principles of Speech Science to Clinical Practice: Current and Future Trends in Craniofacial Disorders

Perspectives on Speech Science and Orofacial Disorders ◽

10.1044/ssod18.1.31 ◽

2008 ◽

Vol 18 (1) ◽

pp. 31-40 ◽

Cited By ~ 2

Author(s):

David J. Zajac

Keyword(s):

Clinical Practice ◽

Speech Intelligibility ◽

Current Practice ◽

Computer Assisted ◽

Behavioral Management ◽

Single Word ◽

Future Directions ◽

Speech Science ◽

The Impact ◽

Opinion Article

Abstract The purpose of this opinion article is to review the impact of the principles and technology of speech science on clinical practice in the area of craniofacial disorders. Current practice relative to (a) speech aerodynamic assessment, (b) computer-assisted single-word speech intelligibility testing, and (c) behavioral management of hypernasal resonance are reviewed. Future directions and/or refinement of each area are also identified. It is suggested that both challenging and rewarding times are in store for clinical researchers in craniofacial disorders.

Download Full-text