scholarly journals When spectral smearing can increase speech intelligibility

2013 ◽  
Author(s):  
James A. Bashford ◽  
Richard M. Warren ◽  
Peter W. Lenz
2013 ◽  
Vol 133 (5) ◽  
pp. 3389-3389
Author(s):  
James A. Bashford ◽  
Richard M. Warren ◽  
Peter W. Lenz

2021 ◽  
Vol 25 ◽  
pp. 233121652110301
Author(s):  
Waldo Nogueira ◽  
Nawal El Boghdady ◽  
Florian Langner ◽  
Etienne Gaudrain ◽  
Deniz Başkent

Speech intelligibility in multitalker settings is challenging for most cochlear implant (CI) users. One possibility for this limitation is the suboptimal representation of vocal cues in implant processing, such as the fundamental frequency (F0), and the vocal tract length (VTL). Previous studies suggested that while F0 perception depends on spectrotemporal cues, VTL perception relies largely on spectral cues. To investigate how spectral smearing in CIs affects vocal cue perception in speech-on-speech (SoS) settings, adjacent electrodes were simultaneously stimulated using current steering in 12 Advanced Bionics users to simulate channel interaction. In current steering, two adjacent electrodes are simultaneously stimulated forming a channel of parallel stimulation. Three such stimulation patterns were used: Sequential (one current steering channel), Paired (two channels), and Triplet stimulation (three channels). F0 and VTL just-noticeable differences (JNDs; Task 1), in addition to SoS intelligibility (Task 2) and comprehension (Task 3), were measured for each stimulation strategy. In Tasks 2 and 3, four maskers were used: the same female talker, a male voice obtained by manipulating both F0 and VTL (F0+VTL) of the original female speaker, a voice where only F0 was manipulated, and a voice where only VTL was manipulated. JNDs were measured relative to the original voice for the F0, VTL, and F0+VTL manipulations. When spectral smearing was increased from Sequential to Triplet, a significant deterioration in performance was observed for Tasks 1 and 2, with no differences between Sequential and Paired stimulation. Data from Task 3 were inconclusive. These results imply that CI users may tolerate certain amounts of channel interaction without significant reduction in performance on tasks relying on voice perception. This points to possibilities for using parallel stimulation in CIs for reducing power consumption.


2010 ◽  
Vol 261 (1-2) ◽  
pp. 63-66 ◽  
Author(s):  
Ewen N. MacDonald ◽  
M. Kathleen Pichora-Fuller ◽  
Bruce A. Schneider

2020 ◽  
Author(s):  
Waldo Nogueira ◽  
Nawal El Boghdady ◽  
Florian Langner ◽  
Etienne Gaudrain ◽  
Deniz Baskent

Speech intelligibility in multi-talker settings is challenging for most CI users. One possibility for this limitation is the suboptimal representation of vocal cues in implant processing, such as the fundamental frequency (F0), and the vocal tract length (VTL). Previous studies suggested that while F0 perception depends on both spectral and temporal cues, VTL perception relies largely on spectral cues. To investigate how spectral smearing in CIs affects vocal cue perception in speech-on-speech (SoS) settings, parallel channels were simultaneously stimulated in 14 Advanced Bionics users to simulate channel interaction. Three such patterns were created: Sequential (stimulation of 2 simultaneous electrodes), Paired (4 electrodes), and Triplet stimulation (6 electrodes). F0 and VTL just-noticeable differences (JNDs; task 1), in addition to SoS intelligibility (task 2) and comprehension (task 3) were measured for each stimulation strategy. In tasks 2 and 3, four maskers were used: the same female talker, a male voice obtained by manipulating both F0 and VTL (F0+VTL) of the original female speaker, a voice where only F0 was manipulated, and a voice where only VTL was manipulated. JNDs were measured relative to the original voice for the F0, VTL, and F0+VTL manipulations. When spectral smearing was increased, a significant deterioration in performance was observed for all tasks, with no differences between Sequential and Paired stimulation. These results imply that CI users may tolerate certain amounts of channel interaction without significant reduction in performance on tasks relying on voice perception. This points to possibilities for utilizing parallel stimulation in CIs for reducing power consumption.


1996 ◽  
Vol 5 (1) ◽  
pp. 23-32 ◽  
Author(s):  
Chris Halpin ◽  
Barbara Herrmann ◽  
Margaret Whearty

The family described in this article provides an unusual opportunity to relate findings from genetic, histological, electrophysiological, psychophysical, and rehabilitative investigation. Although the total number evaluated is large (49), the known, living affected population is smaller (14), and these are spread from age 20 to age 59. As a result, the findings described above are those of a large-scale case study. Clearly, more data will be available through longitudinal study of the individuals documented in the course of this investigation but, given the slow nature of the progression in this disease, such studies will be undertaken after an interval of several years. The general picture presented to the audiologist who must rehabilitate these cases is that of a progressive cochlear degeneration that affects only thresholds at first, and then rapidly diminishes speech intelligibility. The expected result is that, after normal language development, the patient may accept hearing aids well, encouraged by the support of the family. Performance and satisfaction with the hearing aids is good, until the onset of the speech intelligibility loss, at which time the patient will encounter serious difficulties and may reject hearing aids as unhelpful. As the histological and electrophysiological results indicate, however, the eighth nerve remains viable, especially in the younger affected members, and success with cochlear implantation may be expected. Audiologic counseling efforts are aided by the presence of role models and support from the other affected members of the family. Speech-language pathology services were not considered important by the members of this family since their speech production developed normally and has remained very good. Self-correction of speech was supported by hearing aids and cochlear implants (Case 5’s speech production was documented in Perkell, Lane, Svirsky, & Webster, 1992). These patients received genetic counseling and, due to the high penetrance of the disease, exhibited serious concerns regarding future generations and the hope of a cure.


1986 ◽  
Vol 51 (4) ◽  
pp. 362-369 ◽  
Author(s):  
Donna M. Risberg ◽  
Robyn M. Cox

A custom in-the-ear (ITE) hearing aid fitting was compared to two over-the-ear (OTE) hearing aid fittings for each of 9 subjects with mild to moderately severe hearing losses. Speech intelligibility via the three instruments was compared using the Speech Intelligibility Rating (SIR) test. The relationship between functional gain and coupler gain was compared for the ITE and the higher rated OTE instruments. The difference in input received at the microphone locations of the two types of hearing aids was measured for 10 different subjects and compared to the functional gain data. It was concluded that (a) for persons with mild to moderately severe hearing losses, appropriately adjusted custom ITE fittings typically yield speech intelligibility that is equal to the better OTE fitting identified in a comparative evaluation; and (b) gain prescriptions for ITE hearing aids should be adjusted to account for the high-frequency emphasis associated with in-the-concha microphone placement.


1998 ◽  
Vol 41 (6) ◽  
pp. 1282-1293 ◽  
Author(s):  
Jane Mertz Garcia ◽  
Paul A. Dagenais

This study examined changes in the sentence intelligibility scores of speakers with dysarthria in association with different signal-independent factors (contextual influences). This investigation focused on the presence or absence of iconic gestures while speaking sentences with low or high semantic predictiveness. The speakers were 4 individuals with dysarthria, who varied from one another in terms of their level of speech intelligibility impairment, gestural abilities, and overall level of motor functioning. Ninety-six inexperienced listeners (24 assigned to each speaker) orthographically transcribed 16 test sentences presented in an audio + video or audio-only format. The sentences had either low or high semantic predictiveness and were spoken by each speaker with and without the corresponding gestures. The effects of signal-independent factors (presence or absence of iconic gestures, low or high semantic predictiveness, and audio + video or audio-only presentation formats) were analyzed for individual speakers. Not all signal-independent information benefited speakers similarly. Results indicated that use of gestures and high semantic predictiveness improved sentence intelligibility for 2 speakers. The other 2 speakers benefited from high predictive messages. The audio + video presentation mode enhanced listener understanding for all speakers, although there were interactions related to specific speaking situations. Overall, the contributions of relevant signal-independent information were greater for the speakers with more severely impaired intelligibility. The results are discussed in terms of understanding the contribution of signal-independent factors to the communicative process.


2008 ◽  
Vol 18 (1) ◽  
pp. 31-40 ◽  
Author(s):  
David J. Zajac

Abstract The purpose of this opinion article is to review the impact of the principles and technology of speech science on clinical practice in the area of craniofacial disorders. Current practice relative to (a) speech aerodynamic assessment, (b) computer-assisted single-word speech intelligibility testing, and (c) behavioral management of hypernasal resonance are reviewed. Future directions and/or refinement of each area are also identified. It is suggested that both challenging and rewarding times are in store for clinical researchers in craniofacial disorders.


Sign in / Sign up

Export Citation Format

Share Document