Speechreading Supplemented by Single-Channel and Multichannel Tactile Displays of Voice Fundamental Frequency

1995 ◽  
Vol 38 (3) ◽  
pp. 690-705 ◽  
Author(s):  
Robin S. Waldstein ◽  
Arthur Boothroyd

The benefits of two tactile codes of voice fundamental frequency (F o ) were evaluated as supplements to the speechreading of sentences in two short-term training studies, each using 12 adults with normal hearing. In Experiment 1, a multichannel spatiotemporal display of F o , known as Portapitch, was used to stimulate the index finger. In an attempt to improve on past performance with this display, the coding scheme was modified to better cover the F o range of the talker in the training materials. For Experiment 2, to engage kinesthetic/proprioceptive pathways, a novel single-channel positional display was built, in which F o was coded as the vertical displacement of a small finger-rest. Input to both displays consisted of synthesized replicas of the F o contours of the sentences, prepared and perfected off-line. Training with the two tactile F o displays included auditory presentation of the synthesized F o contours in conjunction with the tactile patterns on alternate trials. Speechreading enhancement by the two tactile F o displays was compared to the enhancement provided when auditory F o information was available in conjunction with the tactile patterns, by auditory presentation of a sinusoidal indication of the presence or absence of voicing, and by a single-channel tactile display of the speech waveform presented to the index finger. Despite the modified coding strategy, the multichannel Portapitch provided a mean tactile speechreading enhancement of 7 percentage points, which was no greater than that found in previous studies. The novel positional F o display provided only a 4 percentage point enhancement. Neither F o display was better than the simple single-channel tactile transform of the full speech waveform, which gave a 7 percentage point enhancement effect. Auditory speechreading enhancement effects were 17 percentage points with the voicing indicator and approximately 35 percentage points when the auditory F o contour was provided in conjunction with the tactile displays. The findings are consistent with the hypothesis that subjects were not taking full advantage of the F o variation information available in the outputs of the two experimental tactile displays.

1992 ◽  
Vol 35 (5) ◽  
pp. 1160-1168 ◽  
Author(s):  
Theresa Hnath-Chisolm ◽  
Arthur Boothroyd

Recognition of words in sentences of known topic was measured in normally hearing adults via speechreading alone and speechreading supplemented with auditory presentation of signals intended to convey variations of voice fundamental frequency (F o ) over time. Three signals were used: (a) the low-pass filtered output of an electroglottograph (unprocessed F o ), (b) a constant amplitude sine wave whose instantaneous frequency was intended to equal that of F o (processed F o ), and (c) the same sine wave restricted to a small number of discrete frequency steps (quantized F o ). As the number of steps in the quantized F o contours increased from 1 to 12, the speechreading enhancement effect increased. The quantized F o contour with 12 steps was as effective as the processed F o contour (without quantization), but this processed contour was significantly less effective than the unprocessed electroglottograph signal. The results show that the auditory F o speechreading enhancement effect is sensitive to the errors introduced by the F o extraction and regeneration process used in this study. It is also sensitive to the quantization of F o contours into less than 12 steps. Whether more than 12 steps are required for the full enhancement effect remains to be determined.


1987 ◽  
Vol 96 (1_suppl) ◽  
pp. 74-76
Author(s):  
J. R. Walliker ◽  
A. J. Fourcin

We have developed a family of single-channel signal-processing aids for the profoundly and totally deaf. Common to them all are the analysis of speech into the components most important to the deaf lipreader; the synthesis of stimuli which make the best use of the patient's sensory abilities; and facilities to ensure accurate matching of the aid to the patient. The totally deaf are electrically stimulated by electrodes on the promontory or on the round window of the cochlea using charge-balanced controlled current square waves automatically adjusted to be at a comfortable level. Many potential candidates for electrocochlear stimulation have significant low frequency residual hearing, but do not find conventional hearing aids to be useful. We have found that they can often make very effective use of the voice fundamental frequency presented as an acoustic sinusoid. Our approach to these patients avoids the need for implant surgery but preserves that option should total loss of hearing occur in the future. Both electrocochlear and acoustic methods of signal presentation are implemented with similar hardware. The speech signal from a microphone or other source is analyzed by a voice fundamental frequency extractor and a voiceless sound detector. Their outputs are processed by a single chip microcomputer that synthesizes the output waveform. In both devices the aid is tailored to the patient using a desktop computer that stores amplitude-frequency characteristics and frequency mapping tables into a read-only memory.


1987 ◽  
Vol 82 (S1) ◽  
pp. S22-S22
Author(s):  
Lynne E. Bernstein ◽  
Silvio P. Eberhardt ◽  
Marilyn E. Demorest

1988 ◽  
Vol 9 (6) ◽  
pp. 342-350 ◽  
Author(s):  
Eddy Young ◽  
Arthur Boothroyd ◽  
Cecil Redmond

1990 ◽  
Vol 88 (3) ◽  
pp. 1274-1285 ◽  
Author(s):  
Silvio P. Eberhardt ◽  
Lynne E. Bernstein ◽  
Marilyn E. Demorest ◽  
Moise H. Goldstein

1980 ◽  
Vol 23 (2) ◽  
pp. 274-283 ◽  
Author(s):  
David Sorensen ◽  
Yoshiyuki Horii ◽  
Rebecca Leonard

Fundamental frequency perturbation (jitter) during sustained vowel phonations of speakers under topical anesthesia of the larynx was investigated for five adult males. The results showed that the average jitter was significantly greater under the anesthesia than normal conditions, and that the jitter difference between the two conditions was more prominent at high frequency phonations. Implications of these data for tactile and proprioceptive feedback in phonatory frequency control are discussed.


1971 ◽  
Vol 14 (3) ◽  
pp. 652-658 ◽  
Author(s):  
Bernd Weinberg ◽  
Jan Westerhouse

An intensive study of a normal-speaking subject, proficient in the use of buccal speech, was conducted. With respect to voice fundamental frequency variability, phonation time, and speaking rate his buccal speech characteristics compared favorably with those reported for excellent esophageal speakers. However, the reduced intelligibility of his buccal speech on rhyme-test words, the high average fundamental frequency of his buccal voice, and his conspicuous buccal gestures during speech represent distinct vocal liabilities.


Sign in / Sign up

Export Citation Format

Share Document