The Effects of Digital Quantization Error on Speech Intelligibility and Perceived Speech Quality

Richard W. Harris; Robert H. Brey; Yuan-Shu Chang; B. Diann Soria; Laurence M. Hilton

doi:10.1044/jshr.3401.189

The Effects of Digital Quantization Error on Speech Intelligibility and Perceived Speech Quality

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.3401.189 ◽

1991 ◽

Vol 34 (1) ◽

pp. 189-196 ◽

Cited By ~ 3

Author(s):

Richard W. Harris ◽

Robert H. Brey ◽

Yuan-Shu Chang ◽

B. Diann Soria ◽

Laurence M. Hilton

Keyword(s):

Speech Intelligibility ◽

Perceived Quality ◽

Speech Quality ◽

Quantization Error ◽

Floating Point ◽

Point Data

The effects of digital quantization error upon speech intelligibility and perceived speech quality, for normally hearing subjects, were investigated for digitized speech processed to simulate 6-, 8-, 10-, 12-, 14-, and 16-bit integer conversion and 2-, 3-, 4-, 5-, 6-, and 7-bit floating-point conversion. For the integer data, there were no significant differences in speech intelligibility for 8- to 16-bit conversion. Only 6-bit integer conversion at 55 dB SPL resulted in a significant degradation in speech intelligibility. For the floating-point data, there were no significant differences in speech intelligibility for 2- to 7-bit floating-point conversion. However, results of the perceived quality experiment appeared to be more sensitive to differences among the various conditions. Speech processed using 12-, 14-, and 16-bit integer conversion was judged to be superior to speech processed using the 6-, 8-, and 10-bit integer conditions. Speech processed using 5-, 6-, and 7-bit floating-point conversion was judged to be superior to speech processed using 2-, 3-, and 4-bit floating-point conversion.

Download Full-text

A Robust Dual-Microphone Generalized Sidelobe Canceller Using a Bone-Conduction Sensor for Speech Enhancement

Sensors ◽

10.3390/s21051878 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1878

Author(s):

Yi Zhou ◽

Haiping Wang ◽

Yijing Chu ◽

Hongqing Liu

Keyword(s):

Speech Enhancement ◽

Speech Intelligibility ◽

Bone Conduction ◽

Speech Quality ◽

Generalized Sidelobe Canceller ◽

Spatially Distributed ◽

Interference Signals ◽

Adaptive Noise ◽

Adaptive Noise Canceller ◽

Sidelobe Canceller

The use of multiple spatially distributed microphones allows performing spatial filtering along with conventional temporal filtering, which can better reject the interference signals, leading to an overall improvement of the speech quality. In this paper, we propose a novel dual-microphone generalized sidelobe canceller (GSC) algorithm assisted by a bone-conduction (BC) sensor for speech enhancement, which is named BC-assisted GSC (BCA-GSC) algorithm. The BC sensor is relatively insensitive to the ambient noise compared to the conventional air-conduction (AC) microphone. Hence, BC speech can be analyzed to generate very accurate voice activity detection (VAD), even in a high noise environment. The proposed algorithm incorporates the VAD information obtained by the BC speech into the adaptive blocking matrix (ABM) and adaptive noise canceller (ANC) in GSC. By using VAD to control ABM and combining VAD with signal-to-interference ratio (SIR) to control ANC, the proposed method could suppress interferences and improve the overall performance of GSC significantly. It is verified by experiments that the proposed GSC system not only improves speech quality remarkably but also boosts speech intelligibility.

Download Full-text

JPEG2000 Compliant Lossless Coding of Floating Point Data

Data Compression Conference ◽

10.1109/dcc.2005.49 ◽

2005 ◽

Cited By ~ 3

Author(s):

B. Usevitch

Keyword(s):

Floating Point ◽

Lossless Coding ◽

Point Data

Download Full-text

A Versatile Compression Method for Floating-Point Data Stream

2013 Fourth International Conference on Networking and Distributed Computing ◽

10.1109/icndc.2013.32 ◽

2013 ◽

Cited By ~ 2

Author(s):

Songbin Liu ◽

Xiaomeng Huang ◽

Yufang Ni ◽

Haohuan Fu ◽

Guangwen Yang

Keyword(s):

Data Stream ◽

Floating Point ◽

Compression Method ◽

Point Data

Download Full-text

Subjective speech quality and speech intelligibility evaluation of single-channel dereverberation algorithms

2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC) ◽

10.1109/iwaenc.2014.6954313 ◽

2014 ◽

Cited By ~ 11

Author(s):

Anna Warzybok ◽

Ina Kodrasi ◽

Jan Ole Jungmann ◽

Emanuel Habets ◽

Timo Gerkmann ◽

...

Keyword(s):

Speech Intelligibility ◽

Single Channel ◽

Speech Quality

Download Full-text

Quantifying the Relation Between Speech Quality and Speech Intelligibility

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.3803.714 ◽

1995 ◽

Vol 38 (3) ◽

pp. 714-725 ◽

Cited By ~ 45

Author(s):

Jill E. Preminger ◽

Dianne J. Van Tasell

Keyword(s):

Speech Intelligibility ◽

Normal Hearing ◽

Speech Quality ◽

Category Rating ◽

Single Dimension ◽

Quality Dimensions ◽

Highly Correlated ◽

Quality Measurements

The purpose of the present research was to examine the relation between speech quality and speech intelligibility. Speech quality measurements were made using continuous discourse and a category rating procedure for the following dimensions: intelligibility, pleasantness, loudness, effort, and total impression. Measurements were made using a group of listeners with normal hearing for a set of stimulus conditions in which intelligibility varied, and for a set of stimulus conditions in which intelligibility was held constant near 100%. When ratings were made for a set of stimulus conditions in which intelligibility was allowed to vary (a) intersubject reliability was high (i.e., different listeners interpreted the dimensions in a similar manner); and (b) the speech quality dimensions of intelligibility, effort, and loudness were indistinguishable. When ratings were made for a set of stimulus conditions in which intelligibility was held constant (a) intersubject reliability was reduced, indicating that different listeners interpreted the dimensions in different ways; (b) most listeners rated each dimension differently, indicating that the dimensions were unique; and (c) across listeners, no single dimension was highly correlated with total impression. These results can be used in order to examine the relation between speech quality and speech intelligibility.

Download Full-text

Generating Test Data Using Symbolic Execution: Challenges with Floating Point Data Types

Communications in Computer and Information Science - Information and Software Technologies ◽

10.1007/978-3-642-33308-8_22 ◽

2012 ◽

pp. 267-274

Author(s):

Justinas Prelgauskas ◽

Eduardas Bareisa

Keyword(s):

Test Data ◽

Symbolic Execution ◽

Floating Point ◽

Data Types ◽

Point Data

Download Full-text

Read-Write Operation on Floating Point Data Program Design Between MCU and KingView

Lecture Notes in Electrical Engineering - Proceedings of the 9th International Symposium on Linear Drives for Industry Applications, Volume 3 ◽

10.1007/978-3-642-40633-1_89 ◽

2013 ◽

pp. 717-723

Author(s):

Congcong Fang ◽

Xiaojing Yang

Keyword(s):

Program Design ◽

Floating Point ◽

Point Data ◽

Data Program

Download Full-text

Effect of Slow-Acting Wide Dynamic Range Compression on Measures of Intelligibility and Ratings of Speech Quality in Simulated-Loss Listeners

Journal of Speech Language and Hearing Research ◽

10.1044/1092-4388(2005/048) ◽

2005 ◽

Vol 48 (3) ◽

pp. 702-714 ◽

Cited By ~ 8

Author(s):

Peninah S. Rosengard ◽

Karen L. Payton ◽

Louis D. Braida

Keyword(s):

Hearing Loss ◽

Speech Intelligibility ◽

Dynamic Range ◽

Speech Quality ◽

Subjective Ratings ◽

Subjective Measures ◽

Wide Dynamic Range ◽

Dynamic Range Compression ◽

Amplitude Compression ◽

Moderate Hearing Loss

The purpose of this study was twofold: (a) to determine the extent to which 4-channel, slow-acting wide dynamic range amplitude compression (WDRC) can counteract the perceptual effects of reduced auditory dynamic range and (b) to examine the relation between objective measures of speech intelligibility and categorical ratings of speech quality for sentences processed with slow-acting WDRC. Multiband expansion was used to simulate the effects of elevated thresholds and loudness recruitment in normal hearing listeners. While some previous studies have shown that WDRC can improve both speech intelligibility and quality, others have found no benefit. The current experiment shows that moderate amounts of compression can provide a small but significant improvement in speech intelligibility, relative to linear amplification, for simulated-loss listeners with small dynamic ranges (i.e., flat, moderate hearing loss). This benefit was found for speech at conversational levels, both in quiet and in a background of babble. Simulated-loss listeners with large dynamic ranges (i.e., sloping, mild-to-moderate hearing loss) did not show any improvement. Comparison of speech intelligibility scores and subjective ratings of intelligibility showed that listeners with simulated hearing loss could accurately judge the overall intelligibility of speech. However, in all listeners, ratings of pleasantness decreased as the compression ratio increased. These findings suggest that subjective measures of speech quality should be used in conjunction with either objective or subjective measures of speech intelligibility to ensure that participant-selected hearing aid parameters optimize both comfort and intelligibility.

Download Full-text

Lossless Compression of Double-Precision Floating-Point Data for Numerical Simulations: Highly Parallelizable Algorithms for GPU Computing

IEICE Transactions on Information and Systems ◽

10.1587/transinf.e95.d.2778 ◽

2012 ◽

Vol E95.D (12) ◽

pp. 2778-2786

Author(s):

Mamoru OHARA ◽

Takashi YAMAGUCHI

Keyword(s):

Numerical Simulations ◽

Gpu Computing ◽

Lossless Compression ◽

Floating Point ◽

Double Precision ◽

Point Data

Download Full-text

A Methodology for the Evaluation of Real-Time Speech Digitization

Proceedings of the Human Factors Society Annual Meeting ◽

10.1177/154193128302700130 ◽

1983 ◽

Vol 27 (1) ◽

pp. 104-107 ◽

Cited By ~ 1

Author(s):

Thomas R. Edman ◽

Stephen V. Metz

Keyword(s):

Human Factors ◽

Experimental Method ◽

Real Time ◽

Speech Intelligibility ◽

Speech Quality ◽

Design Issues ◽

Commercial Speech ◽

Store And Forward

Real-time speech digitizing technologies underlie such modern communications products as voice store and forward systems and digital PBX's. Among the human factors design issues associated with this technology, three of particular importance can be identified: i) speaker identifiability, ii) acceptability of speech quality, and iii) speech intelligibility. An experimental method for addressing issues of identifiability and intelligibility was developed and used to compare a commercial speech digitizing device with a standard toll quality telephone channel. It was found that the identifiability and acceptability of the telephone was slightly superior to the digitized speech. Additionally, results on an MRT showed intelligibility scores somewhat below optimal.

Download Full-text