scholarly journals Residual Echo Suppression Considering Harmonic Distortion and Temporal Correlation

2020 ◽  
Vol 10 (15) ◽  
pp. 5291
Author(s):  
Hyungchan Song ◽  
Jong Won Shin

In acoustic echo cancellation, a certain level of residual echo resides in the output of the linear echo canceller because of the nonlinearity of the power amplifier, loudspeaker, and acoustic transfer function in addition to the estimation error of the linear echo canceller. The residual echo in the current frame is correlated not only to the linear echo estimates for the harmonically-related frequency bins in the current frame, but also with linear echo estimates, residual echo estimates, and microphone signals in adjacent frames. In this paper, we propose a residual echo suppression scheme considering harmonic distortion and temporal correlation in the short-time Fourier transform domain. To exploit residual echo estimates and microphone signals in past frames without the adverse effect of the near-end speech and noise, we adopt a double-talk detector which is tuned to have a low false rejection rate of double-talks. Experimental results show that the proposed method outperformed the conventional approach in terms of the echo return loss enhancement during single-talk periods and the perceptual evaluation of speech quality scores during double-talk periods.

Author(s):  
Futing Luo ◽  
Mingliang Zhou ◽  
Bing Fang

In this paper, we propose a strong spatio-temporal mechanism with correlation filters to solve multi-modality tracking tasks. First, we use the features of the previous four frames as spatio-temporal features, then aggregate the spatio-temporal features into the filters learning and positioning of the adjacent frame. Second, we enhance the temporal and spatial characteristics of the current frame filter by learning the previous four frame filters and spatial penalty. From the experimental results on the GTOT, VOT-TIR2019 and RGBT234 datasets, our strong spatio-temporal correlation filters has achieved excellent performance.


Author(s):  
Jiangping Nan

In order to improve the algorithm of time-varying parameters and unknown parameters adaptability, avoid assuming the approximate part deviation caused by the algorithm, this paper proposes a adaptive control algorithm, the algorithm based on lyapunov direct method to predict the output voltage in the process of estimating each parameter in a reasonable manner to parameter estimation error with the actual output current and current automatic adjustment. The adaptive control of current tracking is realized and the error caused by assuming voltage or current and neglecting line resistance is avoided in the predictive current control algorithm. The simulation results show that the tracking current can track the target current with high precision from t = 0 in the presence of random noise, and the power factor is close to 1, showing a good steady-state performance. Frequency domain waveform, the calculated harmonic distortion rate is 2.2418%, waveform quality is good and each harmonic amplitude is small. Conclusion: adaptive control algorithm can quickly and accurately realize current tracking and greatly suppress the noise.


2013 ◽  
Vol 303-306 ◽  
pp. 2042-2045
Author(s):  
Ya Ting Wu ◽  
Y.Y. Zhao ◽  
Fei Yu

A low-complexity echo canceller integrated with vocoder is proposed in this paper to speed up the convergence process. By making full use of the linear prediction parameters retrieved from decoder and the voice active detection feature of the vocoder, the new echo canceller avoids the need to calculate decorrelation filter coefficients and prewhiten the received signal separately. Simulation results show performance improvement of the proposed algorithm in terms of convergence rate and echo return loss enhancement.


2021 ◽  
Vol 11 (13) ◽  
pp. 6208
Author(s):  
Yicheng Luo ◽  
Yajing Xu ◽  
Si Li ◽  
Qifeng Qian ◽  
Bo Xiao

Convolutional neural networks have achieved great success in analyzing potential features inside tropical cyclones (TCs) using satellite images for intensity estimation. However, due to the high similarity of visual features in TC images, it is still a challenge to learn the accurate mapping between TC images and numerical intensity. Existing works mainly focus on the visual features of a single TC, ignoring the impact of intensity continuity and time evolution among TCs on decision making. Therefore, we propose a DR-transformer framework for temporal TC intensity estimation. Inside DR-transformers, a novel DR-extractor can extract Distance-consistency(DC) and Rotation-invariance (RI) features between TC images, and therefore can better learn the contours, structures, and other visual features of each TC image. DC features can reduce the estimation error between adjacent intensities, and RI features can eliminate feature deviation caused by shooting angles and TC rotation. Additionally, a transformer with a DR-extractor as the backbone is applied to aggregate the temporal correlation in a series of TC images, which can learn the evolution from intensity to the visual features of TC. Experiments show that the final result, an RMSE of 7.76 knots, outperforms the baseline, and is better than any previously reported method trained on the TCIR dataset.


Energies ◽  
2018 ◽  
Vol 11 (10) ◽  
pp. 2515 ◽  
Author(s):  
Ya Zhang ◽  
Maurice Roes ◽  
Marcel Hendrix ◽  
Jorge Duarte

A single-phase grid-connected converter is considered in this paper in the presence of harmonic problems introduced non-linear loads. In order to compensate the harmonics caused by the loads, a local voltage support scheme is proposed. This is an added feature because its implementation is in parallel with a conventional current control method. Distinctively, the measurements of the grid or load current are not needed since the scheme is based on only local measurements. On top of a fundamental part for desired power injection, the converter output current comprises a harmonic part for compensation. Thus, the grid current harmonic distortion is minimized and the enhancement of the local voltage quality is achieved. A comprehensive model analysis indicates that the proposed strategy can help to attenuate harmonics of the local voltage without compromising on the quality of the fundamental current injection. Experimental results validate the effectiveness of the proposed control scheme. Moreover, the impact of grid frequency estimation error on the control strategy’s performance is quantified theoretically and experimentally.


2020 ◽  
Vol 63 (4) ◽  
pp. 1018-1032
Author(s):  
Chia-Hsin Wu ◽  
Roger W. Chan

Purpose Semi-occluded vocal tract (SOVT) exercises with tubes or straws have been widely used for a variety of voice disorders. Yet, the effects of longer periods of SOVT exercises (lasting for weeks) on the aging voice are not well understood. This study investigated the effects of a 6-week straw phonation in water (SPW) exercise program. Method Thirty-seven elderly subjects with self-perceived voice problems were assigned into two groups: (a) SPW exercises with six weekly sessions and home practice (experimental group) and (b) vocal hygiene education (control group). Before and after intervention (2 weeks after the completion of the exercise program), acoustic analysis, auditory–perceptual evaluation, and self-assessment of vocal impairment were conducted. Results Analysis of covariance revealed significant differences between the two groups in smoothed cepstral peak prominence measures, harmonics-to-noise ratio, the auditory–perceptual parameter of breathiness, and Voice Handicap Index-10 scores postintervention. No significant differences between the two groups were found for other measures. Conclusions Our results supported the positive effects of SOVT exercises for the aging voice, with a 6-week SPW exercise program being a clinical option. Future studies should involve long-term follow-up and additional outcome measures to better understand the efficacy of SOVT exercises, particularly SPW exercises, for the aging voice.


2020 ◽  
Vol 63 (7) ◽  
pp. 2054-2069
Author(s):  
Brandon Merritt ◽  
Tessa Bent

Purpose The purpose of this study was to investigate how speech naturalness relates to masculinity–femininity and gender identification (accuracy and reaction time) for cisgender male and female speakers as well as transmasculine and transfeminine speakers. Method Stimuli included spontaneous speech samples from 20 speakers who are transgender (10 transmasculine and 10 transfeminine) and 20 speakers who are cisgender (10 male and 10 female). Fifty-two listeners completed three tasks: a two-alternative forced-choice gender identification task, a speech naturalness rating task, and a masculinity/femininity rating task. Results Transfeminine and transmasculine speakers were rated as significantly less natural sounding than cisgender speakers. Speakers rated as less natural took longer to identify and were identified less accurately in the gender identification task; furthermore, they were rated as less prototypically masculine/feminine. Conclusions Perceptual speech naturalness for both transfeminine and transmasculine speakers is strongly associated with gender cues in spontaneous speech. Training to align a speaker's voice with their gender identity may concurrently improve perceptual speech naturalness. Supplemental Material https://doi.org/10.23641/asha.12543158


2020 ◽  
Vol 63 (12) ◽  
pp. 3974-3981
Author(s):  
Ashwini Joshi ◽  
Isha Baheti ◽  
Vrushali Angadi

Aim The purpose of this study was to develop and assess the reliability of a Hindi version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). Reliability was assessed by comparing Hindi CAPE-V ratings with English CAPE-V ratings and by the Grade, Roughness, Breathiness, Asthenia and Strain (GRBAS) scale. Method Hindi sentences were created to match the phonemic load of the corresponding English CAPE-V sentences. The Hindi sentences were adapted for linguistic content. The original English and adapted Hindi CAPE-V and GRBAS were completed for 33 bilingual individuals with normal voice quality. Additionally, the Hindi CAPE-V and GRBAS were completed for 13 Hindi speakers with disordered voice quality. The agreement of CAPE-V ratings was assessed between language versions, GRBAS ratings, and two rater pairs (three raters in total). Pearson product–moment correlation was completed for all comparisons. Results A strong correlation ( r > .8, p < .01) was found between the Hindi CAPE-V scores and the English CAPE-V scores for most variables in normal voice participants. A weak correlation was found for the variable of strain ( r < .2, p = .400) in the normative group. A strong correlation ( r > .6, p < .01) was found between the overall severity/grade, roughness, and breathiness scores in the GRBAS scale and the CAPE-V scale in normal and disordered voice samples. Significant interrater reliability ( r > .75) was present in overall severity and breathiness. Conclusions The Hindi version of the CAPE-V demonstrates good interrater reliability and concurrent validity with the English CAPE-V and the GRBAS. The Hindi CAPE-V can be used for the auditory-perceptual voice assessment of Hindi speakers.


2013 ◽  
Vol 23 (2) ◽  
pp. 49-61 ◽  
Author(s):  
Jamie Perry ◽  
Graham Schenck

Despite advances in surgical management, it is estimated that 20–30% of children with repaired cleft palate will continue to have hypernasal speech and require a second surgery to create normal velopharyngeal function (Bricknell, McFadden, & Curran, 2002; Härtel, Karsten, & Gundlach, 1994; McWilliams, 1990). A qualitative perceptual assessment by a speech-language pathologist is considered the most important step of the evaluation for children with resonance disorders (Peterson-Falzone, Hardin-Jones, & Karnell, 2010). Direct and indirect instrumental analyses should be used to confirm or validate the perceptual evaluation of an experienced speech-language pathologist (Paal, Reulbach, Strobel-Schwarthoff, Nkenke, & Schuster, 2005). The purpose of this article is to provide an overview of current instrumental assessment methods used in cleft palate care. Both direct and indirect instrumental procedures will be reviewed with descriptions of the advantages and disadvantages of each. Lastly, new developments for evaluating velopharyngeal structures and function will be provided.


Sign in / Sign up

Export Citation Format

Share Document