Relations Between Prosodic Variables and Emotions in Normal American English Utterances

1968 ◽  
Vol 11 (3) ◽  
pp. 481-487 ◽  
Author(s):  
George L. Huttar

The emotional states of an adult male American speaker, as reflected in 30 utterances, were evaluated by 12 subjects on nine 7-point semantic differential scales. The subjects also evaluated the utterances on similar scales for pitch, loudness, and speed. Significant correlations were found between some acoustic variables and the judgments of some types of emotion. Higher correlations were found between the acoustic variables and judgments of degree of emotion. Correlation coefficients between judgments of emotion and judgments of prosodic features were in general higher than the correlations involving the acoustic variables. Degree of perceived emotion was found to be highly and positively correlated with fundamental frequency range and intensity range. A causal explanation of these relations in terms of human physiology is suggested.

2013 ◽  
Vol 1 (1) ◽  
pp. 54-67
Author(s):  
Kanu Boku ◽  
Taro Asada ◽  
Yasunari Yoshitomi ◽  
Masayoshi Tabuse

Recently, methods for adding emotion to synthetic speech have received considerable attention in the field of speech synthesis research. For generating emotional synthetic speech, it is necessary to control the prosodic features of the utterances. The authors propose a case-based method for generating emotional synthetic speech by exploiting the characteristics of the maximum amplitude and the utterance time of vowels, and the fundamental frequency of emotional speech. As an initial investigation, they adopted the utterance of Japanese names, which are semantically neutral. By using the proposed method, emotional synthetic speech made from the emotional speech of one male subject was discriminable with a mean accuracy of 70% when ten subjects listened to the emotional synthetic utterances of “angry,” “happy,” “neutral,” “sad,” or “surprised” when the utterance was the Japanese name “Taro.”


2020 ◽  
Vol 10 (14) ◽  
pp. 4751
Author(s):  
Gee Won Shin ◽  
Sunghwan Park ◽  
Yong Min Kim ◽  
Yushin Lee ◽  
Myung Hwan Yun

When developing a user-oriented product, it is crucial to consider users’ affective needs. Various semantic differential (SD) methods have been used to identify affect regarding materials, and this is the most important property in products. This study aims to determine which of the three conventional SD methods (absolute evaluation 1 [AE 1], absolute evaluation 2 [AE 2], or relative evaluation [RE]) is most effective for affective evaluation. Affective evaluation was performed for vehicle instrument panels by each of these three SD methods. Two quantitative analysis methods (correlation analysis and repeated-measures ANOVA) were used to examine the performance (sample distinguishability) of each evaluation method, and it was found that both AE 2 and RE produced better results than AE 1. The correlation coefficients and p-values in correlation analysis were slightly better for RE than for AE 2. In conclusion, an affective evaluation produced better results when pairwise samples (especially one sample pair) were presented, indicating that maintaining distinct samples is very important. The clearer the difference in comparison targets is, the more accurate the evaluation results.


1971 ◽  
Vol 14 (2) ◽  
pp. 301-306 ◽  
Author(s):  
Barbara V. Fishman ◽  
Robert E. McGlone ◽  
Thomas Shipp

Five young adult male subjects with normal larynxes were recorded as they sustained phonation during one normal and three drug conditions. The vocal tasks included obtaining the total fundamental frequency range, tone-matching ability, and vocal fry production, and were performed by the subject (1) before drugs, (2) following injection of a tranquilizer-narcotic premedication, (3) after topical anesthesia of the larynx, and (4) during a drug-recovery period. Results showed no significant differences between conditions on any of the measures of sustained phonation. It was concluded that neither depressed cortical function nor sensory deprivation of the laryngeal mucosa alters the subject’s phonatory capabilities essential to the performance of the selected vocal tasks.


2016 ◽  
Vol 38 (3) ◽  
pp. 541-570 ◽  
Author(s):  
ZHEN QIN ◽  
YU-FU CHIEN ◽  
ANNIE TREMBLAY

ABSTRACTThis study investigates whether second language learners’ processing of stress can be explained by the degree to which suprasegmental cues contribute to lexical identity in the native language. It focuses on Standard Mandarin, Taiwan Mandarin, and American English listeners’ processing of stress in English nonwords. In Mandarin, fundamental frequency contributes to lexical identity by signaling lexical tones, but only in Standard Mandarin does duration distinguish stressed–unstressed and stressed–stressed words. Participants completed sequence-recall tasks containing English disyllabic nonwords contrasting in stress. Experiment 1 used natural stimuli; Experiment 2 used resynthesized stimuli that isolated fundamental frequency and duration cues. Experiment 1 revealed no difference among the groups; in Experiment 2, Standard Mandarin listeners used duration more than Taiwan Mandarin listeners did. These results are interpreted within a cue-weighting theory of speech perception.


1988 ◽  
Vol 31 (4) ◽  
pp. 723-725 ◽  
Author(s):  
Marica C. Wheat ◽  
Amelia I. Hudson

The fundamental vocal frequency characteristics were measured from 50 male and 50 female Black 6-year-old children during prompted spontaneous speech. Boys had a mean fundamental frequency of 219.50 Hz, with a mean range of 134.80–298.70 Hz. Girls had a mean fundamental frequency (F 0 ) of 211.30 Hz and a mean frequency range of 137.60–297.50 Hz. No significant differences in mean or range values were found as a function of speaker sex. Nonsignificant relationships were found between physical variables of speaker height and weight and these F 0 measures for either sex and for the speaker group combined. The results of the present study were compared to previous research concerning the speaking fundamental vocal frequency of White children.


2017 ◽  
Vol 0 (17) ◽  
pp. 32-39
Author(s):  
Величко Олег Миколайович ◽  
Шевкун Сергій Миколайович ◽  
Юлія Миколаївна Куліш ◽  
Марина Валеріївна Добролюбова

Rhema ◽  
2020 ◽  
pp. 9-17
Author(s):  
Natalia M. Solntseva ◽  
Zhang Rui

The article deals with the phonostylistic and semantic features of acoustic images in the early realist stories of Alexander Grin. The conclusion is made about their important compositional role. Attention is focused on vocabulary with acoustic semantics, as well as onomatopoeia understood as the imitation of both the sound and its meaning. The functions of sound polyphony in the depicting of landscapes and emotional states and in the plot structure are discussed. The role of alliteration, the temporal characteristics of acoustic images, a combination of autologous and associative images, and variable functionality of remarks in dialogs are described. The prosodic features of the narrative are analyzed. The motifs of silence and music and their role in the semantic structure of Grin’s stories are noted.


Sign in / Sign up

Export Citation Format

Share Document