A longitudinal study of within-individual variation in human voice pitch

2016 ◽  
Vol 140 (4) ◽  
pp. 3397-3397
Author(s):  
Katarzyna Pisanski ◽  
Meddy Fouquet ◽  
Nicolas Mathevon ◽  
David Reby
2006 ◽  
Vol 27 (4) ◽  
pp. 283-296 ◽  
Author(s):  
David Andrew Puts ◽  
Steven J.C. Gaulin ◽  
Katherine Verdolini

2018 ◽  
Vol 14 (7) ◽  
pp. 20180065 ◽  
Author(s):  
Florence Levrero ◽  
Nicolas Mathevon ◽  
Katarzyna Pisanski ◽  
Erik Gustafsson ◽  
David Reby

Voice pitch (fundamental frequency, F 0 ) is a key dimension of our voice that varies between sexes after puberty, and also among individuals of the same sex both before and after puberty. While a recent longitudinal study indicates that inter-individual differences in voice pitch remain stable in men during adulthood and may even be determined before puberty (Fouquet et al. 2016 R. Soc. open sci. 3 , 160395. ( doi:10.1098/rsos.160395 )), whether these differences emerge in infancy remains unknown. Here, using a longitudinal study design, we investigate the hypothesis that inter-individual differences in F 0 are already present in the cries of pre-verbal babies. While based on a small sample ( n = 15), our results indicate that the F 0 of babies' cries at 4 months of age may predict the F 0 of their speech utterances at 5 years of age, explaining 41% of the inter-individual variance in voice pitch at that age in our sample. We also found that the right-hand ratio of the length of their index to ring finger (2D : 4D digit ratio), which has been proposed to constitute an index of prenatal testosterone exposure, was positively correlated with F 0 at both 4 months and 5 years of age. These findings suggest that a substantial proportion of between-individual differences in voice pitch, which convey important biosocial information about speakers, may partly originate in utero and thus already be present soon after birth.


2018 ◽  
Vol 285 (1893) ◽  
pp. 20181634 ◽  
Author(s):  
Katarzyna Pisanski ◽  
Anna Oleszkiewicz ◽  
Justyna Plachetka ◽  
Marzena Gmiterek ◽  
David Reby

Inter-individual differences in human fundamental frequency ( F 0, perceived as voice pitch) predict mate quality and reproductive success, and affect listeners' social attributions. Although humans can readily and volitionally manipulate their vocal apparatus and resultant voice pitch, for instance, in the production of speech sounds and singing, little is known about whether humans exploit this capacity to adjust the non-verbal dimensions of their voices during social (including sexual) interactions. Here, we recorded full-length conversations of 30 adult men and women taking part in real speed-dating events and tested whether their voice pitch (mean, range and variability) changed with their personal mate choice preferences and the overall desirability of each dating partner. Within-individual analyses indicated that men lowered the minimum pitch of their voices when interacting with women who were overall highly desired by other men. Men also lowered their mean voice pitch on dates with women they selected as potential mates, particularly those who indicated a mutual preference (matches). Interestingly, although women spoke with a higher and more variable voice pitch towards men they selected as potential mates, women lowered both voice pitch parameters towards men who were most desired by other women and whom they also personally preferred. Between-individual analyses indicated that men in turn preferred women with lower-pitched voices, wherein women's minimum voice pitch explained up to 55% of the variance in men's mate preferences. These results, derived in an ecologically valid setting, show that individual- and group-level mate preferences can interact to affect vocal behaviour, and support the hypothesis that human voice modulation functions in non-verbal communication to elicit favourable judgements and behaviours from others, including potential mates.


2001 ◽  
Vol 297 (1) ◽  
pp. 61-63 ◽  
Author(s):  
Jean-Louis Millot ◽  
Gérard Brand
Keyword(s):  

2020 ◽  
Vol 7 (2) ◽  
pp. 191642 ◽  
Author(s):  
Katarzyna Pisanski ◽  
Jordan Raine ◽  
David Reby

Fundamental frequency ( F 0, perceived as voice pitch) predicts sex and age, hormonal status, mating success and a range of social traits, and thus functions as an important biosocial marker in modal speech. Yet, the role of F 0 in human nonverbal vocalizations remains unclear, and given considerable variability in F 0 across call types, it is not known whether F 0 cues to vocalizer attributes are shared across speech and nonverbal vocalizations. Here, using a corpus of vocal sounds from 51 men and women, we examined whether individual differences in F 0 are retained across neutral speech, valenced speech and nonverbal vocalizations (screams, roars and pain cries). Acoustic analyses revealed substantial variability in F 0 across vocal types, with mean F 0 increasing as much as 10-fold in screams compared to speech in the same individual. Despite these extreme pitch differences, sexual dimorphism was preserved within call types and, critically, inter-individual differences in F 0 correlated across vocal types ( r = 0.36–0.80) with stronger relationships between vocal types of the same valence (e.g. 38% of the variance in roar F 0 was predicted by aggressive speech F 0). Our results indicate that biologically and socially relevant indexical cues in the human voice are preserved in simulated valenced speech and vocalizations, including vocalizations characterized by extreme F 0 modulation, suggesting that voice pitch may function as a reliable individual and biosocial marker across disparate communication contexts.


Analysis of human voice based on its pitch can be used in detecting the pauses. The available algorithms for pause detection could succeed to some extent and lot of scope for better performance still exists. The proposed intelligent pause detection protocol (IPDP) is the convergence of (i) calculation of Mean/RMS peak value from human voice pitch (ii) estimating the pause using MLE algorithm and (iii) optimizing the bandwidth utilization of Vocoders using DTX algorithm. The work carried projects better pause removal than the existing standard methods.


2018 ◽  
Author(s):  
Kosuke Motoki ◽  
Toshiki Saito ◽  
Rui Nouchi ◽  
Ryuta Kawashima ◽  
Motoaki Sugiura

Crossmodal correspondences have been increasingly reported in recent scholarship, and pitch–taste associations have been observed. People consistently associate high-pitched vocal tones with sweet/sour foods, while low-pitched tones tend to be associated with bitter foods. The human voice is key in broadcast advertising, and the role of voice in communication generally is partly characterized by acoustic parameters of pitch. However, it remains unknown whether voice pitch and other senses relevant to product attributes (e.g., taste) interactively influence consumer behavior. Since congruent sensory information is desirable, it is plausible that voice pitch and taste interactively guide consumers’ responses to advertising. Based on the crossmodal correspondence phenomenon, this study aimed to elucidate the role played by voice pitch/taste correspondences in advertising effectiveness. Participants listened to voiceover advertisements (at a high or low pitch) for three food products with distinct tastes (sweet, sour, and bitter) and rated their buying intention (an indicator of advertising effectiveness). The results show that the participants were likely to exhibit greater buying intention toward both sweet and sour food when they listened to high-pitched (vs. low-pitched) voiceover advertisements. The effects for sweet food occurred when the vocal pitch was considerably high (Studies 2 and 3), but not when pitch was only moderately high (Study 1). The influence of high pitch on sour food preferences was somewhat inconsistent. These findings emphasize the role that voice pitch/taste correspondence plays in preference formation, and advance the applicability of crossmodal correspondences to business.


2019 ◽  
Vol 32 (4-5) ◽  
pp. 401-427 ◽  
Author(s):  
Kosuke Motoki ◽  
Toshiki Saito ◽  
Rui Nouchi ◽  
Ryuta Kawashima ◽  
Motoaki Sugiura

Abstract We have seen a rapid growth of interest in cross-modal correspondences between sound and taste over recent years. People consistently associate higher-pitched sounds with sweet/sour foods, while lower-pitched sounds tend to be associated with bitter foods. The human voice is key in broadcast advertising, and the role of voice in communication generally is partly characterized by acoustic parameters of pitch. However, it remains unknown whether voice pitch and taste interactively influence consumer behavior. Since consumers prefer congruent sensory information, it is plausible that voice pitch and taste interactively influence consumers’ responses to advertising stimuli. Based on the cross-modal correspondence phenomenon, this study aimed to elucidate the role played by voice pitch–taste correspondences in advertising effectiveness. Participants listened to voiceover advertisements (at a higher or lower pitch than the original narrator’s voice) for three food products with distinct tastes (sweet, sour, and bitter) and rated their buying intention (an indicator of advertising effectiveness). The results show that the participants were likely to exhibit greater buying intention toward both sweet and sour food when they listened to higher-pitched (vs lower-pitched) voiceover advertisements. The influence of a higher pitch on sweet and sour food preferences was observed in only two of the three studies: studies 1 and 2 for sour food, and studies 2 and 3 for sweet food. These findings emphasize the role that voice pitch–taste correspondence plays in preference formation, and advance the applicability of cross-modal correspondences to business.


2020 ◽  
Vol 6 (3) ◽  
pp. 352-379
Author(s):  
Beth Zielinski ◽  
Elizabeth Pryor

Abstract In this longitudinal study we tracked change in comprehensibility and English use over a 10-month period in 14 L2 English learners (8 beginner, 6 intermediate) settling in Australia. They were interviewed 4 times during the 10 months as part of a larger longitudinal study. English use was reported at each interview using a language map and excerpts from recordings of Interviews 1 and 4 were rated for comprehensibility. Intermediate participants tended to be more comprehensible and maintain a higher level of English use over time than the beginners. Exploration of individual variation revealed a range of comprehensibility outcomes, the variable and non-linear nature of English use trajectories, and a possible relationship between comprehensibility change and English use for some participants. Important methodological implications for future studies relate to the measurement of comprehensibility and English use, the speech samples used for comprehensibility ratings, and the importance of individual variation.


Sign in / Sign up

Export Citation Format

Share Document