Comparative Acoustic Analyses of L2 English: The Search for Systematic Variation

Phonetica ◽  
2020 ◽  
Vol 77 (6) ◽  
pp. 441-479
Author(s):  
Rebecca Laturnus

<b><i>Background/Aims:</i></b> Previous research has shown that exposure to multiple foreign accents facilitates adaptation to an untrained novel accent. One explanation is that L2 speech varies systematically such that there are commonalities in the productions of nonnative speakers, regardless of their language background. <b><i>Methods:</i></b> A systematic acoustic comparison was conducted between 3 native English speakers and 6 nonnative accents. Voice onset time, unstressed vowel duration, and formant values of stressed and unstressed vowels were analyzed, comparing each nonnative accent to the native English talkers. A subsequent perception experiment tests what effect training on regionally accented voices has on the participant’s comprehension of nonnative accented speech to investigate the importance of within-speaker variation on attunement and generalization. <b><i>Results:</i></b> Data for each measure show substantial variability across speakers, reflecting phonetic transfer from individual L1s, as well as substantial inconsistency and variability in pronunciation, rather than commonalities in their productions. Training on native English varieties did not improve participants’ accuracy in understanding nonnative speech. <b><i>Conclusion:</i></b> These findings are more consistent with a hypothesis of accent attune­ment wherein listeners track general patterns of nonnative speech rather than relying on overlapping acoustic signals between speakers.

2018 ◽  
Vol 40 (1) ◽  
pp. 93-109
Author(s):  
YI ZHENG ◽  
ARTHUR G. SAMUEL

AbstractIt has been documented that lipreading facilitates the understanding of difficult speech, such as noisy speech and time-compressed speech. However, relatively little work has addressed the role of visual information in perceiving accented speech, another type of difficult speech. In this study, we specifically focus on accented word recognition. One hundred forty-two native English speakers made lexical decision judgments on English words or nonwords produced by speakers with Mandarin Chinese accents. The stimuli were presented as either as videos that were of a relatively far speaker or as videos in which we zoomed in on the speaker’s head. Consistent with studies of degraded speech, listeners were more accurate at recognizing accented words when they saw lip movements from the closer apparent distance. The effect of apparent distance tended to be larger under nonoptimal conditions: when stimuli were nonwords than words, and when stimuli were produced by a speaker who had a relatively strong accent. However, we did not find any influence of listeners’ prior experience with Chinese accented speech, suggesting that cross-talker generalization is limited. The current study provides practical suggestions for effective communication between native and nonnative speakers: visual information is useful, and it is more useful in some circumstances than others.


2020 ◽  
Vol 13 (2) ◽  
pp. 361-389 ◽  
Author(s):  
Timothy L. Face ◽  
Mandy R. Menke

AbstractPrevious studies of native English speakers learning Spanish as a second language (L2) document compromise voice onset time (VOT) values; however, the focus has been predominantly on voiceless stops and has almost exclusively investigated beginning and intermediate learners. This study fills a gap in the literature by considering the acquisition of VOT in both voiceless and voiced Spanish stops by long-time native English-speaking residents of Spain. Overall, the results show that the L2 speakers’ VOT values differ from those of native speakers across all stop consonants; yet L2 speakers’ productions of voiceless, as opposed to voiced, stops more closely approximate those of native speakers. Considerable individual variation is observed as no speaker achieves native-like performance overall, and no consonant is mastered by more than half of the speakers. Results are considered in light of what they contribute to our understanding of ultimate attainment of Spanish VOT, specifically, and L2 phonology more generally.


Author(s):  
Miriam Geiss ◽  
Sonja Gumbsheimer ◽  
Anika Lloyd-Smith ◽  
Svenja Schmid ◽  
Tanja Kupisch

Abstract This study brings together two previously largely independent fields of multilingual language acquisition: heritage language and third language (L3) acquisition. We investigate the production of fortis and lenis stops in semi-naturalistic speech in the three languages of 20 heritage speakers (HSs) of Italian with German as a majority language and English as L3. The study aims to identify the extent to which the HSs produce distinct values across all three languages, or whether crosslinguistic influence (CLI) occurs. To this end, we compare the HSs’ voice onset time (VOT) values with those of L2 English speakers from Italy and Germany. The language triad exhibits overlapping and distinct VOT realizations, making VOT a potentially vulnerable category. Results indicate CLI from German into Italian, although a systemic difference is maintained. When speaking English, the HSs show an advantage over the Italian L2 control group, with less prevoicing and longer fortis stops, indicating a specific bilingual advantage.


1979 ◽  
Vol 66 (3) ◽  
pp. 654-662 ◽  
Author(s):  
Robert F. Port ◽  
Rosemarie Rotunno

1986 ◽  
Vol 29 (4) ◽  
pp. 499-504 ◽  
Author(s):  
Joanne Robbins ◽  
John Christensen ◽  
Gail Kempster

Voice onset time (VOT) and vowel duration characteristics of speakers following the Singer-Blom technique of tracheoesophageal puncture (1980) were compared to those of traditional esophageal and laryngeal speakers. Fifteen subjects in each of the three speaker groups produced the words /pik/, /kup/, and /kup/ in a carrier phrase while audio recordings were obtained. Broadband spectrograms were made of the consonant-vowel-consonant (CVC) utterances and vowel duration and VOT were measured. Analysis of variance (ANOVA) procedures revealed that the tracheoesophageal speakers produced significantly shorter VOTs and longer vowel durations than the laryngeal speakers. However, the longer vowel durations for the traeheoesophageal speakers were not completely accounted for by the shorter VOTs found for that group. Spectrographic examination suggests that delayed voice offset time for the tracheoesophageal speakers also contributes to their longer vowel durations. Overall findings indicate that the physical characteristics and motor control properties of the neoglottis, even when driven by pulmonary air as in tracheoesophageal speakers, exert a major influence on alaryngeal voice production.


Author(s):  
Aslı Altan ◽  
Erika Hoff

Children in bilingual communities are frequently exposed to speech from nonnative speakers, but little research has described how that input might differ from the input of native speakers. There is evidence that input from nonnative speakers might be less useful to language learning children, but little research has asked why. This chapter analyzes the frequency of complex structures in the child-directed speech of 30 native English speakers and 36 nonnative speakers who were late learners of English, all speaking English to their two-and-a-half-year-old children. All instances of nine categories of complex structures were coded in transcripts of mother-child interaction. The frequency of all but one category was greater in the speech of native speakers. These findings suggest that input provided by native speakers provides more frequent models of complex structures than nonnative input.


2018 ◽  
Vol 10 (6) ◽  
pp. 193
Author(s):  
Yaseen A. Azi

Based on the literature review, the three patterns (fillers, repairs and repetitions) in the conversations of the native English speakers are generally regarded as results of the normal speaking between people. On the other hand, the same patterns in the conversations of the L2 speakers are always seen as a marker of disfluency and linguistic disabilities of the nonnative speakers. Therefore, this study simply focuses on finding how the three disfluency patterns are used by the Saudi English speakers from different levels of fluency. The sampling of the study includes two groups of participants from different fluency levels. Through the transcriptions and the discourse analysis of one hour recoding of the two groups, the results showed that the three patterns (fillers, repairs and repetitions) should not be generally associated with disfluency. Instead, repetitions and self-repairs have been equally used by the two groups and such patterns can be used as a conversational device. However, the filler “uh” with longer pausing can clearly predict disfluency among the Saudi English speakers. 


2011 ◽  
Vol 15 (2) ◽  
pp. 275-287 ◽  
Author(s):  
SUE ANN S. LEE ◽  
GREGORY K. IVERSON

The purpose of this study was to conduct an acoustic examination of the obstruent stops produced by Korean–English bilingual children in connection with the question of whether bilinguals establish distinct categories of speech sounds across languages. Stop productions were obtained from ninety children in two age ranges, five and ten years: thirty Korean–English bilinguals, thirty monolingual Koreans and thirty monolingual English speakers. Voice-Onset-Time (VOT) lag at word-initial stop and fundamental frequency (f0) in the following vowel (hereafter vowel-onset f0) were measured. The bilingual children showed different patterns of VOT in comparison to both English and Korean monolinguals, with longer VOT in their production of Korean stop consonants and shorter VOT for English. Moreover, the ten-year-old bilinguals distinguished all stop categories using both VOT and vowel-onset f0,whereas the five-year-olds tended to make stop distinctions based on VOT but not vowel-onset f0. The results of this study suggest that bilingual children at around five years of age do not yet have fully separate stop systems, and that the systems continue to evolve during the developmental period.


2019 ◽  
Vol 63 (2) ◽  
pp. 436-452
Author(s):  
Navin Viswanathan ◽  
Annie J. Olmstead ◽  
M. Pilar Aivar

Among other characteristics, voiced and voiceless consonants differ in voice onset time (VOT; Lisker & Abramson, 1964). In addition, in English, voiced consonants are typically followed by longer vowels than their unvoiced counterparts (Allen & Miller, 1999). In Spanish, this relationship is less systematic (Zimmerman & Sapon, 1958). In two experiments, we investigated perceptual sensitivities of English and Spanish native speakers to following vowel length (VL) in categorizing syllables that ranged from a prevoiced bilabial stop [ba] to a long-lag bilabial stop [pa]. According to our results, English speakers show sensitivity to following vowels with VLs falling within an English-typical range (Experiment 1), but not when vowels are shorter and in a Spanish-typical range (Experiment 2). Interestingly, Spanish native speakers do not show sensitivity to following VL in either condition. These results suggest that VOT-VL tradeoffs in perception reflect phonological sensitivities of listeners and are not reducible to speech rate compensation.


2015 ◽  
Vol 37 (2) ◽  
pp. 303-322 ◽  
Author(s):  
JOAN SERENO ◽  
LYNNE LAMMERS ◽  
ALLARD JONGMAN

ABSTRACTThe present study examines the relative impact of segments and intonation on accentedness, comprehensibility, and intelligibility, specifically investigating the separate contribution of segmental and intonational information to perceived foreign accent in Korean-accented English. Two English speakers and two Korean speakers recorded 40 English sentences. The sentences were manipulated by combining segments from one speaker with intonation (fundamental frequency contour and duration) from another speaker. Four versions of each sentence were created: one English control (English segments and English intonation), one Korean control (Korean segments and Korean intonation), and two Korean–English combinations (one with English segments and Korean intonation; the other with Korean segments and English intonation). Forty native English speakers transcribed the sentences for intelligibility and rated their comprehensibility and accentedness. The data show that segments had a significant effect on accentedness, comprehensibility, and intelligibility, but intonation only had an effect on intelligibility. Contrary to previous studies, the present study, separating segments from intonation, suggests that segmental information contributes substantially more to the perception of foreign accentedness than intonation. Native speakers seem to rely mainly on segments when determining foreign accentedness.


Sign in / Sign up

Export Citation Format

Share Document