Comparative Acoustic Analyses of L2 English: The Search for Systematic Variation

Rebecca Laturnus

doi:10.1159/000508387

Comparative Acoustic Analyses of L2 English: The Search for Systematic Variation

Phonetica ◽

10.1159/000508387 ◽

2020 ◽

Vol 77 (6) ◽

pp. 441-479

Author(s):

Rebecca Laturnus

Keyword(s):

Voice Onset Time ◽

Nonnative Speakers ◽

Onset Time ◽

Native English Speakers ◽

English Speakers ◽

Vowel Duration ◽

Language Background ◽

Accented Speech ◽

Foreign Accents ◽

Acoustic Analyses

Background/Aims: Previous research has shown that exposure to multiple foreign accents facilitates adaptation to an untrained novel accent. One explanation is that L2 speech varies systematically such that there are commonalities in the productions of nonnative speakers, regardless of their language background. Methods: A systematic acoustic comparison was conducted between 3 native English speakers and 6 nonnative accents. Voice onset time, unstressed vowel duration, and formant values of stressed and unstressed vowels were analyzed, comparing each nonnative accent to the native English talkers. A subsequent perception experiment tests what effect training on regionally accented voices has on the participant’s comprehension of nonnative accented speech to investigate the importance of within-speaker variation on attunement and generalization. Results: Data for each measure show substantial variability across speakers, reflecting phonetic transfer from individual L1s, as well as substantial inconsistency and variability in pronunciation, rather than commonalities in their productions. Training on native English varieties did not improve participants’ accuracy in understanding nonnative speech. Conclusion: These findings are more consistent with a hypothesis of accent attunement wherein listeners track general patterns of nonnative speech rather than relying on overlapping acoustic signals between speakers.

Download Full-text

How much do visual cues help listeners in perceiving accented speech?

Applied Psycholinguistics ◽

10.1017/s0142716418000462 ◽

2018 ◽

Vol 40 (1) ◽

pp. 93-109

Author(s):

YI ZHENG ◽

ARTHUR G. SAMUEL

Keyword(s):

Visual Information ◽

Visual Cues ◽

Nonnative Speakers ◽

Apparent Distance ◽

Native English Speakers ◽

English Speakers ◽

Accented Speech ◽

Compressed Speech ◽

Degraded Speech

AbstractIt has been documented that lipreading facilitates the understanding of difficult speech, such as noisy speech and time-compressed speech. However, relatively little work has addressed the role of visual information in perceiving accented speech, another type of difficult speech. In this study, we specifically focus on accented word recognition. One hundred forty-two native English speakers made lexical decision judgments on English words or nonwords produced by speakers with Mandarin Chinese accents. The stimuli were presented as either as videos that were of a relatively far speaker or as videos in which we zoomed in on the speaker’s head. Consistent with studies of degraded speech, listeners were more accurate at recognizing accented words when they saw lip movements from the closer apparent distance. The effect of apparent distance tended to be larger under nonoptimal conditions: when stimuli were nonwords than words, and when stimuli were produced by a speaker who had a relatively strong accent. However, we did not find any influence of listeners’ prior experience with Chinese accented speech, suggesting that cross-talker generalization is limited. The current study provides practical suggestions for effective communication between native and nonnative speakers: visual information is useful, and it is more useful in some circumstances than others.

Download Full-text

L2 Acquisition of Spanish VOT by English-Speaking Immigrants in Spain

Studies in Hispanic and Lusophone Linguistics ◽

10.1515/shll-2020-2034 ◽

2020 ◽

Vol 13 (2) ◽

pp. 361-389 ◽

Cited By ~ 1

Author(s):

Timothy L. Face ◽

Mandy R. Menke

Keyword(s):

Native Speakers ◽

Voice Onset Time ◽

Onset Time ◽

Native English Speakers ◽

English Speakers ◽

L2 Acquisition ◽

L2 Phonology ◽

Long Time ◽

English Speaking ◽

Considerable Individual Variation

AbstractPrevious studies of native English speakers learning Spanish as a second language (L2) document compromise voice onset time (VOT) values; however, the focus has been predominantly on voiceless stops and has almost exclusively investigated beginning and intermediate learners. This study fills a gap in the literature by considering the acquisition of VOT in both voiceless and voiced Spanish stops by long-time native English-speaking residents of Spain. Overall, the results show that the L2 speakers’ VOT values differ from those of native speakers across all stop consonants; yet L2 speakers’ productions of voiceless, as opposed to voiced, stops more closely approximate those of native speakers. Considerable individual variation is observed as no speaker achieves native-like performance overall, and no consonant is mastered by more than half of the speakers. Results are considered in light of what they contribute to our understanding of ultimate attainment of Spanish VOT, specifically, and L2 phonology more generally.

Download Full-text

VOICE ONSET TIME IN MULTILINGUAL SPEAKERS: ITALIAN HERITAGE SPEAKERS IN GERMANY WITH L3 ENGLISH

Studies in Second Language Acquisition ◽

10.1017/s0272263121000280 ◽

2021 ◽

pp. 1-25

Author(s):

Miriam Geiss ◽

Sonja Gumbsheimer ◽

Anika Lloyd-Smith ◽

Svenja Schmid ◽

Tanja Kupisch

Keyword(s):

Heritage Language ◽

Voice Onset Time ◽

Onset Time ◽

English Speakers ◽

Control Group ◽

Heritage Speakers ◽

Third Language ◽

Bilingual Advantage ◽

L3 Acquisition ◽

Crosslinguistic Influence

Abstract This study brings together two previously largely independent fields of multilingual language acquisition: heritage language and third language (L3) acquisition. We investigate the production of fortis and lenis stops in semi-naturalistic speech in the three languages of 20 heritage speakers (HSs) of Italian with German as a majority language and English as L3. The study aims to identify the extent to which the HSs produce distinct values across all three languages, or whether crosslinguistic influence (CLI) occurs. To this end, we compare the HSs’ voice onset time (VOT) values with those of L2 English speakers from Italy and Germany. The language triad exhibits overlapping and distinct VOT realizations, making VOT a potentially vulnerable category. Results indicate CLI from German into Italian, although a systemic difference is maintained. When speaking English, the HSs show an advantage over the Italian L2 control group, with less prevoicing and longer fortis stops, indicating a specific bilingual advantage.

Download Full-text

Relation between voice‐onset time and vowel duration

The Journal of the Acoustical Society of America ◽

10.1121/1.383692 ◽

1979 ◽

Vol 66 (3) ◽

pp. 654-662 ◽

Cited By ~ 55

Author(s):

Robert F. Port ◽

Rosemarie Rotunno

Keyword(s):

Voice Onset Time ◽

Onset Time ◽

Vowel Duration

Download Full-text

Characteristics of Speech Production after Tracheoesophageal Puncture

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.2904.499 ◽

1986 ◽

Vol 29 (4) ◽

pp. 499-504 ◽

Cited By ~ 18

Author(s):

Joanne Robbins ◽

John Christensen ◽

Gail Kempster

Keyword(s):

Motor Control ◽

Speech Production ◽

Voice Onset Time ◽

Onset Time ◽

Major Influence ◽

Vowel Duration ◽

Voice Production ◽

Audio Recordings ◽

Tracheoesophageal Puncture ◽

Offset Time

Voice onset time (VOT) and vowel duration characteristics of speakers following the Singer-Blom technique of tracheoesophageal puncture (1980) were compared to those of traditional esophageal and laryngeal speakers. Fifteen subjects in each of the three speaker groups produced the words /pik/, /kup/, and /kup/ in a carrier phrase while audio recordings were obtained. Broadband spectrograms were made of the consonant-vowel-consonant (CVC) utterances and vowel duration and VOT were measured. Analysis of variance (ANOVA) procedures revealed that the tracheoesophageal speakers produced significantly shorter VOTs and longer vowel durations than the laryngeal speakers. However, the longer vowel durations for the traeheoesophageal speakers were not completely accounted for by the shorter VOTs found for that group. Spectrographic examination suggests that delayed voice offset time for the tracheoesophageal speakers also contributes to their longer vowel durations. Overall findings indicate that the physical characteristics and motor control properties of the neoglottis, even when driven by pulmonary air as in tracheoesophageal speakers, exert a major influence on alaryngeal voice production.

Download Full-text

Complex Structures in the Child-Directed Speech of Native and Nonnative Speakers

Psycholinguistics and Cognition in Language Processing - Advances in Linguistics and Communication Studies ◽

10.4018/978-1-5225-4009-0.ch007 ◽

2018 ◽

pp. 127-139 ◽

Cited By ~ 2

Author(s):

Aslı Altan ◽

Erika Hoff

Keyword(s):

Language Learning ◽

Native Speakers ◽

Nonnative Speakers ◽

Native English Speakers ◽

English Speakers ◽

Complex Structures ◽

Child Interaction ◽

Mother Child Interaction ◽

Bilingual Communities

Children in bilingual communities are frequently exposed to speech from nonnative speakers, but little research has described how that input might differ from the input of native speakers. There is evidence that input from nonnative speakers might be less useful to language learning children, but little research has asked why. This chapter analyzes the frequency of complex structures in the child-directed speech of 30 native English speakers and 36 nonnative speakers who were late learners of English, all speaking English to their two-and-a-half-year-old children. All instances of nine categories of complex structures were coded in transcripts of mother-child interaction. The frequency of all but one category was greater in the speech of native speakers. These findings suggest that input provided by native speakers provides more frequent models of complex structures than nonnative input.

Download Full-text

Fillers, Repairs and Repetitions in the Conversations of Saudi English Speakers: Conversational Device or Disfluency Markers

International Journal of Linguistics ◽

10.5296/ijl.v10i6.14016 ◽

2018 ◽

Vol 10 (6) ◽

pp. 193

Author(s):

Yaseen A. Azi

Keyword(s):

Discourse Analysis ◽

Literature Review ◽

Nonnative Speakers ◽

Native English Speakers ◽

The Other ◽

English Speakers ◽

Other Hand ◽

Different Levels ◽

Normal Speaking

Based on the literature review, the three patterns (fillers, repairs and repetitions) in the conversations of the native English speakers are generally regarded as results of the normal speaking between people. On the other hand, the same patterns in the conversations of the L2 speakers are always seen as a marker of disfluency and linguistic disabilities of the nonnative speakers. Therefore, this study simply focuses on finding how the three disfluency patterns are used by the Saudi English speakers from different levels of fluency. The sampling of the study includes two groups of participants from different fluency levels. Through the transcriptions and the discourse analysis of one hour recoding of the two groups, the results showed that the three patterns (fillers, repairs and repetitions) should not be generally associated with disfluency. Instead, repetitions and self-repairs have been equally used by the two groups and such patterns can be used as a conversational device. However, the filler “uh” with longer pausing can clearly predict disfluency among the Saudi English speakers.

Download Full-text

Stop consonant productions of Korean–English bilingual children

Bilingualism Language and Cognition ◽

10.1017/s1366728911000083 ◽

2011 ◽

Vol 15 (2) ◽

pp. 275-287 ◽

Cited By ~ 17

Author(s):

SUE ANN S. LEE ◽

GREGORY K. IVERSON

Keyword(s):

Fundamental Frequency ◽

Voice Onset Time ◽

Onset Time ◽

Stop Consonant ◽

Developmental Period ◽

English Speakers ◽

Bilingual Children ◽

Stop Consonants ◽

Speech Sounds ◽

English Bilingual

The purpose of this study was to conduct an acoustic examination of the obstruent stops produced by Korean–English bilingual children in connection with the question of whether bilinguals establish distinct categories of speech sounds across languages. Stop productions were obtained from ninety children in two age ranges, five and ten years: thirty Korean–English bilinguals, thirty monolingual Koreans and thirty monolingual English speakers. Voice-Onset-Time (VOT) lag at word-initial stop and fundamental frequency (f0) in the following vowel (hereafter vowel-onset f0) were measured. The bilingual children showed different patterns of VOT in comparison to both English and Korean monolinguals, with longer VOT in their production of Korean stop consonants and shorter VOT for English. Moreover, the ten-year-old bilinguals distinguished all stop categories using both VOT and vowel-onset f0,whereas the five-year-olds tended to make stop distinctions based on VOT but not vowel-onset f0. The results of this study suggest that bilingual children at around five years of age do not yet have fully separate stop systems, and that the systems continue to evolve during the developmental period.

Download Full-text

The Use of Vowel Length in Making Voicing Judgments by Native Listeners of English and Spanish: Implications for Rate Normalization

Language and Speech ◽

10.1177/0023830919851529 ◽

2019 ◽

Vol 63 (2) ◽

pp. 436-452

Author(s):

Navin Viswanathan ◽

Annie J. Olmstead ◽

M. Pilar Aivar

Keyword(s):

Native Speakers ◽

Voice Onset Time ◽

Speech Rate ◽

Onset Time ◽

English Speakers ◽

Vowel Length ◽

Typical Range

Among other characteristics, voiced and voiceless consonants differ in voice onset time (VOT; Lisker & Abramson, 1964). In addition, in English, voiced consonants are typically followed by longer vowels than their unvoiced counterparts (Allen & Miller, 1999). In Spanish, this relationship is less systematic (Zimmerman & Sapon, 1958). In two experiments, we investigated perceptual sensitivities of English and Spanish native speakers to following vowel length (VL) in categorizing syllables that ranged from a prevoiced bilabial stop [ba] to a long-lag bilabial stop [pa]. According to our results, English speakers show sensitivity to following vowels with VLs falling within an English-typical range (Experiment 1), but not when vowels are shorter and in a Spanish-typical range (Experiment 2). Interestingly, Spanish native speakers do not show sensitivity to following VL in either condition. These results suggest that VOT-VL tradeoffs in perception reflect phonological sensitivities of listeners and are not reducible to speech rate compensation.

Download Full-text

The relative contribution of segments and intonation to the perception of foreign-accented speech

Applied Psycholinguistics ◽

10.1017/s0142716414000575 ◽

2015 ◽

Vol 37 (2) ◽

pp. 303-322 ◽

Cited By ~ 20

Author(s):

JOAN SERENO ◽

LYNNE LAMMERS ◽

ALLARD JONGMAN

Keyword(s):

Fundamental Frequency ◽

Native Speakers ◽

Native English Speakers ◽

The Other ◽

Foreign Accent ◽

English Speakers ◽

Relative Impact ◽

Accented Speech ◽

Relative Contribution ◽

Accented English

ABSTRACTThe present study examines the relative impact of segments and intonation on accentedness, comprehensibility, and intelligibility, specifically investigating the separate contribution of segmental and intonational information to perceived foreign accent in Korean-accented English. Two English speakers and two Korean speakers recorded 40 English sentences. The sentences were manipulated by combining segments from one speaker with intonation (fundamental frequency contour and duration) from another speaker. Four versions of each sentence were created: one English control (English segments and English intonation), one Korean control (Korean segments and Korean intonation), and two Korean–English combinations (one with English segments and Korean intonation; the other with Korean segments and English intonation). Forty native English speakers transcribed the sentences for intelligibility and rated their comprehensibility and accentedness. The data show that segments had a significant effect on accentedness, comprehensibility, and intelligibility, but intonation only had an effect on intelligibility. Contrary to previous studies, the present study, separating segments from intonation, suggests that segmental information contributes substantially more to the perception of foreign accentedness than intonation. Native speakers seem to rely mainly on segments when determining foreign accentedness.

Download Full-text