scholarly journals Non-native Listeners’ Recognition of High-Variability Speech Using PRESTO

2014 ◽  
Vol 25 (09) ◽  
pp. 869-892 ◽  
Author(s):  
Terrin N. Tamati ◽  
David B. Pisoni

Background: Natural variability in speech is a significant challenge to robust successful spoken word recognition. In everyday listening environments, listeners must quickly adapt and adjust to multiple sources of variability in both the signal and listening environments. High-variability speech may be particularly difficult to understand for non-native listeners, who have less experience with the second language (L2) phonological system and less detailed knowledge of sociolinguistic variation of the L2. Purpose: The purpose of this study was to investigate the effects of high-variability sentences on non-native speech recognition and to explore the underlying sources of individual differences in speech recognition abilities of non-native listeners. Research Design: Participants completed two sentence recognition tasks involving high-variability and low-variability sentences. They also completed a battery of behavioral tasks and self-report questionnaires designed to assess their indexical processing skills, vocabulary knowledge, and several core neurocognitive abilities. Study Sample: Native speakers of Mandarin (n = 25) living in the United States recruited from the Indiana University community participated in the current study. A native comparison group consisted of scores obtained from native speakers of English (n = 21) in the Indiana University community taken from an earlier study. Data Collection and Analysis: Speech recognition in high-variability listening conditions was assessed with a sentence recognition task using sentences from PRESTO (Perceptually Robust English Sentence Test Open-Set) mixed in 6-talker multitalker babble. Speech recognition in low-variability listening conditions was assessed using sentences from HINT (Hearing In Noise Test) mixed in 6-talker multitalker babble. Indexical processing skills were measured using a talker discrimination task, a gender discrimination task, and a forced-choice regional dialect categorization task. Vocabulary knowledge was assessed with the WordFam word familiarity test, and executive functioning was assessed with the BRIEF-A (Behavioral Rating Inventory of Executive Function – Adult Version) self-report questionnaire. Scores from the non-native listeners on behavioral tasks and self-report questionnaires were compared with scores obtained from native listeners tested in a previous study and were examined for individual differences. Results: Non-native keyword recognition scores were significantly lower on PRESTO sentences than on HINT sentences. Non-native listeners’ keyword recognition scores were also lower than native listeners’ scores on both sentence recognition tasks. Differences in performance on the sentence recognition tasks between non-native and native listeners were larger on PRESTO than on HINT, although group differences varied by signal-to-noise ratio. The non-native and native groups also differed in the ability to categorize talkers by region of origin and in vocabulary knowledge. Individual non-native word recognition accuracy on PRESTO sentences in multitalker babble at more favorable signal-to-noise ratios was found to be related to several BRIEF-A subscales and composite scores. However, non-native performance on PRESTO was not related to regional dialect categorization, talker and gender discrimination, or vocabulary knowledge. Conclusions: High-variability sentences in multitalker babble were particularly challenging for non-native listeners. Difficulty under high-variability testing conditions was related to lack of experience with the L2, especially L2 sociolinguistic information, compared with native listeners. Individual differences among the non-native listeners were related to weaknesses in core neurocognitive abilities affecting behavioral control in everyday life.

2013 ◽  
Vol 24 (07) ◽  
pp. 616-634 ◽  
Author(s):  
Terrin N. Tamati ◽  
Jaimie L. Gilbert ◽  
David B. Pisoni

Background: Previous studies investigating speech recognition in adverse listening conditions have found extensive variability among individual listeners. However, little is currently known about the core underlying factors that influence speech recognition abilities. Purpose: To investigate sensory, perceptual, and neurocognitive differences between good and poor listeners on the Perceptually Robust English Sentence Test Open-set (PRESTO), a new high-variability sentence recognition test under adverse listening conditions. Research Design: Participants who fell in the upper quartile (HiPRESTO listeners) or lower quartile (LoPRESTO listeners) on key word recognition on sentences from PRESTO in multitalker babble completed a battery of behavioral tasks and self-report questionnaires designed to investigate real-world hearing difficulties, indexical processing skills, and neurocognitive abilities. Study Sample: Young, normal-hearing adults (N = 40) from the Indiana University community participated in the current study. Data Collection and Analysis: Participants' assessment of their own real-world hearing difficulties was measured with a self-report questionnaire on situational hearing and hearing health history. Indexical processing skills were assessed using a talker discrimination task, a gender discrimination task, and a forced-choice regional dialect categorization task. Neurocognitive abilities were measured with the Auditory Digit Span Forward (verbal short-term memory) and Digit Span Backward (verbal working memory) tests, the Stroop Color and Word Test (attention/inhibition), the WordFam word familiarity test (vocabulary size), the Behavioral Rating Inventory of Executive Function–Adult Version (BRIEF-A) self-report questionnaire on executive function, and two performance subtests of the Wechsler Abbreviated Scale of Intelligence (WASI) Performance Intelligence Quotient (IQ; nonverbal intelligence). Scores on self-report questionnaires and behavioral tasks were tallied and analyzed by listener group (HiPRESTO and LoPRESTO). Results: The extreme groups did not differ overall on self-reported hearing difficulties in real-world listening environments. However, an item-by-item analysis of questions revealed that LoPRESTO listeners reported significantly greater difficulty understanding speakers in a public place. HiPRESTO listeners were significantly more accurate than LoPRESTO listeners at gender discrimination and regional dialect categorization, but they did not differ on talker discrimination accuracy or response time, or gender discrimination response time. HiPRESTO listeners also had longer forward and backward digit spans, higher word familiarity ratings on the WordFam test, and lower (better) scores for three individual items on the BRIEF-A questionnaire related to cognitive load. The two groups did not differ on the Stroop Color and Word Test or either of the WASI performance IQ subtests. Conclusions: HiPRESTO listeners and LoPRESTO listeners differed in indexical processing abilities, short-term and working memory capacity, vocabulary size, and some domains of executive functioning. These findings suggest that individual differences in the ability to encode and maintain highly detailed episodic information in speech may underlie the variability observed in speech recognition performance in adverse listening conditions using high-variability PRESTO sentences in multitalker babble.


2020 ◽  
Vol 31 (05) ◽  
pp. 324-335
Author(s):  
Terrin N. Tamati ◽  
Christin Ray ◽  
Kara J. Vasil ◽  
David B. Pisoni ◽  
Aaron C. Moberly

Abstract Background Postlingually deafened adult cochlear implant (CI) users routinely display large individual differences in the ability to recognize and understand speech, especially in adverse listening conditions. Although individual differences have been linked to several sensory (‘‘bottom-up’') and cognitive (‘‘top-down’') factors, little is currently known about the relative contributions of these factors in high- and low-performing CI users. Purpose The aim of the study was to investigate differences in sensory functioning and neurocognitive functioning between high- and low-performing CI users on the Perceptually Robust English Sentence Test Open-set (PRESTO), a high-variability sentence recognition test containing sentence materials produced by multiple male and female talkers with diverse regional accents. Research Design CI users with accuracy scores in the upper (HiPRESTO) or lower quartiles (LoPRESTO) on PRESTO in quiet completed a battery of behavioral tasks designed to assess spectral resolution and neurocognitive functioning. Study Sample Twenty-one postlingually deafened adult CI users, with 11 HiPRESTO and 10 LoPRESTO participants. Data Collection and Analysis A discriminant analysis was carried out to determine the extent to which measures of spectral resolution and neurocognitive functioning discriminate HiPRESTO and LoPRESTO CI users. Auditory spectral resolution was measured using the Spectral-Temporally Modulated Ripple Test (SMRT). Neurocognitive functioning was assessed with visual measures of working memory (digit span), inhibitory control (Stroop), speed of lexical/phonological access (Test of Word Reading Efficiency), and nonverbal reasoning (Raven's Progressive Matrices). Results HiPRESTO and LoPRESTO CI users were discriminated primarily by performance on the SMRT and secondarily by the Raven's test. No other neurocognitive measures contributed substantially to the discriminant function. Conclusions High- and low-performing CI users differed by spectral resolution and, to a lesser extent, nonverbal reasoning. These findings suggest that the extreme groups are determined by global factors of richness of sensory information and domain-general, nonverbal intelligence, rather than specific neurocognitive processing operations related to speech perception and spoken word recognition. Thus, although both bottom-up and top-down information contribute to speech recognition performance, low-performing CI users may not be sufficiently able to rely on neurocognitive skills specific to speech recognition to enhance processing of spectrally degraded input in adverse conditions involving high talker variability.


1997 ◽  
Vol 85 (3_suppl) ◽  
pp. 1428-1430 ◽  
Author(s):  
Sy-Ying Lee ◽  
Stephen Krashen ◽  
Lucy Tse

Scores on the Author Recognition Test and self-report of number of books read (free voluntary reading) made independent contributions to scores on a passive vocabulary measure for 30 adult native-speakers of English.


2015 ◽  
Vol 26 (06) ◽  
pp. 582-594 ◽  
Author(s):  
Kathleen F. Faulkner ◽  
Terrin N. Tamati ◽  
Jaimie L. Gilbert ◽  
David B. Pisoni

Background: There is a pressing clinical need for the development of ecologically valid and robust assessment measures of speech recognition. Perceptually Robust English Sentence Test Open-set (PRESTO) is a new high-variability sentence recognition test that is sensitive to individual differences and was designed for use with several different clinical populations. PRESTO differs from other sentence recognition tests because the target sentences differ in talker, gender, and regional dialect. Increasing interest in using PRESTO as a clinical test of spoken word recognition dictates the need to establish equivalence across test lists. Purpose: The purpose of this study was to establish list equivalency of PRESTO for clinical use. Research Design: PRESTO sentence lists were presented to three groups of normal-hearing listeners in noise (multitalker babble [MTB] at 0 dB signal-to-noise ratio) or under eight-channel cochlear implant simulation (CI-Sim). Study Sample: Ninety-one young native speakers of English who were undergraduate students from the Indiana University community participated in this study. Data Collection and Analysis: Participants completed a sentence recognition task using different PRESTO sentence lists. They listened to sentences presented over headphones and typed in the words they heard on a computer. Keyword scoring was completed offline. Equivalency for sentence lists was determined based on the list intelligibility (mean keyword accuracy for each list compared with all other lists) and listener consistency (the relation between mean keyword accuracy on each list for each listener). Results: Based on measures of list equivalency and listener consistency, ten PRESTO lists were found to be equivalent in the MTB condition, nine lists were equivalent in the CI-Sim condition, and six PRESTO lists were equivalent in both conditions. Conclusions: PRESTO is a valuable addition to the clinical toolbox for assessing sentence recognition across different populations. Because the test condition influenced the overall intelligibility of lists, researchers and clinicians should take the presentation conditions into consideration when selecting the best PRESTO lists for their research or clinical protocols.


Author(s):  
Adam K. Bosen ◽  
Victoria A. Sevich ◽  
Shauntelle A. Cannon

Purpose In individuals with cochlear implants, speech recognition is not associated with tests of working memory that primarily reflect storage, such as forward digit span. In contrast, our previous work found that vocoded speech recognition in individuals with normal hearing was correlated with performance on a forward digit span task. A possible explanation for this difference across groups is that variability in auditory resolution across individuals with cochlear implants could conceal the true relationship between speech and memory tasks. Here, our goal was to determine if performance on forward digit span and speech recognition tasks are correlated in individuals with cochlear implants after controlling for individual differences in auditory resolution. Method We measured sentence recognition ability in 20 individuals with cochlear implants with Perceptually Robust English Sentence Test Open-set sentences. Spectral and temporal modulation detection tasks were used to assess individual differences in auditory resolution, auditory forward digit span was used to assess working memory storage, and self-reported word familiarity was used to assess vocabulary. Results Individual differences in speech recognition were predicted by spectral and temporal resolution. A correlation was found between forward digit span and speech recognition, but this correlation was not significant after controlling for spectral and temporal resolution. No relationship was found between word familiarity and speech recognition. Forward digit span performance was not associated with individual differences in auditory resolution. Conclusions Our findings support the idea that sentence recognition in individuals with cochlear implants is primarily limited by individual differences in working memory processing, not storage. Studies examining the relationship between speech and memory should control for individual differences in auditory resolution.


2017 ◽  
Author(s):  
Marc Wittmann ◽  
Henrike Fiedler ◽  
Wilhelm Gros ◽  
Julia Mossbridge ◽  
Cintia Retz Lucci

With this cross-sectional study we investigated how individual differences regarding present- and future-oriented mental processes are related to the experience of time in the seconds and minutes range. A sample of students (N = 100) filled out self-report measures of time perspective (ZTPI), mindfulness (FMI), impulsiveness (BIS), and the daydreaming frequency scale (DDFS). Furthermore they were asked to (a) retrospectively judge the duration of a waiting period of five minutes, and (b) to prospectively perform an visual duration reproduction task with intervals of 3, 6, and 9 seconds. Regression models show that (a) being more present fatalistic (ZTPI) and more impulsive are related to longer duration estimates of the waiting period, and (b) having a stronger propensity to daydream leads to a stronger under-reproduction of temporal intervals. These findings show how personality traits related to present orientation are associated with the state-like perception of duration.


2019 ◽  
Author(s):  
Justin C. Hayes ◽  
Katherine L Alfred ◽  
Rachel Pizzie ◽  
Joshua S. Cetron ◽  
David J. M. Kraemer

Modality specific encoding habits account for a significant portion of individual differences reflected in functional activation during cognitive processing. Yet, little is known about how these habits of thought influence long-term structural changes in the brain. Traditionally, habits of thought have been assessed using self-report questionnaires such as the visualizer-verbalizer questionnaire. Here, rather than relying on subjective reports, we measured habits of thought using a novel behavioral task assessing attentional biases toward picture and word stimuli. Hypothesizing that verbal habits of thought are reflected in the structural integrity of white matter tracts and cortical regions of interest, we used diffusion tensor imaging and volumetric analyses to assess this prediction. Using a whole-brain approach, we show that word bias is associated with increased volume in several bilateral language regions, in both white and grey matter parcels. Additionally, connectivity within white matter tracts within an a priori speech production network increased as a function of word bias. These results demonstrate long-term structural and morphological differences associated with verbal habits of thought.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Davide Marengo ◽  
Kenneth L. Davis ◽  
Gökçe Özkarar Gradwohl ◽  
Christian Montag

AbstractThe Affective Neuroscience Personality Scales (ANPS) were constructed as a self-report assessment to measure individual differences in Jaak Panksepp’s cross-species primary emotional systems: SEEKING, PLAY, CARE (positive emotions) and FEAR, SADNESS, ANGER (negative emotions). Beginning with the first published work on the ANPS in 2003, individual differences on the ANPS measures of these six primary emotional systems have been consistently linked to Big Five personality traits. From a theoretical perspective, these primary emotional systems arising from subcortical regions, shed light on the nature of the Big Five personality traits from an evolutionary perspective, because each of these primary emotional systems represent a tool for survival endowing mammalian species with inherited behavioral programs to react appropriately to complex environments. The present work revisited 21 available samples where both ANPS and Big Five measures have been administered. Our meta-analytical analysis provides solid evidence that high SEEKING relates to high Openness to Experience, high PLAY to high Extraversion, high CARE/low ANGER to high Agreeableness and high FEAR/SADNESS/ANGER to high Neuroticism. This seems to be true regardless of the ANPS inventory chosen, although much more work is needed in this area. Associations between primary emotional systems and Conscientiousness were in the lower effect size area across all six primary emotions, thereby supporting the idea that Conscientiousness rather seems to be less directly related with the subcortical primary emotions and likely is the most cognitive/cortical personality construct out of the Big Five. In sum, the present work underlines the idea that individual differences in primary emotional systems represent evolutionarily ancient foundations of human personality, given their a) meaningful links to the prominent Big Five model and b) their origins lying in subcortical areas of the human brain.


2016 ◽  
Vol 2 (s1) ◽  
Author(s):  
Shiri Lev-Ari

AbstractPeople learn language from their social environment. Therefore, individual differences in the input that their social environment provides could influence their linguistic performance. Nevertheless, investigation of the role of individual differences in input on performance has been mostly restricted to first and second language acquisition. In this paper I argue that individual differences in input can influence linguistic performance even in adult native speakers. Specifically, differences in input can affect performance by influencing people’s knowledgebase, by modulating their processing manner, and by shaping expectations. Therefore, studying the role that individual differences in input play can improve our understanding of how language is learned, processed and represented.


1997 ◽  
Vol 5 (1) ◽  
pp. 83-90 ◽  
Author(s):  
Janine L. Pillitteri ◽  
Lynn T. Kozlowski ◽  
Christine T. Sweeney ◽  
Todd F. Heatherton

Sign in / Sign up

Export Citation Format

Share Document