An audio-visual corpus for speech perception and automatic speech recognition

Martin Cooke; Jon Barker; Stuart Cunningham; Xu Shao

doi:10.1121/1.2229005

An audio-visual corpus for speech perception and automatic speech recognition

The Journal of the Acoustical Society of America ◽

10.1121/1.2229005 ◽

2006 ◽

Vol 120 (5) ◽

pp. 2421-2424 ◽

Cited By ~ 430

Author(s):

Martin Cooke ◽

Jon Barker ◽

Stuart Cunningham ◽

Xu Shao

Keyword(s):

Speech Recognition ◽

Speech Perception ◽

Automatic Speech Recognition

Download Full-text

Predicting Speech Perception in Older Listeners with Sensorineural Hearing Loss Using Automatic Speech Recognition

Trends in Hearing ◽

10.1177/2331216520914769 ◽

2020 ◽

Vol 24 ◽

pp. 233121652091476 ◽

Cited By ~ 1

Author(s):

Lionel Fontan ◽

Tom Cretin-Maitenaz ◽

Christian Füllgrabe

Keyword(s):

Hearing Loss ◽

Speech Recognition ◽

Speech Perception ◽

Sensorineural Hearing Loss ◽

Automatic Speech Recognition ◽

Sensorineural Hearing

Download Full-text

Automatic speech recognition based on findings of the human processes of speech perception

The Journal of the Acoustical Society of America ◽

10.1121/1.2026169 ◽

1988 ◽

Vol 84 (S1) ◽

pp. S214-S214

Author(s):

Hirofumi Udagawa ◽

Sumio Ohno ◽

Hiroya Fujisaki

Keyword(s):

Speech Recognition ◽

Speech Perception ◽

Automatic Speech Recognition

Download Full-text

Dysarthric speech perception: Comparison of training effects on human listeners versus automatic speech recognition tools

The Journal of the Acoustical Society of America ◽

10.1121/1.5101567 ◽

2019 ◽

Vol 145 (3) ◽

pp. 1795-1795

Author(s):

Michael F. Lally ◽

Heejin Kim ◽

Lori A. Moon

Keyword(s):

Speech Recognition ◽

Speech Perception ◽

Automatic Speech Recognition ◽

Training Effects ◽

Dysarthric Speech

Download Full-text

On integrating insights from human speech perception into automatic speech recognition

10.21437/interspeech.2005-475 ◽

2005 ◽

Author(s):

Sorin Dusan ◽

Larry R. Rabiner

Keyword(s):

Speech Recognition ◽

Speech Perception ◽

Automatic Speech Recognition ◽

Human Speech

Download Full-text

Adapting automatic speech recognition methods to speech perception: A hidden semi‐Markov model of listener’s categorization of a VC(C)V continuum

The Journal of the Acoustical Society of America ◽

10.1121/1.4785263 ◽

2004 ◽

Vol 116 (4) ◽

pp. 2570-2571

Author(s):

Terrance M. Nearey

Keyword(s):

Speech Recognition ◽

Speech Perception ◽

Markov Model ◽

Automatic Speech Recognition

Download Full-text

Human Speech Perception: Some Lessons from Automatic Speech Recognition

Text, Speech and Dialogue - Lecture Notes in Computer Science ◽

10.1007/3-540-44805-5_24 ◽

2001 ◽

pp. 187-196 ◽

Cited By ~ 3

Author(s):

Hynek Heřmanský

Keyword(s):

Speech Recognition ◽

Speech Perception ◽

Automatic Speech Recognition ◽

Human Speech

Download Full-text

ASR Systems as Models of Phonetic Category Perception in Adults

10.31234/osf.io/57d8x ◽

2017 ◽

Author(s):

Thomas Schatz ◽

Francis Bach ◽

Emmanuel Dupoux

Keyword(s):

Speech Recognition ◽

Speech Perception ◽

Automatic Speech Recognition ◽

Speech Processing ◽

Systematic Investigation ◽

Speech Sounds ◽

Phonetic Category ◽

Phonetic Perception ◽

Human Adults ◽

Native Speech

We test the potential of standard Automatic Speech Recognition (ASR) systems trained on large corpora of continuous speech as quantitative models of human speech processing. In human adults, speech perception is attuned to efficiently process native speech sounds, at the expense of difficulties in pro- cessing non-native sounds. We use ABX-discriminability measures to test whether ASR models can account for the patterns of confusion between speech sounds observed in humans. We show that ASR models reproduce some well-documented effects in non-native phonetic perception. Beyond the immediate results, our methodology opens up the possibility of a more systematic investigation of phonetic category perception in humans.

Download Full-text