An auditory feature detection circuit for sound pattern recognition

Stefan Schöneich; Konstantinos Kostarakos; Berthold Hedwig

doi:10.1126/sciadv.1500325

An auditory feature detection circuit for sound pattern recognition

Science Advances ◽

10.1126/sciadv.1500325 ◽

2015 ◽

Vol 1 (8) ◽

pp. e1500325 ◽

Cited By ~ 50

Author(s):

Stefan Schöneich ◽

Konstantinos Kostarakos ◽

Berthold Hedwig

Keyword(s):

Feature Detection ◽

Mate Recognition ◽

Delayed Response ◽

Principal Mechanism ◽

Theoretical Concepts ◽

Field Crickets ◽

Temporal Features ◽

Auditory Feature ◽

Sound Pattern Recognition ◽

Central Auditory Neurons

From human language to birdsong and the chirps of insects, acoustic communication is based on amplitude and frequency modulation of sound signals. Whereas frequency processing starts at the level of the hearing organs, temporal features of the sound amplitude such as rhythms or pulse rates require processing by central auditory neurons. Besides several theoretical concepts, brain circuits that detect temporal features of a sound signal are poorly understood. We focused on acoustically communicating field crickets and show how five neurons in the brain of females form an auditory feature detector circuit for the pulse pattern of the male calling song. The processing is based on a coincidence detector mechanism that selectively responds when a direct neural response and an intrinsically delayed response to the sound pulses coincide. This circuit provides the basis for auditory mate recognition in field crickets and reveals a principal mechanism of sensory processing underlying the perception of temporal patterns.

Download Full-text

Auditory feature detection for stimuli presented from different directions

The Journal of the Acoustical Society of America ◽

10.1121/1.4809155 ◽

2002 ◽

Vol 111 (5) ◽

pp. 2466

Author(s):

Adelbert W. Bronkhorst ◽

James T. Townsend

Keyword(s):

Feature Detection ◽

Auditory Feature

Download Full-text

Noise-Robust Speech Recognition Through Auditory Feature Detection and Spike Sequence Decoding

Neural Computation ◽

10.1162/neco_a_00557 ◽

2014 ◽

Vol 26 (3) ◽

pp. 523-556 ◽

Cited By ~ 3

Author(s):

Phillip B. Schafer ◽

Dezhe Z. Jin

Keyword(s):

Speech Recognition ◽

Feature Detection ◽

Training Data ◽

Neural Population ◽

Noise Robustness ◽

Combined System ◽

Spike Sequences ◽

Auditory Feature ◽

Noise Robust ◽

Speech Recognizer

Speech recognition in noisy conditions is a major challenge for computer systems, but the human brain performs it routinely and accurately. Automatic speech recognition (ASR) systems that are inspired by neuroscience can potentially bridge the performance gap between humans and machines. We present a system for noise-robust isolated word recognition that works by decoding sequences of spikes from a population of simulated auditory feature-detecting neurons. Each neuron is trained to respond selectively to a brief spectrotemporal pattern, or feature, drawn from the simulated auditory nerve response to speech. The neural population conveys the time-dependent structure of a sound by its sequence of spikes. We compare two methods for decoding the spike sequences—one using a hidden Markov model–based recognizer, the other using a novel template-based recognition scheme. In the latter case, words are recognized by comparing their spike sequences to template sequences obtained from clean training data, using a similarity measure based on the length of the longest common sub-sequence. Using isolated spoken digits from the AURORA-2 database, we show that our combined system outperforms a state-of-the-art robust speech recognizer at low signal-to-noise ratios. Both the spike-based encoding scheme and the template-based decoding offer gains in noise robustness over traditional speech recognition methods. Our system highlights potential advantages of spike-based acoustic coding and provides a biologically motivated framework for robust ASR development.

Download Full-text

A small, computationally flexible network produces the phenotypic diversity of song recognition in crickets

eLife ◽

10.7554/elife.61475 ◽

2021 ◽

Vol 10 ◽

Author(s):

Jan Clemens ◽

Stefan Schöneich ◽

Konstantin Kostarakos ◽

R Matthias Hennig ◽

Berthold Hedwig

Keyword(s):

Pattern Recognition ◽

Feature Detection ◽

Phenotypic Diversity ◽

Communication Signals ◽

Increase Energy Efficiency ◽

Song Recognition ◽

Electrophysiological Recordings ◽

Temporal Features ◽

Flexible Networks ◽

Species Specific

How neural networks evolved to generate the diversity of species-specific communication signals is unknown. For receivers of the signals one hypothesis is that novel recognition phenotypes arise from parameter variation in computationally flexible feature detection networks. We test this hypothesis in crickets, where males generate and females recognize the mating songs with a species-specific pulse pattern, by investigating whether the song recognition network in the cricket brain has the computational flexibility to recognize different temporal features. Using electrophysiological recordings from the network that recognizes crucial properties of the pulse pattern on the short timescale in the cricket Gryllus bimaculatus, we built a computational model that reproduces the neuronal and behavioral tuning of that species. An analysis of the model's parameter space reveals that the network can provide all recognition phenotypes for pulse duration and pause known in crickets and even other insects. Phenotypic diversity in the model is consistent with known preference types in crickets and other insects, and arise from computations that likely evolved to increase energy efficiency and robustness of pattern recognition. The model's parameter to phenotype mapping is degenerate-different network parameters can create similar changes in the phenotype-which likely supports evolutionary plasticity. Our study suggests that computationally flexible networks underlie the diverse pattern recognition phenotypes and we reveal network properties that constrain and support behavioral diversity.

Download Full-text

Models of Thermodynamic Properties of Prominences

International Astronomical Union Colloquium ◽

10.1017/s0252921100065799 ◽

1979 ◽

Vol 44 ◽

pp. 349-355

Author(s):

R.W. Milkey

Keyword(s):

Thermodynamic Properties ◽

Mass Transport ◽

Emission Spectrum ◽

Thermodynamic Parameters ◽

Working Group ◽

Physical Processes ◽

Theoretical Concepts ◽

Group 3 ◽

Observational Techniques

The focus of discussion in Working Group 3 was on the Thermodynamic Properties as determined spectroscopically, including the observational techniques and the theoretical modeling of physical processes responsible for the emission spectrum. Recent advances in observational techniques and theoretical concepts make this discussion particularly timely. It is wise to remember that the determination of thermodynamic parameters is not an end in itself and that these are interesting chiefly for what they can tell us about the energetics and mass transport in prominences.

Download Full-text

The Mechanism of Erythroblast Denucleation

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100060258 ◽

1967 ◽

Vol 25 ◽

pp. 48-49

Author(s):

Charles F. Simpson ◽

J. W. Carlisle ◽

J. M. Kling

Keyword(s):

Babesia Canis ◽

Phenyl Hydrazine ◽

Principal Mechanism

It is not known whether loss of nuclei from erythroblasts occurs by extrusion or by karyolysis, or both. In this study, blood from anemic dogs was utilized. Dogs were anemic as a result of severe parasitism with a hematozoic parasite, Babesia canis, or were made anemic by treating them with phenyl hydrazine hydrochloride.The mechanism of denucleation was the same in both types of anemia investigated. It was found that extrusion was the principal mechanism of denucleation from the erythroblast. However, probably karyolysis also occurred occasionally, prior to extrusion.

Download Full-text

Temporal Characteristics of Speech in Simultaneous Communication

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.3805.1014 ◽

1995 ◽

Vol 38 (5) ◽

pp. 1014-1024 ◽

Cited By ~ 26

Author(s):

Robert L. Whitehead ◽

Nicholas Schiavetti ◽

Brenda H. Whitehead ◽

Dale Evan Metz

Keyword(s):

Pause Duration ◽

Vowel Duration ◽

Vowel Perception ◽

Temporal Characteristics ◽

Spoken English ◽

Temporal Features ◽

Before And After ◽

Word Duration ◽

Rate Of Speech ◽

Speech Condition

The purpose of this investigation was twofold: (a) to determine if there are changes in specific temporal characteristics of speech that occur during simultaneous communication, and (b) to determine if known temporal rules of spoken English are disrupted during simultaneous communication. Ten speakers uttered sentences consisting of a carrier phrase and experimental CVC words under conditions of: (a) speech, (b) speech combined with signed English, and (c) speech combined with signed English for every word except the CVC word that was fingerspelled. The temporal features investigated included: (a) sentence duration, (b) experimental CVC word duration, (c) vowel duration in experimental CVC words, (d) pause duration before and after experimental CVC words, and (e) consonantal effects on vowel duration. Results indicated that for all durational measures, the speech/sign/fingerspelling condition was longest, followed by the speech/sign condition, with the speech condition being shortest. It was also found that for all three speaking conditions, vowels were longer in duration when preceding voiced consonants than vowels preceding their voiceless cognates, and that a low vowel was longer in duration than a high vowel. These findings indicate that speakers consistently reduced their rate of speech when using simultaneous communication, but did not violate these specific temporal rules of English important for consonant and vowel perception.

Download Full-text

What’s Next?

Journal of Media Psychology Theories Methods and Applications ◽

10.1027/1864-1105/a000034 ◽

2011 ◽

Vol 23 (1) ◽

pp. 60-63 ◽

Cited By ~ 32

Author(s):

Peter Vorderer

Keyword(s):

Cultural Differences ◽

Factor Model ◽

New Developments ◽

Theoretical Concepts ◽

Open Questions ◽

Entertainment Products ◽

Entertainment Theory

This paper points to new developments in the context of entertainment theory. Starting from a background of well-established theories that have been proposed and elaborated mainly by Zillmann and his collaborators since the 1980s, a new two-factor model of entertainment is introduced. This model encompasses “enjoyment” and “appreciation” as two independent factors. In addition, several open questions regarding cultural differences in humans’ responses to entertainment products or the usefulness of various theoretical concepts like “presence,” “identification,” or “transportation” are also discussed. Finally, the question of why media users are seeking entertainment is brought to the forefront, and a possibly relevant need such as the “search for meaningfulness” is mentioned as a possible major candidate for such an explanation.

Download Full-text