scholarly journals An auditory feature detection circuit for sound pattern recognition

2015 ◽  
Vol 1 (8) ◽  
pp. e1500325 ◽  
Author(s):  
Stefan Schöneich ◽  
Konstantinos Kostarakos ◽  
Berthold Hedwig

From human language to birdsong and the chirps of insects, acoustic communication is based on amplitude and frequency modulation of sound signals. Whereas frequency processing starts at the level of the hearing organs, temporal features of the sound amplitude such as rhythms or pulse rates require processing by central auditory neurons. Besides several theoretical concepts, brain circuits that detect temporal features of a sound signal are poorly understood. We focused on acoustically communicating field crickets and show how five neurons in the brain of females form an auditory feature detector circuit for the pulse pattern of the male calling song. The processing is based on a coincidence detector mechanism that selectively responds when a direct neural response and an intrinsically delayed response to the sound pulses coincide. This circuit provides the basis for auditory mate recognition in field crickets and reveals a principal mechanism of sensory processing underlying the perception of temporal patterns.

2002 ◽  
Vol 111 (5) ◽  
pp. 2466
Author(s):  
Adelbert W. Bronkhorst ◽  
James T. Townsend

2014 ◽  
Vol 26 (3) ◽  
pp. 523-556 ◽  
Author(s):  
Phillip B. Schafer ◽  
Dezhe Z. Jin

Speech recognition in noisy conditions is a major challenge for computer systems, but the human brain performs it routinely and accurately. Automatic speech recognition (ASR) systems that are inspired by neuroscience can potentially bridge the performance gap between humans and machines. We present a system for noise-robust isolated word recognition that works by decoding sequences of spikes from a population of simulated auditory feature-detecting neurons. Each neuron is trained to respond selectively to a brief spectrotemporal pattern, or feature, drawn from the simulated auditory nerve response to speech. The neural population conveys the time-dependent structure of a sound by its sequence of spikes. We compare two methods for decoding the spike sequences—one using a hidden Markov model–based recognizer, the other using a novel template-based recognition scheme. In the latter case, words are recognized by comparing their spike sequences to template sequences obtained from clean training data, using a similarity measure based on the length of the longest common sub-sequence. Using isolated spoken digits from the AURORA-2 database, we show that our combined system outperforms a state-of-the-art robust speech recognizer at low signal-to-noise ratios. Both the spike-based encoding scheme and the template-based decoding offer gains in noise robustness over traditional speech recognition methods. Our system highlights potential advantages of spike-based acoustic coding and provides a biologically motivated framework for robust ASR development.


eLife ◽  
2021 ◽  
Vol 10 ◽  
Author(s):  
Jan Clemens ◽  
Stefan Schöneich ◽  
Konstantin Kostarakos ◽  
R Matthias Hennig ◽  
Berthold Hedwig

How neural networks evolved to generate the diversity of species-specific communication signals is unknown. For receivers of the signals one hypothesis is that novel recognition phenotypes arise from parameter variation in computationally flexible feature detection networks. We test this hypothesis in crickets, where males generate and females recognize the mating songs with a species-specific pulse pattern, by investigating whether the song recognition network in the cricket brain has the computational flexibility to recognize different temporal features. Using electrophysiological recordings from the network that recognizes crucial properties of the pulse pattern on the short timescale in the cricket Gryllus bimaculatus, we built a computational model that reproduces the neuronal and behavioral tuning of that species. An analysis of the model's parameter space reveals that the network can provide all recognition phenotypes for pulse duration and pause known in crickets and even other insects. Phenotypic diversity in the model is consistent with known preference types in crickets and other insects, and arise from computations that likely evolved to increase energy efficiency and robustness of pattern recognition. The model's parameter to phenotype mapping is degenerate-different network parameters can create similar changes in the phenotype-which likely supports evolutionary plasticity. Our study suggests that computationally flexible networks underlie the diverse pattern recognition phenotypes and we reveal network properties that constrain and support behavioral diversity.


1979 ◽  
Vol 44 ◽  
pp. 349-355
Author(s):  
R.W. Milkey

The focus of discussion in Working Group 3 was on the Thermodynamic Properties as determined spectroscopically, including the observational techniques and the theoretical modeling of physical processes responsible for the emission spectrum. Recent advances in observational techniques and theoretical concepts make this discussion particularly timely. It is wise to remember that the determination of thermodynamic parameters is not an end in itself and that these are interesting chiefly for what they can tell us about the energetics and mass transport in prominences.


Author(s):  
Charles F. Simpson ◽  
J. W. Carlisle ◽  
J. M. Kling

It is not known whether loss of nuclei from erythroblasts occurs by extrusion or by karyolysis, or both. In this study, blood from anemic dogs was utilized. Dogs were anemic as a result of severe parasitism with a hematozoic parasite, Babesia canis, or were made anemic by treating them with phenyl hydrazine hydrochloride.The mechanism of denucleation was the same in both types of anemia investigated. It was found that extrusion was the principal mechanism of denucleation from the erythroblast. However, probably karyolysis also occurred occasionally, prior to extrusion.


1995 ◽  
Vol 38 (5) ◽  
pp. 1014-1024 ◽  
Author(s):  
Robert L. Whitehead ◽  
Nicholas Schiavetti ◽  
Brenda H. Whitehead ◽  
Dale Evan Metz

The purpose of this investigation was twofold: (a) to determine if there are changes in specific temporal characteristics of speech that occur during simultaneous communication, and (b) to determine if known temporal rules of spoken English are disrupted during simultaneous communication. Ten speakers uttered sentences consisting of a carrier phrase and experimental CVC words under conditions of: (a) speech, (b) speech combined with signed English, and (c) speech combined with signed English for every word except the CVC word that was fingerspelled. The temporal features investigated included: (a) sentence duration, (b) experimental CVC word duration, (c) vowel duration in experimental CVC words, (d) pause duration before and after experimental CVC words, and (e) consonantal effects on vowel duration. Results indicated that for all durational measures, the speech/sign/fingerspelling condition was longest, followed by the speech/sign condition, with the speech condition being shortest. It was also found that for all three speaking conditions, vowels were longer in duration when preceding voiced consonants than vowels preceding their voiceless cognates, and that a low vowel was longer in duration than a high vowel. These findings indicate that speakers consistently reduced their rate of speech when using simultaneous communication, but did not violate these specific temporal rules of English important for consonant and vowel perception.


Author(s):  
Peter Vorderer

This paper points to new developments in the context of entertainment theory. Starting from a background of well-established theories that have been proposed and elaborated mainly by Zillmann and his collaborators since the 1980s, a new two-factor model of entertainment is introduced. This model encompasses “enjoyment” and “appreciation” as two independent factors. In addition, several open questions regarding cultural differences in humans’ responses to entertainment products or the usefulness of various theoretical concepts like “presence,” “identification,” or “transportation” are also discussed. Finally, the question of why media users are seeking entertainment is brought to the forefront, and a possibly relevant need such as the “search for meaningfulness” is mentioned as a possible major candidate for such an explanation.


2007 ◽  
Author(s):  
Jan Theeuwes ◽  
Erik van der Burg ◽  
Artem V. Belopolsky

Sign in / Sign up

Export Citation Format

Share Document